All of lore.kernel.org
 help / color / mirror / Atom feed
* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
@ 2010-10-19 19:05 Martin Michlmayr
  2010-10-19 19:32 ` Nicolas Pitre
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Michlmayr @ 2010-10-19 19:05 UTC (permalink / raw)
  To: linux-arm-kernel

I noticed that 2.6.36-rc no longer boots on my QNAP TS-209.  I get
some output but then the machine resets.

I've bisected it down to the patch "orion/kirkwood: reset PCIe unit on
boot".  I remember this patch was controversial when it was proposed.
Looks like it breaks at least one orion5x machine.  Maybe the patch
should be reverted for now or only be called on kirkwood.

I remember that Marvell was looking into this issue.  Saeed, have you
had a chance to investigate this more in the meantime?  See
http://marc.info/?l=linux-ide&m=124822863706181&w=2 for the original
problem.



Starting kernel ...

Uncompressing Linux... done, booting the kernel.
Linux version 2.6.36-rc8 (tbm at pergolesi) (gcc version 4.1.3 20080704 (prerelease) (Debian 4.1.2-25)) #91 PREEMPT Tue Oct 19 17:55:31 UTC 2010
CPU: Feroceon [41069260] revision 0 (ARMv5TEJ), cr=a0053177
CPU: VIVT data cache, VIVT instruction cache
Machine: QNAP TS-109/TS-209
Clearing invalid memory bank 0KB at 0x00000000
Clearing invalid memory bank 0KB at 0x00000000
Clearing invalid memory bank 0KB at 0x00000000
Ignoring unrecognised tag 0x00000000
Ignoring unrecognised tag 0x00000000
Ignoring unrecognised tag 0x00000000
Ignoring unrecognised tag 0x41000403
bootconsole [earlycon0] enabled
Memory policy: ECC disabled, Data cache writeback
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 32512
Kernel command line: console=ttyS0,115200n8 root=/dev/sda2 earlyprintk
PID hash table entries: 512 (order: -1, 2048 bytes)
Dentry cache hash table entries: 16384 (order: 4, 65536 bytes)
Inode-cache hash table entries: 8192 (order: 3, 32768 bytes)
Memory: 128MB = 128MB total
Memory: 126240k/126240k available, 4832k reserved, 0K highmem
Virtual kernel memory layout:
    vector  : 0xffff0000 - 0xffff1000   (   4 kB)
    fixmap  : 0xfff00000 - 0xfffe0000   ( 896 kB)
    DMA     : 0xffc00000 - 0xffe00000   (   2 MB)
    vmalloc : 0xc8800000 - 0xfd800000   ( 848 MB)
    lowmem  : 0xc0000000 - 0xc8000000   ( 128 MB)
    modules : 0xbf000000 - 0xc0000000   (  16 MB)
      .init : 0xc0008000 - 0xc0027000   ( 124 kB)
      .text : 0xc0027000 - 0xc0370000   (3364 kB)
      .data : 0xc0370000 - 0xc038da40   ( 119 kB)
SLUB: Genslabs=11, HWalign=32, Order=0-3, MinObjects=0, CPUs=1, Nodes=1
Hierarchical RCU implementation.
	RCU-based detection of stalled CPUs is disabled.
	Verbose stalled-CPUs detection is disabled.
NR_IRQS:64
Calibrating delay loop... 332.59 BogoMIPS (lpj=1662976)
pid_max: default: 32768 minimum: 301
Mount-cache hash table entries: 512
CPU: Testing write buffer coherency: ok
NET: Registered protocol family 16
Orion ID: MV88F5182-A2. TCLK=166666667.
tsx09: found ethernet mac address 00:08:9b:ac:6a:04.

         __  __                      _ _
        |  \/  | __ _ _ ____   _____| | |
        | |\/| |/ _` | '__\ \ / / _ \ | |
        | |  | | (_| | |   \ V /  __/ | |
        |_|  |_|\__,_|_|    \_/ \___|_|_|
 _   _     ____              _
| | | |   | __ )  ___   ___ | |_ 
| | | |___|  _ \ / _ \ / _ \| __| 
| |_| |___| |_) | (_) | (_) | |_ 
 \___/    |____/ \___/ \___/ \__|  ** LOADER **

 QNAP SYSTEMS, INC.

-- 
Martin Michlmayr
http://www.cyrius.com/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-19 19:05 QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot" Martin Michlmayr
@ 2010-10-19 19:32 ` Nicolas Pitre
  2010-10-21 13:39   ` Saeed Bishara
  0 siblings, 1 reply; 10+ messages in thread
From: Nicolas Pitre @ 2010-10-19 19:32 UTC (permalink / raw)
  To: linux-arm-kernel

On Tue, 19 Oct 2010, Martin Michlmayr wrote:

> I noticed that 2.6.36-rc no longer boots on my QNAP TS-209.  I get
> some output but then the machine resets.
> 
> I've bisected it down to the patch "orion/kirkwood: reset PCIe unit on
> boot".  I remember this patch was controversial when it was proposed.
> Looks like it breaks at least one orion5x machine.  Maybe the patch
> should be reverted for now or only be called on kirkwood.

According to the log, you did agree with that patch at the time.

Keeping it causes breakage.  Removing it causes breakage too.  So I'd 
much prefer if a solution to solve both breakages was merged instead.

> I remember that Marvell was looking into this issue.  Saeed, have you
> had a chance to investigate this more in the meantime?  See
> http://marc.info/?l=linux-ide&m=124822863706181&w=2 for the original
> problem.

Nicolas

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-19 19:32 ` Nicolas Pitre
@ 2010-10-21 13:39   ` Saeed Bishara
  2010-10-21 20:25     ` Nicolas Pitre
  0 siblings, 1 reply; 10+ messages in thread
From: Saeed Bishara @ 2010-10-21 13:39 UTC (permalink / raw)
  To: linux-arm-kernel

>
>> I noticed that 2.6.36-rc no longer boots on my QNAP TS-209.  I get
>> some output but then the machine resets.
>> 
>> I've bisected it down to the patch "orion/kirkwood: reset 
>PCIe unit on
>> boot".  I remember this patch was controversial when it was proposed.
>> Looks like it breaks at least one orion5x machine.  Maybe the patch
>> should be reverted for now or only be called on kirkwood.
>
>According to the log, you did agree with that patch at the time.
>
>Keeping it causes breakage.  Removing it causes breakage too.  So I'd 
>much prefer if a solution to solve both breakages was merged instead.
We couln't pinpoint the root cause of this issue, actually we failed to reproduce that issue.
Nico, I suggest to apply the orion_pcie_reset() only for the boards we the issue had been observed.
saeed

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-21 13:39   ` Saeed Bishara
@ 2010-10-21 20:25     ` Nicolas Pitre
  2010-10-24 11:13       ` Saeed Bishara
  2010-10-25 15:12       ` Martin Michlmayr
  0 siblings, 2 replies; 10+ messages in thread
From: Nicolas Pitre @ 2010-10-21 20:25 UTC (permalink / raw)
  To: linux-arm-kernel

On Thu, 21 Oct 2010, Saeed Bishara wrote:

> >
> >> I noticed that 2.6.36-rc no longer boots on my QNAP TS-209.  I get
> >> some output but then the machine resets.
> >> 
> >> I've bisected it down to the patch "orion/kirkwood: reset PCIe unit on
> >> boot".  I remember this patch was controversial when it was proposed.
> >> Looks like it breaks at least one orion5x machine.  Maybe the patch
> >> should be reverted for now or only be called on kirkwood.
> >
> >According to the log, you did agree with that patch at the time.
> >
> >Keeping it causes breakage.  Removing it causes breakage too.  So I'd 
> >much prefer if a solution to solve both breakages was merged instead.
> We couln't pinpoint the root cause of this issue, actually we failed to reproduce that issue.
> Nico, I suggest to apply the orion_pcie_reset() only for the boards we the issue had been observed.
> saeed

OK, so I did, and pushed the result to the Orion git tree 
(git://git.kernel.org/pub/scm/linux/kernel/git/nico/orion.git).

Can I have confirmation that 1) new breakages are gone, and 2) the 
device where this reset did fix problems are still OK with this change?


Nicolas

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-21 20:25     ` Nicolas Pitre
@ 2010-10-24 11:13       ` Saeed Bishara
  2010-10-25 15:12       ` Martin Michlmayr
  1 sibling, 0 replies; 10+ messages in thread
From: Saeed Bishara @ 2010-10-24 11:13 UTC (permalink / raw)
  To: linux-arm-kernel

 

>-----Original Message-----
>From: Nicolas Pitre [mailto:nico at fluxnic.net] 
>Sent: Thursday, October 21, 2010 10:25 PM
>To: Saeed Bishara
>Cc: Martin Michlmayr; Olaf Rempel; buytenh at wantstofly.org; 
>brl at pcpool00.mathematik.uni-freiburg.de; 
>linux-arm-kernel at lists.infradead.org
>Subject: RE: QNAP TS-209 broken by "orion/kirkwood: reset PCIe 
>unit on boot"
>
>On Thu, 21 Oct 2010, Saeed Bishara wrote:
>
>> >
>> >> I noticed that 2.6.36-rc no longer boots on my QNAP TS-209.  I get
>> >> some output but then the machine resets.
>> >> 
>> >> I've bisected it down to the patch "orion/kirkwood: reset 
>PCIe unit on
>> >> boot".  I remember this patch was controversial when it 
>was proposed.
>> >> Looks like it breaks at least one orion5x machine.  Maybe 
>the patch
>> >> should be reverted for now or only be called on kirkwood.
>> >
>> >According to the log, you did agree with that patch at the time.
>> >
>> >Keeping it causes breakage.  Removing it causes breakage 
>too.  So I'd 
>> >much prefer if a solution to solve both breakages was 
>merged instead.
>> We couln't pinpoint the root cause of this issue, actually 
>we failed to reproduce that issue.
>> Nico, I suggest to apply the orion_pcie_reset() only for the 
>boards we the issue had been observed.
>> saeed
>
>OK, so I did, and pushed the result to the Orion git tree 
>(git://git.kernel.org/pub/scm/linux/kernel/git/nico/orion.git).
>
>Can I have confirmation that 1) new breakages are gone, and 2) the 
>device where this reset did fix problems are still OK with this change?
The orignal issue reported on Synology DS409 (also kirkwood based), but that board is not supported in mainline yet.
saeed
>
>
>Nicolas
>

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-21 20:25     ` Nicolas Pitre
  2010-10-24 11:13       ` Saeed Bishara
@ 2010-10-25 15:12       ` Martin Michlmayr
  2010-10-27 11:16         ` Martin Michlmayr
  1 sibling, 1 reply; 10+ messages in thread
From: Martin Michlmayr @ 2010-10-25 15:12 UTC (permalink / raw)
  To: linux-arm-kernel

* Nicolas Pitre <nico@fluxnic.net> [2010-10-21 16:25]:
> Can I have confirmation that 1) new breakages are gone, and 2) the
> device where this reset did fix problems are still OK with this change?

I can confirm 1) on my QNAP TS-209.  I also booted a kernel with the
patch on my TS-419P but I never had a reliable way to trigger the
issue.  Maybe Olaf or Bernhard can help with this.
-- 
Martin Michlmayr
http://www.cyrius.com/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-25 15:12       ` Martin Michlmayr
@ 2010-10-27 11:16         ` Martin Michlmayr
  2010-10-27 12:57           ` Nicolas Pitre
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Michlmayr @ 2010-10-27 11:16 UTC (permalink / raw)
  To: linux-arm-kernel

* Martin Michlmayr <tbm@cyrius.com> [2010-10-25 16:12]:
> I can confirm 1) on my QNAP TS-209.  I also booted a kernel with the
> patch on my TS-419P but I never had a reliable way to trigger the
> issue.  Maybe Olaf or Bernhard can help with this.

A Debian user has confirmed in the meantime that the patch helps on
the TS-419P.

-- 
Martin Michlmayr
http://www.cyrius.com/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-27 11:16         ` Martin Michlmayr
@ 2010-10-27 12:57           ` Nicolas Pitre
  2010-10-27 14:40             ` Martin Michlmayr
  0 siblings, 1 reply; 10+ messages in thread
From: Nicolas Pitre @ 2010-10-27 12:57 UTC (permalink / raw)
  To: linux-arm-kernel

On Wed, 27 Oct 2010, Martin Michlmayr wrote:

> * Martin Michlmayr <tbm@cyrius.com> [2010-10-25 16:12]:
> > I can confirm 1) on my QNAP TS-209.  I also booted a kernel with the
> > patch on my TS-419P but I never had a reliable way to trigger the
> > issue.  Maybe Olaf or Bernhard can help with this.
> 
> A Debian user has confirmed in the meantime that the patch helps on
> the TS-419P.

Helped what, exactly?


Nicolas

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-27 12:57           ` Nicolas Pitre
@ 2010-10-27 14:40             ` Martin Michlmayr
  2010-10-27 19:31               ` Nicolas Pitre
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Michlmayr @ 2010-10-27 14:40 UTC (permalink / raw)
  To: linux-arm-kernel

* Nicolas Pitre <nico@fluxnic.net> [2010-10-27 08:57]:
> > A Debian user has confirmed in the meantime that the patch helps on
> > the TS-419P.
> 
> Helped what, exactly?

The SATA problems on TS-419P are gone, i.e. this confirms 2) of your
"Can I have confirmation that 1) new breakages are gone, and 2) the
device where this reset did fix problems are still OK with this
change?"

-- 
Martin Michlmayr
http://www.cyrius.com/

^ permalink raw reply	[flat|nested] 10+ messages in thread

* QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot"
  2010-10-27 14:40             ` Martin Michlmayr
@ 2010-10-27 19:31               ` Nicolas Pitre
  0 siblings, 0 replies; 10+ messages in thread
From: Nicolas Pitre @ 2010-10-27 19:31 UTC (permalink / raw)
  To: linux-arm-kernel

On Wed, 27 Oct 2010, Martin Michlmayr wrote:

> * Nicolas Pitre <nico@fluxnic.net> [2010-10-27 08:57]:
> > > A Debian user has confirmed in the meantime that the patch helps on
> > > the TS-419P.
> > 
> > Helped what, exactly?
> 
> The SATA problems on TS-419P are gone, i.e. this confirms 2) of your
> "Can I have confirmation that 1) new breakages are gone, and 2) the
> device where this reset did fix problems are still OK with this
> change?"

OK.  I'd like to have confirmation that what remains of the workaround 
is enough to cover those machines where it is needed.  But given that 
most feedback I get so far is "I was never able to reproduce the issue 
myself in the first place" then I'll simply push this change as is.


Nicolas

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2010-10-27 19:31 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-10-19 19:05 QNAP TS-209 broken by "orion/kirkwood: reset PCIe unit on boot" Martin Michlmayr
2010-10-19 19:32 ` Nicolas Pitre
2010-10-21 13:39   ` Saeed Bishara
2010-10-21 20:25     ` Nicolas Pitre
2010-10-24 11:13       ` Saeed Bishara
2010-10-25 15:12       ` Martin Michlmayr
2010-10-27 11:16         ` Martin Michlmayr
2010-10-27 12:57           ` Nicolas Pitre
2010-10-27 14:40             ` Martin Michlmayr
2010-10-27 19:31               ` Nicolas Pitre

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.