All of lore.kernel.org
 help / color / mirror / Atom feed
* raid10 kernel panic on sparc64
@ 2007-04-02  0:15 ` Jan Engelhardt
  0 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-04-02  0:15 UTC (permalink / raw)
  To: linux-raid; +Cc: sparclinux

Hi,


just when I did
# mdadm -C /dev/md2 -b internal -e 1.0 -l 10 -n 4 /dev/sd[cdef]4
(created)
# mdadm -D /dev/md2
Killed

dmesg filled up with a kernel oops. A few seconds later, the box
locked solid. Since I was only in by ssh and there is not (yet) any
possibility to reset it remotely, this is all I can give right now,
the last 80x25 screen:

l4: 0000000000000000 l5: 0000000000000000 l6: 0000000000000000
l7: 0000000000000i0: fffff8007f218d18 i1: fffff8002e3d9608
i2: 000000000047f974 i3: 0000000000000i4: 0000000000000000
i5: 00000000006e2800 i6: fffff80008c12a41 i7: 0000000000526
I7: <elv_next_request+0x94/0x188>
Caller[00000000005263e8]: elv_next_request+0x94/0x188
Caller[0000000010086618]: scsi_request_fn+0x60/0x3f4 [scsi_mod]
Caller[0000000000529b70]: __generic_unplug_device+0x34/0x3c
Caller[000000000052a7d4]: generic_unplug_device+0x14/0x2c
Caller[0000000000526e48]: blk_backing_dev_unplug+0x20/0x28
Caller[00000000004a464c]: block_sync_page+0x64/0x6c
Caller[000000000047f9d0]: sync_page+0x64/0x74
Caller[0000000000677e48]: __wait_on_bit_lock+0x58/0x90
Caller[000000000047f86c]: __lock_page+0x54/0x5c
Caller[00000000004802ec]: do_generic_mapping_read+0x204/0x49c
Caller[0000000000480d68]: __generic_file_aio_read+0x120/0x18c
Caller[0000000000481fdc]: generic_file_read+0x70/0x94
Caller[00000000004a3920]: vfs_read+0xa0/0x14c
Caller[00000000004a3c8c]: sys_read+0x34/0x60
Caller[0000000000406c54]: linux_sparc_syscall32+0x3c/0x40
Caller[000000000003c6b4]: 0x3c6bc
Instruction DUMP: 921022bd  7c0e4ea2  90122098 <91d02005> 80a0a020  1848000c
80 [0000000010281cdc] sync_request+0x898/0x8e4 [raid10]
 [00000000005f6fb4] md_do_sync+0x454/0x89c
 [00000000005f69ec] md_thread+0x100/0x11c

Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
Perhaps it helps, otherwise hold your breath until I reproduce it.


Thanks,
Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* raid10 kernel panic on sparc64
@ 2007-04-02  0:15 ` Jan Engelhardt
  0 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-04-02  0:15 UTC (permalink / raw)
  To: linux-raid; +Cc: sparclinux

Hi,


just when I did
# mdadm -C /dev/md2 -b internal -e 1.0 -l 10 -n 4 /dev/sd[cdef]4
(created)
# mdadm -D /dev/md2
Killed

dmesg filled up with a kernel oops. A few seconds later, the box
locked solid. Since I was only in by ssh and there is not (yet) any
possibility to reset it remotely, this is all I can give right now,
the last 80x25 screen:

l4: 0000000000000000 l5: 0000000000000000 l6: 0000000000000000
l7: 0000000000000i0: fffff8007f218d18 i1: fffff8002e3d9608
i2: 000000000047f974 i3: 0000000000000i4: 0000000000000000
i5: 00000000006e2800 i6: fffff80008c12a41 i7: 0000000000526
I7: <elv_next_request+0x94/0x188>
Caller[00000000005263e8]: elv_next_request+0x94/0x188
Caller[0000000010086618]: scsi_request_fn+0x60/0x3f4 [scsi_mod]
Caller[0000000000529b70]: __generic_unplug_device+0x34/0x3c
Caller[000000000052a7d4]: generic_unplug_device+0x14/0x2c
Caller[0000000000526e48]: blk_backing_dev_unplug+0x20/0x28
Caller[00000000004a464c]: block_sync_page+0x64/0x6c
Caller[000000000047f9d0]: sync_page+0x64/0x74
Caller[0000000000677e48]: __wait_on_bit_lock+0x58/0x90
Caller[000000000047f86c]: __lock_page+0x54/0x5c
Caller[00000000004802ec]: do_generic_mapping_read+0x204/0x49c
Caller[0000000000480d68]: __generic_file_aio_read+0x120/0x18c
Caller[0000000000481fdc]: generic_file_read+0x70/0x94
Caller[00000000004a3920]: vfs_read+0xa0/0x14c
Caller[00000000004a3c8c]: sys_read+0x34/0x60
Caller[0000000000406c54]: linux_sparc_syscall32+0x3c/0x40
Caller[000000000003c6b4]: 0x3c6bc
Instruction DUMP: 921022bd  7c0e4ea2  90122098 <91d02005> 80a0a020  1848000c
80 [0000000010281cdc] sync_request+0x898/0x8e4 [raid10]
 [00000000005f6fb4] md_do_sync+0x454/0x89c
 [00000000005f69ec] md_thread+0x100/0x11c

Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
Perhaps it helps, otherwise hold your breath until I reproduce it.


Thanks,
Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-04-02  0:15 ` Jan Engelhardt
@ 2007-04-02  3:13   ` David Miller
  -1 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-04-02  3:13 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)

> just when I did
> # mdadm -C /dev/md2 -b internal -e 1.0 -l 10 -n 4 /dev/sd[cdef]4
> (created)
> # mdadm -D /dev/md2
> Killed
> 
> dmesg filled up with a kernel oops. A few seconds later, the box
> locked solid. Since I was only in by ssh and there is not (yet) any
> possibility to reset it remotely, this is all I can give right now,
> the last 80x25 screen:

Unfortunately the beginning of the OOPS is the most important part,
that says where exactly the kernel died, the rest of the log you
showed only gives half the registers and the rest of the call trace.

Please try to capture the whole thing.

Please also provide hardware type information as well, which you
should give in any bug report like this.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-04-02  3:13   ` David Miller
  0 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-04-02  3:13 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)

> just when I did
> # mdadm -C /dev/md2 -b internal -e 1.0 -l 10 -n 4 /dev/sd[cdef]4
> (created)
> # mdadm -D /dev/md2
> Killed
> 
> dmesg filled up with a kernel oops. A few seconds later, the box
> locked solid. Since I was only in by ssh and there is not (yet) any
> possibility to reset it remotely, this is all I can give right now,
> the last 80x25 screen:

Unfortunately the beginning of the OOPS is the most important part,
that says where exactly the kernel died, the rest of the log you
showed only gives half the registers and the rest of the call trace.

Please try to capture the whole thing.

Please also provide hardware type information as well, which you
should give in any bug report like this.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-04-02  0:15 ` Jan Engelhardt
@ 2007-04-12 21:26   ` David Miller
  -1 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-04-12 21:26 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)

> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
> Perhaps it helps, otherwise hold your breath until I reproduce it.

Jan, if you can reproduce this with the current 2.6.20 vanilla
kernel I'd be very interested in a full trace so that I can
try to fix this.

With the combination of an old kernel and only part of the
crash trace, there isn't much I can do with this report.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-04-12 21:26   ` David Miller
  0 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-04-12 21:26 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)

> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
> Perhaps it helps, otherwise hold your breath until I reproduce it.

Jan, if you can reproduce this with the current 2.6.20 vanilla
kernel I'd be very interested in a full trace so that I can
try to fix this.

With the combination of an old kernel and only part of the
crash trace, there isn't much I can do with this report.

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-04-12 21:26   ` David Miller
@ 2007-04-12 21:28     ` Jan Engelhardt
  -1 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-04-12 21:28 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On Apr 12 2007 14:26, David Miller wrote:
>From: Jan Engelhardt <jengelh@linux01.gwdg.de>
>Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)
>
>> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> Perhaps it helps, otherwise hold your breath until I reproduce it.
>
>Jan, if you can reproduce this with the current 2.6.20 vanilla
>kernel I'd be very interested in a full trace so that I can
>try to fix this.
>
>With the combination of an old kernel and only part of the
>crash trace, there isn't much I can do with this report.

Hi David,

I have not forgotten this issue, but the fact that there is not any serial
console attached right now makes it kinda hard to get the system back up
in case I let it oops. Apologies for the delay it takes.


Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-04-12 21:28     ` Jan Engelhardt
  0 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-04-12 21:28 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On Apr 12 2007 14:26, David Miller wrote:
>From: Jan Engelhardt <jengelh@linux01.gwdg.de>
>Date: Mon, 2 Apr 2007 02:15:57 +0200 (MEST)
>
>> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> Perhaps it helps, otherwise hold your breath until I reproduce it.
>
>Jan, if you can reproduce this with the current 2.6.20 vanilla
>kernel I'd be very interested in a full trace so that I can
>try to fix this.
>
>With the combination of an old kernel and only part of the
>crash trace, there isn't much I can do with this report.

Hi David,

I have not forgotten this issue, but the fact that there is not any serial
console attached right now makes it kinda hard to get the system back up
in case I let it oops. Apologies for the delay it takes.


Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-04-12 21:26   ` David Miller
@ 2007-05-26 15:10     ` Jan Engelhardt
  -1 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-05-26 15:10 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On Apr 12 2007 14:26, David Miller wrote:
>
>> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> Perhaps it helps, otherwise hold your breath until I reproduce it.
>
>Jan, if you can reproduce this with the current 2.6.20 vanilla
>kernel I'd be very interested in a full trace so that I can
>try to fix this.
>
>With the combination of an old kernel and only part of the
>crash trace, there isn't much I can do with this report.

Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.


	Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-05-26 15:10     ` Jan Engelhardt
  0 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-05-26 15:10 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On Apr 12 2007 14:26, David Miller wrote:
>
>> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> Perhaps it helps, otherwise hold your breath until I reproduce it.
>
>Jan, if you can reproduce this with the current 2.6.20 vanilla
>kernel I'd be very interested in a full trace so that I can
>try to fix this.
>
>With the combination of an old kernel and only part of the
>crash trace, there isn't much I can do with this report.

Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.


	Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-05-26 15:10     ` Jan Engelhardt
@ 2007-05-26 22:50       ` David Miller
  -1 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-05-26 22:50 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Sat, 26 May 2007 17:10:30 +0200 (MEST)

> 
> On Apr 12 2007 14:26, David Miller wrote:
> >
> >> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
> >> Perhaps it helps, otherwise hold your breath until I reproduce it.
> >
> >Jan, if you can reproduce this with the current 2.6.20 vanilla
> >kernel I'd be very interested in a full trace so that I can
> >try to fix this.
> >
> >With the combination of an old kernel and only part of the
> >crash trace, there isn't much I can do with this report.
> 
> Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.

Thanks for following up on this Jan.

I'd personally really appreciate reports against upstream
instead of dist kernels in the future, and I'm sure the
linux-raid maintainers feel similarly :-)

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-05-26 22:50       ` David Miller
  0 siblings, 0 replies; 14+ messages in thread
From: David Miller @ 2007-05-26 22:50 UTC (permalink / raw)
  To: jengelh; +Cc: linux-raid, sparclinux

From: Jan Engelhardt <jengelh@linux01.gwdg.de>
Date: Sat, 26 May 2007 17:10:30 +0200 (MEST)

> 
> On Apr 12 2007 14:26, David Miller wrote:
> >
> >> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
> >> Perhaps it helps, otherwise hold your breath until I reproduce it.
> >
> >Jan, if you can reproduce this with the current 2.6.20 vanilla
> >kernel I'd be very interested in a full trace so that I can
> >try to fix this.
> >
> >With the combination of an old kernel and only part of the
> >crash trace, there isn't much I can do with this report.
> 
> Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.

Thanks for following up on this Jan.

I'd personally really appreciate reports against upstream
instead of dist kernels in the future, and I'm sure the
linux-raid maintainers feel similarly :-)

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
  2007-05-26 22:50       ` David Miller
@ 2007-05-29 20:11         ` Jan Engelhardt
  -1 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-05-29 20:11 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On May 26 2007 15:50, David Miller wrote:
>> >
>> >> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> >> Perhaps it helps, otherwise hold your breath until I reproduce it.
>> >
>> >Jan, if you can reproduce this with the current 2.6.20 vanilla
>> >kernel I'd be very interested in a full trace so that I can
>> >try to fix this.
>> >
>> >With the combination of an old kernel and only part of the
>> >crash trace, there isn't much I can do with this report.
>> 
>> Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.
>
>Thanks for following up on this Jan.
>
>I'd personally really appreciate reports against upstream
>instead of dist kernels in the future, and I'm sure the
>linux-raid maintainers feel similarly :-)

The days where compiling a full kernel on 400-800 Mhz boxes
in less than 45 minutes are unfortunately over.


	Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: raid10 kernel panic on sparc64
@ 2007-05-29 20:11         ` Jan Engelhardt
  0 siblings, 0 replies; 14+ messages in thread
From: Jan Engelhardt @ 2007-05-29 20:11 UTC (permalink / raw)
  To: David Miller; +Cc: linux-raid, sparclinux


On May 26 2007 15:50, David Miller wrote:
>> >
>> >> Kernel is kernel-smp-2.6.16-1.2128sp4.sparc64.rpm from Aurora Corona.
>> >> Perhaps it helps, otherwise hold your breath until I reproduce it.
>> >
>> >Jan, if you can reproduce this with the current 2.6.20 vanilla
>> >kernel I'd be very interested in a full trace so that I can
>> >try to fix this.
>> >
>> >With the combination of an old kernel and only part of the
>> >crash trace, there isn't much I can do with this report.
>> 
>> Does not seem to happen under 2.6.21-1.3149.al3.2smp anymore.
>
>Thanks for following up on this Jan.
>
>I'd personally really appreciate reports against upstream
>instead of dist kernels in the future, and I'm sure the
>linux-raid maintainers feel similarly :-)

The days where compiling a full kernel on 400-800 Mhz boxes
in less than 45 minutes are unfortunately over.


	Jan
-- 

^ permalink raw reply	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2007-05-29 20:11 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2007-04-02  0:15 raid10 kernel panic on sparc64 Jan Engelhardt
2007-04-02  0:15 ` Jan Engelhardt
2007-04-02  3:13 ` David Miller
2007-04-02  3:13   ` David Miller
2007-04-12 21:26 ` David Miller
2007-04-12 21:26   ` David Miller
2007-04-12 21:28   ` Jan Engelhardt
2007-04-12 21:28     ` Jan Engelhardt
2007-05-26 15:10   ` Jan Engelhardt
2007-05-26 15:10     ` Jan Engelhardt
2007-05-26 22:50     ` David Miller
2007-05-26 22:50       ` David Miller
2007-05-29 20:11       ` Jan Engelhardt
2007-05-29 20:11         ` Jan Engelhardt

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.