* system hang on HDIO_DRIVE_RESET! help! @ 2003-02-26 16:45 rain.wang 2003-02-26 19:44 ` Alan Cox 0 siblings, 1 reply; 12+ messages in thread From: rain.wang @ 2003-02-26 16:45 UTC (permalink / raw) To: linux-kernel Hi, I did HDIO_DRIVE_RESET ioctl, but system hung without any response, only printed some mesages from kernel(v2.4.20): hda: DMA disabled hda: ide_set_handler: handler not null; old=c01ce300, new=c01d4400 bug: kernel timer added twice at c01ce102 would you please help me with it? Regards rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-02-26 16:45 system hang on HDIO_DRIVE_RESET! help! rain.wang @ 2003-02-26 19:44 ` Alan Cox 2003-02-28 5:04 ` rain.wang 0 siblings, 1 reply; 12+ messages in thread From: Alan Cox @ 2003-02-26 19:44 UTC (permalink / raw) To: rain.wang; +Cc: Linux Kernel Mailing List On Wed, 2003-02-26 at 16:45, rain.wang wrote: > Hi, > I did HDIO_DRIVE_RESET ioctl, but system hung without any response, > only printed some mesages from kernel(v2.4.20): > > hda: DMA disabled > hda: ide_set_handler: handler not null; old=c01ce300, new=c01d4400 > bug: kernel timer added twice at c01ce102 > > would you please help me with it? Does this still occur on 2.4.21pre. It should be fixed now ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-02-26 19:44 ` Alan Cox @ 2003-02-28 5:04 ` rain.wang 2003-02-28 13:35 ` Alan Cox 2003-03-04 13:22 ` rain.wang 0 siblings, 2 replies; 12+ messages in thread From: rain.wang @ 2003-02-28 5:04 UTC (permalink / raw) To: Alan Cox; +Cc: Linux Kernel Mailing List Alan Cox wrote: > On Wed, 2003-02-26 at 16:45, rain.wang wrote: > > Hi, > > I did HDIO_DRIVE_RESET ioctl, but system hung without any response, > > only printed some mesages from kernel(v2.4.20): > > > > hda: DMA disabled > > hda: ide_set_handler: handler not null; old=c01ce300, new=c01d4400 > > bug: kernel timer added twice at c01ce102 > > > > would you please help me with it? > > Does this still occur on 2.4.21pre. It should be fixed now I had tested 'hdparm -w /dev/hda' under 2.4.21-pre4, but problem sill exist, just same message as in 2.4.20. rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-02-28 5:04 ` rain.wang @ 2003-02-28 13:35 ` Alan Cox 2003-02-28 13:30 ` rain.wang 2003-03-04 13:22 ` rain.wang 1 sibling, 1 reply; 12+ messages in thread From: Alan Cox @ 2003-02-28 13:35 UTC (permalink / raw) To: rain.wang; +Cc: Linux Kernel Mailing List On Fri, 2003-02-28 at 05:04, rain.wang wrote: > > Does this still occur on 2.4.21pre. It should be fixed now > > I had tested 'hdparm -w /dev/hda' under 2.4.21-pre4, but problem sill exist, > > just same message as in 2.4.20. What controller are you using and I'll look into it a bit further ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-02-28 13:35 ` Alan Cox @ 2003-02-28 13:30 ` rain.wang 0 siblings, 0 replies; 12+ messages in thread From: rain.wang @ 2003-02-28 13:30 UTC (permalink / raw) To: Alan Cox; +Cc: Linux Kernel Mailing List Alan Cox wrote: > On Fri, 2003-02-28 at 05:04, rain.wang wrote: > > > Does this still occur on 2.4.21pre. It should be fixed now > > > > I had tested 'hdparm -w /dev/hda' under 2.4.21-pre4, but problem sill exist, > > > > just same message as in 2.4.20. > > What controller are you using and I'll look into it a bit further Intel 82801AA host controller, and I found when I disabled DMA before doing drive reset, system wouldn't hang at most time. It seemed not tight related with host chip, does it? rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-02-28 5:04 ` rain.wang 2003-02-28 13:35 ` Alan Cox @ 2003-03-04 13:22 ` rain.wang 2003-03-04 15:27 ` Alan Cox 1 sibling, 1 reply; 12+ messages in thread From: rain.wang @ 2003-03-04 13:22 UTC (permalink / raw) To: Alan Cox, Linux Kernel Mailing List "rain.wang" wrote: > Alan Cox wrote: > > > On Wed, 2003-02-26 at 16:45, rain.wang wrote: > > > Hi, > > > I did HDIO_DRIVE_RESET ioctl, but system hung without any response, > > > only printed some mesages from kernel(v2.4.20): > > > > > > hda: DMA disabled > > > hda: ide_set_handler: handler not null; old=c01ce300, new=c01d4400 > > > bug: kernel timer added twice at c01ce102 > > > > > > would you please help me with it? > > > > Does this still occur on 2.4.21pre. It should be fixed now > > I had tested 'hdparm -w /dev/hda' under 2.4.21-pre4, but problem sill exist, > > just same message as in 2.4.20. > > rain.w Hi Alan, I had tested 'hdparm -w /dev/hda' under 2.4.25-pre5-ac1, system crashed with kernel oops message: kernel BUG at ide-iops:1046! ... can this be resolved? rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-04 13:22 ` rain.wang @ 2003-03-04 15:27 ` Alan Cox 2003-03-07 6:04 ` rain.wang 0 siblings, 1 reply; 12+ messages in thread From: Alan Cox @ 2003-03-04 15:27 UTC (permalink / raw) To: rain.wang; +Cc: Linux Kernel Mailing List On Tue, 2003-03-04 at 13:22, rain.wang wrote: > I had tested 'hdparm -w /dev/hda' under 2.4.25-pre5-ac1, system > crashed > with > kernel oops message: > kernel BUG at ide-iops:1046! > ... > > can this be resolved? Once I understand what the problems all are yes. The BUG() is good, it confirms that what we are both seeing is the same thing - the reset is managing to issue two commands to the controller at the same time. ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-04 15:27 ` Alan Cox @ 2003-03-07 6:04 ` rain.wang 2003-03-07 12:58 ` Alan Cox 0 siblings, 1 reply; 12+ messages in thread From: rain.wang @ 2003-03-07 6:04 UTC (permalink / raw) To: Alan Cox; +Cc: Linux Kernel Mailing List Alan Cox wrote: > On Tue, 2003-03-04 at 13:22, rain.wang wrote: > > I had tested 'hdparm -w /dev/hda' under 2.4.25-pre5-ac1, system > > crashed > > with > > kernel oops message: > > kernel BUG at ide-iops:1046! > > ... > > > > can this be resolved? > > Once I understand what the problems all are yes. The BUG() is good, it > confirms that what we are both seeing is the same thing - the reset is > managing to issue two commands to the controller at the same time. Hi, thank you, Alan. I tested pre5-ac2 patch and that seems all ok. rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-07 6:04 ` rain.wang @ 2003-03-07 12:58 ` Alan Cox 2003-03-14 8:28 ` rain.wang 0 siblings, 1 reply; 12+ messages in thread From: Alan Cox @ 2003-03-07 12:58 UTC (permalink / raw) To: rain.wang; +Cc: Linux Kernel Mailing List On Fri, 2003-03-07 at 06:04, rain.wang wrote: > > Once I understand what the problems all are yes. The BUG() is good, it > > confirms that what we are both seeing is the same thing - the reset is > > managing to issue two commands to the controller at the same time. > > Hi, > thank you, Alan. I tested pre5-ac2 patch and that seems all ok. Thanks for the confirmation it is fixed ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-07 12:58 ` Alan Cox @ 2003-03-14 8:28 ` rain.wang 2003-03-14 9:13 ` Andre Hedrick 0 siblings, 1 reply; 12+ messages in thread From: rain.wang @ 2003-03-14 8:28 UTC (permalink / raw) To: Alan Cox; +Cc: Linux Kernel Mailing List Alan Cox wrote: > On Fri, 2003-03-07 at 06:04, rain.wang wrote: > > > Once I understand what the problems all are yes. The BUG() is good, it > > > confirms that what we are both seeing is the same thing - the reset is > > > managing to issue two commands to the controller at the same time. > > > > Hi, > > thank you, Alan. I tested pre5-ac2 patch and that seems all ok. > > Thanks for the confirmation it is fixed Hi Alan, for 2.4.21-pre5-ac2 and -ac3 patch also. there's still problem on reset. when I do 'hdparm -w /dev/hda' once after another, all seems ok. but when I make a shell script and let 'hdparm -w' run in several times loop, system would always crashed at the second time and left oops messages: kernel BUG at ide.c:1700! ... so, if any bugs still locking there? rain.w ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-14 8:28 ` rain.wang @ 2003-03-14 9:13 ` Andre Hedrick 2003-03-14 14:21 ` Alan Cox 0 siblings, 1 reply; 12+ messages in thread From: Andre Hedrick @ 2003-03-14 9:13 UTC (permalink / raw) To: rain.wang; +Cc: Alan Cox, Linux Kernel Mailing List Rain, The only way to deal with this is to treat the operations a failed and punch them back out to block for clean up. Now we failed the a command. However, I think I need to set a default block hook during the reset process for the drive, channel, hba ... depending on the magnitude of the wrecking ball generated. I need to offline Alan for this core dump. The hang is in the clean ups after the reset. I suspect the driver/hba is in DMA and drive is not. Cheers, Andre Hedrick LAD Storage Consulting Group ------------------------------------ Pokemon (n), A Jamaican proctologist ------------------------------------ On Fri, 14 Mar 2003, rain.wang wrote: > Alan Cox wrote: > > > On Fri, 2003-03-07 at 06:04, rain.wang wrote: > > > > Once I understand what the problems all are yes. The BUG() is good, it > > > > confirms that what we are both seeing is the same thing - the reset is > > > > managing to issue two commands to the controller at the same time. > > > > > > Hi, > > > thank you, Alan. I tested pre5-ac2 patch and that seems all ok. > > > > Thanks for the confirmation it is fixed > > Hi Alan, > for 2.4.21-pre5-ac2 and -ac3 patch also. > there's still problem on reset. when I do 'hdparm -w /dev/hda' once > after another, all seems ok. but when I make a shell script and let > 'hdparm -w' run in several times loop, system would always crashed > at the second time and left oops messages: > kernel BUG at ide.c:1700! > ... > so, if any bugs still locking there? > > rain.w > > > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > ^ permalink raw reply [flat|nested] 12+ messages in thread
* Re: system hang on HDIO_DRIVE_RESET! help! 2003-03-14 9:13 ` Andre Hedrick @ 2003-03-14 14:21 ` Alan Cox 0 siblings, 0 replies; 12+ messages in thread From: Alan Cox @ 2003-03-14 14:21 UTC (permalink / raw) To: Andre Hedrick; +Cc: rain.wang, Linux Kernel Mailing List On Fri, 2003-03-14 at 09:13, Andre Hedrick wrote: > Rain, > > The only way to deal with this is to treat the operations a failed and > punch them back out to block for clean up. Now we failed the a command. > However, I think I need to set a default block hook during the reset > process for the drive, channel, hba ... depending on the magnitude of the > wrecking ball generated. I need to offline Alan for this core dump. I fixed one set of races with resets and it doesnt suprise me there is another right now. ^ permalink raw reply [flat|nested] 12+ messages in thread
end of thread, other threads:[~2003-03-14 13:02 UTC | newest] Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2003-02-26 16:45 system hang on HDIO_DRIVE_RESET! help! rain.wang 2003-02-26 19:44 ` Alan Cox 2003-02-28 5:04 ` rain.wang 2003-02-28 13:35 ` Alan Cox 2003-02-28 13:30 ` rain.wang 2003-03-04 13:22 ` rain.wang 2003-03-04 15:27 ` Alan Cox 2003-03-07 6:04 ` rain.wang 2003-03-07 12:58 ` Alan Cox 2003-03-14 8:28 ` rain.wang 2003-03-14 9:13 ` Andre Hedrick 2003-03-14 14:21 ` Alan Cox
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).