linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* 2.4.23-pac1 (and others) issue with PDC20265
@ 2003-12-03 21:10 tabris
  2003-12-03 23:55 ` tabris
  2003-12-04  1:08 ` tabris
  0 siblings, 2 replies; 3+ messages in thread
From: tabris @ 2003-12-03 21:10 UTC (permalink / raw)
  To: linux-kernel

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

I have an ASUS A7V266-E motherboard, AMD AthlonXP 1800+ CPU.
several hard drives of various makes.

I've tested a couple different kernels... all (afaik) with preempt.
I've tried 2.4.22-10mdk, plus 2.4.23-pac1+preempt+lowlatency, and also a 
2.4.22-ac4+preempt

What I'm getting are deadlocks on what I think are dma issues when copying 
btwn hdf and hdg.
according to /proc/ide/*/model
hda is a WDC AC28400R
hdb is a Maxtor 93652U8
hde is a WDC WD200BB-32BSA0
hdf is a Maxtor 4D060H3
hdg is a Maxtor 4D060H3 (just installed today, likely used)

i've only gotten ONE error message 
Dec  3 12:51:12 tabriel kernel: hdf: dma_timer_expiry: dma status == 0x61

and that was with 2.4.22-10mdk

the rest all deadlock without ANY error messages.

Help?

- --
tabris
currently copying from hdf -> hdg with DMA turned off on hdf. going at 
about 1MB/minute, with some spikes.
- -
"Life would be much simpler and things would get done much faster if it
weren't for other people"
		-- Blore
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)

iD8DBQE/zlFI1U5ZaPMbKQcRAlRBAJ9vJp2ujbmf8KcytXH6E3e0mG+P7gCeL951
p4JfhV5Y+eRpaq289AHRZ0Y=
=Zefz
-----END PGP SIGNATURE-----


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: 2.4.23-pac1 (and others) issue with PDC20265
  2003-12-03 21:10 2.4.23-pac1 (and others) issue with PDC20265 tabris
@ 2003-12-03 23:55 ` tabris
  2003-12-04  1:08 ` tabris
  1 sibling, 0 replies; 3+ messages in thread
From: tabris @ 2003-12-03 23:55 UTC (permalink / raw)
  To: linux-kernel

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Wednesday 03 December 2003 4:10 pm, tabris wrote:
> I have an ASUS A7V266-E motherboard, AMD AthlonXP 1800+ CPU.
> several hard drives of various makes.
>
> I've tested a couple different kernels... all (afaik) with preempt.
> I've tried 2.4.22-10mdk, plus 2.4.23-pac1+preempt+lowlatency, and also
> a 2.4.22-ac4+preempt
I also thought i should make mention of the slow and lost interrupts when 
copying from hdf->hdg, and not using DMA, and the fact that i have to set 
up a do loop in bash with ntpdate to keep the clock going. losing 6-12 
seconds every 15 seconds. (HZ=1000 and jif64 patch by Tim Schleimau 
applied, ported to -ac & -pac by me)
Mouse is sluggish as well.

Sorry for the very grotty grammar. -ENOCAFFEINE
- --
tabris
- -
They went rushing down that freeway,
Messed around and got lost.
They didn't care... they were just dying to get off,
And it was life in the fast lane.
		-- Eagles, "Life in the Fast Lane"
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)

iD8DBQE/znfe1U5ZaPMbKQcRAvNZAJ9laNiExPTel4n27GochvQ+GI2vKQCcD1KT
n15JAhq9QGQyoLRe9oK+oEo=
=6Ya+
-----END PGP SIGNATURE-----


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: 2.4.23-pac1 (and others) issue with PDC20265
  2003-12-03 21:10 2.4.23-pac1 (and others) issue with PDC20265 tabris
  2003-12-03 23:55 ` tabris
@ 2003-12-04  1:08 ` tabris
  1 sibling, 0 replies; 3+ messages in thread
From: tabris @ 2003-12-04  1:08 UTC (permalink / raw)
  To: linux-kernel

[-- Attachment #1: clearsigned data --]
[-- Type: Text/Plain, Size: 1374 bytes --]

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Wednesday 03 December 2003 4:10 pm, tabris wrote:
> I have an ASUS A7V266-E motherboard, AMD AthlonXP 1800+ CPU.
> several hard drives of various makes.
>
> I've tested a couple different kernels... all (afaik) with preempt.
> I've tried 2.4.22-10mdk, plus 2.4.23-pac1+preempt+lowlatency, and also
> a 2.4.22-ac4+preempt
>
> What I'm getting are deadlocks on what I think are dma issues when
> copying btwn hdf and hdg.
> according to /proc/ide/*/model
> hda is a WDC AC28400R
> hdb is a Maxtor 93652U8
> hde is a WDC WD200BB-32BSA0
> hdf is a Maxtor 4D060H3
> hdg is a Maxtor 4D060H3 (just installed today, likely used)
>

Ok, with some advice, I reorganized to...
hda is a WDC AC28400R - reiserfs
hdb is a Maxtor 93652U8 - reiserfs
.
hdf is a Maxtor 4D060H3 - reiserfs
hdg is a Maxtor 4D060H3 (just installed today, likely used) - xfs
.
hdh is a WDC WD200BB-32BSA0 - reiserfs

now instead of deadlocks, I got an oops. which is attached, as is the 
syslog.

- --
tabris
who probably has to reboot now.
- -
The problem with this country is that there is no death penalty for
incompetence.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.3 (GNU/Linux)

iD8DBQE/zoj21U5ZaPMbKQcRAqkeAKCD6k95rJJcNXJGp5bzPMCWaFNIdACghRwg
THwZDggMRqQu1YMKUATFnuI=
=4Wzg
-----END PGP SIGNATURE-----

[-- Attachment #2: oops-20031203-ksymoops.log --]
[-- Type: text/plain, Size: 3962 bytes --]

ksymoops 2.4.9 on i686 2.4.23-tab2+jif64.  Options used
     -V (default)
     -k /proc/ksyms (default)
     -l /proc/modules (default)
     -o /lib/modules/2.4.23-tab2+jif64/ (default)
     -m /boot/System.map-2.4.23-tab2+jif64 (default)

Warning: You did not tell me where to find symbol information.  I will
assume that the log matches the kernel and modules that are running
right now and I'll use the default options above for symbol resolution.
If the current kernel and/or modules do not match the log, you can get
more accurate output by telling me the kernel version and where to find
map, modules, ksyms etc.  ksymoops -h explains the options.

Dec  3 19:57:53 tabriel kernel: kernel BUG at prints.c:341!
Dec  3 19:57:53 tabriel kernel: invalid operand: 0000
Dec  3 19:57:53 tabriel kernel: CPU:    0
Dec  3 19:57:53 tabriel kernel: EIP:    0010:[reiserfs_panic+69/128]    Not tainted
Dec  3 19:57:53 tabriel kernel: EIP:    0010:[<c01928a5>]    Not tainted
Using defaults from ksymoops -t elf32-i386 -a i386
Dec  3 19:57:53 tabriel kernel: EFLAGS: 00010286
Dec  3 19:57:53 tabriel kernel: eax: 00000039   ebx: df549c00   ecx: c15a202c   edx: 00000000
Dec  3 19:57:53 tabriel kernel: esi: 00000009   edi: 00000009   ebp: df549c00   esp: c15a3e0c
Dec  3 19:57:53 tabriel kernel: ds: 0018   es: 0018   ss: 0018
Dec  3 19:57:53 tabriel kernel: Process kupdated (pid: 6, stackpage=c15a3000)
Dec  3 19:57:53 tabriel kernel: Stack: c0280611 c032b080 c0329d60 e0a822fc c01a3aaf df549c00 c0291160 e0a82324
Dec  3 19:57:53 tabriel kernel:        c0145eb5 0000000c dee03000 c15a2000 c15a2000 0000000a 00000000 cf515cc0
Dec  3 19:57:53 tabriel kernel:        df549c00 00000008 cef69000 00000000 c01a8620 df549c00 e0a822fc 00000001
Dec  3 19:57:53 tabriel kernel: Call Trace:    [flush_commit_list+1119/1200] [fsync_buffers_list+437/544] [do_journal_end+2064/3072] [get_cnode+122/144] [flush_old_commits+309/464]
Dec  3 19:57:53 tabriel kernel: Call Trace:    [<c01a3aaf>] [<c0145eb5>] [<c01a8620>] [<c01a328a>] [<c01a7555>]
Dec  3 19:57:53 tabriel kernel:   [<c015deb5>] [<c018f230>] [<c0149b79>] [<c0148b64>] [<c0148ebc>] [<c0107562>]
Dec  3 19:57:53 tabriel kernel:   [<c0148de0>] [<c0105000>] [<c010581b>] [<c0148de0>]
Dec  3 19:57:53 tabriel kernel: Code: 0f 0b 55 01 24 06 28 c0 85 db b8 2d 06 28 c0 74 0c 0f b7 43


>>EIP; c01928a5 <reiserfs_panic+45/80>   <=====

>>ebx; df549c00 <_end+1f1fb994/20569df4>
>>ecx; c15a202c <_end+1253dc0/20569df4>
>>ebp; df549c00 <_end+1f1fb994/20569df4>
>>esp; c15a3e0c <_end+1255ba0/20569df4>

Trace; c01a3aaf <flush_commit_list+45f/4b0>
Trace; c0145eb5 <fsync_buffers_list+1b5/220>
Trace; c01a8620 <do_journal_end+810/c00>
Trace; c01a328a <get_cnode+7a/90>
Trace; c01a7555 <flush_old_commits+135/1d0>
Trace; c015deb5 <__sync_one+b5/19a>
Trace; c018f230 <reiserfs_write_super+50/90>
Trace; c0149b79 <sync_supers+159/190>
Trace; c0148b64 <sync_old_buffers+34/d0>
Trace; c0148ebc <kupdate+dc/1c0>
Trace; c0107562 <ret_from_fork+6/20>
Trace; c0148de0 <kupdate+0/1c0>
Trace; c0105000 <_stext+0/0>
Trace; c010581b <arch_kernel_thread+2b/40>
Trace; c0148de0 <kupdate+0/1c0>

Code;  c01928a5 <reiserfs_panic+45/80>
00000000 <_EIP>:
Code;  c01928a5 <reiserfs_panic+45/80>   <=====
   0:   0f 0b                     ud2a      <=====
Code;  c01928a7 <reiserfs_panic+47/80>
   2:   55                        push   %ebp
Code;  c01928a8 <reiserfs_panic+48/80>
   3:   01 24 06                  add    %esp,(%esi,%eax,1)
Code;  c01928ab <reiserfs_panic+4b/80>
   6:   28 c0                     sub    %al,%al
Code;  c01928ad <reiserfs_panic+4d/80>
   8:   85 db                     test   %ebx,%ebx
Code;  c01928af <reiserfs_panic+4f/80>
   a:   b8 2d 06 28 c0            mov    $0xc028062d,%eax
Code;  c01928b4 <reiserfs_panic+54/80>
   f:   74 0c                     je     1d <_EIP+0x1d>
Code;  c01928b6 <reiserfs_panic+56/80>
  11:   0f b7 43 00               movzwl 0x0(%ebx),%eax


1 warning issued.  Results may not be reliable.

[-- Attachment #3: oops-20031203.log --]
[-- Type: text/plain, Size: 4654 bytes --]

Dec  3 19:56:50 tabriel kernel: hdh: dma_intr: status=0xd1 { Busy }
Dec  3 19:56:50 tabriel kernel:
Dec  3 19:56:50 tabriel kernel: hdg: DMA disabled
Dec  3 19:56:50 tabriel kernel: hdh: DMA disabled
Dec  3 19:56:50 tabriel kernel: PDC202XX: Secondary channel reset.
Dec  3 19:56:50 tabriel kernel: PDC202XX: Primary channel reset.
Dec  3 19:56:50 tabriel kernel: ide3: reset: master: error (0x00?)
Dec  3 19:57:13 tabriel kernel: hde: dma_timer_expiry: dma status == 0x21
Dec  3 19:57:23 tabriel kernel: hde: error waiting for DMA
Dec  3 19:57:23 tabriel kernel: hde: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Dec  3 19:57:23 tabriel kernel:
Dec  3 19:57:23 tabriel kernel: hde: status timeout: status=0xd0 { Busy }
Dec  3 19:57:23 tabriel kernel:
Dec  3 19:57:23 tabriel kernel: PDC202XX: Primary channel reset.
Dec  3 19:57:23 tabriel kernel: PDC202XX: Secondary channel reset.
Dec  3 19:57:23 tabriel kernel: hde: drive not ready for command
Dec  3 19:57:23 tabriel kernel: ide2: reset: master: error (0x00?)
Dec  3 19:57:23 tabriel kernel: blk: queue c0344f58, I/O limit 4095Mb (mask 0xffffffff)
Dec  3 19:57:33 tabriel kernel: hdh: lost interrupt
Dec  3 19:57:43 tabriel kernel: hdh: lost interrupt
Dec  3 19:57:43 tabriel kernel: hde: dma_timer_expiry: dma status == 0x21
Dec  3 19:57:53 tabriel kernel: hdh: lost interrupt
Dec  3 19:57:53 tabriel kernel: hde: error waiting for DMA
Dec  3 19:57:53 tabriel kernel: hde: dma timeout retry: status=0x58 { DriveReady SeekComplete DataRequest }
Dec  3 19:57:53 tabriel kernel:
Dec  3 19:57:53 tabriel kernel: hde: status timeout: status=0xd0 { Busy }
Dec  3 19:57:53 tabriel kernel:
Dec  3 19:57:53 tabriel kernel: PDC202XX: Primary channel reset.
Dec  3 19:57:53 tabriel kernel: PDC202XX: Secondary channel reset.
Dec  3 19:57:53 tabriel kernel: hde: drive not ready for command
Dec  3 19:57:53 tabriel kernel: ide2: reset: master: error (0x00?)
Dec  3 19:57:53 tabriel kernel: end_request: I/O error, dev 21:01 (hde), sector 25120
Dec  3 19:57:53 tabriel kernel: journal-615: buffer write failed
Dec  3 19:57:53 tabriel kernel:  (device ide2(33,1))
Dec  3 19:57:53 tabriel kernel: kernel BUG at prints.c:341!
Dec  3 19:57:53 tabriel kernel: invalid operand: 0000
Dec  3 19:57:53 tabriel kernel: CPU:    0
Dec  3 19:57:53 tabriel kernel: EIP:    0010:[reiserfs_panic+69/128]    Not tainted
Dec  3 19:57:53 tabriel kernel: EIP:    0010:[<c01928a5>]    Not tainted
Dec  3 19:57:53 tabriel kernel: EFLAGS: 00010286
Dec  3 19:57:53 tabriel kernel: eax: 00000039   ebx: df549c00   ecx: c15a202c   edx: 00000000
Dec  3 19:57:53 tabriel kernel: esi: 00000009   edi: 00000009   ebp: df549c00   esp: c15a3e0c
Dec  3 19:57:53 tabriel kernel: ds: 0018   es: 0018   ss: 0018
Dec  3 19:57:53 tabriel kernel: Process kupdated (pid: 6, stackpage=c15a3000)
Dec  3 19:57:53 tabriel kernel: Stack: c0280611 c032b080 c0329d60 e0a822fc c01a3aaf df549c00 c0291160 e0a82324
Dec  3 19:57:53 tabriel kernel:        c0145eb5 0000000c dee03000 c15a2000 c15a2000 0000000a 00000000 cf515cc0
Dec  3 19:57:53 tabriel kernel:        df549c00 00000008 cef69000 00000000 c01a8620 df549c00 e0a822fc 00000001
Dec  3 19:57:53 tabriel kernel: Call Trace:    [flush_commit_list+1119/1200] [fsync_buffers_list+437/544] [do_journal_end+2064/3072] [get_cnode+122/144] [flush_old_commits+309/464]
Dec  3 19:57:53 tabriel kernel: Call Trace:    [<c01a3aaf>] [<c0145eb5>] [<c01a8620>] [<c01a328a>] [<c01a7555>]
Dec  3 19:57:53 tabriel kernel:   [__sync_one+181/410] [reiserfs_write_super+80/144] [sync_supers+345/400] [sync_old_buffers+52/208] [kupdate+220/448] [ret_from_fork+6/32]
Dec  3 19:57:53 tabriel kernel:   [<c015deb5>] [<c018f230>] [<c0149b79>] [<c0148b64>] [<c0148ebc>] [<c0107562>]
Dec  3 19:57:53 tabriel kernel:   [kupdate+0/448] [_stext+0/96] [arch_kernel_thread+43/64] [kupdate+0/448]
Dec  3 19:57:53 tabriel kernel:   [<c0148de0>] [<c0105000>] [<c010581b>] [<c0148de0>]
Dec  3 19:57:53 tabriel kernel:
Dec  3 19:57:53 tabriel kernel: Code: 0f 0b 55 01 24 06 28 c0 85 db b8 2d 06 28 c0 74 0c 0f b7 43
Dec  3 19:57:53 tabriel kernel:  <6>note: kupdated[6] exited with preempt_count 1
Dec  3 19:58:03 tabriel kernel: hdh: lost interrupt
Dec  3 19:58:43 tabriel last message repeated 4 times
Dec  3 19:59:53 tabriel last message repeated 7 times
Dec  3 20:00:53 tabriel last message repeated 6 times
Dec  3 20:01:00 tabriel CROND[3017]: (root) CMD (nice -n 19 run-parts /etc/cron.hourly)
Dec  3 20:01:03 tabriel kernel: hdh: lost interrupt
Dec  3 20:01:43 tabriel last message repeated 4 times
Dec  3 20:02:53 tabriel last message repeated 7 times
Dec  3 20:03:33 tabriel last message repeated 4 times

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2003-12-04  1:10 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-12-03 21:10 2.4.23-pac1 (and others) issue with PDC20265 tabris
2003-12-03 23:55 ` tabris
2003-12-04  1:08 ` tabris

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).