linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
@ 2006-01-28  3:20 Ken MacFerrin
  2006-01-28 15:48 ` Hugh Dickins
                   ` (2 more replies)
  0 siblings, 3 replies; 21+ messages in thread
From: Ken MacFerrin @ 2006-01-28  3:20 UTC (permalink / raw)
  To: linux-kernel

I started getting hard lockups on my desktop PC with the error "kernel 
BUG at mm/rmap.c:487" starting with kernel 2.6.13 and continuing through 
2.6.14.  After switching to 2.6.15 the lockups have continued with the 
message "kernel BUG at mm/rmap.c:486".

The frequency and circumstance are completely random which originally 
had me suspecting bad memory but after running Memtest86+ for over 12 
hours without error I'm at a loss.

I'm running the binary Nvidia driver so I'll understand if I can't get 
help here but in searching through the list archives it would seem I'm 
not alone and I am willing to try any patches that may help diagnose the 
issue.  The crash happens at least daily and I've seen no difference in 
running kernels with or without PREEMPT enabled.

The machine is a P4 3.00GHz with 2048MB PC3200 Unbuffered RAM on an ASUS 
motherboard with an ICH5 chipset.  XFX GF 6600GT video card, 600W power 
supply and plenty of cooling.

Below I've included output from ver_linux, syslog and lspci.

Thanks,
Ken

------------------------------------------------------------
### ver_linux output ###
------------------------------------------------------------
mm-home1 linux # sh scripts/ver_linux
If some fields are empty or look unusual you may have an old version.
Compare to the current minimal requirements in Documentation/Changes.

Linux mm-home1 2.6.15-gentoo-r1 #4 SMP PREEMPT Wed Jan 25 16:50:47 MST 
2006 i686 Intel(R) Pentium(R) 4 CPU 3.00GHz GenuineIntel GNU/Linux

Gnu C                  3.3.6
Gnu make               3.80
binutils               2.16.1
util-linux             2.12r
mount                  2.12r
module-init-tools      3.2.1
e2fsprogs              1.38
jfsutils               1.1.8
reiserfsprogs          3.6.19
reiser4progs           line
xfsprogs               2.6.25
PPP                    2.4.2
Linux C Library        2.3.5
Dynamic linker (ldd)   2.3.5
Procps                 3.2.5
Net-tools              1.60
Kbd                    1.12
Sh-utils               5.2.1
udev                   081
Modules Loaded         ipt_limit iptable_mangle ipt_LOG ipt_MASQUERADE 
ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc ip_conntrack_ftp ipt_state 
ip_conntrack iptable_filter ip_tables snd_seq_midi snd_emu10k1_synth 
snd_emux_synth snd_seq_virmidi snd_seq_midi_emul snd_emu10k1 snd_rawmidi 
snd_ac97_codec snd_ac97_bus snd_util_mem snd_hwdep eth1394 nls_utf8 
rfcomm bnep l2cap bluetooth dv1394 video1394 raw1394 ohci1394 ieee1394 
3c59x marvell loop nvidia ntfs rtc tsdev
------------------------------------------------------------


------------------------------------------------------------
### /var/log/messages output ###
------------------------------------------------------------
Jan 26 00:26:09 mm-home1 ------------[ cut here ]------------
Jan 26 00:26:09 mm-home1 kernel BUG at mm/rmap.c:486!
Jan 26 00:26:09 mm-home1 invalid operand: 0000 [#1]
Jan 26 00:26:09 mm-home1 PREEMPT SMP
Jan 26 00:26:09 mm-home1 Modules linked in: ipt_limit iptable_mangle 
ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc 
ip_conntrack_ftp ipt_state ip_conntrack iptable_filter ip_tables 
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_emul snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus 
snd_util_mem snd_hwdep eth1394 nls_utf8 rfcomm bnep l2cap bluetooth 
dv1394 video1394 raw1394 ohci1394 ieee1394 3c59x marvell loop nvidia 
ntfs rtc tsdev
Jan 26 00:26:09 mm-home1 CPU:    0
Jan 26 00:26:09 mm-home1 EIP:    0060:[<c0147c05>]    Tainted: P      VLI
Jan 26 00:26:09 mm-home1 EFLAGS: 00010286   (2.6.15-gentoo-r1)
Jan 26 00:26:09 mm-home1 EIP is at page_remove_rmap+0x16/0x2a
Jan 26 00:26:09 mm-home1 eax: ffffffff   ebx: c1c4c404   ecx: 00000038 
  edx: c1c4c404
Jan 26 00:26:09 mm-home1 esi: 00000020   edi: fffb5c4c   ebp: b7713000 
  esp: c5cfbdc4
Jan 26 00:26:09 mm-home1 ds: 007b   es: 007b   ss: 0068
Jan 26 00:26:09 mm-home1 Process thunderbird-bin (pid: 15527, 
threadinfo=c5cfa000 task=cd69d030)
Jan 26 00:26:09 mm-home1 Stack: c0142203 c1c4c404 c1c18348 00000000 
fffffffe f7016180 b7780000 cd69fb74
Jan 26 00:26:09 mm-home1 00000000 cd69fb74 c014238c c220e900 c67a664c 
cd69fb74 b7712000 b7780000
Jan 26 00:26:09 mm-home1 c5cfbe48 00000000 b7780000 cd69fb74 00000001 
c67a664c b7712000 b7780000
Jan 26 00:26:09 mm-home1 Call Trace:
Jan 26 00:26:09 mm-home1 [<c0142203>] zap_pte_range+0x166/0x250
Jan 26 00:26:09 mm-home1 [<c014238c>] unmap_page_range+0x9f/0xed
Jan 26 00:26:09 mm-home1 [<c01424c4>] unmap_vmas+0xea/0x1f1
Jan 26 00:26:09 mm-home1 [<c0146376>] exit_mmap+0x6c/0xff
Jan 26 00:26:09 mm-home1 [<c0116add>] mmput+0x21/0x7a
Jan 26 00:26:09 mm-home1 [<c011ae5a>] do_exit+0x174/0x379
Jan 26 00:26:09 mm-home1 [<c011b120>] sys_exit_group+0x0/0x11
Jan 26 00:26:09 mm-home1 [<c0122c2f>] get_signal_to_deliver+0x2b0/0x2d8
Jan 26 00:26:09 mm-home1 [<c01026dd>] do_signal+0x50/0xc2
Jan 26 00:26:09 mm-home1 [<c012007b>] del_timer+0x9/0x4b
Jan 26 00:26:09 mm-home1 [<c015b291>] pipe_poll+0x1a/0x8e
Jan 26 00:26:09 mm-home1 [<c01395cb>] free_hot_cold_page+0x7a/0x10b
Jan 26 00:26:09 mm-home1 [<c016073c>] poll_freewait+0x37/0x3e
Jan 26 00:26:09 mm-home1 [<c0161224>] sys_poll+0x1dd/0x1e9
Jan 26 00:26:09 mm-home1 [<c0160743>] __pollwait+0x0/0x9b
Jan 26 00:26:09 mm-home1 [<c0102777>] do_notify_resume+0x28/0x39
Jan 26 00:26:09 mm-home1 [<c0102942>] work_notifysig+0x13/0x19
Jan 26 00:26:09 mm-home1 kdm[14992]: X server for display :0 terminated 
unexpectedly
Jan 26 00:26:09 mm-home1 Code: 42 08 0f 94 c0 84 c0 74 0b 6a 01 6a 10 e8 
14 23 ff ff 58 5a c3 8b 54 24 04 f0 83 42 08 ff 0f 98 c0 84 c0 74 19 8b 
42 08 40 79 08 <0f> 0b e6 01 12 82 3e c0 6a ff 6a 10 e8 ea 22 ff ff 59 
58 c3 55
Jan 26 00:26:09 mm-home1 <1>Fixing recursive fault but reboot is needed!
Jan 26 00:26:09 mm-home1 scheduling while atomic: 
thunderbird-bin/0x00000003/15527
Jan 26 00:26:09 mm-home1 [<c03bbb17>] schedule+0x43/0xac0
Jan 26 00:26:09 mm-home1 [<c0119174>] vprintk+0x1f8/0x232
Jan 26 00:26:09 mm-home1 [<c01035eb>] show_trace+0x1e/0x6e
Jan 26 00:26:09 mm-home1 [<c0147c19>] try_to_unmap_one+0x0/0x18c
Jan 26 00:26:09 mm-home1 [<c011ad95>] do_exit+0xaf/0x379
Jan 26 00:26:09 mm-home1 [<c01039fc>] do_trap+0x0/0xc1
Jan 26 00:26:09 mm-home1 [<c0103c5d>] do_invalid_op+0x0/0x86
Jan 26 00:26:09 mm-home1 [<c0103cd7>] do_invalid_op+0x7a/0x86
Jan 26 00:26:09 mm-home1 [<c01203ef>] update_wall_time+0xa/0x32
Jan 26 00:26:09 mm-home1 [<c01206bb>] do_timer+0x2e/0xa3
Jan 26 00:26:09 mm-home1 [<c0147c05>] page_remove_rmap+0x16/0x2a
Jan 26 00:26:09 mm-home1 [<c0134af7>] handle_IRQ_event+0x20/0x4c
Jan 26 00:26:09 mm-home1 [<c0134bcc>] __do_IRQ+0xa9/0xdd
Jan 26 00:26:09 mm-home1 [<c0134bf2>] __do_IRQ+0xcf/0xdd
Jan 26 00:26:09 mm-home1 [<c010486a>] do_IRQ+0x1e/0x24
Jan 26 00:26:09 mm-home1 [<c010327e>] common_interrupt+0x1a/0x20
Jan 26 00:26:09 mm-home1 [<c013007b>] search_module_extables+0x86/0x87
Jan 26 00:26:09 mm-home1 [<c01033d7>] error_code+0x4f/0x54
Jan 26 00:26:09 mm-home1 [<c013007b>] search_module_extables+0x86/0x87
Jan 26 00:26:09 mm-home1 [<c0147c05>] page_remove_rmap+0x16/0x2a
Jan 26 00:26:09 mm-home1 [<c0142203>] zap_pte_range+0x166/0x250
Jan 26 00:26:09 mm-home1 [<c014238c>] unmap_page_range+0x9f/0xed
Jan 26 00:26:09 mm-home1 [<c01424c4>] unmap_vmas+0xea/0x1f1
Jan 26 00:26:09 mm-home1 [<c0146376>] exit_mmap+0x6c/0xff
Jan 26 00:26:09 mm-home1 [<c0116add>] mmput+0x21/0x7a
Jan 26 00:26:09 mm-home1 [<c011ae5a>] do_exit+0x174/0x379
Jan 26 00:26:09 mm-home1 [<c011b120>] sys_exit_group+0x0/0x11
Jan 26 00:26:09 mm-home1 [<c0122c2f>] get_signal_to_deliver+0x2b0/0x2d8
Jan 26 00:26:09 mm-home1 [<c01026dd>] do_signal+0x50/0xc2
Jan 26 00:26:09 mm-home1 [<c012007b>] del_timer+0x9/0x4b
Jan 26 00:26:09 mm-home1 [<c015b291>] pipe_poll+0x1a/0x8e
Jan 26 00:26:09 mm-home1 [<c01395cb>] free_hot_cold_page+0x7a/0x10b
Jan 26 00:26:09 mm-home1 [<c016073c>] poll_freewait+0x37/0x3e
Jan 26 00:26:09 mm-home1 [<c0161224>] sys_poll+0x1dd/0x1e9
Jan 26 00:26:09 mm-home1 [<c0160743>] __pollwait+0x0/0x9b
Jan 26 00:26:09 mm-home1 [<c0102777>] do_notify_resume+0x28/0x39
Jan 26 00:26:09 mm-home1 [<c0102942>] work_notifysig+0x13/0x19
Jan 26 00:26:09 mm-home1 kde(pam_unix)[14996]: session closed for user krm
Jan 26 00:26:09 mm-home1 Unable to handle kernel NULL pointer 
dereference at virtual address 00000020
Jan 26 00:26:09 mm-home1 printing eip:
Jan 26 00:26:09 mm-home1 c025c4ef
Jan 26 00:26:09 mm-home1 *pde = 00000000
Jan 26 00:26:09 mm-home1 Oops: 0000 [#2]
Jan 26 00:26:09 mm-home1 PREEMPT SMP
Jan 26 00:26:09 mm-home1 Modules linked in: ipt_limit iptable_mangle 
ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc 
ip_conntrack_ftp ipt_state ip_conntrack iptable_filter ip_tables 
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_emul snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus 
snd_util_mem snd_hwdep eth1394 nls_utf8 rfcomm bnep l2cap bluetooth 
dv1394 video1394 raw1394 ohci1394 ieee1394 3c59x marvell loop nvidia 
ntfs rtc tsdev
Jan 26 00:26:09 mm-home1 CPU:    1
Jan 26 00:26:09 mm-home1 EIP:    0060:[<c025c4ef>]    Tainted: P      VLI
Jan 26 00:26:09 mm-home1 EFLAGS: 00010297   (2.6.15-gentoo-r1)
Jan 26 00:26:09 mm-home1 EIP is at get_index+0x1b/0x3b
Jan 26 00:26:09 mm-home1 eax: cdd231e4   ebx: ce94bdf4   ecx: ce94bdf0 
  edx: ffffffd8
Jan 26 00:26:09 mm-home1 esi: f71bdcfc   edi: ce94bdf0   ebp: cdd231e4 
  esp: ce94bdd4
Jan 26 00:26:09 mm-home1 ds: 007b   es: 007b   ss: 0068
Jan 26 00:26:09 mm-home1 Process artsd (pid: 15963, threadinfo=ce94a000 
task=f71df550)
Jan 26 00:26:09 mm-home1 Stack: c6625c4c c025c7da cdd231e4 00000000 
ce94bdf0 ce94bdf4 c0141aa8 c6d32eac
Jan 26 00:26:09 mm-home1 c6d72d46 cdd231c8 cdd231f4 c237ec80 f71bdcd4 
c0144919 cdd231e4 f71bdcfc
Jan 26 00:26:09 mm-home1 cdd231c8 f71bdcd4 f7b6af3c b3b00000 00000000 
c0141b73 f71bdcd4 f71bdcd4
Jan 26 00:26:09 mm-home1 Call Trace:
Jan 26 00:26:09 mm-home1 [<c025c7da>] prio_tree_remove+0x30/0xaf
Jan 26 00:26:09 mm-home1 [<c0141aa8>] free_pgd_range+0xc5/0x14a
Jan 26 00:26:09 mm-home1 [<c0144919>] unlink_file_vma+0x27/0x3a
Jan 26 00:26:09 mm-home1 [<c0141b73>] free_pgtables+0x46/0x84
Jan 26 00:26:09 mm-home1 [<c0146395>] exit_mmap+0x8b/0xff
Jan 26 00:26:09 mm-home1 [<c0116add>] mmput+0x21/0x7a
Jan 26 00:26:09 mm-home1 [<c0159498>] exec_mmap+0x19d/0x1bb
Jan 26 00:26:09 mm-home1 [<c0159a74>] flush_old_exec+0x552/0x74d
Jan 26 00:26:09 mm-home1 [<c01592f2>] kernel_read+0x38/0x41
Jan 26 00:26:09 mm-home1 [<c0177027>] load_elf_binary+0x4cd/0xb64
Jan 26 00:26:09 mm-home1 [<c0140ed7>] kunmap_high+0x13/0x80
Jan 26 00:26:09 mm-home1 [<c0140f27>] kunmap_high+0x63/0x80
Jan 26 00:26:09 mm-home1 [<c0158f70>] copy_strings+0x1e5/0x1f2
Jan 26 00:26:09 mm-home1 [<c0176b5a>] load_elf_binary+0x0/0xb64
Jan 26 00:26:09 mm-home1 [<c0159f07>] search_binary_handler+0xd6/0x269
Jan 26 00:26:09 mm-home1 [<c015a205>] do_execve+0x16b/0x207
Jan 26 00:26:09 mm-home1 [<c010170d>] sys_execve+0x2c/0x6d
Jan 26 00:26:09 mm-home1 [<c01028b1>] syscall_call+0x7/0xb
Jan 26 00:26:09 mm-home1 Code: ff d6 ba 01 00 00 00 58 5b 89 d0 5e c3 90 
90 90 53 8b 44 24 08 8b 54 24 0c 8b 4c 24 10 66 83 78 06 00 8b 5c 24 14 
74 17 83 ea 28 <8b> 42 48 89 01 8b 42 08 2b 42 04 c1 e8 0c 03 42 48 48 
eb 08 8b
Jan 26 00:26:09 mm-home1 <6>note: artsd[15963] exited with preempt_count 2
Jan 26 00:26:09 mm-home1 Unable to handle kernel paging request at 
virtual address 005d7533
Jan 26 00:26:09 mm-home1 printing eip:
Jan 26 00:26:09 mm-home1 c01407f2
Jan 26 00:26:09 mm-home1 *pde = 00000000
Jan 26 00:26:09 mm-home1 Oops: 0002 [#3]
Jan 26 00:26:09 mm-home1 PREEMPT SMP
Jan 26 00:26:09 mm-home1 Modules linked in: ipt_limit iptable_mangle 
ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc 
ip_conntrack_ftp ipt_state ip_conntrack iptable_filter ip_tables 
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_emul snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus 
snd_util_mem snd_hwdep eth1394 nls_utf8 rfcomm bnep l2cap bluetooth 
dv1394 video1394 raw1394 ohci1394 ieee1394 3c59x marvell loop nvidia 
ntfs rtc tsdev
Jan 26 00:26:09 mm-home1 CPU:    0
Jan 26 00:26:09 mm-home1 EIP:    0060:[<c01407f2>]    Tainted: P      VLI
Jan 26 00:26:09 mm-home1 EFLAGS: 00210207   (2.6.15-gentoo-r1)
Jan 26 00:26:09 mm-home1 EIP is at vma_prio_tree_remove+0x68/0xbf
Jan 26 00:26:09 mm-home1 eax: c6f57b44   ebx: 005d7507   ecx: f700b514 
  edx: 005d752f
Jan 26 00:26:09 mm-home1 esi: c70be4ec   edi: f700b4ec   ebp: ce911e78 
  esp: ce943f04
Jan 26 00:26:09 mm-home1 ds: 007b   es: 007b   ss: 0068
Jan 26 00:26:09 mm-home1 Process kicker (pid: 15118, threadinfo=ce942000 
task=f7096030)
Jan 26 00:26:09 mm-home1 Stack: ce911e5c ce911e88 f3d0f480 c70be4ec 
c0144919 c70be4ec ce911e78 ce911e5c
Jan 26 00:26:09 mm-home1 c70be4ec c70be494 b63fe000 00000000 c0141b73 
c70be4ec c70be4ec ce943f64
Jan 26 00:26:09 mm-home1 c6d7c9bc f7045300 00000100 c0146395 ce943f64 
c6d7c9bc 00000000 00000000
Jan 26 00:26:09 mm-home1 Call Trace:
Jan 26 00:26:09 mm-home1 [<c0144919>] unlink_file_vma+0x27/0x3a
Jan 26 00:26:09 mm-home1 [<c0141b73>] free_pgtables+0x46/0x84
Jan 26 00:26:09 mm-home1 [<c0146395>] exit_mmap+0x8b/0xff
Jan 26 00:26:09 mm-home1 [<c0116add>] mmput+0x21/0x7a
Jan 26 00:26:09 mm-home1 [<c011ae5a>] do_exit+0x174/0x379
Jan 26 00:26:09 mm-home1 [<c011b120>] sys_exit_group+0x0/0x11
Jan 26 00:26:09 mm-home1 [<c01028b1>] syscall_call+0x7/0xb
Jan 26 00:26:09 mm-home1 Code: 5d e9 dd bf 11 00 39 77 34 74 08 0f 0b 7d 
00 a5 81 3e c0 83 7e 30 00 74 3a 8d 4f 28 8b 57 28 31 db 39 ca 74 11 8b 
41 04 8d 5a d8 <89> 42 04 89 10 89 49 04 89 4f 28 8d 47 28 50 8d 46 28 
50 55 e8
Jan 26 00:26:09 mm-home1 <1>Fixing recursive fault but reboot is needed!
Jan 26 00:26:09 mm-home1 scheduling while atomic: kicker/0x00000002/15118
Jan 26 00:26:09 mm-home1 [<c03bbb17>] schedule+0x43/0xac0
Jan 26 00:26:09 mm-home1 [<c0119174>] vprintk+0x1f8/0x232
Jan 26 00:26:09 mm-home1 [<c01035eb>] show_trace+0x1e/0x6e
Jan 26 00:26:09 mm-home1 [<c0140806>] vma_prio_tree_remove+0x7c/0xbf
Jan 26 00:26:09 mm-home1 [<c011ad95>] do_exit+0xaf/0x379
Jan 26 00:26:09 mm-home1 [<c01039fc>] do_trap+0x0/0xc1
Jan 26 00:26:09 mm-home1 [<c0111ecf>] do_page_fault+0x377/0x4a9
Jan 26 00:26:09 mm-home1 [<c0111b58>] do_page_fault+0x0/0x4a9
Jan 26 00:26:09 mm-home1 [<c01033d7>] error_code+0x4f/0x54
Jan 26 00:26:09 mm-home1 [<c01407f2>] vma_prio_tree_remove+0x68/0xbf
Jan 26 00:26:09 mm-home1 [<c0144919>] unlink_file_vma+0x27/0x3a
Jan 26 00:26:09 mm-home1 [<c0141b73>] free_pgtables+0x46/0x84
Jan 26 00:26:09 mm-home1 [<c0146395>] exit_mmap+0x8b/0xff
Jan 26 00:26:09 mm-home1 [<c0116add>] mmput+0x21/0x7a
Jan 26 00:26:09 mm-home1 [<c011ae5a>] do_exit+0x174/0x379
Jan 26 00:26:09 mm-home1 [<c011b120>] sys_exit_group+0x0/0x11
Jan 26 00:26:09 mm-home1 [<c01028b1>] syscall_call+0x7/0xb
Jan 26 00:26:11 mm-home1 kdm[14992]: X server for display :0 terminated 
unexpectedly
Jan 26 00:26:11 mm-home1 kdm[14992]: Unable to fire up local display :0; 
disabling.
------------------------------------------------------------


------------------------------------------------------------
### lspci -vv output ###
------------------------------------------------------------
mm-home1 linux # lspci -vv
00:00.0 Host bridge: Intel Corporation 82865G/PE/P DRAM 
Controller/Host-Hub Interface (rev 02)
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort+ >SERR- <PERR-
         Latency: 0
         Region 0: Memory at e0000000 (32-bit, prefetchable) [size=256M]
         Capabilities: [e4] Vendor Specific Information
         Capabilities: [a0] AGP version 3.0
                 Status: RQ=32 Iso- ArqSz=2 Cal=2 SBA+ ITACoh- GART64- 
HTrans- 64bit- FW+ AGP3+ Rate=x4,x8
                 Command: RQ=1 ArqSz=0 Cal=2 SBA+ AGP+ GART64- 64bit- 
FW+ Rate=x8

00:01.0 PCI bridge: Intel Corporation 82865G/PE/P PCI to AGP Controller 
(rev 02) (prog-if 00 [Normal decode])
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR-
         Latency: 64
         Bus: primary=00, secondary=01, subordinate=01, sec-latency=64
         Memory behind bridge: fa900000-fe9fffff
         Prefetchable memory behind bridge: bff00000-dfefffff
         Secondary status: 66MHz+ FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort+ <SERR- <PERR-
         BridgeCtl: Parity- SERR- NoISA- VGA+ MAbort- >Reset- FastB2B-

00:1d.0 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB 
UHCI Controller #1 (rev 02) (prog-if 00 [UHCI])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin A routed to IRQ 19
         Region 4: I/O ports at eec0 [size=32]

00:1d.1 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB 
UHCI Controller #2 (rev 02) (prog-if 00 [UHCI])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin B routed to IRQ 20
         Region 4: I/O ports at ef00 [size=32]

00:1d.2 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB 
UHCI Controller #3 (rev 02) (prog-if 00 [UHCI])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin C routed to IRQ 17
         Region 4: I/O ports at ef20 [size=32]

00:1d.3 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB 
UHCI Controller #4 (rev 02) (prog-if 00 [UHCI])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin A routed to IRQ 19
         Region 4: I/O ports at ef40 [size=32]

00:1d.7 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB2 
EHCI Controller (rev 02) (prog-if 20 [EHCI])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin D routed to IRQ 18
         Region 0: Memory at febff800 (32-bit, non-prefetchable) [size=1K]
         Capabilities: [50] Power Management version 2
                 Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-
         Capabilities: [58] Debug port

00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev c2) (prog-if 
00 [Normal decode])
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Bus: primary=00, secondary=02, subordinate=02, sec-latency=64
         I/O behind bridge: 0000d000-0000dfff
         Memory behind bridge: fea00000-feafffff
         Prefetchable memory behind bridge: 88000000-880fffff
         Secondary status: 66MHz- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- <SERR- <PERR-
         BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B-

00:1f.0 ISA bridge: Intel Corporation 82801EB/ER (ICH5/ICH5R) LPC 
Interface Bridge (rev 02)
         Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0

00:1f.1 IDE interface: Intel Corporation 82801EB/ER (ICH5/ICH5R) IDE 
Controller (rev 02) (prog-if 8a [Master SecP PriP])
         Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 0
         Interrupt: pin A routed to IRQ 17
         Region 0: I/O ports at <unassigned>
         Region 1: I/O ports at <unassigned>
         Region 2: I/O ports at <unassigned>
         Region 3: I/O ports at <unassigned>
         Region 4: I/O ports at fc00 [size=16]
         Region 5: Memory at 88100000 (32-bit, non-prefetchable) [size=1K]

00:1f.3 SMBus: Intel Corporation 82801EB/ER (ICH5/ICH5R) SMBus 
Controller (rev 02)
         Subsystem: ASUSTeK Computer Inc. P4P800 Mainboard
         Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR- FastB2B-
         Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Interrupt: pin B routed to IRQ 10
         Region 4: I/O ports at 0400 [size=32]

01:00.0 VGA compatible controller: nVidia Corporation NV43 [GeForce 
6600/GeForce 6600 GT] (rev a2) (prog-if 00 [VGA])
         Subsystem: XFX Pine Group Inc. Unknown device 2165
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 248 (1250ns min, 250ns max)
         Interrupt: pin A routed to IRQ 19
         Region 0: Memory at fd000000 (32-bit, non-prefetchable) [size=16M]
         Region 1: Memory at c0000000 (32-bit, prefetchable) [size=256M]
         Region 2: Memory at fc000000 (32-bit, non-prefetchable) [size=16M]
         [virtual] Expansion ROM at fe9e0000 [disabled] [size=128K]
         Capabilities: [60] Power Management version 2
                 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-
         Capabilities: [44] AGP version 3.0
                 Status: RQ=256 Iso- ArqSz=0 Cal=3 SBA+ ITACoh- GART64- 
HTrans- 64bit- FW+ AGP3+ Rate=x4,x8
                 Command: RQ=32 ArqSz=2 Cal=0 SBA+ AGP+ GART64- 64bit- 
FW+ Rate=x8

02:05.0 Ethernet controller: Marvell Technology Group Ltd. 88E8001 
Gigabit Ethernet Controller (rev 13)
         Subsystem: ASUSTeK Computer Inc. Marvell 88E8001 Gigabit 
Ethernet Controller (Asus)
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 64 (5750ns min, 7750ns max), Cache Line Size 04
         Interrupt: pin A routed to IRQ 16
         Region 0: Memory at feafc000 (32-bit, non-prefetchable) [size=16K]
         Region 1: I/O ports at d800 [size=256]
         Expansion ROM at 88000000 [disabled] [size=128K]
         Capabilities: [48] Power Management version 2
                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA 
PME(D0+,D1+,D2+,D3hot+,D3cold+)
                 Status: D0 PME-Enable- DSel=0 DScale=1 PME-
         Capabilities: [50] Vital Product Data

02:0a.0 FireWire (IEEE 1394): Texas Instruments TSB12LV26 IEEE-1394 
Controller (Link) (prog-if 10 [OHCI])
         Subsystem: Accton Technology Corporation Unknown device 1394
         Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 64 (500ns min, 1000ns max), Cache Line Size 04
         Interrupt: pin A routed to IRQ 16
         Region 0: Memory at feafb800 (32-bit, non-prefetchable) [size=2K]
         Region 1: Memory at feaf4000 (32-bit, non-prefetchable) [size=16K]
         Capabilities: [44] Power Management version 1
                 Flags: PMEClk- DSI- D1- D2+ AuxCurrent=0mA 
PME(D0-,D1-,D2+,D3hot+,D3cold-)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-

02:0b.0 Multimedia audio controller: Creative Labs SB Live! EMU10k1 (rev 0a)
         Subsystem: Creative Labs Unknown device 8066
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 64 (500ns min, 5000ns max)
         Interrupt: pin A routed to IRQ 18
         Region 0: I/O ports at df80 [size=32]
         Capabilities: [dc] Power Management version 1
                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-

02:0b.1 Input device controller: Creative Labs SB Live! MIDI/Game Port 
(rev 0a)
         Subsystem: Creative Labs Gameport Joystick
         Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 64
         Region 0: I/O ports at dff0 [size=8]
         Capabilities: [dc] Power Management version 1
                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-

02:0c.0 Ethernet controller: 3Com Corporation 3c905B 100BaseTX [Cyclone] 
(rev 30)
         Subsystem: 3Com Corporation 3C905B Fast Etherlink XL 10/100
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- 
ParErr- Stepping- SERR+ FastB2B-
         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium 
 >TAbort- <TAbort- <MAbort- >SERR- <PERR-
         Latency: 64 (2500ns min, 2500ns max), Cache Line Size 04
         Interrupt: pin A routed to IRQ 21
         Region 0: I/O ports at dc00 [size=128]
         Region 1: Memory at feafb400 (32-bit, non-prefetchable) [size=128]
         Expansion ROM at 88020000 [disabled] [size=128K]
         Capabilities: [dc] Power Management version 1
                 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA 
PME(D0-,D1+,D2+,D3hot+,D3cold+)
                 Status: D0 PME-Enable- DSel=0 DScale=0 PME-
------------------------------------------------------------
### END ###
------------------------------------------------------------


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28  3:20 PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1 Ken MacFerrin
@ 2006-01-28 15:48 ` Hugh Dickins
  2006-02-02  1:17   ` Ken MacFerrin
  2006-01-28 18:31 ` Jesper Juhl
  2006-01-28 19:13 ` Alistair John Strachan
  2 siblings, 1 reply; 21+ messages in thread
From: Hugh Dickins @ 2006-01-28 15:48 UTC (permalink / raw)
  To: Ken MacFerrin; +Cc: linux-kernel

On Fri, 27 Jan 2006, Ken MacFerrin wrote:
> I started getting hard lockups on my desktop PC with the error "kernel BUG at
> mm/rmap.c:487" starting with kernel 2.6.13 and continuing through 2.6.14.
> After switching to 2.6.15 the lockups have continued with the message "kernel
> BUG at mm/rmap.c:486".

That's progress, we're hoping to get it to vanish at line 0 eventually ;)

> The frequency and circumstance are completely random which originally had me
> suspecting bad memory but after running Memtest86+ for over 12 hours without
> error I'm at a loss.
> 
> I'm running the binary Nvidia driver so I'll understand if I can't get help
> here but in searching through the list archives it would seem I'm not alone
> and I am willing to try any patches that may help diagnose the issue.  The
> crash happens at least daily and I've seen no difference in running kernels
> with or without PREEMPT enabled.
> 
> The machine is a P4 3.00GHz with 2048MB PC3200 Unbuffered RAM on an ASUS
> motherboard with an ICH5 chipset.  XFX GF 6600GT video card, 600W power supply
> and plenty of cooling.

You raise several worthwhile points there, I needn't repeat them back to you.

Here's the 2.6.15 version of the patch I traditionally send out for this
(smaller than for earlier releases because of several advances in 2.6.15).

Please apply and let me know of all "Bad page state" and "Bad rmap"
messages you get.  Our record at nailing these problems is not good,
but the patch should at least let you go on running for much longer.

Thanks,
Hugh

--- 2.6.15/include/linux/rmap.h	2006-01-03 03:21:10.000000000 +0000
+++ linux/include/linux/rmap.h	2006-01-03 17:08:21.000000000 +0000
@@ -72,7 +72,7 @@ void __anon_vma_link(struct vm_area_stru
  */
 void page_add_anon_rmap(struct page *, struct vm_area_struct *, unsigned long);
 void page_add_file_rmap(struct page *);
-void page_remove_rmap(struct page *);
+void page_remove_rmap(struct page *, struct vm_area_struct *, unsigned long);
 
 /**
  * page_dup_rmap - duplicate pte mapping to a page
--- 2.6.15/mm/filemap_xip.c	2006-01-03 03:21:10.000000000 +0000
+++ linux/mm/filemap_xip.c	2006-01-03 17:08:21.000000000 +0000
@@ -189,7 +189,7 @@ __xip_unmap (struct address_space * mapp
 			/* Nuke the page table entry. */
 			flush_cache_page(vma, address, pte_pfn(*pte));
 			pteval = ptep_clear_flush(vma, address, pte);
-			page_remove_rmap(page);
+			page_remove_rmap(page, vma, address);
 			dec_mm_counter(mm, file_rss);
 			BUG_ON(pte_dirty(pteval));
 			pte_unmap_unlock(pte, ptl);
--- 2.6.15/mm/fremap.c	2006-01-03 03:21:10.000000000 +0000
+++ linux/mm/fremap.c	2006-01-03 17:08:21.000000000 +0000
@@ -33,7 +33,7 @@ static int zap_pte(struct mm_struct *mm,
 		if (page) {
 			if (pte_dirty(pte))
 				set_page_dirty(page);
-			page_remove_rmap(page);
+			page_remove_rmap(page, vma, addr);
 			page_cache_release(page);
 		}
 	} else {
--- 2.6.15/mm/memory.c	2006-01-03 03:21:10.000000000 +0000
+++ linux/mm/memory.c	2006-01-03 17:08:21.000000000 +0000
@@ -656,7 +656,7 @@ static unsigned long zap_pte_range(struc
 					mark_page_accessed(page);
 				file_rss--;
 			}
-			page_remove_rmap(page);
+			page_remove_rmap(page, vma, addr);
 			tlb_remove_page(tlb, page);
 			continue;
 		}
@@ -1484,7 +1484,7 @@ gotten:
 	page_table = pte_offset_map_lock(mm, pmd, address, &ptl);
 	if (likely(pte_same(*page_table, orig_pte))) {
 		if (old_page) {
-			page_remove_rmap(old_page);
+			page_remove_rmap(old_page, vma, address);
 			if (!PageAnon(old_page)) {
 				dec_mm_counter(mm, file_rss);
 				inc_mm_counter(mm, anon_rss);
--- 2.6.15/mm/rmap.c	2006-01-03 03:21:10.000000000 +0000
+++ linux/mm/rmap.c	2006-01-03 17:08:21.000000000 +0000
@@ -480,10 +480,40 @@ void page_add_file_rmap(struct page *pag
  *
  * The caller needs to hold the pte lock.
  */
-void page_remove_rmap(struct page *page)
+void page_remove_rmap(struct page *page,
+	struct vm_area_struct *vma, unsigned long address)
 {
+	struct address_space *mapping = NULL;
+	unsigned long index;
+
+	index = (address - vma->vm_start) >> PAGE_SHIFT;
+	index += vma->vm_pgoff;
+
+	if (PageAnon(page))
+		mapping = (void *) vma->anon_vma + PAGE_MAPPING_ANON;
+	else if (page->mapping)
+		mapping = vma->vm_file? vma->vm_file->f_mapping: (void *)(-1);
+
+	if (page_mapcount(page) <= 0 || page_count(page) <= 0 ||
+	    (mapping && (mapping != page->mapping || index != page->index))) {
+		pgd_t *pgd = pgd_offset(vma->vm_mm, address);
+		pud_t *pud = pud_offset(pgd, address);
+		pmd_t *pmd = pmd_offset(pud, address);
+		unsigned long ptpfn = pmd_val(*pmd) >> PAGE_SHIFT;
+
+		printk(KERN_ERR "Bad rmap: "
+			"page %p flags %lx count %d mapcount %d\n",
+			page, page->flags,
+			page_count(page), page_mapcount(page));
+		printk(KERN_ERR "  %s addr %lx ptpfn %lx vm_flags %lx\n",
+			current->comm, address, ptpfn, vma->vm_flags);
+		printk(KERN_ERR "  page mapping %p %lx vma mapping %p %lx\n",
+			page->mapping, page->index, mapping, index);
+		get_page(page);	/* corrupt, so leak rather than free it */
+		return;
+	}
+
 	if (atomic_add_negative(-1, &page->_mapcount)) {
-		BUG_ON(page_mapcount(page) < 0);
 		/*
 		 * It would be tidy to reset the PageAnon mapping here,
 		 * but that might overwrite a racing page_add_anon_rmap
@@ -562,7 +592,7 @@ static int try_to_unmap_one(struct page 
 	} else
 		dec_mm_counter(mm, file_rss);
 
-	page_remove_rmap(page);
+	page_remove_rmap(page, vma, address);
 	page_cache_release(page);
 
 out_unmap:
@@ -652,7 +682,7 @@ static void try_to_unmap_cluster(unsigne
 		if (pte_dirty(pteval))
 			set_page_dirty(page);
 
-		page_remove_rmap(page);
+		page_remove_rmap(page, vma, address);
 		page_cache_release(page);
 		dec_mm_counter(mm, file_rss);
 		(*mapcount)--;

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28  3:20 PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1 Ken MacFerrin
  2006-01-28 15:48 ` Hugh Dickins
@ 2006-01-28 18:31 ` Jesper Juhl
  2006-01-29 22:12   ` Ken MacFerrin
  2006-01-28 19:13 ` Alistair John Strachan
  2 siblings, 1 reply; 21+ messages in thread
From: Jesper Juhl @ 2006-01-28 18:31 UTC (permalink / raw)
  To: Ken MacFerrin; +Cc: linux-kernel

On 1/28/06, Ken MacFerrin <lists@macferrin.com> wrote:
> I started getting hard lockups on my desktop PC with the error "kernel
> BUG at mm/rmap.c:487" starting with kernel 2.6.13 and continuing through
> 2.6.14.  After switching to 2.6.15 the lockups have continued with the
> message "kernel BUG at mm/rmap.c:486".
>
> The frequency and circumstance are completely random which originally
> had me suspecting bad memory but after running Memtest86+ for over 12
> hours without error I'm at a loss.
>
> I'm running the binary Nvidia driver so I'll understand if I can't get
> help here but in searching through the list archives it would seem I'm
> not alone and I am willing to try any patches that may help diagnose the
> issue.  The crash happens at least daily and I've seen no difference in
> running kernels with or without PREEMPT enabled.
>
If you don't actually *need* accelerated 3D (or if you could do
without it for a while), switching to the "nv" driver for a few
days/weeks would be interresting. If the crashes go away that would
point towards the nvidia driver, if they don't go away we'll get a
nice untainted crash report.

--
Jesper Juhl <jesper.juhl@gmail.com>
Don't top-post  http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please      http://www.expita.com/nomime.html

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28  3:20 PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1 Ken MacFerrin
  2006-01-28 15:48 ` Hugh Dickins
  2006-01-28 18:31 ` Jesper Juhl
@ 2006-01-28 19:13 ` Alistair John Strachan
  2006-03-12  0:06   ` Patrick B�rjesson
  2 siblings, 1 reply; 21+ messages in thread
From: Alistair John Strachan @ 2006-01-28 19:13 UTC (permalink / raw)
  To: Ken MacFerrin; +Cc: linux-kernel

On Saturday 28 January 2006 03:20, Ken MacFerrin wrote:
> I started getting hard lockups on my desktop PC with the error "kernel
> BUG at mm/rmap.c:487" starting with kernel 2.6.13 and continuing through
> 2.6.14.  After switching to 2.6.15 the lockups have continued with the
> message "kernel BUG at mm/rmap.c:486".
>
> The frequency and circumstance are completely random which originally
> had me suspecting bad memory but after running Memtest86+ for over 12
> hours without error I'm at a loss.
>
> I'm running the binary Nvidia driver so I'll understand if I can't get
> help here but in searching through the list archives it would seem I'm
> not alone and I am willing to try any patches that may help diagnose the
> issue.  The crash happens at least daily and I've seen no difference in
> running kernels with or without PREEMPT enabled.
>
> The machine is a P4 3.00GHz with 2048MB PC3200 Unbuffered RAM on an ASUS
> motherboard with an ICH5 chipset.  XFX GF 6600GT video card, 600W power
> supply and plenty of cooling.

Ken,

Just to let you know, I've had the same problem on x86-64. It's an incredibly 
rare fault here and I've not been able to reproduce it. However, I cannot 
help but notice that all of the reporters so far have been running the binary 
NVIDIA driver, including myself.

I would not be surprised if running without the NVIDIA driver eliminated the 
problem.

-- 
Cheers,
Alistair.

'No sense being pessimistic, it probably wouldn't work anyway.'
Third year Computer Science undergraduate.
1F2 55 South Clerk Street, Edinburgh, UK.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28 18:31 ` Jesper Juhl
@ 2006-01-29 22:12   ` Ken MacFerrin
  2006-01-30  0:56     ` Ken MacFerrin
  0 siblings, 1 reply; 21+ messages in thread
From: Ken MacFerrin @ 2006-01-29 22:12 UTC (permalink / raw)
  To: Jesper Juhl, s0348365, hugh; +Cc: linux-kernel

Jesper Juhl wrote:
> On 1/28/06, Ken MacFerrin <lists@macferrin.com> wrote:
> 
>>I started getting hard lockups on my desktop PC with the error "kernel
>>BUG at mm/rmap.c:487" starting with kernel 2.6.13 and continuing through
>>2.6.14.  After switching to 2.6.15 the lockups have continued with the
>>message "kernel BUG at mm/rmap.c:486".
>>
>>The frequency and circumstance are completely random which originally
>>had me suspecting bad memory but after running Memtest86+ for over 12
>>hours without error I'm at a loss.
>>
>>I'm running the binary Nvidia driver so I'll understand if I can't get
>>help here but in searching through the list archives it would seem I'm
>>not alone and I am willing to try any patches that may help diagnose the
>>issue.  The crash happens at least daily and I've seen no difference in
>>running kernels with or without PREEMPT enabled.
>>
> 
> If you don't actually *need* accelerated 3D (or if you could do
> without it for a while), switching to the "nv" driver for a few
> days/weeks would be interresting. If the crashes go away that would
> point towards the nvidia driver, if they don't go away we'll get a
> nice untainted crash report.
> 

Thanks to all for the response.  In hopes of helping to isolate this I 
will move back over to the "nv" driver to see if I can recreate the 
problem and get a clean bug report before applying Hugh's patch.

This crash currently happens daily for me so I should be able to test 
this relatively quickly.
-Ken

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-29 22:12   ` Ken MacFerrin
@ 2006-01-30  0:56     ` Ken MacFerrin
  2006-01-30 16:46       ` Alistair John Strachan
  0 siblings, 1 reply; 21+ messages in thread
From: Ken MacFerrin @ 2006-01-30  0:56 UTC (permalink / raw)
  To: Jesper Juhl, s0348365, hugh; +Cc: linux-kernel

Ken MacFerrin wrote:
> Jesper Juhl wrote:
> 
>> On 1/28/06, Ken MacFerrin <lists@macferrin.com> wrote:
>>
>>> I started getting hard lockups on my desktop PC with the error "kernel
>>> BUG at mm/rmap.c:487" starting with kernel 2.6.13 and continuing through
>>> 2.6.14.  After switching to 2.6.15 the lockups have continued with the
>>> message "kernel BUG at mm/rmap.c:486".
>>>
>>> The frequency and circumstance are completely random which originally
>>> had me suspecting bad memory but after running Memtest86+ for over 12
>>> hours without error I'm at a loss.
>>>
>>> I'm running the binary Nvidia driver so I'll understand if I can't get
>>> help here but in searching through the list archives it would seem I'm
>>> not alone and I am willing to try any patches that may help diagnose the
>>> issue.  The crash happens at least daily and I've seen no difference in
>>> running kernels with or without PREEMPT enabled.
>>>
>>
>> If you don't actually *need* accelerated 3D (or if you could do
>> without it for a while), switching to the "nv" driver for a few
>> days/weeks would be interresting. If the crashes go away that would
>> point towards the nvidia driver, if they don't go away we'll get a
>> nice untainted crash report.
>>
> 
> Thanks to all for the response.  In hopes of helping to isolate this I 
> will move back over to the "nv" driver to see if I can recreate the 
> problem and get a clean bug report before applying Hugh's patch.
> 
> This crash currently happens daily for me so I should be able to test 
> this relatively quickly.
> -Ken

Unfortunately it seems that the "nv" driver in Xorg does not currently 
support multiple displays on a single video card with dual heads.  Not 
being able to at least run xinerama is a deal breaker for me so I'm back 
to the binary nvidia driver using twinview.  At this point I will apply 
Hugh's patch and post any further "Bad page state" and "Bad rmap"
messages as instructed.

Thanks,
Ken


^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-30  0:56     ` Ken MacFerrin
@ 2006-01-30 16:46       ` Alistair John Strachan
  0 siblings, 0 replies; 21+ messages in thread
From: Alistair John Strachan @ 2006-01-30 16:46 UTC (permalink / raw)
  To: Ken MacFerrin; +Cc: Jesper Juhl, hugh, linux-kernel

On Monday 30 January 2006 00:56, Ken MacFerrin wrote:
[snip]
> Unfortunately it seems that the "nv" driver in Xorg does not currently
> support multiple displays on a single video card with dual heads.  Not
> being able to at least run xinerama is a deal breaker for me so I'm back
> to the binary nvidia driver using twinview.  At this point I will apply
> Hugh's patch and post any further "Bad page state" and "Bad rmap"
> messages as instructed.

Indeed, Hugh's patch is the best way to proceed with discovering who the 
culprits are. Using the "nv" driver was not really meant as a debugging aide, 
but as a last resort if the patch yields nothing useful.

-- 
Cheers,
Alistair.

'No sense being pessimistic, it probably wouldn't work anyway.'
Third year Computer Science undergraduate.
1F2 55 South Clerk Street, Edinburgh, UK.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28 15:48 ` Hugh Dickins
@ 2006-02-02  1:17   ` Ken MacFerrin
  2006-02-02 15:54     ` Hugh Dickins
  0 siblings, 1 reply; 21+ messages in thread
From: Ken MacFerrin @ 2006-02-02  1:17 UTC (permalink / raw)
  To: Hugh Dickins; +Cc: linux-kernel

Hugh Dickins wrote:
> On Fri, 27 Jan 2006, Ken MacFerrin wrote:
> 
>>I started getting hard lockups on my desktop PC with the error "kernel BUG at
>>mm/rmap.c:487" starting with kernel 2.6.13 and continuing through 2.6.14.
>>After switching to 2.6.15 the lockups have continued with the message "kernel
>>BUG at mm/rmap.c:486".
> 
> 
> That's progress, we're hoping to get it to vanish at line 0 eventually ;)
> 
> 
>>The frequency and circumstance are completely random which originally had me
>>suspecting bad memory but after running Memtest86+ for over 12 hours without
>>error I'm at a loss.
>>
>>I'm running the binary Nvidia driver so I'll understand if I can't get help
>>here but in searching through the list archives it would seem I'm not alone
>>and I am willing to try any patches that may help diagnose the issue.  The
>>crash happens at least daily and I've seen no difference in running kernels
>>with or without PREEMPT enabled.
>>
>>The machine is a P4 3.00GHz with 2048MB PC3200 Unbuffered RAM on an ASUS
>>motherboard with an ICH5 chipset.  XFX GF 6600GT video card, 600W power supply
>>and plenty of cooling.
> 
> 
> You raise several worthwhile points there, I needn't repeat them back to you.
> 
> Here's the 2.6.15 version of the patch I traditionally send out for this
> (smaller than for earlier releases because of several advances in 2.6.15).
> 
> Please apply and let me know of all "Bad page state" and "Bad rmap"
> messages you get.  Our record at nailing these problems is not good,
> but the patch should at least let you go on running for much longer.
> 
> Thanks,
> Hugh


Well, unfortunately I'm back again.  I applied your patch last night but 
had another crash again today.  My X session crashed and dropped me into 
the console, which then froze, requiring a hard reboot.  I was only able 
to capture the output below because of remote logging.  This time I did 
not get the specific "BUG at mm/rmap.c" message I had in my previous 
report but do have some "Bad rmap..." messages as you can see below.

Again, I'm happy to test any patches or suggestions for isolating the 
problem.

Thanks,
Ken

-------------------------------------
### /var/log/messages output ###
-------------------------------------
Feb  1 17:01:01 mm-home1 cron[31322]: (root) CMD (/usr/bin/updatedb)
Feb  1 17:04:13 mm-home1 __find_get_block_slow() failed. block=1410, 
b_blocknr=71213169107797378
Feb  1 17:04:13 mm-home1 b_state=0x00000029, b_size=4096
Feb  1 17:04:13 mm-home1 device blocksize: 4096
Feb  1 17:04:13 mm-home1 Unable to handle kernel NULL pointer 
dereference at virtual address 00000000
Feb  1 17:04:13 mm-home1 printing eip:
Feb  1 17:04:13 mm-home1 c019fe9b
Feb  1 17:04:13 mm-home1 *pde = 00000000
Feb  1 17:04:13 mm-home1 Oops: 0000 [#1]
Feb  1 17:04:13 mm-home1 PREEMPT SMP
Feb  1 17:04:13 mm-home1 Modules linked in: ipt_limit iptable_mangle 
ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc 
ip_conntrack_ftp ipt_state ip_conntrack iptable_filter ip_tables 
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_emul snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus 
snd_util_mem snd_hwdep eth1394 nls_utf8 rfcomm bnep l2cap bluetooth 
dv1394 video1394 raw1394 ohci1394 ieee1394 3c59x marvell loop nvidia 
ntfs rtc tsdev
Feb  1 17:04:13 mm-home1 CPU:    0
Feb  1 17:04:13 mm-home1 EIP:    0060:[<c019fe9b>]    Tainted: P      VLI
Feb  1 17:04:13 mm-home1 EFLAGS: 00210282   (2.6.15-gentoo-r1)
Feb  1 17:04:13 mm-home1 EIP is at flush_commit_list+0x229/0x3ef
Feb  1 17:04:13 mm-home1 eax: 00000000   ebx: f8825000   ecx: c15274cc 
  edx: 00000000
Feb  1 17:04:13 mm-home1 esi: e227ae00   edi: f8825000   ebp: 0000000d 
  esp: f7e11e1c
Feb  1 17:04:13 mm-home1 ds: 007b   es: 007b   ss: 0068
Feb  1 17:04:13 mm-home1 Process pdflush (pid: 164, threadinfo=f7e10000 
task=f7c15030)
Feb  1 17:04:13 mm-home1 Stack: f781e000 f7856c00 f781e000 f7856c00 
00000000 00000000 00000000 00000000
Feb  1 17:04:13 mm-home1 ead379c8 00002000 f8825000 c01a3e62 f784f800 
e227ae00 00000001 00002032
Feb  1 17:04:13 mm-home1 00000000 c17d4f18 f7856c00 f7a97f0c 000003fa 
0011fcb3 00000046 e227ae00
Feb  1 17:04:13 mm-home1 Call Trace:
Feb  1 17:04:13 mm-home1 [<c01a3e62>] do_journal_end+0x880/0x8b6
Feb  1 17:04:13 mm-home1 [<c013b7ee>] pdflush+0x0/0x32
Feb  1 17:04:13 mm-home1 [<c01a2e46>] journal_end_sync+0x61/0x67
Feb  1 17:04:13 mm-home1 [<c0194276>] reiserfs_sync_fs+0x31/0x56
Feb  1 17:04:13 mm-home1 [<c01942a6>] reiserfs_write_super+0xb/0xe
Feb  1 17:04:13 mm-home1 [<c0155e3a>] sync_supers+0x79/0xdc
Feb  1 17:04:13 mm-home1 [<c013b018>] wb_kupdate+0x21/0xe2
Feb  1 17:04:13 mm-home1 [<c013b759>] __pdflush+0xe9/0x17e
Feb  1 17:04:13 mm-home1 [<c013b81b>] pdflush+0x2d/0x32
Feb  1 17:04:13 mm-home1 [<c013aff7>] wb_kupdate+0x0/0xe2
Feb  1 17:04:13 mm-home1 [<c0129a6f>] kthread+0x7c/0xa6
Feb  1 17:04:13 mm-home1 [<c01299f3>] kthread+0x0/0xa6
Feb  1 17:04:13 mm-home1 [<c0100ef5>] kernel_thread_helper+0x5/0xb
Feb  1 17:04:13 mm-home1 Code: 73 14 8b 44 24 04 89 d1 8b 54 24 30 03 4b 
0c 8b 58 0c ff 72 0c 89 c8 99 52 51 ff 73 10 e8 19 27 fb ff 89 44 24 28 
89 c2 83 c4 10 <8b> 00 a8 04 75 07 8b 42 0c 85 c0 75 07 52 e8 db 14 fb 
ff 58 8b
Feb  1 17:04:13 mm-home1 Badness in do_exit at kernel/exit.c:796
Feb  1 17:04:13 mm-home1 [<c011ad1e>] do_exit+0x38/0x379
Feb  1 17:04:13 mm-home1 [<c01039fc>] do_trap+0x0/0xc1
Feb  1 17:04:13 mm-home1 [<c0111ecf>] do_page_fault+0x377/0x4a9
Feb  1 17:04:13 mm-home1 [<c0118f79>] printk+0xe/0x11
Feb  1 17:04:13 mm-home1 [<c0111b58>] do_page_fault+0x0/0x4a9
Feb  1 17:04:13 mm-home1 [<c01033d7>] error_code+0x4f/0x54
Feb  1 17:04:13 mm-home1 [<c015007b>] wait_on_retry_sync_kiocb+0x6/0x38
Feb  1 17:04:13 mm-home1 [<c019fe9b>] flush_commit_list+0x229/0x3ef
Feb  1 17:04:13 mm-home1 [<c01a3e62>] do_journal_end+0x880/0x8b6
Feb  1 17:04:13 mm-home1 [<c013b7ee>] pdflush+0x0/0x32
Feb  1 17:04:13 mm-home1 [<c01a2e46>] journal_end_sync+0x61/0x67
Feb  1 17:04:13 mm-home1 [<c0194276>] reiserfs_sync_fs+0x31/0x56
Feb  1 17:04:13 mm-home1 [<c01942a6>] reiserfs_write_super+0xb/0xe
Feb  1 17:04:13 mm-home1 [<c0155e3a>] sync_supers+0x79/0xdc
Feb  1 17:04:13 mm-home1 [<c013b018>] wb_kupdate+0x21/0xe2
Feb  1 17:04:13 mm-home1 [<c013b759>] __pdflush+0xe9/0x17e
Feb  1 17:04:13 mm-home1 [<c013b81b>] pdflush+0x2d/0x32
Feb  1 17:04:13 mm-home1 [<c013aff7>] wb_kupdate+0x0/0xe2
Feb  1 17:04:13 mm-home1 [<c0129a6f>] kthread+0x7c/0xa6
Feb  1 17:04:13 mm-home1 [<c01299f3>] kthread+0x0/0xa6
Feb  1 17:04:13 mm-home1 [<c0100ef5>] kernel_thread_helper+0x5/0xb
Feb  1 17:04:14 mm-home1 kdm[10322]: X server for display :0 terminated 
unexpectedly
Feb  1 17:04:14 mm-home1 su(pam_unix)[11478]: session closed for user root
Feb  1 17:04:14 mm-home1 Bad rmap: page c1ee7ee0 flags c0000014 count 1 
mapcount 0
Feb  1 17:04:14 mm-home1 firefox-bin addr b5313000 ptpfn 69515 vm_flags 
100077
Feb  1 17:04:14 mm-home1 page mapping 00000000 95d4 vma mapping 00000000 
b5313
Feb  1 17:04:14 mm-home1 kde(pam_unix)[10326]: session closed for user krm
Feb  1 17:04:19 mm-home1 Unable to handle kernel paging request at 
virtual address 00180000
Feb  1 17:04:19 mm-home1 printing eip:
Feb  1 17:04:19 mm-home1 c0135fa7
Feb  1 17:04:19 mm-home1 *pde = 00000000
Feb  1 17:04:19 mm-home1 Oops: 0000 [#2]
Feb  1 17:04:19 mm-home1 PREEMPT SMP
Feb  1 17:04:19 mm-home1 Modules linked in: ipt_limit iptable_mangle 
ipt_LOG ipt_MASQUERADE ip_nat ipt_TOS ipt_REJECT ip_conntrack_irc 
ip_conntrack_ftp ipt_state ip_conntrack iptable_filter ip_tables 
snd_seq_midi snd_emu10k1_synth snd_emux_synth snd_seq_virmidi 
snd_seq_midi_emul snd_emu10k1 snd_rawmidi snd_ac97_codec snd_ac97_bus 
snd_util_mem snd_hwdep eth1394 nls_utf8 rfcomm bnep l2cap bluetooth 
dv1394 video1394 raw1394 ohci1394 ieee1394 3c59x marvell loop nvidia 
ntfs rtc tsdev
Feb  1 17:04:19 mm-home1 CPU:    1
Feb  1 17:04:19 mm-home1 EIP:    0060:[<c0135fa7>]    Tainted: P      VLI
Feb  1 17:04:19 mm-home1 EFLAGS: 00210097   (2.6.15-gentoo-r1)
Feb  1 17:04:19 mm-home1 EIP is at find_get_pages+0x33/0x54
Feb  1 17:04:19 mm-home1 eax: c000086c   ebx: 00000002   ecx: 00000001 
  edx: 00180000
Feb  1 17:04:19 mm-home1 esi: f59908f8   edi: f7d8fe58   ebp: 00000000 
  esp: f7d8fe18
Feb  1 17:04:19 mm-home1 ds: 007b   es: 007b   ss: 0068
Feb  1 17:04:19 mm-home1 Process kswapd0 (pid: 165, threadinfo=f7d8e000 
task=c23b5550)
Feb  1 17:04:19 mm-home1 Stack: f7d8fe50 00000000 00000031 c013e91f 
f59908f8 00000000 0000000e f7d8fe58
Feb  1 17:04:19 mm-home1 f5990840 c013ed61 f7d8fe50 f59908f8 00000000 
0000000e 00000000 00000000
Feb  1 17:04:19 mm-home1 c21f51c4 00180000 c04c8328 00200046 f65783e8 
f65783f0 0000001e 00200046
Feb  1 17:04:19 mm-home1 Call Trace:
Feb  1 17:04:19 mm-home1 [<c013e91f>] pagevec_lookup+0x1a/0x21
Feb  1 17:04:19 mm-home1 [<c013ed61>] invalidate_mapping_pages+0xa0/0xb5
Feb  1 17:04:19 mm-home1 [<c011cf66>] irq_exit+0x32/0x3d
Feb  1 17:04:19 mm-home1 [<c010330c>] apic_timer_interrupt+0x1c/0x24
Feb  1 17:04:19 mm-home1 [<c013ed83>] invalidate_inode_pages+0xd/0x11
Feb  1 17:04:19 mm-home1 [<c016630d>] prune_icache+0xca/0x17f
Feb  1 17:04:19 mm-home1 [<c01663da>] shrink_icache_memory+0x18/0x30
Feb  1 17:04:19 mm-home1 [<c013f150>] shrink_slab+0x13a/0x1a7
Feb  1 17:04:19 mm-home1 [<c0140285>] balance_pgdat+0x20a/0x320
Feb  1 17:04:19 mm-home1 [<c03bc665>] schedule+0xa59/0xac0
Feb  1 17:04:19 mm-home1 [<c0140493>] kswapd+0xf8/0xfd
Feb  1 17:04:19 mm-home1 [<c0129e42>] autoremove_wake_function+0x0/0x3a
Feb  1 17:04:19 mm-home1 [<c010278e>] ret_from_fork+0x6/0x14
Feb  1 17:04:19 mm-home1 [<c0129e42>] autoremove_wake_function+0x0/0x3a
Feb  1 17:04:19 mm-home1 [<c014039b>] kswapd+0x0/0xfd
Feb  1 17:04:19 mm-home1 [<c0100ef5>] kernel_thread_helper+0x5/0xb
Feb  1 17:04:19 mm-home1 Code: 7c 24 1c 8d 46 10 e8 e3 78 28 00 8d 46 04 
ff 74 24 18 ff 74 24 18 57 50 e8 80 71 12 00 31 c9 83 c4 10 89 c3 39 c1 
73 16 8b 14 8f <8b> 02 f6 c4 40 74 03 8b 52 0c f0 ff 42 04 41 39 d9 72 
ea 8d 46
Feb  1 17:04:19 mm-home1 <6>note: kswapd0[165] exited with preempt_count 1
Feb  1 17:04:19 mm-home1 (krm-11020): GConf server is not in use, 
shutting down.
Feb  1 17:04:21 mm-home1 kdm: :0[15897]: Abnormal termination of greeter 
for display :0, code 127, signal 0
Feb  1 17:04:29 mm-home1 login(pam_unix)[10286]: session opened for user 
root by LOGIN(uid=0)
Feb  1 17:06:45 mm-home1 __find_get_block_slow() failed. block=1681, 
b_blocknr=23362423066986129
Feb  1 17:06:45 mm-home1 b_state=0x00000029, b_size=4096
-----------------------------
### EOF ###

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-02  1:17   ` Ken MacFerrin
@ 2006-02-02 15:54     ` Hugh Dickins
  2006-02-02 21:31       ` Ken MacFerrin
  0 siblings, 1 reply; 21+ messages in thread
From: Hugh Dickins @ 2006-02-02 15:54 UTC (permalink / raw)
  To: Ken MacFerrin; +Cc: linux-kernel

On Wed, 1 Feb 2006, Ken MacFerrin wrote:
> 
> Well, unfortunately I'm back again.  I applied your patch last night but had
> another crash again today.  My X session crashed and dropped me into the
> console, which then froze, requiring a hard reboot.  I was only able to
> capture the output below because of remote logging.  This time I did not get
> the specific "BUG at mm/rmap.c" message I had in my previous report

Yes, that's replaced by "Bad rmap..." by my patch.

> but do have some "Bad rmap..." messages as you can see below.

Which in many cases allow the system to continue undisturbed;
but unfortunately not in your case, which is in a nastier state.
And only one "Bad rmap...", so not a lot I could glean from it.

> Feb  1 17:01:01 mm-home1 cron[31322]: (root) CMD (/usr/bin/updatedb)

Okay, so plenty of disk and cache activity then.
Were you doing anything interesting at the graphics end?

> Feb  1 17:04:13 mm-home1 __find_get_block_slow() failed. block=1410,
> b_blocknr=71213169107797378

Or in hex, block=0x582 b_blocknr=0x00fd000000000582: something has
corrupted the upper short of the bufheader's block number with 0xfd.

> Feb  1 17:04:13 mm-home1 Unable to handle kernel NULL pointer dereference at
> virtual address 00000000
> Feb  1 17:04:13 mm-home1 EIP is at flush_commit_list+0x229/0x3ef
> Feb  1 17:04:13 mm-home1 Process pdflush (pid: 164, threadinfo=f7e10000
> task=f7c15030)

And ReiserFS is justifiably surprised that no bufheader could be
found for one of its journal pages.

> Feb  1 17:04:13 mm-home1 Badness in do_exit at kernel/exit.c:796

Concomitant fallout from the above fault.

> Feb  1 17:04:14 mm-home1 kdm[10322]: X server for display :0 terminated
> unexpectedly

Nothing to say why that was, but we already know the system is bad.

> Feb  1 17:04:14 mm-home1 Bad rmap: page c1ee7ee0 flags c0000014 count 1
> mapcount 0
> Feb  1 17:04:14 mm-home1 firefox-bin addr b5313000 ptpfn 69515 vm_flags 100077
> Feb  1 17:04:14 mm-home1 page mapping 00000000 95d4 vma mapping 00000000 b5313

A page is being unmapped which was not recorded as being mapped.  Could be
page table corruption.  I'd been hoping for a sequence of these, and would
then have looked for some commonality, but it's an isolated occurrence.
Probably related to the bufheader corruption.

> Feb  1 17:04:19 mm-home1 Unable to handle kernel paging request at virtual
> address 00180000
> Feb  1 17:04:19 mm-home1 Process kswapd0 (pid: 165, threadinfo=f7d8e000 
> Feb  1 17:04:19 mm-home1 EIP is at find_get_pages+0x33/0x54

The radix-tree lookup found 0x00180000 where it should have found a struct
page pointer or NULL: something has corrupted the upper short with 0x18.

> Feb  1 17:04:19 mm-home1 <6>note: kswapd0[165] exited with preempt_count 1

Concomitant fallout from the above fault.

> Feb  1 17:04:19 mm-home1 (krm-11020): GConf server is not in use, shutting
> down.
> Feb  1 17:04:21 mm-home1 kdm: :0[15897]: Abnormal termination of greeter for
> display :0, code 127, signal 0

Things are getting worse.

> Feb  1 17:04:29 mm-home1 login(pam_unix)[10286]: session opened for user root
> by LOGIN(uid=0)
> Feb  1 17:06:45 mm-home1 __find_get_block_slow() failed. block=1681,
> b_blocknr=23362423066986129

Or in hex, block=0x691 b_blocknr=0x0053000000000691: something has
corrupted the upper short of the bufheader's block number with 0x53.

Well, you're getting plenty of memory corruption, and there's some pattern
to it (bits 8-11 each time), but I can't guess where it's coming from,
I'm afraid.  The "Bad rmap", my speciality, looks merely incidental
to the more general memory corruption.

I know you already said you really need to use the nVidia driver for
xinerama, but it has to be suspect #1.  Any chance of doing without it
just for a day, to see what happens then?  Or would that force you into
such a different work pattern that it would prove nothing?

After that, the next thing to try is going back to 2.6.12: I think you
said this bad behaviour started with 2.6.13 (but I may be quite wrong
to assume that you were running 2.6.12 before).  Perhaps the problem
lies with your hardware, but started to manifest around the time you
moved to 2.6.13, we do need to rule that out.

Hugh

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-02 15:54     ` Hugh Dickins
@ 2006-02-02 21:31       ` Ken MacFerrin
  0 siblings, 0 replies; 21+ messages in thread
From: Ken MacFerrin @ 2006-02-02 21:31 UTC (permalink / raw)
  To: Hugh Dickins; +Cc: linux-kernel, dspring

Hugh Dickins wrote:

>>Feb  1 17:01:01 mm-home1 cron[31322]: (root) CMD (/usr/bin/updatedb) 
> Okay, so plenty of disk and cache activity then.
> Were you doing anything interesting at the graphics end?

Nope.. just had a couple vncviewer sessions, firefox, thunderbird, a few 
superkarmba applets and a couple konsole windows.  I'm typically running 
KDE 3.5.0 on two 19" flatpanels via the DVI ports on a dual-head GF 
6600GT card.

One thing I have noticed in the past is I would often get the crash as 
soon as I resumed from a locked screen.  Xscreensaver is set to kick on 
after 20 mintues and the screensaver would be running fine when I sit 
back down, but as soon as I gave a mouse/keyboard input it would lockup 
with a garbled screen.  This time however I was actively using the 
machine when it crashed.

[snip]

>>Feb  1 17:04:14 mm-home1 kdm[10322]: X server for display :0 terminated
>>unexpectedly
> 
> Nothing to say why that was, but we already know the system is bad.

Yep, this is when I get the garbled screen.  Sometimes it will stop 
responding to any input at this point, others it allow me to 
Ctrl+Alt+F10 into the console.  This time I was able to drop to console.

[snip]

>>Feb  1 17:04:29 mm-home1 login(pam_unix)[10286]: session opened for user root
>>by LOGIN(uid=0)
>>Feb  1 17:06:45 mm-home1 __find_get_block_slow() failed. block=1681,
>>b_blocknr=23362423066986129

This was after dropping to console.  It let me login to root but before 
being able to view the logs it started spewing out strings of errors 
that are scrolling too quickly to read and do not get captured by syslog 
(either local or remote).  I've learned from experience that this is the 
time to do a hard reboot or it starts trashing up the filesystem.  I've 
had to run "fsck-reiserfs --rebuild-tree" more times than I'd prefer.

> 
> Or in hex, block=0x691 b_blocknr=0x0053000000000691: something has
> corrupted the upper short of the bufheader's block number with 0x53.
> 
> Well, you're getting plenty of memory corruption, and there's some pattern
> to it (bits 8-11 each time), but I can't guess where it's coming from,
> I'm afraid.  The "Bad rmap", my speciality, looks merely incidental
> to the more general memory corruption.
> 
> I know you already said you really need to use the nVidia driver for
> xinerama, but it has to be suspect #1.  Any chance of doing without it
> just for a day, to see what happens then?  Or would that force you into
> such a different work pattern that it would prove nothing?
> 
> After that, the next thing to try is going back to 2.6.12: I think you
> said this bad behaviour started with 2.6.13 (but I may be quite wrong
> to assume that you were running 2.6.12 before).  Perhaps the problem
> lies with your hardware, but started to manifest around the time you
> moved to 2.6.13, we do need to rule that out.
> 
> Hugh

I agree. I will run with the kernel "nv" driver on a single monitor over 
the weekend to see if I can recreate the problem.  Failing that I'll 
give 2.6.12 another shot.  A couple other datapoints that may be worth note:

1) David Spring posted the following message on this thread yesterday 
that would seem to point away from the binary nvidia driver:

"It's not the nv drivers - or at least not just them.  I'm getting this 
bug once or twice a day on a mini-ITX (C3 533Mhz processor) based server 
which doesn't even have X installed.  For me, it appeared sometime after 
2.6.12.  I'm now running with gentoo 2.6.15-r1 with Hugh's recently 
posted patch,and waiting 8-|
Dave Spring"

If Dave is able to post a syslog with his errors then it would provide 
an untainted report.


2) I have also found this thread from the Nvidia forum that would seem 
to point towards the nvidia driver.  Although unlike this person, whose 
troubles only started with 2.6.15-rc3, I have experienced this bug since 
the 2.6.13 series.
http://www.nvnews.net/vbulletin/archive/index.php/t-60711.html

Thanks,
Ken

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-01-28 19:13 ` Alistair John Strachan
@ 2006-03-12  0:06   ` Patrick B�rjesson
  2006-03-12  2:06     ` Alistair John Strachan
                       ` (2 more replies)
  0 siblings, 3 replies; 21+ messages in thread
From: Patrick B�rjesson @ 2006-03-12  0:06 UTC (permalink / raw)
  To: linux-kernel; +Cc: Alistair John Strachan

[-- Attachment #1: Type: text/plain, Size: 16399 bytes --]

> Just to let you know, I've had the same problem on x86-64. It's an
> incredibly rare fault here and I've not been able to reproduce it.
> However, I cannot help but notice that all of the reporters so far
> have been running the binary NVIDIA driver, including myself.
> 
> I would not be surprised if running without the NVIDIA driver
> eliminated the problem.

Not running either with the NVIDIA driver or with x86-64 on the machine
I'm getting this on, but I get it fairly often (as in: today I've
probably gotten it at least 5-10 times). It seems it's pretty bound by
either high CPU or disk usage, since I've always gotten it while
compiling stuff so far. Although my system doesn't hard lock if I get
this error; I can at least run most commands and ssh into it.

Not sure if everything in the dmesg output is relevant, but one can
never be sure, so I'm pasting the entire one. Flame on if I'm spamming
you ;)

Oh btw. I'm not subscribed to the mailinglist, so if you'd be kind
enough to CC me I'd be grateful.

Hope this helps,
Patrick B�rjesson

---------- dmesg output ----------
Linux version 2.6.15-gentoo-r7 (root@scrapheap) (gcc version 3.4.5 (Gentoo 3.4.5-r1, ssp-3.4.5-1.0, pie-8.7.9)) #3 PREEMPT Sat Mar 11 23:46:29 CET 2006
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 000000000009fc00 (usable)
 BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved)
 BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000001fffc000 (usable)
 BIOS-e820: 000000001fffc000 - 000000001ffff000 (ACPI data)
 BIOS-e820: 000000001ffff000 - 0000000020000000 (ACPI NVS)
 BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
511MB LOWMEM available.
On node 0 totalpages: 131068
  DMA zone: 4096 pages, LIFO batch:0
  DMA32 zone: 0 pages, LIFO batch:0
  Normal zone: 126972 pages, LIFO batch:31
  HighMem zone: 0 pages, LIFO batch:0
DMI 2.3 present.
ACPI: RSDP (v000 ASUS                                  ) @ 0x000f5e40
ACPI: RSDT (v001 ASUS   A7S333   0x42302e31 MSFT 0x31313031) @ 0x1fffc000
ACPI: FADT (v001 ASUS   A7S333   0x42302e31 MSFT 0x31313031) @ 0x1fffc0b2
ACPI: BOOT (v001 ASUS   A7S333   0x42302e31 MSFT 0x31313031) @ 0x1fffc030
ACPI: MADT (v001 ASUS   A7S333   0x42302e31 MSFT 0x31313031) @ 0x1fffc058
ACPI: DSDT (v001   ASUS A7S333   0x00001000 MSFT 0x0100000b) @ 0x00000000
Allocating PCI resources starting at 30000000 (gap: 20000000:dec00000)
Built 1 zonelists
Kernel command line: root=/dev/hdc3 udev video=vesafb:ywrap,mtrr,1024x768-16@85 splash=silent,theme:emergence CONSOLE=/dev/tty1 quiet
Initializing CPU#0
PID hash table entries: 2048 (order: 11, 32768 bytes)
Detected 1311.863 MHz processor.
Using tsc for high-res timesource
Console: colour VGA+ 80x25
Dentry cache hash table entries: 131072 (order: 7, 524288 bytes)
Inode-cache hash table entries: 65536 (order: 6, 262144 bytes)
Memory: 514508k/524272k available (2480k kernel code, 9208k reserved, 684k data, 168k init, 0k highmem)
Checking if this processor honours the WP bit even in supervisor mode... Ok.
Calibrating delay using timer specific routine.. 2627.25 BogoMIPS (lpj=5254503)
Mount-cache hash table entries: 512
CPU: After generic identify, caps: 0383fbff c1cbfbff 00000000 00000000 00000000 00000000 00000000
CPU: After vendor identify, caps: 0383fbff c1cbfbff 00000000 00000000 00000000 00000000 00000000
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 64K (64 bytes/line)
CPU: After all inits, caps: 0383fbff c1cbfbff 00000000 00000020 00000000 00000000 00000000
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
mtrr: v2.0 (20020519)
CPU: AMD Duron(tm) Processor stepping 01
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
ACPI: setting ELCR to 0200 (from 0c60)
checking if image is initramfs... it is
Freeing initrd memory: 789k freed
NET: Registered protocol family 16
ACPI: bus type pci registered
PCI: PCI BIOS revision 2.10 entry at 0xf19a0, last bus=1
PCI: Using configuration type 1
ACPI: Subsystem revision 20050902
ACPI: Interpreter enabled
ACPI: Using PIC for interrupt routing
ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 10 *11 12 14 15)
ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 5 6 7 *10 11 12 14 15)
ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 *5 6 7 10 11 12 14 15)
ACPI: PCI Interrupt Link [LNKE] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKF] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKG] (IRQs 3 4 5 6 7 10 11 12 14 15) *0, disabled.
ACPI: PCI Interrupt Link [LNKH] (IRQs 3 4 5 *6 7 10 11 12 14 15)
ACPI: PCI Root Bridge [PCI0] (0000:00)
PCI: Probing PCI hardware (bus 00)
ACPI: Assume root bridge [\_SB_.PCI0] bus is 0
Enabling SiS 96x SMBus.
Boot video device is 0000:01:00.0
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PCI1._PRT]
Linux Plug and Play Support v0.97 (c) Adam Belay
pnp: PnP ACPI init
pnp: PnP ACPI: found 15 devices
SCSI subsystem initialized
usbcore: registered new driver usbfs
usbcore: registered new driver hub
PCI: Using ACPI for IRQ routing
PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
pnp: 00:01: ioport range 0xe400-0xe47f could not be reserved
pnp: 00:01: ioport range 0xe480-0xe4ff has been reserved
pnp: 00:01: ioport range 0xe600-0xe61f has been reserved
pnp: 00:01: ioport range 0x480-0x48f has been reserved
pnp: 00:0e: ioport range 0x290-0x297 has been reserved
pnp: 00:0e: ioport range 0x500-0x507 has been reserved
PCI: Bridge: 0000:00:01.0
  IO window: disabled.
  MEM window: e7000000-e7ffffff
  PREFETCH window: ef700000-febfffff
PCI: Setting latency timer of device 0000:00:01.0 to 64
Simple Boot Flag at 0x3a set to 0x1
Machine check exception polling timer started.
audit: initializing netlink socket (disabled)
audit(1142120887.156:1): initialized
NTFS driver 2.1.25 [Flags: R/O].
Initializing Cryptographic API
io scheduler noop registered
io scheduler anticipatory registered
io scheduler deadline registered
io scheduler cfq registered
lp: driver loaded but no devices found
Real Time Clock Driver v1.12
Linux agpgart interface v0.101 (c) Dave Jones
agpgart: Detected SiS 745 chipset
agpgart: AGP aperture is 64M @ 0xe8000000
[drm] Initialized drm 1.0.0 20040925
vesafb: unrecognized option mtrr
vesafb: NVidia Corporation, NV17 () Board, Chip Rev A2 (OEM: NVidia)
vesafb: VBE version: 3.0
vesafb: protected mode interface info at c000:ede0
vesafb: pmi: set display start = c00cee25, set palette = c00ceeaa
vesafb: pmi: ports = b4c3 b503 ba03 c003 c103 c403 c503 c603 c703 c803 c903 cc03 ce03 cf03 d003 d103 d203 d303 d403 d503 da03 ff03 
vesafb: VBIOS/hardware supports DDC2 transfers
      Display is GTF capable
vesafb: monitor limits: vf = 160 Hz, hf = 96 kHz, clk = 250 MHz
vesafb: scrolling: ywrap using protected mode interface, yres_virtual=3750
Console: switching to colour frame buffer device 128x48
fbsplash: console 0 using theme 'emergence'
fbsplash: switched splash state to 'on' on console 0
vesafb: framebuffer at 0xf0000000, mapped to 0xe0880000, using 7500k, total 65536k
fb0: VESA VGA frame buffer device
ACPI: Power Button (FF) [PWRF]
ACPI: Power Button (CM) [PWRB]
PNP: PS/2 Controller [PNP0303:PS2K,PNP0f13:PS2M] at 0x60,0x64 irq 1,12
serio: i8042 AUX port at 0x60,0x64 irq 12
serio: i8042 KBD port at 0x60,0x64 irq 1
Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing disabled
serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
00:0a: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
00:0b: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A
parport: PnPBIOS parport detected.
parport0: PC-style at 0x378 (0x778), irq 7 [PCSPP(,...)]
lp0: using parport0 (interrupt-driven).
mice: PS/2 mouse device common for all mice
input: AT Translated Set 2 keyboard as /class/input/input0
input: ImExPS/2 Generic Explorer Mouse as /class/input/input1
floppy0: no floppy controllers found
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
loop: loaded (max 8 devices)
ne2k-pci.c:v1.03 9/22/2003 D. Becker/P. Gortmaker
  http://www.scyld.com/network/ne2k-pci.html
ACPI: PCI Interrupt Link [LNKD] enabled at IRQ 5
PCI: setting IRQ 5 as level-triggered
ACPI: PCI Interrupt 0000:00:0c.0[A] -> Link [LNKD] -> GSI 5 (level, low) -> IRQ 5
eth0: RealTek RTL-8029 found at 0xa800, IRQ 5, 00:00:21:E4:85:F0.
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
SIS5513: IDE controller at PCI slot 0000:00:02.5
SIS5513: chipset revision 208
SIS5513: not 100% native mode: will probe irqs later
SIS5513: SiS745 ATA 100 (2nd gen) controller
    ide0: BM-DMA at 0xd800-0xd807, BIOS settings: hda:DMA, hdb:DMA
    ide1: BM-DMA at 0xd808-0xd80f, BIOS settings: hdc:DMA, hdd:DMA
Probing IDE interface ide0...
hda: Maxtor 91000D8, ATA DISK drive
hdb: WDC WD400BB-60AUA1, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Probing IDE interface ide1...
hdc: Maxtor 6E040L0, ATA DISK drive
hdd: SONY CD-RW CRX225E, ATAPI CD/DVD-ROM drive
ide1 at 0x170-0x177,0x376 on irq 15
hda: max request size: 128KiB
hda: 19545120 sectors (10007 MB) w/512KiB Cache, CHS=19390/16/63, UDMA(33)
hda: cache flushes not supported
 hda: hda1
hdb: max request size: 128KiB
hdb: 78165360 sectors (40020 MB) w/2048KiB Cache, CHS=65535/16/63, UDMA(100)
hdb: cache flushes not supported
 hdb: hdb1 hdb2 < hdb5 hdb6 >
hdc: max request size: 128KiB
hdc: 80293248 sectors (41110 MB) w/2048KiB Cache, CHS=65535/16/63, UDMA(33)
hdc: cache flushes supported
 hdc: hdc1 hdc2 hdc3 hdc4
hdd: ATAPI 40X CD-ROM CD-R/RW drive, 2048kB Cache, UDMA(33)
Uniform CD-ROM driver Revision: 3.20
libata version 1.20 loaded.
usbcore: registered new driver usblp
drivers/usb/class/usblp.c: v0.13: USB Printer Device Class driver
Initializing USB Mass Storage driver...
usbcore: registered new driver usb-storage
USB Mass Storage support registered.
usbcore: registered new driver usbhid
drivers/usb/input/hid-core.c: v2.6:USB HID core driver
NET: Registered protocol family 2
IP route cache hash table entries: 8192 (order: 3, 32768 bytes)
TCP established hash table entries: 32768 (order: 5, 131072 bytes)
TCP bind hash table entries: 32768 (order: 5, 131072 bytes)
TCP: Hash tables configured (established 32768 bind 32768)
TCP reno registered
TCP bic registered
NET: Registered protocol family 1
NET: Registered protocol family 17
Using IPI Shortcut mode
ACPI wakeup devices: 
PCI0 PCI1 PS2K PS2M USB0 USB1 MC97 
ACPI: (supports S0 S1 S4 S5)
EXT3-fs: mounted filesystem with ordered data mode.
VFS: Mounted root (ext3 filesystem) readonly.
Freeing unused kernel memory: 168k freed
kjournald starting.  Commit interval 5 seconds
Adding 999328k swap on /dev/hdc2.  Priority:-1 extents:1 across:999328k
EXT3 FS on hdc3, internal journal
kjournald starting.  Commit interval 5 seconds
EXT3 FS on hdc4, internal journal
EXT3-fs: mounted filesystem with ordered data mode.
ohci_hcd: 2005 April 22 USB 1.1 'Open' Host Controller (OHCI) Driver (PCI)
ACPI: PCI Interrupt 0000:00:02.2[D] -> Link [LNKD] -> GSI 5 (level, low) -> IRQ 5
ohci_hcd 0000:00:02.2: OHCI Host Controller
ohci_hcd 0000:00:02.2: new USB bus registered, assigned bus number 1
ohci_hcd 0000:00:02.2: irq 5, io mem 0xe6800000
hub 1-0:1.0: USB hub found
hub 1-0:1.0: 3 ports detected
ACPI: PCI Interrupt Link [LNKH] enabled at IRQ 6
PCI: setting IRQ 6 as level-triggered
ACPI: PCI Interrupt 0000:00:02.3[A] -> Link [LNKH] -> GSI 6 (level, low) -> IRQ 6
ohci_hcd 0000:00:02.3: OHCI Host Controller
ohci_hcd 0000:00:02.3: new USB bus registered, assigned bus number 2
ohci_hcd 0000:00:02.3: irq 6, io mem 0xe6000000
hub 2-0:1.0: USB hub found
hub 2-0:1.0: 3 ports detected
usb 1-1: new full speed USB device using ohci_hcd and address 2
hub 1-1:1.0: USB hub found
hub 1-1:1.0: 4 ports detected
usb 1-1.4: new full speed USB device using ohci_hcd and address 3
scsi0 : SCSI emulation for USB Mass Storage devices
usb-storage: device found at 3
usb-storage: waiting for device to settle before scanning
  Vendor: Generic   Model: USB SD Reader     Rev: 1.00
  Type:   Direct-Access                      ANSI SCSI revision: 00
sd 0:0:0:0: Attached scsi removable disk sda
sd 0:0:0:0: Attached scsi generic sg0 type 0
  Vendor: Generic   Model: USB CF Reader     Rev: 1.01
  Type:   Direct-Access                      ANSI SCSI revision: 00
sd 0:0:0:1: Attached scsi removable disk sdb
sd 0:0:0:1: Attached scsi generic sg1 type 0
  Vendor: Generic   Model: USB SM Reader     Rev: 1.02
  Type:   Direct-Access                      ANSI SCSI revision: 00
sd 0:0:0:2: Attached scsi removable disk sdc
sd 0:0:0:2: Attached scsi generic sg2 type 0
  Vendor: Generic   Model: USB MS Reader     Rev: 1.03
  Type:   Direct-Access                      ANSI SCSI revision: 00
sd 0:0:0:3: Attached scsi removable disk sdd
sd 0:0:0:3: Attached scsi generic sg3 type 0
usb-storage: device scan complete
------------[ cut here ]------------
kernel BUG at mm/rmap.c:486!
invalid operand: 0000 [#1]
PREEMPT 
Modules linked in: ehci_hcd ohci_hcd
CPU:    0
EIP:    0060:[<c0152fd0>]    Not tainted VLI
EFLAGS: 00010286   (2.6.15-gentoo-r7) 
EIP is at page_remove_rmap+0x30/0x40
eax: ffffffff   ebx: cbb07b8c   ecx: 0001dac2   edx: c13b5840
esi: b7ee3000   edi: c13b5840   ebp: c8f5be30   esp: c8f5bd8c
ds: 007b   es: 007b   ss: 0068
Process i686-pc-linux-g (pid: 5448, threadinfo=c8f5a000 task=de1f1ab0)
Stack: 00003ff4 cbb07b8c c014c0b6 c13b5840 b7ee3000 1dac2025 1dac2025 00000000 
       fffffffb defc31e0 b7fbc000 d3448b80 b7fbc000 c8f5be30 c014c265 c04490b4 
       df6f738c d3448b7c b7ed3000 b7fbc000 c8f5be30 00000000 b7fbbfff d3448b7c 
Call Trace:
 [<c014c0b6>] zap_pte_range+0x156/0x250
 [<c014c265>] unmap_page_range+0xb5/0x140
 [<c014c3df>] unmap_vmas+0xef/0x1f0
 [<c01511c7>] exit_mmap+0x77/0x110
 [<c01165d7>] mmput+0x37/0xb0
 [<c011b38b>] do_exit+0xeb/0x440
 [<c011b754>] do_group_exit+0x34/0xa0
 [<c0125657>] get_signal_to_deliver+0x217/0x360
 [<c010307e>] do_signal+0x6e/0x150
 [<c014d0e5>] do_wp_page+0x1e5/0x2e0
 [<c0167c5c>] vfs_lstat+0x1c/0x70
 [<c0113607>] do_page_fault+0x2a7/0x5a8
 [<c0113360>] do_page_fault+0x0/0x5a8
 [<c0103197>] do_notify_resume+0x37/0x3c
 [<c0103356>] work_notifysig+0x13/0x19
Code: 24 0c 83 42 08 ff 0f 98 c0 84 c0 74 1a 8b 42 08 40 78 18 c7 44 24 04 ff ff ff ff c7 04 24 10 00 00 00 e8 44 ef fe ff 83 c4 08 c3 <0f> 0b e6 01 ef 1f 38 c0 eb de 8d b6 00 00 00 00 83 ec 2c 89 5c 
 <1>Fixing recursive fault but reboot is needed!
scheduling while atomic: i686-pc-linux-g/0x00000002/5448
 [<c036a1a7>] schedule+0x587/0x660
 [<c0103356>] work_notifysig+0x13/0x19
 [<c0152fe5>] try_to_unmap_one+0x5/0x230
 [<c011b52a>] do_exit+0x28a/0x440
 [<c01190b7>] printk+0x17/0x20
 [<c0103ff0>] do_invalid_op+0x0/0xb0
 [<c0103c65>] die+0x185/0x190
 [<c0104092>] do_invalid_op+0xa2/0xb0
 [<c0152fd0>] page_remove_rmap+0x30/0x40
 [<c0176bd5>] dput+0x55/0x290
 [<c016d37e>] __link_path_walk+0xbce/0xf70
 [<c0140ab4>] bad_range+0x34/0x50
 [<c01034ef>] error_code+0x4f/0x54
 [<c0152fd0>] page_remove_rmap+0x30/0x40
 [<c014c0b6>] zap_pte_range+0x156/0x250
 [<c014c265>] unmap_page_range+0xb5/0x140
 [<c014c3df>] unmap_vmas+0xef/0x1f0
 [<c01511c7>] exit_mmap+0x77/0x110
 [<c01165d7>] mmput+0x37/0xb0
 [<c011b38b>] do_exit+0xeb/0x440
 [<c011b754>] do_group_exit+0x34/0xa0
 [<c0125657>] get_signal_to_deliver+0x217/0x360
 [<c010307e>] do_signal+0x6e/0x150
 [<c014d0e5>] do_wp_page+0x1e5/0x2e0
 [<c0167c5c>] vfs_lstat+0x1c/0x70
 [<c0113607>] do_page_fault+0x2a7/0x5a8
 [<c0113360>] do_page_fault+0x0/0x5a8
 [<c0103197>] do_notify_resume+0x37/0x3c
 [<c0103356>] work_notifysig+0x13/0x19

-- 
/  ()  The ASCII Ribbon Campaign - against HTML Email
\  /\   and proprietary formats.

[-- Attachment #2: Type: application/pgp-signature, Size: 191 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-03-12  0:06   ` Patrick B�rjesson
@ 2006-03-12  2:06     ` Alistair John Strachan
  2006-03-12  9:05     ` Arjan van de Ven
  2006-03-12 12:41     ` Nick Piggin
  2 siblings, 0 replies; 21+ messages in thread
From: Alistair John Strachan @ 2006-03-12  2:06 UTC (permalink / raw)
  To: psycho; +Cc: linux-kernel

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #1: Type: text/plain; charset="utf-8", Size: 1276 bytes --]

On Sunday 12 March 2006 00:06, Patrick Börjesson wrote:
> > Just to let you know, I've had the same problem on x86-64. It's an
> > incredibly rare fault here and I've not been able to reproduce it.
> > However, I cannot help but notice that all of the reporters so far
> > have been running the binary NVIDIA driver, including myself.
> >
> > I would not be surprised if running without the NVIDIA driver
> > eliminated the problem.
>
> Not running either with the NVIDIA driver or with x86-64 on the machine
> I'm getting this on, but I get it fairly often (as in: today I've
> probably gotten it at least 5-10 times). It seems it's pretty bound by
> either high CPU or disk usage, since I've always gotten it while
> compiling stuff so far. Although my system doesn't hard lock if I get
> this error; I can at least run most commands and ssh into it.

Please don't do anything! A reproducible test case without NVIDIA loaded is 
exactly what we've been waiting for.

Please search the archives for Hugh Dickins's patch for 2.6.15 which enables 
additional rmap debug. Then try to reproduce the fault.

-- 
Cheers,
Alistair.

'No sense being pessimistic, it probably wouldn't work anyway.'
Third year Computer Science undergraduate.
1F2 55 South Clerk Street, Edinburgh, UK.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-03-12  0:06   ` Patrick B�rjesson
  2006-03-12  2:06     ` Alistair John Strachan
@ 2006-03-12  9:05     ` Arjan van de Ven
  2006-03-12 13:12       ` Patrick Börjesson
  2006-03-12 12:41     ` Nick Piggin
  2 siblings, 1 reply; 21+ messages in thread
From: Arjan van de Ven @ 2006-03-12  9:05 UTC (permalink / raw)
  To: psycho; +Cc: linux-kernel, Alistair John Strachan

On Sun, 2006-03-12 at 01:06 +0100, Patrick =?utf-8?Q?B=F6rjesson?=
wrote:
> > Just to let you know, I've had the same problem on x86-64. It's an
> > incredibly rare fault here and I've not been able to reproduce it.
> > However, I cannot help but notice that all of the reporters so far
> > have been running the binary NVIDIA driver, including myself.
> > 
> > I would not be surprised if running without the NVIDIA driver
> > eliminated the problem.
> 
> Not running either with the NVIDIA driver or with x86-64 on the machine
> I'm getting this on, but I get it fairly often (as in: today I've
> probably gotten it at least 5-10 times). It seems it's pretty bound by
> either high CPU or disk usage, since I've always gotten it while
> compiling stuff so far. Although my system doesn't hard lock if I get
> this error; I can at least run most commands and ssh into it.


just to rule the last issue out: this machine survives memtest86 ?



^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-03-12  0:06   ` Patrick B�rjesson
  2006-03-12  2:06     ` Alistair John Strachan
  2006-03-12  9:05     ` Arjan van de Ven
@ 2006-03-12 12:41     ` Nick Piggin
  2 siblings, 0 replies; 21+ messages in thread
From: Nick Piggin @ 2006-03-12 12:41 UTC (permalink / raw)
  To: psycho; +Cc: linux-kernel, Alistair John Strachan

Patrick wrote:

> ------------[ cut here ]------------
> kernel BUG at mm/rmap.c:486!
> EIP is at page_remove_rmap+0x30/0x40
>  [<c014c0b6>] zap_pte_range+0x156/0x250

This could easily be something that's already been fixed by now,
or it may still be a problem... can you test with the most recent
kernel (2.6.16-rc6)? Thanks.

-- 
SUSE Labs, Novell Inc.
Send instant messages to your online friends http://au.messenger.yahoo.com 

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-03-12  9:05     ` Arjan van de Ven
@ 2006-03-12 13:12       ` Patrick Börjesson
  0 siblings, 0 replies; 21+ messages in thread
From: Patrick Börjesson @ 2006-03-12 13:12 UTC (permalink / raw)
  To: Arjan van de Ven; +Cc: linux-kernel, Alistair John Strachan

[-- Attachment #1: Type: text/plain, Size: 1359 bytes --]

On 2006-03-12 10:05, Arjan van de Ven uttered these thoughts:
> On Sun, 2006-03-12 at 01:06 +0100, Patrick =?utf-8?Q?B=F6rjesson?=
> wrote:
> > > Just to let you know, I've had the same problem on x86-64. It's an
> > > incredibly rare fault here and I've not been able to reproduce it.
> > > However, I cannot help but notice that all of the reporters so far
> > > have been running the binary NVIDIA driver, including myself.
> > > 
> > > I would not be surprised if running without the NVIDIA driver
> > > eliminated the problem.
> > 
> > Not running either with the NVIDIA driver or with x86-64 on the machine
> > I'm getting this on, but I get it fairly often (as in: today I've
> > probably gotten it at least 5-10 times). It seems it's pretty bound by
> > either high CPU or disk usage, since I've always gotten it while
> > compiling stuff so far. Although my system doesn't hard lock if I get
> > this error; I can at least run most commands and ssh into it.
> 
> 
> just to rule the last issue out: this machine survives memtest86 ?

I'm sorry to have wasted your time; no, it didn't =\ 
Didn't think to run it before since the mem-stick in the machine is
pretty new (not more than 2 months). 

Sorry again,
Patrick Börjesson

-- 
/  ()  The ASCII Ribbon Campaign - against HTML Email
\  /\   and proprietary formats.

[-- Attachment #2: Type: application/pgp-signature, Size: 191 bytes --]

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-03-13 18:59       ` Ken MacFerrin
@ 2006-03-14 19:04         ` Ken MacFerrin
  0 siblings, 0 replies; 21+ messages in thread
From: Ken MacFerrin @ 2006-03-14 19:04 UTC (permalink / raw)
  To: Ken MacFerrin
  Cc: Alistair John Strachan, Dave Spring, linux-kernel, Hugh Dickins

Ken MacFerrin wrote:
> Ken MacFerrin wrote:
> 
>> Alistair John Strachan wrote:
>>
>>> On Thursday 09 February 2006 23:55, Dave Spring wrote:
>>>
>>>> Just for closure's sake:
>>>> This turned out to be a hardware problem.
>>>> Memtest86+ http://www.memtest.org/ found an intermittent and
>>>> pattern-sensitive memory error,
>>>> and only appearing at one or two random locations within the 256M 
>>>> module.
>>>> Replacing the dodgy RAM module did the trick.
>>>
>>>
>>>
>>>
>>> Thanks Dave. Any update on your problem Ken? I'm keen to hear whether 
>>> you had crashes without the NVIDIA driver loaded.
>>>
>>
>> Sorry, I got called out of town last weekend so I didn't get a chance 
>> to try this out yet..
>> -Ken
> 
> 
> As a follow-up to close out this thread.  I only had a chance to test 
> the nv driver for a short time before needing to go back to the xinerama 
> capabilities of the Nvidia driver again.  I subsequently had a severe 
> crash that beat up the filesystem pretty badly so I did a data backup 
> and a clean install of Gentoo/KDE3.5 (kernel 2.6.15-r1) along with the 
> binary Nvidia driver (1.0.8178-r3) and have not had the problem re-occur 
> since.  The new install is using the same hardware and kernel config 
> which has been stable for over a week of uptime now.  This would lead me 
> to believe my previous install suffered from some evil filesystem 
> gremlin that had snuck in from an earlier crash and continued to pop up 
> to cause havok versus a genuine kernel bug.
> 
> I appreciate the help and feedback in trying to get this figured out.
> 
> Thanks,
> Ken

It would appear I spoke just a day too soon...  I had the following 
crash this morning with several kernel bugs as listed below.
-Ken

mm-home1 ~ # uname -a
Linux mm-home1 2.6.15-gentoo-r1 #1 SMP PREEMPT Thu Mar 9 16:05:55 MST 
2006 i686 Intel(R) Pentium(R) 4 CPU 3.00GHz GenuineIntel GNU/Linux


----------- /var/log/messages --------------
Mar 14 09:18:12 mm-home1 Unable to handle kernel paging request at 
virtual address 00610000
Mar 14 09:18:12 mm-home1 printing eip:
Mar 14 09:18:12 mm-home1 c0141abd
Mar 14 09:18:12 mm-home1 *pde = 00000000
Mar 14 09:18:12 mm-home1 Oops: 0000 [#1]
Mar 14 09:18:12 mm-home1 PREEMPT SMP
Mar 14 09:18:12 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:18:12 mm-home1 CPU:    0
Mar 14 09:18:12 mm-home1 EIP:    0060:[<c0141abd>]    Tainted: PF     VLI
Mar 14 09:18:12 mm-home1 EFLAGS: 00013006   (2.6.15-gentoo-r1)
Mar 14 09:18:12 mm-home1 EIP is at find_get_page+0x2e/0x4d
Mar 14 09:18:12 mm-home1 eax: 00610000   ebx: 00610000   ecx: 00000000 
  edx: 00610000
Mar 14 09:18:12 mm-home1 esi: f595e1d8   edi: f595e110   ebp: 00020ac6 
  esp: f6773e8c
Mar 14 09:18:12 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:18:12 mm-home1 Process vmware-vmx (pid: 9484, 
threadinfo=f6772000 task=f7afaa70)
Mar 14 09:18:12 mm-home1 Stack: f595e1cc 00020ac6 00030000 00000000 
c0142b86 f595e1c8 00020ac6 00020ab0
Mar 14 09:18:12 mm-home1 00000020 d2b7b544 00000002 00000000 f7072bcc 
f595e1c8 f7072b80 c0142ad5
Mar 14 09:18:12 mm-home1 f6773f08 acfc6000 d2b7b544 c0152f1e d2b7b544 
acfc6000 f6773f08 d2b7b56c
Mar 14 09:18:12 mm-home1 Call Trace:
Mar 14 09:18:12 mm-home1 [<c0142b86>] filemap_nopage+0xb1/0x375
Mar 14 09:18:12 mm-home1 [<c0142ad5>] filemap_nopage+0x0/0x375
Mar 14 09:18:12 mm-home1 [<c0152f1e>] do_no_page+0x85/0x2ba
Mar 14 09:18:12 mm-home1 [<c0142ad5>] filemap_nopage+0x0/0x375
Mar 14 09:18:12 mm-home1 [<c0153460>] __handle_mm_fault+0x263/0x302
Mar 14 09:18:12 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:18:12 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:18:12 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:18:12 mm-home1 Code: ec 08 8b 5c 24 14 8d 73 10 83 c3 04 89 f0 
e8 c3 97 2f 00 8b 44 24 18 89 1c 24 89 44 24 04 e8 7a 2c 17 00 85 c0 89 
c3 74 0d 89 c2 <8b> 00 f6 c4 40 75 13 f0 ff 42 04 89 f0 e8 76 99 2f 00 
89 d8 83
Mar 14 09:18:12 mm-home1 <6>note: vmware-vmx[9484] exited with 
preempt_count 1
Mar 14 09:18:50 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:18:56 mm-home1 smartd[9805]: Device: /dev/hda, SMART Usage 
Attribute: 194 Temperature_Celsius changed from 128 to 117
Mar 14 09:19:51 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:20:01 mm-home1 cron[1181]: (root) CMD (test -x 
/usr/sbin/run-crons && /usr/sbin/run-crons )
Mar 14 09:30:01 mm-home1 cron[17880]: (root) CMD (test -x 
/usr/sbin/run-crons && /usr/sbin/run-crons )
Mar 14 09:35:05 mm-home1 Bad page state at prep_new_page (in process 
'superkaramba', page c2069230)
Mar 14 09:35:05 mm-home1 flags:0xc0000000 mapping:00000000 mapcount:1 
count:1
Mar 14 09:35:05 mm-home1 Backtrace:
Mar 14 09:35:05 mm-home1 [<c01458e8>] bad_page+0x84/0xbc
Mar 14 09:35:05 mm-home1 [<c0145d4d>] prep_new_page+0x27/0x80
Mar 14 09:35:05 mm-home1 [<c0146354>] buffered_rmqueue+0x11f/0x275
Mar 14 09:35:05 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:35:05 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:35:05 mm-home1 [<c0103944>] apic_timer_interrupt+0x1c/0x24
Mar 14 09:35:05 mm-home1 [<c0115188>] pte_alloc_one+0x11/0x12
Mar 14 09:35:05 mm-home1 [<c015094f>] __pte_alloc+0x2b/0xb6
Mar 14 09:35:05 mm-home1 [<c0150deb>] copy_pte_range+0x2d8/0x2ec
Mar 14 09:35:05 mm-home1 [<c0150ebe>] copy_page_range+0xbf/0x112
Mar 14 09:35:05 mm-home1 [<c011bb10>] copy_mm+0x28d/0x38f
Mar 14 09:35:05 mm-home1 [<c011c57a>] copy_process+0x458/0xed0
Mar 14 09:35:05 mm-home1 [<c011d0f2>] do_fork+0x74/0x1bd
Mar 14 09:35:05 mm-home1 [<c02b7324>] copy_to_user+0x42/0x5c
Mar 14 09:35:05 mm-home1 [<c0101a6e>] sys_clone+0x3e/0x42
Mar 14 09:35:05 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:35:05 mm-home1 Trying to fix it up, but a reboot is needed
Mar 14 09:35:05 mm-home1 ------------[ cut here ]------------
Mar 14 09:35:05 mm-home1 kernel BUG at mm/rmap.c:486!
Mar 14 09:35:05 mm-home1 invalid operand: 0000 [#2]
Mar 14 09:35:05 mm-home1 PREEMPT SMP
Mar 14 09:35:05 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:35:05 mm-home1 CPU:    0
Mar 14 09:35:05 mm-home1 EIP:    0060:[<c0158335>]    Tainted: PF   B VLI
Mar 14 09:35:05 mm-home1 EFLAGS: 00010286   (2.6.15-gentoo-r1)
Mar 14 09:35:05 mm-home1 EIP is at page_remove_rmap+0x33/0x3d
Mar 14 09:35:05 mm-home1 eax: ffffffff   ebx: fffb5548   ecx: c2069230 
  edx: c2069230
Mar 14 09:35:05 mm-home1 esi: b6152000   edi: c2069230   ebp: cdbffdd4 
  esp: cdbffd30
Mar 14 09:35:05 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:35:05 mm-home1 Process superkaramba (pid: 26323, 
threadinfo=cdbfe000 task=dab25a70)
Mar 14 09:35:05 mm-home1 Stack: c1fc9558 cdbffdd4 c0151105 c2069230 
b6152000 74b2c045 74b2c045 c192e548
Mar 14 09:35:05 mm-home1 fffffffc ffffffff de784580 b6156000 cdad9b60 
b6156000 cdbffdd4 c01512b9
Mar 14 09:35:05 mm-home1 c220e900 e1eae544 cdad9b60 b614e000 b6156000 
cdbffdd4 00000000 b6155fff
Mar 14 09:35:05 mm-home1 Call Trace:
Mar 14 09:35:05 mm-home1 [<c0151105>] zap_pte_range+0x1f4/0x2f4
Mar 14 09:35:05 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:35:05 mm-home1 [<c015142a>] unmap_vmas+0xeb/0x24c
Mar 14 09:35:05 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:35:05 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:35:05 mm-home1 [<c016df32>] exec_mmap+0xfd/0x1f1
Mar 14 09:35:05 mm-home1 [<c016e5d4>] flush_old_exec+0x50c/0x898
Mar 14 09:35:05 mm-home1 [<c01629ec>] vfs_read+0x162/0x1b3
Mar 14 09:35:05 mm-home1 [<c016de26>] kernel_read+0x50/0x5f
Mar 14 09:35:05 mm-home1 [<c0192506>] load_elf_binary+0x3ae/0xd10
Mar 14 09:35:05 mm-home1 [<c014ff49>] page_address+0xa6/0xc7
Mar 14 09:35:05 mm-home1 [<c014f874>] kunmap_high+0x7c/0xa0
Mar 14 09:35:05 mm-home1 [<c0192158>] load_elf_binary+0x0/0xd10
Mar 14 09:35:05 mm-home1 [<c016ec6e>] search_binary_handler+0xd3/0x2fd
Mar 14 09:35:05 mm-home1 [<c016f036>] do_execve+0x19e/0x240
Mar 14 09:35:05 mm-home1 [<c0101af3>] sys_execve+0x46/0x93
Mar 14 09:35:05 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:35:05 mm-home1 Code: 83 42 08 ff 0f 98 c0 84 c0 74 1c 8b 42 08 
83 c0 01 78 18 c7 44 24 04 ff ff ff ff c7 04 24 10 00 00 00 e8 ab e9 fe 
ff 83 c4 08 c3 <0f> 0b e6 01 c2 b1 45 c0 eb de 83 ec 2c 89 7c 24 24 89 
5c 24 1c
Mar 14 09:35:05 mm-home1 <6>note: superkaramba[26323] exited with 
preempt_count 3
Mar 14 09:35:44 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:35:44 mm-home1 printing eip:
Mar 14 09:35:44 mm-home1 c01462a4
Mar 14 09:35:44 mm-home1 *pde = 00000000
Mar 14 09:35:44 mm-home1 Oops: 0002 [#3]
Mar 14 09:35:44 mm-home1 PREEMPT SMP
Mar 14 09:35:44 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:35:44 mm-home1 CPU:    0
Mar 14 09:35:44 mm-home1 EIP:    0060:[<c01462a4>]    Tainted: PF   B VLI
Mar 14 09:35:44 mm-home1 EFLAGS: 00010002   (2.6.15-gentoo-r1)
Mar 14 09:35:44 mm-home1 EIP is at buffered_rmqueue+0x6f/0x275
Mar 14 09:35:44 mm-home1 eax: c206924c   ebx: c04baa80   ecx: c04bab10 
  edx: 00100100
Mar 14 09:35:44 mm-home1 esi: c04bab00   edi: e237e000   ebp: 00000246 
  esp: e237fe6c
Mar 14 09:35:44 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:35:44 mm-home1 Process amarokapp (pid: 16519, 
threadinfo=e237e000 task=f7b10030)
Mar 14 09:35:44 mm-home1 Stack: e237e000 00000400 00000000 c0142677 
effc0314 f7efcdcc 00000000 c2069230
Mar 14 09:35:44 mm-home1 c04bb128 00000044 00000000 00000003 c014661e 
c04baa80 00000000 000280d2
Mar 14 09:35:44 mm-home1 00000003 00000044 c04bb128 f7b10030 000280d2 
c04baa80 c014668c 000280d2
Mar 14 09:35:44 mm-home1 Call Trace:
Mar 14 09:35:44 mm-home1 [<c0142677>] __generic_file_aio_read+0x1bf/0x227
Mar 14 09:35:44 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:35:44 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:35:44 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:35:44 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:35:44 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:35:44 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:35:44 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:35:44 mm-home1 Code: 8d b3 80 00 00 00 9c 5d fa 8b 83 80 00 00 
00 3b 46 04 0f 8e 23 01 00 00 85 c0 74 28 8b 46 10 8d 48 e4 89 4c 24 1c 
8b 48 04 8b 10 <89> 4a 04 89 11 c7 40 04 00 02 20 00 c7 00 00 01 10 00 
83 ab 80
Mar 14 09:35:44 mm-home1 <6>note: amarokapp[16519] exited with 
preempt_count 1
Mar 14 09:35:54 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:36:25 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:36:25 mm-home1 printing eip:
Mar 14 09:36:25 mm-home1 c01462a4
Mar 14 09:36:25 mm-home1 *pde = 00000000
Mar 14 09:36:25 mm-home1 Oops: 0002 [#4]
Mar 14 09:36:25 mm-home1 PREEMPT SMP
Mar 14 09:36:25 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:36:25 mm-home1 CPU:    0
Mar 14 09:36:25 mm-home1 EIP:    0060:[<c01462a4>]    Tainted: PF   B VLI
Mar 14 09:36:25 mm-home1 EFLAGS: 00010002   (2.6.15-gentoo-r1)
Mar 14 09:36:25 mm-home1 EIP is at buffered_rmqueue+0x6f/0x275
Mar 14 09:36:25 mm-home1 eax: c206924c   ebx: c04baa80   ecx: c04bab10 
  edx: 00100100
Mar 14 09:36:25 mm-home1 esi: c04bab00   edi: e28fc000   ebp: 00000246 
  esp: e28fde6c
Mar 14 09:36:25 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:36:25 mm-home1 Process mlnet (pid: 16435, threadinfo=e28fc000 
task=c0e1d550)
Mar 14 09:36:25 mm-home1 Stack: eab6e090 00000000 e28fdf38 cb8c16b8 
c01cfd53 eab6e090 00000000 c2069230
Mar 14 09:36:25 mm-home1 c04bb128 00000044 00000000 00000003 c014661e 
c04baa80 00000000 000280d2
Mar 14 09:36:25 mm-home1 00000003 00000044 c04bb128 c0e1d550 000280d2 
c04baa80 c014668c 000280d2
Mar 14 09:36:25 mm-home1 Call Trace:
Mar 14 09:36:25 mm-home1 [<c01cfd53>] reiserfs_permission+0x27/0x2b
Mar 14 09:36:25 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:36:25 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:36:25 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:36:25 mm-home1 [<c0161c9c>] nameidata_to_filp+0x37/0x4f
Mar 14 09:36:25 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:36:25 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:36:25 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:36:25 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:36:25 mm-home1 Code: 8d b3 80 00 00 00 9c 5d fa 8b 83 80 00 00 
00 3b 46 04 0f 8e 23 01 00 00 85 c0 74 28 8b 46 10 8d 48 e4 89 4c 24 1c 
8b 48 04 8b 10 <89> 4a 04 89 11 c7 40 04 00 02 20 00 c7 00 00 01 10 00 
83 ab 80
Mar 14 09:36:25 mm-home1 <6>note: mlnet[16435] exited with preempt_count 1
Mar 14 09:36:54 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:37:55 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:38:55 mm-home1 syslog-ng[8497]: Connection broken to 
AF_INET(localhost:5149), reopening in 60 seconds
Mar 14 09:39:19 mm-home1 ------------[ cut here ]------------
Mar 14 09:39:19 mm-home1 kernel BUG at mm/page_alloc.c:761!
Mar 14 09:39:19 mm-home1 invalid operand: 0000 [#5]
Mar 14 09:39:19 mm-home1 PREEMPT SMP
Mar 14 09:39:19 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:19 mm-home1 CPU:    0
Mar 14 09:39:19 mm-home1 EIP:    0060:[<c0146486>]    Tainted: PF   B VLI
Mar 14 09:39:19 mm-home1 EFLAGS: 00010202   (2.6.15-gentoo-r1)
Mar 14 09:39:19 mm-home1 EIP is at buffered_rmqueue+0x251/0x275
Mar 14 09:39:19 mm-home1 eax: 00000001   ebx: c04baa80   ecx: 00038000 
  edx: 00000001
Mar 14 09:39:19 mm-home1 esi: c04bab00   edi: ddb92000   ebp: 00000246 
  esp: ddb93e6c
Mar 14 09:39:19 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:19 mm-home1 Process thunderbird-bin (pid: 26335, 
threadinfo=ddb92000 task=f786fa70)
Mar 14 09:39:19 mm-home1 Stack: c04baa80 c04baaf4 00000000 c0142677 
00000000 00000001 00000000 c04baaf4
Mar 14 09:39:19 mm-home1 c04bb128 00000044 00000000 00000003 c014661e 
c04baa80 00000000 000280d2
Mar 14 09:39:19 mm-home1 00000003 00000044 c04bb128 f786fa70 000280d2 
c04baa80 c014668c 000280d2
Mar 14 09:39:19 mm-home1 Call Trace:
Mar 14 09:39:19 mm-home1 [<c0142677>] __generic_file_aio_read+0x1bf/0x227
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:39:19 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 Code: 3c 00 40 00 00 0f 84 42 ff ff ff 8b 44 24 
38 8b 54 24 1c 89 44 24 04 89 14 24 e8 a6 f4 ff ff 8b 44 24 1c 83 c4 20 
5b 5e 5f 5d c3 <0f> 0b f9 02 aa ab 45 c0 e9 63 fe ff ff e8 bf 3a 2f 00 
e9 31 fe
Mar 14 09:39:19 mm-home1 <1>Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:19 mm-home1 printing eip:
Mar 14 09:39:19 mm-home1 c01461a9
Mar 14 09:39:19 mm-home1 *pde = 00000000
Mar 14 09:39:19 mm-home1 Oops: 0002 [#6]
Mar 14 09:39:19 mm-home1 PREEMPT SMP
Mar 14 09:39:19 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:19 mm-home1 CPU:    0
Mar 14 09:39:19 mm-home1 EIP:    0060:[<c01461a9>]    Tainted: PF   B VLI
Mar 14 09:39:19 mm-home1 EFLAGS: 00010082   (2.6.15-gentoo-r1)
Mar 14 09:39:19 mm-home1 EIP is at free_hot_cold_page+0xe6/0x161
Mar 14 09:39:19 mm-home1 eax: 00100100   ebx: c21e28c4   ecx: c04bab10 
  edx: c21e28e0
Mar 14 09:39:19 mm-home1 esi: c04baa80   edi: ddb92000   ebp: c04bab00 
  esp: ddb93bd8
Mar 14 09:39:19 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:19 mm-home1 Process thunderbird-bin (pid: 26335, 
threadinfo=ddb92000 task=f786fa70)
Mar 14 09:39:19 mm-home1 Stack: 00000034 00000001 01cb3f60 000000b7 
00000282 c04baa80 00000001 ddb93c14
Mar 14 09:39:19 mm-home1 c04baa80 00000008 c01469dc 00000002 c04baa80 
c014c689 ddb93c14 00000002
Mar 14 09:39:19 mm-home1 00000000 c1fc393c c21e28c4 7f2e9067 c1ee422c 
ffffffff 00000000 d51e5d00
Mar 14 09:39:19 mm-home1 Call Trace:
Mar 14 09:39:19 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:19 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:19 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:19 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:19 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:19 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:19 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:19 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:19 mm-home1 [<c011e04a>] printk+0x17/0x1b
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c01044a4>] do_invalid_op+0x0/0xab
Mar 14 09:39:19 mm-home1 [<c0104546>] do_invalid_op+0xa2/0xab
Mar 14 09:39:19 mm-home1 [<c0146486>] buffered_rmqueue+0x251/0x275
Mar 14 09:39:19 mm-home1 [<c014244c>] file_read_actor+0x92/0xfe
Mar 14 09:39:19 mm-home1 [<c01422e4>] do_generic_mapping_read+0x4b7/0x58d
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c014007b>] audit_avc_path+0x93/0x9b
Mar 14 09:39:19 mm-home1 [<c0146486>] buffered_rmqueue+0x251/0x275
Mar 14 09:39:19 mm-home1 [<c0142677>] __generic_file_aio_read+0x1bf/0x227
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:39:19 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 Code: 8d 14 76 c1 e0 07 03 44 24 14 8d 34 d0 8d 
ae 80 00 00 00 9c 8f 44 24 10 fa 8b 86 90 00 00 00 8d 53 1c 8d 8e 90 00 
00 00 89 43 1c <89> 50 04 89 4a 04 89 96 90 00 00 00 8b 86 80 00 00 00 
83 c0 01
Mar 14 09:39:19 mm-home1 <1>Fixing recursive fault but reboot is needed!
Mar 14 09:39:19 mm-home1 scheduling while atomic: 
thunderbird-bin/0x00000002/26335
Mar 14 09:39:19 mm-home1 [<c0439c45>] schedule+0xa21/0xd33
Mar 14 09:39:19 mm-home1 [<c01231f3>] tasklet_action+0x63/0xc2
Mar 14 09:39:19 mm-home1 [<c0122e73>] __do_softirq+0x6b/0xd8
Mar 14 09:39:19 mm-home1 [<c012088e>] do_exit+0x3a8/0x41d
Mar 14 09:39:19 mm-home1 [<c011007b>] __acpi_map_table+0xb/0xd6
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c01461a9>] free_hot_cold_page+0xe6/0x161
Mar 14 09:39:19 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:19 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:19 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:19 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:19 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:19 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:19 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:19 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:19 mm-home1 [<c011e04a>] printk+0x17/0x1b
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c01044a4>] do_invalid_op+0x0/0xab
Mar 14 09:39:19 mm-home1 [<c0104546>] do_invalid_op+0xa2/0xab
Mar 14 09:39:19 mm-home1 [<c0146486>] buffered_rmqueue+0x251/0x275
Mar 14 09:39:19 mm-home1 [<c014244c>] file_read_actor+0x92/0xfe
Mar 14 09:39:19 mm-home1 [<c01422e4>] do_generic_mapping_read+0x4b7/0x58d
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c014007b>] audit_avc_path+0x93/0x9b
Mar 14 09:39:19 mm-home1 [<c0146486>] buffered_rmqueue+0x251/0x275
Mar 14 09:39:19 mm-home1 [<c0142677>] __generic_file_aio_read+0x1bf/0x227
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:39:19 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:19 mm-home1 printing eip:
Mar 14 09:39:19 mm-home1 c014629f
Mar 14 09:39:19 mm-home1 *pde = 00000000
Mar 14 09:39:19 mm-home1 Oops: 0000 [#7]
Mar 14 09:39:19 mm-home1 PREEMPT SMP
Mar 14 09:39:19 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:19 mm-home1 CPU:    0
Mar 14 09:39:19 mm-home1 EIP:    0060:[<c014629f>]    Tainted: PF   B VLI
Mar 14 09:39:19 mm-home1 EFLAGS: 00010002   (2.6.15-gentoo-r1)
Mar 14 09:39:19 mm-home1 EIP is at buffered_rmqueue+0x6a/0x275
Mar 14 09:39:19 mm-home1 eax: 00100100   ebx: c04baa80   ecx: 001000e4 
  edx: c04baa80
Mar 14 09:39:19 mm-home1 esi: c04bab00   edi: e2706000   ebp: 00000246 
  esp: e2707e6c
Mar 14 09:39:19 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:19 mm-home1 Process kicker (pid: 16457, threadinfo=e2706000 
task=c0e50a70)
Mar 14 09:39:19 mm-home1 Stack: 000200d0 00000000 c0e50a70 c0132eb2 
e2707e7c e2707e7c 00000000 001000e4
Mar 14 09:39:19 mm-home1 c04bb128 00000044 00000000 00000003 c014661e 
c04baa80 00000000 000284d2
Mar 14 09:39:19 mm-home1 00000003 00000044 c04bb128 c0e50a70 000084d2 
c04baa80 c014668c 000284d2
Mar 14 09:39:19 mm-home1 Call Trace:
Mar 14 09:39:19 mm-home1 [<c0132eb2>] autoremove_wake_function+0x0/0x57
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0115188>] pte_alloc_one+0x11/0x12
Mar 14 09:39:19 mm-home1 [<c015094f>] __pte_alloc+0x2b/0xb6
Mar 14 09:39:19 mm-home1 [<c01533e9>] __handle_mm_fault+0x1ec/0x302
Mar 14 09:39:19 mm-home1 [<c0156339>] do_brk+0x1c7/0x295
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 Code: 24 18 8d 1c c2 8d b3 80 00 00 00 9c 5d fa 
8b 83 80 00 00 00 3b 46 04 0f 8e 23 01 00 00 85 c0 74 28 8b 46 10 8d 48 
e4 89 4c 24 1c <8b> 48 04 8b 10 89 4a 04 89 11 c7 40 04 00 02 20 00 c7 
00 00 01
Mar 14 09:39:19 mm-home1 <6>note: kicker[16457] exited with preempt_count 1
Mar 14 09:39:19 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:19 mm-home1 printing eip:
Mar 14 09:39:19 mm-home1 c01461a9
Mar 14 09:39:19 mm-home1 *pde = 00000000
Mar 14 09:39:19 mm-home1 Oops: 0002 [#8]
Mar 14 09:39:19 mm-home1 PREEMPT SMP
Mar 14 09:39:19 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:19 mm-home1 CPU:    0
Mar 14 09:39:19 mm-home1 EIP:    0060:[<c01461a9>]    Tainted: PF   B VLI
Mar 14 09:39:19 mm-home1 EFLAGS: 00010082   (2.6.15-gentoo-r1)
Mar 14 09:39:19 mm-home1 EIP is at free_hot_cold_page+0xe6/0x161
Mar 14 09:39:19 mm-home1 eax: 00100100   ebx: c1f90810   ecx: c04bab10 
  edx: c1f9082c
Mar 14 09:39:19 mm-home1 esi: c04baa80   edi: e2706000   ebp: c04bab00 
  esp: e2707c48
Mar 14 09:39:19 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:19 mm-home1 Process kicker (pid: 16457, threadinfo=e2706000 
task=c0e50a70)
Mar 14 09:39:19 mm-home1 Stack: 00000034 00000001 01cb3f60 000000b7 
00000282 c04baa80 00000001 e2707c84
Mar 14 09:39:19 mm-home1 c04baa80 00000008 c01469dc 00000002 c04baa80 
c014c689 e2707c84 00000002
Mar 14 09:39:19 mm-home1 00000000 c1ec59e0 c1f90810 650af065 c1f25b18 
fffffffe 00000000 f7a5e080
Mar 14 09:39:19 mm-home1 Call Trace:
Mar 14 09:39:19 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:19 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:19 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:19 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:19 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:19 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:19 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:19 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c014629f>] buffered_rmqueue+0x6a/0x275
Mar 14 09:39:19 mm-home1 [<c0132eb2>] autoremove_wake_function+0x0/0x57
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0115188>] pte_alloc_one+0x11/0x12
Mar 14 09:39:19 mm-home1 [<c015094f>] __pte_alloc+0x2b/0xb6
Mar 14 09:39:19 mm-home1 [<c01533e9>] __handle_mm_fault+0x1ec/0x302
Mar 14 09:39:19 mm-home1 [<c0156339>] do_brk+0x1c7/0x295
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 Code: 8d 14 76 c1 e0 07 03 44 24 14 8d 34 d0 8d 
ae 80 00 00 00 9c 8f 44 24 10 fa 8b 86 90 00 00 00 8d 53 1c 8d 8e 90 00 
00 00 89 43 1c <89> 50 04 89 4a 04 89 96 90 00 00 00 8b 86 80 00 00 00 
83 c0 01
Mar 14 09:39:19 mm-home1 <1>Fixing recursive fault but reboot is needed!
Mar 14 09:39:19 mm-home1 scheduling while atomic: kicker/0x00000003/16457
Mar 14 09:39:19 mm-home1 [<c0439c45>] schedule+0xa21/0xd33
Mar 14 09:39:19 mm-home1 [<c0122e73>] __do_softirq+0x6b/0xd8
Mar 14 09:39:19 mm-home1 [<c012088e>] do_exit+0x3a8/0x41d
Mar 14 09:39:19 mm-home1 [<c011007b>] __acpi_map_table+0xb/0xd6
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:19 mm-home1 [<c03f0030>] tcp_send_dupack+0x93/0xfc
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c01461a9>] free_hot_cold_page+0xe6/0x161
Mar 14 09:39:19 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:19 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:19 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:19 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:19 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:19 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:19 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:19 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:19 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:19 mm-home1 [<c014629f>] buffered_rmqueue+0x6a/0x275
Mar 14 09:39:19 mm-home1 [<c0132eb2>] autoremove_wake_function+0x0/0x57
Mar 14 09:39:19 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:19 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:19 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:19 mm-home1 [<c0115188>] pte_alloc_one+0x11/0x12
Mar 14 09:39:19 mm-home1 [<c015094f>] __pte_alloc+0x2b/0xb6
Mar 14 09:39:19 mm-home1 [<c01533e9>] __handle_mm_fault+0x1ec/0x302
Mar 14 09:39:19 mm-home1 [<c0156339>] do_brk+0x1c7/0x295
Mar 14 09:39:19 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:19 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:19 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:30 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:30 mm-home1 printing eip:
Mar 14 09:39:30 mm-home1 c01461a9
Mar 14 09:39:30 mm-home1 *pde = 00000000
Mar 14 09:39:30 mm-home1 Oops: 0002 [#9]
Mar 14 09:39:30 mm-home1 PREEMPT SMP
Mar 14 09:39:30 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:30 mm-home1 CPU:    0
Mar 14 09:39:30 mm-home1 EIP:    0060:[<c01461a9>]    Tainted: PF   B VLI
Mar 14 09:39:30 mm-home1 EFLAGS: 00010082   (2.6.15-gentoo-r1)
Mar 14 09:39:30 mm-home1 EIP is at free_hot_cold_page+0xe6/0x161
Mar 14 09:39:30 mm-home1 eax: 00100100   ebx: c1ec17ec   ecx: c04bab10 
  edx: c1ec1808
Mar 14 09:39:30 mm-home1 esi: c04baa80   edi: e2382000   ebp: c04bab00 
  esp: e2383e4c
Mar 14 09:39:30 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:30 mm-home1 Process vncviewer (pid: 16529, 
threadinfo=e2382000 task=f7a57a70)
Mar 14 09:39:30 mm-home1 Stack: 00000034 00000001 00000007 714b8067 
00000282 c04baa80 00000004 e2383e88
Mar 14 09:39:30 mm-home1 c04baa80 00000008 c01469dc 00000005 c04baa80 
c014c689 e2383e88 00000005
Mar 14 09:39:30 mm-home1 00000000 c1f159d8 c1f3a77c c1f15b64 c1fed534 
c1ec17ec 00000000 eedb3b80
Mar 14 09:39:30 mm-home1 Call Trace:
Mar 14 09:39:30 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:30 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:30 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:30 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:30 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:30 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:30 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:30 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:30 mm-home1 [<c012096e>] do_group_exit+0x3c/0xa6
Mar 14 09:39:30 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:39:30 mm-home1 Code: 8d 14 76 c1 e0 07 03 44 24 14 8d 34 d0 8d 
ae 80 00 00 00 9c 8f 44 24 10 fa 8b 86 90 00 00 00 8d 53 1c 8d 8e 90 00 
00 00 89 43 1c <89> 50 04 89 4a 04 89 96 90 00 00 00 8b 86 80 00 00 00 
83 c0 01
Mar 14 09:39:30 mm-home1 <1>Fixing recursive fault but reboot is needed!
Mar 14 09:39:30 mm-home1 scheduling while atomic: vncviewer/0x00000002/16529
Mar 14 09:39:30 mm-home1 [<c0439c45>] schedule+0xa21/0xd33
Mar 14 09:39:30 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:39:30 mm-home1 [<c01461bd>] free_hot_cold_page+0xfa/0x161
Mar 14 09:39:30 mm-home1 [<c012088e>] do_exit+0x3a8/0x41d
Mar 14 09:39:30 mm-home1 [<c011e04a>] printk+0x17/0x1b
Mar 14 09:39:30 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:30 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:30 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:30 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:30 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:30 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:30 mm-home1 [<c029007b>] udf_get_pblock_virt15+0x143/0x159
Mar 14 09:39:30 mm-home1 [<c01461a9>] free_hot_cold_page+0xe6/0x161
Mar 14 09:39:30 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:30 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:30 mm-home1 [<c01512b9>] unmap_page_range+0xb4/0x13a
Mar 14 09:39:30 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:30 mm-home1 [<c0151553>] unmap_vmas+0x214/0x24c
Mar 14 09:39:30 mm-home1 [<c015649a>] exit_mmap+0x93/0x13e
Mar 14 09:39:30 mm-home1 [<c011b564>] mmput+0x38/0x9b
Mar 14 09:39:30 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:30 mm-home1 [<c012096e>] do_group_exit+0x3c/0xa6
Mar 14 09:39:30 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:39:30 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:30 mm-home1 printing eip:
Mar 14 09:39:30 mm-home1 c014629f
Mar 14 09:39:30 mm-home1 *pde = 00000000
Mar 14 09:39:30 mm-home1 Oops: 0000 [#10]
Mar 14 09:39:30 mm-home1 PREEMPT SMP
Mar 14 09:39:30 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:30 mm-home1 CPU:    0
Mar 14 09:39:30 mm-home1 EIP:    0060:[<c014629f>]    Tainted: PF   B VLI
Mar 14 09:39:30 mm-home1 EFLAGS: 00010002   (2.6.15-gentoo-r1)
Mar 14 09:39:30 mm-home1 EIP is at buffered_rmqueue+0x6a/0x275
Mar 14 09:39:30 mm-home1 eax: 00100100   ebx: c04baa80   ecx: 001000e4 
  edx: c04baa80
Mar 14 09:39:30 mm-home1 esi: c04bab00   edi: e2914000   ebp: 00000246 
  esp: e2915e6c
Mar 14 09:39:30 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:30 mm-home1 Process artsd (pid: 26339, threadinfo=e2914000 
task=f70dea70)
Mar 14 09:39:30 mm-home1 Stack: 00000000 01cb3f60 000000b7 01cabf60 
f7a4bb00 b7816000 00000000 001000e4
Mar 14 09:39:30 mm-home1 c04bb128 00000044 00000000 00000003 c014661e 
c04baa80 00000000 000280d2
Mar 14 09:39:30 mm-home1 00000003 00000044 c04bb128 f70dea70 000280d2 
c04baa80 c014668c 000280d2
Mar 14 09:39:30 mm-home1 Call Trace:
Mar 14 09:39:30 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:30 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:30 mm-home1 [<c0154869>] vma_adjust+0x1ff/0x38a
Mar 14 09:39:30 mm-home1 [<c0117838>] try_to_wake_up+0x332/0x410
Mar 14 09:39:30 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:39:30 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:39:30 mm-home1 [<c0156339>] do_brk+0x1c7/0x295
Mar 14 09:39:30 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:30 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:30 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:30 mm-home1 Code: 24 18 8d 1c c2 8d b3 80 00 00 00 9c 5d fa 
8b 83 80 00 00 00 3b 46 04 0f 8e 23 01 00 00 85 c0 74 28 8b 46 10 8d 48 
e4 89 4c 24 1c <8b> 48 04 8b 10 89 4a 04 89 11 c7 40 04 00 02 20 00 c7 
00 00 01
Mar 14 09:39:30 mm-home1 <6>note: artsd[26339] exited with preempt_count 1
Mar 14 09:39:44 mm-home1 Bad page state at prep_new_page (in process 
'X', page c2069230)
Mar 14 09:39:44 mm-home1 flags:0xc0000010 mapping:00000000 mapcount:-1 
count:1
Mar 14 09:39:44 mm-home1 Backtrace:
Mar 14 09:39:44 mm-home1 [<c01458e8>] bad_page+0x84/0xbc
Mar 14 09:39:44 mm-home1 [<c0145d4d>] prep_new_page+0x27/0x80
Mar 14 09:39:44 mm-home1 [<c0146354>] buffered_rmqueue+0x11f/0x275
Mar 14 09:39:44 mm-home1 [<c014661e>] get_page_from_freelist+0xa7/0xbf
Mar 14 09:39:44 mm-home1 [<c014668c>] __alloc_pages+0x56/0x300
Mar 14 09:39:44 mm-home1 [<c0152d4a>] do_anonymous_page+0x50/0x19f
Mar 14 09:39:44 mm-home1 [<c0153307>] __handle_mm_fault+0x10a/0x302
Mar 14 09:39:44 mm-home1 [<c0115738>] do_page_fault+0x1b7/0x557
Mar 14 09:39:44 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:44 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:44 mm-home1 Trying to fix it up, but a reboot is needed
Mar 14 09:39:44 mm-home1 Unable to handle kernel paging request at 
virtual address 00100104
Mar 14 09:39:44 mm-home1 printing eip:
Mar 14 09:39:44 mm-home1 c01461a9
Mar 14 09:39:44 mm-home1 *pde = 00000000
Mar 14 09:39:44 mm-home1 Oops: 0002 [#11]
Mar 14 09:39:44 mm-home1 PREEMPT SMP
Mar 14 09:39:44 mm-home1 Modules linked in: lp vmnet vmmon snd_seq_midi 
snd_emu10k1_synth snd_emux_synth snd_seq_virmidi snd_seq_midi_emul 
snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_seq_oss snd_seq_midi_event 
snd_seq eth1394 nls_utf8 rfcomm bnep l2cap snd_emu10k1 snd_rawmidi 
snd_seq_device snd_ac97_codec snd_pcm snd_timer snd_ac97_bus 
snd_page_alloc snd_util_mem snd_hwdep dv1394 video1394 raw1394 ohci1394 
ieee1394 w83627hf hwmon_vid i2c_isa eeprom i2c_dev i2c_i801 3c59x loop 
nvidia ntfs rtc dm_mod hci_usb bluetooth tsdev
Mar 14 09:39:44 mm-home1 CPU:    0
Mar 14 09:39:44 mm-home1 EIP:    0060:[<c01461a9>]    Tainted: PF   B VLI
Mar 14 09:39:44 mm-home1 EFLAGS: 00013082   (2.6.15-gentoo-r1)
Mar 14 09:39:44 mm-home1 EIP is at free_hot_cold_page+0xe6/0x161
Mar 14 09:39:44 mm-home1 eax: 00100100   ebx: c21d3510   ecx: c04bab10 
  edx: c21d352c
Mar 14 09:39:44 mm-home1 esi: c04baa80   edi: f54ac000   ebp: c04bab00 
  esp: f54ade94
Mar 14 09:39:44 mm-home1 ds: 007b   es: 007b   ss: 0068
Mar 14 09:39:44 mm-home1 Process X (pid: 9988, threadinfo=f54ac000 
task=f7b28a70)
Mar 14 09:39:44 mm-home1 Stack: 00000034 00000001 c09ff06f c09ff06f 
00003282 c04baa80 00000000 f54aded0
Mar 14 09:39:44 mm-home1 c04baa80 00000001 c01469dc 00000001 c04baa80 
c014c689 f54aded0 00000001
Mar 14 09:39:44 mm-home1 00000000 c21d3510 c0150671 0000000c ffffffff 
b6c00000 c0150806 c220e900
Mar 14 09:39:44 mm-home1 Call Trace:
Mar 14 09:39:44 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:44 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:44 mm-home1 [<c0150671>] free_pte_range+0x62/0xf2
Mar 14 09:39:44 mm-home1 [<c0150806>] free_pgd_range+0x105/0x195
Mar 14 09:39:44 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:44 mm-home1 [<c0155da7>] unmap_region+0x13b/0x152
Mar 14 09:39:44 mm-home1 [<c01560a2>] do_munmap+0x10f/0x179
Mar 14 09:39:44 mm-home1 [<c0156151>] sys_munmap+0x45/0x66
Mar 14 09:39:44 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
Mar 14 09:39:44 mm-home1 Code: 8d 14 76 c1 e0 07 03 44 24 14 8d 34 d0 8d 
ae 80 00 00 00 9c 8f 44 24 10 fa 8b 86 90 00 00 00 8d 53 1c 8d 8e 90 00 
00 00 89 43 1c <89> 50 04 89 4a 04 89 96 90 00 00 00 8b 86 80 00 00 00 
83 c0 01
Mar 14 09:39:44 mm-home1 <6>note: X[9988] exited with preempt_count 2
Mar 14 09:39:44 mm-home1 scheduling while atomic: X/0x00000002/9988
Mar 14 09:39:44 mm-home1 [<c0439c45>] schedule+0xa21/0xd33
Mar 14 09:39:44 mm-home1 [<c011e429>] release_console_sem+0xb8/0xba
Mar 14 09:39:44 mm-home1 [<c011e1e8>] vprintk+0x19a/0x2b2
Mar 14 09:39:44 mm-home1 [<c043ab6f>] rwsem_down_read_failed+0x88/0x17b
Mar 14 09:39:44 mm-home1 [<c0121938>] .text.lock.exit+0x27/0x87
Mar 14 09:39:44 mm-home1 [<c01205e2>] do_exit+0xfc/0x41d
Mar 14 09:39:44 mm-home1 [<c010414a>] do_trap+0x0/0x11d
Mar 14 09:39:44 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:44 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:44 mm-home1 [<c0115912>] do_page_fault+0x391/0x557
Mar 14 09:39:44 mm-home1 [<c012711e>] update_wall_time+0x10/0x3b
Mar 14 09:39:44 mm-home1 [<c0115581>] do_page_fault+0x0/0x557
Mar 14 09:39:44 mm-home1 [<c0103a0f>] error_code+0x4f/0x54
Mar 14 09:39:44 mm-home1 [<c01461a9>] free_hot_cold_page+0xe6/0x161
Mar 14 09:39:44 mm-home1 [<c01469dc>] __pagevec_free+0x16/0x1e
Mar 14 09:39:44 mm-home1 [<c014c689>] release_pages+0x161/0x176
Mar 14 09:39:44 mm-home1 [<c0150671>] free_pte_range+0x62/0xf2
Mar 14 09:39:44 mm-home1 [<c0150806>] free_pgd_range+0x105/0x195
Mar 14 09:39:44 mm-home1 [<c0159d90>] free_pages_and_swap_cache+0x5d/0x83
Mar 14 09:39:44 mm-home1 [<c0155da7>] unmap_region+0x13b/0x152
Mar 14 09:39:44 mm-home1 [<c01560a2>] do_munmap+0x10f/0x179
Mar 14 09:39:44 mm-home1 [<c0156151>] sys_munmap+0x45/0x66
Mar 14 09:39:44 mm-home1 [<c0102e8f>] sysenter_past_esp+0x54/0x75
---------------- end ----------------

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-11  2:15     ` Ken MacFerrin
@ 2006-03-13 18:59       ` Ken MacFerrin
  2006-03-14 19:04         ` Ken MacFerrin
  0 siblings, 1 reply; 21+ messages in thread
From: Ken MacFerrin @ 2006-03-13 18:59 UTC (permalink / raw)
  To: Ken MacFerrin
  Cc: Alistair John Strachan, Dave Spring, linux-kernel, Hugh Dickins

Ken MacFerrin wrote:
> Alistair John Strachan wrote:
> 
>> On Thursday 09 February 2006 23:55, Dave Spring wrote:
>>
>>> Just for closure's sake:
>>> This turned out to be a hardware problem.
>>> Memtest86+ http://www.memtest.org/ found an intermittent and
>>> pattern-sensitive memory error,
>>> and only appearing at one or two random locations within the 256M 
>>> module.
>>> Replacing the dodgy RAM module did the trick.
>>
>>
>>
>> Thanks Dave. Any update on your problem Ken? I'm keen to hear whether 
>> you had crashes without the NVIDIA driver loaded.
>>
> 
> Sorry, I got called out of town last weekend so I didn't get a chance to 
> try this out yet..
> -Ken

As a follow-up to close out this thread.  I only had a chance to test 
the nv driver for a short time before needing to go back to the xinerama 
capabilities of the Nvidia driver again.  I subsequently had a severe 
crash that beat up the filesystem pretty badly so I did a data backup 
and a clean install of Gentoo/KDE3.5 (kernel 2.6.15-r1) along with the 
binary Nvidia driver (1.0.8178-r3) and have not had the problem re-occur 
since.  The new install is using the same hardware and kernel config 
which has been stable for over a week of uptime now.  This would lead me 
to believe my previous install suffered from some evil filesystem 
gremlin that had snuck in from an earlier crash and continued to pop up 
to cause havok versus a genuine kernel bug.

I appreciate the help and feedback in trying to get this figured out.

Thanks,
Ken

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-10  0:13   ` Alistair John Strachan
@ 2006-02-11  2:15     ` Ken MacFerrin
  2006-03-13 18:59       ` Ken MacFerrin
  0 siblings, 1 reply; 21+ messages in thread
From: Ken MacFerrin @ 2006-02-11  2:15 UTC (permalink / raw)
  To: Alistair John Strachan; +Cc: Dave Spring, linux-kernel, Hugh Dickins

Alistair John Strachan wrote:
> On Thursday 09 February 2006 23:55, Dave Spring wrote:
> 
>>Just for closure's sake:
>> This turned out to be a hardware problem.
>>Memtest86+ http://www.memtest.org/ found an intermittent and
>>pattern-sensitive memory error,
>>and only appearing at one or two random locations within the 256M module.
>>Replacing the dodgy RAM module did the trick.
> 
> 
> Thanks Dave. Any update on your problem Ken? I'm keen to hear whether you had 
> crashes without the NVIDIA driver loaded.
> 

Sorry, I got called out of town last weekend so I didn't get a chance to 
try this out yet..
-Ken

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-09 23:55 ` Dave Spring
@ 2006-02-10  0:13   ` Alistair John Strachan
  2006-02-11  2:15     ` Ken MacFerrin
  0 siblings, 1 reply; 21+ messages in thread
From: Alistair John Strachan @ 2006-02-10  0:13 UTC (permalink / raw)
  To: Dave Spring; +Cc: linux-kernel, Hugh Dickins, Ken MacFerrin

On Thursday 09 February 2006 23:55, Dave Spring wrote:
> Just for closure's sake:
>  This turned out to be a hardware problem.
> Memtest86+ http://www.memtest.org/ found an intermittent and
> pattern-sensitive memory error,
> and only appearing at one or two random locations within the 256M module.
> Replacing the dodgy RAM module did the trick.

Thanks Dave. Any update on your problem Ken? I'm keen to hear whether you had 
crashes without the NVIDIA driver loaded.

-- 
Cheers,
Alistair.

'No sense being pessimistic, it probably wouldn't work anyway.'
Third year Computer Science undergraduate.
1F2 55 South Clerk Street, Edinburgh, UK.

^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
  2006-02-01 18:22 Dave Spring
@ 2006-02-09 23:55 ` Dave Spring
  2006-02-10  0:13   ` Alistair John Strachan
  0 siblings, 1 reply; 21+ messages in thread
From: Dave Spring @ 2006-02-09 23:55 UTC (permalink / raw)
  To: linux-kernel

Just for closure's sake:
 This turned out to be a hardware problem.
Memtest86+ http://www.memtest.org/ found an intermittent and 
pattern-sensitive memory error,
and only appearing at one or two random locations within the 256M module.
Replacing the dodgy RAM module did the trick.
Dave Spring wrote:

>>> running kernels with or without PREEMPT enabled.
>>>
>> If you don't actually *need* accelerated 3D (or if you could do
>> without it for a while), switching to the "nv" driver for a few
>> days/weeks would be interresting. If the crashes go away that would
>> point towards the nvidia driver, if they don't go away we'll get a
>> nice untainted crash report.
>
>
> It's not the nv drivers - or at least not just them.
> I'm getting this bug once or twice a day on a mini-ITX (C3 533Mhz 
> processor) based server which doesn't even have X installed.
> For me, it appeared sometime after 2.6.12.
> I'm now running with gentoo 2.6.15-r1 with Hugh's recently posted patch,
> and waiting 8-|
>
> Dave Spring



^ permalink raw reply	[flat|nested] 21+ messages in thread

* Re: PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1
@ 2006-02-01 18:22 Dave Spring
  2006-02-09 23:55 ` Dave Spring
  0 siblings, 1 reply; 21+ messages in thread
From: Dave Spring @ 2006-02-01 18:22 UTC (permalink / raw)
  To: linux-kernel

>> running kernels with or without PREEMPT enabled.
>>
>If you don't actually *need* accelerated 3D (or if you could do
>without it for a while), switching to the "nv" driver for a few
>days/weeks would be interresting. If the crashes go away that would
>point towards the nvidia driver, if they don't go away we'll get a
>nice untainted crash report.

It's not the nv drivers - or at least not just them.
I'm getting this bug once or twice a day on a mini-ITX 
(C3 533Mhz processor) based server which doesn't even have X installed.
For me, it appeared sometime after 2.6.12.
I'm now running with gentoo 2.6.15-r1 with Hugh's recently posted patch,
and waiting 8-|

Dave Spring


^ permalink raw reply	[flat|nested] 21+ messages in thread

end of thread, other threads:[~2006-03-14 19:05 UTC | newest]

Thread overview: 21+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2006-01-28  3:20 PROBLEM: kernel BUG at mm/rmap.c:486 - kernel 2.6.15-r1 Ken MacFerrin
2006-01-28 15:48 ` Hugh Dickins
2006-02-02  1:17   ` Ken MacFerrin
2006-02-02 15:54     ` Hugh Dickins
2006-02-02 21:31       ` Ken MacFerrin
2006-01-28 18:31 ` Jesper Juhl
2006-01-29 22:12   ` Ken MacFerrin
2006-01-30  0:56     ` Ken MacFerrin
2006-01-30 16:46       ` Alistair John Strachan
2006-01-28 19:13 ` Alistair John Strachan
2006-03-12  0:06   ` Patrick B�rjesson
2006-03-12  2:06     ` Alistair John Strachan
2006-03-12  9:05     ` Arjan van de Ven
2006-03-12 13:12       ` Patrick Börjesson
2006-03-12 12:41     ` Nick Piggin
2006-02-01 18:22 Dave Spring
2006-02-09 23:55 ` Dave Spring
2006-02-10  0:13   ` Alistair John Strachan
2006-02-11  2:15     ` Ken MacFerrin
2006-03-13 18:59       ` Ken MacFerrin
2006-03-14 19:04         ` Ken MacFerrin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).