On Tue, May 06, 2014 at 12:57:08PM -0400, Vince Weaver wrote: > On Mon, 5 May 2014, Vince Weaver wrote: > > > On Mon, 5 May 2014, Vince Weaver wrote: > > > > > Meanwhile the haswell and AMD machines have been fuzzing away without > > > issue, I don't know why the core2 machine is always the trouble maker. > > > > The haswell has been fuzzing 12 hours with only a NMI dazed/confused > > message. > > So the Haswell seemed to still be going strong after 24-hours, but then I > killed the fuzzer with control-C and got this. > > ^C > [87536.479011] ------------[ cut here ]------------ > [87536.484553] WARNING: CPU: 1 PID: 11978 at lib/list_debug.c:59 __list_del_entry+0xa1/0xd0() > [87536.493994] list_del corruption. prev->next should be ffff8800ce684810, but was 6b6b6b6b6b6b6b6b > [87536.503915] Modules linked in: fuse x86_pkg_temp_thermal intel_powerclamp coretemp kvm snd_hda_codec_hdmi crct10dif_pclmul crc32_pclmul ghash_clmulni_intel snd_hda_codec_realtek snd_hda_codec_generic i915 aesni_intel snd_hda_intel snd_hda_controller snd_hda_codec snd_hwdep snd_pcm snd_seq snd_timer snd_seq_device tpm_tis ppdev snd aes_x86_64 parport_pc tpm evdev mei_me drm_kms_helper iTCO_wdt drm soundcore lrw gf128mul glue_helper iTCO_vendor_support wmi ablk_helper i2c_algo_bit button battery processor mei psmouse parport pcspkr serio_raw cryptd i2c_i801 video i2c_core lpc_ich mfd_core sd_mod sr_mod crc_t10dif cdrom crct10dif_common ahci libahci ehci_pci e1000e libata xhci_hcd ehci_hcd ptp crc32c_intel usbcore scsi_mod pps_core usb_common fan thermal thermal_sys > [87536.581372] CPU: 1 PID: 11978 Comm: perf_fuzzer Tainted: G W 3.15.0-rc1+ #104 > [87536.590762] Hardware name: LENOVO 10AM000AUS/SHARKBAY, BIOS FBKT72AUS 01/26/2014 > [87536.599435] 0000000000000009 ffff880117b57ad8 ffffffff81649ca0 ffff880117b57b20 > [87536.608228] ffff880117b57b10 ffffffff810646ad ffff8800ce684800 ffff880036a64000 > [87536.616970] ffff8800ce684810 ffff8800ce684800 0000000000000001 ffff880117b57b70 > [87536.625688] Call Trace: > [87536.629039] [] dump_stack+0x45/0x56 > [87536.635247] [] warn_slowpath_common+0x7d/0xa0 > [87536.642374] [] warn_slowpath_fmt+0x4c/0x50 > [87536.649211] [] __list_del_entry+0xa1/0xd0 > [87536.655953] [] list_del_event+0xe4/0xf0 > [87536.662477] [] perf_remove_from_context+0xb0/0x120 > [87536.670005] [] perf_event_release_kernel+0x3f/0x80 > [87536.677530] [] put_event+0xd3/0x100 > [87536.683702] [] ? put_event+0x30/0x100 > [87536.690047] [] perf_release+0x15/0x20 > [87536.696292] [] __fput+0xdc/0x1e0 > [87536.702191] [] ____fput+0xe/0x10 > [87536.708038] [] task_work_run+0xc4/0xe0 > [87536.714503] [] do_exit+0x2cc/0xa50 > [87536.720546] [] ? get_signal_to_deliver+0x249/0x650 > [87536.728117] [] do_group_exit+0x4c/0xc0 > [87536.734480] [] get_signal_to_deliver+0x291/0x650 > [87536.741814] [] do_signal+0x48/0x990 > [87536.747877] [] ? finish_task_switch+0x7d/0x120 > [87536.754994] [] ? _raw_spin_unlock_irq+0x27/0x40 > [87536.762243] [] ? finish_task_switch+0x7d/0x120 > [87536.769465] [] ? finish_task_switch+0x3f/0x120 > [87536.776622] [] do_notify_resume+0x70/0xa0 > [87536.783323] [] retint_signal+0x48/0x8c > [87536.789726] ---[ end trace 2b5a3d32e8d767a7 ]--- > [87537.231116] ------------[ cut here ]------------ Of course it did :/ This thing can't ever _just_ work.. My WSM is playing silly buggers and prefers the endless loop (which you saw on Core2 iirc) when I press ^C. I'll see if I can make it do something useful.. No immediate ideas though.