linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-06  0:55 Ian Kumlien
  2003-12-06  1:52 ` Craig Bradney
  2003-12-08  3:12 ` Bob
  0 siblings, 2 replies; 30+ messages in thread
From: Ian Kumlien @ 2003-12-06  0:55 UTC (permalink / raw)
  To: linux-kernel; +Cc: cbradney, prakashpublic, cheuche+lkml

[-- Attachment #1: Type: text/plain, Size: 1601 bytes --]

Craig Bradney wrote:
> Sounds great.. maybe you have come across something. Yes, the CPU
> Disconnect function arrived in your BIOS in revision of 2003/03/27
> "6.Adds"CPU Disconnect Function" to adjust C1 disconnects. The Chipset
> does not support C2 disconnect; thus, disable C2 function."

I doubt thats related, i run ACPI with powersave anyways... 

> For me though.. Im on an ASUS A7N8X Deluxe v2 BIOS 1007. From what I can
> see the CPU Disconnect isnt even in the Uber BIOS 1007 for this ASUS
> that has been discussed.

I don't have it either... 

I'm more hopeful about the patch from Mathieu <cheuche+lkml () free ! fr>...

           CPU0
  0:     267486    IO-APIC-edge  timer
  1:       9654    IO-APIC-edge  keyboard
  2:          0          XT-PIC  cascade
  8:          1    IO-APIC-edge  rtc
  9:          0   IO-APIC-level  acpi
 14:      28252    IO-APIC-edge  ide0
 15:        103    IO-APIC-edge  ide1
 16:     251712   IO-APIC-level  eth0
 17:      90632   IO-APIC-level  EMU10K1
 19:     415529   IO-APIC-level  nvidia
 20:          0   IO-APIC-level  usb-ohci
 21:        153   IO-APIC-level  ehci_hcd
 22:      58257   IO-APIC-level  usb-ohci
NMI:        479
LOC:     265875
ERR:          0
MIS:          0

this far and it feels like a closer match to what windows does from what
i have read on the ml. 

I haven't even come close to testing this yet, I've only been up 45 mins
but i'll leave it running and do what i usually do when it hangs... =)

I'll get back to you about how it goes... 

-- 
Ian Kumlien <pomac@vapor.com>

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 30+ messages in thread
* RE: merged in bk5 Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-09 22:57 b
  2003-12-10  4:48 ` Bob
  0 siblings, 1 reply; 30+ messages in thread
From: b @ 2003-12-09 22:57 UTC (permalink / raw)
  To: recbo, linux-kernel

Is this stuff going to be merged into 2.4 soon? I'd like
to try a 2.4.23/24-bk with these patches.


 >From: Bob
 >Subject: merged in bk5 Re: Catching NForce2 lockup with NMI
 >
 >if you're following this thread, good news--
 >
 >nforce2 fixups have been merged in
 >linux-2.6.0-test11-bk5.patch
 >>  -bk snapshot (patch-2.6.0-test11-bk5)
 >
 >nforce2-disconnect-quirk.patch
 >>  [x86] fix lockups with APIC support on nForce2
 >>
 >>nforce2-apic.patch
 >>  [x86] do not wrongly override mp_ExtINT IRQ
 >
 >plus promise and sis fixes so I don't need to pay
 >for a 3ware controller ;-)   that was another
 >show-stopper for me earlier
 >
 >> We're all trying to get acpi, apic, lapic, io-apic working
 >> when turned on in cmos/bios and kernel.
 >>



^ permalink raw reply	[flat|nested] 30+ messages in thread
* Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-08  3:21 Ross Dickson
  2003-12-08 11:36 ` Craig Bradney
  0 siblings, 1 reply; 30+ messages in thread
From: Ross Dickson @ 2003-12-08  3:21 UTC (permalink / raw)
  To: linux-kernel; +Cc: ross, recbo, B.Zolnierkiewicz

On Monday 08 of December 2003 04:08, Bob wrote: 
 > >>Sounds great.. maybe you have come across something. Yes, the CPU 
 > >>Disconnect function arrived in your BIOS in revision of 2003/03/27 
 > >>"6.Adds"CPU Disconnect Function" to adjust C1 disconnects. The Chipset 
 > >>does not support C2 disconnect; thus, disable C2 function." 
 > >> 
 > >>For me though.. Im on an ASUS A7N8X Deluxe v2 BIOS 1007. From what I can 
 > >>see the CPU Disconnect isnt even in the Uber BIOS 1007 for this ASUS 
 > >>that has been discussed. 
 > >> 
 > >>Craig 
 > >
 > >I don't have that in MSI K7N2 MCP2-T near the 
 > >agp and fsb spread spectrum items or anywhere 
 >> else. 
>Use athcool: 
>         http://members.jcom.home.ne.jp/jacobi/linux/softwares.html#athcool 
> or apply kernel patch (2.4 and 2.6 versions were posted already). 
>--bart 

Please take a look at 

Fixes for nforce2 hard lockup, apic, io-apic, udma133 covered

in mailing list.

I approached it from another angle regarding delaying the apic ack in local timer irq
and achieved stability. It would be good to have others try it. Ian Kumlien is also
reporting success so far.
 

^ permalink raw reply	[flat|nested] 30+ messages in thread
* Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-06 19:33 Ian Kumlien
  2003-12-07  0:20 ` Craig Bradney
  0 siblings, 1 reply; 30+ messages in thread
From: Ian Kumlien @ 2003-12-06 19:33 UTC (permalink / raw)
  To: linux-kernel


[-- Attachment #1.1: Type: text/plain, Size: 386 bytes --]

Hi, i'm now running this patch and it survived my grep in /usr/src.

It's mainly a correction of the apic patch and the ACPI halt disconnect
patch that was originally done for 2.6...

I'll get back to you about uptime, but i think this is it... 

Although i would prefer a not so workaroundish approach =)

-- 
Ian Kumlien <pomac () vapor ! com> -- http://pomac.netswarm.net

[-- Attachment #1.2: Type: text/x-diff, Size: 1680 bytes --]

--- arch/i386/kernel/mpparse.c.orig	2003-11-28 19:26:19.000000000 +0100
+++ arch/i386/kernel/mpparse.c	2003-12-06 19:35:15.000000000 +0100
@@ -1140,7 +1140,8 @@
 	 */
 	for (i = 0; i < mp_irq_entries; i++) {
 		if ((mp_irqs[i].mpc_dstapic == intsrc.mpc_dstapic) 
-			&& (mp_irqs[i].mpc_srcbusirq == intsrc.mpc_srcbusirq)) {
+			&& (mp_irqs[i].mpc_srcbusirq == intsrc.mpc_srcbusirq)
+			&& (mp_irqs[i].mpc_irqtype == intsrc.mpc_irqtype)) {
 			mp_irqs[i] = intsrc;
 			found = 1;
 			break;
--- arch/i386/kernel/pci-pc.c.orig	2003-12-06 19:32:44.000000000 +0100
+++ arch/i386/kernel/pci-pc.c	2003-12-06 19:33:55.000000000 +0100
@@ -1328,6 +1328,18 @@
 		dev->transparent = 1;
 }
 
+/*
+ * Halt Disconnect and Stop Grant Disconnect (bit 4 at offset 0x6F)
+ * must be disabled when APIC is used (or lockups will happen).
+ */
+static void __devinit pci_fixup_nforce2_disconnect(struct pci_dev *d)
+{
+	u8 t;
+
+	pci_read_config_byte(d, 0x6F, &t);
+	pci_write_config_byte(d, 0x6F, (t & 0xef));
+}
+
 struct pci_fixup pcibios_fixups[] = {
 	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_INTEL,	PCI_DEVICE_ID_INTEL_82451NX,	pci_fixup_i450nx },
 	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_INTEL,	PCI_DEVICE_ID_INTEL_82454GX,	pci_fixup_i450gx },
@@ -1343,6 +1355,7 @@
 	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_VIA,	PCI_DEVICE_ID_VIA_8367_0,	pci_fixup_via_northbridge_bug },
 	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_NCR,	PCI_DEVICE_ID_NCR_53C810,	pci_fixup_ncr53c810 },
 	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_INTEL,	PCI_ANY_ID,			pci_fixup_transparent_bridge },
+	{ PCI_FIXUP_HEADER,	PCI_VENDOR_ID_NVIDIA,	PCI_DEVICE_ID_NVIDIA_NFORCE2,	pci_fixup_nforce2_disconnect },
 	{ 0 }
 };
 

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 30+ messages in thread
* Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-06  2:08 Ian Kumlien
  2003-12-06 13:36 ` Craig Bradney
  0 siblings, 1 reply; 30+ messages in thread
From: Ian Kumlien @ 2003-12-06  2:08 UTC (permalink / raw)
  To: linux-kernel; +Cc: cbradney

[-- Attachment #1: Type: text/plain, Size: 993 bytes --]

Craig Bradney wrote:
> All the interrupts are the same...except:
> 0, timer is now IO-APIC-edge.

Same here... 

> Im not getting any NMI counts.. should I use nmi-watchdog=1?

I got nmi counts with nmi_watchdog=2...  I never tested with =1... if
you get nmi's 1 lemme know.

> Ian, from looking back, you have an A7N8X-X bios 1007.
> Interesting that my USB hcis are still sharing IRQs there.

Your? i only see one... But you share it with sound and eth0... 

> Any idea how I can get them apart, or if I should try.

You could always move eth0 to a different slot. Other than that, you can
do manual config for the irq's in the bios, but it shouldn't be
needed...

> My system was pretty stable as I've stated.. but the patch has changed
> things slightly re the timer.

As i stated in my prev email, i had to do 2 full greps at a sizable
amount of data to recreate the crash... =P

And, please CC since i'm not on this ml =P
-- 
Ian Kumlien <pomac@vapor.com>

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 30+ messages in thread
* Re: Catching NForce2 lockup with NMI watchdog - found?
@ 2003-12-06  1:58 Ian Kumlien
  0 siblings, 0 replies; 30+ messages in thread
From: Ian Kumlien @ 2003-12-06  1:58 UTC (permalink / raw)
  To: linux-kernel; +Cc: cbradney, prakashpublic, cheuche+lkml

[-- Attachment #1: Type: text/plain, Size: 1679 bytes --]

Quoting myself... 
> I'm more hopeful about the patch from Mathieu <cheuche+lkml () free ! fr>...

>           CPU0
>   0:     267486    IO-APIC-edge  timer
>   1:       9654    IO-APIC-edge  keyboard
>   2:          0          XT-PIC  cascade
>   8:          1    IO-APIC-edge  rtc
>   9:          0   IO-APIC-level  acpi
>  14:      28252    IO-APIC-edge  ide0
>  15:        103    IO-APIC-edge  ide1
>  16:     251712   IO-APIC-level  eth0
>  17:      90632   IO-APIC-level  EMU10K1
>  19:     415529   IO-APIC-level  nvidia
>  20:          0   IO-APIC-level  usb-ohci
>  21:        153   IO-APIC-level  ehci_hcd
>  22:      58257   IO-APIC-level  usb-ohci
> NMI:        479
> LOC:     265875
> ERR:          0
> MIS:          0

> this far and it feels like a closer match to what windows does from what
> i have read on the ml. 

I think that this is what we want, ie know how windows handles the spic
since i just bet that all the mb manuf. ppl only care about windows and
anything else is secondary. [Can we get some more info from nvidia about
differences in the setup?]

> I haven't even come close to testing this yet, I've only been up 45 mins
> but i'll leave it running and do what i usually do when it hangs... =)

And that some great 2 hours, everything was dandy, screen refreshes
faster (moving windows with contents was snappier and you saw less
trailing refreshes)... but it ended in a beeeeeeeep deadlock.

I later reproduced it again in console mode... It required 2 full grep
-rne test * in my /usr/src, that is, 2.6.0-test11 and 2.4.23*2 + some
rpm's... all in all: 624M + 219M

-- 
Ian Kumlien <pomac@vapor.com>

[-- Attachment #2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 189 bytes --]

^ permalink raw reply	[flat|nested] 30+ messages in thread
* RE: Catching NForce2 lockup with NMI watchdog
@ 2003-12-05 20:56 Allen Martin
  2003-12-05 23:49 ` Catching NForce2 lockup with NMI watchdog - found? Prakash K. Cheemplavam
  0 siblings, 1 reply; 30+ messages in thread
From: Allen Martin @ 2003-12-05 20:56 UTC (permalink / raw)
  To: 'Jesse Allen'; +Cc: linux-kernel

> -----Original Message-----
> From: Jesse Allen [mailto:the3dfxdude@hotmail.com] 
> Sent: Friday, December 05, 2003 12:36 PM
>
> Do you know whether the nforce2's with apic support the timer 
> (IRQ 0) in 
> IO-APIC mode?  To me, it seems like a bug:
> "Dec  4 20:13:11 tesore kernel: ..MP-BIOS bug: 8254 timer not 
> connected to 
> IO-APIC"
> (This message originates in arch/i386/kernel/io_apic.c)
> 

Yes, Win 9x/2k/XP use the system timer on irq0 and have no problem.  I
haven't looked at this yet.

-Allen

^ permalink raw reply	[flat|nested] 30+ messages in thread

end of thread, other threads:[~2003-12-10  4:29 UTC | newest]

Thread overview: 30+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2003-12-06  0:55 Catching NForce2 lockup with NMI watchdog - found? Ian Kumlien
2003-12-06  1:52 ` Craig Bradney
2003-12-06  4:00   ` Mike Fedyk
2003-12-06  8:23   ` Josh McKinney
2003-12-08  3:12 ` Bob
  -- strict thread matches above, loose matches on Subject: below --
2003-12-09 22:57 merged in bk5 " b
2003-12-10  4:48 ` Bob
2003-12-08  3:21 Ross Dickson
2003-12-08 11:36 ` Craig Bradney
2003-12-08 13:34   ` Ross Dickson
2003-12-08 17:40   ` Bob
2003-12-06 19:33 Ian Kumlien
2003-12-07  0:20 ` Craig Bradney
2003-12-07  0:29   ` Ian Kumlien
2003-12-07 10:32     ` Craig Bradney
2003-12-07 14:54       ` Ian Kumlien
2003-12-06  2:08 Ian Kumlien
2003-12-06 13:36 ` Craig Bradney
2003-12-06 13:47   ` Bartlomiej Zolnierkiewicz
2003-12-06 14:42   ` Ian Kumlien
2003-12-06 15:33     ` Craig Bradney
2003-12-06 15:58       ` Ian Kumlien
2003-12-06  1:58 Ian Kumlien
2003-12-05 20:56 Catching NForce2 lockup with NMI watchdog Allen Martin
2003-12-05 23:49 ` Catching NForce2 lockup with NMI watchdog - found? Prakash K. Cheemplavam
2003-12-05 23:55   ` Prakash K. Cheemplavam
2003-12-06  0:15   ` Craig Bradney
2003-12-06  0:21     ` Prakash K. Cheemplavam
2003-12-06  0:37       ` Craig Bradney
2003-12-08  3:08         ` Bob
2003-12-08  3:06           ` Bartlomiej Zolnierkiewicz
2003-12-08  3:03     ` Bob

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).