From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from [140.186.70.92] (port=45970 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1Pgedr-0000oF-VZ for qemu-devel@nongnu.org; Sat, 22 Jan 2011 09:40:41 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Pgedq-0008JP-Ha for qemu-devel@nongnu.org; Sat, 22 Jan 2011 09:40:39 -0500 Received: from hall.aurel32.net ([88.191.126.93]:34456) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Pgedp-0008Ig-QH for qemu-devel@nongnu.org; Sat, 22 Jan 2011 09:40:38 -0500 Date: Sat, 22 Jan 2011 15:40:34 +0100 From: Aurelien Jarno Subject: Re: [Qemu-devel] Re: [PATCH 8/8] ahci: fix !msi interrupts Message-ID: <20110122144034.GC3960@hall.aurel32.net> References: <4D346834.8000903@siemens.com> <4D3468A0.3060709@suse.de> <4D346C31.8060300@siemens.com> <4D346F5B.7090502@suse.de> <4D35587F.3050503@redhat.com> <2006A7EF-07DC-4DAA-B6ED-88D24D6B7978@suse.de> <4D358E71.3040905@siemens.com> <20110122131303.GI27195@hall.aurel32.net> <20110122141406.GC21090@laped.lan> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: Sender: Aurelien Jarno List-Id: qemu-devel.nongnu.org List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Alexander Graf Cc: Kevin Wolf , Joerg Roedel , qemu-devel Developers , Gerd Hoffmann , Jan Kiszka , "Edgar E.Iglesias" , Sebastian Herbszt On Sat, Jan 22, 2011 at 03:32:36PM +0100, Alexander Graf wrote: > > On 22.01.2011, at 15:14, Edgar E. Iglesias wrote: > > > On Sat, Jan 22, 2011 at 02:13:03PM +0100, Aurelien Jarno wrote: > >> On Tue, Jan 18, 2011 at 01:58:25PM +0100, Jan Kiszka wrote: > >>> On 2011-01-18 13:05, Alexander Graf wrote: > >>>> > >>>> On 18.01.2011, at 10:08, Gerd Hoffmann wrote: > >>>> > >>>>> Hi, > >>>>> > >>>>>>> Worse might also be that unknown issue that force you to inject an IRQ > >>>>>>> here. We don't know. That's probably worst. > >>>>>> > >>>>>> Well, IIRC the issue was that usually a level high interrupt line would > >>>>>> simply retrigger an interrupt after enabling the interrupt line in the > >>>>>> APIC again. > >>>>> > >>>>> edge triggered interrupts wouldn't though. > >>>> > >>>> The code change doesn't change anything for edge triggered interrupts - they work fine. Only !msi (== level) ones are broken. > >>>> > >>>>> > >>>>>> Unless my memory completely fails on me, that didn't happen, > >>>>>> so I added the manual retrigger on a partial command ACK in ahci code. > >>>>> > >>>>> That re-trigger smells like you are not clearing the interrupt line where you should. For starters try calling ahci_check_irq() after guest writes to PORT_IRQ_STAT. > >>>> > >>>> The problem happened when I had the following: > >>>> > >>>> ahci irq bits = 0 > >>>> > >>>> ahci irq bits = 1 | 2 > >>>> irq line trigger > >>>> guest clears 1 > >>>> ahci bits = 2 > >>>> > >>>> > >>>> On normal hardware, the high state of the irq line would simply trigger another interrupt in the guest when it gets ACKed on the LAPIC. Somehow it doesn't get triggered here. I don't see why clearing the interrupt line would help? It's not what real hardware would do, no? > >>>> > >>> > >>> No, real devices would simply leave the line asserted as long as there > >>> is a reason to. > >>> > >>> So again my question: With which irqchips do you see this effect, kvm's > >>> in-kernel model and/or qemu's user space model? If there is an issue > >>> with retriggering a CPU interrupt while the source is still asserted, > >>> that probably needs to be fixed. > >>> > >> > >> I guess it might be related to a problem identified long time ago on > >> some targets, and that leads to the following #ifdef in i8259.c: > >> > >> | /* all targets should do this rather than acking the IRQ in the cpu */ > >> | #if defined(TARGET_MIPS) || defined(TARGET_PPC) || defined(TARGET_ALPHA) > >> > >> For your information it has been fixed on MIPS in commit > >> 4de9b249d37c1b382cc3e5a21fad1b4a11cec2fa. > >> > >> Basically when an interrupt line is high, the interrupt is getting > >> delivered to the CPU. This part works correctly on x86. The CPU will > >> take a corresponding action, basically either disabling the interrupt > >> at the CPU or controller level or doing something on the device so that > >> it lower its IRQ line. This new IRQ line level should then propagate > >> through the various controllers, which should also lower their IRQ line > >> if no other interrupt line are active. This ACK process should then > >> continue up to the CPU. > > > > I totally agree. > > > > > >> For x86 the interrupt state is cleared as soon as the interrupt is > >> signaled to the CPU (see cpu-exec.c line 407), therefore if an interrupt > >> is still pending, it won't be seen by the CPU. It's probably what you > >> observed with AHCI. > > > > Yes, essentially, the CPU_INTERRUPT_HARD signal is an input to the CPU. > > The CPU cannot drive it directly. To lower it, it must take some kind > > of indirect action (IO or whatever) to clear the condition that is > > forcing it high. Any assignments to clear or set the CPU_INTERRUPT_HARD > > signal from within the CPU core are likely wrong.. > > > > FWIW, PPC code in cpu-exec.c:443 looks suspicious aswell... > > How's that suspicious? As long as pending_interrupts is only reset on actual interrupt delivery, everything's fine. Interrupts like the decrementor are not level based on some PPCs, so the actual semantics are implementation dependent. > It's suspicious because it's not the right place to do that. It should be done in the interrupt controller. This seems to be already done in hw/ppc.c when updating pending_interrupts, so this code seems to be useless. -- Aurelien Jarno GPG: 1024D/F1BCDB73 aurelien@aurel32.net http://www.aurel32.net