From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:33687) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bHlYh-0003Eh-5L for qemu-devel@nongnu.org; Tue, 28 Jun 2016 01:27:40 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bHlYc-0002Dc-TZ for qemu-devel@nongnu.org; Tue, 28 Jun 2016 01:27:38 -0400 Received: from [59.151.112.132] (port=2013 helo=heian.cn.fujitsu.com) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bHlYb-0002DI-RH for qemu-devel@nongnu.org; Tue, 28 Jun 2016 01:27:34 -0400 References: <1464315131-25834-1-git-send-email-zhoujie2011@cn.fujitsu.com> <20160527100655.60db8206@t450s.home> <30d1cd95-7f67-29cf-c55e-0565364d89ff@cn.fujitsu.com> <41b0c187-ade0-182e-46b5-afd3e99f1e36@cn.fujitsu.com> <20160620103226.0ff61b21@ul30vt.home> <20160620211306.66a6b249@t450s.home> <576935FC.1080503@easystack.cn> <20160621084443.330f932d@t450s.home> <20160621215626.71c99582@t450s.home> <113474d2-8408-db49-e7ef-8c6b736af866@cn.fujitsu.com> <468b752b-a161-902b-d4cc-489dfa18c21e@cn.fujitsu.com> <20160622094236.515549fa@t450s.home> <7746532f-2fad-1304-0df7-7cd25ba761af@cn.fujitsu.com> <20160627095418.659e6e5f@t450s.home> <20160627215808.1531a774@t450s.home> From: Zhou Jie Message-ID: <7912dad0-0e37-603d-fdfe-bb4950b55f28@cn.fujitsu.com> Date: Tue, 28 Jun 2016 13:27:21 +0800 MIME-Version: 1.0 In-Reply-To: <20160627215808.1531a774@t450s.home> Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [PATCH v8 11/12] vfio: register aer resume notification handler for aer resume List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Alex Williamson Cc: izumi.taku@jp.fujitsu.com, caoj.fnst@cn.fujitsu.com, Chen Fan , qemu-devel@nongnu.org, mst@redhat.com Hi Alex, On 2016/6/28 11:58, Alex Williamson wrote: > On Tue, 28 Jun 2016 11:26:33 +0800 > Zhou Jie wrote: > >> Hi Alex, >> >>> The INTx/MSI part needs further definition for the user. Are we >>> actually completely tearing down interrupts with the expectation that >>> the user will re-enable them or are we just masking them such that the >>> user needs to unmask? Also note that not all devices support DisINTx. >> >> After reset, the "Bus Master Enable" bit of "Command Register" >> should be cleared, so MSI/MSI- X interrupt Messages is still disabled. >> After reset, the "Interrupt Disable" bit of "Command Register" >> should be cleared, so INTx interrupts is enabled. >> If the device doesn't support INTx, "Interrupt Disable" bit will >> hardware to 0, it is OK here. >> >> After fatal-error occurs, the user should reset the device and >> reinitialize the device. >> So I disable the interrupt before host reset the device, >> and let user to do the reinitialization. > > I'm dubious here. When DisINTx is not supported by the device or it's > marked broken in host quirks, then we can't trust the device to stop > sending INTx. It's hardwired to zero, meaning that it doesn't work or > it's been found to be broken in other ways. So COMMAND register > masking is not sufficient for all devices. For Endpoints that generate INTx interrupts, this bit is required. For Endpoints that do not generate IN Tx interrupts this bit is optional. If not implemented, this bit must be hardwired to 0b. For Root Ports, Switch Ports, and Bridges that generate INTx interrupts on their own behalf, this bit is required. The above is from "7.5.1.1." of "PCI Express Base Specification 3.1a". So I think "Interrupt Disable" bit must be supported by the device which can generate INTx interrupts. > Also, any time we start > changing the state of the device from what the user expects, we risk > consistency problems. We need to consider how the user last saw the > device and whether we can legitimately expect them to handle the device > in a new state. If we expect the user to re-initialize the device then > would it be more correct to teardown all interrupt signaling such that > the device is effectively in the same state as initial handoff when the > vfio device fd is opened? Before the user re-initialize the device, host has reseted the device. The interrupt status will be cleared by hardware. So the hardware is the same as the state when the vfio device fd is opened. > How will the user know when the device is > ready to be reset? Which of the ioctls that you're blocking can they > poll w/o any unwanted side-effects or awkward interactions? Should > flag bits in the device info ioctl indicate not only support for this > behavior but also the current status? Thanks, I can block the reset ioctl and config write. I will not add flag for the device current status, because I don't depend on user to prevent awkward interactions. Sincerely Zhou Jie