From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:46073) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bHkAG-0006Ex-DM for qemu-devel@nongnu.org; Mon, 27 Jun 2016 23:58:21 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1bHkAB-0000k6-SE for qemu-devel@nongnu.org; Mon, 27 Jun 2016 23:58:20 -0400 Received: from mx1.redhat.com ([209.132.183.28]:53216) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1bHkAB-0000k0-KK for qemu-devel@nongnu.org; Mon, 27 Jun 2016 23:58:15 -0400 Date: Mon, 27 Jun 2016 21:58:08 -0600 From: Alex Williamson Message-ID: <20160627215808.1531a774@t450s.home> In-Reply-To: References: <1464315131-25834-1-git-send-email-zhoujie2011@cn.fujitsu.com> <20160527100655.60db8206@t450s.home> <30d1cd95-7f67-29cf-c55e-0565364d89ff@cn.fujitsu.com> <41b0c187-ade0-182e-46b5-afd3e99f1e36@cn.fujitsu.com> <20160620103226.0ff61b21@ul30vt.home> <20160620211306.66a6b249@t450s.home> <576935FC.1080503@easystack.cn> <20160621084443.330f932d@t450s.home> <20160621215626.71c99582@t450s.home> <113474d2-8408-db49-e7ef-8c6b736af866@cn.fujitsu.com> <468b752b-a161-902b-d4cc-489dfa18c21e@cn.fujitsu.com> <20160622094236.515549fa@t450s.home> <7746532f-2fad-1304-0df7-7cd25ba761af@cn.fujitsu.com> <20160627095418.659e6e5f@t450s.home> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [PATCH v8 11/12] vfio: register aer resume notification handler for aer resume List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Zhou Jie Cc: izumi.taku@jp.fujitsu.com, caoj.fnst@cn.fujitsu.com, Chen Fan , qemu-devel@nongnu.org, mst@redhat.com On Tue, 28 Jun 2016 11:26:33 +0800 Zhou Jie wrote: > Hi Alex, > > > The INTx/MSI part needs further definition for the user. Are we > > actually completely tearing down interrupts with the expectation that > > the user will re-enable them or are we just masking them such that the > > user needs to unmask? Also note that not all devices support DisINTx. > > After reset, the "Bus Master Enable" bit of "Command Register" > should be cleared, so MSI/MSI- X interrupt Messages is still disabled. > After reset, the "Interrupt Disable" bit of "Command Register" > should be cleared, so INTx interrupts is enabled. > If the device doesn't support INTx, "Interrupt Disable" bit will > hardware to 0, it is OK here. > > After fatal-error occurs, the user should reset the device and > reinitialize the device. > So I disable the interrupt before host reset the device, > and let user to do the reinitialization. I'm dubious here. When DisINTx is not supported by the device or it's marked broken in host quirks, then we can't trust the device to stop sending INTx. It's hardwired to zero, meaning that it doesn't work or it's been found to be broken in other ways. So COMMAND register masking is not sufficient for all devices. Also, any time we start changing the state of the device from what the user expects, we risk consistency problems. We need to consider how the user last saw the device and whether we can legitimately expect them to handle the device in a new state. If we expect the user to re-initialize the device then would it be more correct to teardown all interrupt signaling such that the device is effectively in the same state as initial handoff when the vfio device fd is opened? How will the user know when the device is ready to be reset? Which of the ioctls that you're blocking can they poll w/o any unwanted side-effects or awkward interactions? Should flag bits in the device info ioctl indicate not only support for this behavior but also the current status? Thanks, Alex