From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from aserp2120.oracle.com ([141.146.126.78]:55216 "EHLO aserp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725901AbeHPQaG (ORCPT ); Thu, 16 Aug 2018 12:30:06 -0400 Subject: Re: [PATCH 1/1] PCI/AER: prevent pcie_do_fatal_recovery from using device after it is removed To: Benjamin Herrenschmidt , poza@codeaurora.org Cc: okaya@kernel.org, bhelgaas@google.com, keith.busch@intel.com, linux-pci@vger.kernel.org, linux-pci-owner@vger.kernel.org, Sam Bobroff References: <1534179088-44219-1-git-send-email-thomas.tai@oracle.com> <1534179088-44219-2-git-send-email-thomas.tai@oracle.com> <51f4b387d9bd96a42d526a6a029fc43b@codeaurora.org> <903394c04d6ad468ed06dc0a779200e7555345a7.camel@kernel.crashing.org> <6cb069038530757f31f3dd60328c7e30@codeaurora.org> <42bd39aef30fe24bfc48d378e1f5d35d@codeaurora.org> <54d19e0e3d44bedf247853144c6bbfed5561a125.camel@kernel.crashing.org> <6382ee9197b781f5f5680aa60176f937669a0a13.camel@kernel.crashing.org> From: Thomas Tai Message-ID: Date: Thu, 16 Aug 2018 09:30:21 -0400 MIME-Version: 1.0 In-Reply-To: <6382ee9197b781f5f5680aa60176f937669a0a13.camel@kernel.crashing.org> Content-Type: text/plain; charset=utf-8; format=flowed Sender: linux-pci-owner@vger.kernel.org List-ID: [ ... ]> >> and all we have to do is discuss and evolve it or change it >> we can catch up on webex, (Sinan is going to be there in Plumber's >> conference, I might not be able to join there, as we have bring-up >> coming) > > Ok, I'll try to get there. Let's plan at least a BOF or two if not a > microconf. > > To setup a webex let's first list who needs to attend and respective > timezones so we can figure out a time. I'm in Australia east coast. Hi guys, I see that there are a lot discussion regarding the error handling. May I join the webex and hopefully I can be helpful? I'm in Canada east coast (Ottawa,Ontario,Canada) Thank you, Thomas > >>>> The way DPC used to behave in 2016, is still the same; which involved >>>> removing and re-enumerating the devices. >>> >>> Which is mostly useless for anything that isn't a network device. >>> >>> We've been doing EEH for something like 15 to 20 years, so we have a >>> long experience with what it takes to get PCI(e) devices to recover on >>> enterprise systems. >>> >>> Removing and re-enumerating is one of the very worst thing you can do >>> in that area. >>> >>> Cheers, >>> Ben. >