From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Guo, Jia" Subject: Re: [PATCH V20 2/4] eal: add failure handler mechanism for hot plug Date: Thu, 3 May 2018 11:17:27 +0800 Message-ID: References: <1498711073-42917-1-git-send-email-jia.guo@intel.com> <1524058689-4954-1-git-send-email-jia.guo@intel.com> <1524058689-4954-3-git-send-email-jia.guo@intel.com> <2601191342CEEE43887BDE71AB977258AE918E93@IRSMSX102.ger.corp.intel.com> Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit Cc: "jblunck@infradead.org" , "shreyansh.jain@nxp.com" , "dev@dpdk.org" , "Zhang, Helin" To: "Ananyev, Konstantin" , "stephen@networkplumber.org" , "Richardson, Bruce" , "Yigit, Ferruh" , "gaetan.rivet@6wind.com" , "Wu, Jingjing" , "thomas@monjalon.net" , "motih@mellanox.com" , "matan@mellanox.com" , "Van Haaren, Harry" , "Tan, Jianfeng" Return-path: Received: from mga06.intel.com (mga06.intel.com [134.134.136.31]) by dpdk.org (Postfix) with ESMTP id CD5C820BD for ; Thu, 3 May 2018 05:17:31 +0200 (CEST) In-Reply-To: <2601191342CEEE43887BDE71AB977258AE918E93@IRSMSX102.ger.corp.intel.com> List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On 4/21/2018 12:16 AM, Ananyev, Konstantin wrote: >>> + >>> +static void sigbus_handler(int signum __rte_unused, siginfo_t *info, >>> + void *ctx __rte_unused) >>> +{ >>> + int ret; >>> + >>> + RTE_LOG(ERR, EAL, "SIGBUS error, fault address:%p\n", info->si_addr); >>> + ret = dev_uev_failure_process(NULL, info->si_addr); >> As now you can try to mmap/munmap same address from two or more different threads >> you probably need some synchronization here. >> Something simple as spinlock seems to be enough here. >> We might have one per device or might be even a global one would be ok here. >> >>> + if (!ret) >>> + RTE_LOG(DEBUG, EAL, >>> + "SIGBUS error is because of hot unplug!\n"); > Also if sigbus handler wasn't able to fix things - failure addr doesn't belong to > any devices, or remaping fails - we probably should invoke previously installed handler > or just apply default action. > Konstantin i think just exception here by exit for apply default action, and info that is a normal sigbus error should be ok. >>> +} >>> +