From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Zhang, Qi Z" Subject: Re: [PATCH] net/ixgbe: fix crash when on remove Date: Tue, 26 Feb 2019 05:34:50 +0000 Message-ID: <039ED4275CED7440929022BC67E706115333A9D7@SHSMSX103.ccr.corp.intel.com> References: <1550026132-9244-1-git-send-email-wangyunjian@huawei.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable Cc: "xudingke@huawei.com" , "stable@dpdk.org" To: wangyunjian , "dev@dpdk.org" Return-path: In-Reply-To: <1550026132-9244-1-git-send-email-wangyunjian@huawei.com> Content-Language: en-US List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" Hi: > -----Original Message----- > From: wangyunjian [mailto:wangyunjian@huawei.com] > Sent: Wednesday, February 13, 2019 10:49 AM > To: dev@dpdk.org > Cc: Zhang, Qi Z ; xudingke@huawei.com; Yunjian Wang > ; stable@dpdk.org > Subject: [dpdk-dev] [PATCH] net/ixgbe: fix crash when on remove >=20 > From: Yunjian Wang >=20 > The nic's interrupt source has some active handler, when the port remove.= We > should cancel the delay handler before remove dev to prevent executing th= e > delay handler. Agree, thanks to capture this. >=20 > Call Trace: > #0 ixgbe_disable_intr (hw=3D0x0, hw=3D0x0) > at /usr/src/debug/dpdk-18.11/drivers/net/ixgbe/ixgbe_ethdev.c:852 > #1 ixgbe_dev_interrupt_delayed_handler (param=3D0xadb9c0 > ) > at /usr/src/debug/dpdk-18.11/drivers/net/ixgbe/ixgbe_ethdev.c:4386 > #2 0x00007f05782147af in eal_alarm_callback (arg=3D) > at /usr/src/debug/dpdk-18.11/lib/librte_eal/linuxapp/eal/ > eal_alarm.c:90 > #3 0x00007f057821320a in eal_intr_process_interrupts (nfds=3D1, > events=3D0x7f056cbf3e88) at /usr/src/debug/dpdk-18.11/lib/ > librte_eal/linuxapp/eal/eal_interrupts.c:838 > #4 eal_intr_handle_interrupts (totalfds=3D, pfd=3D18) > at /usr/src/debug/dpdk-18.11/lib/librte_eal/linuxapp/eal/ > eal_interrupts.c:885 > #5 eal_intr_thread_main (arg=3D) > at /usr/src/debug/dpdk-18.11/lib/librte_eal/linuxapp/eal/ > eal_interrupts.c:965 > #6 0x00007f05708a0e45 in start_thread () from /usr/lib64/libpthread.so= .0 > #7 0x00007f056eb4ab5d in clone () from /usr/lib64/libc.so.6 >=20 > Fixes: 2866c5f1b87e ("ixgbe: support port hotplug") > Cc: stable@dpdk.org >=20 > Signed-off-by: Yunjian Wang > --- > drivers/net/ixgbe/ixgbe_ethdev.c | 3 +++ > 1 file changed, 3 insertions(+) >=20 > diff --git a/drivers/net/ixgbe/ixgbe_ethdev.c > b/drivers/net/ixgbe/ixgbe_ethdev.c > index 7493110..e9533e5 100644 > --- a/drivers/net/ixgbe/ixgbe_ethdev.c > +++ b/drivers/net/ixgbe/ixgbe_ethdev.c > @@ -1336,6 +1336,9 @@ struct rte_ixgbe_xstats_name_off { > rte_delay_ms(100); > } while (retries++ < (10 + IXGBE_LINK_UP_TIME)); >=20 > + /* cancel the delay handler before remove dev */ > + rte_eal_alarm_cancel(ixgbe_dev_interrupt_delayed_handler, eth_dev); > + I think it will be more safe to move this call ahead, the delayed handler may invoked application callback which may also invoke = the ethdev API, but at this moment, we already reset ethdev, we still have = chance to get problem, right? Is it better that we add this call at the beginning of dev_close? Regards Qi > /* uninitialize PF if max_vfs not zero */ > ixgbe_pf_host_uninit(eth_dev); >=20 > -- > 1.8.3.1 >=20