From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753767AbcFOXLx (ORCPT ); Wed, 15 Jun 2016 19:11:53 -0400 Received: from ale.deltatee.com ([207.54.116.67]:43184 "EHLO ale.deltatee.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752739AbcFOXLu (ORCPT ); Wed, 15 Jun 2016 19:11:50 -0400 To: "Jiang, Dave" , "Allen.Hubbe@emc.com" , "jdmason@kudzu.us" References: <1466026416.16234.262.camel@intel.com> <5761D4C5.8090208@deltatee.com> <1466029459.16234.263.camel@intel.com> Cc: "linux-kernel@vger.kernel.org" , "shuahkh@osg.samsung.com" , "sudipm.mukherjee@gmail.com" , "linux-kselftest@vger.kernel.org" , "arnd@arndb.de" , "linux-ntb@googlegroups.com" From: Logan Gunthorpe Message-ID: <5761E0A0.4060505@deltatee.com> Date: Wed, 15 Jun 2016 17:11:28 -0600 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Icedove/38.8.0 MIME-Version: 1.0 In-Reply-To: <1466029459.16234.263.camel@intel.com> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit X-SA-Exim-Connect-IP: 172.16.1.111 X-SA-Exim-Rcpt-To: linux-ntb@googlegroups.com, arnd@arndb.de, linux-kselftest@vger.kernel.org, sudipm.mukherjee@gmail.com, shuahkh@osg.samsung.com, linux-kernel@vger.kernel.org, jdmason@kudzu.us, Allen.Hubbe@emc.com, dave.jiang@intel.com X-SA-Exim-Mail-From: logang@deltatee.com Subject: Re: [PATCH v3 10/10] ntb_perf: clear link_is_up flag when the link goes down. X-SA-Exim-Version: 4.2.1 (built Mon, 26 Dec 2011 16:24:06 +0000) X-SA-Exim-Scanned: Yes (on ale.deltatee.com) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hey, On 15/06/16 04:24 PM, Jiang, Dave wrote: > On Wed, 2016-06-15 at 16:20 -0600, Logan Gunthorpe wrote: >> Hey, >> >> Actually, I have to retract this patch. After some more thorough >> testing >> I'm finding an issue: >> >> When you remove and re-install the ntb_perf module very quickly, >> ntb_perf will occasionally miss the link up event. This is because >> the >> link_cleanup work gets delayed long enough that it gets scheduled >> after >> the link up event gets sent. It then cancels the link work that >> should >> have occurred. Without this patch, it never happens because >> link_is_up >> never returns to false. >> >> I think the correct solution is to just remove the link_cleanup work >> and >> do those actions immediately on receipt of the event. If there's >> agreement on this I can re-spin it again. > > I'm ok with that. This is not an issue with ntb_transport? Looks like I can get something similar to happen in ntb_transport. However, it's much rarer and takes significantly more tries to get it to occur. It does appear to correctly set its link_is_up to false when the link goes down. I'm not sure I'm quite clear on the flow in ntb_transport and don't have time right now to study it so I'll have to let that be someone else's (fairly minor) issue. Logan