From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Ertman, David M" Subject: RE: [RFC v1 01/19] net/i40e: Add peer register/unregister to struct i40e_netdev_priv Date: Fri, 22 Feb 2019 20:13:58 +0000 Message-ID: <2B0E3F215D1AB84DA946C8BEE234CCC97B11DF23@ORSMSX101.amr.corp.intel.com> References: <20190215171107.6464-1-shiraz.saleem@intel.com> <20190215171107.6464-2-shiraz.saleem@intel.com> <20190215172233.GC30706@ziepe.ca> <9DD61F30A802C4429A01CA4200E302A7A5A471B8@fmsmsx124.amr.corp.intel.com> <20190221193523.GO17500@ziepe.ca> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT Return-path: In-Reply-To: <20190221193523.GO17500@ziepe.ca> Content-Language: en-US Sender: netdev-owner@vger.kernel.org To: Jason Gunthorpe , "Saleem, Shiraz" Cc: "dledford@redhat.com" , "davem@davemloft.net" , "linux-rdma@vger.kernel.org" , "netdev@vger.kernel.org" , "Ismail, Mustafa" , "Kirsher, Jeffrey T" , "Patil, Kiran" List-Id: linux-rdma@vger.kernel.org > -----Original Message----- > From: Jason Gunthorpe [mailto:jgg@ziepe.ca] > Sent: Thursday, February 21, 2019 11:35 AM > To: Saleem, Shiraz > Cc: dledford@redhat.com; davem@davemloft.net; linux- > rdma@vger.kernel.org; netdev@vger.kernel.org; Ismail, Mustafa > ; Kirsher, Jeffrey T ; > Patil, Kiran ; Ertman, David M > > Subject: Re: [RFC v1 01/19] net/i40e: Add peer register/unregister to struct > i40e_netdev_priv > > On Thu, Feb 21, 2019 at 02:19:33AM +0000, Saleem, Shiraz wrote: > > >Subject: Re: [RFC v1 01/19] net/i40e: Add peer register/unregister to > > >struct i40e_netdev_priv > > > > > >On Fri, Feb 15, 2019 at 11:10:48AM -0600, Shiraz Saleem wrote: > > >> Expose the register/unregister function pointers in the struct > > >> i40e_netdev_priv which is accesible via the netdev_priv() interface > > >> in the RDMA driver. On a netdev notification in the RDMA driver, > > >> the appropriate LAN driver register/unregister functions are > > >> invoked from the struct i40e_netdev_priv structure, > > > > > >Why? In later patches we get an entire device_add() based thing. Why > > >do you need two things? > > > > > >The RDMA driver should bind to the thing that device_add created and > > >from there reliably get the netdev. It should not listen to netdev notifiers for > attachment. > > > > In the new IDC mechanism between ice<->irdma, the LAN driver setups up > > the device for us and attaches it to a software bus via device_add() based > mechanism. > > However, RDMA driver binds to the device only when the LAN 'register' > > function is called in irdma. > > That doesn't make sense. The PCI driver should always create the required > struct device attachment point when attachment is becomes possible. > > > There is no ordering guarantee in which irdma, i40e and ice modules load. > > The netdev notifier is for the case where the irdma loads before i40e > > or ice. > > You are supposed to use the driver core to handle this ordering. > > The pci driver creates the attachment points in the correct order, when they > are ready for use, and the driver core will automatically attach registered > device drivers to the attachement points, no matter the module load loader. > > You will have a netdev and a rdma attachment point, sounds like the RDMA one > is created once the netdev is happy. > > Maybe what you are missing is a struct device_driver? > > Jason I am assuming that the term PCI driver is being used to mean the PCI subsystem in the kernel. If this assumption is wrong, please disregard the next paragraph, but the following points will still apply. The bus that the irdma driver is registering with is a software (pseudo) bus and not a hardware bus, and since this software bus is being defined by the LAN driver, the bus does not exist until a LAN driver is loaded, up, and ready to receive registration from the irdma peer. The PCI driver does not have anything to with this bus, and has no ability to perform the described functions. The irdma driver cannot register with the software bus unless it registers with the LAN driver that controls the bus. The LAN driver's register function will call "driver_register(&drv->driver)" for the registering irdma driver. Since the irdma driver is a consolidated driver (supports both ice and i40e LAN drivers), we cannot guarantee that a given LAN driver will load before the irdma driver. Even if we use module dependencies to make irdma depend on (ice || i40e), we have to consider the situation where a machine will have both an ice supported LAN device and an i40e supported LAN device in it. In this case, the load order could be (e.g.) i40e -> irdma -> ice. The irdma driver can check all present netdevs when it loads to find the one that has the correct function pointers in it, but it will have no way of knowing that a new software bus was created by the second LAN driver to load. This is why irdma is listening for netdev notifiers, so that whenever a new netdev appears from a LAN driver loading after irdma, the irdma driver can evaluate whether the new netdev was created by a LAN driver supported by irdma driver. If I have misunderstood your concerns, I apologize and look forward to clarification. Thanks, -Dave Ertman