From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Paul Grun" Subject: RE: When IBoE will be merged to upstream? Date: Mon, 19 Jul 2010 22:28:24 -0700 Message-ID: <004e01cb27cc$61807010$24815030$@com> References: <20100625155755.GC4630@obsidianresearch.com> <4C32D0BD.3030902@Voltaire.com> <4C3417FA.7000600@Voltaire.com> <20100711043449.GA31404@obsidianresearch.com> <20100712162131.GB15392@obsidianresearch.com> Mime-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <20100712162131.GB15392-ePGOBjL8dl3ta4EC/59zMFaTQe2KTcn/@public.gmane.org> Content-Language: en-us Sender: linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org To: 'Jason Gunthorpe' , 'Liran Liss' Cc: 'Or Gerlitz' , 'Roland Dreier' , "'Hefty, Sean'" , 'Aleksey Senin' , 'linux-rdma' , monis-smomgflXvOZWk0Htik3J/w@public.gmane.org, alekseys-smomgflXvOZWk0Htik3J/w@public.gmane.org, yiftahs-smomgflXvOZWk0Htik3J/w@public.gmane.org, 'Tziporet Koren' , alexr-smomgflXvOZWk0Htik3J/w@public.gmane.org List-Id: linux-rdma@vger.kernel.org > -----Original Message----- > From: linux-rdma-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org [mailto:linux-rdma- > owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org] On Behalf Of Jason Gunthorpe > Sent: Monday, July 12, 2010 9:22 AM > To: Liran Liss > Cc: Or Gerlitz; Roland Dreier; Hefty, Sean; Aleksey Senin; linux-rdma; > monis-smomgflXvOZWk0Htik3J/w@public.gmane.org; alekseys-smomgflXvOZWk0Htik3J/w@public.gmane.org; yiftahs-smomgflXvOZWk0Htik3J/w@public.gmane.org; Tziporet > Koren; alexr-smomgflXvOZWk0Htik3J/w@public.gmane.org > Subject: Re: When IBoE will be merged to upstream? > > On Mon, Jul 12, 2010 at 10:58:19AM +0300, Liran Liss wrote: > > >> "...A verbs consumer using a RoCE network relies strictly > >> > on so-called > >> > > Layer 3 addressing (GIDs); layer 2 addresses (e.g. subnet local > >> > > identifiers) are not passed across the verbs interface..." > >> > > >> > Ah, hmm, well, I was on that list during this time and I don't > >> think > >> > this statement means what you are saying it does :) > >> > > > > ?? It doesn't get any clearer than this. > > 'subnet local identidifer' == LID > > The text is saying that the specification does not use any of the LID > fields in the verbs interface, that is it. It isn't talking about MAC > addresses. > This is incorrect. The intent of the text is that a verbs consumer deals ONLY in layer 3 addresses (GIDs). It doesn't make any sense to restrict the interpretation to mean 'LIDs', since LIDs are a NOP for RoCE. CA16-17: When accessing the services of a RoCE verbs provider, the source and destination identifiers contained in the address vector shall consist of GIDs; the address vector shall not contain layer 2 references (e.g. local addresses). Layer 2 references include source and destination local identifiers and LID Path Bits. "layer 2 references" refers to LIDs, MAC IDs or any other form of layer 2 address. If the wording of the text is insufficiently clear, please post a comment on comment tracker on the IBTA website. Nevertheless, that is the intent. > Exactly how and where the MAC address comes about was never decided, Correct. The IBTA IBXoE WG felt that defining the mapping from GID to MAC ID should be a function of the underlying fabric (Ethernet) and thus was out of scope for us to define the mapping mechanism. > and at least some participants thought it should be a 1:1 algorithmic > mapping from the GID. > > Ditto for VLANs, how and where the vlan tag comes about is not part of > the spec. > > > Good idea! This is exactly what we do today for addresses that the > > user explicitly declares as link-local addresses. But, we can't > > mandate an overload of the GID in a way that it prevents its use as > > a true L3 address (eventually routable). > > We are very unlikely to see routable IBoE, ever.. But, even if we do > get there some day then we could extend the AH. > Please keep in mind that IB routing is not yet defined. Although the RoCE spec doesn't explicitly say so, the intent is that routing for RoCE can be defined once the work on IB routing has been completed. > BTW, I absolutely hate the mixing of 'Sometimes it is a IPv4, > sometimes it is a GID, and sometimes it is an IPv6' in the same > field. That is just so nasty. The GID is a GID, don't overload it in > an ambiguous way to mean 2 other things! I am unaware of any overloading, at least in the RoCE spec. > > > > create_ah does not accept any sort of source address > > > specifier > > > > You are wrong -- sgid_index specifies it. > > So, what do you propose to put in sgid_index? It isn't big enough to > store an IPv6 address. You can't exactly number every IP assigned to > every ethernet interface. > > The other fields you mention are not a supserset of socket parameters, > they are only IPv6 parameters, IPv4 uses a different set. > > > Jason, bottom line, I think that we both agree that the rdmacm > > should do the address resolution. The difference is that by having > > the rdmacm initially only bind to the device and complete the > > resolution later (by a call from create_ah()), we don't change the > > user API for *all* gid types. Having addressed your concerns > > regarding resolution below the Verbs, we continue to believe that > > this is the best approach. > > Again, I don't see how what I've outlined changes the API in any > way. > > Doing two routing lookups for the same connection is bad design, it is > racey. L2 parameters have to flow from the first routing lookup in > RDMA-CM to everything else. > > Liran, I don't think you have at all come close to addressing my > concerns, you still haven't explained how a full route lookup is even > possible in create_ah, for instance. Let alone my other concerns! > > Jason > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org > More majordomo info at http://vger.kernel.org/majordomo-info.html Paul (Chair, IBTA IBXoE WG) -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html