All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vladimir Oltean <olteanv@gmail.com>
To: Ido Schimmel <idosch@idosch.org>
Cc: netdev@vger.kernel.org, Andrew Lunn <andrew@lunn.ch>,
	Florian Fainelli <f.fainelli@gmail.com>,
	Vivien Didelot <vivien.didelot@gmail.com>,
	Jiri Pirko <jiri@resnulli.us>, DENG Qingfang <dqfext@gmail.com>,
	Tobias Waldekranz <tobias@waldekranz.com>,
	George McCollister <george.mccollister@gmail.com>,
	Vlad Yasevich <vyasevich@gmail.com>,
	Roopa Prabhu <roopa@nvidia.com>,
	Nikolay Aleksandrov <nikolay@nvidia.com>
Subject: Re: [RFC PATCH v2 net-next 05/17] net: bridge: implement unicast filtering for the bridge device
Date: Thu, 24 Feb 2022 15:52:41 +0200	[thread overview]
Message-ID: <20220224135241.ne6c64segpt6azed@skbuf> (raw)
In-Reply-To: <YheGlwjp849dhcpq@shredder>

On Thu, Feb 24, 2022 at 03:22:31PM +0200, Ido Schimmel wrote:
> On Tue, Feb 22, 2022 at 07:18:10PM +0200, Vladimir Oltean wrote:
> > On Tue, Feb 22, 2022 at 06:54:13PM +0200, Ido Schimmel wrote:
> > > On Tue, Feb 22, 2022 at 01:21:53PM +0200, Vladimir Oltean wrote:
> > > > Hi Ido,
> > > > 
> > > > On Mon, 1 Mar 2021 at 17:22, Ido Schimmel <idosch@idosch.org> wrote:
> > > > >
> > > > > On Wed, Feb 24, 2021 at 01:43:38PM +0200, Vladimir Oltean wrote:
> > > > > > From: Vladimir Oltean <vladimir.oltean@nxp.com>
> > > > > >
> > > > > > The bridge device currently goes into promiscuous mode when it has an
> > > > > > upper with a different MAC address than itself. But it could do better:
> > > > > > it could sync the MAC addresses of its uppers to the software FDB, as
> > > > > > local entries pointing to the bridge itself. This is compatible with
> > > > > > switchdev, since drivers are now instructed to trap these MAC addresses
> > > > > > to the CPU.
> > > > > >
> > > > > > Note that the dev_uc_add API does not propagate VLAN ID, so this only
> > > > > > works for VLAN-unaware bridges.
> > > > >
> > > > > IOW, it breaks VLAN-aware bridges...
> > > > >
> > > > > I understand that you do not want to track bridge uppers, but once you
> > > > > look beyond L2 you will need to do it anyway.
> > > > >
> > > > > Currently, you only care about getting packets with specific DMACs to
> > > > > the CPU. With L3 offload you will need to send these packets to your
> > > > > router block instead and track other attributes of these uppers such as
> > > > > their MTU so that the hardware will know to generate MTU exceptions. In
> > > > > addition, the hardware needs to know the MAC addresses of these uppers
> > > > > so that it will rewrite the SMAC of forwarded packets.
> > > > 
> > > > Ok, let's say I want to track bridge uppers. How can I track the changes to
> > > > those interfaces' secondary addresses, in a way that keeps the association
> > > > with their VLAN ID, if those uppers are VLAN interfaces?
> > > 
> > > Hi,
> > > 
> > > I'm not sure what you mean by "secondary addresses", but the canonical
> > > way that I'm familiar with of adding MAC addresses to a netdev is to use
> > > macvlan uppers. For example:
> > > 
> > > # ip link add name br0 up type bridge vlan_filtering 1
> > > # ip link add link br0 name br0.10 type vlan id 10
> > > # ip link add link br0.10 name br0.10-v address 00:11:22:33:44:55 type macvlan mode private
> > > 
> > > keepalived uses it in VRRP virtual MAC mode (for example):
> > > https://github.com/acassen/keepalived/blob/master/doc/NOTE_vrrp_vmac.txt
> > > 
> > > In the software data path, this will result in br0 transitioning to
> > > promisc mode and passing all the packets to upper devices that will
> > > filter them.
> > > 
> > > In the hardware data path, you can apply promisc mode by flooding to
> > > your CPU port (I believe this is what you are trying to avoid) or
> > > install an FDB entry <00:11:22:33:44:55,10> that points to your CPU
> > > port.
> > 
> > Maybe the terminology is not the best, but by secondary addresses I mean
> > struct net_device :: uc and mc. To my knowledge, the MAC address of
> > vlan/macvlan uppers is not the only way in which these lists can be
> > populated. There is also AF_PACKET UAPI for PACKET_MR_MULTICAST and
> > PACKET_MR_UNICAST, and this ends up calling dev_mc_add() and
> > dev_uc_add(). User space may use this API to add a secondary address to
> > a VLAN upper interface of a bridge.
> 
> OK, I see the problem... So you want the bridge to support
> 'IFF_UNICAST_FLT' by installing local FDB entries? I see two potential
> problems:
> 
> 1. For VLAN-unaware bridges this is trivial as VLAN information is of no
> use. For VLAN-aware bridges we either need to communicate VLAN
> information from upper layers or install a local FDB entry per each
> configured VLAN (wasteful...). Note that VLAN information will not
> always be available (in PACKET_MR_UNICAST, for example), in which case a
> local FDB entry will need to be configured per each existing VLAN in
> order to maintain existing behavior. Which lead to me think about the
> second problem...
>
> 2. The bigger problem that I see is that if the bridge starts supporting
> 'IFF_UNICAST_FLT' by installing local FDB entries, then packets that
> were previously locally received and flooded will only be locally
> received. Only locally receiving them makes sense, but I don't know what
> will break if we change the existing behavior... Maybe this needs to be
> guarded by a new bridge option?

I think it boils down to whether PACKET_MR_UNICAST on br0 is equivalent to
'bridge fdb add dev br0 self permanent' or not. Theoretically, the
former means "if a packet enters the local termination path of br0,
don't drop it", while the other means "direct this MAC DA only towards
the local termination path of br0". I.o.w. the difference between "copy
to CPU" and "trap to CPU".

If we agree they aren't equivalent, and we also agree that a macvlan on
top of a bridge wants "trap to CPU" instead of "copy to CPU", I think
the only logical conclusion is that the communication mechanism between
the bridge and the macvlan that we're looking for doesn't exist -
dev_uc_add() does something slightly different.

Which is why I want to better understand your idea of having the bridge
track upper interfaces.

Essentially, it isn't the bridge local FDB entries that I have a problem with.
"Locally terminated packets that are also flooded on other bridge ports"
is a problem that DSA users have tried to get rid of for years, I didn't
hear a single complaint after we started fixing that. To me, a bridge
VLAN is by definition an L2 broadcast domain and MAC addresses should be
unique. I can't imagine what would break if we'd make the bridge deliver
the packets only to their known destination.

  reply	other threads:[~2022-02-24 13:52 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-02-24 11:43 [RFC PATCH v2 net-next 00/17] RX filtering in DSA Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 01/17] net: dsa: reference count the host mdb addresses Vladimir Oltean
2021-02-26  9:20   ` Tobias Waldekranz
2021-02-24 11:43 ` [RFC PATCH v2 net-next 02/17] net: dsa: reference count the host fdb addresses Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 03/17] net: dsa: install the host MDB and FDB entries in the master's RX filter Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 04/17] net: dsa: install the port MAC addresses as host fdb entries Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 05/17] net: bridge: implement unicast filtering for the bridge device Vladimir Oltean
2021-03-01 15:22   ` Ido Schimmel
2022-02-22 11:21     ` Vladimir Oltean
2022-02-22 16:54       ` Ido Schimmel
2022-02-22 17:18         ` Vladimir Oltean
2022-02-24 13:22           ` Ido Schimmel
2022-02-24 13:52             ` Vladimir Oltean [this message]
2022-03-01 16:20               ` Ido Schimmel
2022-03-02 11:17                 ` Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 06/17] net: dsa: add addresses obtained from RX filtering to host addresses Vladimir Oltean
2021-02-26 10:59   ` Tobias Waldekranz
2021-02-26 13:28     ` Vladimir Oltean
2021-02-26 22:44       ` Tobias Waldekranz
2021-02-24 11:43 ` [RFC PATCH v2 net-next 07/17] net: bridge: switchdev: refactor br_switchdev_fdb_notify Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 08/17] net: bridge: switchdev: include local flag in FDB notifications Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 09/17] net: bridge: switchdev: send FDB notifications for host addresses Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 10/17] net: dsa: include bridge addresses which are local in the host fdb list Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 11/17] net: dsa: include fdb entries pointing to bridge " Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 12/17] net: dsa: sync static FDB entries on foreign interfaces to hardware Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 13/17] net: dsa: mv88e6xxx: Request assisted learning on CPU port Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 14/17] net: dsa: replay port and host-joined mdb entries when joining the bridge Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 15/17] net: dsa: replay port and local fdb " Vladimir Oltean
2021-02-26 12:23   ` Tobias Waldekranz
2021-02-26 18:08     ` Vladimir Oltean
2021-02-24 11:43 ` [RFC PATCH v2 net-next 16/17] net: bridge: switchdev: let drivers inform which bridge ports are offloaded Vladimir Oltean
2021-02-24 14:25   ` kernel test robot
2021-02-24 11:43 ` [RFC PATCH v2 net-next 17/17] net: bridge: offloaded ports are always promiscuous Vladimir Oltean
2021-02-24 15:21   ` kernel test robot

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20220224135241.ne6c64segpt6azed@skbuf \
    --to=olteanv@gmail.com \
    --cc=andrew@lunn.ch \
    --cc=dqfext@gmail.com \
    --cc=f.fainelli@gmail.com \
    --cc=george.mccollister@gmail.com \
    --cc=idosch@idosch.org \
    --cc=jiri@resnulli.us \
    --cc=netdev@vger.kernel.org \
    --cc=nikolay@nvidia.com \
    --cc=roopa@nvidia.com \
    --cc=tobias@waldekranz.com \
    --cc=vivien.didelot@gmail.com \
    --cc=vyasevich@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.