Re: Netlink NLM_F_DUMP_INTR flag lost

* Re: Netlink NLM_F_DUMP_INTR flag lost
       [not found] <20220615171113.7d93af3e@pirotess>
@ 2022-06-15 16:00 ` Jakub Kicinski
  2022-06-16 15:10   ` Ismael Luceno
  0 siblings, 1 reply; 19+ messages in thread
From: Jakub Kicinski @ 2022-06-15 16:00 UTC (permalink / raw)
  To: Ismael Luceno; +Cc: David S. Miller, Paolo Abeni, David Ahern, netdev

CC: netdev ML

On Wed, 15 Jun 2022 17:11:13 +0200 Ismael Luceno wrote:
> It seems a RTM_GETADDR request with AF_UNSPEC has a corner case where
> the NLM_F_DUMP_INTR flag is lost.
> 
> After a change in an address table, if a packet has been fully filled
> just previous, and if the end of the table is found at the same time,
> then the next packet should be flagged, which works fine when it's
> NLMSG_DONE, but gets clobbered when another table is to be dumped next.

Could you describe how it gets clobbered? You mean that prev_seq gets
updated somewhere without setting the flag or something overwrites
nlmsg_flags? Or we set _INTR on an empty skb which never ends up
getting sent? Or..

> A customer noticed the issue using kubernetes, when a large
> number of short-lived containers would push the system constantly
> towards this corner case.
> 
> I'm entertaining the following options:
> 
> 1) introduce a new packet type just to convey flags in cases like this.
> 2) preserve the flag and apply it to the NLMSG_DONE packet.
> 3) flag the first packet of the following table.
> 
> I don't like option 2 and 3 because we can't tell which table is
> affected, which I'm guessing programs might be relying on.
> 
> Option 1 adds a little bit of overhead, but enables us to tell which
> table is affected, and can be ignored by existing software that doesn't
> understand it, so IMHO it's the least disruptive option.
> 
> I want to have a little discussion before introducing a patch, since
> option 1 might have other implications I'm not aware of...

^ permalink raw reply	[flat|nested] 19+ messages in thread