netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Larysa Zaremba <larysa.zaremba@intel.com>
To: Stanislav Fomichev <sdf@google.com>
Cc: <bpf@vger.kernel.org>, <ast@kernel.org>, <daniel@iogearbox.net>,
	<andrii@kernel.org>, <martin.lau@linux.dev>, <song@kernel.org>,
	<yhs@fb.com>, <john.fastabend@gmail.com>, <kpsingh@kernel.org>,
	<haoluo@google.com>, <jolsa@kernel.org>,
	David Ahern <dsahern@gmail.com>, Jakub Kicinski <kuba@kernel.org>,
	Willem de Bruijn <willemb@google.com>,
	"Jesper Dangaard Brouer" <brouer@redhat.com>,
	Anatoly Burakov <anatoly.burakov@intel.com>,
	Alexander Lobakin <alexandr.lobakin@intel.com>,
	Magnus Karlsson <magnus.karlsson@gmail.com>,
	Maryam Tahhan <mtahhan@redhat.com>, <xdp-hints@xdp-project.net>,
	<netdev@vger.kernel.org>
Subject: Re: [PATCH bpf-next v2 06/20] ice: Support HW timestamp hint
Date: Mon, 10 Jul 2023 17:49:27 +0200	[thread overview]
Message-ID: <ZKwohzanCVIFwrxN@lincoln> (raw)
In-Reply-To: <CAKH8qBvrSJF0HppJ9OVF5wRDP-qV6uVfkWBvPR9=-SpRoyvDJQ@mail.gmail.com>

On Thu, Jul 06, 2023 at 09:39:29AM -0700, Stanislav Fomichev wrote:
> On Thu, Jul 6, 2023 at 7:27 AM Larysa Zaremba <larysa.zaremba@intel.com> wrote:
> >
> > On Wed, Jul 05, 2023 at 10:30:56AM -0700, Stanislav Fomichev wrote:
> > > On 07/03, Larysa Zaremba wrote:
> > > > Use previously refactored code and create a function
> > > > that allows XDP code to read HW timestamp.
> > > >
> > > > Also, move cached_phctime into packet context, this way this data still
> > > > stays in the ring structure, just at the different address.
> > > >
> > > > HW timestamp is the first supported hint in the driver,
> > > > so also add xdp_metadata_ops.
> > > >
> > > > Signed-off-by: Larysa Zaremba <larysa.zaremba@intel.com>
> > > > ---
> > > >  drivers/net/ethernet/intel/ice/ice.h          |  2 ++
> > > >  drivers/net/ethernet/intel/ice/ice_ethtool.c  |  2 +-
> > > >  drivers/net/ethernet/intel/ice/ice_lib.c      |  2 +-
> > > >  drivers/net/ethernet/intel/ice/ice_main.c     |  1 +
> > > >  drivers/net/ethernet/intel/ice/ice_ptp.c      |  2 +-
> > > >  drivers/net/ethernet/intel/ice/ice_txrx.h     |  2 +-
> > > >  drivers/net/ethernet/intel/ice/ice_txrx_lib.c | 24 +++++++++++++++++++
> > > >  7 files changed, 31 insertions(+), 4 deletions(-)
> > > >
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice.h b/drivers/net/ethernet/intel/ice/ice.h
> > > > index 4ba3d99439a0..7a973a2229f1 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice.h
> > > > +++ b/drivers/net/ethernet/intel/ice/ice.h
> > > > @@ -943,4 +943,6 @@ static inline void ice_clear_rdma_cap(struct ice_pf *pf)
> > > >     set_bit(ICE_FLAG_UNPLUG_AUX_DEV, pf->flags);
> > > >     clear_bit(ICE_FLAG_RDMA_ENA, pf->flags);
> > > >  }
> > > > +
> > > > +extern const struct xdp_metadata_ops ice_xdp_md_ops;
> > > >  #endif /* _ICE_H_ */
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_ethtool.c b/drivers/net/ethernet/intel/ice/ice_ethtool.c
> > > > index 8d5cbbd0b3d5..3c3b9cbfbcd3 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_ethtool.c
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_ethtool.c
> > > > @@ -2837,7 +2837,7 @@ ice_set_ringparam(struct net_device *netdev, struct ethtool_ringparam *ring,
> > > >             /* clone ring and setup updated count */
> > > >             rx_rings[i] = *vsi->rx_rings[i];
> > > >             rx_rings[i].count = new_rx_cnt;
> > > > -           rx_rings[i].cached_phctime = pf->ptp.cached_phc_time;
> > > > +           rx_rings[i].pkt_ctx.cached_phctime = pf->ptp.cached_phc_time;
> > > >             rx_rings[i].desc = NULL;
> > > >             rx_rings[i].rx_buf = NULL;
> > > >             /* this is to allow wr32 to have something to write to
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_lib.c b/drivers/net/ethernet/intel/ice/ice_lib.c
> > > > index 00e3afd507a4..eb69b0ac7956 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_lib.c
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_lib.c
> > > > @@ -1445,7 +1445,7 @@ static int ice_vsi_alloc_rings(struct ice_vsi *vsi)
> > > >             ring->netdev = vsi->netdev;
> > > >             ring->dev = dev;
> > > >             ring->count = vsi->num_rx_desc;
> > > > -           ring->cached_phctime = pf->ptp.cached_phc_time;
> > > > +           ring->pkt_ctx.cached_phctime = pf->ptp.cached_phc_time;
> > > >             WRITE_ONCE(vsi->rx_rings[i], ring);
> > > >     }
> > > >
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c b/drivers/net/ethernet/intel/ice/ice_main.c
> > > > index 93979ab18bc1..f21996b812ea 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_main.c
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_main.c
> > > > @@ -3384,6 +3384,7 @@ static void ice_set_ops(struct ice_vsi *vsi)
> > > >
> > > >     netdev->netdev_ops = &ice_netdev_ops;
> > > >     netdev->udp_tunnel_nic_info = &pf->hw.udp_tunnel_nic;
> > > > +   netdev->xdp_metadata_ops = &ice_xdp_md_ops;
> > > >     ice_set_ethtool_ops(netdev);
> > > >
> > > >     if (vsi->type != ICE_VSI_PF)
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_ptp.c b/drivers/net/ethernet/intel/ice/ice_ptp.c
> > > > index a31333972c68..70697e4829dd 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_ptp.c
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_ptp.c
> > > > @@ -1038,7 +1038,7 @@ static int ice_ptp_update_cached_phctime(struct ice_pf *pf)
> > > >             ice_for_each_rxq(vsi, j) {
> > > >                     if (!vsi->rx_rings[j])
> > > >                             continue;
> > > > -                   WRITE_ONCE(vsi->rx_rings[j]->cached_phctime, systime);
> > > > +                   WRITE_ONCE(vsi->rx_rings[j]->pkt_ctx.cached_phctime, systime);
> > > >             }
> > > >     }
> > > >     clear_bit(ICE_CFG_BUSY, pf->state);
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_txrx.h b/drivers/net/ethernet/intel/ice/ice_txrx.h
> > > > index d0ab2c4c0c91..4237702a58a9 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_txrx.h
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_txrx.h
> > > > @@ -259,6 +259,7 @@ enum ice_rx_dtype {
> > > >
> > > >  struct ice_pkt_ctx {
> > > >     const union ice_32b_rx_flex_desc *eop_desc;
> > > > +   u64 cached_phctime;
> > > >  };
> > > >
> > > >  struct ice_xdp_buff {
> > > > @@ -354,7 +355,6 @@ struct ice_rx_ring {
> > > >     struct ice_tx_ring *xdp_ring;
> > > >     struct xsk_buff_pool *xsk_pool;
> > > >     dma_addr_t dma;                 /* physical address of ring */
> > > > -   u64 cached_phctime;
> > > >     u16 rx_buf_len;
> > > >     u8 dcb_tc;                      /* Traffic class of ring */
> > > >     u8 ptp_rx;
> > > > diff --git a/drivers/net/ethernet/intel/ice/ice_txrx_lib.c b/drivers/net/ethernet/intel/ice/ice_txrx_lib.c
> > > > index beb1c5bb392a..463d9e5cbe05 100644
> > > > --- a/drivers/net/ethernet/intel/ice/ice_txrx_lib.c
> > > > +++ b/drivers/net/ethernet/intel/ice/ice_txrx_lib.c
> > > > @@ -546,3 +546,27 @@ void ice_finalize_xdp_rx(struct ice_tx_ring *xdp_ring, unsigned int xdp_res,
> > > >                     spin_unlock(&xdp_ring->tx_lock);
> > > >     }
> > > >  }
> > > > +
> > > > +/**
> > > > + * ice_xdp_rx_hw_ts - HW timestamp XDP hint handler
> > > > + * @ctx: XDP buff pointer
> > > > + * @ts_ns: destination address
> > > > + *
> > > > + * Copy HW timestamp (if available) to the destination address.
> > > > + */
> > > > +static int ice_xdp_rx_hw_ts(const struct xdp_md *ctx, u64 *ts_ns)
> > > > +{
> > > > +   const struct ice_xdp_buff *xdp_ext = (void *)ctx;
> > > > +   u64 cached_time;
> > > > +
> > > > +   cached_time = READ_ONCE(xdp_ext->pkt_ctx.cached_phctime);
> > >
> > > I believe we have to have something like the following here:
> > >
> > > if (!ts_ns)
> > >       return -EINVAL;
> > >
> > > IOW, I don't think verifier guarantees that those pointer args are
> > > non-NULL.
> >
> > Oh, that's a shame.
> >
> > > Same for the other ice kfunc you're adding and veth changes.
> > >
> > > Can you also fix it for the existing veth kfuncs? (or lmk if you prefer me
> > > to fix it).
> >
> > I think I can send fixes for RX hash and timestamp in veth separately, before
> > v3 of this patchset, code probably doesn't intersect.
> >
> > But argument checks in kfuncs are a little bit a gray area for me, whether they
> > should be sent to stable tree or not?
> 
> Add a Fixes tag and they will get into the stable trees automatically I believe?

What about declaring XDP hints kfuncs with

BTF_ID_FLAGS(func, name, KF_TRUSTED_ARGS)

instead of BTF_ID_FLAGS(func, name, 0)
?

I have tested this just now and xdp_metadata passes just fine (so both stack 
and data_meta destination pointers work), but if I replace &timestamp with NULL,
verifier rejects the program with a descriptive message "Possibly NULL pointer 
passed to trusted arg1", so it serves our purpose. I do not see many ways this 
could limit the users, but it definitely benefits driver developers.

The only concern I see is that if we ever decide to allow NULL arguments for 
kfuncs, we'd need to add support for a "_or_null" suffix [0]. But it doesn't 
sound too hard?

I have dug into this, because adding

if (unlikely(!hash || &rss_type))
	return -EINVAL;

or something similar to every .xmo_ handler in existence starts to look ugly.

[0] 
https://lore.kernel.org/lkml/20230120054441.arj5h6yrnh5jsrgr@MacBook-Pro-6.local.dhcp.thefacebook.com/

  reply	other threads:[~2023-07-10 15:54 UTC|newest]

Thread overview: 66+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-07-03 18:12 [PATCH bpf-next v2 00/20] XDP metadata via kfuncs for ice Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 01/20] ice: make RX hash reading code more reusable Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 02/20] ice: make RX HW timestamp " Larysa Zaremba
2023-07-04 10:04   ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 03/20] ice: make RX checksum checking " Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 04/20] ice: Make ptype internal to descriptor info processing Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 05/20] ice: Introduce ice_xdp_buff Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 06/20] ice: Support HW timestamp hint Larysa Zaremba
2023-07-05 17:30   ` Stanislav Fomichev
2023-07-06 14:22     ` Larysa Zaremba
2023-07-06 16:39       ` Stanislav Fomichev
2023-07-10 15:49         ` Larysa Zaremba [this message]
2023-07-10 18:12           ` Stanislav Fomichev
2023-07-03 18:12 ` [PATCH bpf-next v2 07/20] ice: Support RX hash XDP hint Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 08/20] ice: Support XDP hints in AF_XDP ZC mode Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 09/20] xdp: Add VLAN tag hint Larysa Zaremba
2023-07-03 20:15   ` John Fastabend
2023-07-04  8:23     ` Larysa Zaremba
2023-07-04 10:23       ` Jesper Dangaard Brouer
2023-07-04 11:02         ` Larysa Zaremba
2023-07-04 14:18           ` Jesper Dangaard Brouer
2023-07-06 14:46             ` Larysa Zaremba
2023-07-07 13:57               ` Jesper Dangaard Brouer
2023-07-07 17:58                 ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 10/20] ice: Implement " Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 11/20] ice: use VLAN proto from ring packet context in skb path Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 12/20] xdp: Add checksum level hint Larysa Zaremba
2023-07-03 20:38   ` John Fastabend
2023-07-04  9:24     ` Larysa Zaremba
2023-07-04 10:39       ` Jesper Dangaard Brouer
2023-07-04 11:19         ` Larysa Zaremba
2023-07-06  5:50           ` John Fastabend
2023-07-06  9:04             ` [xdp-hints] " Jesper Dangaard Brouer
2023-07-06 12:38               ` Larysa Zaremba
2023-07-06 12:49                 ` Larysa Zaremba
2023-07-10 16:58                   ` Alexander Lobakin
2023-07-03 18:12 ` [PATCH bpf-next v2 13/20] ice: Implement " Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 14/20] selftests/bpf: Allow VLAN packets in xdp_hw_metadata Larysa Zaremba
2023-07-05 17:31   ` Stanislav Fomichev
2023-07-03 18:12 ` [PATCH bpf-next v2 15/20] net, xdp: allow metadata > 32 Larysa Zaremba
2023-07-03 21:06   ` John Fastabend
2023-07-06 14:51     ` Larysa Zaremba
2023-07-10 14:01       ` Alexander Lobakin
2023-07-03 18:12 ` [PATCH bpf-next v2 16/20] selftests/bpf: Add flags and new hints to xdp_hw_metadata Larysa Zaremba
2023-07-04 11:03   ` Jesper Dangaard Brouer
2023-07-04 11:04     ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 17/20] veth: Implement VLAN tag and checksum level XDP hint Larysa Zaremba
2023-07-05 17:25   ` Stanislav Fomichev
2023-07-06  9:57     ` Jesper Dangaard Brouer
2023-07-06 10:15       ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 18/20] selftests/bpf: Use AF_INET for TX in xdp_metadata Larysa Zaremba
2023-07-05 17:39   ` Stanislav Fomichev
2023-07-06 14:11     ` Larysa Zaremba
2023-07-06 17:25       ` Stanislav Fomichev
2023-07-06 17:27       ` Stanislav Fomichev
2023-07-07  8:33         ` Larysa Zaremba
2023-07-07 16:49           ` Stanislav Fomichev
2023-07-07 16:58             ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 19/20] selftests/bpf: Check VLAN tag and proto " Larysa Zaremba
2023-07-05 17:41   ` Stanislav Fomichev
2023-07-06 10:10   ` Jesper Dangaard Brouer
2023-07-06 10:13     ` Larysa Zaremba
2023-07-03 18:12 ` [PATCH bpf-next v2 20/20] selftests/bpf: check checksum level " Larysa Zaremba
2023-07-05 17:41   ` Stanislav Fomichev
2023-07-06 10:25   ` Jesper Dangaard Brouer
2023-07-06 12:02     ` Larysa Zaremba

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=ZKwohzanCVIFwrxN@lincoln \
    --to=larysa.zaremba@intel.com \
    --cc=alexandr.lobakin@intel.com \
    --cc=anatoly.burakov@intel.com \
    --cc=andrii@kernel.org \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=brouer@redhat.com \
    --cc=daniel@iogearbox.net \
    --cc=dsahern@gmail.com \
    --cc=haoluo@google.com \
    --cc=john.fastabend@gmail.com \
    --cc=jolsa@kernel.org \
    --cc=kpsingh@kernel.org \
    --cc=kuba@kernel.org \
    --cc=magnus.karlsson@gmail.com \
    --cc=martin.lau@linux.dev \
    --cc=mtahhan@redhat.com \
    --cc=netdev@vger.kernel.org \
    --cc=sdf@google.com \
    --cc=song@kernel.org \
    --cc=willemb@google.com \
    --cc=xdp-hints@xdp-project.net \
    --cc=yhs@fb.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).