From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-10.5 required=3.0 tests=BAYES_00, DKIM_ADSP_CUSTOM_MED,DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN, FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5C1B0C4338F for ; Thu, 19 Aug 2021 05:27:28 +0000 (UTC) Received: from mails.dpdk.org (mails.dpdk.org [217.70.189.124]) by mail.kernel.org (Postfix) with ESMTP id BF5E96044F for ; Thu, 19 Aug 2021 05:27:27 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.4.1 mail.kernel.org BF5E96044F Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=dpdk.org Received: from [217.70.189.124] (localhost [127.0.0.1]) by mails.dpdk.org (Postfix) with ESMTP id 9AE254067E; Thu, 19 Aug 2021 07:27:26 +0200 (CEST) Received: from mail-il1-f173.google.com (mail-il1-f173.google.com [209.85.166.173]) by mails.dpdk.org (Postfix) with ESMTP id 5602840141 for ; Thu, 19 Aug 2021 07:27:25 +0200 (CEST) Received: by mail-il1-f173.google.com with SMTP id x5so4754648ill.3 for ; Wed, 18 Aug 2021 22:27:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=iumh3X4LlvpyvMlklVgeidCLZ9jzu65s06Gx6u5LNgE=; b=IGqrL6wXmc7DnmVHJfk+9xPBhaLMhIUapzrmyVuwC6iSc7brtg0g4gonjQYqB4EkNY qc4TdQIqH7KeKp5QmyUM+8q23WXsVfDtotClAHb1+2So6zMpg5CQM1OTolYvpQg3s+hT a9jP1byDQIXr+8mjj1b1+BRuD0qE6ul8a8QJF29ywCaSHbt9sThk6QCWpy+3CLEXn8Yr 9fSxz7WHFuiguiupGXVFri/1FVUUrKAM1RdZqj+PyN8rw2+qSixl5gG6S0rLRR3cFq4I 9HjCtc4833ylvaQBIKhAOprJy8xIFpbihuEZY5KeLC74v38U9LtvdoknuqMt5WjsGYjK ri+g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=iumh3X4LlvpyvMlklVgeidCLZ9jzu65s06Gx6u5LNgE=; b=FH34DmLBnu+Sv9EUYcEE6GrhRWhdDWZzftc+TjS8QwnI3R65uf/jqOs4wZi6utVFGl cclZxTpUT1phMx2cgHHGvDoS/as+KMVokpEo6lAuA+dMR3W77j+48U1vPeRYkn97Ymu4 T1rWYGHNNSZ9MHG/yphcB0s6dmqt2NIVGk0wfwfBlLaQpPTFipz3mz6hfQZawWXGBPdd u4jvAz70/w6B4yA7k/aUXO+DqUIMVetJ/g7VA6WDWLR5wWvvLOleOQ1uHLr2eELfIOp2 ihl7xCPeUxiWAxi/OSJEwEOqQ1a8rsS4ApJruFaVYgsClyKC4eTGOJha/Cb/UpPNWPTj iopw== X-Gm-Message-State: AOAM530LrdNr0ITVEdAKThfO1mqkHqINcE4nZjqfJhQPehblue3dKQII pgdeXFPLzI8GqTZT//BjgABopWXx0x4XU2N4AIM= X-Google-Smtp-Source: ABdhPJxf88Q/tuR5TtJnMwFVcnONt0cbSbFJ3i8zRm5t0j0rdpqCmQhFvKExDaZdi42d4TQYidyxsU5pYLtdLWwMVMQ= X-Received: by 2002:a92:d70f:: with SMTP id m15mr8524506iln.162.1629350844583; Wed, 18 Aug 2021 22:27:24 -0700 (PDT) MIME-Version: 1.0 References: <20210727034204.20649-1-xuemingl@nvidia.com> <20210811140418.393264-1-xuemingl@nvidia.com> In-Reply-To: From: Jerin Jacob Date: Thu, 19 Aug 2021 10:56:58 +0530 Message-ID: To: "Xueming(Steven) Li" Cc: dpdk-dev , Ferruh Yigit , NBU-Contact-Thomas Monjalon , Andrew Rybchenko Content-Type: text/plain; charset="UTF-8" Subject: Re: [dpdk-dev] [PATCH v2 01/15] ethdev: introduce shared Rx queue X-BeenThere: dev@dpdk.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: DPDK patches and discussions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: dev-bounces@dpdk.org Sender: "dev" On Wed, Aug 18, 2021 at 4:44 PM Xueming(Steven) Li wrote: > > > > > -----Original Message----- > > From: Jerin Jacob > > Sent: Tuesday, August 17, 2021 11:12 PM > > To: Xueming(Steven) Li > > Cc: dpdk-dev ; Ferruh Yigit ; NBU-Contact-Thomas Monjalon ; > > Andrew Rybchenko > > Subject: Re: [PATCH v2 01/15] ethdev: introduce shared Rx queue > > > > On Tue, Aug 17, 2021 at 5:01 PM Xueming(Steven) Li wrote: > > > > > > > > > > > > > -----Original Message----- > > > > From: Jerin Jacob > > > > Sent: Tuesday, August 17, 2021 5:33 PM > > > > To: Xueming(Steven) Li > > > > Cc: dpdk-dev ; Ferruh Yigit ; > > > > NBU-Contact-Thomas Monjalon ; Andrew Rybchenko > > > > > > > > Subject: Re: [PATCH v2 01/15] ethdev: introduce shared Rx queue > > > > > > > > On Wed, Aug 11, 2021 at 7:34 PM Xueming Li wrote: > > > > > > > > > > In current DPDK framework, each RX queue is pre-loaded with mbufs > > > > > for incoming packets. When number of representors scale out in a > > > > > switch domain, the memory consumption became significant. Most > > > > > important, polling all ports leads to high cache miss, high > > > > > latency and low throughput. > > > > > > > > > > This patch introduces shared RX queue. Ports with same > > > > > configuration in a switch domain could share RX queue set by specifying sharing group. > > > > > Polling any queue using same shared RX queue receives packets from > > > > > all member ports. Source port is identified by mbuf->port. > > > > > > > > > > Port queue number in a shared group should be identical. Queue > > > > > index is > > > > > 1:1 mapped in shared group. > > > > > > > > > > Share RX queue must be polled on single thread or core. > > > > > > > > > > Multiple groups is supported by group ID. > > > > > > > > > > Signed-off-by: Xueming Li > > > > > Cc: Jerin Jacob > > > > > --- > > > > > Rx queue object could be used as shared Rx queue object, it's > > > > > important to clear all queue control callback api that using queue object: > > > > > https://mails.dpdk.org/archives/dev/2021-July/215574.html > > > > > > > > > #undef RTE_RX_OFFLOAD_BIT2STR > > > > > diff --git a/lib/ethdev/rte_ethdev.h b/lib/ethdev/rte_ethdev.h > > > > > index d2b27c351f..a578c9db9d 100644 > > > > > --- a/lib/ethdev/rte_ethdev.h > > > > > +++ b/lib/ethdev/rte_ethdev.h > > > > > @@ -1047,6 +1047,7 @@ struct rte_eth_rxconf { > > > > > uint8_t rx_drop_en; /**< Drop packets if no descriptors are available. */ > > > > > uint8_t rx_deferred_start; /**< Do not start queue with rte_eth_dev_start(). */ > > > > > uint16_t rx_nseg; /**< Number of descriptions in rx_seg array. > > > > > */ > > > > > + uint32_t shared_group; /**< Shared port group index in > > > > > + switch domain. */ > > > > > > > > Not to able to see anyone setting/creating this group ID test application. > > > > How this group is created? > > > > > > Nice catch, the initial testpmd version only support one default group(0). > > > All ports that supports shared-rxq assigned in same group. > > > > > > We should be able to change "--rxq-shared" to "--rxq-shared-group" to > > > support group other than default. > > > > > > To support more groups simultaneously, need to consider testpmd > > > forwarding stream core assignment, all streams in same group need to stay on same core. > > > It's possible to specify how many ports to increase group number, but > > > user must schedule stream affinity carefully - error prone. > > > > > > On the other hand, one group should be sufficient for most customer, > > > the doubt is whether it valuable to support multiple groups test. > > > > Ack. One group is enough in testpmd. > > > > My question was more about who and how this group is created, Should n't we need API to create shared_group? If we do the > > following, at least, I can think, how it can be implemented in SW or other HW. > > > > - Create aggregation queue group > > - Attach multiple Rx queues to the aggregation queue group > > - Pull the packets from the queue group(which internally fetch from the Rx queues _attached_) > > > > Does the above kind of sequence, break your representor use case? > > Seems more like a set of EAL wrapper. Current API tries to minimize the application efforts to adapt shared-rxq. > - step 1, not sure how important it is to create group with API, in rte_flow, group is created on demand. Which rte_flow pattern/action for this? > - step 2, currently, the attaching is done in rte_eth_rx_queue_setup, specify offload and group in rx_conf struct. > - step 3, define a dedicate api to receive packets from shared rxq? Looks clear to receive packets from shared rxq. > currently, rxq objects in share group is same - the shared rxq, so the eth callback eth_rx_burst_t(rxq_obj, mbufs, n) could > be used to receive packets from any ports in group, normally the first port(PF) in group. > An alternative way is defining a vdev with same queue number and copy rxq objects will make the vdev a proxy of > the shared rxq group - this could be an helper API. > > Anyway the wrapper doesn't break use case, step 3 api is more clear, need to understand how to implement efficiently. Are you doing this feature based on any HW support or it just pure SW thing, If it is SW, It is better to have just new vdev for like drivers/net/bonding/. This we can help aggregate multiple Rxq across the multiple ports of same the driver. > > > > > > > > > > > > > > > > > > > > > /** > > > > > * Per-queue Rx offloads to be set using DEV_RX_OFFLOAD_* flags. > > > > > * Only offloads set on rx_queue_offload_capa or > > > > > rx_offload_capa @@ -1373,6 +1374,12 @@ struct rte_eth_conf { > > > > > #define DEV_RX_OFFLOAD_OUTER_UDP_CKSUM 0x00040000 > > > > > #define DEV_RX_OFFLOAD_RSS_HASH 0x00080000 > > > > > #define RTE_ETH_RX_OFFLOAD_BUFFER_SPLIT 0x00100000 > > > > > +/** > > > > > + * Rx queue is shared among ports in same switch domain to save > > > > > +memory, > > > > > + * avoid polling each port. Any port in group can be used to receive packets. > > > > > + * Real source port number saved in mbuf->port field. > > > > > + */ > > > > > +#define RTE_ETH_RX_OFFLOAD_SHARED_RXQ 0x00200000 > > > > > > > > > > #define DEV_RX_OFFLOAD_CHECKSUM (DEV_RX_OFFLOAD_IPV4_CKSUM | \ > > > > > DEV_RX_OFFLOAD_UDP_CKSUM | \ > > > > > -- > > > > > 2.25.1 > > > > >