bpf.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Yonghong Song <yhs@fb.com>
To: Brian Vazquez <brianvv@google.com>,
	Brian Vazquez <brianvv.kernel@gmail.com>,
	Alexei Starovoitov <ast@kernel.org>,
	"Daniel Borkmann" <daniel@iogearbox.net>,
	"David S . Miller" <davem@davemloft.net>
Cc: Andrii Nakryiko <andrii.nakryiko@gmail.com>,
	Stanislav Fomichev <sdf@google.com>,
	Petar Penkov <ppenkov@google.com>,
	Willem de Bruijn <willemb@google.com>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"netdev@vger.kernel.org" <netdev@vger.kernel.org>,
	"bpf@vger.kernel.org" <bpf@vger.kernel.org>
Subject: Re: [PATCH v5 bpf-next 0/9] add bpf batch ops to process more than 1 elem
Date: Wed, 15 Jan 2020 20:13:07 +0000	[thread overview]
Message-ID: <0ed19302-a43c-c04e-110e-eb1f0a72146f@fb.com> (raw)
In-Reply-To: <20200115184308.162644-1-brianvv@google.com>



On 1/15/20 10:42 AM, Brian Vazquez wrote:
> This patch series introduce batch ops that can be added to bpf maps to
> lookup/lookup_and_delete/update/delete more than 1 element at the time,
> this is specially useful when syscall overhead is a problem and in case
> of hmap it will provide a reliable way of traversing them.
> 
> The implementation inclues a generic approach that could potentially be
> used by any bpf map and adds it to arraymap, it also includes the specific
> implementation of hashmaps which are traversed using buckets instead
> of keys.
> 
> The bpf syscall subcommands introduced are:
> 
>    BPF_MAP_LOOKUP_BATCH
>    BPF_MAP_LOOKUP_AND_DELETE_BATCH
>    BPF_MAP_UPDATE_BATCH
>    BPF_MAP_DELETE_BATCH
> 
> The UAPI attribute is:
> 
>    struct { /* struct used by BPF_MAP_*_BATCH commands */
>           __aligned_u64   in_batch;       /* start batch,
>                                            * NULL to start from beginning
>                                            */
>           __aligned_u64   out_batch;      /* output: next start batch */
>           __aligned_u64   keys;
>           __aligned_u64   values;
>           __u32           count;          /* input/output:
>                                            * input: # of key/value
>                                            * elements
>                                            * output: # of filled elements
>                                            */
>           __u32           map_fd;
>           __u64           elem_flags;
>           __u64           flags;
>    } batch;
> 
> 
> in_batch and out_batch are only used for lookup and lookup_and_delete since
> those are the only two operations that attempt to traverse the map.
> 
> update/delete batch ops should provide the keys/values that user wants
> to modify.
> 
> Here are the previous discussions on the batch processing:
>   - https://lore.kernel.org/bpf/20190724165803.87470-1-brianvv@google.com/
>   - https://lore.kernel.org/bpf/20190829064502.2750303-1-yhs@fb.com/
>   - https://lore.kernel.org/bpf/20190906225434.3635421-1-yhs@fb.com/
> 
> Changelog sinve v4:
>   - Remove unnecessary checks from libbpf API (Andrii Nakryiko)
>   - Move DECLARE_LIBBPF_OPTS with all var declarations (Andrii Nakryiko)
>   - Change bucket internal buffer size to 5 entries (Yonghong Song)
>   - Fix some minor bugs in hashtab batch ops implementation (Yonghong Song)
> 
> Changelog sinve v3:
>   - Do not use copy_to_user inside atomic region (Yonghong Song)
>   - Use _opts approach on libbpf APIs (Andrii Nakryiko)
>   - Drop generic_map_lookup_and_delete_batch support
>   - Free malloc-ed memory in tests (Yonghong Song)
>   - Reverse christmas tree (Yonghong Song)
>   - Add acked labels
> 
> Changelog sinve v2:
>   - Add generic batch support for lpm_trie and test it (Yonghong Song)
>   - Use define MAP_LOOKUP_RETRIES for retries (John Fastabend)
>   - Return errors directly and remove labels (Yonghong Song)
>   - Insert new API functions into libbpf alphabetically (Yonghong Song)
>   - Change hlist_nulls_for_each_entry_rcu to
>     hlist_nulls_for_each_entry_safe in htab batch ops (Yonghong Song)
> 
> Changelog since v1:
>   - Fix SOB ordering and remove Co-authored-by tag (Alexei Starovoitov)
> 
> Changelog since RFC:
>   - Change batch to in_batch and out_batch to support more flexible opaque
>     values to iterate the bpf maps.
>   - Remove update/delete specific batch ops for htab and use the generic
>     implementations instead.
> 
> Brian Vazquez (5):
>    bpf: add bpf_map_{value_size,update_value,map_copy_value} functions
>    bpf: add generic support for lookup batch op
>    bpf: add generic support for update and delete batch ops
>    bpf: add lookup and update batch ops to arraymap
>    selftests/bpf: add batch ops testing to array bpf map
> 
> Yonghong Song (4):
>    bpf: add batch ops to all htab bpf map
>    tools/bpf: sync uapi header bpf.h
>    libbpf: add libbpf support to batch ops
>    selftests/bpf: add batch ops testing for htab and htab_percpu map
> 
>   include/linux/bpf.h                           |  18 +
>   include/uapi/linux/bpf.h                      |  21 +
>   kernel/bpf/arraymap.c                         |   2 +
>   kernel/bpf/hashtab.c                          | 264 +++++++++
>   kernel/bpf/syscall.c                          | 554 ++++++++++++++----
>   tools/include/uapi/linux/bpf.h                |  21 +
>   tools/lib/bpf/bpf.c                           |  58 ++
>   tools/lib/bpf/bpf.h                           |  22 +
>   tools/lib/bpf/libbpf.map                      |   4 +
>   .../bpf/map_tests/array_map_batch_ops.c       | 129 ++++
>   .../bpf/map_tests/htab_map_batch_ops.c        | 283 +++++++++
>   11 files changed, 1248 insertions(+), 128 deletions(-)
>   create mode 100644 tools/testing/selftests/bpf/map_tests/array_map_batch_ops.c
>   create mode 100644 tools/testing/selftests/bpf/map_tests/htab_map_batch_ops.c

Thanks for the work! LGTM. Ack for the whole series.

Acked-by: Yonghong Song <yhs@fb.com>


  parent reply	other threads:[~2020-01-15 20:13 UTC|newest]

Thread overview: 13+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-01-15 18:42 [PATCH v5 bpf-next 0/9] add bpf batch ops to process more than 1 elem Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 1/9] bpf: add bpf_map_{value_size,update_value,map_copy_value} functions Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 2/9] bpf: add generic support for lookup batch op Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 3/9] bpf: add generic support for update and delete batch ops Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 4/9] bpf: add lookup and update batch ops to arraymap Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 5/9] bpf: add batch ops to all htab bpf map Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 6/9] tools/bpf: sync uapi header bpf.h Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 7/9] libbpf: add libbpf support to batch ops Brian Vazquez
2020-01-15 20:20   ` Andrii Nakryiko
2020-01-15 18:43 ` [PATCH v5 bpf-next 8/9] selftests/bpf: add batch ops testing for htab and htab_percpu map Brian Vazquez
2020-01-15 18:43 ` [PATCH v5 bpf-next 9/9] selftests/bpf: add batch ops testing to array bpf map Brian Vazquez
2020-01-15 20:13 ` Yonghong Song [this message]
2020-01-15 22:09   ` [PATCH v5 bpf-next 0/9] add bpf batch ops to process more than 1 elem Alexei Starovoitov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=0ed19302-a43c-c04e-110e-eb1f0a72146f@fb.com \
    --to=yhs@fb.com \
    --cc=andrii.nakryiko@gmail.com \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=brianvv.kernel@gmail.com \
    --cc=brianvv@google.com \
    --cc=daniel@iogearbox.net \
    --cc=davem@davemloft.net \
    --cc=linux-kernel@vger.kernel.org \
    --cc=netdev@vger.kernel.org \
    --cc=ppenkov@google.com \
    --cc=sdf@google.com \
    --cc=willemb@google.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).