From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-11.4 required=3.0 tests=DKIMWL_WL_MED,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,USER_AGENT_GIT,USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id D1968C43603 for ; Wed, 11 Dec 2019 22:34:09 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id A520F214AF for ; Wed, 11 Dec 2019 22:34:09 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="C4dXYq2U" Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726494AbfLKWeJ (ORCPT ); Wed, 11 Dec 2019 17:34:09 -0500 Received: from mail-pj1-f74.google.com ([209.85.216.74]:51976 "EHLO mail-pj1-f74.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726487AbfLKWeI (ORCPT ); Wed, 11 Dec 2019 17:34:08 -0500 Received: by mail-pj1-f74.google.com with SMTP id fa4so54488pjb.18 for ; Wed, 11 Dec 2019 14:34:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=date:message-id:mime-version:subject:from:to:cc; bh=7ijxfaNlMj4qkP2wK7qiyfpUIttfUVcdL6s+H3bAfgM=; b=C4dXYq2UNwyTBGI4U0bauSxXbOpRXVhlUzgmFzgmbB/SZMRQg54x4GGbJIg492KUnE p1SkuqlWe20pcwLxJgZ0gL1pRg9iiwuy/C+J5GiCUgxJlS8VZIH/3e5UMEETC4uZbn6N c0JA36BU0apkX763JG6qu8wEjJnTxGeWixDJE+bPh4DQjD51AK5WtzAN/bRGYy6r+PZ5 63eI83o1skB3e/+Cb/tOJ/RBq4xDz6W9iErwjv9xgzArErCxpO/s5FQD0K1aoUQywtzc f0Cias4Jw/lOhjS1iY/VrsTeLS9NWBMwl+LxfyVqvYpcZ4lrTCpgIAgO6NjO0RUxUpyC LsUg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:message-id:mime-version:subject:from:to:cc; bh=7ijxfaNlMj4qkP2wK7qiyfpUIttfUVcdL6s+H3bAfgM=; b=rcC1K6Z9qr1+Weg2xut/60Ybi7oGMiOlzRQ4GwrKsLEYkHcZKl9hzldy8GTVKcimen bw7Cpw3wYiIKSEOXa6xqLwnIveUHHJQmM2R8vMNRQr7lukwV4PTg26OiStpcgP1dNxFP Ao9UE9XhVJV1v+vGbdEOSFKCVg2WifbNyGlKqfHaCHbo7oVlYx1SJaEifurSmpBRtKNe +IZDwKFzOHXrLykB0bsWC3IDnqDOsrZroBydpr2fqXP+Bqye6ITO3ZzfqLx41qbKfZCY ZBmhkJ1dF5vBgyJ97zjJ+fVEtqOOt6Pc0yl1YJ6kv6zjuu+W2iIwa1274v2v4Y2iFNAv /azw== X-Gm-Message-State: APjAAAXvnmVSVLrmbvkRLPIcn326nmuoMLbvau7Uf5d3w5UJpeNDdl4h KFDDiqz5T4owh0KkT7qLsBuVTvxjIzPA X-Google-Smtp-Source: APXvYqy2U/Fr/JiD6HSj0oj5vwPWZMh+kH9t1DkKuuAONvQ53YQ1ZpoAIDRUQsdQ6FFTDDWQu5BQF5rBSMq6 X-Received: by 2002:a63:1c1f:: with SMTP id c31mr6745147pgc.292.1576103647800; Wed, 11 Dec 2019 14:34:07 -0800 (PST) Date: Wed, 11 Dec 2019 14:33:33 -0800 Message-Id: <20191211223344.165549-1-brianvv@google.com> Mime-Version: 1.0 X-Mailer: git-send-email 2.24.1.735.g03f4e72817-goog Subject: [PATCH v3 bpf-next 00/11] add bpf batch ops to process more than 1 elem From: Brian Vazquez To: Brian Vazquez , Brian Vazquez , Alexei Starovoitov , Daniel Borkmann , "David S . Miller" Cc: Yonghong Song , Stanislav Fomichev , Petar Penkov , Willem de Bruijn , linux-kernel@vger.kernel.org, netdev@vger.kernel.org, bpf@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: bpf-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: bpf@vger.kernel.org This patch series introduce batch ops that can be added to bpf maps to lookup/lookup_and_delete/update/delete more than 1 element at the time, this is specially useful when syscall overhead is a problem and in case of hmap it will provide a reliable way of traversing them. The implementation inclues a generic approach that could potentially be used by any bpf map and adds it to arraymap, it also includes the specific implementation of hashmaps which are traversed using buckets instead of keys. The bpf syscall subcommands introduced are: BPF_MAP_LOOKUP_BATCH BPF_MAP_LOOKUP_AND_DELETE_BATCH BPF_MAP_UPDATE_BATCH BPF_MAP_DELETE_BATCH The UAPI attribute is: struct { /* struct used by BPF_MAP_*_BATCH commands */ __aligned_u64 in_batch; /* start batch, * NULL to start from beginning */ __aligned_u64 out_batch; /* output: next start batch */ __aligned_u64 keys; __aligned_u64 values; __u32 count; /* input/output: * input: # of key/value * elements * output: # of filled elements */ __u32 map_fd; __u64 elem_flags; __u64 flags; } batch; in_batch and out_batch are only used for lookup and lookup_and_delete since those are the only two operations that attempt to traverse the map. update/delete batch ops should provide the keys/values that user wants to modify. Here are the previous discussions on the batch processing: - https://lore.kernel.org/bpf/20190724165803.87470-1-brianvv@google.com/ - https://lore.kernel.org/bpf/20190829064502.2750303-1-yhs@fb.com/ - https://lore.kernel.org/bpf/20190906225434.3635421-1-yhs@fb.com/ Changelog sinve v2: - Add generic batch support for lpm_trie and test it (Yonghong Song) - Use define MAP_LOOKUP_RETRIES for retries (John Fastabend) - Return errors directly and remove labels (Yonghong Song) - Insert new API functions into libbpf alphabetically (Yonghong Song) - Change hlist_nulls_for_each_entry_rcu to hlist_nulls_for_each_entry_safe in htab batch ops (Yonghong Song) Changelog since v1: - Fix SOB ordering and remove Co-authored-by tag (Alexei Starovoitov) Changelog since RFC: - Change batch to in_batch and out_batch to support more flexible opaque values to iterate the bpf maps. - Remove update/delete specific batch ops for htab and use the generic implementations instead. Brian Vazquez (7): bpf: add bpf_map_{value_size,update_value,map_copy_value} functions bpf: add generic support for lookup and lookup_and_delete batch ops bpf: add generic support for update and delete batch ops bpf: add lookup and updated batch ops to arraymap bpf: add generic_batch_ops to lpm_trie map selftests/bpf: add batch ops testing to array bpf map selftests/bpf: add batch ops testing to lpm_trie bpf map Yonghong Song (4): bpf: add batch ops to all htab bpf map tools/bpf: sync uapi header bpf.h libbpf: add libbpf support to batch ops selftests/bpf: add batch ops testing for htab and htab_percpu map include/linux/bpf.h | 21 + include/uapi/linux/bpf.h | 21 + kernel/bpf/arraymap.c | 2 + kernel/bpf/hashtab.c | 242 ++++++++ kernel/bpf/lpm_trie.c | 4 + kernel/bpf/syscall.c | 562 ++++++++++++++---- tools/include/uapi/linux/bpf.h | 21 + tools/lib/bpf/bpf.c | 61 ++ tools/lib/bpf/bpf.h | 14 + tools/lib/bpf/libbpf.map | 4 + .../bpf/map_tests/array_map_batch_ops.c | 119 ++++ .../bpf/map_tests/htab_map_batch_ops.c | 269 +++++++++ .../bpf/map_tests/trie_map_batch_ops.c | 235 ++++++++ 13 files changed, 1451 insertions(+), 124 deletions(-) create mode 100644 tools/testing/selftests/bpf/map_tests/array_map_batch_ops.c create mode 100644 tools/testing/selftests/bpf/map_tests/htab_map_batch_ops.c create mode 100644 tools/testing/selftests/bpf/map_tests/trie_map_batch_ops.c -- 2.24.1.735.g03f4e72817-goog