From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 05B56C433F5 for ; Fri, 11 Mar 2022 16:24:57 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1350300AbiCKQZ6 (ORCPT ); Fri, 11 Mar 2022 11:25:58 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37572 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1350255AbiCKQZr (ORCPT ); Fri, 11 Mar 2022 11:25:47 -0500 Received: from mail-wr1-x430.google.com (mail-wr1-x430.google.com [IPv6:2a00:1450:4864:20::430]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1B4A010856D for ; Fri, 11 Mar 2022 08:24:35 -0800 (PST) Received: by mail-wr1-x430.google.com with SMTP id r10so13742699wrp.3 for ; Fri, 11 Mar 2022 08:24:35 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=VkP407ey7sSKihQ8LsbKfHv3jstOQBCZaBhwp07qZIU=; b=NXwnVU51GJzaQT61LEnHs9xU39W5ZNyxjcDXLacr6fs6Q/LTduP5WGtZLrA5YK9zSG KR4GLMC/nE8qKQI3cDV+nM0TGJRq8tp+B6zacaSDX6x+vbKT1goRDtIWjvjkkeX0aONG IHw3ZPyYKZfx2Q2vcTZgvypAqZu1jTCZ32kTxVESvJiqpw2VpCD09+erAVLja4iSjNj4 zZIcSoeHlGtIuimIaUg+gZLgSF1myFJQeaYt09m/3p8+KrSGwkdseEOWXMo+77unwJlM Ph2WsRpvK97i2gVOtoyJJm8npUKFeIAxKoZUQxNj/sP7EskWyO9ty23kxpfS58qxGq+D o5Fw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=VkP407ey7sSKihQ8LsbKfHv3jstOQBCZaBhwp07qZIU=; b=4dnDCDLPylxxIhh7gKD/rUpnb3GG4BRTxe+Pmw+8Qay2ao/xskXc2ZOrDCF3uhXFe4 89AWNduQYIj0t7kZkhq+OU3lG5StaNDyHV0x4TizVYjOBDzu65ymYc2QAbQBN/QkJb6i HyRG/yBH8VcBt/1DZrWpHRa8svObjNUS+F1uCDgejuAPak4VjAnE3AIcr/wtzliMxr2S cybAC8r9sG0/8kVU1+E9DWn7AonmzQGooV3ftTVQEtFYMyasJaYBjf6WSP1q7VpRfECL iC9xnwYCAJdi+GXIhtO9iQIGNapSp9lEEqN9Y4kNNXZuF6D7xpKoi+u4TAJgiK+4cA9e N06w== X-Gm-Message-State: AOAM530eqkOmt7v0kOEkpcToVNmCiRo4meRnnvKvnbXeomi4/ixQy3Gu rYIHqrvRKSfmOvlGOZZ5qE34Pa6S4LdCcg== X-Google-Smtp-Source: ABdhPJxzN/Xtmnr9ssvL7iqp5/d+jKlD3LwscWw5KHNy8I6QuK4wLG7lAFVX/vCtvD+Aq+SeRB1Zxw== X-Received: by 2002:adf:dc91:0:b0:1f0:728c:8faf with SMTP id r17-20020adfdc91000000b001f0728c8fafmr8046192wrj.287.1647015873266; Fri, 11 Mar 2022 08:24:33 -0800 (PST) Received: from vm.nix.is (vm.nix.is. [2a01:4f8:120:2468::2]) by smtp.gmail.com with ESMTPSA id f22-20020a1cc916000000b00380d3e49e89sm7318667wmb.22.2022.03.11.08.24.32 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 11 Mar 2022 08:24:32 -0800 (PST) From: =?UTF-8?q?=C3=86var=20Arnfj=C3=B6r=C3=B0=20Bjarmason?= To: git@vger.kernel.org Cc: Junio C Hamano , Derrick Stolee , Jonathan Tan , Jonathan Nieder , Albert Cui , "Robin H . Johnson" , Teng Long , =?UTF-8?q?=C3=86var=20Arnfj=C3=B6r=C3=B0=20Bjarmason?= Subject: [RFC PATCH v2 03/13] bundle-uri client: add "bundle-uri" parsing + tests Date: Fri, 11 Mar 2022 17:24:15 +0100 Message-Id: X-Mailer: git-send-email 2.35.1.1337.g7e32d794afe In-Reply-To: References: MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org Add a "test-tool bundle-uri parse" which parses the format defined in the newly specified "bundle-uri" command. As note in the "bundle-uri" section in protocol-v2.txt we haven't specified any key-values yet, just URI lines, but we should parse their format for conformity with the spec. We need to make sure our future client doesn't die if this optional data is ever provided by the server, and that we've covered all the edge cases with these key-values in our specification. Let's add and test a bundle_uri_parse_line() to do that. Signed-off-by: Ævar Arnfjörð Bjarmason --- Makefile | 1 + bundle-uri.c | 124 +++++++++++++++++++++++++++++ bundle-uri.h | 16 ++++ t/helper/test-bundle-uri.c | 83 +++++++++++++++++++ t/helper/test-tool.c | 1 + t/helper/test-tool.h | 1 + t/t5750-bundle-uri-parse.sh | 153 ++++++++++++++++++++++++++++++++++++ 7 files changed, 379 insertions(+) create mode 100644 t/helper/test-bundle-uri.c create mode 100755 t/t5750-bundle-uri-parse.sh diff --git a/Makefile b/Makefile index 5a3d35109a1..4fec8e5af09 100644 --- a/Makefile +++ b/Makefile @@ -696,6 +696,7 @@ PROGRAMS += $(patsubst %.o,git-%$X,$(PROGRAM_OBJS)) TEST_BUILTINS_OBJS += test-advise.o TEST_BUILTINS_OBJS += test-bitmap.o TEST_BUILTINS_OBJS += test-bloom.o +TEST_BUILTINS_OBJS += test-bundle-uri.o TEST_BUILTINS_OBJS += test-chmtime.o TEST_BUILTINS_OBJS += test-config.o TEST_BUILTINS_OBJS += test-crontab.o diff --git a/bundle-uri.c b/bundle-uri.c index ff054ddc690..9827fc5da17 100644 --- a/bundle-uri.c +++ b/bundle-uri.c @@ -53,3 +53,127 @@ int bundle_uri_command(struct repository *r, return 0; } + +/** + * General API for {transport,connect}.c etc. + */ +int bundle_uri_parse_line(struct string_list *bundle_uri, const char *line) +{ + size_t i; + struct string_list columns = STRING_LIST_INIT_DUP; + const char *uri; + struct string_list *uri_columns = NULL; + int ret = 0; + + if (!strlen(line)) + return error(_("bundle-uri: got an empty line")); + + /* + * Right now we don't understand anything beyond the first SP, + * but let's be tolerant and ignore any future unknown + * fields. See the "MUST" note about "bundle-feature-key" in + * Documentation/technical/protocol-v2.txt + */ + if (string_list_split(&columns, line, ' ', -1) < 1) + return error(_("bundle-uri: line not in SP-delimited format: %s"), line); + + /* + * We represent a "[ ...]" line with the URI + * being the .string in a string list, and the .util being an + * optional string list of key (.string) and values + * (.util). If the top-level .util is NULL there's no + * key-value pairs.... + */ + uri = columns.items[0].string; + if (!strlen(uri)) { + ret = error(_("bundle-uri: got an empty URI component")); + goto cleanup; + } + + /* + * ... we're going to need that non-NULL .util . + */ + if (columns.nr > 1) { + uri_columns = xcalloc(1, sizeof(struct string_list)); + string_list_init_dup(uri_columns); + } + + /* + * Let's parse the optional "kv" format, even if we don't + * understand any of the keys or values yet. + */ + for (i = 1; i < columns.nr; i++) { + struct string_list kv = STRING_LIST_INIT_DUP; + const char *arg = columns.items[i].string; + int fields = string_list_split(&kv, arg, '=', 2); + int err = 0; + + switch (fields) { + case 0: + BUG("should have no fields=0"); + case 1: + if (!strlen(arg)) { + err = error("bundle-uri: column %lu: got an empty attribute (full line was '%s')", + i, line); + break; + } + /* + * We could dance around with + * string_list_append_nodup() and skip + * string_list_clear(&kv, 0) here, but let's + * keep it simple. + */ + string_list_append(uri_columns, arg); + break; + case 2: + { + const char *k = kv.items[0].string; + const char *v = kv.items[1].string; + + string_list_append(uri_columns, k)->util = xstrdup(v); + break; + } + default: + err = error("bundle-uri: column %lu: '%s' more than one '=' character (full line was '%s')", + i, arg, line); + break; + } + + string_list_clear(&kv, 0); + if (err) { + ret = err; + break; + } + } + + + /* + * Per the spec we'll only consider bundle-uri lines OK if + * there were no parsing problems, even if the problems were + * with attributes whose content we don't understand. + */ + if (ret && uri_columns) { + string_list_clear(uri_columns, 1); + free(uri_columns); + } else if (!ret) { + string_list_append(bundle_uri, uri)->util = uri_columns; + } + +cleanup: + string_list_clear(&columns, 0); + return ret; +} + +static void bundle_uri_string_list_clear_cb(void *util, const char *string) +{ + struct string_list *fields = util; + if (!fields) + return; + string_list_clear(fields, 1); + free(fields); +} + +void bundle_uri_string_list_clear(struct string_list *bundle_uri) +{ + string_list_clear_func(bundle_uri, bundle_uri_string_list_clear_cb); +} diff --git a/bundle-uri.h b/bundle-uri.h index 5a7e556a0ba..be6d1df97ff 100644 --- a/bundle-uri.h +++ b/bundle-uri.h @@ -3,6 +3,7 @@ #include "repository.h" #include "pkt-line.h" #include "strbuf.h" +#include "string-list.h" /** * API used by serve.[ch]. @@ -10,4 +11,19 @@ int bundle_uri_advertise(struct repository *r, struct strbuf *value); int bundle_uri_command(struct repository *r, struct packet_reader *request); +/** + * General API for {transport,connect}.c etc. + */ + +/** + * bundle_uri_parse_line() returns 0 when a valid bundle-uri has been + * added to `bundle_uri`, <0 on error. + */ +int bundle_uri_parse_line(struct string_list *bundle_uri, const char *line); + +/** + * Clear the `bundle_uri` list. Just a very thin wrapper on + * string_list_clear(). + */ +void bundle_uri_string_list_clear(struct string_list *bundle_uri); #endif /* BUNDLE_URI_H */ diff --git a/t/helper/test-bundle-uri.c b/t/helper/test-bundle-uri.c new file mode 100644 index 00000000000..805a86c0130 --- /dev/null +++ b/t/helper/test-bundle-uri.c @@ -0,0 +1,83 @@ +#include "test-tool.h" +#include "parse-options.h" +#include "bundle-uri.h" +#include "strbuf.h" +#include "string-list.h" + +static int cmd__bundle_uri_parse(int argc, const char **argv) +{ + const char *usage[] = { + "test-tool bundle-uri parse util; + + fprintf(stdout, "%s", item->string); + if (!kv) { + fprintf(stdout, "\n"); + continue; + } + for_each_string_list_item(kv_item, kv) { + const char *k = kv_item->string; + const char *v = kv_item->util; + + if (v) + fprintf(stdout, " [kv: %s => %s]", k, v); + else + fprintf(stdout, " [attr: %s]", k); + } + fprintf(stdout, "\n"); + } + strbuf_release(&sb); + + bundle_uri_string_list_clear(&list); + + return err < 0 ? 1 : 0; +usage: + usage_with_options(usage, options); +} + +int cmd__bundle_uri(int argc, const char **argv) +{ + const char *usage[] = { + "test-tool bundle-uri []", + NULL + }; + struct option options[] = { + OPT_END(), + }; + + argc = parse_options(argc, argv, NULL, options, usage, + PARSE_OPT_STOP_AT_NON_OPTION | + PARSE_OPT_KEEP_ARGV0); + if (argc == 1) + goto usage; + + if (!strcmp(argv[1], "parse")) + return cmd__bundle_uri_parse(argc - 1, argv + 1); + error("there is no test-tool bundle-uri tool '%s'", argv[1]); + +usage: + usage_with_options(usage, options); +} diff --git a/t/helper/test-tool.c b/t/helper/test-tool.c index e6ec69cf326..dc73e68f329 100644 --- a/t/helper/test-tool.c +++ b/t/helper/test-tool.c @@ -17,6 +17,7 @@ static struct test_cmd cmds[] = { { "advise", cmd__advise_if_enabled }, { "bitmap", cmd__bitmap }, { "bloom", cmd__bloom }, + { "bundle-uri", cmd__bundle_uri }, { "chmtime", cmd__chmtime }, { "config", cmd__config }, { "crontab", cmd__crontab }, diff --git a/t/helper/test-tool.h b/t/helper/test-tool.h index 20756eefdda..927b6b418cd 100644 --- a/t/helper/test-tool.h +++ b/t/helper/test-tool.h @@ -7,6 +7,7 @@ int cmd__advise_if_enabled(int argc, const char **argv); int cmd__bitmap(int argc, const char **argv); int cmd__bloom(int argc, const char **argv); +int cmd__bundle_uri(int argc, const char **argv); int cmd__chmtime(int argc, const char **argv); int cmd__config(int argc, const char **argv); int cmd__crontab(int argc, const char **argv); diff --git a/t/t5750-bundle-uri-parse.sh b/t/t5750-bundle-uri-parse.sh new file mode 100755 index 00000000000..70fd1b398e9 --- /dev/null +++ b/t/t5750-bundle-uri-parse.sh @@ -0,0 +1,153 @@ +#!/bin/sh + +test_description="Test bundle-uri bundle_uri_parse_line()" + +TEST_NO_CREATE_REPO=1 +TEST_PASSES_SANITIZE_LEAK=true +. ./test-lib.sh + +test_expect_success 'bundle_uri_parse_line() just URIs' ' + cat >in <<-\EOF && + http://example.com/bundle.bdl + https://example.com/bundle.bdl + file:///usr/share/git/bundle.bdl + EOF + + # For the simple case + cp in expect && + + test-tool bundle-uri parse actual 2>err && + test_must_be_empty err && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() with attributes' ' + cat >in <<-\EOF && + http://example.com/bundle1.bdl attr + http://example.com/bundle2.bdl ibute + EOF + + cat >expect <<-\EOF && + http://example.com/bundle1.bdl [attr: attr] + http://example.com/bundle2.bdl [attr: ibute] + EOF + + test-tool bundle-uri parse actual 2>err && + test_must_be_empty err && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() with attributes and key-value attributes' ' + cat >in <<-\EOF && + http://example.com/bundle1.bdl x a=b y c=d z e=f a=b + EOF + + + cat >expect <<-\EOF && + http://example.com/bundle1.bdl [attr: x] [kv: a => b] [attr: y] [kv: c => d] [attr: z] [kv: e => f] [kv: a => b] + EOF + + test-tool bundle-uri parse actual 2>err && + test_must_be_empty err && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() parsing edge cases: extra SP' ' + cat >in <<-\EOF && + http://example.com/bundle1.bdl one-space + http://example.com/bundle2.bdl two-space + http://example.com/bundle3.bdl three-space + EOF + + cat >err.expect <<-\EOF && + error: bundle-uri: column 1: got an empty attribute (full line was '\''http://example.com/bundle2.bdl two-space'\'') + error: bad line: '\''http://example.com/bundle2.bdl two-space'\'' + error: bundle-uri: column 1: got an empty attribute (full line was '\''http://example.com/bundle3.bdl three-space'\'') + error: bad line: '\''http://example.com/bundle3.bdl three-space'\'' + EOF + + cat >expect <<-\EOF && + http://example.com/bundle1.bdl [attr: one-space] + EOF + + test_must_fail test-tool bundle-uri parse actual 2>err.actual && + test_cmp err.expect err.actual && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() parsing edge cases: empty lines' ' + cat >in <<-\EOF && + http://example.com/bundle1.bdl + + http://example.com/bundle2.bdl a=b + + http://example.com/bundle3.bdl + EOF + + cat >err.expect <<-\EOF && + error: bundle-uri: got an empty line + error: bad line: '\'''\'' + error: bundle-uri: got an empty line + error: bad line: '\'''\'' + EOF + + # We fail, but try to continue parsing regardless + cat >expect <<-\EOF && + http://example.com/bundle1.bdl + http://example.com/bundle2.bdl [kv: a => b] + http://example.com/bundle3.bdl + EOF + + test_must_fail test-tool bundle-uri parse actual 2>err.actual && + test_cmp err.expect err.actual && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() parsing edge cases: empty URIs' ' + sed "s/> //" >in <<-\EOF && + http://example.com/bundle1.bdl + > a=b + http://example.com/bundle3.bdl a=b + EOF + + cat >err.expect <<-\EOF && + error: bundle-uri: got an empty URI component + error: bad line: '\'' a=b'\'' + EOF + + # We fail, but try to continue parsing regardless + cat >expect <<-\EOF && + http://example.com/bundle1.bdl + http://example.com/bundle3.bdl [kv: a => b] + EOF + + test_must_fail test-tool bundle-uri parse actual 2>err.actual && + test_cmp err.expect err.actual && + test_cmp expect actual +' + +test_expect_success 'bundle_uri_parse_line() parsing edge cases: multiple = in key-values' ' + cat >in <<-\EOF && + http://example.com/bundle1.bdl k=v=extra + http://example.com/bundle2.bdl a=b k=v=extra c=d + http://example.com/bundle3.bdl kv=ok + EOF + + cat >err.expect <<-\EOF && + error: bundle-uri: column 1: '\''k=v=extra'\'' more than one '\''='\'' character (full line was '\''http://example.com/bundle1.bdl k=v=extra'\'') + error: bad line: '\''http://example.com/bundle1.bdl k=v=extra'\'' + error: bundle-uri: column 2: '\''k=v=extra'\'' more than one '\''='\'' character (full line was '\''http://example.com/bundle2.bdl a=b k=v=extra c=d'\'') + error: bad line: '\''http://example.com/bundle2.bdl a=b k=v=extra c=d'\'' + EOF + + # We fail, but try to continue parsing regardless + cat >expect <<-\EOF && + http://example.com/bundle3.bdl [kv: kv => ok] + EOF + + test_must_fail test-tool bundle-uri parse actual 2>err.actual && + test_cmp err.expect err.actual && + test_cmp expect actual +' + +test_done -- 2.35.1.1337.g7e32d794afe