From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 6711FC433F5 for ; Fri, 4 Mar 2022 14:03:23 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239755AbiCDOEJ (ORCPT ); Fri, 4 Mar 2022 09:04:09 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39978 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239483AbiCDOEH (ORCPT ); Fri, 4 Mar 2022 09:04:07 -0500 Received: from mail-ot1-x334.google.com (mail-ot1-x334.google.com [IPv6:2607:f8b0:4864:20::334]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D63274C405 for ; Fri, 4 Mar 2022 06:03:19 -0800 (PST) Received: by mail-ot1-x334.google.com with SMTP id a7-20020a9d5c87000000b005ad1467cb59so7511810oti.5 for ; Fri, 04 Mar 2022 06:03:19 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=github.com; s=google; h=message-id:date:mime-version:user-agent:subject:content-language :from:to:cc:references:in-reply-to:content-transfer-encoding; bh=lvuOZBRA3RhdHllAWzbAmVJJjYL8GAj9F6eruABxjtw=; b=ODFkMEDy1xFqxEZe2XAyvzeH7flU727swXCXjtGjAnRiZSbsYEQns4Kw7ouT1Xhyx6 zx0iLDgewCTrLshCB4kSaFEBg1YnaCAHukyBeHGC2JDewAJaHgKShYTgT1vnYKQCyhtv SUCzM7Qm9ZoL3B517iKEvTpRxfeaKBOVbmecfVWkafiEZu1gTai71Xdr6mJTfiDqZNsm DzRKb3MTVsvDIFmRGecgKBIuQ3E4VxUs/AtXC+sxovhKInJXx5S+xqNsqmXcwDXPZAAH Q58+WKsX8PIchkqaIRvErqwr2QgluTblVgs7yrfinHd+ASBjETpZwkbQwStPFz2lpb8P b1GQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:date:mime-version:user-agent:subject :content-language:from:to:cc:references:in-reply-to :content-transfer-encoding; bh=lvuOZBRA3RhdHllAWzbAmVJJjYL8GAj9F6eruABxjtw=; b=4zWWJCNIO2hmF7l8PN2BndKGsBnQlJoOa77ES6wE1JariFNUSxKffPFRDkxEfv2eNa UtpQqKuzL2EnQcti87U4TAvD5nl39S2QmVjHa73javcoE0W3tLM/pTZOFNyKC6c0KMLq 7V0zMhQF4C/YOgXrt6Cd59v7yCmAjaU8Ab56REs2LDeis5T/yQWL6DHVqtE5HJP4typ1 ErRGN4oYvM4XQc/uNfCa9iL+CCz3eYsWuLsjiNpc4R1UGO2QbJ4YM8YvHUv5sbz+C7Ee B6TGDu6oAxveXeZxap10QDlFpVvE93pCJb1l98gHdkFs3Rsf0EbMEee9q/HOK6c8Qp8e ED4w== X-Gm-Message-State: AOAM533ka8HlgLDibLjH2oF24QhJGQvTOBpIT6ptk6Jx/T5I8q+8VOvX lDRIkNFWt4r0un5CWVFXFF2D X-Google-Smtp-Source: ABdhPJxPI3WfdsDzGL6YIujjlnZQ3dquJE67nSTL9B3P5chuVXUdgIJ4Prc9vkrF8+UZZotSRs2ucw== X-Received: by 2002:a05:6830:1d8f:b0:5af:14a6:4b03 with SMTP id y15-20020a0568301d8f00b005af14a64b03mr22903844oti.145.1646402599014; Fri, 04 Mar 2022 06:03:19 -0800 (PST) Received: from [192.168.1.110] ([99.85.27.166]) by smtp.gmail.com with ESMTPSA id 36-20020a9d0ba7000000b005ad59f1f783sm2445660oth.3.2022.03.04.06.03.17 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 04 Mar 2022 06:03:18 -0800 (PST) Message-ID: <06ea3190-32d0-c792-0ae9-c5600305f158@github.com> Date: Fri, 4 Mar 2022 09:03:15 -0500 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.6.1 Subject: Re: [PATCH 3/7] commit-graph: start parsing generation v2 (again) Content-Language: en-US From: Derrick Stolee To: Patrick Steinhardt Cc: Derrick Stolee via GitGitGadget , git@vger.kernel.org, me@ttaylorr.com, gitster@pobox.com, abhishekkumar8222@gmail.com References: <1b9912f7-87be-2520-bb53-9e23529ad233@github.com> <33deae83-1afd-1645-82f3-5af14f14094d@github.com> In-Reply-To: <33deae83-1afd-1645-82f3-5af14f14094d@github.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org On 3/3/2022 11:00 AM, Derrick Stolee wrote: > On 3/3/2022 6:19 AM, Patrick Steinhardt wrote: >> On Wed, Mar 02, 2022 at 09:57:17AM -0500, Derrick Stolee wrote: >>> On 3/2/2022 8:57 AM, Patrick Steinhardt wrote: >>>> On Tue, Mar 01, 2022 at 10:25:46AM -0500, Derrick Stolee wrote: >>>>> On 3/1/2022 9:53 AM, Patrick Steinhardt wrote: >>> >>>>>> Hum. I have re-verified, and this indeed seems to play out. So I must've >>>>>> accidentally ran all my testing with the state generated without the >>>>>> final patch which fixes the corruption. I do see lots of the following >>>>>> warnings, but overall I can verify and write the commit-graph just fine: >>>>>> >>>>>> commit-graph generation for commit c80a42de8803e2d77818d0c82f88e748d7f9425f is 1623362063 < 1623362139 >>>>> >>>>> But I'm not able to generate these warnings from either version. I >>>>> tried generating different levels of a split commit-graph, but >>>>> could not reproduce it. If you have reproduction steps using current >>>>> 'master' (or any released Git version) and the four patches here, >>>>> then I would love to get a full understanding of your errors. >>>>> >>>>> Thanks, >>>>> -Stolee >>>> >>>> I haven't yet been able to reproduce it with publicly available data, >>>> but with the internal references I'm able to evoke the warnings >>>> reliably. It only works when I have two repositories connected via >>>> alternates, when generating the commit-graph in the linked-to repo >>>> first, and then generating the commit-graph in the linking repo. >>>> >>>> The following recipe allows me to reproduce, but rely on private data: >>>> >>>> $ git --version >>>> git version 2.35.1 >>>> >>>> # The pool repository is the one we're linked to from the fork. >>>> $ cd "$pool" >>>> $ rm -rf objects/info/commit-graph objects/info/commit-graph >>>> $ git commit-graph write --split >>>> >>>> $ cd "$fork" >>>> $ rm -rf objects/info/commit-graph objects/info/commit-graph >>>> $ git commit-graph write --split >>>> >>>> $ git commit-graph verify --no-progress >>>> $ echo $? >>>> 0 >>>> >>>> # This is 715d08a9e51251ad8290b181b6ac3b9e1f9719d7 with your full v2 >>>> # applied on top. >>>> $ ~/Development/git/bin-wrappers/git --version >>>> git version 2.35.1.358.g7ede1bea24 >>>> >>>> $ ~/Development/git/bin-wrappers/git commit-graph verify --no-progress >>>> commit-graph generation for commit 06a91bac00ed11128becd48d5ae77eacd8f24c97 is 1623273624 < 1623273710 >>>> commit-graph generation for commit 0ae91029f27238e8f8e109c6bb3907f864dda14f is 1622151146 < 1622151220 >>>> commit-graph generation for commit 0d4582a33d8c8e3eb01adbf564f5e1deeb3b56a2 is 1631045222 < 1631045225 >>>> commit-graph generation for commit 0daf8976439d7e0bb9710c5ee63b570580e0dc03 is 1620347739 < 1620347789 >>>> commit-graph generation for commit 0e0ee8ffb3fa22cee7d28e21cbd6df26454932cf is 1623783297 < 1623783380 >>>> commit-graph generation for commit 0f08ab3de6ec115ea8a956a1996cb9759e640e74 is 1621543278 < 1621543339 >>>> commit-graph generation for commit 133ed0319b5a66ae0c2be76e5a887b880452b111 is 1620949864 < 1620949915 >>>> commit-graph generation for commit 1341b3e6c63343ae94a8a473fa057126ddd4669a is 1637344364 < 1637344384 >>>> commit-graph generation for commit 15bdfc501c2c9f23e9353bf6e6a5facd9c32a07a is 1623348103 < 1623348133 >>>> ... >>>> $ echo $? >>>> 1 >>>> >>>> When generating commit-graphs with your patches applied the `verify` >>>> step works alright. >>>> >>>> I've also by accident stumbled over the original error again: >>>> >>>> fatal: commit-graph requires overflow generation data but has none >>>> >>>> This time it's definitely not caused by generating commit-graphs with an >>>> in-between state of your patch series because the data comes straight >>>> from production with no changes to the commit-graphs performed by >>>> myself. There we're running Git v2.33.1 with a couple of backported >>>> patches (see [1]). While those patches cause us to make more use of the >>>> commit-graph, none modify the way we generate them. >>>> >>>> Of note is that the commit-graph contains references to commits which >>>> don't exist in the ODB anymore. >>>> >>>> Patrick >>>> >>>> [1]: https://gitlab.com/gitlab-org/gitlab-git/-/commits/pks-v2.33.1.gl3 >>> >>> Thank you for your diligence here, Patrick. I really appreciate the >>> work you're putting in to verify the situation. >>> >>> Since our repro relies on private information, but is consistent, I >>> wonder if we should take the patch below, which starts to ignore the >>> older generation number v2 data and only writes freshly-computed >>> numbers. >>> >>> Thanks, >>> -Stolee >> >> Thanks. With your patch below the `fatal:` error is gone, but I'm still >> seeing the same errors with regards to the commit-graph generations. > > This is disappointing and unexpected. Thanks for verifying. > >> So to summarize my findings: >> >> - This bug occurs when writing commit-graphs with v2.35.1, but >> reading them with your patches. >> >> - This bug occurs when I have two repositories connected via an >> alternates file. I haven't yet been able to reproduce it in a >> single repository that is not connected to a separate ODB. > > This is an interesting distinction. One that I didn't think would > matter, but I'll look into the code to see how that could affect > things. > >> - This bug only occurs when I first generate the commit-graph in the >> repository I'm borrowing objects from. >> >> - This bug only occurs when I write commit-graphs with `--split` in >> both repositories. "Normal" commit-graphs don't have this issue, >> and neither can I see it with `--split=replace` or mixed-type >> commit-graphs. >> >> Beware, the following explanation is based on my very basic >> understanding of the commit-graph code and thus more likely to be wrong >> than right: >> >> With the old Git version, we've been mis-parsing the generation because >> `read_generation_data` wasn't ever set. As a result it can happen that >> the second split commit-graph we're generating computes its own >> generation numbers from the wrong starting point because it uses the >> mis-parsed generation numbers from the parent commit-graph. >> >> With your patches, we start to correctly account for overflows and would >> thus end up with a different value for the generation depending on where >> we parse the commit from: if we parse it from the first commit-graph it >> would be correct because it's contains the "root" of the generation >> numbers. But if we parse a commit from the second commit-graph we may >> have a mismatch because the generation numbers in there may have been >> derived from generation numbers mis-parsed from the first commit-graph. >> And because these would be wrong in case there was an overflow it is >> clear that the new corrected generation number may be wrong, as well. > > Hm. My expectation was that the older layers of the split commit-graph > would have read_generation_data disabled (because the new Git version > cannot read the GDAT chunk) and then the validate_mixed_generation_chain() > method would remove read_generation_data from all of the graphs in the > list. > > Combining this with your thoughts on cross-alternate split commit-graphs, > this makes me think we should try this: > > --- >8 --- > > diff --git a/commit-graph.c b/commit-graph.c > index fb2ced0bd6..74c6534f56 100644 > --- a/commit-graph.c > +++ b/commit-graph.c > @@ -609,8 +609,6 @@ struct commit_graph *read_commit_graph_one(struct repository *r, > if (!g) > g = load_commit_graph_chain(r, odb); > > - validate_mixed_generation_chain(g); > - > return g; > } > > @@ -668,7 +666,13 @@ static int prepare_commit_graph(struct repository *r) > !r->objects->commit_graph && odb; > odb = odb->next) > prepare_commit_graph_one(r, odb); > - return !!r->objects->commit_graph; > + > + if (r->objects->commit_graph) { > + validate_mixed_generation_chain(r->objects->commit_graph); > + return 1; > + } > + > + return 0; > } > > int generation_numbers_enabled(struct repository *r) > > > --- >8 --- > > Notice that I'm moving the validate_mixed_generation_chain() call > out of read_commit_graph_one() and into prepare_commit_graph(). To > my understanding, this _should_ have an equivalent end state as the > old code, but might be worth trying just as a quick check. > > I will continue investigating and try to reproduce with this > additional constraint of working across an alternate. My attempts to reproduce this across an alternate have failed. I tried running the following test against Git without these patches, then verify with the newer version of Git. (I also have generated a few new layers on top with these patches, and they correctly drop the GDA2 and GDO2 chunks when the lower layers "don't have gen v2".) test_description='commit-graph with offsets across alternates' . ./test-lib.sh if ! test_have_prereq TIME_IS_64BIT || ! test_have_prereq TIME_T_IS_64BIT then skip_all='skipping 64-bit timestamp tests' test_done fi UNIX_EPOCH_ZERO="@0 +0000" FUTURE_DATE="@4147483646 +0000" GIT_TEST_COMMIT_GRAPH_CHANGED_PATHS=0 test_expect_success 'generate alternate split commit-graph' ' git init alternate && ( cd alternate && test_commit --date "$UNIX_EPOCH_ZERO" 1 && test_commit --date "$FUTURE_DATE" 2 && git commit-graph write --reachable && test_commit --date "$UNIX_EPOCH_ZERO" 3 && test_commit --date "$FUTURE_DATE" 4 && git commit-graph write --reachable --split=no-merge ) && git clone --shared alternate fork && ( cd fork && test_commit --date "$UNIX_EPOCH_ZERO" 5 && test_commit --date "$FUTURE_DATE" 6 && git commit-graph write --reachable --split=no-merge && test_commit --date "$UNIX_EPOCH_ZERO" 7 && test_commit --date "$FUTURE_DATE" 8 && git commit-graph write --reachable --split=no-merge ) ' test_done My testing after running this with -d allows me to reliably see these layers being created with GDAT and GDOV chunks. Running the 'git commit-graph verify' command with the new code does not show those errors, even after adding commits and another layer to the split commit-graph. I look forward to any additional insights you might have here. Thanks, -Stolee