From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-9.0 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 8EEC5C433DF for ; Tue, 11 Aug 2020 05:22:29 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 72939206C3 for ; Tue, 11 Aug 2020 05:22:29 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726182AbgHKFW2 (ORCPT ); Tue, 11 Aug 2020 01:22:28 -0400 Received: from cloud.peff.net ([104.130.231.41]:54808 "EHLO cloud.peff.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726020AbgHKFW2 (ORCPT ); Tue, 11 Aug 2020 01:22:28 -0400 Received: (qmail 26520 invoked by uid 109); 11 Aug 2020 05:22:28 -0000 Received: from Unknown (HELO peff.net) (10.0.1.2) by cloud.peff.net (qpsmtpd/0.94) with ESMTP; Tue, 11 Aug 2020 05:22:28 +0000 Authentication-Results: cloud.peff.net; auth=none Received: (qmail 13349 invoked by uid 111); 11 Aug 2020 05:22:27 -0000 Received: from coredump.intra.peff.net (HELO sigill.intra.peff.net) (10.0.0.2) by peff.net (qpsmtpd/0.94) with (TLS_AES_256_GCM_SHA384 encrypted) ESMTPS; Tue, 11 Aug 2020 01:22:27 -0400 Authentication-Results: peff.net; auth=none Date: Tue, 11 Aug 2020 01:22:26 -0400 From: Jeff King To: =?utf-8?B?xJBvw6BuIFRy4bqnbiBDw7RuZw==?= Danh Cc: Junio C Hamano , git@vger.kernel.org Subject: Re: [RFC PATCH 0/2] extend --abbrev support to diff-patch format Message-ID: <20200811052226.GA82699@coredump.intra.peff.net> References: <20200810100038.GB37030@coredump.intra.peff.net> <20200810123148.GB17119@danh.dev> <20200810152705.GA61606@coredump.intra.peff.net> <20200811003359.GD17119@danh.dev> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20200811003359.GD17119@danh.dev> Sender: git-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: git@vger.kernel.org On Tue, Aug 11, 2020 at 07:33:59AM +0700, Đoàn Trần Công Danh wrote: > > Yeah, that's what I was getting at: if you care about robust > > machine-readability, then the full index is the best solution. Reading > > between the lines, I think the argument may be "using --full-index is > > too long and therefore ugly, so people like the short-ish names but with > > a bit of extra safety". > > My argument was people can either easily fetch the patch via HTTP like: > > curl -LO https://github.com/git/git/commit/eb12adc74cf22add318f884072be2071d181abaa.patch > > or take it from a mailing list archive, bugzilla, instead of > cloning a full repository. With those options, we can't say, > "we prefer full-index, please send us the patch with full-index > instead". OK. But then how would they use "--abbrev" in that case? I.e., isn't it too late at that point (especially in the mailing list archive case) to do change anything in the formatting of the patch? Maybe I'm confused... > > There's an extra challenge here, which is that you have to convince the > > sender to use the extra --abbrev option, even though they themselves > > won't be the ones running into the problem when applying. > > Not really, since the sender tree is usually larger than the archived > tree, their abbrev is usually long enough, and the receiver will use > --abbrev to lengthen their abbrev to reduce the noise instead. Now I'm doubly confused. If the sender has the larger tree then they'll have the larger abbrev. So what's the problem? Going back to re-read your earlier responses...So...this _isn't_ a problem within Git itself? It's only about people trying to compare textual patches byte-for-byte and seeing different index lines? If that's the case, then it seems to me that the byte comparison is the problem here. If I have: index 1234abcd..5678bcde and index 1234abcd87..5678bcde65 those should be considered equivalent to see if two patches are plausibly the same. And I think tools like git-cherry, etc, would do that (and we provide git-patch-id for that purpose, too). > > Yeah, I certainly don't mind the extra flexibility between "full" and > > "default" for "index" lines. I do wonder if people want to configure the > > abbreviations for those lines separately from other parts. I don't know > > that I've ever particularly cared about that flexibility, but the fact > > that they were set up separately all those years ago makes me think > > somebody might. > > I don't think people particularly care about the index line (and to > the extent, its length) that much, since the default is number is > actually a minimum number, if Git can't differentiate object with that > number of characters, Git will show a longer object names anyway. > > I think most people scripts will put a regex for: > > /index [a-z0-9]{7,}\.\.[a-z0-9]{7,} [0-7]{6}/ > > Or even: > > /index [a-z0-9]+\.\.[a-z0-9]+ [0-7]+/ > > For the former case, we could change the code in 2/2 to set the minimum > default to DEFAULT_ABBREV instead of MINIMUM_ABBREV? > > For the historical case that users put both --full-index and --abbrev > into there scripts, we still keep our promise to not break their > script by always respect --full-index, regardless of --abbrev. I care less about scripting (as you note, anything consuming abbreviated objects has to handle longer-than-minimum names anyway), and was more wondering whether anybody really cared that: git log --abbrev=30 -p kept the short index lines (e.g., because they're easier to read). But I'm having trouble coming up with a plausible reason somebody would want long object names in earlier lines like "Merge:" but not in the patch index lines. And already we respect --abbrev for --raw, so it's not like the diff code isn't already affected. Making "-p" consistent with all the rest of it is probably worth doing regardless. -Peff