* [PATCH] diff: teach --stat to ignore uninteresting modifications @ 2020-08-01 23:33 Matthew Rogers via GitGitGadget 2020-08-02 0:31 ` Junio C Hamano 2020-08-14 22:45 ` [PATCH v2] " Matthew Rogers via GitGitGadget 0 siblings, 2 replies; 9+ messages in thread From: Matthew Rogers via GitGitGadget @ 2020-08-01 23:33 UTC (permalink / raw) To: git; +Cc: peff, Matthew Rogers, Matthew Rogers From: Matthew Rogers <mattr94@gmail.com> Sometimes when diffing, files may show as being momdified even when there are no interesting diffs to show. This happens naturally when using options such as --ignore-space-change. We don't want to prevent the display of all files that have 0 effective diffs since they could be the result of a rename, permission change, or other similar operation that may still be of interest so we special case additions and deletions as they are always interesting. Signed-off-by: Matthew Rogers <mattr94@gmail.com> --- diff: teach --stat to ignore uninteresting modifications This patch is based on the discussion these email threads: https://lore.kernel.org/git/1484704915.2096.16.camel@mattmccutchen.net/ https://lore.kernel.org/git/CAOjrSZtQPQ8Xxuz+7SGykR8Q-gFDEZANSE5yQASqKjpbUAq_5Q@mail.gmail.com/ With the code mostly taken from this specific message: https://lore.kernel.org/git/20170118111705.6bqzkklluikda3r5@sigill.intra.peff.net/ The summary is that when running git diff --stat in combination with --ignore-all-space or similar options, you'll see many lines of the form: some-file.txt | 0 which can be misleading when you are explicitly telling git to "ignore all space" or something similar. To rectify this issue, this patch categorizes all files that are modified but have no effective changes as not fit to display to the user. Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-689%2FROGERSM94%2Fzero-diffs-v1 Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-689/ROGERSM94/zero-diffs-v1 Pull-Request: https://github.com/gitgitgadget/git/pull/689 diff.c | 36 +++++++++++++++++++++++++++++------- t/t4015-diff-whitespace.sh | 8 +++++++- 2 files changed, 36 insertions(+), 8 deletions(-) diff --git a/diff.c b/diff.c index d24aaa3047..9868f82ece 100644 --- a/diff.c +++ b/diff.c @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o gather_dirstat(options, &dir, changed, "", 0); } +static void free_diffstat_file(struct diffstat_file *f) +{ + free(f->print_name); + free(f->name); + free(f->from_name); + free(f); +} + void free_diffstat_info(struct diffstat_t *diffstat) { int i; - for (i = 0; i < diffstat->nr; i++) { - struct diffstat_file *f = diffstat->files[i]; - free(f->print_name); - free(f->name); - free(f->from_name); - free(f); - } + for (i = 0; i < diffstat->nr; i++) + free_diffstat_file(diffstat->files[i]); free(diffstat->files); } @@ -3718,6 +3721,25 @@ static void builtin_diffstat(const char *name_a, const char *name_b, if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, diffstat_consume, diffstat, &xpp, &xecfg)) die("unable to generate diffstat for %s", one->path); + + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { + struct diffstat_file *file = + diffstat->files[diffstat->nr - 1]; + /* + * Omit diffstats of modified files where nothing changed. + * Even if !same_contents, this might be the case due to + * ignoring whitespace changes, etc. + * + * But note that we special-case additions and deletions, + * as adding an empty file, for example is still of interest. + */ + if (p->status == DIFF_STATUS_MODIFIED + && !file->added + && !file->deleted) { + free_diffstat_file(file); + diffstat->nr--; + } + } } diff_free_filespec_data(one); diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh index 88d3026894..32c1b967f9 100755 --- a/t/t4015-diff-whitespace.sh +++ b/t/t4015-diff-whitespace.sh @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' git diff --check ' -test_expect_success 'whitespace-only changes not reported' ' +test_expect_success 'whitespace-only changes not reported (diff)' ' git reset --hard && echo >x "hello world" && git add x && @@ -799,6 +799,12 @@ test_expect_success 'whitespace-only changes not reported' ' test_must_be_empty actual ' +test_expect_success 'whitespace-only changes not reported (diffstat)' ' + # reuse state from previous test + git diff --stat -b >actual && + test_must_be_empty actual +' + test_expect_success 'whitespace-only changes reported across renames' ' git reset --hard && for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && base-commit: e8ab941b671da6890181aea5b5755d1d9eea24ec -- gitgitgadget ^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH] diff: teach --stat to ignore uninteresting modifications 2020-08-01 23:33 [PATCH] diff: teach --stat to ignore uninteresting modifications Matthew Rogers via GitGitGadget @ 2020-08-02 0:31 ` Junio C Hamano 2020-08-02 1:51 ` Matt Rogers 2020-08-14 22:45 ` [PATCH v2] " Matthew Rogers via GitGitGadget 1 sibling, 1 reply; 9+ messages in thread From: Junio C Hamano @ 2020-08-02 0:31 UTC (permalink / raw) To: Matthew Rogers via GitGitGadget; +Cc: git, peff, Matthew Rogers "Matthew Rogers via GitGitGadget" <gitgitgadget@gmail.com> writes: > + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { > + struct diffstat_file *file = > + diffstat->files[diffstat->nr - 1]; > + /* > + * Omit diffstats of modified files where nothing changed. > + * Even if !same_contents, this might be the case due to > + * ignoring whitespace changes, etc. > + * > + * But note that we special-case additions and deletions, > + * as adding an empty file, for example is still of interest. > + */ > + if (p->status == DIFF_STATUS_MODIFIED > + && !file->added > + && !file->deleted) { > + free_diffstat_file(file); > + diffstat->nr--; > + } > + } > } > > diff_free_filespec_data(one); There are some "trailing whitespace" errors detected by "git am/apply". > diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh > index 88d3026894..32c1b967f9 100755 > --- a/t/t4015-diff-whitespace.sh > +++ b/t/t4015-diff-whitespace.sh > @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' > git diff --check > ' > > -test_expect_success 'whitespace-only changes not reported' ' > +test_expect_success 'whitespace-only changes not reported (diff)' ' > git reset --hard && > echo >x "hello world" && > git add x && > @@ -799,6 +799,12 @@ test_expect_success 'whitespace-only changes not reported' ' > test_must_be_empty actual > ' > > +test_expect_success 'whitespace-only changes not reported (diffstat)' ' > + # reuse state from previous test > + git diff --stat -b >actual && > + test_must_be_empty actual > +' > + This is a "let's show off our shiny new toy" test, which shows that the code change covered the case you are interested in changing. We'd also need tests that makes sure that the effect of the code change is not seen when it should not trigger. For example, if we further change mode bits of file 'x' (which has a whitespace-only changes applied in the test in the previous hunk), e.g. git update-index --chmod=+x x && git diff --stat -b --cached >actual should that be counted as a file with 0-line change that is worth reporting, or is it hidden? I _think_ the new code will do a wrong thing here. That is, - If the change truly is only mode bits and one and two have "same_contents", the new code is bypassed, and we'll continue to show "0 lines changed, but the file is worth reporting". - If the change is whitespace-only change plus mode bits, i.e. one and two do not have "same_contents", the new code triggers and the stat output is suppressed for the path. ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH] diff: teach --stat to ignore uninteresting modifications 2020-08-02 0:31 ` Junio C Hamano @ 2020-08-02 1:51 ` Matt Rogers 0 siblings, 0 replies; 9+ messages in thread From: Matt Rogers @ 2020-08-02 1:51 UTC (permalink / raw) To: Junio C Hamano Cc: Matthew Rogers via GitGitGadget, Git Mailing List, Jeff King > > For example, if we further change mode bits of file 'x' (which has a > whitespace-only changes applied in the test in the previous hunk), > e.g. > > git update-index --chmod=+x x && > git diff --stat -b --cached >actual > > should that be counted as a file with 0-line change that is worth > reporting, or is it hidden? I _think_ the new code will do a wrong > thing here. That is, > > - If the change truly is only mode bits and one and two have > "same_contents", the new code is bypassed, and we'll continue to > show "0 lines changed, but the file is worth reporting". > This first one is the behavior I was going for, I think it's most reasonable > - If the change is whitespace-only change plus mode bits, i.e. one > and two do not have "same_contents", the new code triggers and > the stat output is suppressed for the path. > I just ran some tests on my end, and you're correct here, I'll try to come up with a reroll that addresses this and adds this as a test condition. -- Matthew Rogers ^ permalink raw reply [flat|nested] 9+ messages in thread
* [PATCH v2] diff: teach --stat to ignore uninteresting modifications 2020-08-01 23:33 [PATCH] diff: teach --stat to ignore uninteresting modifications Matthew Rogers via GitGitGadget 2020-08-02 0:31 ` Junio C Hamano @ 2020-08-14 22:45 ` Matthew Rogers via GitGitGadget 2020-08-17 21:39 ` Junio C Hamano 2020-08-20 0:41 ` [PATCH v3] " Matthew Rogers via GitGitGadget 1 sibling, 2 replies; 9+ messages in thread From: Matthew Rogers via GitGitGadget @ 2020-08-14 22:45 UTC (permalink / raw) To: git; +Cc: peff, Matthew Rogers, Matthew Rogers From: Matthew Rogers <mattr94@gmail.com> Sometimes when diffing, files may show as being momdified even when there are no interesting diffs to show. This happens naturally when using options such as --ignore-space-change. We don't want to prevent the display of all files that have 0 effective diffs since they could be the result of a rename, permission change, or other similar operation that may still be of interest so we special case additions and deletions as they are always interesting. Signed-off-by: Matthew Rogers <mattr94@gmail.com> --- diff: teach --stat to ignore uninteresting modifications This patch is based on the discussion these email threads: https://lore.kernel.org/git/1484704915.2096.16.camel@mattmccutchen.net/ https://lore.kernel.org/git/CAOjrSZtQPQ8Xxuz+7SGykR8Q-gFDEZANSE5yQASqKjpbUAq_5Q@mail.gmail.com/ With the code mostly taken from this specific message: https://lore.kernel.org/git/20170118111705.6bqzkklluikda3r5@sigill.intra.peff.net/ The summary is that when running git diff --stat in combination with --ignore-all-space or similar options, you'll see many lines of the form: some-file.txt | 0 which can be misleading when you are explicitly telling git to "ignore all space" or something similar. To rectify this issue, this patch categorizes all files that are modified but have no effective changes as not fit to display to the user. New in V2: * I've added a test covering the rename case with whitespace-changes and permissions changes * I've also updated the logic in builtin_diffstat to include that logic as well Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-689%2FROGERSM94%2Fzero-diffs-v2 Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-689/ROGERSM94/zero-diffs-v2 Pull-Request: https://github.com/gitgitgadget/git/pull/689 Range-diff vs v1: 1: 640c864ac4 ! 1: 6c5db18618 diff: teach --stat to ignore uninteresting modifications @@ diff.c: static void builtin_diffstat(const char *name_a, const char *name_b, + * But note that we special-case additions and deletions, + * as adding an empty file, for example is still of interest. + */ -+ if (p->status == DIFF_STATUS_MODIFIED ++ if ((p->status == DIFF_STATUS_MODIFIED) + && !file->added -+ && !file->deleted) { ++ && !file->deleted ++ && one->mode == two->mode) { + free_diffstat_file(file); + diffstat->nr--; + } @@ t/t4015-diff-whitespace.sh: test_expect_success 'whitespace-only changes not rep test_must_be_empty actual ' +-test_expect_success 'whitespace-only changes reported across renames' ' +test_expect_success 'whitespace-only changes not reported (diffstat)' ' + # reuse state from previous test + git diff --stat -b >actual && + test_must_be_empty actual +' + - test_expect_success 'whitespace-only changes reported across renames' ' ++test_expect_success 'whitespace changes with modification reported (diffstat)' ' ++ git reset --hard && ++ echo >x "hello world" && ++ git update-index --chmod=+x x && ++ git diff --stat --cached -b >actual && ++ cat <<-EOF >expect && ++ x | 0 ++ 1 file changed, 0 insertions(+), 0 deletions(-) ++ EOF ++ test_cmp expect actual ++' ++ ++test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' git reset --hard && for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && + git add x && ++ git commit -m "base" && ++ sed -e "5s/^/ /" x >z && ++ git rm x && ++ git add z && ++ git diff -w -M --cached --stat >actual && ++ cat <<-EOF >expect && ++ x => z | 0 ++ 1 file changed, 0 insertions(+), 0 deletions(-) ++ EOF ++ test_cmp expect actual ++' ++ ++test_expect_success 'whitespace-only changes reported across renames' ' ++ git reset --hard HEAD~1 && ++ for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && ++ git add x && + hash_x=$(git hash-object x) && + before=$(git rev-parse --short "$hash_x") && + git commit -m "base" && diff.c | 37 ++++++++++++++++++++++++++++++------- t/t4015-diff-whitespace.sh | 38 ++++++++++++++++++++++++++++++++++++-- 2 files changed, 66 insertions(+), 9 deletions(-) diff --git a/diff.c b/diff.c index f9709de7b4..131903fa3a 100644 --- a/diff.c +++ b/diff.c @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o gather_dirstat(options, &dir, changed, "", 0); } +static void free_diffstat_file(struct diffstat_file *f) +{ + free(f->print_name); + free(f->name); + free(f->from_name); + free(f); +} + void free_diffstat_info(struct diffstat_t *diffstat) { int i; - for (i = 0; i < diffstat->nr; i++) { - struct diffstat_file *f = diffstat->files[i]; - free(f->print_name); - free(f->name); - free(f->from_name); - free(f); - } + for (i = 0; i < diffstat->nr; i++) + free_diffstat_file(diffstat->files[i]); free(diffstat->files); } @@ -3718,6 +3721,26 @@ static void builtin_diffstat(const char *name_a, const char *name_b, if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, diffstat_consume, diffstat, &xpp, &xecfg)) die("unable to generate diffstat for %s", one->path); + + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { + struct diffstat_file *file = + diffstat->files[diffstat->nr - 1]; + /* + * Omit diffstats of modified files where nothing changed. + * Even if !same_contents, this might be the case due to + * ignoring whitespace changes, etc. + * + * But note that we special-case additions and deletions, + * as adding an empty file, for example is still of interest. + */ + if ((p->status == DIFF_STATUS_MODIFIED) + && !file->added + && !file->deleted + && one->mode == two->mode) { + free_diffstat_file(file); + diffstat->nr--; + } + } } diff_free_filespec_data(one); diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh index 88d3026894..8bdaa0a693 100755 --- a/t/t4015-diff-whitespace.sh +++ b/t/t4015-diff-whitespace.sh @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' git diff --check ' -test_expect_success 'whitespace-only changes not reported' ' +test_expect_success 'whitespace-only changes not reported (diff)' ' git reset --hard && echo >x "hello world" && git add x && @@ -799,10 +799,44 @@ test_expect_success 'whitespace-only changes not reported' ' test_must_be_empty actual ' -test_expect_success 'whitespace-only changes reported across renames' ' +test_expect_success 'whitespace-only changes not reported (diffstat)' ' + # reuse state from previous test + git diff --stat -b >actual && + test_must_be_empty actual +' + +test_expect_success 'whitespace changes with modification reported (diffstat)' ' + git reset --hard && + echo >x "hello world" && + git update-index --chmod=+x x && + git diff --stat --cached -b >actual && + cat <<-EOF >expect && + x | 0 + 1 file changed, 0 insertions(+), 0 deletions(-) + EOF + test_cmp expect actual +' + +test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' git reset --hard && for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && git add x && + git commit -m "base" && + sed -e "5s/^/ /" x >z && + git rm x && + git add z && + git diff -w -M --cached --stat >actual && + cat <<-EOF >expect && + x => z | 0 + 1 file changed, 0 insertions(+), 0 deletions(-) + EOF + test_cmp expect actual +' + +test_expect_success 'whitespace-only changes reported across renames' ' + git reset --hard HEAD~1 && + for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && + git add x && hash_x=$(git hash-object x) && before=$(git rev-parse --short "$hash_x") && git commit -m "base" && base-commit: 878e727637ec5815ccb3301eb994a54df95b21b8 -- gitgitgadget ^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH v2] diff: teach --stat to ignore uninteresting modifications 2020-08-14 22:45 ` [PATCH v2] " Matthew Rogers via GitGitGadget @ 2020-08-17 21:39 ` Junio C Hamano 2020-08-17 22:37 ` Matt Rogers 2020-08-20 0:41 ` [PATCH v3] " Matthew Rogers via GitGitGadget 1 sibling, 1 reply; 9+ messages in thread From: Junio C Hamano @ 2020-08-17 21:39 UTC (permalink / raw) To: Matthew Rogers via GitGitGadget; +Cc: git, peff, Matthew Rogers "Matthew Rogers via GitGitGadget" <gitgitgadget@gmail.com> writes: > From: Matthew Rogers <mattr94@gmail.com> > > Sometimes when diffing, files may show as being momdified even when momdified? mummified? ah, modified. > there are no interesting diffs to show. This happens naturally when > using options such as --ignore-space-change. Read the next paragraph and notice that it explains the cases where the patch does not want not to show, and then read the above again to realize that the above does not say anything about what it wants to do to cases the next paragraph does not cover. It only says such a case often happens when --ignore-space-change is used. When options like --ignore-space-change is in use, files with modification can have no interesting textual changes worth showing. In such cases, "git diff --stat" shows 0 lines of additions and deletions. Teach "git diff --stat" not to show such a path in its output, which would be more natural. perhaps? > We don't want to prevent > the display of all files that have 0 effective diffs since they could > be the result of a rename, permission change, or other similar operation > that may still be of interest so we special case additions and deletions > as they are always interesting. Yup. That makes sense. It would be nice if this does not have to be implemented as a list of exceptions, though. Rather, a more targetted "omit output only in this narrow case" would be nicer, but the check with the mode bits should do at lesat for now. > diff --git a/diff.c b/diff.c > index f9709de7b4..131903fa3a 100644 > --- a/diff.c > +++ b/diff.c > @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o > gather_dirstat(options, &dir, changed, "", 0); > } > > +static void free_diffstat_file(struct diffstat_file *f) > +{ > + free(f->print_name); > + free(f->name); > + free(f->from_name); > + free(f); > +} > + > void free_diffstat_info(struct diffstat_t *diffstat) > { > int i; > - for (i = 0; i < diffstat->nr; i++) { > - struct diffstat_file *f = diffstat->files[i]; > - free(f->print_name); > - free(f->name); > - free(f->from_name); > - free(f); > - } > + for (i = 0; i < diffstat->nr; i++) > + free_diffstat_file(diffstat->files[i]); > free(diffstat->files); > } > > @@ -3718,6 +3721,26 @@ static void builtin_diffstat(const char *name_a, const char *name_b, > if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, > diffstat_consume, diffstat, &xpp, &xecfg)) > die("unable to generate diffstat for %s", one->path); > + > + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { > + struct diffstat_file *file = > + diffstat->files[diffstat->nr - 1]; > + /* > + * Omit diffstats of modified files where nothing changed. > + * Even if !same_contents, this might be the case due to > + * ignoring whitespace changes, etc. > + * > + * But note that we special-case additions and deletions, * renames and mode changes without any content changes, > + * as adding an empty file, for example is still of interest. > + */ > + if ((p->status == DIFF_STATUS_MODIFIED) > + && !file->added > + && !file->deleted > + && one->mode == two->mode) { > + free_diffstat_file(file); > + diffstat->nr--; > + } > + } > } > > diff_free_filespec_data(one); > diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh > index 88d3026894..8bdaa0a693 100755 > --- a/t/t4015-diff-whitespace.sh > +++ b/t/t4015-diff-whitespace.sh > @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' > git diff --check > ' > > -test_expect_success 'whitespace-only changes not reported' ' > +test_expect_success 'whitespace-only changes not reported (diff)' ' > git reset --hard && > echo >x "hello world" && > git add x && > @@ -799,10 +799,44 @@ test_expect_success 'whitespace-only changes not reported' ' > test_must_be_empty actual > ' > > -test_expect_success 'whitespace-only changes reported across renames' ' > +test_expect_success 'whitespace-only changes not reported (diffstat)' ' > + # reuse state from previous test > + git diff --stat -b >actual && > + test_must_be_empty actual > +' > + > +test_expect_success 'whitespace changes with modification reported (diffstat)' ' > + git reset --hard && > + echo >x "hello world" && > + git update-index --chmod=+x x && > + git diff --stat --cached -b >actual && > + cat <<-EOF >expect && > + x | 0 > + 1 file changed, 0 insertions(+), 0 deletions(-) > + EOF > + test_cmp expect actual > +' > + > +test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' > git reset --hard && > for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > git add x && > + git commit -m "base" && > + sed -e "5s/^/ /" x >z && > + git rm x && > + git add z && > + git diff -w -M --cached --stat >actual && > + cat <<-EOF >expect && > + x => z | 0 > + 1 file changed, 0 insertions(+), 0 deletions(-) > + EOF > + test_cmp expect actual > +' > + > +test_expect_success 'whitespace-only changes reported across renames' ' > + git reset --hard HEAD~1 && > + for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > + git add x && > hash_x=$(git hash-object x) && > before=$(git rev-parse --short "$hash_x") && > git commit -m "base" && > > base-commit: 878e727637ec5815ccb3301eb994a54df95b21b8 ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] diff: teach --stat to ignore uninteresting modifications 2020-08-17 21:39 ` Junio C Hamano @ 2020-08-17 22:37 ` Matt Rogers 2020-08-17 22:37 ` Matt Rogers 0 siblings, 1 reply; 9+ messages in thread From: Matt Rogers @ 2020-08-17 22:37 UTC (permalink / raw) To: Junio C Hamano Cc: Matthew Rogers via GitGitGadget, Git Mailing List, Jeff King On Mon, Aug 17, 2020 at 5:39 PM Junio C Hamano <gitster@pobox.com> wrote: > > "Matthew Rogers via GitGitGadget" <gitgitgadget@gmail.com> writes: > > > From: Matthew Rogers <mattr94@gmail.com> > > > > Sometimes when diffing, files may show as being momdified even when > > momdified? mummified? ah, modified. > > > there are no interesting diffs to show. This happens naturally when > > using options such as --ignore-space-change. > > Read the next paragraph and notice that it explains the cases where > the patch does not want not to show, and then read the above again > to realize that the above does not say anything about what it wants > to do to cases the next paragraph does not cover. It only says such > a case often happens when --ignore-space-change is used. > > When options like --ignore-space-change is in use, files > with modification can have no interesting textual changes > worth showing. In such cases, "git diff --stat" shows 0 > lines of additions and deletions. Teach "git diff --stat" > not to show such a path in its output, which would be more > natural. > > perhaps? > > > We don't want to prevent > > the display of all files that have 0 effective diffs since they could > > be the result of a rename, permission change, or other similar operation > > that may still be of interest so we special case additions and deletions > > as they are always interesting. > > Yup. That makes sense. I'll send a reroll with the message improved as you suggested, as well as updating > > It would be nice if this does not have to be implemented as a list > of exceptions, though. Rather, a more targetted "omit output only > in this narrow case" would be nicer, but the check with the mode > bits should do at lesat for now. > > > diff --git a/diff.c b/diff.c > > index f9709de7b4..131903fa3a 100644 > > --- a/diff.c > > +++ b/diff.c > > @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o > > gather_dirstat(options, &dir, changed, "", 0); > > } > > > > +static void free_diffstat_file(struct diffstat_file *f) > > +{ > > + free(f->print_name); > > + free(f->name); > > + free(f->from_name); > > + free(f); > > +} > > + > > void free_diffstat_info(struct diffstat_t *diffstat) > > { > > int i; > > - for (i = 0; i < diffstat->nr; i++) { > > - struct diffstat_file *f = diffstat->files[i]; > > - free(f->print_name); > > - free(f->name); > > - free(f->from_name); > > - free(f); > > - } > > + for (i = 0; i < diffstat->nr; i++) > > + free_diffstat_file(diffstat->files[i]); > > free(diffstat->files); > > } > > > > @@ -3718,6 +3721,26 @@ static void builtin_diffstat(const char *name_a, const char *name_b, > > if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, > > diffstat_consume, diffstat, &xpp, &xecfg)) > > die("unable to generate diffstat for %s", one->path); > > + > > + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { > > + struct diffstat_file *file = > > + diffstat->files[diffstat->nr - 1]; > > + /* > > + * Omit diffstats of modified files where nothing changed. > > + * Even if !same_contents, this might be the case due to > > + * ignoring whitespace changes, etc. > > + * > > + * But note that we special-case additions and deletions, > > * renames and mode changes without any content changes, > > > + * as adding an empty file, for example is still of interest. > > + */ > > + if ((p->status == DIFF_STATUS_MODIFIED) > > + && !file->added > > + && !file->deleted > > + && one->mode == two->mode) { > > + free_diffstat_file(file); > > + diffstat->nr--; > > + } > > + } > > } > > > > diff_free_filespec_data(one); > > diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh > > index 88d3026894..8bdaa0a693 100755 > > --- a/t/t4015-diff-whitespace.sh > > +++ b/t/t4015-diff-whitespace.sh > > @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' > > git diff --check > > ' > > > > -test_expect_success 'whitespace-only changes not reported' ' > > +test_expect_success 'whitespace-only changes not reported (diff)' ' > > git reset --hard && > > echo >x "hello world" && > > git add x && > > @@ -799,10 +799,44 @@ test_expect_success 'whitespace-only changes not reported' ' > > test_must_be_empty actual > > ' > > > > -test_expect_success 'whitespace-only changes reported across renames' ' > > +test_expect_success 'whitespace-only changes not reported (diffstat)' ' > > + # reuse state from previous test > > + git diff --stat -b >actual && > > + test_must_be_empty actual > > +' > > + > > +test_expect_success 'whitespace changes with modification reported (diffstat)' ' > > + git reset --hard && > > + echo >x "hello world" && > > + git update-index --chmod=+x x && > > + git diff --stat --cached -b >actual && > > + cat <<-EOF >expect && > > + x | 0 > > + 1 file changed, 0 insertions(+), 0 deletions(-) > > + EOF > > + test_cmp expect actual > > +' > > + > > +test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' > > git reset --hard && > > for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > > git add x && > > + git commit -m "base" && > > + sed -e "5s/^/ /" x >z && > > + git rm x && > > + git add z && > > + git diff -w -M --cached --stat >actual && > > + cat <<-EOF >expect && > > + x => z | 0 > > + 1 file changed, 0 insertions(+), 0 deletions(-) > > + EOF > > + test_cmp expect actual > > +' > > + > > +test_expect_success 'whitespace-only changes reported across renames' ' > > + git reset --hard HEAD~1 && > > + for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > > + git add x && > > hash_x=$(git hash-object x) && > > before=$(git rev-parse --short "$hash_x") && > > git commit -m "base" && > > > > base-commit: 878e727637ec5815ccb3301eb994a54df95b21b8 -- Matthew Rogers ^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [PATCH v2] diff: teach --stat to ignore uninteresting modifications 2020-08-17 22:37 ` Matt Rogers @ 2020-08-17 22:37 ` Matt Rogers 0 siblings, 0 replies; 9+ messages in thread From: Matt Rogers @ 2020-08-17 22:37 UTC (permalink / raw) To: Junio C Hamano Cc: Matthew Rogers via GitGitGadget, Git Mailing List, Jeff King ...As well as updating the comment. (sorry for the double mail I fat fingered ctrl+enter while typing) On Mon, Aug 17, 2020 at 6:37 PM Matt Rogers <mattr94@gmail.com> wrote: > > On Mon, Aug 17, 2020 at 5:39 PM Junio C Hamano <gitster@pobox.com> wrote: > > > > "Matthew Rogers via GitGitGadget" <gitgitgadget@gmail.com> writes: > > > > > From: Matthew Rogers <mattr94@gmail.com> > > > > > > Sometimes when diffing, files may show as being momdified even when > > > > momdified? mummified? ah, modified. > > > > > there are no interesting diffs to show. This happens naturally when > > > using options such as --ignore-space-change. > > > > Read the next paragraph and notice that it explains the cases where > > the patch does not want not to show, and then read the above again > > to realize that the above does not say anything about what it wants > > to do to cases the next paragraph does not cover. It only says such > > a case often happens when --ignore-space-change is used. > > > > When options like --ignore-space-change is in use, files > > with modification can have no interesting textual changes > > worth showing. In such cases, "git diff --stat" shows 0 > > lines of additions and deletions. Teach "git diff --stat" > > not to show such a path in its output, which would be more > > natural. > > > > perhaps? > > > > > We don't want to prevent > > > the display of all files that have 0 effective diffs since they could > > > be the result of a rename, permission change, or other similar operation > > > that may still be of interest so we special case additions and deletions > > > as they are always interesting. > > > > Yup. That makes sense. > > I'll send a reroll with the message improved as you suggested, as well > as updating > > > > > It would be nice if this does not have to be implemented as a list > > of exceptions, though. Rather, a more targetted "omit output only > > in this narrow case" would be nicer, but the check with the mode > > bits should do at lesat for now. > > > > > diff --git a/diff.c b/diff.c > > > index f9709de7b4..131903fa3a 100644 > > > --- a/diff.c > > > +++ b/diff.c > > > @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o > > > gather_dirstat(options, &dir, changed, "", 0); > > > } > > > > > > +static void free_diffstat_file(struct diffstat_file *f) > > > +{ > > > + free(f->print_name); > > > + free(f->name); > > > + free(f->from_name); > > > + free(f); > > > +} > > > + > > > void free_diffstat_info(struct diffstat_t *diffstat) > > > { > > > int i; > > > - for (i = 0; i < diffstat->nr; i++) { > > > - struct diffstat_file *f = diffstat->files[i]; > > > - free(f->print_name); > > > - free(f->name); > > > - free(f->from_name); > > > - free(f); > > > - } > > > + for (i = 0; i < diffstat->nr; i++) > > > + free_diffstat_file(diffstat->files[i]); > > > free(diffstat->files); > > > } > > > > > > @@ -3718,6 +3721,26 @@ static void builtin_diffstat(const char *name_a, const char *name_b, > > > if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, > > > diffstat_consume, diffstat, &xpp, &xecfg)) > > > die("unable to generate diffstat for %s", one->path); > > > + > > > + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { > > > + struct diffstat_file *file = > > > + diffstat->files[diffstat->nr - 1]; > > > + /* > > > + * Omit diffstats of modified files where nothing changed. > > > + * Even if !same_contents, this might be the case due to > > > + * ignoring whitespace changes, etc. > > > + * > > > + * But note that we special-case additions and deletions, > > > > * renames and mode changes without any content changes, > > > > > + * as adding an empty file, for example is still of interest. > > > + */ > > > + if ((p->status == DIFF_STATUS_MODIFIED) > > > + && !file->added > > > + && !file->deleted > > > + && one->mode == two->mode) { > > > + free_diffstat_file(file); > > > + diffstat->nr--; > > > + } > > > + } > > > } > > > > > > diff_free_filespec_data(one); > > > diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh > > > index 88d3026894..8bdaa0a693 100755 > > > --- a/t/t4015-diff-whitespace.sh > > > +++ b/t/t4015-diff-whitespace.sh > > > @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' > > > git diff --check > > > ' > > > > > > -test_expect_success 'whitespace-only changes not reported' ' > > > +test_expect_success 'whitespace-only changes not reported (diff)' ' > > > git reset --hard && > > > echo >x "hello world" && > > > git add x && > > > @@ -799,10 +799,44 @@ test_expect_success 'whitespace-only changes not reported' ' > > > test_must_be_empty actual > > > ' > > > > > > -test_expect_success 'whitespace-only changes reported across renames' ' > > > +test_expect_success 'whitespace-only changes not reported (diffstat)' ' > > > + # reuse state from previous test > > > + git diff --stat -b >actual && > > > + test_must_be_empty actual > > > +' > > > + > > > +test_expect_success 'whitespace changes with modification reported (diffstat)' ' > > > + git reset --hard && > > > + echo >x "hello world" && > > > + git update-index --chmod=+x x && > > > + git diff --stat --cached -b >actual && > > > + cat <<-EOF >expect && > > > + x | 0 > > > + 1 file changed, 0 insertions(+), 0 deletions(-) > > > + EOF > > > + test_cmp expect actual > > > +' > > > + > > > +test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' > > > git reset --hard && > > > for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > > > git add x && > > > + git commit -m "base" && > > > + sed -e "5s/^/ /" x >z && > > > + git rm x && > > > + git add z && > > > + git diff -w -M --cached --stat >actual && > > > + cat <<-EOF >expect && > > > + x => z | 0 > > > + 1 file changed, 0 insertions(+), 0 deletions(-) > > > + EOF > > > + test_cmp expect actual > > > +' > > > + > > > +test_expect_success 'whitespace-only changes reported across renames' ' > > > + git reset --hard HEAD~1 && > > > + for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && > > > + git add x && > > > hash_x=$(git hash-object x) && > > > before=$(git rev-parse --short "$hash_x") && > > > git commit -m "base" && > > > > > > base-commit: 878e727637ec5815ccb3301eb994a54df95b21b8 > > > > -- > Matthew Rogers -- Matthew Rogers ^ permalink raw reply [flat|nested] 9+ messages in thread
* [PATCH v3] diff: teach --stat to ignore uninteresting modifications 2020-08-14 22:45 ` [PATCH v2] " Matthew Rogers via GitGitGadget 2020-08-17 21:39 ` Junio C Hamano @ 2020-08-20 0:41 ` Matthew Rogers via GitGitGadget 2020-08-20 0:56 ` Junio C Hamano 1 sibling, 1 reply; 9+ messages in thread From: Matthew Rogers via GitGitGadget @ 2020-08-20 0:41 UTC (permalink / raw) To: git; +Cc: peff, Matthew Rogers, Matthew Rogers From: Matthew Rogers <mattr94@gmail.com> When options such as --ignore-space-change are in use, files with modifications can have no interesting textual changes worth showing. In such cases, "git diff --stat" shows 0 lines of additions and deletions. Teach "git diff --stat" not to show such a path in its output, which would be more natural. However, we don't want to prevent the display of all files that have 0 effective diffs since they could be the result of a rename, permission change, or other similar operation that may still be of interest so we special case additions and deletions as they are always interesting. Signed-off-by: Matthew Rogers <mattr94@gmail.com> --- diff: teach --stat to ignore uninteresting modifications This patch is based on the discussion these email threads: https://lore.kernel.org/git/1484704915.2096.16.camel@mattmccutchen.net/ https://lore.kernel.org/git/CAOjrSZtQPQ8Xxuz+7SGykR8Q-gFDEZANSE5yQASqKjpbUAq_5Q@mail.gmail.com/ With the code mostly taken from this specific message: https://lore.kernel.org/git/20170118111705.6bqzkklluikda3r5@sigill.intra.peff.net/ The summary is that when running git diff --stat in combination with --ignore-all-space or similar options, you'll see many lines of the form: some-file.txt | 0 which can be misleading when you are explicitly telling git to "ignore all space" or something similar. To rectify this issue, this patch categorizes all files that are modified but have no effective changes as not fit to display to the user. New in V2: * I've added a test covering the rename case with whitespace-changes and permissions changes * I've also updated the logic in builtin_diffstat to include that logic as well Published-As: https://github.com/gitgitgadget/git/releases/tag/pr-689%2FROGERSM94%2Fzero-diffs-v3 Fetch-It-Via: git fetch https://github.com/gitgitgadget/git pr-689/ROGERSM94/zero-diffs-v3 Pull-Request: https://github.com/gitgitgadget/git/pull/689 Range-diff vs v2: 1: 6c5db18618 ! 1: 7c3113846e diff: teach --stat to ignore uninteresting modifications @@ Metadata ## Commit message ## diff: teach --stat to ignore uninteresting modifications - Sometimes when diffing, files may show as being momdified even when - there are no interesting diffs to show. This happens naturally when - using options such as --ignore-space-change. We don't want to prevent - the display of all files that have 0 effective diffs since they could - be the result of a rename, permission change, or other similar operation - that may still be of interest so we special case additions and deletions - as they are always interesting. + When options such as --ignore-space-change are in use, files with + modifications can have no interesting textual changes worth showing. In + such cases, "git diff --stat" shows 0 lines of additions and deletions. + Teach "git diff --stat" not to show such a path in its output, which + would be more natural. + + However, we don't want to prevent the display of all files that have 0 + effective diffs since they could be the result of a rename, permission + change, or other similar operation that may still be of interest so we + special case additions and deletions as they are always interesting. Signed-off-by: Matthew Rogers <mattr94@gmail.com> @@ diff.c: static void builtin_diffstat(const char *name_a, const char *name_b, + * Even if !same_contents, this might be the case due to + * ignoring whitespace changes, etc. + * -+ * But note that we special-case additions and deletions, -+ * as adding an empty file, for example is still of interest. ++ * But note that we special-case additions, deletions, ++ * renames, and mode changes as adding an empty file, ++ * for example is still of interest. + */ + if ((p->status == DIFF_STATUS_MODIFIED) + && !file->added diff.c | 38 +++++++++++++++++++++++++++++++------- t/t4015-diff-whitespace.sh | 38 ++++++++++++++++++++++++++++++++++++-- 2 files changed, 67 insertions(+), 9 deletions(-) diff --git a/diff.c b/diff.c index f9709de7b4..4f54b41395 100644 --- a/diff.c +++ b/diff.c @@ -3153,16 +3153,19 @@ static void show_dirstat_by_line(struct diffstat_t *data, struct diff_options *o gather_dirstat(options, &dir, changed, "", 0); } +static void free_diffstat_file(struct diffstat_file *f) +{ + free(f->print_name); + free(f->name); + free(f->from_name); + free(f); +} + void free_diffstat_info(struct diffstat_t *diffstat) { int i; - for (i = 0; i < diffstat->nr; i++) { - struct diffstat_file *f = diffstat->files[i]; - free(f->print_name); - free(f->name); - free(f->from_name); - free(f); - } + for (i = 0; i < diffstat->nr; i++) + free_diffstat_file(diffstat->files[i]); free(diffstat->files); } @@ -3718,6 +3721,27 @@ static void builtin_diffstat(const char *name_a, const char *name_b, if (xdi_diff_outf(&mf1, &mf2, discard_hunk_line, diffstat_consume, diffstat, &xpp, &xecfg)) die("unable to generate diffstat for %s", one->path); + + if (DIFF_FILE_VALID(one) && DIFF_FILE_VALID(two)) { + struct diffstat_file *file = + diffstat->files[diffstat->nr - 1]; + /* + * Omit diffstats of modified files where nothing changed. + * Even if !same_contents, this might be the case due to + * ignoring whitespace changes, etc. + * + * But note that we special-case additions, deletions, + * renames, and mode changes as adding an empty file, + * for example is still of interest. + */ + if ((p->status == DIFF_STATUS_MODIFIED) + && !file->added + && !file->deleted + && one->mode == two->mode) { + free_diffstat_file(file); + diffstat->nr--; + } + } } diff_free_filespec_data(one); diff --git a/t/t4015-diff-whitespace.sh b/t/t4015-diff-whitespace.sh index 88d3026894..8bdaa0a693 100755 --- a/t/t4015-diff-whitespace.sh +++ b/t/t4015-diff-whitespace.sh @@ -789,7 +789,7 @@ test_expect_success 'checkdiff allows new blank lines' ' git diff --check ' -test_expect_success 'whitespace-only changes not reported' ' +test_expect_success 'whitespace-only changes not reported (diff)' ' git reset --hard && echo >x "hello world" && git add x && @@ -799,10 +799,44 @@ test_expect_success 'whitespace-only changes not reported' ' test_must_be_empty actual ' -test_expect_success 'whitespace-only changes reported across renames' ' +test_expect_success 'whitespace-only changes not reported (diffstat)' ' + # reuse state from previous test + git diff --stat -b >actual && + test_must_be_empty actual +' + +test_expect_success 'whitespace changes with modification reported (diffstat)' ' + git reset --hard && + echo >x "hello world" && + git update-index --chmod=+x x && + git diff --stat --cached -b >actual && + cat <<-EOF >expect && + x | 0 + 1 file changed, 0 insertions(+), 0 deletions(-) + EOF + test_cmp expect actual +' + +test_expect_success 'whitespace-only changes reported across renames (diffstat)' ' git reset --hard && for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && git add x && + git commit -m "base" && + sed -e "5s/^/ /" x >z && + git rm x && + git add z && + git diff -w -M --cached --stat >actual && + cat <<-EOF >expect && + x => z | 0 + 1 file changed, 0 insertions(+), 0 deletions(-) + EOF + test_cmp expect actual +' + +test_expect_success 'whitespace-only changes reported across renames' ' + git reset --hard HEAD~1 && + for i in 1 2 3 4 5 6 7 8 9; do echo "$i$i$i$i$i$i"; done >x && + git add x && hash_x=$(git hash-object x) && before=$(git rev-parse --short "$hash_x") && git commit -m "base" && base-commit: 878e727637ec5815ccb3301eb994a54df95b21b8 -- gitgitgadget ^ permalink raw reply related [flat|nested] 9+ messages in thread
* Re: [PATCH v3] diff: teach --stat to ignore uninteresting modifications 2020-08-20 0:41 ` [PATCH v3] " Matthew Rogers via GitGitGadget @ 2020-08-20 0:56 ` Junio C Hamano 0 siblings, 0 replies; 9+ messages in thread From: Junio C Hamano @ 2020-08-20 0:56 UTC (permalink / raw) To: Matthew Rogers via GitGitGadget; +Cc: git, peff, Matthew Rogers "Matthew Rogers via GitGitGadget" <gitgitgadget@gmail.com> writes: > From: Matthew Rogers <mattr94@gmail.com> > > When options such as --ignore-space-change are in use, files with > modifications can have no interesting textual changes worth showing. In > such cases, "git diff --stat" shows 0 lines of additions and deletions. > Teach "git diff --stat" not to show such a path in its output, which > would be more natural. > > However, we don't want to prevent the display of all files that have 0 > effective diffs since they could be the result of a rename, permission > change, or other similar operation that may still be of interest so we > special case additions and deletions as they are always interesting. > > Signed-off-by: Matthew Rogers <mattr94@gmail.com> > --- Looks good, thanks. Will queue. By the way, before making your commits, please make sure you do not have whitespace errors. I've let my "git am" to fix them, so no need to resend, but for future reference... .git/rebase-apply/patch:116: trailing whitespace. struct diffstat_file *file = .git/rebase-apply/patch:119: trailing whitespace. * Omit diffstats of modified files where nothing changed. .git/rebase-apply/patch:122: trailing whitespace. * .git/rebase-apply/patch:124: trailing whitespace. * renames, and mode changes as adding an empty file, .git/rebase-apply/patch:127: trailing whitespace. if ((p->status == DIFF_STATUS_MODIFIED) warning: 5 lines applied after fixing whitespace errors. Applying: diff: teach --stat to ignore uninteresting modifications ^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2020-08-20 0:56 UTC | newest] Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2020-08-01 23:33 [PATCH] diff: teach --stat to ignore uninteresting modifications Matthew Rogers via GitGitGadget 2020-08-02 0:31 ` Junio C Hamano 2020-08-02 1:51 ` Matt Rogers 2020-08-14 22:45 ` [PATCH v2] " Matthew Rogers via GitGitGadget 2020-08-17 21:39 ` Junio C Hamano 2020-08-17 22:37 ` Matt Rogers 2020-08-17 22:37 ` Matt Rogers 2020-08-20 0:41 ` [PATCH v3] " Matthew Rogers via GitGitGadget 2020-08-20 0:56 ` Junio C Hamano
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).