* [PATCH v2] checkpatch: Warn if missing author Signed-off-by
@ 2018-07-11 15:10 Geert Uytterhoeven
2018-07-11 16:20 ` Joe Perches
0 siblings, 1 reply; 5+ messages in thread
From: Geert Uytterhoeven @ 2018-07-11 15:10 UTC (permalink / raw)
To: Andy Whitcroft, Joe Perches
Cc: Jason Gunthorpe, Stephen Rothwell, Linus Walleij,
Yoshihiro Shimoda, Pavel Machek, linux-kernel,
Geert Uytterhoeven
Print a warning if none of the Signed-off-by lines cover the patch
author.
Non-ASCII quoted printable encoding in From: headers and (lack of)
double quotes are handled.
Split From: headers are not fully handled: only the first part is
compared.
Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
---
Tested using a set of ca. 4000 real world commits.
Most common offenders are people using:
- different email addresses for author and Sob,
- different firstname/lastname order, or other different name
spelling,
- suse.de vs. suse.com.
v2:
- Use "Encode" instead of "MIME::Words", as the former is a Perl core
module,
- Reduce level from error to warning.
---
scripts/checkpatch.pl | 27 ++++++++++++++++++++++++---
1 file changed, 24 insertions(+), 3 deletions(-)
diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl
index 8d0bad190c25e2e0..9ad674516502c7e8 100755
--- a/scripts/checkpatch.pl
+++ b/scripts/checkpatch.pl
@@ -13,6 +13,7 @@ use POSIX;
use File::Basename;
use Cwd 'abs_path';
use Term::ANSIColor qw(:constants);
+use Encode qw(decode encode);
my $P = $0;
my $D = dirname(abs_path($P));
@@ -2236,6 +2237,8 @@ sub process {
our $clean = 1;
my $signoff = 0;
+ my $author = '';
+ my $authorsignoff = 0;
my $is_patch = 0;
my $in_header_lines = $file ? 0 : 1;
my $in_commit_log = 0; #Scanning lines before patch
@@ -2518,10 +2521,23 @@ sub process {
}
}
+# Check the patch for a From:
+ if (decode("MIME-Header", $line) =~ /^From:\s*(.*)/) {
+ $author = encode("utf8", $1);
+ $author =~ s/"//g;
+ }
+
# Check the patch for a signoff:
if ($line =~ /^\s*signed-off-by:/i) {
$signoff++;
$in_commit_log = 0;
+ if ($author ne '') {
+ my $l = $line;
+ $l =~ s/"//g;
+ if ($l =~ /^\s*signed-off-by: \Q$author\E/i) {
+ $authorsignoff = 1;
+ }
+ }
}
# Check if MAINTAINERS is being updated. If so, there's probably no need to
@@ -6487,9 +6503,14 @@ sub process {
ERROR("NOT_UNIFIED_DIFF",
"Does not appear to be a unified-diff format patch\n");
}
- if ($is_patch && $has_commit_log && $chk_signoff && $signoff == 0) {
- ERROR("MISSING_SIGN_OFF",
- "Missing Signed-off-by: line(s)\n");
+ if ($is_patch && $has_commit_log && $chk_signoff) {
+ if ($signoff == 0) {
+ ERROR("MISSING_SIGN_OFF",
+ "Missing Signed-off-by: line(s)\n");
+ } elsif (!$authorsignoff) {
+ WARN("NO_AUTHOR_SIGN_OFF",
+ "Missing Signed-off-by: line by patch author\n");
+ }
}
print report_dump();
--
2.17.1
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v2] checkpatch: Warn if missing author Signed-off-by
2018-07-11 15:10 [PATCH v2] checkpatch: Warn if missing author Signed-off-by Geert Uytterhoeven
@ 2018-07-11 16:20 ` Joe Perches
2018-07-11 17:07 ` Geert Uytterhoeven
0 siblings, 1 reply; 5+ messages in thread
From: Joe Perches @ 2018-07-11 16:20 UTC (permalink / raw)
To: Geert Uytterhoeven, Andy Whitcroft
Cc: Jason Gunthorpe, Stephen Rothwell, Linus Walleij,
Yoshihiro Shimoda, Pavel Machek, linux-kernel
On Wed, 2018-07-11 at 17:10 +0200, Geert Uytterhoeven wrote:
> Print a warning if none of the Signed-off-by lines cover the patch
> author.
>
> Non-ASCII quoted printable encoding in From: headers and (lack of)
> double quotes are handled.
> Split From: headers are not fully handled: only the first part is
> compared.
[]
> diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl
[]
> +# Check the patch for a From:
> + if (decode("MIME-Header", $line) =~ /^From:\s*(.*)/) {
> + $author = encode("utf8", $1);
> + $author =~ s/"//g;
> + }
> +
> # Check the patch for a signoff:
> if ($line =~ /^\s*signed-off-by:/i) {
> $signoff++;
> $in_commit_log = 0;
> + if ($author ne '') {
> + my $l = $line;
> + $l =~ s/"//g;
> + if ($l =~ /^\s*signed-off-by: \Q$author\E/i) {
> + $authorsignoff = 1;
> + }
> + }
> }
I don't see the point of removing the quotes.
If the name and email address don't exactly match,
why shouldn't it be reported?
> # Check if MAINTAINERS is being updated. If so, there's probably no need to
> @@ -6487,9 +6503,14 @@ sub process {
> ERROR("NOT_UNIFIED_DIFF",
> "Does not appear to be a unified-diff format patch\n");
> }
> - if ($is_patch && $has_commit_log && $chk_signoff && $signoff == 0) {
> - ERROR("MISSING_SIGN_OFF",
> - "Missing Signed-off-by: line(s)\n");
> + if ($is_patch && $has_commit_log && $chk_signoff) {
> + if ($signoff == 0) {
> + ERROR("MISSING_SIGN_OFF",
> + "Missing Signed-off-by: line(s)\n");
> + } elsif (!$authorsignoff) {
> + WARN("NO_AUTHOR_SIGN_OFF",
> + "Missing Signed-off-by: line by patch author\n");
Perhaps better to show the From: line author
"Missing 'Signed-off-by:' from nomimal patch author '$author'\n");
Another somewhat frequent Signed-off-by: defect pattern
is to have a lower-case name or no name at all used.
Perhaps it'd be useful to add some checks like
a single word for a name and all lower-case names:
e.g.:
Signed-off-by: joe perches <joe@perches.com>
emits
Unusual lower case name: 'joe perches <joe@perches.com>'
and
Signed-off-by: root <root@mydomain.net>
and
Signed-off-by: Root <root@mydomain.net>
and
Signed-off-by: root@mydomain.net
all emit that a single or missing name is unusual.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2] checkpatch: Warn if missing author Signed-off-by
2018-07-11 16:20 ` Joe Perches
@ 2018-07-11 17:07 ` Geert Uytterhoeven
2018-07-11 17:24 ` Joe Perches
0 siblings, 1 reply; 5+ messages in thread
From: Geert Uytterhoeven @ 2018-07-11 17:07 UTC (permalink / raw)
To: Joe Perches
Cc: Geert Uytterhoeven, Andy Whitcroft, jgg, Stephen Rothwell,
Linus Walleij, Yoshihiro Shimoda, Pavel Machek,
Linux Kernel Mailing List
Hi Joe,
On Wed, Jul 11, 2018 at 6:20 PM Joe Perches <joe@perches.com> wrote:
> On Wed, 2018-07-11 at 17:10 +0200, Geert Uytterhoeven wrote:
> > Print a warning if none of the Signed-off-by lines cover the patch
> > author.
> >
> > Non-ASCII quoted printable encoding in From: headers and (lack of)
> > double quotes are handled.
> > Split From: headers are not fully handled: only the first part is
> > compared.
> []
> > diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl
> []
> > +# Check the patch for a From:
> > + if (decode("MIME-Header", $line) =~ /^From:\s*(.*)/) {
> > + $author = encode("utf8", $1);
> > + $author =~ s/"//g;
> > + }
> > +
> > # Check the patch for a signoff:
> > if ($line =~ /^\s*signed-off-by:/i) {
> > $signoff++;
> > $in_commit_log = 0;
> > + if ($author ne '') {
> > + my $l = $line;
> > + $l =~ s/"//g;
> > + if ($l =~ /^\s*signed-off-by: \Q$author\E/i) {
> > + $authorsignoff = 1;
> > + }
> > + }
> > }
>
> I don't see the point of removing the quotes.
If the name contains a dot (e.g. "David S. Miller"), it must be protected
by double quotes in the From: email header, but that is usually not the case
in the Sob line.
> If the name and email address don't exactly match,
> why shouldn't it be reported?
It's still the same email address.
> > # Check if MAINTAINERS is being updated. If so, there's probably no need to
> > @@ -6487,9 +6503,14 @@ sub process {
> > ERROR("NOT_UNIFIED_DIFF",
> > "Does not appear to be a unified-diff format patch\n");
> > }
> > - if ($is_patch && $has_commit_log && $chk_signoff && $signoff == 0) {
> > - ERROR("MISSING_SIGN_OFF",
> > - "Missing Signed-off-by: line(s)\n");
> > + if ($is_patch && $has_commit_log && $chk_signoff) {
> > + if ($signoff == 0) {
> > + ERROR("MISSING_SIGN_OFF",
> > + "Missing Signed-off-by: line(s)\n");
> > + } elsif (!$authorsignoff) {
> > + WARN("NO_AUTHOR_SIGN_OFF",
> > + "Missing Signed-off-by: line by patch author\n");
>
> Perhaps better to show the From: line author
>
> "Missing 'Signed-off-by:' from nomimal patch author '$author'\n");
OK.
> Another somewhat frequent Signed-off-by: defect pattern
> is to have a lower-case name or no name at all used.
>
> Perhaps it'd be useful to add some checks like
> a single word for a name and all lower-case names:
>
> e.g.:
> Signed-off-by: joe perches <joe@perches.com>
>
> emits
>
> Unusual lower case name: 'joe perches <joe@perches.com>'
>
> and
> Signed-off-by: root <root@mydomain.net>
> and
> Signed-off-by: Root <root@mydomain.net>
> and
> Signed-off-by: root@mydomain.net
>
> all emit that a single or missing name is unusual.
Perhaps. But adding email addresses is not the topic of this patch.
Gr{oetje,eeting}s,
Geert
--
Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@linux-m68k.org
In personal conversations with technical people, I call myself a hacker. But
when I'm talking to journalists I just say "programmer" or something like that.
-- Linus Torvalds
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2] checkpatch: Warn if missing author Signed-off-by
2018-07-11 17:07 ` Geert Uytterhoeven
@ 2018-07-11 17:24 ` Joe Perches
2018-07-12 10:05 ` Geert Uytterhoeven
0 siblings, 1 reply; 5+ messages in thread
From: Joe Perches @ 2018-07-11 17:24 UTC (permalink / raw)
To: Geert Uytterhoeven
Cc: Geert Uytterhoeven, Andy Whitcroft, jgg, Stephen Rothwell,
Linus Walleij, Yoshihiro Shimoda, Pavel Machek,
Linux Kernel Mailing List
On Wed, 2018-07-11 at 19:07 +0200, Geert Uytterhoeven wrote:
> Hi Joe,
Rehi Geert.
> On Wed, Jul 11, 2018 at 6:20 PM Joe Perches <joe@perches.com> wrote:
> > On Wed, 2018-07-11 at 17:10 +0200, Geert Uytterhoeven wrote:
> > > Print a warning if none of the Signed-off-by lines cover the patch
> > > author.
> > >
> > > Non-ASCII quoted printable encoding in From: headers and (lack of)
> > > double quotes are handled.
> > > Split From: headers are not fully handled: only the first part is
> > > compared.
> >
> > []
> > > diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl
> >
> > []
> > > +# Check the patch for a From:
> > > + if (decode("MIME-Header", $line) =~ /^From:\s*(.*)/) {
> > > + $author = encode("utf8", $1);
> > > + $author =~ s/"//g;
> > > + }
> > > +
> > > # Check the patch for a signoff:
> > > if ($line =~ /^\s*signed-off-by:/i) {
> > > $signoff++;
> > > $in_commit_log = 0;
> > > + if ($author ne '') {
> > > + my $l = $line;
> > > + $l =~ s/"//g;
> > > + if ($l =~ /^\s*signed-off-by: \Q$author\E/i) {
if (/^\s*signed-off-by:\s*\Q$author\E/i)
> > > + $authorsignoff = 1;
> > > + }
> > > + }
> > > }
> >
> > I don't see the point of removing the quotes.
>
> If the name contains a dot (e.g. "David S. Miller"), it must be protected
> by double quotes in the From: email header, but that is usually not the case
> in the Sob line.
Right.
It seems it's unusual for SOBs to use quotes around names.
> > Another somewhat frequent Signed-off-by: defect pattern
> > is to have a lower-case name or no name at all used.
> >
> > Perhaps it'd be useful to add some checks like
> > a single word for a name and all lower-case names:
> >
> > e.g.:
> > Signed-off-by: joe perches <joe@perches.com>
> >
> > emits
> >
> > Unusual lower case name: 'joe perches <joe@perches.com>'
> >
> > and
> > Signed-off-by: root <root@mydomain.net>
> > and
> > Signed-off-by: Root <root@mydomain.net>
> > and
> > Signed-off-by: root@mydomain.net
> >
> > all emit that a single or missing name is unusual.
>
> Perhaps. But adding email addresses is not the topic of this patch.
Just another possible improvement if you're feeling energetic.
cheers, Joe
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v2] checkpatch: Warn if missing author Signed-off-by
2018-07-11 17:24 ` Joe Perches
@ 2018-07-12 10:05 ` Geert Uytterhoeven
0 siblings, 0 replies; 5+ messages in thread
From: Geert Uytterhoeven @ 2018-07-12 10:05 UTC (permalink / raw)
To: Joe Perches
Cc: Geert Uytterhoeven, Andy Whitcroft, jgg, Stephen Rothwell,
Linus Walleij, Yoshihiro Shimoda, Pavel Machek,
Linux Kernel Mailing List
Hi Joe,
On Wed, Jul 11, 2018 at 7:24 PM Joe Perches <joe@perches.com> wrote:
> On Wed, 2018-07-11 at 19:07 +0200, Geert Uytterhoeven wrote:
> > On Wed, Jul 11, 2018 at 6:20 PM Joe Perches <joe@perches.com> wrote:
> > > On Wed, 2018-07-11 at 17:10 +0200, Geert Uytterhoeven wrote:
> > > > Print a warning if none of the Signed-off-by lines cover the patch
> > > > author.
> > > >
> > > > Non-ASCII quoted printable encoding in From: headers and (lack of)
> > > > double quotes are handled.
> > > > Split From: headers are not fully handled: only the first part is
> > > > compared.
> > >
> > > []
> > > > diff --git a/scripts/checkpatch.pl b/scripts/checkpatch.pl
> > >
> > > []
> > > > +# Check the patch for a From:
> > > > + if (decode("MIME-Header", $line) =~ /^From:\s*(.*)/) {
> > > > + $author = encode("utf8", $1);
> > > > + $author =~ s/"//g;
> > > > + }
> > > > +
> > > > # Check the patch for a signoff:
> > > > if ($line =~ /^\s*signed-off-by:/i) {
> > > > $signoff++;
> > > > $in_commit_log = 0;
> > > > + if ($author ne '') {
> > > > + my $l = $line;
> > > > + $l =~ s/"//g;
> > > > + if ($l =~ /^\s*signed-off-by: \Q$author\E/i) {
>
> if (/^\s*signed-off-by:\s*\Q$author\E/i)
Dropping the "$l =~" doesn't fly.
The "\s*" part does work.
> > > > + $authorsignoff = 1;
> > > > + }
> > > > + }
> > > > }
> > >
> > > I don't see the point of removing the quotes.
> >
> > If the name contains a dot (e.g. "David S. Miller"), it must be protected
> > by double quotes in the From: email header, but that is usually not the case
> > in the Sob line.
>
> Right.
> It seems it's unusual for SOBs to use quotes around names.
But it does happen from time to time.
Gr{oetje,eeting}s,
Geert
--
Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert@linux-m68k.org
In personal conversations with technical people, I call myself a hacker. But
when I'm talking to journalists I just say "programmer" or something like that.
-- Linus Torvalds
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2018-07-12 10:05 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-07-11 15:10 [PATCH v2] checkpatch: Warn if missing author Signed-off-by Geert Uytterhoeven
2018-07-11 16:20 ` Joe Perches
2018-07-11 17:07 ` Geert Uytterhoeven
2018-07-11 17:24 ` Joe Perches
2018-07-12 10:05 ` Geert Uytterhoeven
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).