linux-kernel-mentees.lists.linuxfoundation.org archive mirror
 help / color / mirror / Atom feed
* [Linux-kernel-mentees] checkpatch.pl improvement: NO_AUTHOR_SIGN_OFF warnings for users with multiple emails
@ 2020-09-21 12:11 Dwaipayan Ray
  2020-09-21 12:24 ` Joe Perches
  2020-09-21 13:23 ` Lukas Bulwahn
  0 siblings, 2 replies; 5+ messages in thread
From: Dwaipayan Ray @ 2020-09-21 12:11 UTC (permalink / raw)
  To: Lukas Bulwahn, Joe Perches; +Cc: linux-kernel-mentees

Hi Joe and Lukas and others,
I would like to elaborate a bit on the issue and the solution I
have thought of for fixing Missing Author Signed-off-by
warning for regular committers who use multiple email
addresses.

The problem:
While running checkpatch on previous commits to the kernel,
there were multiple such instances where the author had
signed off using a different email address rather than the one
which he used to mail the patch.

From Lukas's data:

$ grep "NO_AUTHOR_SIGN_OFF" v5.4..v5.8.tsv  | cut -f 7 | sort  | uniq -c |
sort -nr | head -n 8
    175 Missing Signed-off-by: line by nominal patch author 'Daniel
Vetter <daniel.vetter@ffwll.ch>'
     68 Missing Signed-off-by: line by nominal patch author 'Trond
Myklebust <trondmy@gmail.com>'
     43 Missing Signed-off-by: line by nominal patch author 'Thinh
Nguyen <Thinh.Nguyen@synopsys.com>'
     40 Missing Signed-off-by: line by nominal patch author 'Pascal
van Leeuwen <pascalvanl@gmail.com>'
     36 Missing Signed-off-by: line by nominal patch author 'Alex
Maftei <amaftei@solarflare.com>'
     31 Missing Signed-off-by: line by nominal patch author 'Valdis
Kletnieks <valdis.kletnieks@vt.edu>'
     24 Missing Signed-off-by: line by nominal patch author 'Luke
Nelson <lukenels@cs.washington.edu>'

So most of them belong to the case where they have signed off
using a different mail address. I believe these can be handled
better.

Proposed Solution:
The .mailmap file contains mappings of the following types:
  name1 <mail1>
  <mail1> <mail2>
  name1 <mail1> <mail2>
  name1 <mail1> name2 <mail2>

Thus loading .mailmap data and matching email addresses for
the same author would resolve many of these warnings.

Now the remaining problem at hand is to have a data structure
by which this query can be handled easily without much extra
overhead.

One possible solution is while parsing the author, also load mailmap
and load associated email addresses to the author and load it
into a hash. Next when a signed-off-by line is encountered and
the email is found in our hash ( or maybe some other ds ), then
the signed-off-by match should be positive.

Is this feasible? I would be looking at other possibilities too. But it
would be great to have your view on it!

Thanks,
Dwaipayan.
_______________________________________________
Linux-kernel-mentees mailing list
Linux-kernel-mentees@lists.linuxfoundation.org
https://lists.linuxfoundation.org/mailman/listinfo/linux-kernel-mentees

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2020-09-21 14:00 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-09-21 12:11 [Linux-kernel-mentees] checkpatch.pl improvement: NO_AUTHOR_SIGN_OFF warnings for users with multiple emails Dwaipayan Ray
2020-09-21 12:24 ` Joe Perches
2020-09-21 13:27   ` Lukas Bulwahn
2020-09-21 13:23 ` Lukas Bulwahn
2020-09-21 14:00   ` Dwaipayan Ray

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).