All of lore.kernel.org
 help / color / mirror / Atom feed
From: Joe Perches <joe@perches.com>
To: Masanari Iida <standby24x7@gmail.com>
Cc: Kees Cook <keescook@chromium.org>,
	"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	Andy Whitcroft <apw@canonical.com>,
	Geert Uytterhoeven <geert@linux-m68k.org>,
	linux-doc <linux-doc@vger.kernel.org>
Subject: Re: [PATCH v2] checkpatch: look for common misspellings
Date: Wed, 10 Sep 2014 00:00:46 -0700	[thread overview]
Message-ID: <1410332446.24028.26.camel@joe-AO725> (raw)
In-Reply-To: <CALLJCT3syNafyj-y+1osLCMabszJbyZhmmKWLO8k8-y4Nkmpzg@mail.gmail.com>

On Wed, 2014-09-10 at 13:37 +0900, Masanari Iida wrote:
> Hello Joe, Kees,

Hello Masanari-san.

> Sorry for late reply.
> I was on holiday when the version 1 patch discussions were posted.

No worries, holidays are far more important
than patches like this...

These patches are simple niceties, not fixes
for bugs, so review and acceptance timing is
not urgent.

> I am using codespell ( https://github.com/lucasdemarchi/codespell/ ).
> The codespell has its own typo dictionary.
> The dictionary format is
> 
> typo->good   (1 candidate)
> typo->good1,good2,  (multiple candidates)
> typo->good, comment  (1 candidate with special remark)
> 
> Its similar to your  typo||good  format.
> 
> The license of the codespell is GPLv2 according to COPYING file in tar ball.
> 
> Compare number of typo samples in dictionary.
> Your dictionary :  1033
> codespell-1.4 :     4261
> codespell-1.4 + my adding 5245
> Your dictionary + codespell-1.4 + my adding - remove duplicate:  5742
> 
> Latest version of codespell is 1.7.
> My dictionary is based on codespell-1.4. So I use the number as of 1.4.
> 
> I can provide my typo samples under GPLv2 license.

Thanks.

Any additions you have to the dictionary would be
gladly welcomed.

Using a common format for the dictionary and any
suggested corrections would be good too.

Maybe the dictionary and code should be changed to
use the codespell format.  It seems a bit more
flexible than the lintian form.

I do not know if one project is more active than
the other, but perhaps that should be the deciding
factor.  Or maybe just Kees' preference...

Merging all these together might not be a good
solution though.

Right now, the checkpatch spelling code uses word
boundaries that include an underscore.

checkpatch spelling tests are done on 4 segments of
a #define like "PREFIX_PREFERED_SEG_ABC" finding the
misspelling of PREFERED.

Some sifting of the dictionary is still necessary to
eliminate some common prefixes to avoid too many false
positives.

For example, "ths" was dropped because it's a prefix
used by several modules even though it's a somewhat
frequent typo.



  reply	other threads:[~2014-09-10  7:00 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-09-08 18:15 [PATCH v2] checkpatch: look for common misspellings Kees Cook
2014-09-08 18:48 ` Joe Perches
2014-09-10  4:37   ` Masanari Iida
2014-09-10  7:00     ` Joe Perches [this message]
2014-09-10 22:52 ` Andrew Morton
2014-09-11  2:10   ` Joe Perches
2014-09-11  7:19   ` Geert Uytterhoeven
2014-09-11 14:12     ` Kees Cook
2014-09-11 14:15     ` Joe Perches
2014-09-12  4:09       ` Masanari Iida
2014-09-12  4:45         ` Joe Perches
2014-09-12 10:30           ` Masanari Iida

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1410332446.24028.26.camel@joe-AO725 \
    --to=joe@perches.com \
    --cc=apw@canonical.com \
    --cc=geert@linux-m68k.org \
    --cc=keescook@chromium.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=standby24x7@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.