From: David Eger <eger@havoc.gtf.org>
To: linux-kernel@vger.kernel.org
Subject: [PATCH] UTF-8ifying the kernel source
Date: Thu, 4 Mar 2004 05:05:03 -0500 [thread overview]
Message-ID: <20040304100503.GA13970@havoc.gtf.org> (raw)
http://www.yak.net/random/linux-2.6.3-utf8-cleanup-auto.diff.bz2
Here you find the first of several patches to convert the kernel
source from ISO Latin-1 to UTF-8. I'm working on the files that didn't
auto-convert easily; comments welcome ;-)
First, some statistics!
In Linux 2.6.3, there are:
15860 clean 7-bit ASCII files
274 text files are not 7-bit clean
38 of these 274 files are not auto-convertible -- either they are not ISO
Latin-1 or the high octets appear within the actual code (not comments).
This first patch applies to help files, documentation, and comments which
are trivially correct ISO Latin-1 => UTF-8 conversions. The work I have
left to do is summarized below.
--dte
Un-needed/wrong non-ASCII characters (these fixes will form patch 2)
====================================================================
drivers/video/amifb.c - +- sign?
Documentation/i2c/i2c-protocol - NBSP, but why?
arch/i386/kernel/cpu/cyrix.c - NBSP, but why?
arch/v850/kernel/as85ep1.ld - WTF? comments in some random charset...
drivers/char/ftape/lowlevel/fdc-isr.c - WTF? shit in the comments
include/asm-m68k/atarihw.h - 0x94 - "cancel character"?
include/asm-m68k/atariints.h - 0x94 - "cancel character"?
include/linux/802_11.h - why the non-standard dash?
scripts/docproc.c - why the bizarre spelling for specific?
fs/ext2/xattr.c - bad ASCII art
fs/ext3/xattr.c - bad ASCII art
fs/afs/vlclient.h - a degrees sign, but why?
Box-drawing ASCII art (these fixes will form patch 3)
=====================================================
Documentation/networking/tms380tr.txt - DOS-style ASCII art
arch/arm/nwfpe/fpopcode.h - line-drawing characters
C strings - (what to do?)
=========================
arch/ppc/platforms/proc_rtas.c - a C string containing "degrees"
arch/ppc64/kernel/rtas-proc.c - a C string containing "degrees"
drivers/macintosh/therm_adt7467.c - degrees, MODULE_PARAM_DESC(),
and a C string
drivers/mtd/chips/cfi_probe.c - C strings
drivers/net/wireless/netwave_cs.c - C strings
drivers/scsi/dc395x.c - C strings
Other - (i'd convert it, but...)
================================
drivers/pci/pci.ids - I don't know what program processes this...
drivers/ieee1394/oui.db - I don't know what program processes this...
Machine / charset specific shite - (does anything need to be done?)
===================================================================
arch/m68k/hp300/hp300map.map - maps to "char"s.. grr
drivers/char/defkeymap.map - a map file... maps to "char"s.. grr
drivers/char/qtronixmap.c_shipped - maps to "char"s.. grr
drivers/char/qtronixmap.map - maps to "char"s.. grr
drivers/tc/lk201-map.c_shipped - maps to "char"s.. grr
drivers/tc/lk201-map.map - maps to "char"s.. grr
drivers/acorn/char/defkeymap-l7200.c - maps to "char"s.. grr
arch/s390/kernel/ebcdic.c - comments on a keymap table
drivers/video/console/font_8x16.c - comments on a keymap table
drivers/video/console/font_8x8.c - comments on a keymap table
drivers/video/console/font_pearl_8x8.c - comments on a keymap table
drivers/s390/ebcdic.c - comments on a keymap table
Noise from userland (this I won't be touching)
==============================================
Documentation/networking/ethertap.txt - random crap cat'd from /dev/tap0
Documentation/s390/Debugging390.txt - weird gdb output
next reply other threads:[~2004-03-04 10:05 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-03-04 10:05 David Eger [this message]
2004-03-04 10:19 ` [PATCH] UTF-8ifying the kernel source Meelis Roos
2004-03-04 10:32 ` Måns Rullgård
2004-03-04 21:51 ` Alex Belits
2004-03-05 8:26 ` Miles Bader
2004-03-05 20:01 ` H. Peter Anvin
2004-03-05 21:00 ` Mike Fedyk
2004-03-05 21:02 ` H. Peter Anvin
2004-03-05 21:17 ` Måns Rullgård
2004-03-05 21:26 ` Charles Cazabon
2004-03-05 21:20 ` David Eger
2004-03-05 23:24 ` David Eger
2004-03-05 23:33 ` H. Peter Anvin
2004-03-06 11:08 ` Xavier Bestel
2004-03-06 11:14 ` Måns Rullgård
2004-03-09 0:30 ` H. Peter Anvin
2004-03-09 9:49 ` Xavier Bestel
2004-03-06 13:33 ` Other bizarre thing... backspaces? David Eger
2004-03-06 14:04 ` Måns Rullgård
2004-03-14 16:25 ` Petr Baudis
2004-03-09 12:19 ` [PATCH] UTF-8ifying the kernel source Geert Uytterhoeven
2004-03-05 13:21 paolo ciarrocchi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20040304100503.GA13970@havoc.gtf.org \
--to=eger@havoc.gtf.org \
--cc=linux-kernel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).