linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] UTF-8ifying the kernel source
@ 2004-03-04 10:05 David Eger
  2004-03-04 10:19 ` Meelis Roos
                   ` (3 more replies)
  0 siblings, 4 replies; 22+ messages in thread
From: David Eger @ 2004-03-04 10:05 UTC (permalink / raw)
  To: linux-kernel



http://www.yak.net/random/linux-2.6.3-utf8-cleanup-auto.diff.bz2

Here you find the first of several patches to convert the kernel
source from ISO Latin-1 to UTF-8.  I'm working on the files that didn't
auto-convert easily; comments welcome ;-)

First, some statistics!

In Linux 2.6.3, there are:
15860 clean 7-bit ASCII files
274 text files are not 7-bit clean

38 of these 274 files are not auto-convertible -- either they are not ISO
Latin-1 or the high octets appear within the actual code (not comments).

This first patch applies to help files, documentation, and comments which
are trivially correct ISO Latin-1 => UTF-8 conversions.  The work I have
left to do is summarized below.

--dte


Un-needed/wrong non-ASCII characters (these fixes will form patch 2)
====================================================================
drivers/video/amifb.c	- +- sign?
Documentation/i2c/i2c-protocol	- NBSP, but why?
arch/i386/kernel/cpu/cyrix.c	- NBSP, but why?
arch/v850/kernel/as85ep1.ld	- WTF? comments in some random charset...
drivers/char/ftape/lowlevel/fdc-isr.c	- WTF? shit in the comments
include/asm-m68k/atarihw.h	- 0x94 - "cancel character"?
include/asm-m68k/atariints.h	- 0x94 - "cancel character"?
include/linux/802_11.h - why the non-standard dash?
scripts/docproc.c	- why the bizarre spelling for specific?
fs/ext2/xattr.c	- bad ASCII art
fs/ext3/xattr.c	- bad ASCII art
fs/afs/vlclient.h	- a degrees sign, but why?

Box-drawing ASCII art (these fixes will form patch 3)
=====================================================
Documentation/networking/tms380tr.txt	- DOS-style ASCII art
arch/arm/nwfpe/fpopcode.h	- line-drawing characters

C strings - (what to do?)
=========================
arch/ppc/platforms/proc_rtas.c	-  a C string containing "degrees"
arch/ppc64/kernel/rtas-proc.c	-  a C string containing "degrees"
drivers/macintosh/therm_adt7467.c	- degrees, MODULE_PARAM_DESC(), 
					  and a C string
drivers/mtd/chips/cfi_probe.c	- C strings
drivers/net/wireless/netwave_cs.c	- C strings	
drivers/scsi/dc395x.c	- C strings

Other - (i'd convert it, but...)
================================
drivers/pci/pci.ids	- I don't know what program processes this...
drivers/ieee1394/oui.db	- I don't know what program processes this...

Machine / charset specific shite - (does anything need to be done?)
===================================================================
arch/m68k/hp300/hp300map.map	- maps to "char"s.. grr
drivers/char/defkeymap.map	- a map file... maps to "char"s.. grr
drivers/char/qtronixmap.c_shipped	- maps to "char"s.. grr
drivers/char/qtronixmap.map	- maps to "char"s.. grr
drivers/tc/lk201-map.c_shipped	- maps to "char"s.. grr
drivers/tc/lk201-map.map	- maps to "char"s.. grr
drivers/acorn/char/defkeymap-l7200.c	- maps to "char"s.. grr
arch/s390/kernel/ebcdic.c	- comments on a keymap table
drivers/video/console/font_8x16.c	- comments on a keymap table 
drivers/video/console/font_8x8.c	- comments on a keymap table 
drivers/video/console/font_pearl_8x8.c	- comments on a keymap table 
drivers/s390/ebcdic.c	- comments on a keymap table

Noise from userland (this I won't be touching)
==============================================
Documentation/networking/ethertap.txt	- random crap cat'd from /dev/tap0
Documentation/s390/Debugging390.txt	- weird gdb output


^ permalink raw reply	[flat|nested] 22+ messages in thread
* Re: [PATCH] UTF-8ifying the kernel source
@ 2004-03-05 13:21 paolo ciarrocchi
  0 siblings, 0 replies; 22+ messages in thread
From: paolo ciarrocchi @ 2004-03-05 13:21 UTC (permalink / raw)
  To: linux-kernel

Sorry to jump in to this thread without providing any useful information...

I'm looking for doc and/or links to info regardign UTF8 and iso-*.

Any hints ?

Thanks in advance.

Ciao,
                Paolo

_________________________________________________________________
Filtri antispamming e antivirus per la tua casella di posta 
http://www.msn.it/msn/hotmail


^ permalink raw reply	[flat|nested] 22+ messages in thread

end of thread, other threads:[~2004-03-14 23:32 UTC | newest]

Thread overview: 22+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-03-04 10:05 [PATCH] UTF-8ifying the kernel source David Eger
2004-03-04 10:19 ` Meelis Roos
2004-03-04 10:32   ` Måns Rullgård
2004-03-04 21:51 ` Alex Belits
2004-03-05  8:26 ` Miles Bader
2004-03-05 20:01   ` H. Peter Anvin
2004-03-05 21:00     ` Mike Fedyk
2004-03-05 21:02       ` H. Peter Anvin
2004-03-05 21:17         ` Måns Rullgård
2004-03-05 21:26           ` Charles Cazabon
2004-03-05 21:20       ` David Eger
2004-03-05 23:24 ` David Eger
2004-03-05 23:33   ` H. Peter Anvin
2004-03-06 11:08     ` Xavier Bestel
2004-03-06 11:14       ` Måns Rullgård
2004-03-09  0:30       ` H. Peter Anvin
2004-03-09  9:49         ` Xavier Bestel
2004-03-06 13:33   ` Other bizarre thing... backspaces? David Eger
2004-03-06 14:04     ` Måns Rullgård
2004-03-14 16:25       ` Petr Baudis
2004-03-09 12:19   ` [PATCH] UTF-8ifying the kernel source Geert Uytterhoeven
2004-03-05 13:21 paolo ciarrocchi

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).