linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Matthew Wilcox <willy@infradead.org>
To: Carlos Bilbao <carlos.bilbao@amd.com>
Cc: Akira Yokosawa <akiyks@gmail.com>,
	bilbao@vt.edu, corbet@lwn.net, linux-doc@vger.kernel.org,
	linux-kernel@vger.kernel.org, miguel.ojeda.sandonis@gmail.com
Subject: Re: [PATCH v2 1/2] Documentation: Start translations to Spanish
Date: Mon, 24 Oct 2022 16:01:18 +0100	[thread overview]
Message-ID: <Y1aovvn7dAyo1nuW@casper.infradead.org> (raw)
In-Reply-To: <9e65ba16-68d3-8d11-c6ac-c35c29026688@amd.com>

On Mon, Oct 24, 2022 at 08:40:42AM -0500, Carlos Bilbao wrote:
> > > I don't know what standard we're actually following.  RFC5646 suggests
> > > simply using "es", with "es-419" for Latin America specialisation or
> > > "es-ES" for Spain.  I don't know how much variation there is between
> > > different Spanish dialects for technical documents; as I understand it,
> > > it's worth supporting two dialects of Chinese, but we merrily mix &
> > > match en_US and en_GB spellings.  Similarly, I wouldn't suggest that we
> > > have separate translations for fr_CA, fr_CH, fr_FR, just a single 'fr'
> > > would be fine.
> > > 
> > > We do need to be careful here; people are rightfully sensitive about
> > > being incorrectly grouped together.  If possible we should find a
> > > standard to follow that's been defined by experts in these matters.
> > > https://nam11.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FIETF_language_tag&amp;data=05%7C01%7Ccarlos.bilbao%40amd.com%7C44c226d534f44b4afc1f08dab0b1893b%7C3dd8961fe4884e608e11a82d994e183d%7C0%7C0%7C638016573808784843%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&amp;sdata=3T9bPQzcj9hEuZiPkjIU%2BPCEaxAivgaNKZ2gL5m3OQA%3D&amp;reserved=0 may be a good place to
> > > start looking.
> > I think generic "es" is OK, especially if "es_ES" can have such a
> > negative connotation to some. I just wanted to point out "sp_SP"
> > looks wrong.
> > 
> > Carlos, if you go the "es" way, it would be better to mention the
> > reason of the choice in the Changelog for future reference.
> > 
> > Subdirectories "ja_JP", "ko_KR", and "zh_CN" were added under
> > Documentation/ way back in 2007 (v2.6.23).
> > 
> > As you might see, two of the three language codes needed region
> > distinction and they were reasonable choices at the time.
> > 
> >          Thanks, Akira
> 
> Answering to Akira and Matthew below. Thanks to both for valuable feedback.
> 
> I made the conscious choice of not using es_ES, because as mentioned, it
> references a standard that I don’t intend to follow myself or enforce on
> Spanish translations. es_ES is a standard that comes from “Esp”-aña (Spain,
> the country) whereas “sp_SP” is as in "Sp"-anish, the language, not the
> country. Regarding this, I took the liberty of adding an extra paragraph to
> index.rs. I would translate it to English like:
> 
> "Many countries speak Spanish, each one with its own culture, expressions,
> and sometimes significant grammatical differences. The translators are free
> to use the version of Spanish which they are most comfortable with. In
> principle, these small differences should not pose a great barrier for
> speakers of different versions of Spanish, albeit in case of doubt, you can
> ask the maintainers."
> 
> I also opted for not using es_ES due to its geographical connotations. If
> someone from Peru, Mexico, Argentina, … submits a translation tomorrow, I
> would review it and we would understand each other just fine. Even within
> “Spain” there are many dialects and things change within regions. I
> reiterate that all dialects should be allowed in this directory.
> 
> Fortunately for us, versions of Spanish differ much more in spoken form
> than they do when written. This does not happen between traditional and
> simplified Chinese.
> 
> On top of everything else, using locale es_ES may imply that spell checks
> on that directory using the locale es_ES would be clean, but this is very
> far from reality, among other things, because all the English terms we
> inherit regarding computers. As Miguel Ojeda pointed out somewhere in this
> thread, there are terms that is better if we do not translate, to favor
> understanding of code/other documents.
> 
> I will update the corresponding commit message to clarify why we are using
> es_ES format in this particular case.

I think we're better off following BCP 47:
https://www.rfc-editor.org/info/bcp47 rather than the libc locale format.
That will imply renaming it_IT to simply "it", ja_JP to "ja" and
ko_KR to "ko".  The two Chinese translations we have might be called
"zh-Hant" and "zh-Hans", if the distinction is purely Traditional vs
Simplified script.  If they really are region based, then they'd be
zh-CN and zh-TW.

I think you're right to conflate all dialects of Spanish together, just
as we do all dialects of English.

Jon, this feels like policy you should be setting.  Are you on board
with this, or do you want to retain the mandatory geography tag that
we've been using up to now?

  reply	other threads:[~2022-10-24 19:21 UTC|newest]

Thread overview: 31+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-10-13 18:48 [PATCH 0/2] Documentation: Start Spanish translation and include HOWTO Carlos Bilbao
2022-10-13 18:48 ` [PATCH 1/2] Documentation: Start translations to Spanish Carlos Bilbao
2022-10-13 18:48 ` [PATCH 2/2] Documentation: Add HOWTO Spanish translation into rst based build system Carlos Bilbao
2022-10-14  6:16   ` kernel test robot
2022-10-14  9:21   ` Bagas Sanjaya
2022-10-14 12:58     ` Carlos Bilbao
2022-10-14 13:57       ` Jonathan Corbet
2022-10-13 21:09 ` [PATCH 0/2] Documentation: Start Spanish translation and include HOWTO Jonathan Corbet
2022-10-14 13:01   ` Carlos Bilbao
2022-10-14 14:24 ` [PATCH v2 " Carlos Bilbao
2022-10-14 14:24   ` [PATCH v2 1/2] Documentation: Start translations to Spanish Carlos Bilbao
2022-10-14 15:21     ` Miguel Ojeda
2022-10-14 15:33       ` Carlos Bilbao
2022-10-14 15:36         ` Jonathan Corbet
2022-10-14 15:44           ` Carlos Bilbao
2022-10-14 15:51         ` Miguel Ojeda
2022-10-15  4:06     ` Akira Yokosawa
2022-10-17 14:41       ` Matthew Wilcox
2022-10-18  2:36         ` Akira Yokosawa
2022-10-24 13:40           ` Carlos Bilbao
2022-10-24 15:01             ` Matthew Wilcox [this message]
2022-10-24 15:22               ` Jonathan Corbet
2022-10-24 15:31                 ` Jonathan Corbet
2022-10-24 15:33                   ` Carlos Bilbao
2022-10-25 11:05                   ` Alex Shi
2022-10-25 12:53                     ` YanTeng Si
2022-10-24 15:19             ` Miguel Ojeda
2022-10-14 14:24   ` [PATCH v2 2/2] Documentation: Add HOWTO Spanish translation into rst based build system Carlos Bilbao
2022-10-15  3:37     ` Bagas Sanjaya
2022-10-15  3:40   ` [PATCH v2 0/2] Documentation: Start Spanish translation and include HOWTO Bagas Sanjaya
2022-10-24 14:53     ` Carlos Bilbao

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=Y1aovvn7dAyo1nuW@casper.infradead.org \
    --to=willy@infradead.org \
    --cc=akiyks@gmail.com \
    --cc=bilbao@vt.edu \
    --cc=carlos.bilbao@amd.com \
    --cc=corbet@lwn.net \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=miguel.ojeda.sandonis@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).