From: "Michal Suchánek" <msuchanek@suse.de>
To: Mauro Carvalho Chehab <mchehab@kernel.org>
Cc: Markus Heiser <markus.heiser@darmarit.de>,
linux-doc@vger.kernel.org, Jonathan Corbet <corbet@lwn.net>
Subject: Re: Sphinx parallel build error: UnicodeEncodeError: 'latin-1' codec can't encode characters in position 18-20: ordinal not in range(256)
Date: Mon, 17 May 2021 15:10:07 +0200 [thread overview]
Message-ID: <20210517131007.GL8544@kitsune.suse.cz> (raw)
In-Reply-To: <20210512095931.65ece006@coco.lan>
On Wed, May 12, 2021 at 09:59:31AM +0200, Mauro Carvalho Chehab wrote:
> Em Wed, 12 May 2021 09:01:57 +0200
> Michal Suchánek <msuchanek@suse.de> escreveu:
>
> > On Wed, May 12, 2021 at 08:22:38AM +0200, Mauro Carvalho Chehab wrote:
> > > Hi Michal,
> > >
> > > Em Thu, 6 May 2021 19:48:49 +0200
> > > Michal Suchánek <msuchanek@suse.de> escreveu:
> > >
> > > > [ 127s] + :
> > > > [ 127s] + locale
> > > > [ 128s] LANG=en_US
> > > > [ 128s] LC_CTYPE="en_US"
> > > > [ 128s] LC_NUMERIC="en_US"
> > > > [ 128s] LC_TIME="en_US"
> > > > [ 128s] LC_COLLATE="en_US"
> > > > [ 128s] LC_MONETARY="en_US"
> > > > [ 128s] LC_MESSAGES="en_US"
> > > > [ 128s] LC_PAPER="en_US"
> > > > [ 128s] LC_NAME="en_US"
> > > > [ 128s] LC_ADDRESS="en_US"
> > > > [ 128s] LC_TELEPHONE="en_US"
> > > > [ 128s] LC_MEASUREMENT="en_US"
> > > > [ 128s] LC_IDENTIFICATION="en_US"
> > > > [ 128s] LC_ALL=
> > > > [ 128s] + echo LC_ALL=
> > > > [ 128s] LC_ALL=
> > > > [ 128s] + echo LANG=en_US
> > > > [ 128s] LANG=en_US
> > >
> > > Where those the locale settings that you used when the build
> > > failed?
> > >
> > > I tried to reproduce the bug here with, disabling the parallel run (as
> > > it masks the real error) with both:
> > >
> > > $ for i in LANG LC_ALL LC_ADDRESS LC_IDENTIFICATION LC_MEASUREMENT LC_MONETARY LC_NAME LC_NUMERIC LC_PAPER LC_TELEPHONE LC_TIME; do echo $i=en_US; done
> > > $ make cleandocs && make SPHINXOPTS=-j1 htmldocs
> > >
> > > (this one caused lots of warnings on Debian, due to the
> > > settings at /etc/locale.gen)
> > >
> > > and:
> > >
> > > $ for i in LANG LC_ALL LC_ADDRESS LC_IDENTIFICATION LC_MEASUREMENT LC_MONETARY LC_NAME LC_NUMERIC LC_PAPER LC_TELEPHONE LC_TIME; do echo $i=en_US.ISO-8859-1; done
> > > $ make cleandocs && make SPHINXOPTS=-j1 htmldocs
> > >
> > > Without any success.
> > >
> > > Could you please provide more details about the build VM and the git
> > > changeset that caused the issue?
> >
> > It depends on what character set your en_US locale implements.
> >
> > ~> cat test.py
> > print("↑ᛏ个")
> > ~> locale
> > LANG=en_US.utf8
> > LC_CTYPE="en_US.utf8"
> > LC_NUMERIC="en_US.utf8"
> > LC_TIME="en_US.utf8"
> > LC_COLLATE="en_US.utf8"
> > LC_MONETARY="en_US.utf8"
> > LC_MESSAGES="en_US.utf8"
> > LC_PAPER="en_US.utf8"
> > LC_NAME="en_US.utf8"
> > LC_ADDRESS="en_US.utf8"
> > LC_TELEPHONE="en_US.utf8"
> > LC_MEASUREMENT="en_US.utf8"
> > LC_IDENTIFICATION="en_US.utf8"
> > LC_ALL=
> > ~> python3 test.py
> > ↑ᛏ个
> > ~> LANG=en_US python3 test.py
> > Traceback (most recent call last):
> > File "test.py", line 1, in <module>
> > print("\u2191\u16cf\u4e2a\uf8f9")
> > UnicodeEncodeError: 'latin-1' codec can't encode characters in position 0-3: ordinal not in range(256)
> > ~> LANG=C python3 test.py
> > ↑ᛏ个
> >
>
> This is working as expected on my test machine:
>
> $ LANG=en_US.utf8 python3 test.py
> ↑ᛏ个
> $ LANG=en_US python3 test.py
> Traceback (most recent call last):
> File "test.py", line 1, in <module>
> print("\u2191\u16cf\u4e2a\uf8f9")
> UnicodeEncodeError: 'latin-1' codec can't encode characters in position 0-3: ordinal not in range(256)
>
> Yet, running:
>
> $ . /devel/v4l/docs/sphinx_3.3.1/bin/activate
> make cleandocs && LANG=en_US make SPHINXOPTS=-j1 htmldocs
>
> Doesn't produce any UnicodeEncodeError errors.
>
> See, here I'm testing it with Sphinx version 3.3.1, on Ubuntu 20.04,
> using changeset 9f4ad9e425a1 Linux 5.12. Also, both UTF8 and iso8859-1
> are on this machine's locale:
>
> $ more /etc/locale.gen |grep -v ^#
> de_DE.UTF-8 UTF-8
> en_US ISO-8859-1
> en_US.UTF-8 UTF-8
>
> (On Debian/Ubuntu, python and other tools complain a lot if the used
> locale is not at /etc/locale.gen)
>
> Maybe you're using a different Sphinx version, or maybe the distro
> on your VM is using has different locales installed on it or some
> other different packages.
I am using these:
[ 14s] [287/464] cumulate python38-sphinxcontrib-websupport-1.2.4-1.3
[ 14s] [323/464] cumulate python38-Sphinx2-2.3.1-4.1
[ 14s] [324/464] cumulate python38-sphinx_rtd_theme-0.5.2-1.1
[ 14s] [325/464] cumulate python38-sphinxcontrib-applehelp-1.0.2-1.4
[ 14s] [326/464] cumulate python38-sphinxcontrib-devhelp-1.0.2-1.4
[ 14s] [327/464] cumulate python38-sphinxcontrib-htmlhelp-1.0.3-1.4
[ 14s] [328/464] cumulate python38-sphinxcontrib-jsmath-1.0.1-2.5
[ 14s] [329/464] cumulate python38-sphinxcontrib-qthelp-1.0.3-1.4
[ 14s] [330/464] cumulate python38-sphinxcontrib-serializinghtml-1.1.4-1.4
[ 455s] Sphinx parallel build error:
[ 455s] UnicodeEncodeError: 'latin-1' codec can't encode characters in
position 18-20: ordinal not in range(256)
[ 467s] make[2]: *** [../Documentation/Makefile:91: htmldocs] Error 2
[ 467s] make[1]: ***
[/home/abuild/rpmbuild/BUILD/kernel-docs-5.13~rc1.next.20210514/linux-5.13-rc1-next-20210514/Makefile:1784:
htmldocs] Error 2
[ 467s] make[1]: Leaving directory
'/home/abuild/rpmbuild/BUILD/kernel-docs-5.13~rc1.next.20210514/linux-5.13-rc1-next-20210514/html'
[ 467s] make: *** [Makefile:222: __sub-make] Error 2
Thanks
Michal
prev parent reply other threads:[~2021-05-17 13:10 UTC|newest]
Thread overview: 41+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-05-06 10:39 Sphinx parallel build error: UnicodeEncodeError: 'latin-1' codec can't encode characters in position 18-20: ordinal not in range(256) Michal Suchánek
2021-05-06 11:20 ` Mauro Carvalho Chehab
2021-05-06 13:32 ` Michal Suchánek
2021-05-06 14:24 ` Mauro Carvalho Chehab
2021-05-06 14:35 ` Michal Suchánek
2021-05-06 15:57 ` Markus Heiser
2021-05-06 16:46 ` Mauro Carvalho Chehab
2021-05-06 17:04 ` Markus Heiser
2021-05-06 17:27 ` Mauro Carvalho Chehab
2021-05-06 17:53 ` Markus Heiser
2021-05-06 18:06 ` Michal Suchánek
2021-05-07 8:52 ` Mauro Carvalho Chehab
2021-05-06 17:57 ` Randy Dunlap
2021-05-06 18:08 ` Matthew Wilcox
2021-05-06 21:21 ` Randy Dunlap
2021-05-07 6:39 ` Mauro Carvalho Chehab
2021-05-07 6:49 ` Randy Dunlap
2021-05-07 8:04 ` Mauro Carvalho Chehab
2021-05-07 8:35 ` Michal Suchánek
2021-05-07 8:56 ` Markus Heiser
2021-05-07 9:14 ` Mauro Carvalho Chehab
2021-05-07 9:51 ` Markus Heiser
2021-05-07 10:29 ` Michal Suchánek
2021-05-07 9:02 ` Mauro Carvalho Chehab
2021-05-08 9:22 ` Mauro Carvalho Chehab
2021-05-08 10:41 ` Michal Suchánek
2021-05-08 14:41 ` Mauro Carvalho Chehab
2021-05-08 15:55 ` Randy Dunlap
2021-05-08 17:09 ` Michal Suchánek
2021-05-08 17:46 ` Randy Dunlap
2021-05-10 6:22 ` Mauro Carvalho Chehab
2021-05-10 8:17 ` Mauro Carvalho Chehab
2021-05-06 17:48 ` Michal Suchánek
2021-05-06 17:59 ` Markus Heiser
2021-05-06 18:16 ` Michal Suchánek
2021-05-12 6:22 ` Mauro Carvalho Chehab
2021-05-12 7:01 ` Michal Suchánek
2021-05-12 7:18 ` Markus Heiser
2021-05-12 7:37 ` Markus Heiser
2021-05-12 7:59 ` Mauro Carvalho Chehab
2021-05-17 13:10 ` Michal Suchánek [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20210517131007.GL8544@kitsune.suse.cz \
--to=msuchanek@suse.de \
--cc=corbet@lwn.net \
--cc=linux-doc@vger.kernel.org \
--cc=markus.heiser@darmarit.de \
--cc=mchehab@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).