* Re: b4: UTF-8 in replies
[not found] <20201001171405.GA17873@qmqm.qmqm.pl>
@ 2020-10-01 18:05 ` Konstantin Ryabitsev
2020-10-01 18:09 ` [tools] " Mark Brown
2020-10-01 18:36 ` Michał Mirosław
0 siblings, 2 replies; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-01 18:05 UTC (permalink / raw)
To: Michał Mirosław; +Cc: tools
On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> I'm writing to you because I can't find a bug tracker for b4 script.
There isn't one. Just email tools@linux.kernel.org (cc'd).
> The bug is that when generating the reply, script doesn't include MIME
> headers but is using UTF-8 in the body (eg. when replying to an author
> with non-US name, like me).
Hm... is that causing you any problems? We make sure to set
"Content-Transfer-Encoding: 8bit" in the headers, so unless you're using
some extremely old SMTP software, UTF-8 in the body or headers shouldn't
matter.
Regards,
-K
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [tools] b4: UTF-8 in replies
2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
@ 2020-10-01 18:09 ` Mark Brown
2020-10-01 18:36 ` Michał Mirosław
1 sibling, 0 replies; 9+ messages in thread
From: Mark Brown @ 2020-10-01 18:09 UTC (permalink / raw)
To: tools, konstantin; +Cc: Michał Mirosław
[-- Attachment #1: Type: text/plain, Size: 610 bytes --]
On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> > The bug is that when generating the reply, script doesn't include MIME
> > headers but is using UTF-8 in the body (eg. when replying to an author
> > with non-US name, like me).
> Hm... is that causing you any problems? We make sure to set
> "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using
> some extremely old SMTP software, UTF-8 in the body or headers shouldn't
> matter.
vger doesn't like 8 bit content in the headers.
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 499 bytes --]
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: b4: UTF-8 in replies
2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
2020-10-01 18:09 ` [tools] " Mark Brown
@ 2020-10-01 18:36 ` Michał Mirosław
2020-10-01 18:41 ` Michał Mirosław
1 sibling, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-01 18:36 UTC (permalink / raw)
To: Konstantin Ryabitsev; +Cc: tools
On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> > I'm writing to you because I can't find a bug tracker for b4 script.
>
> There isn't one. Just email tools@linux.kernel.org (cc'd).
>
> > The bug is that when generating the reply, script doesn't include MIME
> > headers but is using UTF-8 in the body (eg. when replying to an author
> > with non-US name, like me).
>
> Hm... is that causing you any problems? We make sure to set
> "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using
> some extremely old SMTP software, UTF-8 in the body or headers shouldn't
> matter.
At least K9 Mail on Android requires MIME headers to properly decode it.
The additional headers are actually required by RFC 5322 to include
8-bit data in the message.
I guess the fix would be to just include Content-Type (and others) with
charset="utf8".
Best Regards,
Michał Mirosław
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: b4: UTF-8 in replies
2020-10-01 18:36 ` Michał Mirosław
@ 2020-10-01 18:41 ` Michał Mirosław
2020-10-01 19:26 ` Konstantin Ryabitsev
0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-01 18:41 UTC (permalink / raw)
To: Konstantin Ryabitsev; +Cc: tools
On Thu, Oct 01, 2020 at 08:36:47PM +0200, Michał Mirosław wrote:
> On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> > On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> > > I'm writing to you because I can't find a bug tracker for b4 script.
> >
> > There isn't one. Just email tools@linux.kernel.org (cc'd).
> >
> > > The bug is that when generating the reply, script doesn't include MIME
> > > headers but is using UTF-8 in the body (eg. when replying to an author
> > > with non-US name, like me).
> >
> > Hm... is that causing you any problems? We make sure to set
> > "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using
> > some extremely old SMTP software, UTF-8 in the body or headers shouldn't
> > matter.
>
> At least K9 Mail on Android requires MIME headers to properly decode it.
> The additional headers are actually required by RFC 5322 to include
> 8-bit data in the message.
>
> I guess the fix would be to just include Content-Type (and others) with
> charset="utf8".
According to RFC2045 those should be enough:
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit
Best Regards,
Michał Mirosław
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: b4: UTF-8 in replies
2020-10-01 18:41 ` Michał Mirosław
@ 2020-10-01 19:26 ` Konstantin Ryabitsev
2020-10-02 21:09 ` [tools] " Mark Brown
0 siblings, 1 reply; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-01 19:26 UTC (permalink / raw)
To: Michał Mirosław; +Cc: tools
On Thu, Oct 01, 2020 at 08:41:09PM +0200, Michał Mirosław wrote:
> > At least K9 Mail on Android requires MIME headers to properly decode
> > it.
> > The additional headers are actually required by RFC 5322 to include
> > 8-bit data in the message.
> >
> > I guess the fix would be to just include Content-Type (and others) with
> > charset="utf8".
>
> According to RFC2045 those should be enough:
>
> MIME-Version: 1.0
> Content-Type: text/plain; charset=utf-8
> Content-Transfer-Encoding: 8bit
Hrmph, apparently my assumption on what headers will be generated by
policy=8bit was pretty wrong. I pushed a commit to master and
stable-0.5.y that should properly set the charset and generate the
necessary headers.
I intend to cut 0.6.0 tomorrow that will incorporate this fix, among
several others.
Thanks!
-K
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [tools] b4: UTF-8 in replies
2020-10-01 19:26 ` Konstantin Ryabitsev
@ 2020-10-02 21:09 ` Mark Brown
2020-10-03 10:13 ` Michał Mirosław
0 siblings, 1 reply; 9+ messages in thread
From: Mark Brown @ 2020-10-02 21:09 UTC (permalink / raw)
To: tools, konstantin; +Cc: Michał Mirosław
[-- Attachment #1: Type: text/plain, Size: 1502 bytes --]
On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:
> Hrmph, apparently my assumption on what headers will be generated by
> policy=8bit was pretty wrong. I pushed a commit to master and
> stable-0.5.y that should properly set the charset and generate the
> necessary headers.
This fix seems to have caused some issues for me - using msmtp with
kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
getting headers with:
To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
=?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
=?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
=?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
=?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
=?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
=?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=
(in message ID 160167211500.21762.13369928255363041989.b4-ty@kernel.org
which I'll manage to send) which /usr/sbin/sendmail -t from both msmtp
and exim has a lot of trouble parsing and since b4 has generated the
thanks I now can't ask the database (though I don't immediately see a
way to extract it anyway). I think we need a sidecar file with the
unencoded e-mail addresses in it?
ask b4 what addresses are buried in there even if
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 499 bytes --]
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [tools] b4: UTF-8 in replies
2020-10-02 21:09 ` [tools] " Mark Brown
@ 2020-10-03 10:13 ` Michał Mirosław
2020-10-03 10:18 ` Michał Mirosław
0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-03 10:13 UTC (permalink / raw)
To: Mark Brown; +Cc: tools, konstantin
On Fri, Oct 02, 2020 at 10:09:50PM +0100, Mark Brown wrote:
> On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:
>
> > Hrmph, apparently my assumption on what headers will be generated by
> > policy=8bit was pretty wrong. I pushed a commit to master and
> > stable-0.5.y that should properly set the charset and generate the
> > necessary headers.
>
> This fix seems to have caused some issues for me - using msmtp with
> kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
> getting headers with:
>
> To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
> Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
> =?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
> =?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
> =?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
> =?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
> =?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
> =?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=
RFC 2047 section 6.2. suggests that this encoding is invalid, because it
encodes structural characters (commas in this case) -- the MTA/MUA
might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
RFC forbids encoding the addr-spec part (mail address if I understand
correctly).
Best Regards,
Michał Mirosław
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [tools] b4: UTF-8 in replies
2020-10-03 10:13 ` Michał Mirosław
@ 2020-10-03 10:18 ` Michał Mirosław
2020-10-03 14:40 ` Konstantin Ryabitsev
0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-03 10:18 UTC (permalink / raw)
To: Mark Brown; +Cc: tools, konstantin
On Sat, Oct 03, 2020 at 12:13:45PM +0200, Michał Mirosław wrote:
> On Fri, Oct 02, 2020 at 10:09:50PM +0100, Mark Brown wrote:
> > On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:
> >
> > > Hrmph, apparently my assumption on what headers will be generated by
> > > policy=8bit was pretty wrong. I pushed a commit to master and
> > > stable-0.5.y that should properly set the charset and generate the
> > > necessary headers.
> >
> > This fix seems to have caused some issues for me - using msmtp with
> > kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
> > getting headers with:
> >
> > To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
> > Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
> > =?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
> > =?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
> > =?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
> > =?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
> > =?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
> > =?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=
>
> RFC 2047 section 6.2. suggests that this encoding is invalid, because it
> encodes structural characters (commas in this case) -- the MTA/MUA
> might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
> RFC forbids encoding the addr-spec part (mail address if I understand
> correctly).
IOW, correct encoding might be at most:
Cc: linux-kernel@vger.kernel.org, Kai Vehmanen
<kai.vehmanen@linux.intel.com>, Pierre-Louis Bossart
<pierre-louis.bossart@linux.intel.com>, Takashi Iwai
<tiwai@suse.com>, Guennadi Liakhovetski
<guennadi.liakhovetski@linux.intel.com>, Liam Girdwood
<lgirdwood@gmail.com>,
=?utf-8?q?Amadeusz_S=C5=82awi=C5=84ski?=
<amadeuszx.slawinski@linux.intel.com>
Best Regards,
Michał Mirosław
^ permalink raw reply [flat|nested] 9+ messages in thread
* Re: [tools] b4: UTF-8 in replies
2020-10-03 10:18 ` Michał Mirosław
@ 2020-10-03 14:40 ` Konstantin Ryabitsev
0 siblings, 0 replies; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-03 14:40 UTC (permalink / raw)
To: Michał Mirosław; +Cc: Mark Brown, tools
On Sat, Oct 03, 2020 at 12:18:04PM +0200, Michał Mirosław wrote:
> > RFC 2047 section 6.2. suggests that this encoding is invalid,
> > because it
> > encodes structural characters (commas in this case) -- the MTA/MUA
> > might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
> > RFC forbids encoding the addr-spec part (mail address if I understand
> > correctly).
>
> IOW, correct encoding might be at most:
>
> Cc: linux-kernel@vger.kernel.org, Kai Vehmanen
> <kai.vehmanen@linux.intel.com>, Pierre-Louis Bossart
> <pierre-louis.bossart@linux.intel.com>, Takashi Iwai
> <tiwai@suse.com>, Guennadi Liakhovetski
> <guennadi.liakhovetski@linux.intel.com>, Liam Girdwood
> <lgirdwood@gmail.com>,
> =?utf-8?q?Amadeusz_S=C5=82awi=C5=84ski?=
> <amadeuszx.slawinski@linux.intel.com>
Please feel free to report it to Python upstream, as we're just
deferring to whatever decisions they make. Anyway, we're back to 8bit
headers with the latest change.
-K
^ permalink raw reply [flat|nested] 9+ messages in thread
end of thread, other threads:[~2020-10-03 14:40 UTC | newest]
Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <20201001171405.GA17873@qmqm.qmqm.pl>
2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
2020-10-01 18:09 ` [tools] " Mark Brown
2020-10-01 18:36 ` Michał Mirosław
2020-10-01 18:41 ` Michał Mirosław
2020-10-01 19:26 ` Konstantin Ryabitsev
2020-10-02 21:09 ` [tools] " Mark Brown
2020-10-03 10:13 ` Michał Mirosław
2020-10-03 10:18 ` Michał Mirosław
2020-10-03 14:40 ` Konstantin Ryabitsev
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).