tools.linux.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: b4: UTF-8 in replies
       [not found] <20201001171405.GA17873@qmqm.qmqm.pl>
@ 2020-10-01 18:05 ` Konstantin Ryabitsev
  2020-10-01 18:09   ` [tools] " Mark Brown
  2020-10-01 18:36   ` Michał Mirosław
  0 siblings, 2 replies; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-01 18:05 UTC (permalink / raw)
  To: Michał Mirosław; +Cc: tools

On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> I'm writing to you because I can't find a bug tracker for b4 script.

There isn't one. Just email tools@linux.kernel.org (cc'd).

> The bug is that when generating the reply, script doesn't include MIME
> headers but is using UTF-8 in the body (eg. when replying to an author
> with non-US name, like me).

Hm... is that causing you any problems? We make sure to set 
"Content-Transfer-Encoding: 8bit" in the headers, so unless you're using 
some extremely old SMTP software, UTF-8 in the body or headers shouldn't 
matter.

Regards,
-K

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [tools] b4: UTF-8 in replies
  2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
@ 2020-10-01 18:09   ` Mark Brown
  2020-10-01 18:36   ` Michał Mirosław
  1 sibling, 0 replies; 9+ messages in thread
From: Mark Brown @ 2020-10-01 18:09 UTC (permalink / raw)
  To: tools, konstantin; +Cc: Michał Mirosław

[-- Attachment #1: Type: text/plain, Size: 610 bytes --]

On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:

> > The bug is that when generating the reply, script doesn't include MIME
> > headers but is using UTF-8 in the body (eg. when replying to an author
> > with non-US name, like me).

> Hm... is that causing you any problems? We make sure to set 
> "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using 
> some extremely old SMTP software, UTF-8 in the body or headers shouldn't 
> matter.

vger doesn't like 8 bit content in the headers.

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 499 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: b4: UTF-8 in replies
  2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
  2020-10-01 18:09   ` [tools] " Mark Brown
@ 2020-10-01 18:36   ` Michał Mirosław
  2020-10-01 18:41     ` Michał Mirosław
  1 sibling, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-01 18:36 UTC (permalink / raw)
  To: Konstantin Ryabitsev; +Cc: tools

On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> > I'm writing to you because I can't find a bug tracker for b4 script.
> 
> There isn't one. Just email tools@linux.kernel.org (cc'd).
> 
> > The bug is that when generating the reply, script doesn't include MIME
> > headers but is using UTF-8 in the body (eg. when replying to an author
> > with non-US name, like me).
> 
> Hm... is that causing you any problems? We make sure to set 
> "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using 
> some extremely old SMTP software, UTF-8 in the body or headers shouldn't 
> matter.

At least K9 Mail on Android requires MIME headers to properly decode it.
The additional headers are actually required by RFC 5322 to include
8-bit data in the message.

I guess the fix would be to just include Content-Type (and others) with
charset="utf8".

Best Regards,
Michał Mirosław

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: b4: UTF-8 in replies
  2020-10-01 18:36   ` Michał Mirosław
@ 2020-10-01 18:41     ` Michał Mirosław
  2020-10-01 19:26       ` Konstantin Ryabitsev
  0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-01 18:41 UTC (permalink / raw)
  To: Konstantin Ryabitsev; +Cc: tools

On Thu, Oct 01, 2020 at 08:36:47PM +0200, Michał Mirosław wrote:
> On Thu, Oct 01, 2020 at 02:05:41PM -0400, Konstantin Ryabitsev wrote:
> > On Thu, Oct 01, 2020 at 07:14:05PM +0200, Michał Mirosław wrote:
> > > I'm writing to you because I can't find a bug tracker for b4 script.
> > 
> > There isn't one. Just email tools@linux.kernel.org (cc'd).
> > 
> > > The bug is that when generating the reply, script doesn't include MIME
> > > headers but is using UTF-8 in the body (eg. when replying to an author
> > > with non-US name, like me).
> > 
> > Hm... is that causing you any problems? We make sure to set 
> > "Content-Transfer-Encoding: 8bit" in the headers, so unless you're using 
> > some extremely old SMTP software, UTF-8 in the body or headers shouldn't 
> > matter.
> 
> At least K9 Mail on Android requires MIME headers to properly decode it.
> The additional headers are actually required by RFC 5322 to include
> 8-bit data in the message.
> 
> I guess the fix would be to just include Content-Type (and others) with
> charset="utf8".

According to RFC2045 those should be enough:

MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 8bit

Best Regards,
Michał Mirosław

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: b4: UTF-8 in replies
  2020-10-01 18:41     ` Michał Mirosław
@ 2020-10-01 19:26       ` Konstantin Ryabitsev
  2020-10-02 21:09         ` [tools] " Mark Brown
  0 siblings, 1 reply; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-01 19:26 UTC (permalink / raw)
  To: Michał Mirosław; +Cc: tools

On Thu, Oct 01, 2020 at 08:41:09PM +0200, Michał Mirosław wrote:
> > At least K9 Mail on Android requires MIME headers to properly decode 
> > it.
> > The additional headers are actually required by RFC 5322 to include
> > 8-bit data in the message.
> > 
> > I guess the fix would be to just include Content-Type (and others) with
> > charset="utf8".
> 
> According to RFC2045 those should be enough:
> 
> MIME-Version: 1.0
> Content-Type: text/plain; charset=utf-8
> Content-Transfer-Encoding: 8bit

Hrmph, apparently my assumption on what headers will be generated by 
policy=8bit was pretty wrong. I pushed a commit to master and 
stable-0.5.y that should properly set the charset and generate the 
necessary headers.

I intend to cut 0.6.0 tomorrow that will incorporate this fix, among 
several others.

Thanks!
-K

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [tools] b4: UTF-8 in replies
  2020-10-01 19:26       ` Konstantin Ryabitsev
@ 2020-10-02 21:09         ` Mark Brown
  2020-10-03 10:13           ` Michał Mirosław
  0 siblings, 1 reply; 9+ messages in thread
From: Mark Brown @ 2020-10-02 21:09 UTC (permalink / raw)
  To: tools, konstantin; +Cc: Michał Mirosław

[-- Attachment #1: Type: text/plain, Size: 1502 bytes --]

On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:

> Hrmph, apparently my assumption on what headers will be generated by 
> policy=8bit was pretty wrong. I pushed a commit to master and 
> stable-0.5.y that should properly set the charset and generate the 
> necessary headers.

This fix seems to have caused some issues for me - using msmtp with
kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
getting headers with:

To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
 =?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
 =?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
 =?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
 =?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
 =?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
 =?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=

(in message ID 160167211500.21762.13369928255363041989.b4-ty@kernel.org
which I'll manage to send) which /usr/sbin/sendmail -t from both msmtp
and exim has a lot of trouble parsing and since b4 has generated the
thanks I now can't ask the database (though I don't immediately see a
way to extract it anyway).  I think we need a sidecar file with the
unencoded e-mail addresses in it?
ask b4 what addresses are buried in there even if 

[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 499 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [tools] b4: UTF-8 in replies
  2020-10-02 21:09         ` [tools] " Mark Brown
@ 2020-10-03 10:13           ` Michał Mirosław
  2020-10-03 10:18             ` Michał Mirosław
  0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-03 10:13 UTC (permalink / raw)
  To: Mark Brown; +Cc: tools, konstantin

On Fri, Oct 02, 2020 at 10:09:50PM +0100, Mark Brown wrote:
> On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:
> 
> > Hrmph, apparently my assumption on what headers will be generated by 
> > policy=8bit was pretty wrong. I pushed a commit to master and 
> > stable-0.5.y that should properly set the charset and generate the 
> > necessary headers.
> 
> This fix seems to have caused some issues for me - using msmtp with
> kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
> getting headers with:
> 
> To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
> Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
>  =?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
>  =?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
>  =?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
>  =?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
>  =?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
>  =?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=

RFC 2047 section 6.2. suggests that this encoding is invalid, because it
encodes structural characters (commas in this case) -- the MTA/MUA
might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
RFC forbids encoding the addr-spec part (mail address if I understand
correctly).

Best Regards,
Michał Mirosław

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [tools] b4: UTF-8 in replies
  2020-10-03 10:13           ` Michał Mirosław
@ 2020-10-03 10:18             ` Michał Mirosław
  2020-10-03 14:40               ` Konstantin Ryabitsev
  0 siblings, 1 reply; 9+ messages in thread
From: Michał Mirosław @ 2020-10-03 10:18 UTC (permalink / raw)
  To: Mark Brown; +Cc: tools, konstantin

On Sat, Oct 03, 2020 at 12:13:45PM +0200, Michał Mirosław wrote:
> On Fri, Oct 02, 2020 at 10:09:50PM +0100, Mark Brown wrote:
> > On Thu, Oct 01, 2020 at 03:26:00PM -0400, Konstantin Ryabitsev wrote:
> > 
> > > Hrmph, apparently my assumption on what headers will be generated by 
> > > policy=8bit was pretty wrong. I pushed a commit to master and 
> > > stable-0.5.y that should properly set the charset and generate the 
> > > necessary headers.
> > 
> > This fix seems to have caused some issues for me - using msmtp with
> > kernel.org on 1594818110-786-1-git-send-email-brent.lu@intel.com I'm
> > getting headers with:
> > 
> > To: alsa-devel@alsa-project.org, Brent Lu <brent.lu@intel.com>
> > Cc: =?utf-8?q?linux-kernel=40vger=2Ekernel=2Eorg=2C_Kai_Vehmanen_=3Ckai=2Eve?=
> >  =?utf-8?q?hmanen=40linux=2Eintel=2Ecom=3E=2C_Pierre-Louis_Bossart_=3Cpierre?=
> >  =?utf-8?q?-louis=2Ebossart=40linux=2Eintel=2Ecom=3E=2C_Takashi_Iwai_=3Ctiwa?=
> >  =?utf-8?q?i=40suse=2Ecom=3E=2C_Guennadi_Liakhovetski_=3Cguennadi=2Eliakhove?=
> >  =?utf-8?q?tski=40linux=2Eintel=2Ecom=3E=2C_Liam_Girdwood_=3Clgirdwood=40gma?=
> >  =?utf-8?q?il=2Ecom=3E=2C_Amadeusz_S=C5=82awi=C5=84ski_=3Camadeuszx=2Eslawin?=
> >  =?utf-8?q?ski=40linux=2Eintel=2Ecom=3E?=
> 
> RFC 2047 section 6.2. suggests that this encoding is invalid, because it
> encodes structural characters (commas in this case) -- the MTA/MUA
> might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
> RFC forbids encoding the addr-spec part (mail address if I understand
> correctly).

IOW, correct encoding might be at most:

Cc: linux-kernel@vger.kernel.org, Kai Vehmanen
	<kai.vehmanen@linux.intel.com>, Pierre-Louis Bossart
	<pierre-louis.bossart@linux.intel.com>, Takashi Iwai
	<tiwai@suse.com>, Guennadi Liakhovetski
	<guennadi.liakhovetski@linux.intel.com>, Liam Girdwood
	<lgirdwood@gmail.com>,
	=?utf-8?q?Amadeusz_S=C5=82awi=C5=84ski?=
	<amadeuszx.slawinski@linux.intel.com>

Best Regards,
Michał Mirosław

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [tools] b4: UTF-8 in replies
  2020-10-03 10:18             ` Michał Mirosław
@ 2020-10-03 14:40               ` Konstantin Ryabitsev
  0 siblings, 0 replies; 9+ messages in thread
From: Konstantin Ryabitsev @ 2020-10-03 14:40 UTC (permalink / raw)
  To: Michał Mirosław; +Cc: Mark Brown, tools

On Sat, Oct 03, 2020 at 12:18:04PM +0200, Michał Mirosław wrote:
> > RFC 2047 section 6.2. suggests that this encoding is invalid, 
> > because it
> > encodes structural characters (commas in this case) -- the MTA/MUA
> > might treat the whole 'Cc:' as one entry. Also, section 5 point 3 of the
> > RFC forbids encoding the addr-spec part (mail address if I understand
> > correctly).
> 
> IOW, correct encoding might be at most:
> 
> Cc: linux-kernel@vger.kernel.org, Kai Vehmanen
> 	<kai.vehmanen@linux.intel.com>, Pierre-Louis Bossart
> 	<pierre-louis.bossart@linux.intel.com>, Takashi Iwai
> 	<tiwai@suse.com>, Guennadi Liakhovetski
> 	<guennadi.liakhovetski@linux.intel.com>, Liam Girdwood
> 	<lgirdwood@gmail.com>,
> 	=?utf-8?q?Amadeusz_S=C5=82awi=C5=84ski?=
> 	<amadeuszx.slawinski@linux.intel.com>

Please feel free to report it to Python upstream, as we're just 
deferring to whatever decisions they make. Anyway, we're back to 8bit 
headers with the latest change.

-K

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2020-10-03 14:40 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20201001171405.GA17873@qmqm.qmqm.pl>
2020-10-01 18:05 ` b4: UTF-8 in replies Konstantin Ryabitsev
2020-10-01 18:09   ` [tools] " Mark Brown
2020-10-01 18:36   ` Michał Mirosław
2020-10-01 18:41     ` Michał Mirosław
2020-10-01 19:26       ` Konstantin Ryabitsev
2020-10-02 21:09         ` [tools] " Mark Brown
2020-10-03 10:13           ` Michał Mirosław
2020-10-03 10:18             ` Michał Mirosław
2020-10-03 14:40               ` Konstantin Ryabitsev

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).