All of lore.kernel.org
 help / color / mirror / Atom feed
From: Benjamin Herrenschmidt <benh@kernel.crashing.org>
To: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com>,
	Catalin Marinas <catalin.marinas@arm.com>
Cc: Clint Sbisa <csbisa@amazon.com>,
	linux-pci@vger.kernel.org, Bjorn Helgaas <helgaas@kernel.org>,
	linux-arm-kernel@lists.infradead.org, will@kernel.org
Subject: Re: [PATCH] arm64: Enable PCI write-combine resources under sysfs
Date: Thu, 03 Sep 2020 09:03:32 +1000	[thread overview]
Message-ID: <1fa18cf358eb6570f919c2b03c193e856c72705b.camel@kernel.crashing.org> (raw)
In-Reply-To: <20200902175445.GA31706@e121166-lin.cambridge.arm.com>

On Wed, 2020-09-02 at 18:54 +0100, Lorenzo Pieralisi wrote:
> > > > If that driver is fixed to check what it actually wants to check, would that
> > > > address your concern about the blanket enable? I don't see any other references
> > > > to this in kernel drivers and I think the documentation at
> > > > `filesystems/sysfs-pci.rst` outlines it pretty explicitly:
> > > > 
> > > >    Platforms which support write-combining maps of PCI resources must define
> > > >    arch_can_pci_mmap_wc() which shall evaluate to non-zero at runtime when
> > > >    write-combining is permitted.
> > > 
> > > That's exactly the problem. I am asking you: what does "write-combining
> > > maps of PCI resources" mean ?
> > > 
> > > I understand we do want weak ordering for prefetchable BAR mappings
> > > but my worry is that by exposing the resources as WC to user space
> > > we are giving user space the impression that those mappings mirror
> > > x86 WC mappings behaviour that is not true on ARM64.
> > 
> > Would Device_GRE be close to the x86 WC better? It won't allow unaligned
> > accesses and that can be problematic for the user. OTOH, it doesn't
> > speculate reads, so it's safer from the hardware perspective.
> 
> Thanks Catalin for chiming in, it may yes but I need to figure out
> the precise semantics of WC on x86 first.

We never got to the bottom of that with powerpc... semantics of "WC"
are subtly different all over the archs. They key idea I think is for
us to state that a WC mapping drops all ordering guarantees :-)

That said, the goal here is to expose the sysfs _wc files, without
which, mapping of "no-side-effect" memory such as frame buffers etc...
produces something very very slow.

> Actually *if* I read x86 specs correctly WC mappings allow speculative
> reads, which then would shift the issue on the PCI specs that allow
> marking read side effects BARs as prefetchable;

Yes.

>  in other words if
> an endpoint is designed with a prefetchable BAR that has read side
> effects this is already an issue on x86 in the current kernel.

An powerpc. We remove the "G" bit. Same deal.

> There is that, plus the usage of arch_can_pci_mmap_wc() in mellanox
> drivers which I suspect it is yet another interpretation of x86 write
> combine - I don't know what happens if we let arch_can_pci_mmap_wc() == 1
> on both normalNC or deviceGRE mappings for pgprot_writecombine.
> 
> I think it is worth getting to the bottom of this before applying
> this patch.

I think it basically boils down to mapping things without side effect
and ordering guarantees but that still cannot be cached.

Cheers,
Ben.



WARNING: multiple messages have this Message-ID (diff)
From: Benjamin Herrenschmidt <benh@kernel.crashing.org>
To: Lorenzo Pieralisi <lorenzo.pieralisi@arm.com>,
	Catalin Marinas <catalin.marinas@arm.com>
Cc: linux-pci@vger.kernel.org, will@kernel.org,
	Bjorn Helgaas <helgaas@kernel.org>,
	linux-arm-kernel@lists.infradead.org,
	Clint Sbisa <csbisa@amazon.com>
Subject: Re: [PATCH] arm64: Enable PCI write-combine resources under sysfs
Date: Thu, 03 Sep 2020 09:03:32 +1000	[thread overview]
Message-ID: <1fa18cf358eb6570f919c2b03c193e856c72705b.camel@kernel.crashing.org> (raw)
In-Reply-To: <20200902175445.GA31706@e121166-lin.cambridge.arm.com>

On Wed, 2020-09-02 at 18:54 +0100, Lorenzo Pieralisi wrote:
> > > > If that driver is fixed to check what it actually wants to check, would that
> > > > address your concern about the blanket enable? I don't see any other references
> > > > to this in kernel drivers and I think the documentation at
> > > > `filesystems/sysfs-pci.rst` outlines it pretty explicitly:
> > > > 
> > > >    Platforms which support write-combining maps of PCI resources must define
> > > >    arch_can_pci_mmap_wc() which shall evaluate to non-zero at runtime when
> > > >    write-combining is permitted.
> > > 
> > > That's exactly the problem. I am asking you: what does "write-combining
> > > maps of PCI resources" mean ?
> > > 
> > > I understand we do want weak ordering for prefetchable BAR mappings
> > > but my worry is that by exposing the resources as WC to user space
> > > we are giving user space the impression that those mappings mirror
> > > x86 WC mappings behaviour that is not true on ARM64.
> > 
> > Would Device_GRE be close to the x86 WC better? It won't allow unaligned
> > accesses and that can be problematic for the user. OTOH, it doesn't
> > speculate reads, so it's safer from the hardware perspective.
> 
> Thanks Catalin for chiming in, it may yes but I need to figure out
> the precise semantics of WC on x86 first.

We never got to the bottom of that with powerpc... semantics of "WC"
are subtly different all over the archs. They key idea I think is for
us to state that a WC mapping drops all ordering guarantees :-)

That said, the goal here is to expose the sysfs _wc files, without
which, mapping of "no-side-effect" memory such as frame buffers etc...
produces something very very slow.

> Actually *if* I read x86 specs correctly WC mappings allow speculative
> reads, which then would shift the issue on the PCI specs that allow
> marking read side effects BARs as prefetchable;

Yes.

>  in other words if
> an endpoint is designed with a prefetchable BAR that has read side
> effects this is already an issue on x86 in the current kernel.

An powerpc. We remove the "G" bit. Same deal.

> There is that, plus the usage of arch_can_pci_mmap_wc() in mellanox
> drivers which I suspect it is yet another interpretation of x86 write
> combine - I don't know what happens if we let arch_can_pci_mmap_wc() == 1
> on both normalNC or deviceGRE mappings for pgprot_writecombine.
> 
> I think it is worth getting to the bottom of this before applying
> this patch.

I think it basically boils down to mapping things without side effect
and ordering guarantees but that still cannot be cached.

Cheers,
Ben.



_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel

  reply	other threads:[~2020-09-02 23:03 UTC|newest]

Thread overview: 117+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-08-31 15:18 [PATCH] arm64: Enable PCI write-combine resources under sysfs Clint Sbisa
2020-08-31 23:24 ` Benjamin Herrenschmidt
2020-09-01 18:37 ` Bjorn Helgaas
2020-09-01 23:22   ` Benjamin Herrenschmidt
2020-09-02  8:57     ` Will Deacon
2020-09-02 11:32 ` Lorenzo Pieralisi
2020-09-02 14:29   ` Clint Sbisa
2020-09-02 16:47     ` Lorenzo Pieralisi
2020-09-02 16:47       ` Lorenzo Pieralisi
2020-09-02 17:21       ` Catalin Marinas
2020-09-02 17:21         ` Catalin Marinas
2020-09-02 17:54         ` Lorenzo Pieralisi
2020-09-02 17:54           ` Lorenzo Pieralisi
2020-09-02 23:03           ` Benjamin Herrenschmidt [this message]
2020-09-02 23:03             ` Benjamin Herrenschmidt
2020-09-02 23:08         ` Benjamin Herrenschmidt
2020-09-02 23:08           ` Benjamin Herrenschmidt
2020-09-02 23:08           ` Benjamin Herrenschmidt
2020-09-02 23:08             ` Benjamin Herrenschmidt
2020-09-02 23:07       ` Benjamin Herrenschmidt
2020-09-02 23:07         ` Benjamin Herrenschmidt
2020-09-03 11:08         ` Lorenzo Pieralisi
2020-09-03 11:08           ` Lorenzo Pieralisi
2020-09-03 14:36           ` Clint Sbisa
2020-09-03 14:36             ` Clint Sbisa
2020-09-03 22:26           ` Benjamin Herrenschmidt
2020-09-03 22:26             ` Benjamin Herrenschmidt
2020-09-07 23:33           ` Benjamin Herrenschmidt
2020-09-07 23:33             ` Benjamin Herrenschmidt
2020-09-10  9:46             ` Lorenzo Pieralisi
2020-09-10  9:46               ` Lorenzo Pieralisi
2020-09-10 10:54               ` Leon Romanovsky
2020-09-10 10:54                 ` Leon Romanovsky
2020-09-10 12:37               ` Jason Gunthorpe
2020-09-10 12:37                 ` Jason Gunthorpe
2020-09-10 15:17                 ` Lorenzo Pieralisi
2020-09-10 15:17                   ` Lorenzo Pieralisi
2020-09-10 17:10                   ` Jason Gunthorpe
2020-09-10 17:10                     ` Jason Gunthorpe
2020-09-10 21:46                     ` Benjamin Herrenschmidt
2020-09-10 21:46                       ` Benjamin Herrenschmidt
2020-09-10 23:29                       ` Jason Gunthorpe
2020-09-10 23:29                         ` Jason Gunthorpe
2020-09-11  0:39                         ` Benjamin Herrenschmidt
2020-09-11  0:39                           ` Benjamin Herrenschmidt
2020-09-11 14:21                           ` Jason Gunthorpe
2020-09-11 14:21                             ` Jason Gunthorpe
2020-09-11 21:42                           ` Clint Sbisa
2020-09-11 21:42                             ` Clint Sbisa
2020-09-14 14:17                             ` Jason Gunthorpe
2020-09-14 14:17                               ` Jason Gunthorpe
2020-09-14 14:24                               ` Clint Sbisa
2020-09-14 14:24                                 ` Clint Sbisa
2020-09-14 14:38                                 ` Jason Gunthorpe
2020-09-14 14:38                                   ` Jason Gunthorpe
2020-09-14 21:42                                   ` Benjamin Herrenschmidt
2020-09-14 21:42                                     ` Benjamin Herrenschmidt
2020-09-14 22:00                                     ` Benjamin Herrenschmidt
2020-09-14 22:00                                       ` Benjamin Herrenschmidt
2020-09-14 22:32                                       ` Clint Sbisa
2020-09-14 22:32                                         ` Clint Sbisa
2020-09-14 22:57                                       ` Jason Gunthorpe
2020-09-14 22:57                                         ` Jason Gunthorpe
2020-09-14 23:25                                         ` Benjamin Herrenschmidt
2020-09-14 23:25                                           ` Benjamin Herrenschmidt
2020-09-15 10:18                                           ` Lorenzo Pieralisi
2020-09-15 10:18                                             ` Lorenzo Pieralisi
2020-09-15 11:05                                             ` Jason Gunthorpe
2020-09-15 11:05                                               ` Jason Gunthorpe
2020-09-15 23:17                                               ` Benjamin Herrenschmidt
2020-09-15 23:17                                                 ` Benjamin Herrenschmidt
2020-09-15 23:40                                                 ` Jason Gunthorpe
2020-09-15 23:40                                                   ` Jason Gunthorpe
2020-09-16  7:59                                                   ` Benjamin Herrenschmidt
2020-09-16  7:59                                                     ` Benjamin Herrenschmidt
2020-09-16 12:12                                                     ` Jason Gunthorpe
2020-09-16 12:12                                                       ` Jason Gunthorpe
2020-09-16 14:09                                                       ` Lorenzo Pieralisi
2020-09-16 14:09                                                         ` Lorenzo Pieralisi
2020-09-16 14:14                                                         ` Jason Gunthorpe
2020-09-16 14:14                                                           ` Jason Gunthorpe
2020-09-16 23:59                                                       ` Benjamin Herrenschmidt
2020-09-16 23:59                                                         ` Benjamin Herrenschmidt
2020-09-17 10:28                                                         ` Lorenzo Pieralisi
2020-09-17 10:28                                                           ` Lorenzo Pieralisi
2020-09-17 11:32                                                           ` Jason Gunthorpe
2020-09-17 11:32                                                             ` Jason Gunthorpe
2020-09-17 14:01                                                             ` Lorenzo Pieralisi
2020-09-17 14:01                                                               ` Lorenzo Pieralisi
2020-09-17 16:08                                                               ` Will Deacon
2020-09-17 16:08                                                                 ` Will Deacon
2020-09-16 12:48                                                     ` Leon Romanovsky
2020-09-16 12:48                                                       ` Leon Romanovsky
2020-09-16  8:33                                                   ` Will Deacon
2020-09-16  8:33                                                     ` Will Deacon
2020-09-16  8:48                                                     ` Catalin Marinas
2020-09-16  8:48                                                       ` Catalin Marinas
2020-09-16 14:15                                                       ` Lorenzo Pieralisi
2020-09-16 14:15                                                         ` Lorenzo Pieralisi
2020-09-16 17:00                                                         ` Catalin Marinas
2020-09-16 17:00                                                           ` Catalin Marinas
2020-09-16 21:29                                                           ` Benjamin Herrenschmidt
2020-09-16 21:29                                                             ` Benjamin Herrenschmidt
2020-09-16 12:08                                                     ` Jason Gunthorpe
2020-09-16 12:08                                                       ` Jason Gunthorpe
2020-09-15 23:00                                             ` Benjamin Herrenschmidt
2020-09-15 23:00                                               ` Benjamin Herrenschmidt
2020-09-15 23:12                                               ` Clint Sbisa
2020-09-15 23:12                                                 ` Clint Sbisa
2020-09-14 21:41                               ` Benjamin Herrenschmidt
2020-09-14 21:41                                 ` Benjamin Herrenschmidt
  -- strict thread matches above, loose matches on Subject: below --
2020-08-21 15:51 Clint Sbisa
2020-08-21 15:51 ` Clint Sbisa
2020-08-27 14:41 ` Clint Sbisa
2020-08-27 14:41   ` Clint Sbisa
2020-08-31 15:22 ` Clint Sbisa
2020-08-31 15:22   ` Clint Sbisa

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1fa18cf358eb6570f919c2b03c193e856c72705b.camel@kernel.crashing.org \
    --to=benh@kernel.crashing.org \
    --cc=catalin.marinas@arm.com \
    --cc=csbisa@amazon.com \
    --cc=helgaas@kernel.org \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=lorenzo.pieralisi@arm.com \
    --cc=will@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.