linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* possible dmar_init_reserved_ranges() error
@ 2016-12-19 21:20 Bjorn Helgaas
  2016-12-22 16:27 ` Joerg Roedel
  2016-12-27 23:44 ` Bjorn Helgaas
  0 siblings, 2 replies; 10+ messages in thread
From: Bjorn Helgaas @ 2016-12-19 21:20 UTC (permalink / raw)
  To: David Woodhouse, Joerg Roedel; +Cc: rwright, iommu, linux-pci, linux-kernel

Hi guys,

I have some questions about dmar_init_reserved_ranges().  On systems
where CPU physical address space is not identity-mapped to PCI bus
address space, e.g., where the PCI host bridge windows have _TRA
offsets, I'm not sure we're doing the right thing.

Assume we have a PCI host bridge with _TRA that maps CPU addresses
0x80000000-0x9fffffff to PCI bus addresses 0x00000000-0x1fffffff, with
two PCI devices below it:

  PCI host bridge domain 0000 [bus 00-3f]
  PCI host bridge window [mem 0x80000000-0x9fffffff] (bus 0x00000000-0x1fffffff]
  00:00.0: BAR 0 [mem 0x80000000-0x8ffffffff] (0x00000000-0x0fffffff on bus)
  00:01.0: BAR 0 [mem 0x90000000-0x9ffffffff] (0x10000000-0x1fffffff on bus)

The IOMMU init code in dmar_init_reserved_ranges() reserves the PCI
MMIO space for all devices:

  pci_iommu_init()
    intel_iommu_init()
      dmar_init_reserved_ranges()
        reserve_iova(0x80000000-0x8ffffffff)
        reserve_iova(0x90000000-0x9ffffffff)

This looks odd because we're reserving CPU physical addresses, but
the IOVA space contains *PCI bus* addresses.  On most x86 systems they
would be the same, but not on all.

Assume the driver for 00:00.0 maps a page of main memory for DMA.  It
may receive a dma_addr_t of 0x10000000:

  00:00.0: intel_map_page() returns dma_addr_t 0x10000000
  00:00.0: issues DMA to 0x10000000

What happens here?  The DMA access should go to main memory.  In
conventional PCI it would be a peer-to-peer access to device 00:01.0.
Is there enough PCIe smarts (ACS or something?) to do otherwise?

The dmar_init_reserved_ranges() comment says "Reserve all PCI MMIO to
avoid peer-to-peer access."  Without _TRA, CPU addresses and PCI bus
addresses would be identical, and I think these reserve_iova() calls
*would* prevent this situation.  So maybe we're just missing a
pcibios_resource_to_bus() here?

Bjorn

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2017-01-04 14:39 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-12-19 21:20 possible dmar_init_reserved_ranges() error Bjorn Helgaas
2016-12-22 16:27 ` Joerg Roedel
2016-12-22 20:28   ` Bjorn Helgaas
2016-12-22 23:32     ` Raj, Ashok
2016-12-22 23:45       ` Raj, Ashok
2016-12-23  0:48         ` Bjorn Helgaas
2016-12-23 10:35           ` Joerg Roedel
2016-12-27 23:44 ` Bjorn Helgaas
2016-12-28  3:21   ` Raj, Ashok
2017-01-04 14:39     ` Joerg Roedel

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).