All of lore.kernel.org
 help / color / mirror / Atom feed
* DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
@ 2010-05-22 10:26 Stefan Richter
  2010-05-22 12:12 ` Matthew Wilcox
  0 siblings, 1 reply; 8+ messages in thread
From: Stefan Richter @ 2010-05-22 10:26 UTC (permalink / raw)
  To: linux-pci; +Cc: linux1394-devel, linux-kernel

Hi,

I came across a report of the subject message.  What does it mean; what
type of bug should be look for?

https://bugzilla.redhat.com/show_bug.cgi?id=587178
>>>
firewire_ohci 0000:04:00.4: PCI INT C -> GSI 16 (level, low) -> IRQ 16
firewire_ohci 0000:04:00.4: setting latency timer to 64
firewire_ohci: Added fw-ohci device 0000:04:00.4, OHCI version 1.0
DRHD: handling fault status reg 2
DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
DMAR:[fault reason 02] Present bit in context entry is clear
DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
DMAR:[fault reason 02] Present bit in context entry is clear
DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
DMAR:[fault reason 02] Present bit in context entry is clear
<<<

The log sounds as if this happens during from-device DMA of 04:00.4 into
a consistent buffer, yet the fault is logged for device 04:00.0.  The
log is from a Dell M4500.  The devices are, according to a web search,
04:00.0 CardBus bridge [0607]: Ricoh Co Ltd Device [1180:e476] (rev 02)
04:00.4 FireWire (IEEE 1394) [0c00]: Ricoh Co Ltd Device [1180:e832]
(rev 03) (prog-if 10)

Thanks for any hint,
-- 
Stefan Richter
-=====-==-=- -=-= =-==-
http://arcgraph.de/sr/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2010-05-22 10:26 DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci) Stefan Richter
@ 2010-05-22 12:12 ` Matthew Wilcox
  2010-05-22 18:40   ` David Woodhouse
  0 siblings, 1 reply; 8+ messages in thread
From: Matthew Wilcox @ 2010-05-22 12:12 UTC (permalink / raw)
  To: Stefan Richter; +Cc: linux-pci, linux1394-devel, linux-kernel, David Woodhouse

On Sat, May 22, 2010 at 12:26:15PM +0200, Stefan Richter wrote:
> Hi,
> 
> I came across a report of the subject message.  What does it mean; what
> type of bug should be look for?
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=587178
> >>>
> firewire_ohci 0000:04:00.4: PCI INT C -> GSI 16 (level, low) -> IRQ 16
> firewire_ohci 0000:04:00.4: setting latency timer to 64
> firewire_ohci: Added fw-ohci device 0000:04:00.4, OHCI version 1.0
> DRHD: handling fault status reg 2
> DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> DMAR:[fault reason 02] Present bit in context entry is clear
> DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> DMAR:[fault reason 02] Present bit in context entry is clear
> DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> DMAR:[fault reason 02] Present bit in context entry is clear
> <<<
> 
> The log sounds as if this happens during from-device DMA of 04:00.4 into
> a consistent buffer, yet the fault is logged for device 04:00.0.  The
> log is from a Dell M4500.  The devices are, according to a web search,
> 04:00.0 CardBus bridge [0607]: Ricoh Co Ltd Device [1180:e476] (rev 02)
> 04:00.4 FireWire (IEEE 1394) [0c00]: Ricoh Co Ltd Device [1180:e832]
> (rev 03) (prog-if 10)

I would guess that the DMAR can't distinguish between the different
devices behind the Cardbus bridge.  Adding Dave Woodhouse.

-- 
Matthew Wilcox				Intel Open Source Technology Centre
"Bill, look, we understand that you're interested in selling us this
operating system, but compare it to ours.  We can't possibly take such
a retrograde step."

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2010-05-22 12:12 ` Matthew Wilcox
@ 2010-05-22 18:40   ` David Woodhouse
  2010-05-22 19:09     ` Jesse Barnes
  0 siblings, 1 reply; 8+ messages in thread
From: David Woodhouse @ 2010-05-22 18:40 UTC (permalink / raw)
  To: Matthew Wilcox, jbarnes
  Cc: Stefan Richter, linux-pci, linux1394-devel, linux-kernel

On Sat, 2010-05-22 at 06:12 -0600, Matthew Wilcox wrote:
> On Sat, May 22, 2010 at 12:26:15PM +0200, Stefan Richter wrote:
> > Hi,
> > 
> > I came across a report of the subject message.  What does it mean; what
> > type of bug should be look for?
> > 
> > https://bugzilla.redhat.com/show_bug.cgi?id=587178
> > >>>
> > firewire_ohci 0000:04:00.4: PCI INT C -> GSI 16 (level, low) -> IRQ 16
> > firewire_ohci 0000:04:00.4: setting latency timer to 64
> > firewire_ohci: Added fw-ohci device 0000:04:00.4, OHCI version 1.0
> > DRHD: handling fault status reg 2
> > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > DMAR:[fault reason 02] Present bit in context entry is clear
> > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > DMAR:[fault reason 02] Present bit in context entry is clear
> > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > DMAR:[fault reason 02] Present bit in context entry is clear
> > <<<
> > 
> > The log sounds as if this happens during from-device DMA of 04:00.4 into
> > a consistent buffer, yet the fault is logged for device 04:00.0.  The
> > log is from a Dell M4500.  The devices are, according to a web search,
> > 04:00.0 CardBus bridge [0607]: Ricoh Co Ltd Device [1180:e476] (rev 02)
> > 04:00.4 FireWire (IEEE 1394) [0c00]: Ricoh Co Ltd Device [1180:e832]
> > (rev 03) (prog-if 10)
> 
> I would guess that the DMAR can't distinguish between the different
> devices behind the Cardbus bridge.  Adding Dave Woodhouse.

Yeah, the DMAR looks at the source-id in the PCIe transactions, and it
sounds like those are all 04:00.0. But we've probably set up the DMAR to
allow the transactions from 04:00.4, and then it naturally faults when a
"different" device actually ends up doing the transaction.

If you make the pci_find_upstream_pcie_bridge() function do the
appropriate thing for this device, does it then work as expected?

Whether that's a _sane_ thing to do or not is possibly more of a Jesse
question...

-- 
David Woodhouse                            Open Source Technology Centre
David.Woodhouse@intel.com                              Intel Corporation


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2010-05-22 18:40   ` David Woodhouse
@ 2010-05-22 19:09     ` Jesse Barnes
  2010-05-30 14:18       ` Stefan Richter
  2011-11-19 17:21       ` Tomáš Janoušek
  0 siblings, 2 replies; 8+ messages in thread
From: Jesse Barnes @ 2010-05-22 19:09 UTC (permalink / raw)
  To: David Woodhouse
  Cc: Matthew Wilcox, Stefan Richter, linux-pci, linux1394-devel,
	linux-kernel, Dominik Brodowski

On Sat, 22 May 2010 19:40:43 +0100
David Woodhouse <dwmw2@infradead.org> wrote:

> On Sat, 2010-05-22 at 06:12 -0600, Matthew Wilcox wrote:
> > On Sat, May 22, 2010 at 12:26:15PM +0200, Stefan Richter wrote:
> > > Hi,
> > > 
> > > I came across a report of the subject message.  What does it mean; what
> > > type of bug should be look for?
> > > 
> > > https://bugzilla.redhat.com/show_bug.cgi?id=587178
> > > >>>
> > > firewire_ohci 0000:04:00.4: PCI INT C -> GSI 16 (level, low) -> IRQ 16
> > > firewire_ohci 0000:04:00.4: setting latency timer to 64
> > > firewire_ohci: Added fw-ohci device 0000:04:00.4, OHCI version 1.0
> > > DRHD: handling fault status reg 2
> > > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > > DMAR:[fault reason 02] Present bit in context entry is clear
> > > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > > DMAR:[fault reason 02] Present bit in context entry is clear
> > > DMAR:[DMA Read] Request device [04:00.0] fault addr fffff000
> > > DMAR:[fault reason 02] Present bit in context entry is clear
> > > <<<
> > > 
> > > The log sounds as if this happens during from-device DMA of 04:00.4 into
> > > a consistent buffer, yet the fault is logged for device 04:00.0.  The
> > > log is from a Dell M4500.  The devices are, according to a web search,
> > > 04:00.0 CardBus bridge [0607]: Ricoh Co Ltd Device [1180:e476] (rev 02)
> > > 04:00.4 FireWire (IEEE 1394) [0c00]: Ricoh Co Ltd Device [1180:e832]
> > > (rev 03) (prog-if 10)
> > 
> > I would guess that the DMAR can't distinguish between the different
> > devices behind the Cardbus bridge.  Adding Dave Woodhouse.
> 
> Yeah, the DMAR looks at the source-id in the PCIe transactions, and it
> sounds like those are all 04:00.0. But we've probably set up the DMAR to
> allow the transactions from 04:00.4, and then it naturally faults when a
> "different" device actually ends up doing the transaction.
> 
> If you make the pci_find_upstream_pcie_bridge() function do the
> appropriate thing for this device, does it then work as expected?
> 
> Whether that's a _sane_ thing to do or not is possibly more of a Jesse
> question...

Well, if cardbus bridges tend to behave this way in general it would
make sense to simply use the cardbus bride id everywhere, rather than
the specific functions of the device.  Cc'ing Dominik.

Jesse

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2010-05-22 19:09     ` Jesse Barnes
@ 2010-05-30 14:18       ` Stefan Richter
  2011-11-19 17:21       ` Tomáš Janoušek
  1 sibling, 0 replies; 8+ messages in thread
From: Stefan Richter @ 2010-05-30 14:18 UTC (permalink / raw)
  To: Jesse Barnes
  Cc: David Woodhouse, Matthew Wilcox, linux-pci, linux1394-devel,
	linux-kernel, Dominik Brodowski

>>> On Sat, May 22, 2010 at 12:26:15PM +0200, Stefan Richter wrote:
>>>> https://bugzilla.redhat.com/show_bug.cgi?id=587178

Thanks to those who responded.  Alas I cannot follow up on this myself
with tests since I don't have affected hardware.
-- 
Stefan Richter
-=====-==-=- -=-= ====-
http://arcgraph.de/sr/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2010-05-22 19:09     ` Jesse Barnes
  2010-05-30 14:18       ` Stefan Richter
@ 2011-11-19 17:21       ` Tomáš Janoušek
  2011-11-30 16:16         ` Clemens Ladisch
  1 sibling, 1 reply; 8+ messages in thread
From: Tomáš Janoušek @ 2011-11-19 17:21 UTC (permalink / raw)
  To: Jesse Barnes
  Cc: David Woodhouse, Matthew Wilcox, Stefan Richter, linux-pci,
	linux1394-devel, linux-kernel, Dominik Brodowski

Hello guys,

On Sat, May 22, 2010 at 12:09:05PM -0700, Jesse Barnes wrote:
> On Sat, 22 May 2010 19:40:43 +0100
> David Woodhouse <dwmw2@infradead.org> wrote:
> > [...]
> > Yeah, the DMAR looks at the source-id in the PCIe transactions, and it
> > sounds like those are all 04:00.0. But we've probably set up the DMAR to
> > allow the transactions from 04:00.4, and then it naturally faults when a
> > "different" device actually ends up doing the transaction.
> > 
> > If you make the pci_find_upstream_pcie_bridge() function do the
> > appropriate thing for this device, does it then work as expected?
> > 
> > Whether that's a _sane_ thing to do or not is possibly more of a Jesse
> > question...
> 
> Well, if cardbus bridges tend to behave this way in general it would
> make sense to simply use the cardbus bride id everywhere, rather than
> the specific functions of the device.  Cc'ing Dominik.

Any news on this one? It is still an issue with current kernel on current
hardware (like my ThinkPad T420). :-(

Regards,
-- 
Tomáš Janoušek, a.k.a. Liskni_si, http://work.lisk.in/

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2011-11-19 17:21       ` Tomáš Janoušek
@ 2011-11-30 16:16         ` Clemens Ladisch
  2011-12-01  9:14           ` Tomáš Janoušek
  0 siblings, 1 reply; 8+ messages in thread
From: Clemens Ladisch @ 2011-11-30 16:16 UTC (permalink / raw)
  To: Tomáš Janoušek
  Cc: Jesse Barnes, Matthew Wilcox, linux-pci, linux-kernel,
	Dominik Brodowski, Stefan Richter, linux1394-devel,
	David Woodhouse

Tomáš Janoušek wrote:
> On Sat, May 22, 2010 at 12:09:05PM -0700, Jesse Barnes wrote:
>> On Sat, 22 May 2010 19:40:43 +0100
>> David Woodhouse <dwmw2@infradead.org> wrote:
>>> [...]
>>> Yeah, the DMAR looks at the source-id in the PCIe transactions, and it
>>> sounds like those are all 04:00.0. But we've probably set up the DMAR to
>>> allow the transactions from 04:00.4, and then it naturally faults when a
>>> "different" device actually ends up doing the transaction.
>>>
>>> If you make the pci_find_upstream_pcie_bridge() function do the
>>> appropriate thing for this device, does it then work as expected?
>>>
>>> Whether that's a _sane_ thing to do or not is possibly more of a Jesse
>>> question...
>>
>> Well, if cardbus bridges tend to behave this way in general it would
>> make sense to simply use the cardbus bride id everywhere, rather than
>> the specific functions of the device.  Cc'ing Dominik.
>
> Any news on this one?

Apparently not.

Try the patch in <https://bugzilla.redhat.com/show_bug.cgi?id=605888>
that is intended to work around the bug in your Ricoh chip.  Alternatively,
disable VT-d or FireWire in the BIOS, or add the kernel parameter
"intel_iommu=off".


Regards,
Clemens

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci)
  2011-11-30 16:16         ` Clemens Ladisch
@ 2011-12-01  9:14           ` Tomáš Janoušek
  0 siblings, 0 replies; 8+ messages in thread
From: Tomáš Janoušek @ 2011-12-01  9:14 UTC (permalink / raw)
  To: Clemens Ladisch
  Cc: Jesse Barnes, Matthew Wilcox, linux-pci, linux-kernel,
	Dominik Brodowski, Stefan Richter, linux1394-devel,
	David Woodhouse

Hi,

On Wed, Nov 30, 2011 at 05:16:54PM +0100, Clemens Ladisch wrote:
> Try the patch in <https://bugzilla.redhat.com/show_bug.cgi?id=605888>
> that is intended to work around the bug in your Ricoh chip.  Alternatively,
> disable VT-d or FireWire in the BIOS, or add the kernel parameter
> "intel_iommu=off".

Yeah, I guess that would work, I was just concerned with a fix not being in
the mainline. Surely we don't expect every ThinkPad user who wants to use
firewire and VT-d to apply the patch himself (the Fedora fix was to forcefully
disable IOMMU in presence of that Ricoh device). :-)

Regards,
-- 
Tomáš Janoušek, a.k.a. Liskni_si, http://work.lisk.in/

^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2011-12-01  9:14 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2010-05-22 10:26 DMAR:[fault reason 02] Present bit in context entry is clear (firewire-ohci) Stefan Richter
2010-05-22 12:12 ` Matthew Wilcox
2010-05-22 18:40   ` David Woodhouse
2010-05-22 19:09     ` Jesse Barnes
2010-05-30 14:18       ` Stefan Richter
2011-11-19 17:21       ` Tomáš Janoušek
2011-11-30 16:16         ` Clemens Ladisch
2011-12-01  9:14           ` Tomáš Janoušek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.