linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* Re: [RFC PATCH 0/7] Emulated coherent graphics memory take 2
@ 2019-09-15  3:34 Hillf Danton
  0 siblings, 0 replies; 2+ messages in thread
From: Hillf Danton @ 2019-09-15  3:34 UTC (permalink / raw)
  To: Thomas Hellstrom; +Cc: linux-kernel, dri-devel, linux-mm


On Fri, 13 Sep 2019 11:32:06 +0200
> 
> The mm patch page walk interface has been reworked to be similar to the
> reworked page-walk code (mm/pagewalk.c). There have been two other solutions
> to consider:
> 1) Using the page-walk code. That is currently not possible since it requires
> the mmap-sem to be held for the struct vm_area_struct vm_flags and for huge
> page splitting. The pagewalk code in this patchset can't hold the mmap sems
> since it will lead to locking inversion.

Specify the locking scenario, if non-rfc is planned, to help understand
the new wheel this patchset looks to create, as two days of finding it in
the works after ba4e7d973dd0 failed.



^ permalink raw reply	[flat|nested] 2+ messages in thread

* [RFC PATCH 0/7] Emulated coherent graphics memory take 2
@ 2019-09-13  9:32 Thomas Hellström (VMware)
  0 siblings, 0 replies; 2+ messages in thread
From: Thomas Hellström (VMware) @ 2019-09-13  9:32 UTC (permalink / raw)
  To: linux-kernel, dri-devel, linux-mm
  Cc: pv-drivers, linux-graphics-maintainer, Thomas Hellström,
	Andrew Morton, Matthew Wilcox, Will Deacon, Peter Zijlstra,
	Rik van Riel, Minchan Kim, Michal Hocko, Huang Ying,
	Souptick Joarder, Jérôme Glisse, Christian König,
	Christoph Hellwig

From: Thomas Hellström <thellstrom@vmware.com>

Graphics APIs like OpenGL 4.4 and Vulkan require the graphics driver
to provide coherent graphics memory, meaning that the GPU sees any
content written to the coherent memory on the next GPU operation that
touches that memory, and the CPU sees any content written by the GPU
to that memory immediately after any fence object trailing the GPU
operation has signaled.

Paravirtual drivers that otherwise require explicit synchronization
needs to do this by hooking up dirty tracking to pagefault handlers
and buffer object validation.

The mm patch page walk interface has been reworked to be similar to the
reworked page-walk code (mm/pagewalk.c). There have been two other solutions
to consider:
1) Using the page-walk code. That is currently not possible since it requires
the mmap-sem to be held for the struct vm_area_struct vm_flags and for huge
page splitting. The pagewalk code in this patchset can't hold the mmap sems
since it will lead to locking inversion. Instead it uses an operation mode
similar to unmap_mapping_range where the i_mmap_lock is held.
2) Using apply_to_page_range(). The primary use of this code is to fill
page tables. The operation modes are IMO sufficiently different to motivate
re-implementing the page-walk.

For the TTM changes they are hopefully in line with the long-term
strategy of making helpers out of what's left of TTM.

The code has been tested and exercised by a tailored version of mesa
where we disable all explicit synchronization and assume graphics memory
is coherent. The performance loss varies of course; a typical number is
around 5%.

I would like to merge this code through the DRM tree, so an ack to do that
from an mm maintainer would be greatly appreciated.

Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Matthew Wilcox <willy@infradead.org>
Cc: Will Deacon <will.deacon@arm.com>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Rik van Riel <riel@surriel.com>
Cc: Minchan Kim <minchan@kernel.org>
Cc: Michal Hocko <mhocko@suse.com>
Cc: Huang Ying <ying.huang@intel.com>
Cc: Souptick Joarder <jrdr.linux@gmail.com>
Cc: "Jérôme Glisse" <jglisse@redhat.com>
Cc: "Christian König" <christian.koenig@amd.com>
Cc: Christoph Hellwig <hch@infradead.org>


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2019-09-15  3:35 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-09-15  3:34 [RFC PATCH 0/7] Emulated coherent graphics memory take 2 Hillf Danton
  -- strict thread matches above, loose matches on Subject: below --
2019-09-13  9:32 Thomas Hellström (VMware)

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).