From: Felix Kuehling <felix.kuehling@amd.com>
To: Alex Sierra <alex.sierra@amd.com>,
akpm@linux-foundation.org, linux-mm@kvack.org,
rcampbell@nvidia.com, linux-ext4@vger.kernel.org,
linux-xfs@vger.kernel.org
Cc: amd-gfx@lists.freedesktop.org, dri-devel@lists.freedesktop.org,
hch@lst.de, jgg@nvidia.com, jglisse@redhat.com
Subject: Re: [PATCH v1 06/14] drm/amdkfd: add SPM support for SVM
Date: Wed, 25 Aug 2021 10:45:45 -0400 [thread overview]
Message-ID: <c16b9cdc-76a3-896f-3723-c8128aed1e70@amd.com> (raw)
In-Reply-To: <20210825034828.12927-7-alex.sierra@amd.com>
Am 2021-08-24 um 11:48 p.m. schrieb Alex Sierra:
> When CPU is connected throug XGMI, it has coherent
> access to VRAM resource. In this case that resource
> is taken from a table in the device gmc aperture base.
> This resource is used along with the device type, which could
> be DEVICE_PRIVATE or DEVICE_PUBLIC to create the device
> page map region.
>
> Signed-off-by: Alex Sierra <alex.sierra@amd.com>
> Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
> ---
> v7:
> Remove lookup_resource call, so export symbol for this function
> is not longer required. Patch dropped "kernel: resource:
> lookup_resource as exported symbol"
> ---
> drivers/gpu/drm/amd/amdkfd/kfd_migrate.c | 29 +++++++++++++++---------
> 1 file changed, 18 insertions(+), 11 deletions(-)
>
> diff --git a/drivers/gpu/drm/amd/amdkfd/kfd_migrate.c b/drivers/gpu/drm/amd/amdkfd/kfd_migrate.c
> index 47ee9a895cd2..dd245699479f 100644
> --- a/drivers/gpu/drm/amd/amdkfd/kfd_migrate.c
> +++ b/drivers/gpu/drm/amd/amdkfd/kfd_migrate.c
> @@ -865,7 +865,7 @@ int svm_migrate_init(struct amdgpu_device *adev)
> {
> struct kfd_dev *kfddev = adev->kfd.dev;
> struct dev_pagemap *pgmap;
> - struct resource *res;
> + struct resource *res = NULL;
> unsigned long size;
> void *r;
>
> @@ -880,19 +880,25 @@ int svm_migrate_init(struct amdgpu_device *adev)
> * should remove reserved size
> */
> size = ALIGN(adev->gmc.real_vram_size, 2ULL << 20);
> - res = devm_request_free_mem_region(adev->dev, &iomem_resource, size);
> - if (IS_ERR(res))
> - return -ENOMEM;
> + if (adev->gmc.xgmi.connected_to_cpu) {
> + pgmap->range.start = adev->gmc.aper_base;
> + pgmap->range.end = adev->gmc.aper_base + adev->gmc.aper_size - 1;
> + pgmap->type = MEMORY_DEVICE_PUBLIC;
> + } else {
> + res = devm_request_free_mem_region(adev->dev, &iomem_resource, size);
> + if (IS_ERR(res))
> + return -ENOMEM;
> + pgmap->range.start = res->start;
> + pgmap->range.end = res->end;
> + pgmap->type = MEMORY_DEVICE_PRIVATE;
> + }
>
> - pgmap->type = MEMORY_DEVICE_PRIVATE;
> pgmap->nr_range = 1;
> - pgmap->range.start = res->start;
> - pgmap->range.end = res->end;
> pgmap->ops = &svm_migrate_pgmap_ops;
> pgmap->owner = SVM_ADEV_PGMAP_OWNER(adev);
> - pgmap->flags = MIGRATE_VMA_SELECT_DEVICE_PRIVATE;
> + pgmap->flags = 0;
> r = devm_memremap_pages(adev->dev, pgmap);
> - if (IS_ERR(r)) {
> + if (res && IS_ERR(r)) {
I think the (res && ...) condition means you only detect failures for
DEVICE_PRIVATE memory. Why are you ignoring failures for DEVICE_PUBLIC?
For DEVICE_PUBLIC you can skip devm_release_mem_region, but you still
need to detect and return the error. Also, using res as an indicator is
a bit obscure. I'd put an if (pgmap->type == MEMORY_DEVICE_PRIVATE)
before the devm_release_mem_region call.
Regards,
Felix
> pr_err("failed to register HMM device memory\n");
> devm_release_mem_region(adev->dev, res->start,
> res->end - res->start + 1);
> @@ -914,6 +920,7 @@ void svm_migrate_fini(struct amdgpu_device *adev)
> struct dev_pagemap *pgmap = &adev->kfd.dev->pgmap;
>
> devm_memunmap_pages(adev->dev, pgmap);
> - devm_release_mem_region(adev->dev, pgmap->range.start,
> - pgmap->range.end - pgmap->range.start + 1);
> + if (pgmap->type == MEMORY_DEVICE_PRIVATE)
> + devm_release_mem_region(adev->dev, pgmap->range.start,
> + pgmap->range.end - pgmap->range.start + 1);
> }
next prev parent reply other threads:[~2021-08-25 14:45 UTC|newest]
Thread overview: 42+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-08-25 3:48 [PATCH v1 00/14] Add MEMORY_DEVICE_PUBLIC for CPU-accessible coherent device memory Alex Sierra
2021-08-25 3:48 ` [PATCH v1 01/14] ext4/xfs: add page refcount helper Alex Sierra
2021-08-25 7:35 ` Christoph Hellwig
2021-08-25 15:09 ` Theodore Ts'o
2021-08-25 15:49 ` Darrick J. Wong
2021-08-25 3:48 ` [PATCH v1 02/14] mm: remove extra ZONE_DEVICE struct page refcount Alex Sierra
2021-08-25 7:35 ` Christoph Hellwig
2021-08-25 11:15 ` Vlastimil Babka
2021-08-25 17:49 ` Ralph Campbell
2021-08-27 11:26 ` Vlastimil Babka
2021-08-25 3:48 ` [PATCH v1 03/14] mm: add iomem vma selection for memory migration Alex Sierra
2021-08-25 7:40 ` Christoph Hellwig
2021-08-25 7:46 ` Christoph Hellwig
2021-08-25 18:24 ` Sierra Guiza, Alejandro (Alex)
2021-08-26 22:27 ` Felix Kuehling
2021-08-30 8:28 ` Christoph Hellwig
2021-08-30 17:04 ` Felix Kuehling
2021-09-01 8:29 ` Christoph Hellwig
2021-09-01 15:40 ` Felix Kuehling
2021-09-01 22:03 ` Dave Chinner
2021-09-01 23:07 ` Felix Kuehling
2021-09-02 1:14 ` Dave Chinner
2021-09-09 4:55 ` Felix Kuehling
2021-09-02 8:18 ` Christoph Hellwig
2021-09-02 18:07 ` Dan Williams
2021-09-09 4:02 ` Felix Kuehling
2021-08-25 14:24 ` Felix Kuehling
2021-08-25 3:48 ` [PATCH v1 04/14] mm: add zone device public type memory support Alex Sierra
2021-08-25 3:48 ` [PATCH v1 05/14] drm/amdkfd: ref count init for device pages Alex Sierra
2021-08-25 14:34 ` Felix Kuehling
2021-08-25 3:48 ` [PATCH v1 06/14] drm/amdkfd: add SPM support for SVM Alex Sierra
2021-08-25 14:45 ` Felix Kuehling [this message]
2021-08-25 3:48 ` [PATCH v1 07/14] drm/amdkfd: public type as sys mem on migration to ram Alex Sierra
2021-08-25 3:48 ` [PATCH v1 08/14] mm: add public type support to migrate_vma helpers Alex Sierra
2021-08-25 7:47 ` Christoph Hellwig
2021-08-25 3:48 ` [PATCH v1 09/14] mm: call pgmap->ops->page_free for DEVICE_PUBLIC pages Alex Sierra
2021-08-25 7:46 ` Christoph Hellwig
2021-08-25 3:48 ` [PATCH v1 10/14] lib: test_hmm add ioctl to get zone device type Alex Sierra
2021-08-25 3:48 ` [PATCH v1 11/14] lib: test_hmm add module param for " Alex Sierra
2021-08-25 3:48 ` [PATCH v1 12/14] lib: add support for device public type in test_hmm Alex Sierra
2021-08-25 3:48 ` [PATCH v1 13/14] tools: update hmm-test to support device public type Alex Sierra
2021-08-25 3:48 ` [PATCH v1 14/14] tools: update test_hmm script to support SP config Alex Sierra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c16b9cdc-76a3-896f-3723-c8128aed1e70@amd.com \
--to=felix.kuehling@amd.com \
--cc=akpm@linux-foundation.org \
--cc=alex.sierra@amd.com \
--cc=amd-gfx@lists.freedesktop.org \
--cc=dri-devel@lists.freedesktop.org \
--cc=hch@lst.de \
--cc=jgg@nvidia.com \
--cc=jglisse@redhat.com \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=linux-xfs@vger.kernel.org \
--cc=rcampbell@nvidia.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).