linuxppc-dev.lists.ozlabs.org archive mirror
 help / color / mirror / Atom feed
From: David Gibson <david@gibson.dropbear.id.au>
To: Alexey Kardashevskiy <aik@ozlabs.ru>
Cc: Gavin Shan <gwshan@linux.vnet.ibm.com>,
	linux-kernel@vger.kernel.org,
	Alex Williamson <alex.williamson@redhat.com>,
	Paul Mackerras <paulus@samba.org>,
	linuxppc-dev@lists.ozlabs.org
Subject: Re: [PATCH kernel v9 25/32] vfio: powerpc/spapr: powerpc/powernv/ioda2: Rework ownership
Date: Wed, 29 Apr 2015 15:39:28 +1000	[thread overview]
Message-ID: <20150429053928.GU32589@voom.redhat.com> (raw)
In-Reply-To: <1429964096-11524-26-git-send-email-aik@ozlabs.ru>

[-- Attachment #1: Type: text/plain, Size: 6893 bytes --]

On Sat, Apr 25, 2015 at 10:14:49PM +1000, Alexey Kardashevskiy wrote:
> Before the IOMMU user (VFIO) would take control over the IOMMU table
> belonging to a specific IOMMU group. This approach did not allow sharing
> tables between IOMMU groups attached to the same container.
> 
> This introduces a new IOMMU ownership flavour when the user can not
> just control the existing IOMMU table but remove/create tables on demand.
> If an IOMMU implements take/release_ownership() callbacks, this lets
> the user have full control over the IOMMU group. When the ownership is taken,
> the platform code removes all the windows so the caller must create them.
> Before returning the ownership back to the platform code, VFIO
> unprograms and removes all the tables it created.
> 
> This changes IODA2's onwership handler to remove the existing table

"onwership"

> rather than manipulating with the existing one. From now on,
> iommu_take_ownership() and iommu_release_ownership() are only called
> from the vfio_iommu_spapr_tce driver.
> 
> In tce_iommu_detach_group(), this copies a iommu_table descriptor on stack
> as IODA2's unset_window() will clear the descriptor embedded into PE
> and we will not be able to free the table afterwards.
> This is a transitional hack and following patches will replace this code
> anyway.
> 
> Old-style ownership is still supported allowing VFIO to run on older
> P5IOC2 and IODA IO controllers.
> 
> No change in userspace-visible behaviour is expected. Since it recreates
> TCE tables on each ownership change, related kernel traces will appear
> more often.
> 
> Signed-off-by: Alexey Kardashevskiy <aik@ozlabs.ru>
> [aw: for the vfio related changes]
> Acked-by: Alex Williamson <alex.williamson@redhat.com>
> ---
> Changes:
> v9:
> * fixed crash in tce_iommu_detach_group() on tbl->it_ops->free as
> tce_iommu_attach_group() used to initialize the table from a descriptor
> on stack (it does not matter for the series as this bit is changed later anyway
> but it ruing bisectability)
> 
> v6:
> * fixed commit log that VFIO removes tables before passing ownership
> back to the platform code, not userspace
> 
> 1
> ---
>  arch/powerpc/platforms/powernv/pci-ioda.c | 27 +++++++++++++++++++++++--
>  drivers/vfio/vfio_iommu_spapr_tce.c       | 33 +++++++++++++++++++++++++++++--
>  2 files changed, 56 insertions(+), 4 deletions(-)
> 
> diff --git a/arch/powerpc/platforms/powernv/pci-ioda.c b/arch/powerpc/platforms/powernv/pci-ioda.c
> index 2a4b2b2..45bc131 100644
> --- a/arch/powerpc/platforms/powernv/pci-ioda.c
> +++ b/arch/powerpc/platforms/powernv/pci-ioda.c
> @@ -2105,16 +2105,39 @@ static void pnv_ioda2_take_ownership(struct iommu_table_group *table_group)
>  	struct pnv_ioda_pe *pe = container_of(table_group, struct pnv_ioda_pe,
>  						table_group);
>  
> -	iommu_take_ownership(&table_group->tables[0]);
>  	pnv_pci_ioda2_set_bypass(pe, false);
> +	pnv_pci_ioda2_unset_window(&pe->table_group, 0);
> +	pnv_pci_free_table(&pe->table_group.tables[0]);
>  }
>  
>  static void pnv_ioda2_release_ownership(struct iommu_table_group *table_group)
>  {
>  	struct pnv_ioda_pe *pe = container_of(table_group, struct pnv_ioda_pe,
>  						table_group);
> +	struct iommu_table *tbl = &pe->table_group.tables[0];
> +	int64_t rc;
> +
> +	rc = pnv_pci_ioda2_create_table(&pe->table_group, 0,
> +			IOMMU_PAGE_SHIFT_4K,
> +			pe->phb->ioda.m32_pci_base,
> +			POWERNV_IOMMU_DEFAULT_LEVELS, tbl);
> +	if (rc) {
> +		pe_err(pe, "Failed to create 32-bit TCE table, err %ld",
> +				rc);
> +		return;
> +	}
> +
> +	tbl->it_table_group = &pe->table_group;
> +	iommu_init_table(tbl, pe->phb->hose->node);
> +
> +	rc = pnv_pci_ioda2_set_window(&pe->table_group, 0, tbl);
> +	if (rc) {
> +		pe_err(pe, "Failed to configure 32-bit TCE table, err %ld\n",
> +				rc);
> +		pnv_pci_free_table(tbl);
> +		return;
> +	}

It seems like you want a helper function called both here and in the
initial PE setup.  Otherwise you encourage future bugs where the
initial PE setup changes, but taking and releasing IOMMU ownership
from VFIO no longer sets up exactly the same thing again.

> -	iommu_release_ownership(&table_group->tables[0]);
>  	pnv_pci_ioda2_set_bypass(pe, true);
>  }
>  
> diff --git a/drivers/vfio/vfio_iommu_spapr_tce.c b/drivers/vfio/vfio_iommu_spapr_tce.c
> index 2d51bbf..892a584 100644
> --- a/drivers/vfio/vfio_iommu_spapr_tce.c
> +++ b/drivers/vfio/vfio_iommu_spapr_tce.c
> @@ -569,6 +569,10 @@ static int tce_iommu_attach_group(void *iommu_data,
>  	if (!table_group->ops || !table_group->ops->take_ownership ||
>  			!table_group->ops->release_ownership) {
>  		ret = tce_iommu_take_ownership(table_group);
> +	} else if (!table_group->ops->create_table ||
> +			!table_group->ops->set_window) {
> +		WARN_ON_ONCE(1);
> +		ret = -EFAULT;
>  	} else {
>  		/*
>  		 * Disable iommu bypass, otherwise the user can DMA to all of
> @@ -576,7 +580,15 @@ static int tce_iommu_attach_group(void *iommu_data,
>  		 * the pages that has been explicitly mapped into the iommu
>  		 */
>  		table_group->ops->take_ownership(table_group);
> -		ret = 0;
> +		ret = table_group->ops->create_table(table_group,
> +				0, /* window number */
> +				IOMMU_PAGE_SHIFT_4K,
> +				table_group->tce32_size,
> +				1, /* default levels */
> +				&table_group->tables[0]);
> +		if (!ret)
> +			ret = table_group->ops->set_window(table_group, 0,
> +					&table_group->tables[0]);
>  	}
>  
>  	if (ret)
> @@ -595,6 +607,7 @@ static void tce_iommu_detach_group(void *iommu_data,
>  {
>  	struct tce_container *container = iommu_data;
>  	struct iommu_table_group *table_group;
> +	long i;
>  
>  	mutex_lock(&container->lock);
>  	if (iommu_group != container->grp) {
> @@ -620,8 +633,24 @@ static void tce_iommu_detach_group(void *iommu_data,
>  	/* Kernel owns the device now, we can restore bypass */
>  	if (!table_group->ops || !table_group->ops->release_ownership)
>  		tce_iommu_release_ownership(container, table_group);
> -	else
> +	else if (!table_group->ops->unset_window)
> +		WARN_ON_ONCE(1);
> +	else {
> +		for (i = 0; i < IOMMU_TABLE_GROUP_MAX_TABLES; ++i) {
> +			struct iommu_table tbl = table_group->tables[i];
> +
> +			if (!tbl.it_size)
> +				continue;
> +
> +			table_group->ops->unset_window(table_group, i);
> +			tce_iommu_clear(container, &tbl,
> +					tbl.it_offset, tbl.it_size);
> +			if (tbl.it_ops->free)
> +				tbl.it_ops->free(&tbl);
> +		}
> +
>  		table_group->ops->release_ownership(table_group);
> +	}
>  
>  unlock_exit:
>  	mutex_unlock(&container->lock);

-- 
David Gibson			| I'll have my music baroque, and my code
david AT gibson.dropbear.id.au	| minimalist, thank you.  NOT _the_ _other_
				| _way_ _around_!
http://www.ozlabs.org/~dgibson

[-- Attachment #2: Type: application/pgp-signature, Size: 819 bytes --]

  reply	other threads:[~2015-04-29  5:50 UTC|newest]

Thread overview: 110+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-04-25 12:14 [PATCH kernel v9 00/32] powerpc/iommu/vfio: Enable Dynamic DMA windows Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 01/32] powerpc/iommu: Split iommu_free_table into 2 helpers Alexey Kardashevskiy
2015-04-29  2:03   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 02/32] Revert "powerpc/powernv: Allocate struct pnv_ioda_pe iommu_table dynamically" Alexey Kardashevskiy
2015-04-27 21:05   ` Alex Williamson
2015-04-29  2:05   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 03/32] vfio: powerpc/spapr: Move page pinning from arch code to VFIO IOMMU driver Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 04/32] vfio: powerpc/spapr: Check that IOMMU page is fully contained by system page Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 05/32] vfio: powerpc/spapr: Use it_page_size Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 06/32] vfio: powerpc/spapr: Move locked_vm accounting to helpers Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 07/32] vfio: powerpc/spapr: Disable DMA mappings on disabled container Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 08/32] vfio: powerpc/spapr: Moving pinning/unpinning to helpers Alexey Kardashevskiy
2015-04-29  2:14   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 09/32] vfio: powerpc/spapr: Rework groups attaching Alexey Kardashevskiy
2015-04-29  2:16   ` David Gibson
2015-04-30  2:29     ` Alexey Kardashevskiy
2015-04-30  4:05       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 10/32] powerpc/powernv: Do not set "read" flag if direction==DMA_NONE Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 11/32] powerpc/iommu: Move tce_xxx callbacks from ppc_md to iommu_table Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 12/32] powerpc/spapr: vfio: Switch from iommu_table to new iommu_table_group Alexey Kardashevskiy
2015-04-29  2:49   ` David Gibson
2015-04-30  2:30     ` Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 13/32] vfio: powerpc/spapr/iommu/powernv/ioda2: Rework IOMMU ownership control Alexey Kardashevskiy
2015-04-29  3:02   ` David Gibson
2015-04-29  9:19     ` Alexey Kardashevskiy
2015-04-30  4:08       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 14/32] powerpc/iommu: Fix IOMMU ownership control functions Alexey Kardashevskiy
2015-04-29  3:08   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 15/32] powerpc/powernv/ioda/ioda2: Rework TCE invalidation in tce_build()/tce_free() Alexey Kardashevskiy
2015-04-29  3:18   ` David Gibson
2015-04-30  2:58     ` Alexey Kardashevskiy
2015-04-30  4:16       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 16/32] powerpc/powernv/ioda: Move TCE kill register address to PE Alexey Kardashevskiy
2015-04-27 21:05   ` Alex Williamson
2015-04-29  3:25   ` David Gibson
2015-04-29  9:00     ` Alexey Kardashevskiy
2015-04-30  4:18       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 17/32] powerpc/powernv: Implement accessor to TCE entry Alexey Kardashevskiy
2015-04-29  4:04   ` David Gibson
2015-04-29  9:02     ` Alexey Kardashevskiy
2015-04-30  0:13       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 18/32] powerpc/iommu/powernv: Release replaced TCE Alexey Kardashevskiy
2015-04-29  4:18   ` David Gibson
2015-04-29  9:51     ` Alexey Kardashevskiy
2015-04-30  4:21       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 19/32] powerpc/powernv/ioda2: Rework iommu_table creation Alexey Kardashevskiy
2015-04-29  4:27   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 20/32] powerpc/powernv/ioda2: Introduce pnv_pci_create_table/pnv_pci_free_table Alexey Kardashevskiy
2015-04-29  4:39   ` David Gibson
2015-04-29  9:12     ` Alexey Kardashevskiy
2015-04-30  4:24       ` David Gibson
2015-05-01 10:13     ` Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 21/32] powerpc/powernv/ioda2: Introduce pnv_pci_ioda2_set_window Alexey Kardashevskiy
2015-04-29  4:45   ` David Gibson
2015-04-29  9:26     ` Alexey Kardashevskiy
2015-04-30  4:32       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 22/32] powerpc/powernv: Implement multilevel TCE tables Alexey Kardashevskiy
2015-04-29  5:04   ` David Gibson
2015-05-01  9:48     ` Alexey Kardashevskiy
2015-05-05 12:05       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 23/32] powerpc/powernv/ioda: Define and implement DMA table/window management callbacks Alexey Kardashevskiy
2015-04-29  5:30   ` David Gibson
2015-04-29  9:44     ` Alexey Kardashevskiy
2015-04-30  4:37       ` David Gibson
2015-04-30  9:56         ` Alexey Kardashevskiy
2015-05-01  3:36           ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 24/32] powerpc/powernv/ioda2: Use new helpers to do proper cleanup on PE release Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 25/32] vfio: powerpc/spapr: powerpc/powernv/ioda2: Rework ownership Alexey Kardashevskiy
2015-04-29  5:39   ` David Gibson [this message]
2015-04-25 12:14 ` [PATCH kernel v9 26/32] powerpc/iommu: Add userspace view of TCE table Alexey Kardashevskiy
2015-04-29  6:31   ` David Gibson
2015-05-01  4:01     ` Alexey Kardashevskiy
2015-05-01  4:23       ` David Gibson
2015-05-01  7:12         ` Alexey Kardashevskiy
2015-05-05 12:02           ` David Gibson
2015-05-11  2:11             ` Alexey Kardashevskiy
2015-05-11  4:52               ` Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 27/32] powerpc/iommu/ioda2: Add get_table_size() to calculate the size of future table Alexey Kardashevskiy
2015-04-29  6:40   ` David Gibson
2015-05-01  4:10     ` Alexey Kardashevskiy
2015-05-01  5:12       ` David Gibson
2015-05-01  6:53         ` Alexey Kardashevskiy
2015-05-05 11:58           ` David Gibson
2015-05-11  2:24             ` Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 28/32] powerpc/mmu: Add userspace-to-physical addresses translation cache Alexey Kardashevskiy
2015-04-29  7:01   ` David Gibson
2015-05-01 11:26     ` Alexey Kardashevskiy
2015-05-05 12:12       ` David Gibson
2015-04-30  6:34   ` David Gibson
2015-04-30  8:25     ` Paul Mackerras
2015-05-01  3:39       ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 29/32] vfio: powerpc/spapr: Register memory and define IOMMU v2 Alexey Kardashevskiy
2015-04-30  6:55   ` David Gibson
2015-05-01  4:35     ` Alexey Kardashevskiy
2015-05-01  5:23       ` David Gibson
2015-05-01  6:27         ` Alexey Kardashevskiy
2015-05-05 11:53           ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 30/32] vfio: powerpc/spapr: Use 32bit DMA window properties from table_group Alexey Kardashevskiy
2015-04-27 22:18   ` Alex Williamson
2015-04-30  6:58   ` David Gibson
2015-04-25 12:14 ` [PATCH kernel v9 31/32] vfio: powerpc/spapr: Support multiple groups in one container if possible Alexey Kardashevskiy
2015-04-30  7:22   ` David Gibson
2015-04-30  9:33     ` Alexey Kardashevskiy
2015-05-01  0:46       ` Benjamin Herrenschmidt
2015-05-01  4:44         ` David Gibson
2015-05-01  4:33       ` David Gibson
2015-05-01  6:05         ` Alexey Kardashevskiy
2015-05-05 11:50           ` David Gibson
2015-05-11  2:26             ` Alexey Kardashevskiy
2015-04-25 12:14 ` [PATCH kernel v9 32/32] vfio: powerpc/spapr: Support Dynamic DMA windows Alexey Kardashevskiy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20150429053928.GU32589@voom.redhat.com \
    --to=david@gibson.dropbear.id.au \
    --cc=aik@ozlabs.ru \
    --cc=alex.williamson@redhat.com \
    --cc=gwshan@linux.vnet.ibm.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linuxppc-dev@lists.ozlabs.org \
    --cc=paulus@samba.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).