From: Wei Yang <weiyang@linux.vnet.ibm.com> To: Bjorn Helgaas <bhelgaas@google.com> Cc: Wei Yang <weiyang@linux.vnet.ibm.com>, benh@au1.ibm.com, gwshan@linux.vnet.ibm.com, linux-pci@vger.kernel.org, linuxppc-dev@lists.ozlabs.org Subject: Re: [PATCH v12 10/21] PCI: Consider additional PF's IOV BAR alignment in sizing and assigning Date: Wed, 11 Mar 2015 17:17:35 +0800 [thread overview] Message-ID: <20150311091735.GE5826@richard> (raw) In-Reply-To: <20150311023658.GA10994@google.com> On Tue, Mar 10, 2015 at 09:36:58PM -0500, Bjorn Helgaas wrote: >On Mon, Mar 02, 2015 at 03:32:47PM +0800, Wei Yang wrote: >> On Tue, Feb 24, 2015 at 02:41:52AM -0600, Bjorn Helgaas wrote: >> >On Tue, Feb 24, 2015 at 02:34:06AM -0600, Bjorn Helgaas wrote: >> >> From: Wei Yang <weiyang@linux.vnet.ibm.com> >> >> >> >> When sizing and assigning resources, we divide the resources into two >> >> lists: the requested list and the additional list. We don't consider the >> >> alignment of additional VF(n) BAR space. >> >> >> >> This is reasonable because the alignment required for the VF(n) BAR space >> >> is the size of an individual VF BAR, not the size of the space for *all* >> >> VFs. But some platforms, e.g., PowerNV, require additional alignment. >> >> >> >> Consider the additional IOV BAR alignment when sizing and assigning >> >> resources. When there is not enough system MMIO space, the PF's IOV BAR >> >> alignment will not contribute to the bridge. When there is enough system >> >> MMIO space, the additional alignment will contribute to the bridge. >> > >> >I don't understand the ""when there is not enough system MMIO space" part. >> >How do we tell if there's enough MMIO space? >> > >> >> In __assign_resources_sorted(), it has two resources list, one for requested >> (head) and one for additional (realloc_head). This function will first try to >> combine them and assign. If failed, this means we don't have enough MMIO >> space. > >How about this text: > > This is because the alignment required for the VF(n) BAR space is the size > of an individual VF BAR, not the size of the space for *all* VFs. But we > want additional alignment to support partitioning on PowerNV. > > Consider the additional IOV BAR alignment when sizing and assigning > resources. When there is not enough system MMIO space to accomodate both > the requested list and the additional list, the PF's IOV BAR alignment will > not contribute to the bridge. When there is enough system MMIO space for > both lists, the additional alignment will contribute to the bridge. > >We're doing something specifically for PowerNV. I would really like to be >able to read this patch and say "Oh, here's the hook where we get the >PowerNV behavior, and it's obvious that other platforms are unaffected." >But I don't see a pcibios or similar hook, so I don't know where that >PowerNV behavior is. > >Is it something to do with get_res_add_align()? That uses min_align, but I >don't know how that's connected ... ah, I see, "add_align" is computed >from pci_resource_alignment(), which has this path: > > pci_resource_alignment > pci_sriov_resource_alignment > pcibios_iov_resource_alignment > >and powerpc has a special pcibios_iov_resource_alignment() for PowerNV. > Thanks for the text. I have added these in the change log and some description about how it give arch a chance to be involved. >> >> Also, take advantage of pci_dev_resource::min_align to store this >> >> additional alignment. >> > >> >This comment doesn't seem to make sense; this patch doesn't save anything >> >in min_align. >> >> At the end of this patch: >> >> add_to_list(realloc_head, bus->self, b_res, size1-size0, add_align); >> >> The add_align is stored in pci_dev_resource::min_align in add_to_list(). And >> retrieved by get_res_add_align() in below code. This field is not used >> previously, so I took advantage of this field to store the alignment of the >> additional resources. > >Hmm. pci_dev_resource::min_align *is* already used in >reassign_resources_sorted(). Maybe there's no overlap; I gave up the >analysis before I could convince myself. > Bjorn, I know you may have some concern on this, let me try to explain how I understand the code. If my understanding is not correct, please let me know. In __assign_resources_sorted(), we pass two resources list, one is required and the other is the additional. First, we try our best to assigned both of them by merge them together. If this fails, we will assign the required list first and then take care of the additional list. There is one interesting thing in the first step. We merge these two list to the required list and in this patch I fix the alignment in required list. (Which is the "head" list in code.) And before doing so, we save the original information in "save_head". When we fail to assign the merged list, we will restore the required list, this mean we clean the alignment done in this patch and make sure we assign the required resource just with basic alignment. The usage of the min_align in reassign_resources_sorted() happens in the second part to assign the additional list individually. In the realloc_head list, those resources still have the add_align which is calculated in pbus_size_mem(). And we try to allocate it with this alignment, which is exactly what we want. BTW, by reading the code again, it looks I missed to change one place in reassign_resources_sorted(). In condition when (!resource_size(res)) is true, we rely on the res->start to be the alignment. Since the alignment is no longer the start address, we need to fix this part too. I may miss some background of the code, if my understanding is not correct, glad to hear from you. >The changelog needs to mention the add_to_list() connection. > Added in the change log. >> >> + /* >> >> + * There are two kinds of additional resources in the list: >> >> + * 1. bridge resource -- IORESOURCE_STARTALIGN >> >> + * 2. SR-IOV resource -- IORESOURCE_SIZEALIGN >> >> + * Here just fix the additional alignment for bridge >> >> + */ >> >> + if (!(dev_res->res->flags & IORESOURCE_STARTALIGN)) >> >> + continue; >> >> + >> >> + add_align = get_res_add_align(realloc_head, dev_res->res); >> >> + >> >> + /* Reorder the list by their alignment */ >> > >> >Why do we need to reorder the list by alignment? >> >> Resource list "head" is sorted by the alignment, while the alignment would be >> changed after we considering the additional resource. >> >> Take powernv platform as an example. The IOV BAR is expanded and need to be >> aligned with its total size instead of the individual VF BAR size. If we don't >> reorder it, the IOV BAR would be assigned after some other resources, which >> may cause the real assignment fail even the total size is enough. > >This is worthy of a comment in the code. > >Bjorn >-- >To unsubscribe from this list: send the line "unsubscribe linux-pci" in >the body of a message to majordomo@vger.kernel.org >More majordomo info at http://vger.kernel.org/majordomo-info.html -- Richard Yang Help you, Help me
WARNING: multiple messages have this Message-ID (diff)
From: Wei Yang <weiyang@linux.vnet.ibm.com> To: Bjorn Helgaas <bhelgaas@google.com> Cc: linux-pci@vger.kernel.org, Wei Yang <weiyang@linux.vnet.ibm.com>, benh@au1.ibm.com, linuxppc-dev@lists.ozlabs.org, gwshan@linux.vnet.ibm.com Subject: Re: [PATCH v12 10/21] PCI: Consider additional PF's IOV BAR alignment in sizing and assigning Date: Wed, 11 Mar 2015 17:17:35 +0800 [thread overview] Message-ID: <20150311091735.GE5826@richard> (raw) In-Reply-To: <20150311023658.GA10994@google.com> On Tue, Mar 10, 2015 at 09:36:58PM -0500, Bjorn Helgaas wrote: >On Mon, Mar 02, 2015 at 03:32:47PM +0800, Wei Yang wrote: >> On Tue, Feb 24, 2015 at 02:41:52AM -0600, Bjorn Helgaas wrote: >> >On Tue, Feb 24, 2015 at 02:34:06AM -0600, Bjorn Helgaas wrote: >> >> From: Wei Yang <weiyang@linux.vnet.ibm.com> >> >> >> >> When sizing and assigning resources, we divide the resources into two >> >> lists: the requested list and the additional list. We don't consider the >> >> alignment of additional VF(n) BAR space. >> >> >> >> This is reasonable because the alignment required for the VF(n) BAR space >> >> is the size of an individual VF BAR, not the size of the space for *all* >> >> VFs. But some platforms, e.g., PowerNV, require additional alignment. >> >> >> >> Consider the additional IOV BAR alignment when sizing and assigning >> >> resources. When there is not enough system MMIO space, the PF's IOV BAR >> >> alignment will not contribute to the bridge. When there is enough system >> >> MMIO space, the additional alignment will contribute to the bridge. >> > >> >I don't understand the ""when there is not enough system MMIO space" part. >> >How do we tell if there's enough MMIO space? >> > >> >> In __assign_resources_sorted(), it has two resources list, one for requested >> (head) and one for additional (realloc_head). This function will first try to >> combine them and assign. If failed, this means we don't have enough MMIO >> space. > >How about this text: > > This is because the alignment required for the VF(n) BAR space is the size > of an individual VF BAR, not the size of the space for *all* VFs. But we > want additional alignment to support partitioning on PowerNV. > > Consider the additional IOV BAR alignment when sizing and assigning > resources. When there is not enough system MMIO space to accomodate both > the requested list and the additional list, the PF's IOV BAR alignment will > not contribute to the bridge. When there is enough system MMIO space for > both lists, the additional alignment will contribute to the bridge. > >We're doing something specifically for PowerNV. I would really like to be >able to read this patch and say "Oh, here's the hook where we get the >PowerNV behavior, and it's obvious that other platforms are unaffected." >But I don't see a pcibios or similar hook, so I don't know where that >PowerNV behavior is. > >Is it something to do with get_res_add_align()? That uses min_align, but I >don't know how that's connected ... ah, I see, "add_align" is computed >from pci_resource_alignment(), which has this path: > > pci_resource_alignment > pci_sriov_resource_alignment > pcibios_iov_resource_alignment > >and powerpc has a special pcibios_iov_resource_alignment() for PowerNV. > Thanks for the text. I have added these in the change log and some description about how it give arch a chance to be involved. >> >> Also, take advantage of pci_dev_resource::min_align to store this >> >> additional alignment. >> > >> >This comment doesn't seem to make sense; this patch doesn't save anything >> >in min_align. >> >> At the end of this patch: >> >> add_to_list(realloc_head, bus->self, b_res, size1-size0, add_align); >> >> The add_align is stored in pci_dev_resource::min_align in add_to_list(). And >> retrieved by get_res_add_align() in below code. This field is not used >> previously, so I took advantage of this field to store the alignment of the >> additional resources. > >Hmm. pci_dev_resource::min_align *is* already used in >reassign_resources_sorted(). Maybe there's no overlap; I gave up the >analysis before I could convince myself. > Bjorn, I know you may have some concern on this, let me try to explain how I understand the code. If my understanding is not correct, please let me know. In __assign_resources_sorted(), we pass two resources list, one is required and the other is the additional. First, we try our best to assigned both of them by merge them together. If this fails, we will assign the required list first and then take care of the additional list. There is one interesting thing in the first step. We merge these two list to the required list and in this patch I fix the alignment in required list. (Which is the "head" list in code.) And before doing so, we save the original information in "save_head". When we fail to assign the merged list, we will restore the required list, this mean we clean the alignment done in this patch and make sure we assign the required resource just with basic alignment. The usage of the min_align in reassign_resources_sorted() happens in the second part to assign the additional list individually. In the realloc_head list, those resources still have the add_align which is calculated in pbus_size_mem(). And we try to allocate it with this alignment, which is exactly what we want. BTW, by reading the code again, it looks I missed to change one place in reassign_resources_sorted(). In condition when (!resource_size(res)) is true, we rely on the res->start to be the alignment. Since the alignment is no longer the start address, we need to fix this part too. I may miss some background of the code, if my understanding is not correct, glad to hear from you. >The changelog needs to mention the add_to_list() connection. > Added in the change log. >> >> + /* >> >> + * There are two kinds of additional resources in the list: >> >> + * 1. bridge resource -- IORESOURCE_STARTALIGN >> >> + * 2. SR-IOV resource -- IORESOURCE_SIZEALIGN >> >> + * Here just fix the additional alignment for bridge >> >> + */ >> >> + if (!(dev_res->res->flags & IORESOURCE_STARTALIGN)) >> >> + continue; >> >> + >> >> + add_align = get_res_add_align(realloc_head, dev_res->res); >> >> + >> >> + /* Reorder the list by their alignment */ >> > >> >Why do we need to reorder the list by alignment? >> >> Resource list "head" is sorted by the alignment, while the alignment would be >> changed after we considering the additional resource. >> >> Take powernv platform as an example. The IOV BAR is expanded and need to be >> aligned with its total size instead of the individual VF BAR size. If we don't >> reorder it, the IOV BAR would be assigned after some other resources, which >> may cause the real assignment fail even the total size is enough. > >This is worthy of a comment in the code. > >Bjorn >-- >To unsubscribe from this list: send the line "unsubscribe linux-pci" in >the body of a message to majordomo@vger.kernel.org >More majordomo info at http://vger.kernel.org/majordomo-info.html -- Richard Yang Help you, Help me
next prev parent reply other threads:[~2015-03-11 9:18 UTC|newest] Thread overview: 69+ messages / expand[flat|nested] mbox.gz Atom feed top 2015-02-24 8:32 [PATCH v12 00/21] Enable SRIOV on Power8 Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 01/21] PCI: Print more info in sriov_enable() error message Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 02/21] PCI: Print PF SR-IOV resource that contains all VF(n) BAR space Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 03/21] PCI: Keep individual VF BAR size in struct pci_sriov Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 04/21] PCI: Index IOV resources in the conventional style Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 05/21] PCI: Refresh First VF Offset and VF Stride when updating NumVFs Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 06/21] PCI: Calculate maximum number of buses required for VFs Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 07/21] PCI: Export pci_iov_virtfn_bus() and pci_iov_virtfn_devfn() Bjorn Helgaas 2015-02-24 8:33 ` [PATCH v12 08/21] PCI: Add pcibios_sriov_enable() and pcibios_sriov_disable() Bjorn Helgaas 2015-02-24 8:39 ` Bjorn Helgaas 2015-03-02 6:53 ` Wei Yang 2015-03-02 6:53 ` Wei Yang 2015-02-24 8:33 ` [PATCH v12 09/21] PCI: Add pcibios_iov_resource_alignment() interface Bjorn Helgaas 2015-02-24 8:34 ` [PATCH v12 10/21] PCI: Consider additional PF's IOV BAR alignment in sizing and assigning Bjorn Helgaas 2015-02-24 8:41 ` Bjorn Helgaas 2015-03-02 7:32 ` Wei Yang 2015-03-02 7:32 ` Wei Yang 2015-03-11 2:36 ` Bjorn Helgaas 2015-03-11 2:36 ` Bjorn Helgaas 2015-03-11 9:17 ` Wei Yang [this message] 2015-03-11 9:17 ` Wei Yang 2015-02-24 8:34 ` [PATCH v12 11/21] powerpc/pci: Don't unset PCI resources for VFs Bjorn Helgaas 2015-02-24 8:44 ` Bjorn Helgaas 2015-03-02 7:34 ` Wei Yang 2015-03-02 7:34 ` Wei Yang 2015-02-24 8:34 ` [PATCH v12 12/21] powerpc/pci: Refactor pci_dn Bjorn Helgaas 2015-02-24 8:34 ` [PATCH v12 13/21] powerpc/powernv: Use pci_dn, not device_node, in PCI config accessor Bjorn Helgaas 2015-02-24 8:34 ` [PATCH v12 14/21] powerpc/powernv: Allocate struct pnv_ioda_pe iommu_table dynamically Bjorn Helgaas 2015-02-24 8:46 ` Bjorn Helgaas 2015-03-02 7:50 ` Wei Yang 2015-03-02 7:50 ` Wei Yang 2015-03-02 7:56 ` Benjamin Herrenschmidt 2015-03-02 7:56 ` Benjamin Herrenschmidt 2015-03-02 8:02 ` Wei Yang 2015-03-02 8:02 ` Wei Yang 2015-03-11 2:47 ` Bjorn Helgaas 2015-03-11 2:47 ` Bjorn Helgaas 2015-03-11 6:13 ` Wei Yang 2015-03-11 6:13 ` Wei Yang 2015-02-24 8:34 ` [PATCH v12 15/21] powerpc/powernv: Reserve additional space for IOV BAR according to the number of total_pe Bjorn Helgaas 2015-02-24 8:52 ` Bjorn Helgaas 2015-03-02 7:41 ` Wei Yang 2015-03-02 7:41 ` Wei Yang 2015-03-11 2:51 ` Bjorn Helgaas 2015-03-11 2:51 ` Bjorn Helgaas 2015-03-11 6:22 ` Wei Yang 2015-03-11 6:22 ` Wei Yang 2015-03-11 13:40 ` Bjorn Helgaas 2015-03-11 13:40 ` Bjorn Helgaas 2015-02-24 8:34 ` [PATCH v12 16/21] powerpc/powernv: Implement pcibios_iov_resource_alignment() on powernv Bjorn Helgaas 2015-02-24 8:34 ` [PATCH v12 17/21] powerpc/powernv: Shift VF resource with an offset Bjorn Helgaas 2015-02-24 9:00 ` Bjorn Helgaas 2015-02-24 17:10 ` Bjorn Helgaas 2015-03-02 7:58 ` Wei Yang 2015-03-02 7:58 ` Wei Yang 2015-03-04 3:01 ` Wei Yang 2015-03-04 3:01 ` Wei Yang 2015-03-11 2:55 ` Bjorn Helgaas 2015-03-11 2:55 ` Bjorn Helgaas 2015-03-11 6:42 ` Wei Yang 2015-03-11 6:42 ` Wei Yang 2015-02-24 9:03 ` Bjorn Helgaas 2015-02-24 8:35 ` [PATCH v12 18/21] powerpc/powernv: Reserve additional space for IOV BAR, with m64_per_iov supported Bjorn Helgaas 2015-02-24 9:06 ` Bjorn Helgaas 2015-03-02 7:55 ` Wei Yang 2015-03-02 7:55 ` Wei Yang 2015-02-24 8:35 ` [PATCH v12 19/21] powerpc/powernv: Group VF PE when IOV BAR is big on PHB3 Bjorn Helgaas 2015-02-24 8:35 ` [PATCH v12 20/21] powerpc/pci: Remove unused struct pci_dn.pcidev field Bjorn Helgaas 2015-02-24 8:35 ` [PATCH v12 21/21] powerpc/pci: Add PCI resource alignment documentation Bjorn Helgaas
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=20150311091735.GE5826@richard \ --to=weiyang@linux.vnet.ibm.com \ --cc=benh@au1.ibm.com \ --cc=bhelgaas@google.com \ --cc=gwshan@linux.vnet.ibm.com \ --cc=linux-pci@vger.kernel.org \ --cc=linuxppc-dev@lists.ozlabs.org \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.