xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: Julien Grall <julien.grall@arm.com>
To: Stefano Stabellini <sstabellini@kernel.org>
Cc: proskurin@sec.in.tum.de, steve.capper@arm.com,
	wei.chen@linaro.org, xen-devel@lists.xen.org
Subject: Re: [PATCH 18/22] xen/arm: p2m: Rework the context switch to another VTTBR in flush_tlb_domain
Date: Wed, 27 Jul 2016 11:22:39 +0100	[thread overview]
Message-ID: <4d38135a-4567-34e0-3580-b951f6f695a6@arm.com> (raw)
In-Reply-To: <alpine.DEB.2.10.1607261809270.12319@sstabellini-ThinkPad-X260>

Hi Stefano,

On 27/07/16 02:12, Stefano Stabellini wrote:
> On Wed, 20 Jul 2016, Julien Grall wrote:
>> The current implementation of flush_tlb_domain is relying on the domain
>> to have a single p2m. With the upcoming feature altp2m, a single domain
>> may have different p2m. So we would need to switch to the correct p2m in
>> order to flush the TLBs.
>>
>> Rather than checking whether the domain is not the current domain, check
>> whether the VTTBR is different. The resulting assembly code is much
>> smaller: from 38 instructions (+ 2 functions call) to 22 instructions.
>
> That's true but SYSREG reads are more expensive than regular
> instructions.

This argument is not really true. The ARM ARM (D7-1879 in ARM DDI 
0487A.j) says: "Reads of the System registers can occur out of order 
with respect to earlier instructions executed on the same PE, provided 
that any data dependencies between the instructions are respected". So 
It will depend on how the micro-architecture implemented access to SYSREG.

However, the current code already contains plenty of SYSREG read access 
(via the macro current using TPIDR_EL2). So the number of SYSREG 
accesses stay exactly the same.

I also forgot to mention that the number of instructions in the function 
call (10 instructions). So we are down from 58 instructions to 22 
instructions.

Therefore, smaller code and likely better performance.

>
>
>> Signed-off-by: Julien Grall <julien.grall@arm.com>
>> ---
>>  xen/arch/arm/p2m.c | 18 +++++++++++-------
>>  1 file changed, 11 insertions(+), 7 deletions(-)
>>
>> diff --git a/xen/arch/arm/p2m.c b/xen/arch/arm/p2m.c
>> index d1b6009..015c1e8 100644
>> --- a/xen/arch/arm/p2m.c
>> +++ b/xen/arch/arm/p2m.c
>> @@ -151,24 +151,28 @@ void p2m_restore_state(struct vcpu *n)
>>
>>  void flush_tlb_domain(struct domain *d)
>>  {
>> +    struct p2m_domain *p2m = &d->arch.p2m;
>>      unsigned long flags = 0;
>> +    uint64_t ovttbr;
>>
>>      /*
>> -     * Update the VTTBR if necessary with the domain d. In this case,
>> -     * it's only necessary to flush TLBs on every CPUs with the current VMID
>> -     * (our domain).
>> +     * ARM only provides an instruction to flush TLBs for the current
>> +     * VMID. So switch to the VTTBR of a given P2M if different.
>>       */
>> -    if ( d != current->domain )
>> +    ovttbr = READ_SYSREG64(VTTBR_EL2);
>> +    if ( ovttbr != p2m->vttbr )
>>      {
>>          local_irq_save(flags);
>> -        p2m_load_VTTBR(d);
>> +        WRITE_SYSREG64(p2m->vttbr, VTTBR_EL2);
>> +        isb();
>>      }
>>
>>      flush_tlb();
>>
>> -    if ( d != current->domain )
>> +    if ( ovttbr != READ_SYSREG64(VTTBR_EL2) )
>
> You should be able to remove this second SYSREG read and optimize the
> code further.

I should be able, however I think it will not bring much more 
optimization here but obfuscating a bit more the code.

Regards,

-- 
Julien Grall

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

  reply	other threads:[~2016-07-27 10:22 UTC|newest]

Thread overview: 80+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-07-20 16:10 [PATCH 00/22] xen/arm: P2M clean-up and fixes Julien Grall
2016-07-20 16:10 ` [PATCH 01/22] xen/arm: system: Use the correct parameter name in local_irq_restore Julien Grall
2016-07-22  1:19   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 02/22] xen/arm: p2m: Pass the vCPU in parameter to get_page_from_gva Julien Grall
2016-07-22  1:22   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 03/22] xen/arm: p2m: Restrict usage of get_page_from_gva to the current vCPU Julien Grall
2016-07-22  1:25   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 04/22] xen/arm: p2m: Fix multi-lines coding style comments Julien Grall
2016-07-22  1:26   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 05/22] xen/arm: p2m: Clean-up mfn_to_p2m_entry Julien Grall
2016-07-26 22:24   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 06/22] xen/arm: p2m: Use the typesafe MFN in mfn_to_p2m_entry Julien Grall
2016-07-26 22:28   ` Stefano Stabellini
2016-07-27  9:54     ` Julien Grall
2016-07-27 18:25       ` Stefano Stabellini
2016-07-27 20:14         ` Julien Grall
2016-07-20 16:10 ` [PATCH 07/22] xen/arm: p2m: Use p2m_is_foreign in get_page_from_gfn to avoid open coding Julien Grall
2016-07-26 22:33   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 08/22] xen/arm: p2m: Simplify p2m type check by using bitmask Julien Grall
2016-07-26 22:36   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 09/22] xen/arm: p2m: Use a whitelist rather than blacklist in get_page_from_gfn Julien Grall
2016-07-26 22:44   ` Stefano Stabellini
2016-07-27  9:59     ` Julien Grall
2016-07-27 17:56       ` Stefano Stabellini
2016-07-27 17:57         ` Julien Grall
2016-07-20 16:10 ` [PATCH 10/22] xen/arm: p2m: Differentiate cacheable vs non-cacheable MMIO Julien Grall
2016-07-26 22:47   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 11/22] xen/arm: p2m: Find the memory attributes based on the p2m type Julien Grall
2016-07-27  0:41   ` Stefano Stabellini
2016-07-27 17:15   ` Julien Grall
2016-07-27 17:55     ` Stefano Stabellini
2016-07-27 20:15       ` Julien Grall
2016-07-20 16:10 ` [PATCH 12/22] xen/arm: p2m: Remove unnecessary locking Julien Grall
2016-07-27  0:47   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 13/22] xen/arm: p2m: Introduce p2m_{read, write}_{, un}lock helpers Julien Grall
2016-07-27  0:50   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 14/22] xen/arm: p2m: Switch the p2m lock from spinlock to rwlock Julien Grall
2016-07-27  0:51   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 15/22] xen/arm: Don't call p2m_alloc_table from arch_domain_create Julien Grall
2016-07-22  8:32   ` Sergej Proskurin
2016-07-22  9:18     ` Julien Grall
2016-07-22 10:16       ` Sergej Proskurin
2016-07-22 10:26         ` Julien Grall
2016-07-22 10:39           ` Sergej Proskurin
2016-07-22 10:38             ` Julien Grall
2016-07-22 11:05               ` Sergej Proskurin
2016-07-22 13:00                 ` Julien Grall
2016-07-23 17:59                   ` Sergej Proskurin
2016-07-27  0:54   ` Stefano Stabellini
2016-07-20 16:10 ` [PATCH 16/22] xen/arm: p2m: Move the vttbr field from arch_domain to p2m_domain Julien Grall
2016-07-22  7:46   ` Sergej Proskurin
2016-07-22  9:23     ` Julien Grall
2016-07-27  0:57   ` Stefano Stabellini
2016-07-27 10:00     ` Julien Grall
2016-07-27 17:19   ` Julien Grall
2016-07-20 16:10 ` [PATCH 17/22] xen/arm: p2m: Don't need to restore the state for an idle vCPU Julien Grall
2016-07-22  7:37   ` Sergej Proskurin
2016-07-27  1:05   ` Stefano Stabellini
2016-07-20 16:11 ` [PATCH 18/22] xen/arm: p2m: Rework the context switch to another VTTBR in flush_tlb_domain Julien Grall
2016-07-22  7:51   ` Sergej Proskurin
2016-07-27  1:12   ` Stefano Stabellini
2016-07-27 10:22     ` Julien Grall [this message]
2016-07-20 16:11 ` [PATCH 19/22] xen/arm: p2m: Inline p2m_load_VTTBR into p2m_restore_state Julien Grall
2016-07-22  8:07   ` Sergej Proskurin
2016-07-22  9:29     ` Julien Grall
2016-07-27  1:13   ` Stefano Stabellini
2016-07-20 16:11 ` [PATCH 20/22] xen/arm: Don't export flush_tlb_domain Julien Grall
2016-07-22  8:54   ` Sergej Proskurin
2016-07-22  9:30     ` Julien Grall
2016-07-22 10:25       ` Sergej Proskurin
2016-07-22 10:34         ` Julien Grall
2016-07-22 10:46           ` Sergej Proskurin
2016-07-22 10:57             ` Julien Grall
2016-07-22 11:22               ` Sergej Proskurin
2016-07-27  1:14   ` Stefano Stabellini
2016-07-20 16:11 ` [PATCH 21/22] xen/arm: p2m: Replace flush_tlb_domain by p2m_flush_tlb Julien Grall
2016-07-27  1:15   ` Stefano Stabellini
2016-07-20 16:11 ` [PATCH 22/22] xen/arm: p2m: Pass the p2m in parameter rather the domain when it is possible Julien Grall
2016-07-27  1:15   ` Stefano Stabellini
2016-07-22  1:31 ` [PATCH 00/22] xen/arm: P2M clean-up and fixes Stefano Stabellini

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4d38135a-4567-34e0-3580-b951f6f695a6@arm.com \
    --to=julien.grall@arm.com \
    --cc=proskurin@sec.in.tum.de \
    --cc=sstabellini@kernel.org \
    --cc=steve.capper@arm.com \
    --cc=wei.chen@linaro.org \
    --cc=xen-devel@lists.xen.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).