All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu
@ 2017-05-21 13:09 Luwei Kang
  2017-05-22 13:29 ` Jan Beulich
  0 siblings, 1 reply; 4+ messages in thread
From: Luwei Kang @ 2017-05-21 13:09 UTC (permalink / raw)
  To: xen-devel; +Cc: andrew.cooper3, boris.ostrovsky, Luwei Kang, jbeulich

Currently, Hot unplug a physical CPU with vpmu enabled may cause
system hang due to send a remote call to an offlined pCPU. This
patch add a cpu hot unplug notifer to save vpmu context before
cpu offline.

Consider one scenario, hot unplug pCPU N with vpmu enabled.
The vcpu which running on this pCPU will be switch to other
online cpu. A remote call will be send to pCPU N to save the
vpmu context before loading the vpmu context on this pCPU.
System will hang in function on_select_cpus() because of that pCPU
is offlined and can not do any respond.

Signed-off-by: Luwei Kang <luwei.kang@intel.com>
---
v3:
 1.add cpu_online() check in vpm_load() and vpmu_arch_destroy();
 2.add vpmu_ prefix. rename cpu_callback() to vpmu_cpu_callback();
v2:
 1.fix some typo and coding style;
 2.change "swith" to "if" in cpu_callback() because of there just have one case;
 3.add VPMU_CONTEX_LOADED check before send remote call in vpmu_arch_destroy();
---
 xen/arch/x86/cpu/vpmu.c | 52 +++++++++++++++++++++++++++++++++++++++++++------
 1 file changed, 46 insertions(+), 6 deletions(-)

diff --git a/xen/arch/x86/cpu/vpmu.c b/xen/arch/x86/cpu/vpmu.c
index 03401fd..486af12 100644
--- a/xen/arch/x86/cpu/vpmu.c
+++ b/xen/arch/x86/cpu/vpmu.c
@@ -21,6 +21,7 @@
 #include <xen/xenoprof.h>
 #include <xen/event.h>
 #include <xen/guest_access.h>
+#include <xen/cpu.h>
 #include <asm/regs.h>
 #include <asm/types.h>
 #include <asm/msr.h>
@@ -394,8 +395,11 @@ int vpmu_load(struct vcpu *v, bool_t from_guest)
     if ( !vpmu_is_set(vpmu, VPMU_CONTEXT_ALLOCATED) )
         return 0;
 
-    /* First time this VCPU is running here */
-    if ( vpmu->last_pcpu != pcpu )
+    /*
+     * The last pCPU is still online and this is the first time this vCPU
+     * running here.
+     */
+    if ( cpu_online(vpmu->last_pcpu) && vpmu->last_pcpu != pcpu )
     {
         /*
          * Get the context from last pcpu that we ran on. Note that if another
@@ -575,15 +579,21 @@ static void vpmu_arch_destroy(struct vcpu *v)
      * We will test it again in vpmu_clear_last() with interrupts
      * disabled to make sure we don't clear someone else.
      */
-    if ( per_cpu(last_vcpu, vpmu->last_pcpu) == v )
+    if ( cpu_online(vpmu->last_pcpu) &&
+            per_cpu(last_vcpu, vpmu->last_pcpu) == v )
         on_selected_cpus(cpumask_of(vpmu->last_pcpu),
                          vpmu_clear_last, v, 1);
 
     if ( vpmu->arch_vpmu_ops && vpmu->arch_vpmu_ops->arch_vpmu_destroy )
     {
-        /* Unload VPMU first. This will stop counters */
-        on_selected_cpus(cpumask_of(vcpu_vpmu(v)->last_pcpu),
-                         vpmu_save_force, v, 1);
+        /*
+         * Unload VPMU first if VPMU_CONTEXT_LOADED being set.
+         * This will stop counters.
+         */
+        if ( vpmu_is_set(vpmu, VPMU_CONTEXT_LOADED) )
+            on_selected_cpus(cpumask_of(vcpu_vpmu(v)->last_pcpu),
+                             vpmu_save_force, v, 1);
+
          vpmu->arch_vpmu_ops->arch_vpmu_destroy(v);
     }
 }
@@ -835,6 +845,33 @@ long do_xenpmu_op(unsigned int op, XEN_GUEST_HANDLE_PARAM(xen_pmu_params_t) arg)
     return ret;
 }
 
+static int vpmu_cpu_callback(
+    struct notifier_block *nfb, unsigned long action, void *hcpu)
+{
+    unsigned int cpu = (unsigned long)hcpu;
+    struct vcpu *vcpu = per_cpu(last_vcpu, cpu);
+    struct vpmu_struct *vpmu;
+
+    if ( !vcpu )
+        return NOTIFY_DONE;
+
+    vpmu = vcpu_vpmu(vcpu);
+    if ( !vpmu_is_set(vpmu, VPMU_CONTEXT_ALLOCATED) )
+        return NOTIFY_DONE;
+
+    if ( action == CPU_DYING )
+    {
+        vpmu_save_force(vcpu);
+        vpmu_reset(vpmu, VPMU_CONTEXT_LOADED);
+    }
+
+    return NOTIFY_DONE;
+}
+
+static struct notifier_block vpmu_cpu_nfb = {
+    .notifier_call = vpmu_cpu_callback
+};
+
 static int __init vpmu_init(void)
 {
     int vendor = current_cpu_data.x86_vendor;
@@ -872,8 +909,11 @@ static int __init vpmu_init(void)
     }
 
     if ( vpmu_mode != XENPMU_MODE_OFF )
+    {
+        register_cpu_notifier(&vpmu_cpu_nfb);
         printk(XENLOG_INFO "VPMU: version " __stringify(XENPMU_VER_MAJ) "."
                __stringify(XENPMU_VER_MIN) "\n");
+    }
     else
         opt_vpmu_enabled = 0;
 
-- 
1.8.3.1


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

^ permalink raw reply related	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu
  2017-05-21 13:09 [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu Luwei Kang
@ 2017-05-22 13:29 ` Jan Beulich
  2017-05-23  1:47   ` Kang, Luwei
  0 siblings, 1 reply; 4+ messages in thread
From: Jan Beulich @ 2017-05-22 13:29 UTC (permalink / raw)
  To: Luwei Kang; +Cc: andrew.cooper3, boris.ostrovsky, xen-devel

>>> On 21.05.17 at 15:09, <luwei.kang@intel.com> wrote:
> v3:
>  1.add cpu_online() check in vpm_load() and vpmu_arch_destroy();
>  2.add vpmu_ prefix. rename cpu_callback() to vpmu_cpu_callback();

I had specifically objected to the latter.

> @@ -394,8 +395,11 @@ int vpmu_load(struct vcpu *v, bool_t from_guest)
>      if ( !vpmu_is_set(vpmu, VPMU_CONTEXT_ALLOCATED) )
>          return 0;
>  
> -    /* First time this VCPU is running here */
> -    if ( vpmu->last_pcpu != pcpu )
> +    /*
> +     * The last pCPU is still online and this is the first time this vCPU
> +     * running here.
> +     */
> +    if ( cpu_online(vpmu->last_pcpu) && vpmu->last_pcpu != pcpu )

Adding a cpu_online() check here is unlikely to be helpful. Actually I
may have misguided you with prior comments (and if so, I'm sorry) -
the LOADED check following this one makes sure on_selected_cpus()
won't be called with an offline CPU here. IOW I think the code can
be left untouched, but the reason why should be spelled out in the
commit message (matching the reasoning why adding the LOADED
check to vpmu_arch_destroy() is sufficient for the second use of
last_pcpu there).

> @@ -575,15 +579,21 @@ static void vpmu_arch_destroy(struct vcpu *v)
>       * We will test it again in vpmu_clear_last() with interrupts
>       * disabled to make sure we don't clear someone else.
>       */
> -    if ( per_cpu(last_vcpu, vpmu->last_pcpu) == v )
> +    if ( cpu_online(vpmu->last_pcpu) &&
> +            per_cpu(last_vcpu, vpmu->last_pcpu) == v )

Indentation.

Jan


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu
  2017-05-22 13:29 ` Jan Beulich
@ 2017-05-23  1:47   ` Kang, Luwei
  2017-05-23  7:31     ` Jan Beulich
  0 siblings, 1 reply; 4+ messages in thread
From: Kang, Luwei @ 2017-05-23  1:47 UTC (permalink / raw)
  To: Jan Beulich; +Cc: andrew.cooper3, boris.ostrovsky, xen-devel

> >>> On 21.05.17 at 15:09, <luwei.kang@intel.com> wrote:
> > v3:
> >  1.add cpu_online() check in vpm_load() and vpmu_arch_destroy();
> > 2.add vpmu_ prefix. rename cpu_callback() to vpmu_cpu_callback();
> 
> I had specifically objected to the latter.

Sorry, will rollback it.

> 
> > @@ -394,8 +395,11 @@ int vpmu_load(struct vcpu *v, bool_t from_guest)
> >      if ( !vpmu_is_set(vpmu, VPMU_CONTEXT_ALLOCATED) )
> >          return 0;
> >
> > -    /* First time this VCPU is running here */
> > -    if ( vpmu->last_pcpu != pcpu )
> > +    /*
> > +     * The last pCPU is still online and this is the first time this vCPU
> > +     * running here.
> > +     */
> > +    if ( cpu_online(vpmu->last_pcpu) && vpmu->last_pcpu != pcpu )
> 
> Adding a cpu_online() check here is unlikely to be helpful. Actually I may have misguided you with prior comments (and if so, I'm
> sorry) - the LOADED check following this one makes sure on_selected_cpus() won't be called with an offline CPU here. IOW I think
> the code can be left untouched, but the reason why should be spelled out in the commit message (matching the reasoning why
> adding the LOADED check to vpmu_arch_destroy() is sufficient for the second use of last_pcpu there).
> 

So, remove cpu_online() check here, because of LOADED check can make sure don't send remote call to an offline cpu (cpu_callback() will
reset this flag).  
The cpu_online() check in vpmu_arch_destroy() should be reserved due to per_cpu(last_vcpu, vpmu->last_pcpu) has become an invalid value(Not NULL).
Is that right?

Thanks,
Luwei Kang

> > @@ -575,15 +579,21 @@ static void vpmu_arch_destroy(struct vcpu *v)
> >       * We will test it again in vpmu_clear_last() with interrupts
> >       * disabled to make sure we don't clear someone else.
> >       */
> > -    if ( per_cpu(last_vcpu, vpmu->last_pcpu) == v )
> > +    if ( cpu_online(vpmu->last_pcpu) &&
> > +            per_cpu(last_vcpu, vpmu->last_pcpu) == v )
> 
> Indentation.
> 
> Jan


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu
  2017-05-23  1:47   ` Kang, Luwei
@ 2017-05-23  7:31     ` Jan Beulich
  0 siblings, 0 replies; 4+ messages in thread
From: Jan Beulich @ 2017-05-23  7:31 UTC (permalink / raw)
  To: Luwei Kang; +Cc: andrew.cooper3, boris.ostrovsky, xen-devel

>>> On 23.05.17 at 03:47, <luwei.kang@intel.com> wrote:
>> >>> On 21.05.17 at 15:09, <luwei.kang@intel.com> wrote:
>> > @@ -394,8 +395,11 @@ int vpmu_load(struct vcpu *v, bool_t from_guest)
>> >      if ( !vpmu_is_set(vpmu, VPMU_CONTEXT_ALLOCATED) )
>> >          return 0;
>> >
>> > -    /* First time this VCPU is running here */
>> > -    if ( vpmu->last_pcpu != pcpu )
>> > +    /*
>> > +     * The last pCPU is still online and this is the first time this vCPU
>> > +     * running here.
>> > +     */
>> > +    if ( cpu_online(vpmu->last_pcpu) && vpmu->last_pcpu != pcpu )
>> 
>> Adding a cpu_online() check here is unlikely to be helpful. Actually I may 
> have misguided you with prior comments (and if so, I'm
>> sorry) - the LOADED check following this one makes sure on_selected_cpus() 
> won't be called with an offline CPU here. IOW I think
>> the code can be left untouched, but the reason why should be spelled out in 
> the commit message (matching the reasoning why
>> adding the LOADED check to vpmu_arch_destroy() is sufficient for the second 
> use of last_pcpu there).
>> 
> 
> So, remove cpu_online() check here, because of LOADED check can make sure 
> don't send remote call to an offline cpu (cpu_callback() will
> reset this flag).  
> The cpu_online() check in vpmu_arch_destroy() should be reserved due to 
> per_cpu(last_vcpu, vpmu->last_pcpu) has become an invalid value(Not NULL).
> Is that right?

Yes.

Jan


_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2017-05-23  7:31 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-05-21 13:09 [PATCH v3] x86/vpmu: add cpu hot unplug notifier for vpmu Luwei Kang
2017-05-22 13:29 ` Jan Beulich
2017-05-23  1:47   ` Kang, Luwei
2017-05-23  7:31     ` Jan Beulich

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.