All of lore.kernel.org
 help / color / mirror / Atom feed
From: Raghavendra K T <raghavendra.kt@linux.vnet.ibm.com>
To: Gleb Natapov <gleb@redhat.com>
Cc: mingo@redhat.com, jeremy@goop.org, x86@kernel.org,
	konrad.wilk@oracle.com, hpa@zytor.com, pbonzini@redhat.com,
	linux-doc@vger.kernel.org, habanero@linux.vnet.ibm.com,
	xen-devel@lists.xensource.com, peterz@infradead.org,
	mtosatti@redhat.com, stefano.stabellini@eu.citrix.com,
	andi@firstfloor.org, attilio.rao@citrix.com, ouyang@cs.pitt.edu,
	gregkh@suse.de, agraf@suse.de, chegu_vinod@hp.com,
	torvalds@linux-foundation.org, avi.kivity@gmail.com,
	tglx@linutronix.de, kvm@vger.kernel.org,
	linux-kernel@vger.kernel.org, riel@redhat.com,
	drjones@redhat.com, virtualization@lists.linux-foundation.org,
	srivatsa.vaddagiri@gmail.com
Subject: Re: [PATCH RFC V11 15/18] kvm : Paravirtual ticketlocks support for linux guests running on KVM hypervisor
Date: Thu, 25 Jul 2013 14:47:37 +0530	[thread overview]
Message-ID: <51F0ED31.3040200@linux.vnet.ibm.com> (raw)
In-Reply-To: <51EFCA42.5020009@linux.vnet.ibm.com>

On 07/24/2013 06:06 PM, Raghavendra K T wrote:
> On 07/24/2013 05:36 PM, Gleb Natapov wrote:
>> On Wed, Jul 24, 2013 at 05:30:20PM +0530, Raghavendra K T wrote:
>>> On 07/24/2013 04:09 PM, Gleb Natapov wrote:
>>>> On Wed, Jul 24, 2013 at 03:15:50PM +0530, Raghavendra K T wrote:
>>>>> On 07/23/2013 08:37 PM, Gleb Natapov wrote:
>>>>>> On Mon, Jul 22, 2013 at 11:50:16AM +0530, Raghavendra K T wrote:
>>>>>>> +static void kvm_lock_spinning(struct arch_spinlock *lock,
>>>>>>> __ticket_t want)
>>>>> [...]
>>>>>>> +
>>>>>>> +    /*
>>>>>>> +     * halt until it's our turn and kicked. Note that we do safe
>>>>>>> halt
>>>>>>> +     * for irq enabled case to avoid hang when lock info is
>>>>>>> overwritten
>>>>>>> +     * in irq spinlock slowpath and no spurious interrupt occur
>>>>>>> to save us.
>>>>>>> +     */
>>>>>>> +    if (arch_irqs_disabled_flags(flags))
>>>>>>> +        halt();
>>>>>>> +    else
>>>>>>> +        safe_halt();
>>>>>>> +
>>>>>>> +out:
>>>>>> So here now interrupts can be either disabled or enabled. Previous
>>>>>> version disabled interrupts here, so are we sure it is safe to
>>>>>> have them
>>>>>> enabled at this point? I do not see any problem yet, will keep
>>>>>> thinking.
>>>>>
>>>>> If we enable interrupt here, then
>>>>>
>>>>>
>>>>>>> +    cpumask_clear_cpu(cpu, &waiting_cpus);
>>>>>
>>>>> and if we start serving lock for an interrupt that came here,
>>>>> cpumask clear and w->lock=null may not happen atomically.
>>>>> if irq spinlock does not take slow path we would have non null value
>>>>> for lock, but with no information in waitingcpu.
>>>>>
>>>>> I am still thinking what would be problem with that.
>>>>>
>>>> Exactly, for kicker waiting_cpus and w->lock updates are
>>>> non atomic anyway.
>>>>
>>>>>>> +    w->lock = NULL;
>>>>>>> +    local_irq_restore(flags);
>>>>>>> +    spin_time_accum_blocked(start);
>>>>>>> +}
>>>>>>> +PV_CALLEE_SAVE_REGS_THUNK(kvm_lock_spinning);
>>>>>>> +
>>>>>>> +/* Kick vcpu waiting on @lock->head to reach value @ticket */
>>>>>>> +static void kvm_unlock_kick(struct arch_spinlock *lock,
>>>>>>> __ticket_t ticket)
>>>>>>> +{
>>>>>>> +    int cpu;
>>>>>>> +
>>>>>>> +    add_stats(RELEASED_SLOW, 1);
>>>>>>> +    for_each_cpu(cpu, &waiting_cpus) {
>>>>>>> +        const struct kvm_lock_waiting *w =
>>>>>>> &per_cpu(lock_waiting, cpu);
>>>>>>> +        if (ACCESS_ONCE(w->lock) == lock &&
>>>>>>> +            ACCESS_ONCE(w->want) == ticket) {
>>>>>>> +            add_stats(RELEASED_SLOW_KICKED, 1);
>>>>>>> +            kvm_kick_cpu(cpu);
>>>>>> What about using NMI to wake sleepers? I think it was discussed, but
>>>>>> forgot why it was dismissed.
>>>>>
>>>>> I think I have missed that discussion. 'll go back and check. so
>>>>> what is the idea here? we can easily wake up the halted vcpus that
>>>>> have interrupt disabled?
>>>> We can of course. IIRC the objection was that NMI handling path is very
>>>> fragile and handling NMI on each wakeup will be more expensive then
>>>> waking up a guest without injecting an event, but it is still
>>>> interesting
>>>> to see the numbers.
>>>>
>>>
>>> Haam, now I remember, We had tried request based mechanism. (new
>>> request like REQ_UNHALT) and process that. It had worked, but had some
>>> complex hacks in vcpu_enter_guest to avoid guest hang in case of
>>> request cleared.  So had left it there..
>>>
>>> https://lkml.org/lkml/2012/4/30/67
>>>
>>> But I do not remember performance impact though.
>> No, this is something different. Wakeup with NMI does not need KVM
>> changes at
>> all. Instead of kvm_kick_cpu(cpu) in kvm_unlock_kick you send NMI IPI.
>>
>
> True. It was not NMI.
> just to confirm, are you talking about something like this to be tried ?
>
> apic->send_IPI_mask(cpumask_of(cpu), APIC_DM_NMI);

When I started benchmark, I started seeing
"Dazed and confused, but trying to continue" from unknown nmi error
handling.
Did I miss anything (because we did not register any NMI handler)? or
is it that spurious NMIs are trouble because we could get spurious NMIs
if next waiter already acquired the lock.

(note: I tried sending APIC_DM_REMRD IPI directly, which worked fine
but hypercall way of handling still performed well from the results I
saw).






WARNING: multiple messages have this Message-ID (diff)
From: Raghavendra K T <raghavendra.kt@linux.vnet.ibm.com>
To: Gleb Natapov <gleb@redhat.com>
Cc: jeremy@goop.org, gregkh@suse.de, kvm@vger.kernel.org,
	linux-doc@vger.kernel.org, peterz@infradead.org,
	drjones@redhat.com, virtualization@lists.linux-foundation.org,
	andi@firstfloor.org, hpa@zytor.com,
	stefano.stabellini@eu.citrix.com, xen-devel@lists.xensource.com,
	x86@kernel.org, mingo@redhat.com, habanero@linux.vnet.ibm.com,
	riel@redhat.com, konrad.wilk@oracle.com, ouyang@cs.pitt.edu,
	avi.kivity@gmail.com, tglx@linutronix.de, chegu_vinod@hp.com,
	linux-kernel@vger.kernel.org, srivatsa.vaddagiri@gmail.com,
	attilio.rao@citrix.com, pbonzini@redhat.com,
	torvalds@linux-foundation.org
Subject: Re: [PATCH RFC V11 15/18] kvm : Paravirtual ticketlocks support for linux guests running on KVM hypervisor
Date: Thu, 25 Jul 2013 14:47:37 +0530	[thread overview]
Message-ID: <51F0ED31.3040200@linux.vnet.ibm.com> (raw)
In-Reply-To: <51EFCA42.5020009@linux.vnet.ibm.com>

On 07/24/2013 06:06 PM, Raghavendra K T wrote:
> On 07/24/2013 05:36 PM, Gleb Natapov wrote:
>> On Wed, Jul 24, 2013 at 05:30:20PM +0530, Raghavendra K T wrote:
>>> On 07/24/2013 04:09 PM, Gleb Natapov wrote:
>>>> On Wed, Jul 24, 2013 at 03:15:50PM +0530, Raghavendra K T wrote:
>>>>> On 07/23/2013 08:37 PM, Gleb Natapov wrote:
>>>>>> On Mon, Jul 22, 2013 at 11:50:16AM +0530, Raghavendra K T wrote:
>>>>>>> +static void kvm_lock_spinning(struct arch_spinlock *lock,
>>>>>>> __ticket_t want)
>>>>> [...]
>>>>>>> +
>>>>>>> +    /*
>>>>>>> +     * halt until it's our turn and kicked. Note that we do safe
>>>>>>> halt
>>>>>>> +     * for irq enabled case to avoid hang when lock info is
>>>>>>> overwritten
>>>>>>> +     * in irq spinlock slowpath and no spurious interrupt occur
>>>>>>> to save us.
>>>>>>> +     */
>>>>>>> +    if (arch_irqs_disabled_flags(flags))
>>>>>>> +        halt();
>>>>>>> +    else
>>>>>>> +        safe_halt();
>>>>>>> +
>>>>>>> +out:
>>>>>> So here now interrupts can be either disabled or enabled. Previous
>>>>>> version disabled interrupts here, so are we sure it is safe to
>>>>>> have them
>>>>>> enabled at this point? I do not see any problem yet, will keep
>>>>>> thinking.
>>>>>
>>>>> If we enable interrupt here, then
>>>>>
>>>>>
>>>>>>> +    cpumask_clear_cpu(cpu, &waiting_cpus);
>>>>>
>>>>> and if we start serving lock for an interrupt that came here,
>>>>> cpumask clear and w->lock=null may not happen atomically.
>>>>> if irq spinlock does not take slow path we would have non null value
>>>>> for lock, but with no information in waitingcpu.
>>>>>
>>>>> I am still thinking what would be problem with that.
>>>>>
>>>> Exactly, for kicker waiting_cpus and w->lock updates are
>>>> non atomic anyway.
>>>>
>>>>>>> +    w->lock = NULL;
>>>>>>> +    local_irq_restore(flags);
>>>>>>> +    spin_time_accum_blocked(start);
>>>>>>> +}
>>>>>>> +PV_CALLEE_SAVE_REGS_THUNK(kvm_lock_spinning);
>>>>>>> +
>>>>>>> +/* Kick vcpu waiting on @lock->head to reach value @ticket */
>>>>>>> +static void kvm_unlock_kick(struct arch_spinlock *lock,
>>>>>>> __ticket_t ticket)
>>>>>>> +{
>>>>>>> +    int cpu;
>>>>>>> +
>>>>>>> +    add_stats(RELEASED_SLOW, 1);
>>>>>>> +    for_each_cpu(cpu, &waiting_cpus) {
>>>>>>> +        const struct kvm_lock_waiting *w =
>>>>>>> &per_cpu(lock_waiting, cpu);
>>>>>>> +        if (ACCESS_ONCE(w->lock) == lock &&
>>>>>>> +            ACCESS_ONCE(w->want) == ticket) {
>>>>>>> +            add_stats(RELEASED_SLOW_KICKED, 1);
>>>>>>> +            kvm_kick_cpu(cpu);
>>>>>> What about using NMI to wake sleepers? I think it was discussed, but
>>>>>> forgot why it was dismissed.
>>>>>
>>>>> I think I have missed that discussion. 'll go back and check. so
>>>>> what is the idea here? we can easily wake up the halted vcpus that
>>>>> have interrupt disabled?
>>>> We can of course. IIRC the objection was that NMI handling path is very
>>>> fragile and handling NMI on each wakeup will be more expensive then
>>>> waking up a guest without injecting an event, but it is still
>>>> interesting
>>>> to see the numbers.
>>>>
>>>
>>> Haam, now I remember, We had tried request based mechanism. (new
>>> request like REQ_UNHALT) and process that. It had worked, but had some
>>> complex hacks in vcpu_enter_guest to avoid guest hang in case of
>>> request cleared.  So had left it there..
>>>
>>> https://lkml.org/lkml/2012/4/30/67
>>>
>>> But I do not remember performance impact though.
>> No, this is something different. Wakeup with NMI does not need KVM
>> changes at
>> all. Instead of kvm_kick_cpu(cpu) in kvm_unlock_kick you send NMI IPI.
>>
>
> True. It was not NMI.
> just to confirm, are you talking about something like this to be tried ?
>
> apic->send_IPI_mask(cpumask_of(cpu), APIC_DM_NMI);

When I started benchmark, I started seeing
"Dazed and confused, but trying to continue" from unknown nmi error
handling.
Did I miss anything (because we did not register any NMI handler)? or
is it that spurious NMIs are trouble because we could get spurious NMIs
if next waiter already acquired the lock.

(note: I tried sending APIC_DM_REMRD IPI directly, which worked fine
but hypercall way of handling still performed well from the results I
saw).

  reply	other threads:[~2013-07-25  9:10 UTC|newest]

Thread overview: 120+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-07-22  6:16 [PATCH RFC V11 0/18] Paravirtualized ticket spinlocks Raghavendra K T
2013-07-22  6:16 ` Raghavendra K T
2013-07-22  6:16 ` [PATCH RFC V11 1/18] x86/spinlock: Replace pv spinlocks with pv ticketlocks Raghavendra K T
2013-07-22  6:16   ` Raghavendra K T
2013-07-22  6:16   ` Raghavendra K T
2013-07-22  6:17 ` [PATCH RFC V11 2/18] x86/ticketlock: Don't inline _spin_unlock when using paravirt spinlocks Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17 ` Raghavendra K T
2013-07-22  6:17 ` [PATCH RFC V11 3/18] x86/ticketlock: Collapse a layer of functions Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17 ` [PATCH RFC V11 4/18] xen: Defer spinlock setup until boot CPU setup Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17 ` [PATCH RFC V11 5/18] xen/pvticketlock: Xen implementation for PV ticket locks Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17 ` [PATCH RFC V11 6/18] xen/pvticketlocks: Add xen_nopvspin parameter to disable xen pv ticketlocks Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:17   ` Raghavendra K T
2013-07-22  6:18 ` [PATCH RFC V11 7/18] x86/pvticketlock: Use callee-save for lock_spinning Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18 ` [PATCH RFC V11 8/18] x86/pvticketlock: When paravirtualizing ticket locks, increment by 2 Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18 ` [PATCH RFC V11 9/18] jump_label: Split out rate limiting from jump_label.h Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18 ` [PATCH RFC V11 10/18] x86/ticketlock: Add slowpath logic Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:18   ` Raghavendra K T
2013-07-22  6:19 ` [PATCH RFC V11 11/18] xen/pvticketlock: Allow interrupts to be enabled while blocking Raghavendra K T
2013-07-22  6:19 ` Raghavendra K T
2013-07-22  6:19   ` Raghavendra K T
2013-07-22  6:19 ` [PATCH RFC V11 12/18] kvm hypervisor : Add a hypercall to KVM hypervisor to support pv-ticketlocks Raghavendra K T
2013-07-22  6:19   ` Raghavendra K T
2013-07-22  6:19   ` Raghavendra K T
2013-07-22  6:19 ` [PATCH RFC V11 13/18] kvm : Fold pv_unhalt flag into GET_MP_STATE ioctl to aid migration Raghavendra K T
2013-07-22  6:19   ` Raghavendra K T
2013-07-22  6:19   ` Raghavendra K T
2013-07-22  6:20 ` [PATCH RFC V11 14/18] kvm guest : Add configuration support to enable debug information for KVM Guests Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20 ` [PATCH RFC V11 15/18] kvm : Paravirtual ticketlocks support for linux guests running on KVM hypervisor Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-23 15:07   ` Gleb Natapov
2013-07-23 15:07     ` Gleb Natapov
2013-07-24  9:24     ` [PATCH RESEND " Raghavendra K T
2013-07-24  9:24       ` Raghavendra K T
2013-07-24  9:45     ` [PATCH " Raghavendra K T
2013-07-24  9:45       ` Raghavendra K T
2013-07-24 10:39       ` Gleb Natapov
2013-07-24 10:39         ` Gleb Natapov
2013-07-24 12:00         ` Raghavendra K T
2013-07-24 12:00         ` Raghavendra K T
2013-07-24 12:06           ` Gleb Natapov
2013-07-24 12:06           ` Gleb Natapov
2013-07-24 12:36             ` Raghavendra K T
2013-07-24 12:36             ` Raghavendra K T
2013-07-25  9:17               ` Raghavendra K T [this message]
2013-07-25  9:17                 ` Raghavendra K T
2013-07-25  9:15                 ` Gleb Natapov
2013-07-25  9:15                   ` Gleb Natapov
2013-07-25  9:38                   ` Raghavendra K T
2013-07-25  9:38                     ` Raghavendra K T
2013-07-30 16:43                     ` Raghavendra K T
2013-07-30 16:43                       ` Raghavendra K T
2013-07-31  6:24                       ` Gleb Natapov
2013-07-31  6:24                         ` Gleb Natapov
2013-08-01  7:38                         ` Raghavendra K T
2013-08-01  7:38                           ` Raghavendra K T
2013-08-01  7:45                           ` Gleb Natapov
2013-08-01  7:45                             ` Gleb Natapov
2013-08-01  9:04                             ` Raghavendra K T
2013-08-02  3:22                               ` Raghavendra K T
2013-08-02  3:22                                 ` Raghavendra K T
2013-08-02  9:23                                 ` Ingo Molnar
2013-08-02  9:23                                   ` Ingo Molnar
2013-08-02  9:44                                   ` Raghavendra K T
2013-08-02  9:44                                     ` Raghavendra K T
2013-08-01  9:04                             ` Raghavendra K T
2013-08-02  9:25                           ` Ingo Molnar
2013-08-02  9:25                             ` Ingo Molnar
2013-08-02  9:54                             ` Gleb Natapov
2013-08-02  9:54                               ` Gleb Natapov
2013-08-02 10:57                               ` Raghavendra K T
2013-08-02 10:57                                 ` Raghavendra K T
2013-08-05  9:46                               ` Ingo Molnar
2013-08-05  9:46                                 ` Ingo Molnar
2013-08-05 10:42                                 ` Raghavendra K T
2013-08-05 10:42                                   ` Raghavendra K T
     [not found]                                 ` <20130805095901.GL2258@redhat.com>
2013-08-05 13:52                                   ` Ingo Molnar
2013-08-05 14:05                                     ` Paolo Bonzini
2013-08-05 14:05                                       ` Paolo Bonzini
2013-08-05 14:39                                       ` Raghavendra K T
2013-08-05 14:45                                         ` Paolo Bonzini
2013-08-05 14:45                                         ` Paolo Bonzini
2013-08-05 14:39                                       ` Raghavendra K T
2013-08-05 13:52                                   ` Ingo Molnar
2013-08-05 15:37                                 ` Konrad Rzeszutek Wilk
2013-08-05 15:37                                   ` Konrad Rzeszutek Wilk
2013-07-22  6:20 ` [PATCH RFC V11 16/18] kvm hypervisor : Simplify kvm_for_each_vcpu with kvm_irq_delivery_to_apic Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20 ` [PATCH RFC V11 17/18] Documentation/kvm : Add documentation on Hypercalls and features used for PV spinlock Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20 ` [PATCH RFC V11 18/18] kvm hypervisor: Add directed yield in vcpu block path Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22  6:20   ` Raghavendra K T
2013-07-22 19:36 ` [PATCH RFC V11 0/18] Paravirtualized ticket spinlocks Konrad Rzeszutek Wilk
2013-07-22 19:36   ` Konrad Rzeszutek Wilk
2013-07-23  2:50   ` Raghavendra K T
2013-07-23  2:50     ` Raghavendra K T
2013-08-05 22:50 ` H. Peter Anvin
2013-08-05 22:50   ` H. Peter Anvin
2013-08-06  2:50   ` Raghavendra K T
2013-08-06  2:50     ` Raghavendra K T

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=51F0ED31.3040200@linux.vnet.ibm.com \
    --to=raghavendra.kt@linux.vnet.ibm.com \
    --cc=agraf@suse.de \
    --cc=andi@firstfloor.org \
    --cc=attilio.rao@citrix.com \
    --cc=avi.kivity@gmail.com \
    --cc=chegu_vinod@hp.com \
    --cc=drjones@redhat.com \
    --cc=gleb@redhat.com \
    --cc=gregkh@suse.de \
    --cc=habanero@linux.vnet.ibm.com \
    --cc=hpa@zytor.com \
    --cc=jeremy@goop.org \
    --cc=konrad.wilk@oracle.com \
    --cc=kvm@vger.kernel.org \
    --cc=linux-doc@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@redhat.com \
    --cc=mtosatti@redhat.com \
    --cc=ouyang@cs.pitt.edu \
    --cc=pbonzini@redhat.com \
    --cc=peterz@infradead.org \
    --cc=riel@redhat.com \
    --cc=srivatsa.vaddagiri@gmail.com \
    --cc=stefano.stabellini@eu.citrix.com \
    --cc=tglx@linutronix.de \
    --cc=torvalds@linux-foundation.org \
    --cc=virtualization@lists.linux-foundation.org \
    --cc=x86@kernel.org \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.