All of lore.kernel.org
 help / color / mirror / Atom feed
From: Paolo Bonzini <pbonzini@redhat.com>
To: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Eduardo Habkost <ehabkost@redhat.com>,
	kvm@vger.kernel.org, Radim Krcmar <rkrcmar@redhat.com>,
	Juan Quintela <quintela@redhat.com>,
	qemu-devel@nongnu.org,
	"Dr. David Alan Gilbert" <dgilbert@redhat.com>
Subject: Re: [qemu patch 2/2] kvmclock: reduce kvmclock difference on migration
Date: Thu, 17 Nov 2016 14:03:36 +0100	[thread overview]
Message-ID: <07f60f05-4e4a-7217-23dc-d687423736b3@redhat.com> (raw)
In-Reply-To: <20161117121637.GA13404@amt.cnet>



On 17/11/2016 13:16, Marcelo Tosatti wrote:
> On Mon, Nov 14, 2016 at 04:15:18PM -0200, Marcelo Tosatti wrote:
>> On Mon, Nov 14, 2016 at 06:20:29PM +0100, Paolo Bonzini wrote:
>>>
>>>
>>> On 14/11/2016 18:13, Marcelo Tosatti wrote:
>>>> On Mon, Nov 14, 2016 at 05:43:33PM +0100, Paolo Bonzini wrote:
>>>>>
>>>>>
>>>>> On 14/11/2016 16:40, Marcelo Tosatti wrote:
>>>>>> static bool kvmclock_src_use_reliable_get_clock(void *opaque)
>>>>>> {
>>>>>>     KVMClockState *s = opaque;
>>>>>>
>>>>>>     /*
>>>>>>      * On machine types that support reliable KVM_GET_CLOCK,
>>>>>>      * if host kernel does provide reliable KVM_GET_CLOCK,
>>>>>>      * set src_use_reliable_get_clock=true so that destination
>>>>>>      * avoids reading kvmclock from memory.
>>>>>>      */
>>>>>>     if (s->mach_use_reliable_get_clock && kvm_has_adjust_clock_stable())
>>>>>>     {
>>>>>>         s->src_use_reliable_get_clock = true;
>>>>>>     }
>>>>>>
>>>>>>     return s->mach_use_reliable_get_clock;
>>>>>> }
>>>>>>
>>>>>>
>>>>>> Ah, OK, done.
>>>>>
>>>>> s->src_use_reliable_get_clock should not be set with
>>>>> KVM_CHECK_EXTENSION, but rather from the flags returned by KVM_GET_CLOCK.
>>>>
>>>> Well, thats not right: What matters is the presence of get_kvmclock_ns 
>>>> which returns a value that the guest sees. 
>>>>
>>>>                 get_kernel_monotonic_clock() + kvmclock_offset +
>>>>                 (rdtsc() - tsc_timestamp)
>>>>
>>>> IOW what the guest sees. And you changed that in 
>>>>
>>>> commit 108b249c453dd7132599ab6dc7e435a7036c193f
>>>> Author: Paolo Bonzini <pbonzini@redhat.com>
>>>> Date:   Thu Sep 1 14:21:03 2016 +0200
>>>>
>>>>     KVM: x86: introduce get_kvmclock_ns
>>>>
>>>> And the correct behaviour (once KVM_GET_CLOCK is fixed per 
>>>> previous message to return rdtsc - tsc_timestamp for the 
>>>> non masterclock case) depends on this commit above, 
>>>> not on masterclock.
>>>
>>> This commit in turn only gets the correct behavior if 
>>> "vcpu->arch.hv_clock.flags & PVCLOCK_TSC_STABLE_BIT" (and it will be 
>>> changed soon to ka->use_masterclock).  KVM_CHECK_EXTENSION can still 
>>> return KVM_CLOCK_TSC_STABLE even if the masterclock is disabled, 
>>> because KVM_CHECK_EXTENSION only tells you which flags are known for
>>> this version of the KVM module.
>>
>> What QEMU wants is to use KVM_GET_CLOCK at pre_save independently
>> of whether masterclock is enabled or not... it just depends
>> on KVM_GET_CLOCK being correct for the masterclock case
>> (108b249c453dd7132599ab6dc7e435a7036c193f).
>>
>> So a "reliable KVM_GET_CLOCK" (that does not timebackward
>> when masterclock is enabled) is much simpler to userspace
>> than "whether masterclock is enabled or not".
>>
>> If you have a reason why that should not be the case,
>> let me know.
>>
>>> To see if the masterclock is enabled _now_, you need to check what
>>> KVM_GET_CLOCK sets in the flags.  From the KVM_CLOCK_TSC_STABLE patch:
>>>
>>> 		user_ns.flags = kvm->arch.use_master_clock ? KVM_CLOCK_TSC_STABLE : 0;
>>
>> Again, whether masterclock is enable is independent of 
>> being able to use KVM_GET_CLOCK at pre_save.
> 
> Is this point OK ?
> 
> Using 
> 
>                 break;
> +   case KVM_CAP_ADJUST_CLOCK:
> +               r = KVM_CLOCK_TSC_STABLE;
> +               break;
> 
> To infer whether KVM_GET_CLOCK is fixed for the monotonic case.

Yes, I still haven't digested why it is correct (I need to read again
what you wrote), but it is indeed correct to use KVM_CAP_ADJUST_CLOCK
this way.

Paolo

WARNING: multiple messages have this Message-ID (diff)
From: Paolo Bonzini <pbonzini@redhat.com>
To: Marcelo Tosatti <mtosatti@redhat.com>
Cc: Eduardo Habkost <ehabkost@redhat.com>,
	kvm@vger.kernel.org, Juan Quintela <quintela@redhat.com>,
	Radim Krcmar <rkrcmar@redhat.com>,
	qemu-devel@nongnu.org,
	"Dr. David Alan Gilbert" <dgilbert@redhat.com>
Subject: Re: [Qemu-devel] [qemu patch 2/2] kvmclock: reduce kvmclock difference on migration
Date: Thu, 17 Nov 2016 14:03:36 +0100	[thread overview]
Message-ID: <07f60f05-4e4a-7217-23dc-d687423736b3@redhat.com> (raw)
In-Reply-To: <20161117121637.GA13404@amt.cnet>



On 17/11/2016 13:16, Marcelo Tosatti wrote:
> On Mon, Nov 14, 2016 at 04:15:18PM -0200, Marcelo Tosatti wrote:
>> On Mon, Nov 14, 2016 at 06:20:29PM +0100, Paolo Bonzini wrote:
>>>
>>>
>>> On 14/11/2016 18:13, Marcelo Tosatti wrote:
>>>> On Mon, Nov 14, 2016 at 05:43:33PM +0100, Paolo Bonzini wrote:
>>>>>
>>>>>
>>>>> On 14/11/2016 16:40, Marcelo Tosatti wrote:
>>>>>> static bool kvmclock_src_use_reliable_get_clock(void *opaque)
>>>>>> {
>>>>>>     KVMClockState *s = opaque;
>>>>>>
>>>>>>     /*
>>>>>>      * On machine types that support reliable KVM_GET_CLOCK,
>>>>>>      * if host kernel does provide reliable KVM_GET_CLOCK,
>>>>>>      * set src_use_reliable_get_clock=true so that destination
>>>>>>      * avoids reading kvmclock from memory.
>>>>>>      */
>>>>>>     if (s->mach_use_reliable_get_clock && kvm_has_adjust_clock_stable())
>>>>>>     {
>>>>>>         s->src_use_reliable_get_clock = true;
>>>>>>     }
>>>>>>
>>>>>>     return s->mach_use_reliable_get_clock;
>>>>>> }
>>>>>>
>>>>>>
>>>>>> Ah, OK, done.
>>>>>
>>>>> s->src_use_reliable_get_clock should not be set with
>>>>> KVM_CHECK_EXTENSION, but rather from the flags returned by KVM_GET_CLOCK.
>>>>
>>>> Well, thats not right: What matters is the presence of get_kvmclock_ns 
>>>> which returns a value that the guest sees. 
>>>>
>>>>                 get_kernel_monotonic_clock() + kvmclock_offset +
>>>>                 (rdtsc() - tsc_timestamp)
>>>>
>>>> IOW what the guest sees. And you changed that in 
>>>>
>>>> commit 108b249c453dd7132599ab6dc7e435a7036c193f
>>>> Author: Paolo Bonzini <pbonzini@redhat.com>
>>>> Date:   Thu Sep 1 14:21:03 2016 +0200
>>>>
>>>>     KVM: x86: introduce get_kvmclock_ns
>>>>
>>>> And the correct behaviour (once KVM_GET_CLOCK is fixed per 
>>>> previous message to return rdtsc - tsc_timestamp for the 
>>>> non masterclock case) depends on this commit above, 
>>>> not on masterclock.
>>>
>>> This commit in turn only gets the correct behavior if 
>>> "vcpu->arch.hv_clock.flags & PVCLOCK_TSC_STABLE_BIT" (and it will be 
>>> changed soon to ka->use_masterclock).  KVM_CHECK_EXTENSION can still 
>>> return KVM_CLOCK_TSC_STABLE even if the masterclock is disabled, 
>>> because KVM_CHECK_EXTENSION only tells you which flags are known for
>>> this version of the KVM module.
>>
>> What QEMU wants is to use KVM_GET_CLOCK at pre_save independently
>> of whether masterclock is enabled or not... it just depends
>> on KVM_GET_CLOCK being correct for the masterclock case
>> (108b249c453dd7132599ab6dc7e435a7036c193f).
>>
>> So a "reliable KVM_GET_CLOCK" (that does not timebackward
>> when masterclock is enabled) is much simpler to userspace
>> than "whether masterclock is enabled or not".
>>
>> If you have a reason why that should not be the case,
>> let me know.
>>
>>> To see if the masterclock is enabled _now_, you need to check what
>>> KVM_GET_CLOCK sets in the flags.  From the KVM_CLOCK_TSC_STABLE patch:
>>>
>>> 		user_ns.flags = kvm->arch.use_master_clock ? KVM_CLOCK_TSC_STABLE : 0;
>>
>> Again, whether masterclock is enable is independent of 
>> being able to use KVM_GET_CLOCK at pre_save.
> 
> Is this point OK ?
> 
> Using 
> 
>                 break;
> +   case KVM_CAP_ADJUST_CLOCK:
> +               r = KVM_CLOCK_TSC_STABLE;
> +               break;
> 
> To infer whether KVM_GET_CLOCK is fixed for the monotonic case.

Yes, I still haven't digested why it is correct (I need to read again
what you wrote), but it is indeed correct to use KVM_CAP_ADJUST_CLOCK
this way.

Paolo

  reply	other threads:[~2016-11-17 13:03 UTC|newest]

Thread overview: 46+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-11-14 12:36 [qemu patch 0/2] improve kvmclock difference on migration Marcelo Tosatti
2016-11-14 12:36 ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 12:36 ` [qemu patch 1/2] kvm: sync linux headers Marcelo Tosatti
2016-11-14 12:36   ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 12:36 ` [qemu patch 2/2] kvmclock: reduce kvmclock difference on migration Marcelo Tosatti
2016-11-14 12:36   ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 13:54   ` Paolo Bonzini
2016-11-14 13:54     ` [Qemu-devel] " Paolo Bonzini
2016-11-14 14:00     ` Marcelo Tosatti
2016-11-14 14:00       ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 14:22       ` Paolo Bonzini
2016-11-14 14:22         ` [Qemu-devel] " Paolo Bonzini
2016-11-14 14:50         ` Marcelo Tosatti
2016-11-14 14:50           ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 15:00           ` Paolo Bonzini
2016-11-14 15:00             ` [Qemu-devel] " Paolo Bonzini
2016-11-14 15:40             ` Marcelo Tosatti
2016-11-14 15:40               ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 16:43               ` Paolo Bonzini
2016-11-14 16:43                 ` [Qemu-devel] " Paolo Bonzini
2016-11-14 17:13                 ` Marcelo Tosatti
2016-11-14 17:13                   ` [Qemu-devel] " Marcelo Tosatti
2016-11-14 17:20                   ` Paolo Bonzini
2016-11-14 17:20                     ` [Qemu-devel] " Paolo Bonzini
2016-11-14 18:15                     ` Marcelo Tosatti
2016-11-14 18:15                       ` [Qemu-devel] " Marcelo Tosatti
2016-11-17 12:16                       ` Marcelo Tosatti
2016-11-17 12:16                         ` [Qemu-devel] " Marcelo Tosatti
2016-11-17 13:03                         ` Paolo Bonzini [this message]
2016-11-17 13:03                           ` Paolo Bonzini
2016-11-28 13:47                         ` Paolo Bonzini
2016-11-28 13:47                           ` [Qemu-devel] " Paolo Bonzini
2016-11-28 14:28                           ` Eduardo Habkost
2016-11-28 14:28                             ` [Qemu-devel] " Eduardo Habkost
2016-11-28 15:12                             ` Paolo Bonzini
2016-11-28 15:12                               ` [Qemu-devel] " Paolo Bonzini
2016-11-28 16:36                           ` Marcelo Tosatti
2016-11-28 16:36                             ` [Qemu-devel] " Marcelo Tosatti
2016-11-28 17:30                             ` Paolo Bonzini
2016-11-28 17:30                               ` [Qemu-devel] " Paolo Bonzini
2016-11-14 14:11     ` Juan Quintela
2016-11-14 14:11       ` [Qemu-devel] " Juan Quintela
2016-11-14 14:09   ` Juan Quintela
2016-11-14 14:09     ` [Qemu-devel] " Juan Quintela
2016-11-14 15:37     ` Marcelo Tosatti
2016-11-14 15:37       ` [Qemu-devel] " Marcelo Tosatti

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=07f60f05-4e4a-7217-23dc-d687423736b3@redhat.com \
    --to=pbonzini@redhat.com \
    --cc=dgilbert@redhat.com \
    --cc=ehabkost@redhat.com \
    --cc=kvm@vger.kernel.org \
    --cc=mtosatti@redhat.com \
    --cc=qemu-devel@nongnu.org \
    --cc=quintela@redhat.com \
    --cc=rkrcmar@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.