From mboxrd@z Thu Jan  1 00:00:00 1970
From: Boris Ostrovsky <boris.ostrovsky@oracle.com>
Subject: Re: [PATCH v10 11/20] x86/VPMU: Interface for setting
 PMU mode and flags
Date: Fri, 12 Sep 2014 11:03:15 -0400
Message-ID: <54130B33.3020907@oracle.com>
References: <1409802080-6160-1-git-send-email-boris.ostrovsky@oracle.com>
	<1409802080-6160-12-git-send-email-boris.ostrovsky@oracle.com>
	<541084C8020000780003366A@mail.emea.novell.com>
	<54108C50.7030500@oracle.com>
	<541160E90200007800033A7B@mail.emea.novell.com>
	<5411ADDD.2040003@oracle.com>
	<5411D4DF0200007800034005@mail.emea.novell.com>
	<5411C963.2000002@oracle.com>
	<5412B383020000780003449D@mail.emea.novell.com>
	<5412FF55.1090605@oracle.com>
	<541321CD02000078000348EB@mail.emea.novell.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"; Format="flowed"
Content-Transfer-Encoding: 7bit
Return-path: <xen-devel-bounces@lists.xen.org>
In-Reply-To: <541321CD02000078000348EB@mail.emea.novell.com>
List-Unsubscribe: <http://lists.xen.org/cgi-bin/mailman/options/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=unsubscribe>
List-Post: <mailto:xen-devel@lists.xen.org>
List-Help: <mailto:xen-devel-request@lists.xen.org?subject=help>
List-Subscribe: <http://lists.xen.org/cgi-bin/mailman/listinfo/xen-devel>,
	<mailto:xen-devel-request@lists.xen.org?subject=subscribe>
Sender: xen-devel-bounces@lists.xen.org
Errors-To: xen-devel-bounces@lists.xen.org
To: Jan Beulich <JBeulich@suse.com>
Cc: tim@xen.org, kevin.tian@intel.com, keir@xen.org, suravee.suthikulpanit@amd.com, andrew.cooper3@citrix.com, eddie.dong@intel.com, xen-devel@lists.xen.org, Aravind.Gopalakrishnan@amd.com, jun.nakajima@intel.com
List-Id: xen-devel@lists.xenproject.org

On 09/12/2014 10:39 AM, Jan Beulich wrote:
>>>> On 12.09.14 at 16:12, <boris.ostrovsky@oracle.com> wrote:
>> On 09/12/2014 02:49 AM, Jan Beulich wrote:
>>>>>> On 11.09.14 at 18:10, <boris.ostrovsky@oracle.com> wrote:
>>>> On 09/11/2014 10:59 AM, Jan Beulich wrote:
>>>>>>>> On 11.09.14 at 16:12, <boris.ostrovsky@oracle.com> wrote:
>>>>>> On 09/11/2014 02:44 AM, Jan Beulich wrote:
>>>>>>>>>> On 10.09.14 at 19:37, <boris.ostrovsky@oracle.com> wrote:
>>>>>>>> On 09/10/2014 11:05 AM, Jan Beulich wrote:
>>>>>>>>>>>> On 04.09.14 at 05:41, <boris.ostrovsky@oracle.com> wrote:
>>>>>>     > > +long do_xenpmu_op(int op, XEN_GUEST_HANDLE_PARAM(xen_pmu_params_t) arg)
>>>>>>     > > +{
>>>>>>     > > +    int ret = -EINVAL;
>>>>>>     > > +    xen_pmu_params_t pmu_params;
>>>>>>     > > +
>>>>>>     > > +    switch ( op )
>>>>>>     > > +    {
>>>>>>     > > +    case XENPMU_mode_set:
>>>>>>     > > +    {
>>>>>>     > > +        static DEFINE_SPINLOCK(xenpmu_mode_lock);
>>>>>>     > > +        uint32_t current_mode;
>>>>>>     > > +
>>>>>>     > > +        if ( !is_control_domain(current->domain) )
>>>>>>     > > +            return -EPERM;
>>>>>>     > > +
>>>>>>     > > +        if ( copy_from_guest(&pmu_params, arg, 1) )
>>>>>>     > > +            return -EFAULT;
>>>>>>     > > +
>>>>>>     > > +        if ( pmu_params.val & ~XENPMU_MODE_SELF )
>>>>>>     > > +            return -EINVAL;
>>>>>>     > > +
>>>>>>     > > +        /*
>>>>>>     > > +         * Return error is someone else is in the middle of changing
>>>> mode ---
>>>>>>     > > +         * this is most likely indication of two system administrators
>>>>>>     > > +         * working against each other
>>>>>>     > > +         */
>>>>>>     > > +        if ( !spin_trylock(&xenpmu_mode_lock) )
>>>>>>     > > +            return -EAGAIN;
>>>>>>     >
>>>>>>     > So what happens if you can't take the lock in a continuation? If
>>>>>>     > returning -EAGAIN in that case is not a problem, what do you
>>>>>>     > need the continuation for in the first place?
>>>>>>
>>>>>> EAGAIN this case means that the caller was not able to initiate the
>>>>>> operation. Continuation will allow the caller to finish operation in
>>>>>> progress.
>>>>> But that's only what you want, not what the code does. Also now
>>>>> that I look again I don't think the comment really applies to this if().
>>>> Oh, I see. Then both first and second will fail.
>>>>
>>>> I can make the second caller reset everything so that when continuation
>>>> gets to run it will start anew. And if it (i.e. the first caller) did
>>>> get -EAGAIN while trying to get the lock then it's just as well --- the
>>>> state will be clean when user tries this again.
>>>>
>>>> As for the question why continuation is needed in the firs place ---
>>>> it's to make sure this hypercall doesn't prevent other unrelated
>>>> operations from executing. Not to manage simultaneous execution of this
>>>> hypercall from multiple VCPUs (if this is what you were asking).
>>> No, that's not what I was asking. The point I'm trying to make is - if
>>> the caller is in need of dealing with -EAGAIN anyway (i.e. you
>>> require it to retry), why can't you simply return -EAGAIN also for
>>> the case where you currently use a continuation?
>> You mean
>>
>>    while ( atomic_read(&vpmu_sched_counter) != allbutself_num )
>> {
>>           /* Give up after 5 seconds */
>>           if ( NOW() > start + SECONDS(5) )
>> {
>>               printk(XENLOG_WARNING "vpmu_force_context_switch: failed to
>> sync\n");
>>               ret = -EBUSY;
>> break;
>> }
>> cpu_relax();
>>           if ( hypercall_preempt_check() ) {
>>               //return hypercall_create_continuation(
>>               //    __HYPERVISOR_xenpmu_op, "ih", XENPMU_mode_set, arg);
>>               return -EAGAIN; // plus cleanup
>> }
>>       }
>>
>> ?
> Yes.

My concern is that on a very large system we may be getting lots of 
preemptions and thus this will be failing for the sysadmin somewhat 
often and without clear reason. (I am basing this assumption on the fact 
that I am currently looking at a ~200CPU box with gobs of memory and 
some operations that use contuations get preempted very often. Hundreds 
of times. I don't remember seeing this on "regular" systems).

-boris