From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <linux-kernel-owner+willy=40w.ods.org-S1751052AbWAKWFQ@vger.kernel.org>
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
	id S1751052AbWAKWFQ (ORCPT <rfc822;willy@w.ods.org>);
	Wed, 11 Jan 2006 17:05:16 -0500
Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751104AbWAKWFQ
	(ORCPT <rfc822;linux-kernel-outgoing>);
	Wed, 11 Jan 2006 17:05:16 -0500
Received: from omta03ps.mx.bigpond.com ([144.140.82.155]:57733 "EHLO
	omta03ps.mx.bigpond.com") by vger.kernel.org with ESMTP
	id S1751052AbWAKWFO (ORCPT <rfc822;linux-kernel@vger.kernel.org>);
	Wed, 11 Jan 2006 17:05:14 -0500
Message-ID: <43C58117.9080706@bigpond.net.au>
Date: Thu, 12 Jan 2006 09:05:11 +1100
From: Peter Williams <pwil3058@bigpond.net.au>
User-Agent: Mozilla Thunderbird 1.0.7-1.1.fc4 (X11/20050929)
X-Accept-Language: en-us, en
MIME-Version: 1.0
To: Con Kolivas <kernel@kolivas.org>
CC: "Martin J. Bligh" <mbligh@google.com>, Andrew Morton <akpm@osdl.org>,
       linux-kernel@vger.kernel.org, Ingo Molnar <mingo@elte.hu>
Subject: Re: -mm seems significanty slower than mainline on kernbench
References: <43C45BDC.1050402@google.com> <43C4A3E9.1040301@google.com> <43C4F8EE.50208@bigpond.net.au> <200601120129.16315.kernel@kolivas.org>
In-Reply-To: <200601120129.16315.kernel@kolivas.org>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit
X-Authentication-Info: Submitted using SMTP AUTH PLAIN at omta03ps.mx.bigpond.com from [147.10.133.38] using ID pwil3058@bigpond.net.au at Wed, 11 Jan 2006 22:05:12 +0000
Sender: linux-kernel-owner@vger.kernel.org
X-Mailing-List: linux-kernel@vger.kernel.org

Con Kolivas wrote:
> On Wednesday 11 January 2006 23:24, Peter Williams wrote:
> 
>>Martin J. Bligh wrote:
>>
>>>That seems broken to me ?
>>
>>But, yes, given that the problem goes away when the patch is removed
>>(which we're still waiting to see) it's broken.  I think the problem is
>>probably due to the changed metric (i.e. biased load instead of simple
>>load) causing idle_balance() to fail more often (i.e. it decides to not
>>bother moving any tasks more often than it otherwise would) which would
>>explain the increased idle time being seen.  This means that the fix
>>would be to review the criteria for deciding whether to move tasks in
>>idle_balance().
> 
> 
> Look back on my implementation. The problem as I saw it was that one task 
> alone with a biased load would suddenly make a runqueue look much busier than 
> it was supposed to so I special cased the runqueue that had precisely one 
> task.

OK.  I'll look at that.

But I was thinking more about the code that (in the original) handled 
the case where the number of tasks to be moved was less than 1 but more 
than 0 (i.e. the cases where "imbalance" would have been reduced to zero 
when divided by SCHED_LOAD_SCALE).  I think that I got that part wrong 
and you can end up with a bias load to be moved which is less than any 
of the bias_prio values for any queued tasks (in circumstances where the 
original code would have rounded up to 1 and caused a move).  I think 
that the way to handle this problem is to replace 1 with "average bias 
prio" within that logic.  This would guarantee at least one task with a 
bias_prio small enough to be moved.

I think that this analysis is a strong argument for my original patch 
being the cause of the problem so I'll go ahead and generate a fix. 
I'll try to have a patch available later this morning.

Peter
PS
-- 
Peter Williams                                   pwil3058@bigpond.net.au

"Learning, n. The kind of ignorance distinguishing the studious."
  -- Ambrose Bierce