From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755575AbbFCRnm (ORCPT ); Wed, 3 Jun 2015 13:43:42 -0400 Received: from mail-wi0-f177.google.com ([209.85.212.177]:36592 "EHLO mail-wi0-f177.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753886AbbFCRne (ORCPT ); Wed, 3 Jun 2015 13:43:34 -0400 Message-ID: <1433353411.3407.15.camel@gmail.com> Subject: Re: [PATCH RESEND] sched: prefer an idle cpu vs an idle sibling for BALANCE_WAKE From: Mike Galbraith To: Josef Bacik Cc: Peter Zijlstra , Rik van Riel , mingo@redhat.com, linux-kernel@vger.kernel.org, morten.rasmussen@arm.com, kernel-team Date: Wed, 03 Jun 2015 19:43:31 +0200 In-Reply-To: <556F3677.2090206@fb.com> References: <1432761736-22093-1-git-send-email-jbacik@fb.com> <20150528102127.GD3644@twins.programming.kicks-ass.net> <20150528110514.GR18673@twins.programming.kicks-ass.net> <5568D43D.20703@fb.com> <556CB4A8.1050509@fb.com> <1433191354.11346.22.camel@twins> <556DE3FB.9020400@fb.com> <556F0B5E.6030805@redhat.com> <1433341448.1495.4.camel@twins> <1433345444.3343.21.camel@gmail.com> <556F23E5.5020107@fb.com> <1433350386.3996.15.camel@gmail.com> <556F3677.2090206@fb.com> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.12.11 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 2015-06-03 at 13:16 -0400, Josef Bacik wrote: > Eesh ok, do you happen to remember how you ran tbench so I can add it to > my tests here? In addition to fixing this problem we're also interested > in tracking performance of new kernels so we don't have to do this "what > the hell went wrong in the last 6 releases" dance every year, so I'm > throwing every performance thing we find useful in our test > infrastructure. Thanks, > > Josef Start a tbench server, then tbench -t 30 1 localhost. You're unlikely to find anything as painful as that bouncing cow bug was, but you won't have to look hard at all to find bounce pain. There are also other loads like your server where waking to an idle cpu dominates all else, pgbench is one of those. In that case, you've got a 1:N waker/wakee relationship, and what matters above ALL else is when the mother of all work (the single server thread) wants a CPU, it had better get it NOW, else the load stalls. Likewise, 'mom' being preempted hurts truckloads. Perhaps your server has a similar thing going on, keeping wakees the hell away from the waker rules all. -Mike