From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753484AbaKLGLL (ORCPT ); Wed, 12 Nov 2014 01:11:11 -0500 Received: from mail-oi0-f50.google.com ([209.85.218.50]:51730 "EHLO mail-oi0-f50.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751182AbaKLGLJ (ORCPT ); Wed, 12 Nov 2014 01:11:09 -0500 MIME-Version: 1.0 In-Reply-To: <20141111171526.GC3216@lerouge> References: <20141110153147.GK4901@linux.vnet.ibm.com> <20141111171526.GC3216@lerouge> Date: Wed, 12 Nov 2014 11:41:09 +0530 Message-ID: Subject: Re: Future of NOHZ full/isolation development (was Re: [NOHZ] Remove scheduler_tick_max_deferment) From: Viresh Kumar To: Frederic Weisbecker Cc: Christoph Lameter , "Paul E. McKenney" , Thomas Gleixner , Linux Kernel Mailing List , Gilad Ben-Yossef , Tejun Heo , John Stultz , Mike Frysinger , Minchan Kim , Hakan Akkan , Max Krasnyansky , Hugh Dickins , "H. Peter Anvin" , Ingo Molnar , Peter Zijlstra , Kevin Hilman Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11 November 2014 22:45, Frederic Weisbecker wrote: > Here is a summarized list: > > * Unbound workqueues affinity (to housekeeper) > * Unbound timers affinity (to housekeeper) > * 1 Hz residual scheduler tick offlining to housekeeper > * Fix some scheduler accounting that don't even work with 1 Hz: cpu load > accounting, rt_scale, load balancing, etc... > * Lighten the syscall path and get rid of cputime accounting + RCU hooks > for people who want isolation + fast syscalls and faults. > * Work on non-affinable workqueues > * Work on non-affinable timers > * ... + spurious interrupts with NOHZ_FULL on all architectures which break isolation but doesn't get caught with traces. Can be observed with this: diff --git a/kernel/time/hrtimer.c b/kernel/time/hrtimer.c index 481fa54..91d490d 100644 --- a/kernel/time/hrtimer.c +++ b/kernel/time/hrtimer.c @@ -1244,7 +1244,8 @@ void hrtimer_interrupt(struct clock_event_device *dev) { struct hrtimer_cpu_base *cpu_base = this_cpu_ptr(&hrtimer_bases); ktime_t expires_next, now, entry_time, delta; - int i, retries = 0; + int i, retries = 0, count = 0; + static int total_spurious; BUG_ON(!cpu_base->hres_active); cpu_base->nr_events++; @@ -1304,10 +1305,14 @@ void hrtimer_interrupt(struct clock_event_device *dev) break; } + count++; __run_hrtimer(timer, &basenow); } } + if (!count) + pr_err("____%s: Totalspurious: %d\n", __func__, ++total_spurious); + /* * Store the new expiry value so the migration code can verify * against it.