From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755439AbbDIOFS (ORCPT ); Thu, 9 Apr 2015 10:05:18 -0400 Received: from www.linutronix.de ([62.245.132.108]:47600 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753451AbbDIOFO (ORCPT ); Thu, 9 Apr 2015 10:05:14 -0400 Date: Thu, 9 Apr 2015 16:05:12 +0200 From: Sebastian Andrzej Siewior To: Mike Galbraith Cc: LKML , linux-rt-users , Steven Rostedt Subject: Re: [patch] rt, hotplug: Use set_cpus_allowed_ptr() in sync_unplug_thread() Message-ID: <20150409140512.GB2416@linutronix.de> References: <1427181289.3316.27.camel@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <1427181289.3316.27.camel@gmail.com> X-Key-Id: 2A8CF5D1 X-Key-Fingerprint: 6425 4695 FFF0 AA44 66CC 19E6 7B96 E816 2A8C F5D1 User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org * Mike Galbraith | 2015-03-24 08:14:49 [+0100]: >do_set_cpus_allowed() is not safe vs ->sched_class change. > >crash> bt >PID: 11676 TASK: ffff88026f979da0 CPU: 22 COMMAND: "sync_unplug/22" > #0 [ffff880274d25bc8] machine_kexec at ffffffff8103b41c > #1 [ffff880274d25c18] crash_kexec at ffffffff810d881a > #2 [ffff880274d25cd8] oops_end at ffffffff81525818 > #3 [ffff880274d25cf8] do_invalid_op at ffffffff81003096 > #4 [ffff880274d25d90] invalid_op at ffffffff8152d3de > [exception RIP: set_cpus_allowed_rt+18] > RIP: ffffffff8109e012 RSP: ffff880274d25e48 RFLAGS: 00010202 > RAX: ffffffff8109e000 RBX: ffff88026f979da0 RCX: ffff8802770cb6e8 > RDX: 0000000000000000 RSI: ffffffff81add700 RDI: ffff88026f979da0 > RBP: ffff880274d25e78 R8: ffffffff816112e0 R9: 0000000000000001 > R10: 0000000000000001 R11: 0000000000011940 R12: ffff88026f979da0 > R13: ffff8802770cb6d0 R14: ffff880274d25fd8 R15: 0000000000000000 > ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018 > #5 [ffff880274d25e60] do_set_cpus_allowed at ffffffff8108e65f > #6 [ffff880274d25e80] sync_unplug_thread at ffffffff81058c08 > #7 [ffff880274d25ed8] kthread at ffffffff8107cad6 > #8 [ffff880274d25f50] ret_from_fork at ffffffff8152bbbc >crash> task_struct ffff88026f979da0 | grep class > sched_class = 0xffffffff816111e0 , Is this a one-time thing or can you reproduce this? What happen here? I doubt p vanished. +18 is mostlikely the "migrate_disabled_updated()" check. I doubt p->sched_class->set_cpus_allowed or p->sched_class vanish between testing for it and invoking it, or did it? >Signed-off-by: Mike Galbraith >Cc: stable-rt@vger.kernel.org >--- > kernel/cpu.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > >--- a/kernel/cpu.c >+++ b/kernel/cpu.c >@@ -265,7 +265,7 @@ static int sync_unplug_thread(void *data > * we don't want any more work on this CPU. > */ > current->flags &= ~PF_NO_SETAFFINITY; >- do_set_cpus_allowed(current, cpu_present_mask); >+ set_cpus_allowed_ptr(current, cpu_present_mask); > migrate_me(); > return 0; > } Sebastian