From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1161120Ab2JXUYU (ORCPT ); Wed, 24 Oct 2012 16:24:20 -0400 Received: from mx1.redhat.com ([209.132.183.28]:34122 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933652Ab2JXUYS (ORCPT ); Wed, 24 Oct 2012 16:24:18 -0400 Date: Wed, 24 Oct 2012 16:24:11 -0400 From: Dave Jones To: Linux Kernel Cc: Darren Hart Subject: pi futex oops in __lock_acquire Message-ID: <20121024202411.GA20597@redhat.com> Mail-Followup-To: Dave Jones , Linux Kernel , Darren Hart References: <20121018015312.GA29865@redhat.com> <507F9944.50505@kernel.dk> <20121019052136.GA30852@redhat.com> <1350650972.30157.34.camel@twins> <20121023195026.GA2872@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20121023195026.GA2872@redhat.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org I've been able to trigger this for the last week or so. Unclear whether this is a new bug, or my fuzzer got smarter, but I see the pi-futex code hasn't changed since the last time it found something.. > BUG: unable to handle kernel NULL pointer dereference at 0000000000000018 > IP: [] __lock_acquire+0x5e/0x1ba0 > PGD 8e72c067 PUD 34f07067 PMD 0 > Oops: 0000 [#1] PREEMPT SMP > CPU 7 > Pid: 27513, comm: trinity-child0 Not tainted 3.7.0-rc2+ #43 > RIP: 0010:[] [] __lock_acquire+0x5e/0x1ba0 > RSP: 0018:ffff8800803f7b28 EFLAGS: 00010046 > RAX: 0000000000000086 RBX: 0000000000000000 RCX: 0000000000000000 > RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000018 > RBP: ffff8800803f7c18 R08: 0000000000000002 R09: 0000000000000000 > R10: 0000000000000000 R11: 2222222222222222 R12: 0000000000000002 > R13: ffff880051dd8000 R14: 0000000000000002 R15: 0000000000000018 > FS: 00007f9fc6ccb740(0000) GS:ffff880148a00000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: 0000000000000018 CR3: 000000008e6fb000 CR4: 00000000001407e0 > DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 > DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 > Process trinity-child0 (pid: 27513, threadinfo ffff8800803f6000, task ffff880051dd8000) > Stack: > ffff8800803f7b48 ffffffff816c5c59 ffff8800803f7b48 ffff88014840ebc0 > ffff8800803f7b68 ffffffff816c18e3 ffff8800803f7d10 0000000000000001 > ffff8800803f7ba8 ffffffff810a1e62 ffff8800803f7d10 0000000000000282 > Call Trace: > [] ? sub_preempt_count+0x79/0xd0 > [] ? _raw_spin_unlock_irqrestore+0x73/0xa0 > [] ? hrtimer_try_to_cancel+0x52/0x210 > [] ? debug_rt_mutex_free_waiter+0x15/0x180 > [] ? rt_mutex_slowlock+0x127/0x1b0 > [] ? local_clock+0x89/0xa0 > [] lock_acquire+0xa2/0x220 > [] ? futex_lock_pi.isra.18+0x1cc/0x390 > [] _raw_spin_lock+0x40/0x80 > [] ? futex_lock_pi.isra.18+0x1cc/0x390 > [] futex_lock_pi.isra.18+0x1cc/0x390 > [] ? update_rmtp+0x70/0x70 > [] do_futex+0x394/0xa50 > [] ? might_fault+0x53/0xb0 > [] sys_futex+0x8d/0x190 > [] tracesys+0xe1/0xe6 > Code: d8 45 0f 45 e0 4c 89 75 f0 4c 89 7d f8 85 c0 0f 84 f8 00 00 00 8b 05 22 fe f3 00 49 89 ff 89 f3 41 89 d2 85 c0 0f 84 02 01 00 00 <49> 8b 07 ba 01 00 00 00 48 3d c0 81 06 82 44 0f 44 e2 83 fb 01 > RIP [] __lock_acquire+0x5e/0x1ba0 > RSP > CR2: 0000000000000018 It looks like we got all the way to lock_acquire with a NULL 'lock' somehow. Darren, any idea how this could happen ? Dave