From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757858Ab1IATir (ORCPT ); Thu, 1 Sep 2011 15:38:47 -0400 Received: from www.linutronix.de ([62.245.132.108]:48653 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1757812Ab1IATiq (ORCPT ); Thu, 1 Sep 2011 15:38:46 -0400 Date: Thu, 1 Sep 2011 21:38:35 +0200 (CEST) From: Thomas Gleixner To: Fernando Lopez-Lezcano cc: linux-rt-users , LKML , "Paul E. McKenney" , efault@gmx.de, Peter Zijlstra , LKML Subject: Re: 3.0.4 + rt12: deadlock In-Reply-To: <4E5E6A75.5030404@localhost> Message-ID: References: <4E5E6A75.5030404@localhost> User-Agent: Alpine 2.02 (LFD 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 31 Aug 2011, Fernando Lopez-Lezcano wrote: > Booting 3.0.4 + rt12 on a quadcore workstation (running fc14) gave me this: > > ---- > ============================================= > [ INFO: possible recursive locking detected ] > 3.0.4-1.rt12.1.fc14.ccrma.i686.rtPAE #1 > --------------------------------------------- > swapper/0 is trying to acquire lock: > (&parent->list_lock){+.+...}, at: [] > __cache_free.clone.27+0x45/0xc4 > > but task is already holding lock: > (&parent->list_lock){+.+...}, at: [] do_tune_cpucache+0xf0/0x2b0 > > other info that might help us debug this: > Possible unsafe locking scenario: > > CPU0 > ---- > lock(&parent->list_lock); > lock(&parent->list_lock); That's something which has to do with debugging options (debugobjects IIRC). There was some attempt to fix that, but that might have gone lost in my vacation and the following futile attempt to take care of the resulting backlog. Peter ??? Thanks, tglx > *** DEADLOCK *** > > May be due to missing lock nesting notation > > 3 locks held by swapper/0: > #0: (cache_chain_mutex){+.+...}, at: [] > kmem_cache_init_late+0x15/0x61 > #1: (&per_cpu(slab_lock, __cpu).lock){+.+...}, at: [] > __local_lock_irq+0x1e/0x5b > #2: (&parent->list_lock){+.+...}, at: [] > do_tune_cpucache+0xf0/0x2b0 > > stack backtrace: > Pid: 0, comm: swapper Not tainted 3.0.4-1.rt12.1.fc14.ccrma.i686.rtPAE #1 > Call Trace: > [] ? printk+0x2d/0x2f > [] __lock_acquire+0x8b3/0xc2f > [] ? rt_spin_lock_slowlock+0x67/0x170 > [] ? mark_lock+0x26/0x1bb > [] ? __cache_free.clone.27+0x45/0xc4 > [] lock_acquire+0xde/0x11d > [] ? __cache_free.clone.27+0x45/0xc4 > [] rt_spin_lock+0x3d/0x43 > [] ? __cache_free.clone.27+0x45/0xc4 > [] __cache_free.clone.27+0x45/0xc4 > [] ? test_ti_thread_flag+0x8/0x10 > [] kmem_cache_free+0x73/0xe1 > [] slab_destroy+0x4f/0x53 > [] free_block+0x94/0xc5 > [] do_tune_cpucache+0x109/0x2b0 > [] enable_cpucache+0x7b/0xa7 > [] kmem_cache_init_late+0x26/0x61 > [] start_kernel+0x24f/0x367 > [] ? loglevel+0x1a/0x1a > [] ? reserve_ebda_region+0x70/0x72 > [] i386_start_kernel+0xb2/0xba > Console: colour VGA+ 80x25 > console [tty0] enabled > Lock dependency validator: Copyright (c) 2006 Red Hat, Inc., Ingo Molnar > ... MAX_LOCKDEP_SUBCLASSES: 8 > ... MAX_LOCK_DEPTH: 48 > ... MAX_LOCKDEP_KEYS: 8191 > ... CLASSHASH_SIZE: 4096 > ... MAX_LOCKDEP_ENTRIES: 32768 > ... MAX_LOCKDEP_CHAINS: 65536 > ... CHAINHASH_SIZE: 32768 > ---- > > I started working and a little while later the machine froze (jack + heavy > prioritized udp traffic in eth1 - with the r8169 driver). It recognized > alt-sysrq boot so it was not completely dead. Nothing left on the logs to see. > > Full config attached. > -- Fernando >