From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753078Ab2GPCWx (ORCPT ); Sun, 15 Jul 2012 22:22:53 -0400 Received: from charybdis-ext.suse.de ([195.135.221.2]:50695 "EHLO nat.nue.novell.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751760Ab2GPCWv (ORCPT ); Sun, 15 Jul 2012 22:22:51 -0400 Message-ID: <1342405366.7659.35.camel@marge.simpson.net> Subject: Re: Deadlocks due to per-process plugging From: Mike Galbraith To: Thomas Gleixner Cc: Jan Kara , Jeff Moyer , LKML , linux-fsdevel@vger.kernel.org, Tejun Heo , Jens Axboe , mgalbraith@suse.com Date: Mon, 16 Jul 2012 04:22:46 +0200 In-Reply-To: <1342343673.28142.2.camel@marge.simpson.net> References: <20120711133735.GA8122@quack.suse.cz> <20120711201601.GB9779@quack.suse.cz> <20120713123318.GB20361@quack.suse.cz> <20120713144622.GB28715@quack.suse.cz> <1342343673.28142.2.camel@marge.simpson.net> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.2.3 Content-Transfer-Encoding: 7bit Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, 2012-07-15 at 11:14 +0200, Mike Galbraith wrote: > On Sun, 2012-07-15 at 10:59 +0200, Thomas Gleixner wrote: > > Can you figure out on which lock the stuck thread which did not unplug > > due to tsk_is_pi_blocked was blocked? > > I'll take a peek. Sorry for late reply, took a half day away from box. Jan had already done the full ext3 IO deadlock analysis: Again kjournald is waiting for buffer IO on block 4367635 (sector 78364838) to finish. Now it is dbench thread 0xffff88026f330e70 which has submitted this buffer for IO and is still holding this buffer behind its plug (request for sector 78364822..78364846). The dbench thread is waiting on j_checkpoint mutex (apparently it has successfully got the mutex in the past, checkpointed some buffers, released the mutex and hung when trying to acquire it again in the next loop of __log_wait_for_space()). -Mike