From mboxrd@z Thu Jan 1 00:00:00 1970 From: Faidon Liambotis Subject: Re: Bug#557262: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk Date: Sat, 21 Nov 2009 01:44:24 +0200 Message-ID: <4B0729D8.3000105@debian.org> References: <20091019030456.GS9464@discord.disaster> <20091020003358.GW9464@discord.disaster> Reply-To: Asterisk Users Mailing List - Non-Commercial Discussion Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: asterisk-users-bounces@lists.digium.com Errors-To: asterisk-users-bounces@lists.digium.com To: Justin Piszcz , 557262@bugs.debian.org Cc: linux-raid@vger.kernel.org, Dave Chinner , linux-kernel@vger.kernel.org, xfs@oss.sgi.com, submit@bugs.debian.org, asterisk-users@lists.digium.com, Alan Piszcz List-Id: linux-raid.ids Justin Piszcz wrote: > Found root cause-- root cause is asterisk PBX software. I use an SPA3102. > When someone called me, they accidentally dropped the connection, I called > them back in a short period. It is during this time (and the last time) > this happened that the box froze under multiple(!) kernels, always when > someone was calling. > I don't know what asterisk is doing but top did run before the crash > and asterisk was using 100% CPU and as I noted before all other processes > were in D-state. > > When this bug occurs, it freezes I/O to all devices and the only way to > recover > is to reboot the system. That's obviously *not* the root cause. It's not normal for an application that isn't even privileged to hang all I/O and, subsequently everything on a system. This is almost probably a kernel issue and asterisk just does something that triggers this bug. Regards, Faidon _______________________________________________ -- Bandwidth and Colocation Provided by http://www.api-digital.com -- asterisk-users mailing list To UNSUBSCRIBE or update options visit: http://lists.digium.com/mailman/listinfo/asterisk-users From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753967AbZKUAGg (ORCPT ); Fri, 20 Nov 2009 19:06:36 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753818AbZKUAGg (ORCPT ); Fri, 20 Nov 2009 19:06:36 -0500 Received: from solitude.tty.gr ([95.154.208.37]:33475 "EHLO mx.tty.gr" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753805AbZKUAGf (ORCPT ); Fri, 20 Nov 2009 19:06:35 -0500 X-Greylist: delayed 1314 seconds by postgrey-1.27 at vger.kernel.org; Fri, 20 Nov 2009 19:06:35 EST Message-ID: <4B0729D8.3000105@debian.org> Date: Sat, 21 Nov 2009 01:44:24 +0200 From: Faidon Liambotis Organization: Debian User-Agent: Mozilla-Thunderbird 2.0.0.22 (X11/20090707) MIME-Version: 1.0 To: Justin Piszcz , 557262@bugs.debian.org CC: Dave Chinner , submit@bugs.debian.org, linux-kernel@vger.kernel.org, xfs@oss.sgi.com, linux-raid@vger.kernel.org, asterisk-users@lists.digium.com, Alan Piszcz Subject: Re: Bug#557262: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk References: <20091019030456.GS9464@discord.disaster> <20091020003358.GW9464@discord.disaster> In-Reply-To: X-Enigmail-Version: 0.95.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Justin Piszcz wrote: > Found root cause-- root cause is asterisk PBX software. I use an SPA3102. > When someone called me, they accidentally dropped the connection, I called > them back in a short period. It is during this time (and the last time) > this happened that the box froze under multiple(!) kernels, always when > someone was calling. > I don't know what asterisk is doing but top did run before the crash > and asterisk was using 100% CPU and as I noted before all other processes > were in D-state. > > When this bug occurs, it freezes I/O to all devices and the only way to > recover > is to reboot the system. That's obviously *not* the root cause. It's not normal for an application that isn't even privileged to hang all I/O and, subsequently everything on a system. This is almost probably a kernel issue and asterisk just does something that triggers this bug. Regards, Faidon From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from cuda.sgi.com (cuda1.sgi.com [192.48.157.11]) by oss.sgi.com (8.14.3/8.14.3/SuSE Linux 0.8) with ESMTP id nAKNiZQp215818 for ; Fri, 20 Nov 2009 17:44:36 -0600 Received: from mx.tty.gr (localhost [127.0.0.1]) by cuda.sgi.com (Spam Firewall) with ESMTP id 3F239CD7B2F for ; Fri, 20 Nov 2009 15:44:58 -0800 (PST) Received: from mx.tty.gr (solitude.tty.gr [95.154.208.37]) by cuda.sgi.com with ESMTP id J2xk2qiFEM8d7xr5 for ; Fri, 20 Nov 2009 15:44:58 -0800 (PST) Message-ID: <4B0729D8.3000105@debian.org> Date: Sat, 21 Nov 2009 01:44:24 +0200 From: Faidon Liambotis MIME-Version: 1.0 Subject: Re: Bug#557262: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available) - root cause found = asterisk References: <20091019030456.GS9464@discord.disaster> <20091020003358.GW9464@discord.disaster> In-Reply-To: List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Sender: xfs-bounces@oss.sgi.com Errors-To: xfs-bounces@oss.sgi.com To: Justin Piszcz , 557262@bugs.debian.org Cc: linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org, xfs@oss.sgi.com, submit@bugs.debian.org, asterisk-users@lists.digium.com, Alan Piszcz Justin Piszcz wrote: > Found root cause-- root cause is asterisk PBX software. I use an SPA3102. > When someone called me, they accidentally dropped the connection, I called > them back in a short period. It is during this time (and the last time) > this happened that the box froze under multiple(!) kernels, always when > someone was calling. > I don't know what asterisk is doing but top did run before the crash > and asterisk was using 100% CPU and as I noted before all other processes > were in D-state. > > When this bug occurs, it freezes I/O to all devices and the only way to > recover > is to reboot the system. That's obviously *not* the root cause. It's not normal for an application that isn't even privileged to hang all I/O and, subsequently everything on a system. This is almost probably a kernel issue and asterisk just does something that triggers this bug. Regards, Faidon _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs