From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:33003) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Uel5N-0007To-0I for qemu-devel@nongnu.org; Tue, 21 May 2013 07:50:37 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Uel5I-0007q7-7J for qemu-devel@nongnu.org; Tue, 21 May 2013 07:50:32 -0400 Received: from cantor2.suse.de ([195.135.220.15]:54430 helo=mx2.suse.de) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Uel5H-0007q2-N0 for qemu-devel@nongnu.org; Tue, 21 May 2013 07:50:27 -0400 Message-ID: <519B5F80.1080302@suse.de> Date: Tue, 21 May 2013 13:50:24 +0200 From: =?UTF-8?B?QW5kcmVhcyBGw6RyYmVy?= MIME-Version: 1.0 References: <33183CC9F5247A488A2544077AF19020697A3B72@szxeml538-mbx.china.huawei.com> In-Reply-To: <33183CC9F5247A488A2544077AF19020697A3B72@szxeml538-mbx.china.huawei.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Subject: Re: [Qemu-devel] IDE disk FLUSH take more than 30 secs, the SUSE guest reports "lost interrupt and the file system becomes read-only" List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: "Gonglei (Arei)" Cc: "kwolf@redhat.com" , Luonengjun , "qemu-devel@nongnu.org" , Wangzhenguo , Bo Yang , "Huangweidong (Hardware)" Hi, Am 21.05.2013 09:12, schrieb Gonglei (Arei): > In the case of physical hard disk's speed which processing IO (when gro= uping RAID) is very slow, I encountered a problem. > I dd big file in SUSE virtual machine, the command is > linux:/ # dd if=3D/dev/zero of=3D./info bs=3D1M count=3D5000;sync >=20 > but finally I get those message: > linux:~ # dmesg > [ 174.804114] ata1: lost interrupt (Status 0x50) > [ 174.812305] end_request: I/O error, dev sda, sector 12085270 > [ 174.812309] Buffer I/O error on device sda2, logical block 984530 > [ 174.812310] lost page write due to I/O error on sda2 > [ 174.813268] Aborting journal on device sda2. > [ 174.828330] journal commit I/O error > [ 174.828373] ext3_abort called. > [ 174.828375] EXT3-fs error (device sda2): ext3_journal_start_sb: Dete= cted aborted journal > [ 174.828377] Remounting filesystem read-only > [ 182.286424] __journal_remove_journal_head: freeing b_committed_data > [ 182.286434] __journal_remove_journal_head: freeing b_committed_data > [ 182.286442] __journal_remove_journal_head: freeing b_committed_data > [ 182.286452] __journal_remove_journal_head: freeing b_committed_data > [ 182.286472] __journal_remove_journal_head: freeing b_committed_data >=20 >=20 > Through analysis, I found that because the system call the fdatasync co= mmand in the Qemu over 30s, Could you share your QEMU command line being used on the host? In particular I'm wondering about -drive's cache option used - I've only seen issues with cache=3Dunsafe so far. Is it an upstream qemu-system-x86_64 or a SLES qemu-kvm? What version? Regards, Andreas > after the Guest's kernel thread detects the io transferation is timeout= , went to check IDE disk state.=20 > But the IDE disk status is 0x50, rather than the BSY status, and then d= eparted error process... >=20 > the path of kernel's action is : > scsi_softirq_done > scsi_eh_scmd_add > scsi_error_handler > shost->transportt->eh_strategy_handler=20 > ata_scsi_error=20 > ap->ops->lost_interrupt > ata_sff_lost_interrupt > Finally, the file system becomes read-only. >=20 > Why not set the IDE disk for the BSY status When 0xe7 command is execut= ed in the Qemu? > Anyone know it? thanks! >=20 > Best Regards! > -Arei --=20 SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 N=C3=BCrnberg, Germany GF: Jeff Hawn, Jennifer Guild, Felix Imend=C3=B6rffer; HRB 16746 AG N=C3=BC= rnberg