From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-5.3 required=3.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE, SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 84D0DC433ED for ; Fri, 23 Apr 2021 02:18:22 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 589F86141C for ; Fri, 23 Apr 2021 02:18:22 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236688AbhDWCS5 (ORCPT ); Thu, 22 Apr 2021 22:18:57 -0400 Received: from szxga05-in.huawei.com ([45.249.212.191]:17028 "EHLO szxga05-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236041AbhDWCS4 (ORCPT ); Thu, 22 Apr 2021 22:18:56 -0400 Received: from DGGEMS413-HUB.china.huawei.com (unknown [172.30.72.60]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4FRHtr4FhkzPtX3; Fri, 23 Apr 2021 10:15:16 +0800 (CST) Received: from [127.0.0.1] (10.174.176.216) by DGGEMS413-HUB.china.huawei.com (10.3.19.213) with Microsoft SMTP Server id 14.3.498.0; Fri, 23 Apr 2021 10:18:10 +0800 Subject: Re: [PATCH] e2fsprogs: Try again to solve unreliable io case To: Theodore Ts'o , Haotian Li CC: Ext4 Developers List , "harshad shirwadkar," , linfeilong References: From: Zhiqiang Liu Message-ID: Date: Fri, 23 Apr 2021 10:18:09 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.2.2 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.174.176.216] X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org On 2021/4/21 0:19, Theodore Ts'o wrote: > On Tue, Apr 20, 2021 at 03:18:05PM +0800, Haotian Li wrote: >> If some I/O error occured during e2fsck, for example the >> fibre channel connections are flasky, the e2fsck may exit. >> Try again in these I/O error cases may help e2fsck >> successfully execute and fix the disk correctly. > > Why not fix this by retrying in the device driver instead? If the > Fibre Channel is that flaky, then it's going to be a problem when the > file system is mounted, so it would seem to me that fixing this in the > kernel makes a lot more sense. > > - Ted > Thanks for your reply. Actually, we have met the problem in ipsan situation. When exec 'fsck -a ', short-term fluctuations or abnormalities may occur on the network. Despite the driver has do the best effort, some IO errors may occur. So add retrying in e2fsprogs can further improve the reliability of the repair process. Regards Zhiqiang Liu > . >