From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from wp530.webpack.hosteurope.de (wp530.webpack.hosteurope.de [80.237.130.52]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 4868D944C for ; Tue, 11 Apr 2023 18:15:23 +0000 (UTC) Received: from [2a02:8108:8980:2478:8cde:aa2c:f324:937e]; authenticated by wp530.webpack.hosteurope.de running ExIM with esmtpsa (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) id 1pmIWS-0008I4-H9; Tue, 11 Apr 2023 20:15:16 +0200 Message-ID: <7e2ec2c1-e962-603c-17e7-a3c11285bf63@leemhuis.info> Date: Tue, 11 Apr 2023 20:15:15 +0200 Precedence: bulk X-Mailing-List: regressions@lists.linux.dev List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.9.1 Subject: Re: [6.2 regression][bisected]discard storm on idle since v6.1-rc8-59-g63a7cb130718 discard=async Content-Language: en-US, de-DE To: dsterba@suse.cz, Michael Bromilow Cc: Boris Burkov , Chris Mason , Christoph Hellwig , Linux regressions mailing list , Sergei Trofimovich , Josef Bacik , Christopher Price , anand.jain@oracle.com, clm@fb.com, dsterba@suse.com, linux-btrfs@vger.kernel.org References: <20230323222606.20d10de2@nz> <20d85dc4-b6c2-dac1-fdc6-94e44b43692a@leemhuis.info> <41141706-2685-1b32-8624-c895a3b219ea@meta.com> <20230404185138.GB344341@zen> <6a54fa77-9a0c-8844-2eb0-b65591e97a16@gmail.com> <20230411175239.GA19619@twin.jikos.cz> From: "Linux regression tracking (Thorsten Leemhuis)" Reply-To: Linux regressions mailing list In-Reply-To: <20230411175239.GA19619@twin.jikos.cz> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-bounce-key: webpack.hosteurope.de;regressions@leemhuis.info;1681236923;51142a00; X-HE-SMSGID: 1pmIWS-0008I4-H9 On 11.04.23 19:52, David Sterba wrote: > On Mon, Apr 10, 2023 at 03:03:37AM +0100, Michael Bromilow wrote: >> On 04/04/2023 19:51, Boris Burkov wrote: >>> On Tue, Apr 04, 2023 at 02:15:38PM -0400, Chris Mason wrote: >>>> So, honestly from my POV the async discard is best suited to consumer >>>> devices. Our defaults are probably wrong because no matter what you >>>> choose there's a drive out there that makes it look bad. Also, laptops >>>> probably don't want the slow dribble. >>> >>> Our reasonable options, as I see them: >>> - back to nodiscard, rely on periodic trims from the OS. >>> - leave low iops_limit, drives stay busy unexpectedly long, conclude that >>> that's OK, and communicate the tuning/measurement options better. >>> - set a high iops_limit (e.g. 1000) drives will get back to idle faster. >>> - change an unset iops_limit to mean truly unlimited async discard, set >>> that as the default, and anyone who cares to meter it can set an >>> iops_limit. >>> >>> The regression here is in drive idle time due to modest discard getting >>> metered out over minutes rather than dealt with relatively quickly. So >>> I would favor the unlimited async discard mode and will send a patch to >>> that effect which we can discuss. >> >> Will power usage be taken into consideration? I only noticed this regression >> myself when my laptop started to draw a constant extra ~10W from the constant >> drive activity, so I imagine other laptop users would also prefer a default >> which avoids this if possible. If "relatively quickly" means anything more >> than a few seconds I could see that causing rather significant battery life >> reductions. > > This may need to be configured from userspace, e.g. from > laptop-mode-tools, I'm not sure if any of the power usage info is > available from kernel. Well, from the point of the regression tracker this looks a bit different: if a system after a kernel updates draw a constant extra ~10W, it's a regression that needs to be fixed. But in the end it's of course Linus option that matters. But whatever, Boris afaics is looking for an improved solution that hopefully avoids the extra power consumption and at the same time does the right thing on modern enterprise storage devices, so that the defaults work for the majority of the users. Hopefully this will work out. :-D Ciao, Thorsten