From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5671DC43334 for ; Mon, 27 Jun 2022 16:25:38 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236240AbiF0QZf (ORCPT ); Mon, 27 Jun 2022 12:25:35 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56048 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234972AbiF0QZd (ORCPT ); Mon, 27 Jun 2022 12:25:33 -0400 Received: from mail-yw1-x112c.google.com (mail-yw1-x112c.google.com [IPv6:2607:f8b0:4864:20::112c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 16D35E00F for ; Mon, 27 Jun 2022 09:25:32 -0700 (PDT) Received: by mail-yw1-x112c.google.com with SMTP id 00721157ae682-31780ad7535so90815507b3.8 for ; Mon, 27 Jun 2022 09:25:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=iAKbjN/cfvJPKH/ugt3ybbCNo8kM3ZVPMiqHDruEifg=; b=RbOhXfsOqEDMRENO38mrpSrXxxDYigIIL0iGHtE8xCdU+DUoAXKdt3N63SVhhS9bj+ vE2UFnU2Zf22D0Ic4nq6JinMhf0Rjku6ueYypwbmUvx4P4JuaSisipOX5K8x7shK1He1 Sk/Gw+rOyqohify/KsKnjlLAviBaC9Vo8OtebbrJ5fNeCpt6gqGuma5db02XQdqbiN/9 1Ag08g5+15n3bnIIqqmQnHSAO7DPvRcyWTP8d5sJSPb1SSC2xw97478ugS1gXgiYTaO6 gI/ywCGguebiSzo0OFIw5YW2u7W3NCNfDYEl+SNNPS0N8N6VKJwV+q1Cupj+aXeASM/M Lang== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=iAKbjN/cfvJPKH/ugt3ybbCNo8kM3ZVPMiqHDruEifg=; b=uDmhgZT7sEMwD0vdEwdMT4ckVbR6eLl2Ofc7pQ/Cq/grUs9oYyMEIOjtYMwcGdkm+N 0YwcP/OJaPH7HH/R6HMv3GaK+d2CzU7m/t27axLPU7djkiPmxyb1QwKCZ+VvrHvQsaPy vXOsVJadQNLVU0Rc2JUNZXTb4H5d/O+m1xmPoNaQfLw+mQfWqQAFTfnzCAGPxfO1o+lA GmfBie/QA9O0dA5BIoz4yIobNympBu9mvKLTeD4s/MH078L8MfJ/DCH/svM5Xp9z17o7 dZKvlo+fs9U4QE0oUjSCNbKtJ42DDQy8IC0E7Z/H4vk/2NzT9uJSrqogO+g7T/2P4SAF lYqA== X-Gm-Message-State: AJIora8cascnEbQ/9qrqnDM4ihYmWe8stBkAbZErsVMV4IbhxqJNURbU Vih1eLWF+BD0vRP0AP+RRZvuUqSXKw635Ud6PO5ouQ== X-Google-Smtp-Source: AGRyM1vH55ursM9GjUkBbRtmBX4QDoHDlEEpNJpUs5A6vX12LOpEiXhq4y8+47q5BzoyFRCpabSxvoZUSxVB0BpYoPY= X-Received: by 2002:a81:bd51:0:b0:31b:db72:88a1 with SMTP id n17-20020a81bd51000000b0031bdb7288a1mr3087923ywk.208.1656347131154; Mon, 27 Jun 2022 09:25:31 -0700 (PDT) MIME-Version: 1.0 References: <20220623185730.25b88096@kernel.org> <20220624070656.GE79500@shbuild999.sh.intel.com> <20220624144358.lqt2ffjdry6p5u4d@google.com> <20220625023642.GA40868@shbuild999.sh.intel.com> <20220627023812.GA29314@shbuild999.sh.intel.com> <20220627123415.GA32052@shbuild999.sh.intel.com> <20220627151258.GB20878@shbuild999.sh.intel.com> In-Reply-To: <20220627151258.GB20878@shbuild999.sh.intel.com> From: Shakeel Butt Date: Mon, 27 Jun 2022 09:25:20 -0700 Message-ID: Subject: Re: [net] 4890b686f4: netperf.Throughput_Mbps -69.4% regression To: Feng Tang Cc: Eric Dumazet , Linux MM , Andrew Morton , Roman Gushchin , Michal Hocko , Johannes Weiner , Muchun Song , Jakub Kicinski , Xin Long , Marcelo Ricardo Leitner , kernel test robot , Soheil Hassas Yeganeh , LKML , network dev , linux-s390@vger.kernel.org, MPTCP Upstream , "linux-sctp @ vger . kernel . org" , lkp@lists.01.org, kbuild test robot , Huang Ying , Xing Zhengjun , Yin Fengwei , Ying Xu Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jun 27, 2022 at 8:25 AM Feng Tang wrote: > > On Mon, Jun 27, 2022 at 07:52:55AM -0700, Shakeel Butt wrote: > > On Mon, Jun 27, 2022 at 5:34 AM Feng Tang wrote: > > > Yes, 1% is just around noise level for a microbenchmark. > > > > > > I went check the original test data of Oliver's report, the tests was > > > run 6 rounds and the performance data is pretty stable (0Day's report > > > will show any std deviation bigger than 2%) > > > > > > The test platform is a 4 sockets 72C/144T machine, and I run the > > > same job (nr_tasks = 25% * nr_cpus) on one CascadeLake AP (4 nodes) > > > and one Icelake 2 sockets platform, and saw 75% and 53% regresson on > > > them. > > > > > > In the first email, there is a file named 'reproduce', it shows the > > > basic test process: > > > > > > " > > > use 'performane' cpufre governor for all CPUs > > > > > > netserver -4 -D > > > modprobe sctp > > > netperf -4 -H 127.0.0.1 -t SCTP_STREAM_MANY -c -C -l 300 -- -m 10K & > > > netperf -4 -H 127.0.0.1 -t SCTP_STREAM_MANY -c -C -l 300 -- -m 10K & > > > netperf -4 -H 127.0.0.1 -t SCTP_STREAM_MANY -c -C -l 300 -- -m 10K & > > > (repeat 36 times in total) > > > ... > > > > > > " > > > > > > Which starts 36 (25% of nr_cpus) netperf clients. And the clients number > > > also matters, I tried to increase the client number from 36 to 72(50%), > > > and the regression is changed from 69.4% to 73.7% > > > > > > > Am I understanding correctly that this 69.4% (or 73.7%) regression is > > with cgroup v2? > > Yes. > > > Eric did the experiments on v2 but on real hardware where the > > performance impact was negligible. > > > > BTW do you see similar regression for tcp as well or just sctp? > > Yes, I run TCP_SENDFILE case with 'send_size'==10K, it hits a > 70%+ regressioin. > Thanks Feng. I think we should start with squeezing whatever we can from layout changes and then try other approaches like increasing batch size or something else. I can take a stab at this next week. thanks, Shakeel