From mboxrd@z Thu Jan 1 00:00:00 1970 From: Jesper Dangaard Brouer Subject: Re: [net-next PATCH 1/1 V4] qdisc: bulk dequeue support for qdiscs with TCQ_F_ONETXQUEUE Date: Thu, 25 Sep 2014 17:23:29 +0200 Message-ID: <20140925172329.7460f787@redhat.com> References: <20140924160932.9721.56450.stgit@localhost> <20140924161047.9721.43080.stgit@localhost> <1411579395.15395.41.camel@edumazet-glaptop2.roam.corp.google.com> <20140924195831.6fb91051@redhat.com> <54234225.5000503@mojatatu.com> <20140925102505.494acab1@redhat.com> <54240F34.1050707@mojatatu.com> <20140925165738.646d0783@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Cc: Jamal Hadi Salim , Eric Dumazet , Linux Netdev List , "David S. Miller" , Alexander Duyck , Toke =?UTF-8?B?SMO4aWxhbmQtSsO4cmdlbnNl?= =?UTF-8?B?bg==?= , Florian Westphal , Dave Taht , John Fastabend , Daniel Borkmann , Hannes Frederic Sowa , brouer@redhat.com To: Tom Herbert Return-path: Received: from mx1.redhat.com ([209.132.183.28]:45235 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753728AbaIYPYZ (ORCPT ); Thu, 25 Sep 2014 11:24:25 -0400 In-Reply-To: Sender: netdev-owner@vger.kernel.org List-ID: On Thu, 25 Sep 2014 08:05:38 -0700 Tom Herbert wrote: > On Thu, Sep 25, 2014 at 7:57 AM, Jesper Dangaard Brouer > wrote: > > On Thu, 25 Sep 2014 07:40:33 -0700 > > Tom Herbert wrote: > > > >> A few test results in patch 0 are good. I like to have results for > >> with and without patch. These should two things: 1) Any regressions > >> caused by the patch 2) Performance gains (in that order of importance > >> :-) ). There doesn't need to be a lot here, just something reasonably > >> representative, simple, and should be easily reproducible. My > >> expectation in bulk dequeue is that we should see no obvious > >> regression and hopefully an improvement in CPU utilization-- are you > >> able to verify this? > > > > We are saving 3% CPU, as I described in my post with subject: > > "qdisc/UDP_STREAM: measuring effect of qdisc bulk dequeue": > > http://thread.gmane.org/gmane.linux.network/331152/focus=331154 > > > > Using UDP_STREAM on 1Gbit/s driver igb, I can show that the > > _raw_spin_lock calls are reduced with approx 3%, when enabling > > bulking of just 2 packets. > > > > That's great. In commit log, would be good to have results with > TCP_STREAM also and please report aggregate CPU utilization changes > (like from mpstat). The TCP_STREAM is not a good test for this, because unless disabling both TSO and GSO the packets will not hit the code path (that this patch changes). When we later add support for TSO and GSO bulking, then it will make sense to include TCP_STREAM testing, not before. I will redo the tests, once I get home to my testlab, as the remote lab I'm using now is annoyingly slow rebooting machines, as we not longer have a runtime option for enable/disable (I'm currently in Switzerland). -- Best regards, Jesper Dangaard Brouer MSc.CS, Sr. Network Kernel Developer at Red Hat Author of http://www.iptv-analyzer.org LinkedIn: http://www.linkedin.com/in/brouer