From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:49065) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZDTiu-0003Ij-O8 for qemu-devel@nongnu.org; Fri, 10 Jul 2015 04:31:57 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1ZDTiq-000425-Lm for qemu-devel@nongnu.org; Fri, 10 Jul 2015 04:31:56 -0400 Received: from greensocs.com ([193.104.36.180]:54225) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1ZDTiq-0003zz-DE for qemu-devel@nongnu.org; Fri, 10 Jul 2015 04:31:52 -0400 Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2102\)) From: Mark Burton In-Reply-To: <1436516626-8322-1-git-send-email-a.rigo@virtualopensystems.com> Date: Fri, 10 Jul 2015 10:31:46 +0200 Content-Transfer-Encoding: quoted-printable Message-Id: References: <1436516626-8322-1-git-send-email-a.rigo@virtualopensystems.com> Subject: Re: [Qemu-devel] [RFC v3 00/13] Slow-path for atomic instruction translation List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Alvise Rigo Cc: mttcg@listserver.greensocs.com, claudio.fontana@huawei.com, qemu-devel@nongnu.org, pbonzini@redhat.com, jani.kokkonen@huawei.com, tech@virtualopensystems.com, alex.bennee@linaro.org To be clear, for a normal user (e.g. they boot linux, they run some = apps, etc)..., if they use only one core, is it true that they will see = no difference in performance? For a =E2=80=98normal user=E2=80=99 who does use multi-core, are you = saying that a typical boot is slower? Cheers Mark. > On 10 Jul 2015, at 10:23, Alvise Rigo = wrote: >=20 > * Performance considerations > This implementation shows good results while booting a Linux kernel, > where tons of flushes affect the overall performance. A complete ARM > Linux boot, without any filesystem, requires 30% longer if compared to > the mttcg implementation, benefiting however of being capable to offer > the infrastructure to handle atomic instructions on any architecture. > Instead compared to the current TCG upstream, it is 40% faster with = four > vCPUs and 2.1 times faster with 8 vCPUs. > In addition, there is still margin to improve such performance, since = at > the moment TLB is flushed quite often, probably more than the = required. +44 (0)20 7100 3485 x 210 +33 (0)5 33 52 01 77x 210 +33 (0)603762104 mark.burton