From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([2001:4830:134:3::10]:54088) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1c7Sq4-0003CB-0X for qemu-devel@nongnu.org; Thu, 17 Nov 2016 14:59:16 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1c7Spz-0007gb-6C for qemu-devel@nongnu.org; Thu, 17 Nov 2016 14:59:16 -0500 Received: from mail-wm0-x241.google.com ([2a00:1450:400c:c09::241]:36692) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1c7Spy-0007gR-Ua for qemu-devel@nongnu.org; Thu, 17 Nov 2016 14:59:11 -0500 Received: by mail-wm0-x241.google.com with SMTP id m203so24463162wma.3 for ; Thu, 17 Nov 2016 11:59:10 -0800 (PST) Sender: Richard Henderson References: <1479324335-2074-1-git-send-email-rth@twiddle.net> <1479324335-2074-17-git-send-email-rth@twiddle.net> <1e2e6eab-921a-f30b-93d4-66bd62f68dd1@mail.uni-paderborn.de> From: Richard Henderson Message-ID: Date: Thu, 17 Nov 2016 20:59:06 +0100 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Subject: Re: [Qemu-devel] [PATCH 16/25] tcg/i386: Handle ctz and clz opcodes List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Bastian Koppelmann , qemu-devel@nongnu.org On 11/17/2016 08:53 PM, Richard Henderson wrote: > On 11/17/2016 05:50 PM, Bastian Koppelmann wrote: >> On 11/16/2016 08:25 PM, Richard Henderson wrote: >>> + >>> + OP_32_64(clz): >>> + if (const_args[2]) { >>> + tcg_debug_assert(have_bmi1); >>> + tcg_debug_assert(args[2] == (rexw ? 64 : 32)); >>> + tcg_out_modrm(s, OPC_LZCNT + rexw, args[0], args[1]); >>> + } else { >>> + /* ??? See above. */ >>> + tcg_out_modrm(s, OPC_BSR + rexw, args[0], args[1]); >> >> The Intel ISA manual states that it find the bit index of the most >> significant bit, where the least significant bit is index 0. So for the >> input 0x2 this should return 1. However this is not the number of >> leading zeros. > > Oh, of course you're right. I thought I was testing this, but while alpha does > have this operation, it turns out it isn't used much. Alternately, what I tested was on a haswell machine, which takes the LZCNT path, which *does* produce the intended results. Just the BSR path doesn't. r~