From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-6.0 required=3.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,MAILING_LIST_MULTI,SPF_HELO_NONE, SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1D055C433B4 for ; Fri, 16 Apr 2021 22:14:40 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id E44C2613CD for ; Fri, 16 Apr 2021 22:14:39 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235703AbhDPWPE (ORCPT ); Fri, 16 Apr 2021 18:15:04 -0400 Received: from mail.kernel.org ([198.145.29.99]:45138 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235192AbhDPWPD (ORCPT ); Fri, 16 Apr 2021 18:15:03 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id CB4AE613D2 for ; Fri, 16 Apr 2021 22:14:37 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1618611277; bh=YAW6WRjJB1b/E7DTA4ByXcdOWn3VcOBIWw3+GxAvbQc=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=aJBN/Wk7BwKQoiVOxWKTh/K9RVR0H0dodvRB4YL8hknd2xTOYQ91NIvnra67OQVfg +j3BTm+WoDPjekRNVeYXwPwd0V7NhPYFuazThT8H1bE32Yppusm83My1H7i9bSpW/l O6XXqEmmqpxdDb4HZ+ohMHyntX8LDCu3seIRrGMBl+0HTgKNk7klSFhqyAUjfbaEG4 mlA/bjpoFpQ7DEDw3qvw+fssBrAab7ygjP12HdbJXTYn1+NiKLoqrrePlQD+wuekUb K6sHrzYFF1Xkb1P28CeNwyZ8RjIfnDFPpCd8v7zW8TGy74cvg7kQlOS/ge+/RpdEAG k4qorR0iC2eLw== Received: by mail-ed1-f46.google.com with SMTP id i3so8322851edt.1 for ; Fri, 16 Apr 2021 15:14:37 -0700 (PDT) X-Gm-Message-State: AOAM531nMOvXYCg0KwqRPOwSP6cu9bm9lve/2Q9n9FV/CzviWC/v9Fg2 Zjih7mSifFfsYob9ZwS2C75/OMdCEWyfqx0E/y8V/Q== X-Google-Smtp-Source: ABdhPJw2QIR+p11NKImTuFQOteRkZF+prwoeWnHRZjm/v8Q+9ztKDMxAW4YT6fYEWJhDSYhkEXfz8jXZHa3dPkqAvMo= X-Received: by 2002:aa7:d7d1:: with SMTP id e17mr12795520eds.84.1618611276328; Fri, 16 Apr 2021 15:14:36 -0700 (PDT) MIME-Version: 1.0 References: <87lf9nk2ku.fsf@oldenburg.str.redhat.com> In-Reply-To: From: Andy Lutomirski Date: Fri, 16 Apr 2021 15:14:24 -0700 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Candidate Linux ABI for Intel AMX and hypothetical new related features To: Len Brown Cc: Andy Lutomirski , Willy Tarreau , Florian Weimer , "Bae, Chang Seok" , Dave Hansen , X86 ML , LKML , linux-abi@vger.kernel.org, "libc-alpha@sourceware.org" , Rich Felker , Kyle Huey , Keno Fischer Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 16, 2021 at 3:11 PM Len Brown wrote: > > > I get it. That does not explain why LDMXCSR and VLDMXCSR cause > > pipelines stalls. > > Sorry, I thought this thread was about AMX. > I don't know the answer to your LDMXCSR and VLDMXCSR question. My point is that every single major math extension since the original XMM extensions (SSE, etc) has come with performance gotchas. Given Intel's general unwillingness to document the gotchas in hardware that is actually shipping, I'm sceptical that AMX is as delightfully gotcha-free as you are making it out to be. Is there any authoritative guidance at all on what actually happens, performance-wise, when someone does AMX math?