From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:42195)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <greg.bellows@linaro.org>) id 1YHCRt-0004BD-VA
	for qemu-devel@nongnu.org; Fri, 30 Jan 2015 09:21:34 -0500
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <greg.bellows@linaro.org>) id 1YHCRo-0007uN-0r
	for qemu-devel@nongnu.org; Fri, 30 Jan 2015 09:21:29 -0500
Received: from mail-qa0-f43.google.com ([209.85.216.43]:36847)
	by eggs.gnu.org with esmtp (Exim 4.71)
	(envelope-from <greg.bellows@linaro.org>) id 1YHCRn-0007uD-SA
	for qemu-devel@nongnu.org; Fri, 30 Jan 2015 09:21:23 -0500
Received: by mail-qa0-f43.google.com with SMTP id v10so20032912qac.2
	for <qemu-devel@nongnu.org>; Fri, 30 Jan 2015 06:21:23 -0800 (PST)
MIME-Version: 1.0
In-Reply-To: <CABoDooOEGKuB1+iVHGxpWuV7gSjd2ge5LW022vPHGqVApG1gqA@mail.gmail.com>
References: <1422559909-19377-1-git-send-email-peter.maydell@linaro.org>
	<CABoDooOEGKuB1+iVHGxpWuV7gSjd2ge5LW022vPHGqVApG1gqA@mail.gmail.com>
Date: Fri, 30 Jan 2015 08:21:23 -0600
Message-ID: <CAOgzsHXu9K+eCdXPv_kzaj=49UhTsGdCCg9uB1jTdjopJjnY+Q@mail.gmail.com>
From: Greg Bellows <greg.bellows@linaro.org>
Content-Type: multipart/alternative; boundary=001a11c12eeab33243050ddf5178
Subject: Re: [Qemu-devel] [PATCH] target-arm: Squash input denormals in
 FRECPS and FRSQRTS
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: Laurent Desnogues <laurent.desnogues@gmail.com>
Cc: Peter Maydell <peter.maydell@linaro.org>, Patch Tracking <patches@linaro.org>, =?UTF-8?B?QWxleCBCZW5uw6ll?= <alex.bennee@linaro.org>, "qemu-devel@nongnu.org" <qemu-devel@nongnu.org>, Xiangyu Hu <libhu.so@gmail.com>

--001a11c12eeab33243050ddf5178
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

On Fri, Jan 30, 2015 at 3:41 AM, Laurent Desnogues <
laurent.desnogues@gmail.com> wrote:

> On Thu, Jan 29, 2015 at 8:31 PM, Peter Maydell <peter.maydell@linaro.org>
> wrote:
> > The helper functions for FRECPS and FRSQRTS have special case
> > handling that includes checks for zero inputs, so squash input
> > denormals if necessary before those checks. This fixes incorrect
> > output when the FPCR DZ bit is set to enable squashing of input
> > denormals.
> >
> > Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
>
> Tested-by: Laurent Desnogues <laurent.desnogues@gmail.com>
>
> Thanks,
>
> Laurent
>
> > ---
> > A quick eyeball of helper-a64.c suggests that these are the only
> > other insns we needed to fix, and a risu test of these insns
> > confirms that (a) they're buggy and (b) this patch fixes them.
> > I haven't done an exhaustive coverage test of the whole instruction
> > set with the DZ bit set, though...
> >
> >  target-arm/helper-a64.c | 12 ++++++++++++
> >  1 file changed, 12 insertions(+)
> >
> > diff --git a/target-arm/helper-a64.c b/target-arm/helper-a64.c
> > index ebd9247..8aa40e9 100644
> > --- a/target-arm/helper-a64.c
> > +++ b/target-arm/helper-a64.c
> > @@ -229,6 +229,9 @@ float32 HELPER(recpsf_f32)(float32 a, float32 b,
> void *fpstp)
> >  {
> >      float_status *fpst =3D fpstp;
> >
> > +    a =3D float32_squash_input_denormal(a, fpst);
> > +    b =3D float32_squash_input_denormal(b, fpst);
> > +
> >      a =3D float32_chs(a);
> >      if ((float32_is_infinity(a) && float32_is_zero(b)) ||
> >          (float32_is_infinity(b) && float32_is_zero(a))) {
> > @@ -241,6 +244,9 @@ float64 HELPER(recpsf_f64)(float64 a, float64 b,
> void *fpstp)
> >  {
> >      float_status *fpst =3D fpstp;
> >
> > +    a =3D float64_squash_input_denormal(a, fpst);
> > +    b =3D float64_squash_input_denormal(b, fpst);
> > +
> >      a =3D float64_chs(a);
> >      if ((float64_is_infinity(a) && float64_is_zero(b)) ||
> >          (float64_is_infinity(b) && float64_is_zero(a))) {
> > @@ -253,6 +259,9 @@ float32 HELPER(rsqrtsf_f32)(float32 a, float32 b,
> void *fpstp)
> >  {
> >      float_status *fpst =3D fpstp;
> >
> > +    a =3D float32_squash_input_denormal(a, fpst);
> > +    b =3D float32_squash_input_denormal(b, fpst);
> > +
> >      a =3D float32_chs(a);
> >      if ((float32_is_infinity(a) && float32_is_zero(b)) ||
> >          (float32_is_infinity(b) && float32_is_zero(a))) {
> > @@ -265,6 +274,9 @@ float64 HELPER(rsqrtsf_f64)(float64 a, float64 b,
> void *fpstp)
> >  {
> >      float_status *fpst =3D fpstp;
> >
> > +    a =3D float64_squash_input_denormal(a, fpst);
> > +    b =3D float64_squash_input_denormal(b, fpst);
> > +
> >      a =3D float64_chs(a);
> >      if ((float64_is_infinity(a) && float64_is_zero(b)) ||
> >          (float64_is_infinity(b) && float64_is_zero(a))) {
> > --
> > 1.9.1
> >
> >
>
>
=E2=80=8BReviewed-by: Greg Bellows <greg.bellows@linaro.org>=E2=80=8B

--001a11c12eeab33243050ddf5178
Content-Type: text/html; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

<div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:arial,he=
lvetica,sans-serif"><br></div><div class=3D"gmail_extra"><br><div class=3D"=
gmail_quote">On Fri, Jan 30, 2015 at 3:41 AM, Laurent Desnogues <span dir=
=3D"ltr">&lt;<a href=3D"mailto:laurent.desnogues@gmail.com" target=3D"_blan=
k">laurent.desnogues@gmail.com</a>&gt;</span> wrote:<br><blockquote class=
=3D"gmail_quote" style=3D"margin:0 0 0 .8ex;border-left:1px #ccc solid;padd=
ing-left:1ex"><span class=3D"">On Thu, Jan 29, 2015 at 8:31 PM, Peter Mayde=
ll &lt;<a href=3D"mailto:peter.maydell@linaro.org">peter.maydell@linaro.org=
</a>&gt; wrote:<br>
&gt; The helper functions for FRECPS and FRSQRTS have special case<br>
&gt; handling that includes checks for zero inputs, so squash input<br>
&gt; denormals if necessary before those checks. This fixes incorrect<br>
&gt; output when the FPCR DZ bit is set to enable squashing of input<br>
&gt; denormals.<br>
&gt;<br>
&gt; Signed-off-by: Peter Maydell &lt;<a href=3D"mailto:peter.maydell@linar=
o.org">peter.maydell@linaro.org</a>&gt;<br>
<br>
</span>Tested-by: Laurent Desnogues &lt;<a href=3D"mailto:laurent.desnogues=
@gmail.com">laurent.desnogues@gmail.com</a>&gt;<br>
<br>
Thanks,<br>
<br>
Laurent<br>
<div class=3D"HOEnZb"><div class=3D"h5"><br>
&gt; ---<br>
&gt; A quick eyeball of helper-a64.c suggests that these are the only<br>
&gt; other insns we needed to fix, and a risu test of these insns<br>
&gt; confirms that (a) they&#39;re buggy and (b) this patch fixes them.<br>
&gt; I haven&#39;t done an exhaustive coverage test of the whole instructio=
n<br>
&gt; set with the DZ bit set, though...<br>
&gt;<br>
&gt;=C2=A0 target-arm/helper-a64.c | 12 ++++++++++++<br>
&gt;=C2=A0 1 file changed, 12 insertions(+)<br>
&gt;<br>
&gt; diff --git a/target-arm/helper-a64.c b/target-arm/helper-a64.c<br>
&gt; index ebd9247..8aa40e9 100644<br>
&gt; --- a/target-arm/helper-a64.c<br>
&gt; +++ b/target-arm/helper-a64.c<br>
&gt; @@ -229,6 +229,9 @@ float32 HELPER(recpsf_f32)(float32 a, float32 b, v=
oid *fpstp)<br>
&gt;=C2=A0 {<br>
&gt;=C2=A0 =C2=A0 =C2=A0 float_status *fpst =3D fpstp;<br>
&gt;<br>
&gt; +=C2=A0 =C2=A0 a =3D float32_squash_input_denormal(a, fpst);<br>
&gt; +=C2=A0 =C2=A0 b =3D float32_squash_input_denormal(b, fpst);<br>
&gt; +<br>
&gt;=C2=A0 =C2=A0 =C2=A0 a =3D float32_chs(a);<br>
&gt;=C2=A0 =C2=A0 =C2=A0 if ((float32_is_infinity(a) &amp;&amp; float32_is_=
zero(b)) ||<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 (float32_is_infinity(b) &amp;&amp; f=
loat32_is_zero(a))) {<br>
&gt; @@ -241,6 +244,9 @@ float64 HELPER(recpsf_f64)(float64 a, float64 b, v=
oid *fpstp)<br>
&gt;=C2=A0 {<br>
&gt;=C2=A0 =C2=A0 =C2=A0 float_status *fpst =3D fpstp;<br>
&gt;<br>
&gt; +=C2=A0 =C2=A0 a =3D float64_squash_input_denormal(a, fpst);<br>
&gt; +=C2=A0 =C2=A0 b =3D float64_squash_input_denormal(b, fpst);<br>
&gt; +<br>
&gt;=C2=A0 =C2=A0 =C2=A0 a =3D float64_chs(a);<br>
&gt;=C2=A0 =C2=A0 =C2=A0 if ((float64_is_infinity(a) &amp;&amp; float64_is_=
zero(b)) ||<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 (float64_is_infinity(b) &amp;&amp; f=
loat64_is_zero(a))) {<br>
&gt; @@ -253,6 +259,9 @@ float32 HELPER(rsqrtsf_f32)(float32 a, float32 b, =
void *fpstp)<br>
&gt;=C2=A0 {<br>
&gt;=C2=A0 =C2=A0 =C2=A0 float_status *fpst =3D fpstp;<br>
&gt;<br>
&gt; +=C2=A0 =C2=A0 a =3D float32_squash_input_denormal(a, fpst);<br>
&gt; +=C2=A0 =C2=A0 b =3D float32_squash_input_denormal(b, fpst);<br>
&gt; +<br>
&gt;=C2=A0 =C2=A0 =C2=A0 a =3D float32_chs(a);<br>
&gt;=C2=A0 =C2=A0 =C2=A0 if ((float32_is_infinity(a) &amp;&amp; float32_is_=
zero(b)) ||<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 (float32_is_infinity(b) &amp;&amp; f=
loat32_is_zero(a))) {<br>
&gt; @@ -265,6 +274,9 @@ float64 HELPER(rsqrtsf_f64)(float64 a, float64 b, =
void *fpstp)<br>
&gt;=C2=A0 {<br>
&gt;=C2=A0 =C2=A0 =C2=A0 float_status *fpst =3D fpstp;<br>
&gt;<br>
&gt; +=C2=A0 =C2=A0 a =3D float64_squash_input_denormal(a, fpst);<br>
&gt; +=C2=A0 =C2=A0 b =3D float64_squash_input_denormal(b, fpst);<br>
&gt; +<br>
&gt;=C2=A0 =C2=A0 =C2=A0 a =3D float64_chs(a);<br>
&gt;=C2=A0 =C2=A0 =C2=A0 if ((float64_is_infinity(a) &amp;&amp; float64_is_=
zero(b)) ||<br>
&gt;=C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 (float64_is_infinity(b) &amp;&amp; f=
loat64_is_zero(a))) {<br>
&gt; --<br>
&gt; 1.9.1<br>
&gt;<br>
&gt;<br>
<br>
</div></div></blockquote></div><br></div><div class=3D"gmail_extra"><div cl=
ass=3D"gmail_default" style=3D"font-family:arial,helvetica,sans-serif">=E2=
=80=8BReviewed-by: Greg Bellows &lt;<a href=3D"mailto:greg.bellows@linaro.o=
rg">greg.bellows@linaro.org</a>&gt;=E2=80=8B</div><br></div></div>

--001a11c12eeab33243050ddf5178--