From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-1.0 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 55D94C4321D for ; Wed, 22 Aug 2018 14:30:34 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id D15DC2147C for ; Wed, 22 Aug 2018 14:30:33 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org D15DC2147C Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=bootlin.com Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729014AbeHVRzj (ORCPT ); Wed, 22 Aug 2018 13:55:39 -0400 Received: from mail.bootlin.com ([62.4.15.54]:43002 "EHLO mail.bootlin.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728517AbeHVRzj (ORCPT ); Wed, 22 Aug 2018 13:55:39 -0400 Received: by mail.bootlin.com (Postfix, from userid 110) id 88CC420720; Wed, 22 Aug 2018 16:30:28 +0200 (CEST) Received: from aptenodytes (AAubervilliers-681-1-85-9.w90-88.abo.wanadoo.fr [90.88.27.9]) by mail.bootlin.com (Postfix) with ESMTPSA id 1ED37203EC; Wed, 22 Aug 2018 16:30:28 +0200 (CEST) Message-ID: Subject: Re: [PATCH 1/9] CHROMIUM: v4l: Add H264 low-level decoder API compound controls. From: Paul Kocialkowski To: Tomasz Figa Cc: Maxime Ripard , Ezequiel Garcia , Pawel Osciak , Hans Verkuil , Alexandre Courbot , Sakari Ailus , Laurent Pinchart , Chen-Yu Tsai , Linux Kernel Mailing List , "list@263.net:IOMMU DRIVERS , Joerg " "Roedel ," , Linux Media Mailing List , Nicolas Dufresne , jenskuske@gmail.com, linux-sunxi@googlegroups.com, thomas.petazzoni@bootlin.com, groeck@chromium.org Date: Wed, 22 Aug 2018 16:30:28 +0200 In-Reply-To: References: <20180613140714.1686-1-maxime.ripard@bootlin.com> <20180613140714.1686-2-maxime.ripard@bootlin.com> <80e1d9cb49c6df06843e49332685f2b401023292.camel@collabora.com> <20180822091557.gtnlgoebyv6yttzf@flea> Organization: Bootlin Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="=-3aAsz1HYbIFK9eXBX5+l" X-Mailer: Evolution 3.28.4 Mime-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --=-3aAsz1HYbIFK9eXBX5+l Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi, On Wed, 2018-08-22 at 22:24 +0900, Tomasz Figa wrote: > On Wed, Aug 22, 2018 at 10:03 PM Paul Kocialkowski > wrote: > >=20 > > Hi, > >=20 > > On Wed, 2018-08-22 at 18:54 +0900, Tomasz Figa wrote: > > > On Wed, Aug 22, 2018 at 6:16 PM Maxime Ripard wrote: > > > >=20 > > > > Hi, > > > >=20 > > > > On Tue, Aug 21, 2018 at 01:58:38PM -0300, Ezequiel Garcia wrote: > > > > > On Wed, 2018-06-13 at 16:07 +0200, Maxime Ripard wrote: > > > > > > From: Pawel Osciak > > > > > >=20 > > > > > > Signed-off-by: Pawel Osciak > > > > > > Reviewed-by: Wu-cheng Li > > > > > > Tested-by: Tomasz Figa > > > > > > [rebase44(groeck): include linux/types.h in v4l2-controls.h] > > > > > > Signed-off-by: Guenter Roeck > > > > > > Signed-off-by: Maxime Ripard > > > > > > --- > > > > > >=20 > > > > >=20 > > > > > [..] > > > > > > diff --git a/include/uapi/linux/videodev2.h b/include/uapi/linu= x/videodev2.h > > > > > > index 242a6bfa1440..4b4a1b25a0db 100644 > > > > > > --- a/include/uapi/linux/videodev2.h > > > > > > +++ b/include/uapi/linux/videodev2.h > > > > > > @@ -626,6 +626,7 @@ struct v4l2_pix_format { > > > > > > #define V4L2_PIX_FMT_H264 v4l2_fourcc('H', '2', '6', '4') = /* H264 with start codes */ > > > > > > #define V4L2_PIX_FMT_H264_NO_SC v4l2_fourcc('A', 'V', 'C', '1'= ) /* H264 without start codes */ > > > > > > #define V4L2_PIX_FMT_H264_MVC v4l2_fourcc('M', '2', '6', '4') = /* H264 MVC */ > > > > > > +#define V4L2_PIX_FMT_H264_SLICE v4l2_fourcc('S', '2', '6', '4'= ) /* H264 parsed slices */ > > > > >=20 > > > > > As pointed out by Tomasz, the Rockchip VPU driver expects start c= odes [1], so the userspace > > > > > should be aware of it. Perhaps we could document this pixel forma= t better as: > > > > >=20 > > > > > #define V4L2_PIX_FMT_H264_SLICE v4l2_fourcc('S', '2', '6', '4') /= * H264 parsed slices with start codes */ > > > >=20 > > > > I'm not sure this is something we want to do at that point. libva > > > > doesn't give the start code, so this is only going to make the life= of > > > > the sane controllers more difficult. And if you need to have the st= art > > > > code and parse it, then you're not so stateless anymore. > > >=20 > > > I might not remember correctly, but Rockchip decoder does some slice > > > parsing on its own (despite not doing any higher level parsing). > > > Probably that's why it needs those start codes. > >=20 > > The VPU found on Allwinner platforms also provides a mechanism to parse > > the bitstream data via a dedicated interface through the VPU registers. > > It is used in libvdpau-sunxi but not in our driver, because we don't > > want to be doing bitstream parsing in the kernel. > >=20 > > It would be good to know if this is just a feature of the Rockchip VPU > > hardware that can be skipped (like on Allwinner) or if it's a hard > > requirement in its decoding pipeline. >=20 > It's a hard requirement for its decoding pipeline, but... >=20 > > Also, maybe it only concerns the > > slice header? It is already part of the slice data (provided by VAAPI) > > for H.264/H.265 and an offset is provided to the beginning of the coded > > video data. >=20 > Yes, it seems to be only the slice header. Sounds good, then I don't have any problem with that. > >=20 > > > I wonder if libva is the best reference here. It's been designed > > > almost entirely by Intel for Intel video hardware. We want something > > > that could work with a wide range of devices and avoid something like > > > a need to create a semi-stateless API few months later. In fact, > > > hardware from another vendor, we're working with, also does parsing o= f > > > slice headers internally. Moreover, we have some weird > > > kind-of-stateful decoders, which cannot fully deal with bitstream on > > > its own, e.g. cannot parse formats, cannot handle resolution changes, > > > need H264 bitstream NALUs split into separate buffers, etc. > > >=20 > > > As I suggested some time ago, having the full bitstream in the buffer= , > > > with offsets of particular units included in respective controls, > > > would be the most scalable thing. If really needed, we could add flag= s > > > telling the driver that particular units are present, so one's > > > implementation of libva could put only raw slice data in the buffers. > > > But perhaps it's libva which needs some amendment? > >=20 > > If the raw bitstream is needed, I think it would make more sense to use > > the already-existing formats for stateful VPUs along with the controls > > for stateless ones instead of having the full bitstream in the > > V4L2_PIX_FMT_*_SLICE formats. >=20 > It may indeed make sense to separate this based on pixel format. > However, how do we tell the client that it needs to provide those > controls? Current concept was based entirely on pixel format, so I > guess that would mean creating something like > V4L2_PIX_FMT_*_NOT_REALLY_SLICE (_PARSED, _STATELESS?). Might be okay, > though... How about declaring support for the request API (through the associated CAPs) and only having the non-_SLICE formats listed in ENUM_FMT? > > I would also be tempted to say that reconstructing the needed parts of > > the bitstream in-driver for these half-way VPUs would be a better > > approach than blurrying the line between how (and what) data should be > > passed for stateful and stateless VPUs at the API level. Stateless > > should only cover what's in the slice NAL unit RBSP, which excludes the > > start code detection bytes. It is no longer parsed data otherwise. >=20 > I'm not sure where such decision comes from. In particular, Chromium, > from which this code originates, includes start codes in > V4L2_PIX_FMT_H264_SLICE. As I mentioned earlier, we can't design this > API based only on 1 type of hardware semantics. The stateless API > should cover any kind of codec that needs user space assistance in > processing the stream, which in practice would be almost everything > for which stateful API doesn't work. Maybe we need to formalize what the stateless API aims to support and what the formats really entail. I was under the impression that it was synonymouse with providing parsed bitstream to the kernel. The way I understand "parsed bitstream" means the slice NALU RBSP as raw data and the metadata from other NALUs as controls (with the overlap of the slice header in recent formats, that is in both). I don't see any other sane boundary that could be conceptually attached to something like "providing parsed bitstream". Covering all that might be required for !=3D stateful seems hard to formalize conceptually and to delimit in general. > That said, since pixel format essentially specifies the buffer > contents, having such cases differentiated based on the pixel format > doesn't sound insane. Great! --=20 Paul Kocialkowski, Bootlin (formerly Free Electrons) Embedded Linux and kernel engineering https://bootlin.com --=-3aAsz1HYbIFK9eXBX5+l Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part Content-Transfer-Encoding: 7bit -----BEGIN PGP SIGNATURE----- iQEzBAABCAAdFiEEJZpWjZeIetVBefti3cLmz3+fv9EFAlt9c4QACgkQ3cLmz3+f v9GVPAf/QtvvtBKBM3MAU7tEHGmfMj7cEY824F/9v/3eECN9T39p0HyvsLupLInA g9L9SjVfVmKImhuarT3lB47XIt9KArBtomim1w2msne98PCXSnNHqs1RU81z3ufz HSBPE0mMQVNpXlUCk6nshaNPn7x7u2tW8816YH2RgFOiKtVt7YIZlddi3TrxyYXF lY4hWHk9gihzSuF45PC4QqRnWr+oA9IMKhZJ2OyFFrfKnX6eI1XmL3DAtN2sCdh1 bkFIF5ISiYuAzwHZaXWbZKcbnVsF/C01fgmVAVkTjSfbm0PLmseclvO3chaFiwnh Dfy/A8wBcWVqD7X60LHKbMtJPBNbew== =ElKj -----END PGP SIGNATURE----- --=-3aAsz1HYbIFK9eXBX5+l-- From mboxrd@z Thu Jan 1 00:00:00 1970 From: paul.kocialkowski@bootlin.com (Paul Kocialkowski) Date: Wed, 22 Aug 2018 16:30:28 +0200 Subject: [PATCH 1/9] CHROMIUM: v4l: Add H264 low-level decoder API compound controls. In-Reply-To: References: <20180613140714.1686-1-maxime.ripard@bootlin.com> <20180613140714.1686-2-maxime.ripard@bootlin.com> <80e1d9cb49c6df06843e49332685f2b401023292.camel@collabora.com> <20180822091557.gtnlgoebyv6yttzf@flea> Message-ID: To: linux-arm-kernel@lists.infradead.org List-Id: linux-arm-kernel.lists.infradead.org Hi, On Wed, 2018-08-22 at 22:24 +0900, Tomasz Figa wrote: > On Wed, Aug 22, 2018 at 10:03 PM Paul Kocialkowski > wrote: > > > > Hi, > > > > On Wed, 2018-08-22 at 18:54 +0900, Tomasz Figa wrote: > > > On Wed, Aug 22, 2018 at 6:16 PM Maxime Ripard wrote: > > > > > > > > Hi, > > > > > > > > On Tue, Aug 21, 2018 at 01:58:38PM -0300, Ezequiel Garcia wrote: > > > > > On Wed, 2018-06-13 at 16:07 +0200, Maxime Ripard wrote: > > > > > > From: Pawel Osciak > > > > > > > > > > > > Signed-off-by: Pawel Osciak > > > > > > Reviewed-by: Wu-cheng Li > > > > > > Tested-by: Tomasz Figa > > > > > > [rebase44(groeck): include linux/types.h in v4l2-controls.h] > > > > > > Signed-off-by: Guenter Roeck > > > > > > Signed-off-by: Maxime Ripard > > > > > > --- > > > > > > > > > > > > > > > > [..] > > > > > > diff --git a/include/uapi/linux/videodev2.h b/include/uapi/linux/videodev2.h > > > > > > index 242a6bfa1440..4b4a1b25a0db 100644 > > > > > > --- a/include/uapi/linux/videodev2.h > > > > > > +++ b/include/uapi/linux/videodev2.h > > > > > > @@ -626,6 +626,7 @@ struct v4l2_pix_format { > > > > > > #define V4L2_PIX_FMT_H264 v4l2_fourcc('H', '2', '6', '4') /* H264 with start codes */ > > > > > > #define V4L2_PIX_FMT_H264_NO_SC v4l2_fourcc('A', 'V', 'C', '1') /* H264 without start codes */ > > > > > > #define V4L2_PIX_FMT_H264_MVC v4l2_fourcc('M', '2', '6', '4') /* H264 MVC */ > > > > > > +#define V4L2_PIX_FMT_H264_SLICE v4l2_fourcc('S', '2', '6', '4') /* H264 parsed slices */ > > > > > > > > > > As pointed out by Tomasz, the Rockchip VPU driver expects start codes [1], so the userspace > > > > > should be aware of it. Perhaps we could document this pixel format better as: > > > > > > > > > > #define V4L2_PIX_FMT_H264_SLICE v4l2_fourcc('S', '2', '6', '4') /* H264 parsed slices with start codes */ > > > > > > > > I'm not sure this is something we want to do at that point. libva > > > > doesn't give the start code, so this is only going to make the life of > > > > the sane controllers more difficult. And if you need to have the start > > > > code and parse it, then you're not so stateless anymore. > > > > > > I might not remember correctly, but Rockchip decoder does some slice > > > parsing on its own (despite not doing any higher level parsing). > > > Probably that's why it needs those start codes. > > > > The VPU found on Allwinner platforms also provides a mechanism to parse > > the bitstream data via a dedicated interface through the VPU registers. > > It is used in libvdpau-sunxi but not in our driver, because we don't > > want to be doing bitstream parsing in the kernel. > > > > It would be good to know if this is just a feature of the Rockchip VPU > > hardware that can be skipped (like on Allwinner) or if it's a hard > > requirement in its decoding pipeline. > > It's a hard requirement for its decoding pipeline, but... > > > Also, maybe it only concerns the > > slice header? It is already part of the slice data (provided by VAAPI) > > for H.264/H.265 and an offset is provided to the beginning of the coded > > video data. > > Yes, it seems to be only the slice header. Sounds good, then I don't have any problem with that. > > > > > I wonder if libva is the best reference here. It's been designed > > > almost entirely by Intel for Intel video hardware. We want something > > > that could work with a wide range of devices and avoid something like > > > a need to create a semi-stateless API few months later. In fact, > > > hardware from another vendor, we're working with, also does parsing of > > > slice headers internally. Moreover, we have some weird > > > kind-of-stateful decoders, which cannot fully deal with bitstream on > > > its own, e.g. cannot parse formats, cannot handle resolution changes, > > > need H264 bitstream NALUs split into separate buffers, etc. > > > > > > As I suggested some time ago, having the full bitstream in the buffer, > > > with offsets of particular units included in respective controls, > > > would be the most scalable thing. If really needed, we could add flags > > > telling the driver that particular units are present, so one's > > > implementation of libva could put only raw slice data in the buffers. > > > But perhaps it's libva which needs some amendment? > > > > If the raw bitstream is needed, I think it would make more sense to use > > the already-existing formats for stateful VPUs along with the controls > > for stateless ones instead of having the full bitstream in the > > V4L2_PIX_FMT_*_SLICE formats. > > It may indeed make sense to separate this based on pixel format. > However, how do we tell the client that it needs to provide those > controls? Current concept was based entirely on pixel format, so I > guess that would mean creating something like > V4L2_PIX_FMT_*_NOT_REALLY_SLICE (_PARSED, _STATELESS?). Might be okay, > though... How about declaring support for the request API (through the associated CAPs) and only having the non-_SLICE formats listed in ENUM_FMT? > > I would also be tempted to say that reconstructing the needed parts of > > the bitstream in-driver for these half-way VPUs would be a better > > approach than blurrying the line between how (and what) data should be > > passed for stateful and stateless VPUs at the API level. Stateless > > should only cover what's in the slice NAL unit RBSP, which excludes the > > start code detection bytes. It is no longer parsed data otherwise. > > I'm not sure where such decision comes from. In particular, Chromium, > from which this code originates, includes start codes in > V4L2_PIX_FMT_H264_SLICE. As I mentioned earlier, we can't design this > API based only on 1 type of hardware semantics. The stateless API > should cover any kind of codec that needs user space assistance in > processing the stream, which in practice would be almost everything > for which stateful API doesn't work. Maybe we need to formalize what the stateless API aims to support and what the formats really entail. I was under the impression that it was synonymouse with providing parsed bitstream to the kernel. The way I understand "parsed bitstream" means the slice NALU RBSP as raw data and the metadata from other NALUs as controls (with the overlap of the slice header in recent formats, that is in both). I don't see any other sane boundary that could be conceptually attached to something like "providing parsed bitstream". Covering all that might be required for != stateful seems hard to formalize conceptually and to delimit in general. > That said, since pixel format essentially specifies the buffer > contents, having such cases differentiated based on the pixel format > doesn't sound insane. Great! -- Paul Kocialkowski, Bootlin (formerly Free Electrons) Embedded Linux and kernel engineering https://bootlin.com -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 488 bytes Desc: This is a digitally signed message part URL: