From mboxrd@z Thu Jan 1 00:00:00 1970 From: Tomasz Figa Subject: Re: [PATCH 10/10] venus: dec: make decoder compliant with stateful codec API Date: Tue, 9 Apr 2019 18:59:47 +0900 Message-ID: References: <20190117162008.25217-1-stanimir.varbanov@linaro.org> <20190117162008.25217-11-stanimir.varbanov@linaro.org> <28069a44-b188-6b89-2687-542fa762c00e@linaro.org> <57419418d377f32d0e6978f4e4171c0da7357cbb.camel@ndufresne.ca> <1548938556.4585.1.camel@pengutronix.de> <1f8485785a21c0b0e071a3a766ed2cbc727e47f6.camel@ndufresne.ca> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Return-path: In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org To: Hans Verkuil Cc: Nicolas Dufresne , Philipp Zabel , Stanimir Varbanov , Linux Media Mailing List , Mauro Carvalho Chehab , Linux Kernel Mailing List , linux-arm-msm , Vikash Garodia , Alexandre Courbot , Malathi Gottam List-Id: linux-arm-msm@vger.kernel.org On Thu, Feb 7, 2019 at 4:33 PM Tomasz Figa wrote: > > On Tue, Feb 5, 2019 at 7:35 PM Hans Verkuil wrote: > > > > On 2/5/19 10:31 AM, Tomasz Figa wrote: > > > On Tue, Feb 5, 2019 at 6:00 PM Hans Verkuil wrot= e: > > >> > > >> On 2/5/19 7:26 AM, Tomasz Figa wrote: > > >>> On Fri, Feb 1, 2019 at 12:18 AM Nicolas Dufresne wrote: > > >>>> > > >>>> Le jeudi 31 janvier 2019 =C3=A0 22:34 +0900, Tomasz Figa a =C3=A9c= rit : > > >>>>> On Thu, Jan 31, 2019 at 9:42 PM Philipp Zabel wrote: > > >>>>>> Hi Nicolas, > > >>>>>> > > >>>>>> On Wed, 2019-01-30 at 10:32 -0500, Nicolas Dufresne wrote: > > >>>>>>> Le mercredi 30 janvier 2019 =C3=A0 15:17 +0900, Tomasz Figa a = =C3=A9crit : > > >>>>>>>>> I don't remember saying that, maybe I meant to say there migh= t be a > > >>>>>>>>> workaround ? > > >>>>>>>>> > > >>>>>>>>> For the fact, here we queue the headers (or first frame): > > >>>>>>>>> > > >>>>>>>>> https://gitlab.freedesktop.org/gstreamer/gst-plugins-good/blo= b/master/sys/v4l2/gstv4l2videodec.c#L624 > > >>>>>>>>> > > >>>>>>>>> Then few line below this helper does G_FMT internally: > > >>>>>>>>> > > >>>>>>>>> https://gitlab.freedesktop.org/gstreamer/gst-plugins-good/blo= b/master/sys/v4l2/gstv4l2videodec.c#L634 > > >>>>>>>>> https://gitlab.freedesktop.org/gstreamer/gst-plugins-good/blo= b/master/sys/v4l2/gstv4l2object.c#L3907 > > >>>>>>>>> > > >>>>>>>>> And just plainly fails if G_FMT returns an error of any type.= This was > > >>>>>>>>> how Kamil designed it initially for MFC driver. There was no = other > > >>>>>>>>> alternative back then (no EAGAIN yet either). > > >>>>>>>> > > >>>>>>>> Hmm, was that ffmpeg then? > > >>>>>>>> > > >>>>>>>> So would it just set the OUTPUT width and height to 0? Does it= mean > > >>>>>>>> that gstreamer doesn't work with coda and mtk-vcodec, which do= n't have > > >>>>>>>> such wait in their g_fmt implementations? > > >>>>>>> > > >>>>>>> I don't know for MTK, I don't have the hardware and didn't inte= grate > > >>>>>>> their vendor pixel format. For the CODA, I know it works, if th= ere is > > >>>>>>> no wait in the G_FMT, then I suppose we are being really lucky = with the > > >>>>>>> timing (it would be that the drivers process the SPS/PPS synchr= onously, > > >>>>>>> and a simple lock in the G_FMT call is enough to wait). Adding = Philipp > > >>>>>>> in CC, he could explain how this works, I know they use GStream= er in > > >>>>>>> production, and he would have fixed GStreamer already if that w= as > > >>>>>>> causing important issue. > > >>>>>> > > >>>>>> CODA predates the width/height=3D0 rule on the coded/OUTPUT queu= e. > > >>>>>> It currently behaves more like a traditional mem2mem device. > > >>>>> > > >>>>> The rule in the latest spec is that if width/height is 0 then CAP= TURE > > >>>>> format is determined only after the stream is parsed. Otherwise i= t's > > >>>>> instantly deduced from the OUTPUT resolution. > > >>>>> > > >>>>>> When width/height is set via S_FMT(OUT) or output crop selection= , the > > >>>>>> driver will believe it and set the same (rounded up to macrobloc= k > > >>>>>> alignment) on the capture queue without ever having seen the SPS= . > > >>>>> > > >>>>> That's why I asked whether gstreamer sets width and height of OUT= PUT > > >>>>> to non-zero values. If so, there is no regression, as the specs m= imic > > >>>>> the coda behavior. > > >>>> > > >>>> I see, with Philipp's answer it explains why it works. Note that > > >>>> GStreamer sets the display size on the OUTPUT format (in fact we p= ass > > >>>> as much information as we have, because a) it's generic code and b= ) it > > >>>> will be needed someday when we enable pre-allocation (REQBUFS befo= re > > >>>> SPS/PPS is passed, to avoid the setup delay introduce by allocatio= n, > > >>>> mostly seen with CMA base decoder). In any case, the driver report= ed > > >>>> display size should always be ignored in GStreamer, the only > > >>>> information we look at is the G_SELECTION for the case the x/y or = the > > >>>> cropping rectangle is non-zero. > > >>>> > > >>>> Note this can only work if the capture queue is not affected by th= e > > >>>> coded size, or if the round-up made by the driver is bigger or equ= al to > > >>>> that coded size. I believe CODA falls into the first category, sin= ce > > >>>> the decoding happens in a separate set of buffers and are then de-= tiled > > >>>> into the capture buffers (if understood correctly). > > >>> > > >>> Sounds like it would work only if coded size is equal to the visibl= e > > >>> size (that GStreamer sets) rounded up to full macroblocks. Non-zero= x > > >>> or y in the crop could be problematic too. > > >>> > > >>> Hans, what's your view on this? Should we require G_FMT(CAPTURE) to > > >>> wait until a format becomes available or the OUTPUT queue runs out = of > > >> > > >> You mean CAPTURE queue? If not, then I don't understand that part. > > > > > > No, I exactly meant the OUTPUT queue. The behavior of s5p-mfc in case > > > of the format not being detected yet is to waits for any pending > > > bitstream buffers to be processed by the decoder before returning an > > > error. > > > > > > See https://elixir.bootlin.com/linux/v5.0-rc5/source/drivers/media/pl= atform/s5p-mfc/s5p_mfc_dec.c#L329 > > > > It blocks?! That shouldn't happen. Totally against the spec. > > > > Yeah and that's what this patch tries to implement in venus as well > and is seemingly required for compatibility with gstreamer... > > > > . > > > > > >> > > >>> buffers? > > >> > > >> First see my comment here regarding G_FMT returning an error: > > >> > > >> https://www.spinics.net/lists/linux-media/msg146505.html > > >> > > >> In my view that is a bad idea. > > > > > > I don't like it either, but it seemed to be the most consistent and > > > compatible behavior, but I'm not sure anymore. > > > > > >> > > >> What G_FMT should return between the time a resolution change was > > >> detected and the CAPTURE queue being drained (i.e. the old or the ne= w > > >> resolution?) is something I am not sure about. > > > > > > Note that we're talking here about the initial stream information > > > detection, when the driver doesn't have any information needed to > > > determine the CAPTURE format yet. > > > > IMHO the driver should just start off with some default format, it > > really doesn't matter what that is. > > > > I guess that's fine indeed. > > > This initial situation is really just a Seek operation: you have a form= at, > > you seek to a new position and when you find the resolution of the > > first frame in the bitstream it triggers a SOURCE_CHANGE event. Actuall= y, > > to be really consistent with the Seek: you only need to trigger this ev= ent > > if 1) the new resolution is different from the current format, or 2) th= e > > capture queue is empty. 2) will never happen during a normal Seek, so > > that's a little bit special to this initial situation. > > Having the error returned allowed the applications to handle the > initial parsing without the event, though. It could have waited for > all the OUTPUT buffers to be dequeued and then call G_FMT to check if > that was enough data to obtain the format. > Actually, I'm not sure whether triggering the event only if the resolution changed is a good idea. It makes the application have no idea when it can actually start preparing the CAPTURE queue. Any thoughts? Should it just try to allocate instantly and then reallocate? That would be a waste of time, though, especially since allocations are not cheap. Best regards, Tomasz