linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Hans Verkuil <hverkuil-cisco@xs4all.nl>
To: dikshita@codeaurora.org
Cc: Nicolas Dufresne <nicolas@ndufresne.ca>,
	mchehab@kernel.org, ezequiel@collabora.com,
	boris.brezillon@collabora.com, ribalda@kernel.org,
	paul.kocialkowski@bootlin.com, posciak@chromium.org,
	linux-media@vger.kernel.org, stanimir.varbanov@linaro.org,
	linux-kernel@vger.kernel.org, linux-arm-msm@vger.kernel.org,
	vgarodia@codeaurora.org, majja@codeaurora.org,
	linux-media-owner@vger.kernel.org
Subject: Re: [RFC PATCH 0/1] Add LTR controls
Date: Thu, 16 Jul 2020 10:55:49 +0200	[thread overview]
Message-ID: <d37e4e83-b7ae-7d44-c75f-2055f11ae898@xs4all.nl> (raw)
In-Reply-To: <40040141fc3027c3eb1fdebc1a0e8ade@codeaurora.org>

On 16/06/2020 20:41, dikshita@codeaurora.org wrote:
> Hi Hans, Nicolas,
> 
> Thanks for your comments.
> 
> On 2020-06-12 14:41, Hans Verkuil wrote:
>> Hi Dikshita, Nicolas,
>>
>> On 11/06/2020 16:22, Nicolas Dufresne wrote:
>>> Le jeudi 11 juin 2020 à 15:55 +0530, Dikshita Agarwal a écrit :
>>>> LTR (Long Term Reference) frames are the frames that are encoded 
>>>> sometime in the past
>>>> and stored in the DPB buffer list to be used as reference to encode 
>>>> future frames.
>>>> One usage of LTR encoding is to reduce error propagation for video 
>>>> transmission
>>>> in packet lossy networks.  For example, encoder may want to specify 
>>>> some key frames as
>>>> LTR pictures and use them as reference frames for encoding. With 
>>>> extra protection
>>>> selectively on these LTR frames or synchronization with the receiver 
>>>> of reception of
>>>> the LTR frames during transmission, decoder can receive reference 
>>>> frames more reliably
>>>> than other non-reference frames. As a result, transmission error can 
>>>> be effectively
>>>> restricted within certain frames rather than propagated to future 
>>>> frames.
>>>>
>>>> We are introducing below V4l2 Controls for this feature
>>>> 1. V4L2_CID_MPEG_VIDEO_LTRCOUNT
>>>>     a. This is used to query or configure the number of LTR frames.
>>>>        This is a static control and is controlled by the client.
>>>>     b. The LTR index varies from 0 to the max LTR-1.
>>>>     c. If LTR Count is more than max supported LTR count (max LTR) by 
>>>> driver, it will be rejected.
>>>>     d. Auto Marking : If LTR count is non zero,
>>>>         1) first LTR count frames would be mark as LTR automatically 
>>>> after
>>>>       	   every IDR frame (inclusive).
>>>>         2) For multilayer encoding: first LTR count base layer 
>>>> reference frames starting after
>>>>            every IDR frame (inclusive) in encoding order would be 
>>>> marked as LTR frames by the encoder.
>>>>         3) Auto marking of LTR due to IDR should consider following 
>>>> conditions:
>>>>             1. The frame is not already set to be marked as LTR.
>>>>             2. The frame is part of the base layer in the 
>>>> hierarchical layer case.
>>>>             3. The number of frames currently marked as LTR is less 
>>>> than the maximum LTR frame index plus 1.
>>>>     e. Encoder needs to handle explicit Mark/Use command when encoder 
>>>> is still doing "auto" marking
>>
>> I don't follow this, quite possibly due to lack of experience with 
>> encoders.
>>
>> I kind of would expect to see two modes: either automatic where 
>> encoders can
>> mark up to LTR_COUNT frames as long term reference, and userspace just 
>> sets
>> LTR_COUNT and doesn't have to do anything else.
>>
>> Or it is manual mode where userspace explicitly marks long term 
>> reference
>> frames.
>>
>> From the proposal above it looks like you can mix auto and manual 
>> modes.
>>
>> BTW, how do you 'unmark' long term reference frames?
>>
>> This feature is for stateful encoders, right?
>>
>>>
>>> Perhaps we are missing a LONG_TERM_REFERENCE_MODE ? I bet some encoder
>>> can select by themself long term references and even some encoders may
>>> not let the user decide.
>>>
>>> (not huge han of LTR acronyme, but that could be fine too, assuming 
>>> you
>>> add more _).
>>>
> 
> Userspace sets LTR count which signifies the number of LTR frames 
> encoder needs to generate or keep.
> The encoder has to build-up its internal buffer reference list (aka DBP 
> list or recon buffer list).
> In order to achieve that encoder will fill It's LTR (long term 
> references) list and STR (short term references) list
> by auto marking n frames as LTR frames(n is equal to LTR count) based on 
> auto-marking dictated by the encoder spec.
> The client then can replace those automatically marked frames with new 
> frames using V4L2_CID_MPEG_VIDEO_MARKLTRFRAME and can ask
> encoder to refer the newly marked frame for encoding the next frame 
> using V4L2_CID_MPEG_VIDEO_USELTRFRAME.
> 
>>>>
>>>> 2. V4L2_CID_MPEG_VIDEO_MARKLTRFRAME :
>>>>     a. This signals to mark the current frame as LTR frame. It is a 
>>>> dynamic control and also provide the LTR index to be used.
>>>>     b. the LTR index provided by this control should never exceed the 
>>>> max LTR-1. Else it will be rejected.
>>>
>>> The "current" frame seems a bit loose. Perhaps you wanted to use 
>>> buffer
>>> flags ? A bit like what we have to signal TOP/BOTTOM fields in
>>> alternate interlacing.
>>
>> I was thinking the same thing. Using a control for this doesn't seem 
>> right.
>>
> 
> the client sets this to replace automatically marked frames by the 
> encoder with a particular frame.
> this provides an index that ranges from 0 to LTR count-1 and then the 
> particular frame will be marked with that index.
> this can be achieved through request by associating this control with a 
> specific buffer to make it synchronized.
> 
>>>
>>>>
>>>> 3. V4L2_CID_MPEG_VIDEO_USELTRFRAME :
>>>>     a. This specifies the LTR frame(s) to be used for encoding the 
>>>> current frame. This is a dynamic control.
>>>>     b. LTR Use Bitmap : this consists of bits [0, 15]. A total of N 
>>>> LSB bits of this field are valid,
>>>>        where N is the maximum number of LTRs supported. All the other 
>>>> bits are invalid and should be rejected.
>>>>        The LSB corresponds to the LTR index 0. Bit N-1 from the LSB 
>>>> corresponds to the LTR index max LTR-1.
>>
>> How would userspace know this? Esp. with auto marking since userspace 
>> would have
>> to predict how auto marking works (if I understand this correctly).
>>
> 
> Client sets LTR count which tells about the number of LTR frames 
> automatically marked by the encoder.
> so client can use LTR index (0 to LTR count -1) to ask encoder to refer 
> any particular
> frame (marked automatically by driver or marked by client with 
> V4L2_CID_MPEG_VIDEO_MARKLTRFRAME) as a reference to encode the next 
> frame.
> 
>> For which HW encoder is this meant?
>>
> This is primarily meant for H264 and HEVC.

The venus encoder?

Some more questions:

1) How many LTR frames do h.264 and hevc allow?
2) Given N LTR frames, is there a ordering of those frames? E.g.
   the LTR frame with index 0 is processed/used differently from
   LTR frame with index 1? Or are they all equal in that it is just a pool
   of LTR frames that the encoder can use as it wishes?

Regards,

	Hans

> 
> Thanks,
> Dikshita
> 
>>>
>>> Note, I haven't captured very well the userspace control flow, perhaps
>>> this could be enhanced through writing some documentation.
>>>
>>> As per all other generic encoder controls, we need to make sure it 
>>> will
>>> be usable and flexible enough for multiple HW blocks, as it can be
>>> tedious to extend later otherwise. It is important that along with 
>>> this
>>> RFC you provide some comparisons with with other HW / SW APIs in order
>>> to help justify the design decisions. I also think there should be
>>> link made V4L2_CID_MPEG_VIDEO_GOP_* , number of B-Frames etc.
>>
>> I agree with Nicolas.
>>
>> Regards,
>>
>> 	Hans
>>
>>>
>>> regards,
>>> Nicolas
>>>
>>>>
>>>> Dikshita Agarwal (1):
>>>>   media: v4l2-ctrls:  add control for ltr
>>>>
>>>>  drivers/media/v4l2-core/v4l2-ctrls.c | 6 ++++++
>>>>  include/uapi/linux/v4l2-controls.h   | 4 ++++
>>>>  2 files changed, 10 insertions(+)
>>>>
>>>


  parent reply	other threads:[~2020-07-16  8:56 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2020-06-11 10:25 [RFC PATCH 0/1] Add LTR controls Dikshita Agarwal
2020-06-11 10:25 ` [RFC PATCH 1/1] media: v4l2-ctrls: add control for ltr Dikshita Agarwal
2020-06-11 14:22 ` [RFC PATCH 0/1] Add LTR controls Nicolas Dufresne
2020-06-12  9:11   ` Hans Verkuil
2020-06-16 18:41     ` dikshita
2020-07-14  7:21       ` dikshita
2020-07-16  8:55       ` Hans Verkuil [this message]
2020-07-20 14:33         ` dikshita
2020-07-22 14:26           ` Hans Verkuil
2020-08-06 11:05             ` dikshita
2020-08-07  8:39               ` Hans Verkuil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=d37e4e83-b7ae-7d44-c75f-2055f11ae898@xs4all.nl \
    --to=hverkuil-cisco@xs4all.nl \
    --cc=boris.brezillon@collabora.com \
    --cc=dikshita@codeaurora.org \
    --cc=ezequiel@collabora.com \
    --cc=linux-arm-msm@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-media-owner@vger.kernel.org \
    --cc=linux-media@vger.kernel.org \
    --cc=majja@codeaurora.org \
    --cc=mchehab@kernel.org \
    --cc=nicolas@ndufresne.ca \
    --cc=paul.kocialkowski@bootlin.com \
    --cc=posciak@chromium.org \
    --cc=ribalda@kernel.org \
    --cc=stanimir.varbanov@linaro.org \
    --cc=vgarodia@codeaurora.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).