From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-17.3 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER, INCLUDES_PATCH,MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id CEB75C48BCD for ; Wed, 9 Jun 2021 14:02:39 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id B4E3D61364 for ; Wed, 9 Jun 2021 14:02:39 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237210AbhFIOEc (ORCPT ); Wed, 9 Jun 2021 10:04:32 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60854 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232724AbhFIOEa (ORCPT ); Wed, 9 Jun 2021 10:04:30 -0400 Received: from perceval.ideasonboard.com (perceval.ideasonboard.com [IPv6:2001:4b98:dc2:55:216:3eff:fef7:d647]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 94521C061574 for ; Wed, 9 Jun 2021 07:02:35 -0700 (PDT) Received: from [192.168.1.111] (91-158-153-130.elisa-laajakaista.fi [91.158.153.130]) by perceval.ideasonboard.com (Postfix) with ESMTPSA id 5A45C46E; Wed, 9 Jun 2021 16:02:33 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ideasonboard.com; s=mail; t=1623247353; bh=i47mpQKDFYs2oAr4/8Dvzle+RuqBCo4D+rdp20igkPQ=; h=To:Cc:References:From:Subject:Date:In-Reply-To:From; b=wCWhx7iW7S7YoffBDGBOxBze3JboKulSAP0aQkHAyTlHulTJz2ch4fsZ1uhJDjmQw aXEbQZrA8peFNGifhXOWbhArR0TW2CMbLyHrt6Qj5QazJ9zg/WTF+BP/p8M0gcHav7 In/yGC3ALxZSkr361FcN7CuSrYi8z51KEsT9yVMY= To: Laurent Pinchart Cc: Pratyush Yadav , Lokesh Vutla , linux-media@vger.kernel.org References: <20210524110909.672432-1-tomi.valkeinen@ideasonboard.com> <20210524110909.672432-33-tomi.valkeinen@ideasonboard.com> <3721c31f-d2a4-6800-cb27-4bd47971d401@ideasonboard.com> From: Tomi Valkeinen Subject: Re: [PATCH v3 32/38] media: ti-vpe: cal: use CSI-2 frame number Message-ID: <130b25f1-46e2-3d29-8d85-9ecb56eec0cd@ideasonboard.com> Date: Wed, 9 Jun 2021 17:02:32 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.8.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-media@vger.kernel.org On 09/06/2021 15:47, Laurent Pinchart wrote: > Hi Tomi, > > On Tue, Jun 08, 2021 at 03:46:21PM +0300, Tomi Valkeinen wrote: >> On 08/06/2021 10:38, Tomi Valkeinen wrote: >>> On 07/06/2021 19:51, Laurent Pinchart wrote: >>>> On Mon, Jun 07, 2021 at 05:55:05PM +0300, Tomi Valkeinen wrote: >>>>> On 07/06/2021 16:42, Laurent Pinchart wrote: >>>>>> On Mon, Jun 07, 2021 at 03:39:45PM +0300, Tomi Valkeinen wrote: >>>>>>> On 04/06/2021 17:04, Laurent Pinchart wrote: >>>>>>>> On Mon, May 24, 2021 at 02:09:03PM +0300, Tomi Valkeinen wrote: >>>>>>>>> The driver fills buf->vb.sequence with an increasing number which is >>>>>>>>> incremented by the driver. This feels a bit pointless, as the userspace >>>>>>>>> could as well track that kind of number itself. Instead, lets use the >>>>>>>> >>>>>>>> s/lets/let's/ >>>>>>>> >>>>>>>>> frame number provided in the CSI-2 data from the sensor. >>>>>>>>> >>>>>>>>> Signed-off-by: Tomi Valkeinen >>>>>>>>> --- >>>>>>>>>     drivers/media/platform/ti-vpe/cal.c | 7 +++++-- >>>>>>>>>     drivers/media/platform/ti-vpe/cal.h | 1 - >>>>>>>>>     2 files changed, 5 insertions(+), 3 deletions(-) >>>>>>>>> >>>>>>>>> diff --git a/drivers/media/platform/ti-vpe/cal.c >>>>>>>>> b/drivers/media/platform/ti-vpe/cal.c >>>>>>>>> index 888706187fd1..62c45add4efe 100644 >>>>>>>>> --- a/drivers/media/platform/ti-vpe/cal.c >>>>>>>>> +++ b/drivers/media/platform/ti-vpe/cal.c >>>>>>>>> @@ -493,7 +493,6 @@ void cal_ctx_unprepare(struct cal_ctx *ctx) >>>>>>>>>     void cal_ctx_start(struct cal_ctx *ctx) >>>>>>>>>     { >>>>>>>>> -    ctx->sequence = 0; >>>>>>>>>         ctx->dma.state = CAL_DMA_RUNNING; >>>>>>>>>         /* Configure the CSI-2, pixel processing and write DMA >>>>>>>>> contexts. */ >>>>>>>>> @@ -586,6 +585,10 @@ static inline void cal_irq_wdma_start(struct >>>>>>>>> cal_ctx *ctx) >>>>>>>>>     static inline void cal_irq_wdma_end(struct cal_ctx *ctx) >>>>>>>>>     { >>>>>>>>>         struct cal_buffer *buf = NULL; >>>>>>>>> +    u32 frame_num; >>>>>>>>> + >>>>>>>>> +    frame_num = cal_read(ctx->cal, >>>>>>>>> CAL_CSI2_STATUS(ctx->phy->instance, >>>>>>>>> +                               ctx->csi2_ctx)) & 0xffff; >>>>>>>>>         spin_lock(&ctx->dma.lock); >>>>>>>>> @@ -607,7 +610,7 @@ static inline void cal_irq_wdma_end(struct >>>>>>>>> cal_ctx *ctx) >>>>>>>>>         if (buf) { >>>>>>>>>             buf->vb.vb2_buf.timestamp = ktime_get_ns(); >>>>>>>>>             buf->vb.field = ctx->v_fmt.fmt.pix.field; >>>>>>>>> -        buf->vb.sequence = ctx->sequence++; >>>>>>>>> +        buf->vb.sequence = frame_num; >>>>>>>> >>>>>>>> We'll need something a bit more complicated. The CSI-2 frame >>>>>>>> number is >>>>>>>> not mandatory, and when used, it is a 16-bit number starting at 1 and >>>>>>>> counting to an unspecified value larger than one, resetting to 1 >>>>>>>> at the >>>>>>>> end of the cycle. The V4L2 sequence number, on the other hand, is a >>>>>>>> monotonic counter starting at 0 and wrapping only at 2^32-1. We >>>>>>>> should >>>>>>>> thus keep a software sequence counter and >>>>>>>> >>>>>>>> - increase it by 1 if the frame number is zero >>>>>>>> - increase it by frame_num - last_frame_num (with wrap-around of >>>>>>>>      frame_num handled) otherwise >>>>>>> >>>>>>> Ok... I wonder if we need a new field for this, though. The problem I >>>>>>> was solving when I changed this to use the CSI-2 frame-number was >>>>>>> how to >>>>>>> associate a pixel frame and a metadata frame. >>>>>>> >>>>>>> Their CSI-2 frame-numbers match (as they are from the same original >>>>>>> CSI-2 frame), so the userspace can use that to figure the matching >>>>>>> frames. While the above method you suggest should give us identical >>>>>>> sequence numbers for pixel and metadata, I think it's going a bit away >>>>>>> from my intended purpose, and possibly risks ending up with different >>>>>>> sequences for pixel and metadata. >>>>>> >>>>>> Why do you think they could get out of sync (assuming the sensor >>>>>> supports frame numbers of course, if it always returns 0, that's not >>>>>> usable for the purpose of synchronization). >>>>> >>>>> If there's a requirement that the sequence starts from 0, it doesn't >>>>> work, as the pixel and metadata video capture may be started at >>>>> different times. When pixel capture starts, the frame number could be 10 >>>>> and pixel sequence would be 0, but when metadata capture starts, the >>>>> frame number could be 12, and pixel seq would thus be 2 and meta seq 0. >>>>> >>>>> But even if we allow the seq to start from the current frame number, >>>> >>>> Good point. I think we can allow starting at a non-zero value to handle >>>> this. >>>> >>>>> this doesn't work if the frame number has wrapped between starting the >>>>> pixel capture and starting the meta capture. >>>> >>>> The timestamp should be enough to handle this, the timestamp difference >>>> between two wraparounds should be large enough to sync the two streams >>>> without any risk. >>> >>> Well, this still won't work, as CAL doesn't know when the sensor's frame >>> counter wraps. CAL can detect that the counter has wrapped, but it >>> doesn't know if some frames were missed. This leads to the two streams >>> getting out of sync. >>> >>> I'll try to figure out if I can somehow handle the frame counter in a >>> shared manner, so that multiple streams that originate from the same >>> frame would always use the same sequence numbers for same frame numbers. >> >> How about something like this: >> >> commit 22fb554bd2b71fa6d245e2595424639bee0ed604 >> Author: Tomi Valkeinen >> Date: Tue Apr 13 11:20:41 2021 +0300 >> >> media: ti-vpe: cal: use CSI-2 frame number >> >> The userspace needs a way to match received metadata buffers to pixel >> data buffers. The obvious way to do this is to use the CSI-2 frame >> number, as both the metadata and the pixel data have the same frame >> number as they come from the same frame. >> >> However, we don't have means to convey the frame number to userspace. We >> do have the 'sequence' field, which with a few tricks can be used for >> this purpose. >> >> To achieve this, track the frame number for each virtual channel and >> increase the sequence for each virtual channel by frame-number - >> previous-frame-number, also taking into account the eventual wrap of the >> CSI-2 frame number. >> >> This way we get a monotonically increasing sequence number which is >> common to all streams using the same virtual channel. >> >> Signed-off-by: Tomi Valkeinen >> >> diff --git a/drivers/media/platform/ti-vpe/cal-camerarx.c b/drivers/media/platform/ti-vpe/cal-camerarx.c >> index 82392499e663..3ca629278fd4 100644 >> --- a/drivers/media/platform/ti-vpe/cal-camerarx.c >> +++ b/drivers/media/platform/ti-vpe/cal-camerarx.c >> @@ -801,6 +801,8 @@ struct cal_camerarx *cal_camerarx_create(struct cal_dev *cal, >> phy->cal = cal; >> phy->instance = instance; >> >> + spin_lock_init(&phy->vc_lock); >> + >> phy->res = platform_get_resource_byname(pdev, IORESOURCE_MEM, >> (instance == 0) ? >> "cal_rx_core0" : >> diff --git a/drivers/media/platform/ti-vpe/cal.c b/drivers/media/platform/ti-vpe/cal.c >> index 7a92bb9429d4..0594e47a2c88 100644 >> --- a/drivers/media/platform/ti-vpe/cal.c >> +++ b/drivers/media/platform/ti-vpe/cal.c >> @@ -493,7 +493,22 @@ void cal_ctx_unprepare(struct cal_ctx *ctx) >> >> void cal_ctx_start(struct cal_ctx *ctx) >> { >> - ctx->sequence = 0; >> + struct cal_camerarx *phy = ctx->phy; >> + >> + /* >> + * Reset the frame number & sequence number, but only if the >> + * virtual channel is not already in use. >> + */ >> + >> + spin_lock(&phy->vc_lock); >> + >> + if (phy->vc_enable_count[ctx->vc]++ == 0) { >> + phy->vc_frame_number[ctx->vc] = 0; >> + phy->vc_sequence[ctx->vc] = 0; >> + } >> + >> + spin_unlock(&phy->vc_lock); >> + >> ctx->dma.state = CAL_DMA_RUNNING; >> >> /* Configure the CSI-2, pixel processing and write DMA contexts. */ >> @@ -513,8 +528,15 @@ void cal_ctx_start(struct cal_ctx *ctx) >> >> void cal_ctx_stop(struct cal_ctx *ctx) >> { >> + struct cal_camerarx *phy = ctx->phy; >> long timeout; >> >> + WARN_ON(phy->vc_enable_count[ctx->vc] == 0); >> + >> + spin_lock(&phy->vc_lock); >> + phy->vc_enable_count[ctx->vc]--; >> + spin_unlock(&phy->vc_lock); > > Can cal_ctx_start() and cal_ctx_stop() be called concurrently, aren't > they serialized at higher levels ? You could drop the spinlock in that > case. No, I don't there's anything serializing these calls. >> + >> /* >> * Request DMA stop and wait until it completes. If completion times >> * out, forcefully disable the DMA. >> @@ -586,7 +608,6 @@ static inline void cal_irq_wdma_start(struct cal_ctx *ctx) >> static inline void cal_irq_wdma_end(struct cal_ctx *ctx) >> { >> struct cal_buffer *buf = NULL; >> - > > Doesn't checkpatch warn about this ? Oops, I somehow missed that. >> spin_lock(&ctx->dma.lock); >> >> /* If the DMA context was stopping, it is now stopped. */ >> @@ -605,9 +626,28 @@ static inline void cal_irq_wdma_end(struct cal_ctx *ctx) >> spin_unlock(&ctx->dma.lock); >> >> if (buf) { >> + struct cal_dev *cal = ctx->cal; >> + struct cal_camerarx *phy = ctx->phy; >> + u32 prev_frame_num, frame_num; >> + u8 vc = ctx->vc; >> + >> + frame_num = cal_read(cal, CAL_CSI2_STATUS(ctx->phy->instance, >> + ctx->csi2_ctx)) & 0xffff; > > I'm worried about race conditions here, as this is the DMA end IRQ > handler. When it the frame number updated in this register ? Afaik, it's updated when the CSI-2 RX gets the frame-start packet. So, while we'll always be racy, I guess this could use some improvement. If we record the frame_num at wdma start irq, we have more time to avoid the race. Of course, the DMA itself is racy with CAL, and if we get multiple interrupts coalesced to one, I think we can't expect the frame number handling to work flawlessly either. >> + >> + if (phy->vc_frame_number[vc] != frame_num) { >> + prev_frame_num = phy->vc_frame_number[vc]; >> + >> + if (prev_frame_num > frame_num) >> + prev_frame_num = 0; > > If we happen to miss a frame when wrapping around, I don't think this > will produce the right result. We don't know when the counter wraps > around, so there's not much we can do, except perhaps keeping track of > the maximum frame number to figure out when it wraps around. It may be > overkill, I'm not sure. What do you think happens? Afaics, we'll just skip some sequence numbers. Tomi