From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.8 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 1E2F6C43331 for ; Tue, 24 Mar 2020 10:12:57 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id EF7922080C for ; Tue, 24 Mar 2020 10:12:56 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726818AbgCXKM4 (ORCPT ); Tue, 24 Mar 2020 06:12:56 -0400 Received: from mga12.intel.com ([192.55.52.136]:51859 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726563AbgCXKM4 (ORCPT ); Tue, 24 Mar 2020 06:12:56 -0400 IronPort-SDR: ONy+KHuQnvhIFMBxhlt4bGuEMyvF3G5xn1lhLM4Qmj/Zr7YikQP71n3jidde7+Bjxid2g/2ffs b1ev65+LXj/Q== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga006.jf.intel.com ([10.7.209.51]) by fmsmga106.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Mar 2020 03:12:55 -0700 IronPort-SDR: H1G/Zvt1jaTSGbi+Yl2JDyCmzkOd2vLqEQ+npdJn5EvBgeUUvy7jdmCw8AJIOFp+tVH24c7yXm 69TYpwV1Ya4g== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.72,300,1580803200"; d="scan'208";a="249996635" Received: from smile.fi.intel.com (HELO smile) ([10.237.68.40]) by orsmga006.jf.intel.com with ESMTP; 24 Mar 2020 03:12:42 -0700 Received: from andy by smile with local (Exim 4.93) (envelope-from ) id 1jGgY7-00CUuU-F9; Tue, 24 Mar 2020 12:12:43 +0200 Date: Tue, 24 Mar 2020 12:12:43 +0200 From: Andy Shevchenko To: Sergey Semin Cc: Maxime Ripard , Greg Kroah-Hartman , Jiri Slaby , Alexey Malahov , Maxim Kaurkin , Pavel Parkhomenko , Ramil Zaripov , Ekaterina Skachko , Vadim Vlasov , Thomas Bogendoerfer , Paul Burton , Ralf Baechle , Chen-Yu Tsai , Ray Jui , Scott Branden , Florian Fainelli , Wei Xu , Jason Cooper , Andrew Lunn , Gregory Clement , Sebastian Hesselbarth , Jisheng Zhang , Heiko Stuebner , Catalin Marinas , Will Deacon , Russell King , linux-arm-kernel@lists.infradead.org, Michael Turquette , Stephen Boyd , linux-clk@vger.kernel.org, Heikki Krogerus , Kefeng Wang , linux-serial@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v2] serial: 8250_dw: Fix common clocks usage race condition Message-ID: <20200324101243.GG1922688@smile.fi.intel.com> References: <20200306130231.05BBC8030795@mail.baikalelectronics.ru> <20200323024611.16039-1-Sergey.Semin@baikalelectronics.ru> <20200323100109.k2gckdyneyzo23fb@gilmour.lan> <20200323135017.4vi5nwam2rlpepgn@ubsrv2.baikal.int> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200323135017.4vi5nwam2rlpepgn@ubsrv2.baikal.int> Organization: Intel Finland Oy - BIC 0357606-4 - Westendinkatu 7, 02160 Espoo Sender: linux-serial-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-serial@vger.kernel.org On Mon, Mar 23, 2020 at 04:50:17PM +0300, Sergey Semin wrote: > On Mon, Mar 23, 2020 at 11:01:09AM +0100, Maxime Ripard wrote: > > On Mon, Mar 23, 2020 at 05:46:09AM +0300, Sergey.Semin@baikalelectronics.ru wrote: > > > From: Serge Semin > > > > > > There are races possible in the dw8250_set_termios() callback method > > > and while the device is in PM suspend state. A race condition may > > > happen if the baudrate clock source device is shared with some other > > > device (in our machine it's another DW UART port). In this case if that > > > device changes the clock rate while serial console is using it the > > > DW 8250 UART port might not only end up with an invalid uartclk value > > > saved, but may also experience a distorted output data since baud-clock > > > could have been changed. In order to fix this lets enable an exclusive > > > reference clock rate access in case if "baudclk" device is specified. > > > > > > So if some other device also acquires the rate exclusivity during the > > > time of a DW UART 8250 port being opened, then DW UART 8250 driver > > > won't be able to alter the baud-clock. It shall just use the available > > > clock rate. Similarly another device also won't manage to change the > > > rate at that time. If nothing else have the exclusive rate access > > > acquired except DW UART 8250 driver, then the driver will be able to > > > alter the rate as much as it needs to in accordance with the currently > > > implemented logic. > > clk_rate_exclusive_get is pretty intrusive, and due to the usual > > topology of clock trees, this will lock down 3-4 parent clocks to > > their current rate as well. In the Allwinner SoCs case for example, > > this will lock down the same PLL than the one used by the CPU, > > preventing cpufreq from running. > > Speaking about weak design of a SoC' clock tree. Our problems are nothing > with respect to the Allwinner SoC, in which case of changing the > CPU-frequency may cause the UART glitches subsequently causing data > transfer artefacts.) Moreover as I can see the same issue may raise for > I2C, QSPI, PWM devices there. > > Anyway your concern does make sense. > > > However, the 8250 has a pretty wide range of dividers and can adapt to > > any reasonable parent clock rate, so we don't really need to lock the > > rate either, we can simply react to a parent clock rate change using > > the clock notifiers, just like the SiFive UART is doing. > > > > I tried to do that, but given that I don't really have an extensive > > knowledge of the 8250, I couldn't find a way to stop the TX of chars > > while we change the clock rate. I'm not sure if this is a big deal or > > not, the SiFive UART doesn't seem to care. > > Yes, your solution is also possible, but even in case of stopping Tx/Rx it > doesn't lack drawbacks. First of all AFAIK there is no easy way to just > pause the transfers. We'd have to first wait for the current transfers > to be completed, then somehow lock the port usage (both Tx and Rx > traffic), permit the reference clock rate change, accordingly adjust the > UART clock divider, and finally unlock the port. While if we don't mind > to occasionally have UART data glitches, we can just adjust the UART ref > divider synchronously with ref clock rate change as you and SiFive UART > driver suggest. > > So we are now at a zugzwang - a fork to three not that good solutions: > 1) lock the whole clock branch and provide a glitchless interfaces. But > by doing so we may (in case of Allwinner SoCs we will) lockup some very > important functionality like CPU-frequency change while the UART port is > started up. In this case we won't have the data glitches. > 2) just adjust the UART clock divider in case of reference clock rate > change (use the SiFive UART driver approach). In this case we may have the > data corruption. > 3) somehow implement the algo: wait for the transfers to be completed, > lock UART interface (it's possible for Tx, but for Rx in case of no handshake > enabled it's simply impossible), permit the ref clock rate change, > adjust the UART divider, then unlock the UART interface. In this case the data > glitches still may happen (if no modem control is available or > handshakes are disabled). > > As for the cases of Baikal-T1 UARTs the first solutions is the most suitable. > We don't lock anything valuable, since a base PLL output isn't directly > connected to any device and it's rate once setup isn't changed during the > system running. On the other hand I don't mind to implement the second > solution, even though it's prone to data glitches. Regarding the solution > 3) I won't even try. It's too complicated, I don't have time and > test-infrastructure for this. > > So Andy what do you think? >From Intel HW perspective the first two are okay, but since Maxime is against first, you have the only option from your list. Perhaps somebody may give option 4) here... -- With Best Regards, Andy Shevchenko