From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751372AbcGUFWG (ORCPT ); Thu, 21 Jul 2016 01:22:06 -0400 Received: from mail-oi0-f44.google.com ([209.85.218.44]:35044 "EHLO mail-oi0-f44.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751280AbcGUFWE (ORCPT ); Thu, 21 Jul 2016 01:22:04 -0400 MIME-Version: 1.0 In-Reply-To: <57904F08.7020109@linaro.org> References: <1469073189-9167-1-git-send-email-john.stultz@linaro.org> <1469073189-9167-7-git-send-email-john.stultz@linaro.org> <57904F08.7020109@linaro.org> From: John Stultz Date: Wed, 20 Jul 2016 22:22:02 -0700 Message-ID: Subject: Re: [PATCH 6/7] k3dma: Fix occasional DMA ERR issue by using proper dma api To: zhangfei Cc: lkml , Jingoo Han , Krzysztof Kozlowski , Maxime Ripard , Vinod Koul , Dan Williams , Mark Brown , Andy Green Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jul 20, 2016 at 9:26 PM, zhangfei wrote: > > > On 07/21/2016 11:53 AM, John Stultz wrote: >> >> After lots of debugging on an occasional DMA ERR issue, I realized >> that the desc structures which we point the dma hardware are being >> allocated out of regular memory. This means when we fill the desc >> structures, that data doesn't always get flushed out to memory by >> the time we start the dma transfer, resulting in the dma engine getting >> some null values, resulting in a DMA ERR on the first irq. > > > How about using wmb() flush before start dma to sync desc? So I'm not going to pretend to be an expert here, but my understanding is that wmb() syncrhonizes cpu write ordering operations across cpus, so the cpus see all the changes before the wmb() before they see any changes after. But I'm not sure what effect wmb() has across cpu cache to device ordering. I don't think it works as a cache flush to memory. Andy's patch introducing the cyclic support actually had a wmb() in it that I removed as I couldn't understand clearly why it was there (and there wasn't a comment explaining, as required by checkpatch :). But even with that wmb(), the DMA ERR was still seen. Only with these two new changes have I gotten to the point where I can't seem to trigger the DMA error. thanks -john