From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755842AbaAFUqq (ORCPT ); Mon, 6 Jan 2014 15:46:46 -0500 Received: from mail-pb0-f48.google.com ([209.85.160.48]:63041 "EHLO mail-pb0-f48.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753702AbaAFUqo (ORCPT ); Mon, 6 Jan 2014 15:46:44 -0500 Date: Mon, 6 Jan 2014 12:46:41 -0800 From: Kent Overstreet To: Shaohua Li Cc: linux-kernel@vger.kernel.org, axboe@kernel.dk, hch@infradead.org Subject: Re: [patch 1/2]percpu_ida: fix a live lock Message-ID: <20140106204641.GB9037@kmo> References: <20131231033827.GA31994@kernel.org> <20140104210804.GA24199@kmo-pixel> <20140105131300.GB4186@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20140105131300.GB4186@kernel.org> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, Jan 05, 2014 at 09:13:00PM +0800, Shaohua Li wrote: > On Sat, Jan 04, 2014 at 01:08:04PM -0800, Kent Overstreet wrote: > > On Tue, Dec 31, 2013 at 11:38:27AM +0800, Shaohua Li wrote: > > > > > > steal_tags only happens when free tags is more than half of the total tags. > > > This is too restrict and can cause live lock. I found one cpu has free tags, > > > but other cpu can't steal (thread is bound to specific cpus), threads which > > > wants to allocate tags are always sleeping. I found this when I run next patch, > > > but this could happen without it I think. > > > > > > I did performance test too with null_blk. Two cases (each cpu has enough percpu > > > tags, or total tags are limited) haven't performance changes. > > > > This doesn't appear to me to fix anything wrong with the current code > > (and it'll hurt performance) > > I suspects this hurts performance too, but it doesn't actually in my test. > > > - we explicitly don't guarantee that all > > the tags will be available for allocation at any given time, only half > > of them. > > only half of the tags can be used? this is scaring. Of course we hope all tags > are available. No: that behaviour is explicitly documented and it's the behaviour we want. There is an inherent performance tradeoff here between internal fragmentation and performance: your patch tries to drive internal fragmentation to 0, but under workloads that hit this, running on enough cores, this will have a VERY high performance cost. > > Can you explain more how this is being used where you're seeing > > the issue? And I don't see the other patch in your patch series. > > tasks are bound to specific cpus. And I saw one cpu has a lot of free (but less > than half of total tags), but other cpus can't allocate tags, so I saw a live > lock. You're using it wrong - don't assume all nr_tags can be allocated. If you need to guarantee that you'll always be able to allocate a certain number of tags, you pass twice that to percpu_ida_init() instead. Nacking this patch (and if you want me to look at it more you still to send me the other patch in the series and show me your code that's using it).