From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752937AbcHQRas (ORCPT ); Wed, 17 Aug 2016 13:30:48 -0400 Received: from mx1.redhat.com ([209.132.183.28]:46068 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752145AbcHQRaq (ORCPT ); Wed, 17 Aug 2016 13:30:46 -0400 Date: Wed, 17 Aug 2016 19:30:27 +0200 From: Oleg Nesterov To: Bart Van Assche Cc: Peter Zijlstra , "mingo@kernel.org" , Andrew Morton , Johannes Weiner , Neil Brown , Michael Shaver , "linux-kernel@vger.kernel.org" Subject: Re: [PATCH] sched: Avoid that __wait_on_bit_lock() hangs Message-ID: <20160817173027.GA13187@redhat.com> References: <3cec7657-caa9-92ca-9f0e-34f073a6ed8c@sandisk.com> <20160810104555.GA3333@redhat.com> <4d2e02f8-c7da-ee1a-1068-25492cbffebe@sandisk.com> <20160811173651.GA31803@redhat.com> <20160812161611.GC30930@redhat.com> <20160813163208.GA938@redhat.com> <20160816130559.GA14022@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.18 (2008-05-17) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.38]); Wed, 17 Aug 2016 17:30:45 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 08/16, Bart Van Assche wrote: > > On 08/16/2016 06:06 AM, Oleg Nesterov wrote: >> If only I could reproduce. Or at least understand what are you doing to >> hit this bug ;) > > Hello Oleg, > > What I'm doing to hit this bug is to run the test script that is > available at https://github.com/bvanassche/srp-test on a setup that is > equipped with at least one InfiniBand adapter. I see the following > possibilities for you to reproduce this: > * Ask a colleague for access to an IB setup. > * Add RoCE support to the srp-test script and run that script against a > v4.8 kernel + ib_srp-backport + SCST ib_srpt drivers. These last two > (out-of-tree) drivers namely support SRP over RoCE. The upstream > drivers not yet. The SRP-over-RoCE functionality will be sent > upstream as soon as standardization of this protocol by the T10 > committee has finished (this work has already been started and will > probably be finished later this year). > > Please let me know if you need more information. Heh ;) I can't understand any single word above. So I'll give up. Previously you reported that this patch http://marc.info/?l=linux-kernel&m=147085570503588 the problem goes away. In this case the next one http://marc.info/?l=linux-kernel&m=147101858416463 could give us more info but you didn't try it so far (without other changes). It seems you find the root of this problem somewhere else, hopefully you will resolve it soon. Oleg.