All of lore.kernel.org
 help / color / mirror / Atom feed
From: Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
To: peterz@infradead.org, mhocko@kernel.org
Cc: linux-kernel@vger.kernel.org, mingo@redhat.com,
	tglx@linutronix.de, hpa@zytor.com, davem@davemloft.net,
	tony.luck@intel.com, akpm@linux-foundation.org, chris@zankel.net,
	jcmvbkbc@gmail.com, dave@stgolabs.net, Waiman.Long@hpe.com
Subject: Re: [PATCH 03/11] locking, rwsem: introduce basis for down_write_killable
Date: Tue, 10 May 2016 22:57:34 +0900	[thread overview]
Message-ID: <201605102257.HBE12981.LOOJFQtOSMFFHV@I-love.SAKURA.ne.jp> (raw)
In-Reply-To: <20160510123806.GB3193@twins.programming.kicks-ass.net>

Michal Hocko wrote:
> On Tue 10-05-16 19:43:20, Tetsuo Handa wrote:
> > I hit "allowing the OOM killer to select the same thread again" problem
> > ( http://lkml.kernel.org/r/20160408113425.GF29820@dhcp22.suse.cz ), but
> > I think that there is a bug in down_write_killable() series (at least
> > "locking, rwsem: introduce basis for down_write_killable" patch).
> > 
> > Complete log is at http://I-love.SAKURA.ne.jp/tmp/serial-20160510-sem.txt.xz .
> [...]
> > 2 threads (PID: 1314 and 1443) are sleeping at rwsem_down_read_failed()
> > but no thread is sleeping at rwsem_down_write_failed_killable().
> > If there is no thread waiting for write lock, threads waiting for read
> > lock must be able to run. This suggests that one of threads which was
> > waiting for write lock forgot to wake up reader threads.
> 
> Or that the write lock holder is still keeping the lock held. I do not
> see such a process in your list though. Is it possible that the
> debug_show_all_locks would just miss it as it is not sleeping?

I don't think it is possible. This reproducer
( http://lkml.kernel.org/r/201605061958.HHG48967.JVFtSLFQOFOOMH@I-love.SAKURA.ne.jp )
creates a thread group with two threads, and two of these two threads are
sleeping at rwsem_down_read_failed() waiting for mmap_sem.
SysRq-t suggests that PID 1443 called rwsem_down_write_failed_killable()
before calling rwsem_down_read_failed().

By the way, I suggested you to show traces of threads which are using the OOM victim's mm
( http://lkml.kernel.org/r/201603172200.CIE52148.QOVSOHJFMLOFtF@I-love.SAKURA.ne.jp ),
but you said that showing all locks held by !TASK_RUNNING threads would be useful
( http://lkml.kernel.org/r/20160323120716.GE7059@dhcp22.suse.cz ).
Do you admit that debug_show_all_locks() is not always useful by suspecting
the possibility of debug_show_all_locks() failing to report a thread which
held mmap_sem for write? (This is a kmallocwd topic, so I stop here.)

> 
> > Looking at rwsem_down_read_failed(), reader threads waiting for the
> > writer thread to release the lock are waiting on sem->wait_list list.
> > Looking at __rwsem_down_write_failed_common(), when the writer thread
> > escaped the
> > 
> >                  /* Block until there are no active lockers. */
> >                  do {
> >                          if (signal_pending_state(state, current)) {
> >                                  raw_spin_lock_irq(&sem->wait_lock);
> >                                  ret = ERR_PTR(-EINTR);
> >                                  goto out;
> >                          }
> >                          schedule();
> >                          set_current_state(state);
> >                  } while ((count = sem->count) & RWSEM_ACTIVE_MASK);
> > 
> > loop due to SIGKILL, I think that the writer thread needs to check for
> > remaining threads on sem->wait_list list and wake up reader threads
> > before rwsem_down_write_failed_killable() returns -EINTR.
> 
> I am not sure I understand. The rwsem counter is not write locked while
> the thread is sleeping and when we fail on the signal pending so readers
> should be able to proceed, no?
> 
I guess __rwsem_do_wake() is needed for waking up the readers because
I guess the sequence occurred was

  (1) PID 1314 requested down_read() and succeeded.
  (2) PID 1443 requested down_write_killable() and blocked.
  (3) The OOM killer sent SIGKILL to PID 1314 and PID 1443.
  (4) PID 1443 left down_write_killable() with -EINTR.
  (5) PID 1314 called up_read() and down_read()
      while PID 1443 called down_read().

.

> Or are you suggesting that the failure path should call rwsem_wake?

I don't know how rwsem works. Please consult maintainers.

Peter Zijlstra wrote:
> Mutex is much simpler; it doesn't have to do the reader-vs-writer
> fairness thing.
> 
> However, at the time I was thinking that if we have:
> 
> 	reader (owner)
> 	writer (pending)
> 	reader (blocked on writer)
> 
> and writer would get cancelled, the up_read() would do a wakeup and kick
> the blocked reader.
> 
> But yes, immediately kicking further pending waiters might be better.
> 
> Also, looking at it again; I think we're forgetting to re-adjust the
> BIAS for the cancelled writer.

Yes, I think so.
> 
> Davidlohr, Waiman, can you look at this?
> 

  reply	other threads:[~2016-05-10 13:58 UTC|newest]

Thread overview: 226+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-29 12:58 [PATCH 0/11] introduce down_write_killable for rw_semaphore Michal Hocko
2016-02-29 12:58 ` Michal Hocko
2016-02-29 12:58 ` [PATCH 01/11] locking, rwsem: get rid of __down_write_nested Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 02/11] locking, rwsem: drop explicit memory barriers Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 03/11] locking, rwsem: introduce basis for down_write_killable Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-03-30 13:25   ` Peter Zijlstra
2016-03-30 13:25     ` Peter Zijlstra
2016-03-31  8:33     ` Michal Hocko
2016-03-31  8:33       ` Michal Hocko
2016-03-31  8:44       ` Peter Zijlstra
2016-03-31  8:44         ` Peter Zijlstra
2016-03-31  8:55   ` [PATCH] " Michal Hocko
2016-03-31  8:55     ` Michal Hocko
2016-02-29 12:58 ` [PATCH 04/11] alpha, rwsem: provide __down_write_killable Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 05/11] ia64, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 06/11] s390, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 07/11] sh, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 08/11] sparc, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 09/11] xtensa, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 10/11] x86, " Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-02-29 12:58 ` [PATCH 11/11] locking, rwsem: provide down_write_killable Michal Hocko
2016-02-29 12:58   ` Michal Hocko
2016-03-30 13:32 ` [PATCH 0/11] introduce down_write_killable for rw_semaphore Peter Zijlstra
2016-03-30 13:32   ` Peter Zijlstra
2016-03-31  8:59   ` Michal Hocko
2016-03-31  8:59     ` Michal Hocko
2016-03-31  9:20     ` Ingo Molnar
2016-03-31  9:20       ` Ingo Molnar
2016-03-31 10:58       ` Michal Hocko
2016-03-31 10:58         ` Michal Hocko
2016-03-31 17:03       ` Andrew Morton
2016-03-31 17:03         ` Andrew Morton
2016-04-01  6:33         ` Ingo Molnar
2016-04-01  6:33           ` Ingo Molnar
2016-04-01  9:21           ` Michal Hocko
2016-04-01  9:21             ` Michal Hocko
2016-04-01  9:50             ` Ingo Molnar
2016-04-01  9:50               ` Ingo Molnar
2016-04-01 10:52               ` Michal Hocko
2016-04-01 10:52                 ` Michal Hocko
2016-04-01  7:26         ` Michal Hocko
2016-04-01  7:26           ` Michal Hocko
2016-04-01  9:11           ` Andrew Morton
2016-04-01  9:11             ` Andrew Morton
2016-04-01 11:04 ` [PATCH 0/11] introduce down_write_killable for rw_semaphore v2 Michal Hocko
2016-04-01 11:04   ` Michal Hocko
2016-04-01 11:04   ` [PATCH 01/11] locking, rwsem: get rid of __down_write_nested Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-02  0:28     ` Davidlohr Bueso
2016-04-02  0:28       ` Davidlohr Bueso
2016-04-01 11:04   ` [PATCH 02/11] locking, rwsem: drop explicit memory barriers Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-02  1:17     ` Davidlohr Bueso
2016-04-02  1:17       ` Davidlohr Bueso
2016-04-04  9:03       ` Michal Hocko
2016-04-04  9:03         ` Michal Hocko
2016-04-04  9:06         ` [PATCH 1/2] xtensa, rwsem: drop superfluous arch specific implementation Michal Hocko
2016-04-04  9:06           ` Michal Hocko
2016-04-04  9:06           ` [PATCH 2/2] sh, " Michal Hocko
2016-04-04  9:06             ` Michal Hocko
2016-04-06  9:26             ` Peter Zijlstra
2016-04-06  9:26               ` Peter Zijlstra
2016-04-06  9:50               ` Geert Uytterhoeven
2016-04-06  9:50                 ` Geert Uytterhoeven
2016-04-06 10:27                 ` Peter Zijlstra
2016-04-06 10:27                   ` Peter Zijlstra
2016-04-04 10:23           ` [PATCH 1/2] xtensa, " Max Filippov
2016-04-04 10:23             ` Max Filippov
2016-04-06  9:06       ` [PATCH] sparc, " Michal Hocko
2016-04-06  9:06         ` Michal Hocko
2016-04-06  9:06         ` Michal Hocko
2016-04-06  9:06         ` Michal Hocko
2016-04-01 11:04   ` [PATCH 03/11] locking, rwsem: introduce basis for down_write_killable Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-02  4:41     ` Davidlohr Bueso
2016-04-02  4:41       ` Davidlohr Bueso
2016-04-04  9:17       ` Michal Hocko
2016-04-04  9:17         ` Michal Hocko
2016-04-04  9:21         ` Peter Zijlstra
2016-04-04  9:21           ` Peter Zijlstra
2016-04-07  6:58         ` Davidlohr Bueso
2016-04-07  6:58           ` Davidlohr Bueso
2016-04-07  7:38           ` Michal Hocko
2016-04-07  7:38             ` Michal Hocko
2016-05-10 10:43     ` Tetsuo Handa
2016-05-10 11:53       ` Michal Hocko
2016-05-10 12:38         ` Peter Zijlstra
2016-05-10 13:57           ` Tetsuo Handa [this message]
2016-05-11  7:23           ` Michal Hocko
2016-05-11  8:28             ` Michal Hocko
2016-05-11  8:44               ` Peter Zijlstra
2016-05-11  9:04                 ` Michal Hocko
2016-05-11  9:17                   ` Peter Zijlstra
2016-05-11  9:31                     ` Michal Hocko
2016-05-11  9:41                       ` Peter Zijlstra
2016-05-11 13:59                         ` Michal Hocko
2016-05-11 18:03                           ` Michal Hocko
2016-05-12 11:57                             ` [PATCH] locking, rwsem: Fix down_write_killable() Peter Zijlstra
2016-05-12 12:15                               ` [tip:locking/rwsem] locking/rwsem: " tip-bot for Peter Zijlstra
2016-05-12 16:59                               ` [PATCH] locking, rwsem: " Michal Hocko
2016-05-15 20:57                               ` [tip:locking/rwsem] locking/rwsem: " tip-bot for Peter Zijlstra
2016-05-12 12:12                             ` [PATCH 03/11] locking, rwsem: introduce basis for down_write_killable Peter Zijlstra
2016-05-12 12:19                               ` Michal Hocko
2016-05-12 13:58                                 ` Peter Zijlstra
2016-05-12 19:42                                 ` Waiman Long
2016-05-11  8:35             ` Peter Zijlstra
2016-05-11  9:02               ` Michal Hocko
2016-04-01 11:04   ` [PATCH 04/11] alpha, rwsem: provide __down_write_killable Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 05/11] ia64, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 06/11] s390, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 07/11] sh, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 08/11] sparc, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 09/11] xtensa, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-01 11:04   ` [PATCH 10/11] x86, " Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-06 18:31     ` Peter Zijlstra
2016-04-06 18:31       ` Peter Zijlstra
2016-04-01 11:04   ` [PATCH 11/11] locking, rwsem: provide down_write_killable Michal Hocko
2016-04-01 11:04     ` Michal Hocko
2016-04-07 15:12 ` [PATCH 0/11] introduce down_write_killable for rw_semaphore v3 Michal Hocko
2016-04-07 15:12   ` Michal Hocko
2016-04-07 15:12   ` [PATCH 01/11] locking, rwsem: get rid of __down_write_nested Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:32     ` [tip:locking/rwsem] locking/rwsem: Get rid of __down_write_nested() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 02/11] locking, rwsem: drop explicit memory barriers Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:32     ` [tip:locking/rwsem] locking/rwsem: Drop " tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 03/11] xtensa, rwsem: drop superfluous arch specific implementation Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:33     ` [tip:locking/rwsem] locking/rwsem, xtensa: Drop " tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 04/11] sh, rwsem: drop " Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:33     ` [tip:locking/rwsem] locking/rwsem, sh: Drop " tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 05/11] sparc, rwsem: drop " Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:33     ` [tip:locking/rwsem] locking/rwsem, sparc: Drop " tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 06/11] locking, rwsem: introduce basis for down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13 11:34     ` [tip:locking/rwsem] locking/rwsem: Introduce basis for down_write_killable() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 07/11] alpha, rwsem: provide __down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-22  9:42     ` [tip:locking/rwsem] locking/rwsem, alpha: Provide __down_write_killable() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 08/11] ia64, rwsem: provide __down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-07 17:28     ` Sergei Shtylyov
2016-04-07 17:28       ` Sergei Shtylyov
2016-04-08  6:22       ` Michal Hocko
2016-04-08  6:22         ` Michal Hocko
2016-04-22  9:43     ` [tip:locking/rwsem] locking/rwsem, ia64: Provide __down_write_killable() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 09/11] s390, rwsem: provide __down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-22  9:43     ` [tip:locking/rwsem] locking/rwsem, s390: Provide __down_write_killable() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 10/11] x86, rwsem: provide __down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-13  9:08     ` Ingo Molnar
2016-04-13  9:08       ` Ingo Molnar
2016-04-13  9:16       ` Michal Hocko
2016-04-13  9:16         ` Michal Hocko
2016-04-13  9:19         ` Ingo Molnar
2016-04-13  9:19           ` Ingo Molnar
2016-04-13 10:27           ` Ingo Molnar
2016-04-13 10:27             ` Ingo Molnar
2016-04-13 12:49             ` Michal Hocko
2016-04-13 12:49               ` Michal Hocko
2016-04-13 12:49               ` Michal Hocko
2016-04-13 12:49               ` Michal Hocko
2016-04-17 16:59               ` Michal Hocko
2016-04-17 16:59                 ` Michal Hocko
2016-04-17 16:59                 ` Michal Hocko
2016-04-20 13:40               ` Peter Zijlstra
2016-04-20 13:40                 ` Peter Zijlstra
2016-04-20 13:40                 ` Peter Zijlstra
2016-04-20 18:04                 ` H. Peter Anvin
2016-04-20 18:04                   ` H. Peter Anvin
2016-04-20 20:45                   ` Borislav Petkov
2016-04-20 20:45                     ` Borislav Petkov
2016-04-20 20:58                     ` Michal Hocko
2016-04-20 20:58                       ` Michal Hocko
2016-04-20 21:06                     ` H. Peter Anvin
2016-04-20 21:06                       ` H. Peter Anvin
2016-04-20 21:36                       ` Borislav Petkov
2016-04-20 21:36                         ` Borislav Petkov
2016-04-20 22:29                         ` H. Peter Anvin
2016-04-20 22:29                           ` H. Peter Anvin
2016-04-21 11:35                           ` Borislav Petkov
2016-04-21 11:35                             ` Borislav Petkov
2016-04-21 13:09                             ` Michal Hocko
2016-04-21 13:09                               ` Michal Hocko
2016-04-21 13:21                               ` Borislav Petkov
2016-04-21 13:21                                 ` Borislav Petkov
2016-04-27 12:02                           ` [PATCH] x86/locking/rwsem: Cleanup ____down_write() Borislav Petkov
2016-04-27 12:02                             ` Borislav Petkov
2016-04-28 10:27                             ` [tip:locking/rwsem] locking/rwsem, x86: Clean up ____down_write() tip-bot for Borislav Petkov
2016-04-22  6:53                 ` [PATCH 10/11] x86, rwsem: provide __down_write_killable Ingo Molnar
2016-04-22  6:53                   ` Ingo Molnar
2016-04-22  6:53                   ` Ingo Molnar
2016-04-13  9:57     ` [PATCH] x86: add frame annotation for call_rwsem_down_write_failed_killable Michal Hocko
2016-04-13  9:57       ` Michal Hocko
2016-04-13  9:57       ` Michal Hocko
2016-04-22  9:44       ` [tip:locking/rwsem] locking/rwsem, x86: Add frame annotation for call_rwsem_down_write_failed_killable() tip-bot for Michal Hocko
2016-04-22  9:43     ` [tip:locking/rwsem] locking/rwsem, x86: Provide __down_write_killable() tip-bot for Michal Hocko
2016-04-07 15:12   ` [PATCH 11/11] locking, rwsem: provide down_write_killable Michal Hocko
2016-04-07 15:12     ` Michal Hocko
2016-04-22  9:44     ` [tip:locking/rwsem] locking/rwsem: Provide down_write_killable() tip-bot for Michal Hocko
2016-04-12  9:37   ` [PATCH 0/11] introduce down_write_killable for rw_semaphore v3 Michal Hocko
2016-04-12  9:37     ` Michal Hocko
2016-04-12 15:40     ` Peter Zijlstra
2016-04-12 15:40       ` Peter Zijlstra
2016-04-12 18:01       ` Michal Hocko
2016-04-12 18:01         ` Michal Hocko

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=201605102257.HBE12981.LOOJFQtOSMFFHV@I-love.SAKURA.ne.jp \
    --to=penguin-kernel@i-love.sakura.ne.jp \
    --cc=Waiman.Long@hpe.com \
    --cc=akpm@linux-foundation.org \
    --cc=chris@zankel.net \
    --cc=dave@stgolabs.net \
    --cc=davem@davemloft.net \
    --cc=hpa@zytor.com \
    --cc=jcmvbkbc@gmail.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mhocko@kernel.org \
    --cc=mingo@redhat.com \
    --cc=peterz@infradead.org \
    --cc=tglx@linutronix.de \
    --cc=tony.luck@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.