linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Michal Hocko <mhocko@suse.cz>
To: azurIt <azurit@pobox.sk>
Cc: linux-kernel@vger.kernel.org, linux-mm@kvack.org,
	cgroups mailinglist <cgroups@vger.kernel.org>,
	KAMEZAWA Hiroyuki <kamezawa.hiroyu@jp.fujitsu.com>
Subject: Re: memory-cgroup bug
Date: Sun, 25 Nov 2012 13:05:24 +0100	[thread overview]
Message-ID: <20121125120524.GB10623@dhcp22.suse.cz> (raw)
In-Reply-To: <20121125011047.7477BB5E@pobox.sk>

[Adding Kamezawa into CC]

On Sun 25-11-12 01:10:47, azurIt wrote:
> >Could you take few snapshots over time?
> 
> 
> Here it is, now from different server, snapshot was taken every second
> for 10 minutes (hope it's enough):
> www.watchdog.sk/lkml/memcg-bug-2.tar.gz

Hmm, interesting:
$ grep . */memory.failcnt | cut -d: -f2 | awk 'BEGIN{min=666666}{if (prev>0) {diff=$1-prev; if (diff>max) max=diff; if (diff<min) min=diff; sum+=diff; n++} prev=$1}END{printf "min:%d max:%d avg:%f\n", min, max, sum/n}'
min:16281 max:224048 avg:18818.943119

So there is a lot of attempts to allocate which fail, every second!
Will get to that later.

The number of tasks in the group is stable (20):
$ for i in *; do ls -d1 $i/[0-9]* | wc -l; done | sort | uniq -c
    546 20

And no task has been killed or spawned:
$ for i in *; do ls -d1 $i/[0-9]* | cut -d/ -f2; done | sort | uniq
24495
24762
24774
24796
24798
24805
24813
24827
24831
24841
24842
24863
24892
24924
24931
25130
25131
25192
25193
25243

$ for stack in [0-9]*/[0-9]*
do 
	head -n1 $stack/stack
done | sort | uniq -c
   9841 [<ffffffff8110a9c1>] mem_cgroup_handle_oom+0x241/0x3b0
    546 [<ffffffff811109b8>] do_truncate+0x58/0xa0
    533 [<ffffffffffffffff>] 0xffffffffffffffff

Tells us that the stacks are pretty much stable.
$ grep do_truncate -r [0-9]* | cut -d/ -f2 | sort | uniq -c
    546 24495

So 24495 is stuck in do_truncate
[<ffffffff811109b8>] do_truncate+0x58/0xa0
[<ffffffff81121c90>] do_last+0x250/0xa30
[<ffffffff81122547>] path_openat+0xd7/0x440
[<ffffffff811229c9>] do_filp_open+0x49/0xa0
[<ffffffff8110f7d6>] do_sys_open+0x106/0x240
[<ffffffff8110f950>] sys_open+0x20/0x30
[<ffffffff815b5926>] system_call_fastpath+0x18/0x1d
[<ffffffffffffffff>] 0xffffffffffffffff

I suspect it is waiting for i_mutex. Who is holding that lock?
Other tasks are blocked on the mem_cgroup_handle_oom either coming from
the page fault path so i_mutex can be exluded or vfs_write (24796) and
that one is interesting:
[<ffffffff8110a9c1>] mem_cgroup_handle_oom+0x241/0x3b0
[<ffffffff8110b5ab>] T.1146+0x5ab/0x5c0
[<ffffffff8110c22e>] mem_cgroup_cache_charge+0xbe/0xe0
[<ffffffff810ca28c>] add_to_page_cache_locked+0x4c/0x140
[<ffffffff810ca3a2>] add_to_page_cache_lru+0x22/0x50
[<ffffffff810ca45b>] grab_cache_page_write_begin+0x8b/0xe0
[<ffffffff81193a18>] ext3_write_begin+0x88/0x270
[<ffffffff810c8fc6>] generic_file_buffered_write+0x116/0x290
[<ffffffff810cb3cc>] __generic_file_aio_write+0x27c/0x480
[<ffffffff810cb646>] generic_file_aio_write+0x76/0xf0		# takes &inode->i_mutex
[<ffffffff8111156a>] do_sync_write+0xea/0x130
[<ffffffff81112183>] vfs_write+0xf3/0x1f0
[<ffffffff81112381>] sys_write+0x51/0x90
[<ffffffff815b5926>] system_call_fastpath+0x18/0x1d
[<ffffffffffffffff>] 0xffffffffffffffff

This smells like a deadlock. But kind strange one. The rapidly
increasing failcnt suggests that somebody still tries to allocate but
who when all of them hung in the mem_cgroup_handle_oom. This can be
explained though.
Memcg OOM killer let's only one process (which is able to lock the
hierarchy by mem_cgroup_oom_lock) call mem_cgroup_out_of_memory and kill
a process, while others are waiting on the wait queue. Once the killer
is done it calls memcg_wakeup_oom which wakes up other tasks waiting on
the queue. Those retry the charge, in a hope there is some memory freed
in the meantime which hasn't happened so they get into OOM again (and
again and again).
This all usually works out except in this particular case I would bet
my hat that the OOM selected task is pid 24495 which is blocked on the
mutex which is held by one of the oom killer task so it cannot finish -
thus free a memory.

It seems that the current Linus' tree is affected as well.

I will have to think about a solution but it sounds really tricky. It is
not just ext3 that is affected.

I guess we need to tell mem_cgroup_cache_charge that it should never
reach OOM from add_to_page_cache_locked. This sounds quite intrusive to
me. On the other hand it is really weird that an excessive writer might
trigger a memcg OOM killer.
-- 
Michal Hocko
SUSE Labs

  reply	other threads:[~2012-11-25 12:05 UTC|newest]

Thread overview: 172+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-11-21 19:02 memory-cgroup bug azurIt
2012-11-22  0:26 ` Kamezawa Hiroyuki
2012-11-22  9:36   ` azurIt
2012-11-22 21:45     ` Michal Hocko
2012-11-22 15:24 ` Michal Hocko
2012-11-22 18:05   ` azurIt
2012-11-22 21:42     ` Michal Hocko
2012-11-22 22:34       ` azurIt
2012-11-23  7:40         ` Michal Hocko
2012-11-23  9:21           ` azurIt
2012-11-23  9:28             ` Michal Hocko
2012-11-23  9:44               ` azurIt
2012-11-23 10:10                 ` Michal Hocko
2012-11-23  9:34             ` Glauber Costa
2012-11-23 10:04             ` Michal Hocko
2012-11-23 14:59               ` azurIt
2012-11-25 10:17                 ` Michal Hocko
2012-11-25 12:39                   ` azurIt
2012-11-25 13:02                     ` Michal Hocko
2012-11-25 13:27                       ` azurIt
2012-11-25 13:44                         ` Michal Hocko
2012-11-25  0:10               ` azurIt
2012-11-25 12:05                 ` Michal Hocko [this message]
2012-11-25 12:36                   ` azurIt
2012-11-25 13:55                   ` Michal Hocko
2012-11-26  0:38                     ` azurIt
2012-11-26  7:57                       ` Michal Hocko
2012-11-26 13:18                       ` [PATCH -mm] memcg: do not trigger OOM from add_to_page_cache_locked Michal Hocko
2012-11-26 13:21                         ` [PATCH for 3.2.34] " Michal Hocko
2012-11-26 21:28                           ` azurIt
2012-11-30  1:45                           ` azurIt
2012-11-30  2:29                           ` azurIt
2012-11-30 12:45                             ` Michal Hocko
2012-11-30 12:53                               ` azurIt
2012-11-30 13:44                               ` azurIt
2012-11-30 14:44                                 ` Michal Hocko
2012-11-30 15:03                                   ` Michal Hocko
2012-11-30 15:37                                     ` Michal Hocko
2012-11-30 15:08                                   ` azurIt
2012-11-30 15:39                                     ` Michal Hocko
2012-11-30 15:59                                       ` azurIt
2012-11-30 16:19                                         ` Michal Hocko
2012-11-30 16:26                                           ` azurIt
2012-11-30 16:53                                             ` Michal Hocko
2012-11-30 20:43                                               ` azurIt
2012-12-03 15:16                                           ` Michal Hocko
2012-12-05  1:36                                             ` azurIt
2012-12-05 14:17                                               ` Michal Hocko
2012-12-06  0:29                                                 ` azurIt
2012-12-06  9:54                                                   ` Michal Hocko
2012-12-06 10:12                                                     ` azurIt
2012-12-06 17:06                                                       ` Michal Hocko
2012-12-10  1:20                                                     ` azurIt
2012-12-10  9:43                                                       ` Michal Hocko
2012-12-10 10:18                                                         ` azurIt
2012-12-10 15:52                                                           ` Michal Hocko
2012-12-10 17:18                                                             ` azurIt
2012-12-17  1:34                                                             ` azurIt
2012-12-17 16:32                                                               ` Michal Hocko
2012-12-17 18:23                                                                 ` azurIt
2012-12-17 19:55                                                                   ` Michal Hocko
2012-12-18 14:22                                                                     ` azurIt
2012-12-18 15:20                                                                       ` Michal Hocko
2012-12-24 13:25                                                                         ` azurIt
2012-12-28 16:22                                                                           ` Michal Hocko
2012-12-30  1:09                                                                             ` azurIt
2012-12-30 11:08                                                                               ` Michal Hocko
2013-01-25 15:07                                                                                 ` azurIt
2013-01-25 16:31                                                                                   ` Michal Hocko
2013-02-05 13:49                                                                                     ` Michal Hocko
2013-02-05 14:49                                                                                       ` azurIt
2013-02-05 16:09                                                                                         ` Michal Hocko
2013-02-05 16:46                                                                                           ` azurIt
2013-02-05 16:48                                                                                           ` Greg Thelen
2013-02-05 17:46                                                                                             ` Michal Hocko
2013-02-05 18:09                                                                                               ` Greg Thelen
2013-02-05 18:59                                                                                                 ` Michal Hocko
2013-02-08  4:27                                                                                                   ` Greg Thelen
2013-02-08 16:29                                                                                                     ` Michal Hocko
2013-02-08 16:40                                                                                                       ` Michal Hocko
2013-02-06  1:17                                                                                           ` azurIt
2013-02-06 14:01                                                                                             ` Michal Hocko
2013-02-06 14:22                                                                                               ` Michal Hocko
2013-02-06 16:00                                                                                                 ` [PATCH for 3.2.34] memcg: do not trigger OOM if PF_NO_MEMCG_OOM is set Michal Hocko
2013-02-08  5:03                                                                                                   ` azurIt
2013-02-08  9:44                                                                                                     ` Michal Hocko
2013-02-08 11:02                                                                                                       ` azurIt
2013-02-08 12:38                                                                                                         ` Michal Hocko
2013-02-08 13:56                                                                                                           ` azurIt
2013-02-08 14:47                                                                                                             ` Michal Hocko
2013-02-08 15:24                                                                                                             ` Michal Hocko
2013-02-08 15:58                                                                                                               ` azurIt
2013-02-08 17:10                                                                                                                 ` Michal Hocko
2013-02-08 21:02                                                                                                                   ` azurIt
2013-02-10 15:03                                                                                                                     ` Michal Hocko
2013-02-10 16:46                                                                                                                       ` azurIt
2013-02-11 11:22                                                                                                                         ` Michal Hocko
2013-02-22  8:23                                                                                                                           ` azurIt
2013-02-22 12:52                                                                                                                             ` Michal Hocko
2013-02-22 12:54                                                                                                                               ` azurIt
2013-02-22 13:00                                                                                                                                 ` Michal Hocko
2013-06-06 16:04                                                                                                                             ` Michal Hocko
2013-06-06 16:16                                                                                                                               ` azurIt
2013-06-07 13:11                                                                                                                                 ` [PATCH for 3.2] memcg: do not trap chargers with full callstack on OOM Michal Hocko
2013-06-17 10:21                                                                                                                                   ` azurIt
2013-06-19 13:26                                                                                                                                     ` Michal Hocko
2013-06-22 20:09                                                                                                                                       ` azurIt
2013-06-24 20:13                                                                                                                                         ` Johannes Weiner
2013-06-28 10:06                                                                                                                                           ` azurIt
2013-07-05 18:17                                                                                                                                             ` Johannes Weiner
2013-07-05 19:02                                                                                                                                               ` azurIt
2013-07-05 19:18                                                                                                                                                 ` Johannes Weiner
2013-07-07 23:42                                                                                                                                                   ` azurIt
2013-07-09 13:10                                                                                                                                                     ` Michal Hocko
2013-07-09 13:19                                                                                                                                                       ` azurIt
2013-07-09 13:54                                                                                                                                                         ` Michal Hocko
2013-07-10 16:25                                                                                                                                                           ` azurIt
2013-07-11  7:25                                                                                                                                                             ` Michal Hocko
2013-07-13 23:26                                                                                                                                                               ` azurIt
2013-07-13 23:51                                                                                                                                                                 ` azurIt
2013-07-15 15:41                                                                                                                                                                   ` Michal Hocko
2013-07-15 16:00                                                                                                                                                                     ` Michal Hocko
2013-07-16 15:35                                                                                                                                                                       ` Johannes Weiner
2013-07-16 16:09                                                                                                                                                                         ` Michal Hocko
2013-07-16 16:48                                                                                                                                                                           ` Johannes Weiner
2013-07-19  4:21                                                                                                                                                                             ` Johannes Weiner
2013-07-19  4:22                                                                                                                                                                               ` [patch 1/5] mm: invoke oom-killer from remaining unconverted page fault handlers Johannes Weiner
2013-07-19  4:24                                                                                                                                                                               ` [patch 2/5] mm: pass userspace fault flag to generic fault handler Johannes Weiner
2013-07-19  4:25                                                                                                                                                                               ` [patch 3/5] x86: finish fault error path with fatal signal Johannes Weiner
2013-07-24 20:32                                                                                                                                                                                 ` Johannes Weiner
2013-07-25 20:29                                                                                                                                                                                   ` KOSAKI Motohiro
2013-07-25 21:50                                                                                                                                                                                     ` Johannes Weiner
2013-07-19  4:25                                                                                                                                                                               ` [patch 4/5] memcg: do not trap chargers with full callstack on OOM Johannes Weiner
2013-07-19  4:26                                                                                                                                                                               ` [patch 5/5] mm: memcontrol: sanity check memcg OOM context unwind Johannes Weiner
2013-07-19  8:23                                                                                                                                                                               ` [PATCH for 3.2] memcg: do not trap chargers with full callstack on OOM azurIt
2013-07-14 17:07                                                                                                                                                   ` azurIt
2013-07-09 13:00                                                                                                                                           ` Michal Hocko
2013-07-09 13:08                                                                                                                                             ` Michal Hocko
2013-07-09 13:10                                                                                                                                               ` Michal Hocko
2013-06-24 16:48                                                                                                                                       ` azurIt
2013-02-22 12:00                                                                                                                           ` [PATCH for 3.2.34] memcg: do not trigger OOM if PF_NO_MEMCG_OOM is set azurIt
2013-02-07 11:01                                                                                               ` [PATCH for 3.2.34] memcg: do not trigger OOM from add_to_page_cache_locked Kamezawa Hiroyuki
2013-02-07 12:31                                                                                                 ` Michal Hocko
2013-02-08  4:16                                                                                                   ` Kamezawa Hiroyuki
2013-02-08  1:40                                                                                                 ` Kamezawa Hiroyuki
2013-02-08 16:01                                                                                                   ` Michal Hocko
2013-02-05 16:31                                                                                         ` Michal Hocko
2012-12-24 13:38                                                                         ` azurIt
2012-12-28 16:35                                                                           ` Michal Hocko
2012-11-26 17:46                         ` [PATCH -mm] " Johannes Weiner
2012-11-26 18:04                           ` Michal Hocko
2012-11-26 18:24                             ` Johannes Weiner
2012-11-26 19:03                               ` Michal Hocko
2012-11-26 19:29                                 ` Johannes Weiner
2012-11-26 20:08                                   ` Michal Hocko
2012-11-26 20:19                                     ` Johannes Weiner
2012-11-26 20:46                                       ` azurIt
2012-11-26 20:53                                         ` Johannes Weiner
2012-11-26 22:06                                       ` Michal Hocko
2012-11-27  0:05                         ` Kamezawa Hiroyuki
2012-11-27  9:54                           ` Michal Hocko
2012-11-27 19:48                           ` Johannes Weiner
2012-11-27 20:54                             ` [PATCH -v2 " Michal Hocko
2012-11-27 20:59                               ` Michal Hocko
2012-11-28 15:26                                 ` Johannes Weiner
2012-11-28 16:04                                   ` Michal Hocko
2012-11-28 16:37                                     ` Johannes Weiner
2012-11-28 16:46                                       ` Michal Hocko
2012-11-28 16:48                                         ` Michal Hocko
2012-11-28 18:44                                           ` Johannes Weiner
2012-11-28 20:20                                           ` Hugh Dickins
2012-11-29 14:05                                             ` Michal Hocko

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20121125120524.GB10623@dhcp22.suse.cz \
    --to=mhocko@suse.cz \
    --cc=azurit@pobox.sk \
    --cc=cgroups@vger.kernel.org \
    --cc=kamezawa.hiroyu@jp.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-mm@kvack.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).