From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mail138.messagelabs.com (mail138.messagelabs.com [216.82.249.35]) by kanga.kvack.org (Postfix) with SMTP id 157068D0039 for ; Fri, 28 Jan 2011 01:44:02 -0500 (EST) Date: Fri, 28 Jan 2011 01:44:01 -0500 (EST) From: CAI Qian Message-ID: <77942321.201910.1296197041743.JavaMail.root@zmail06.collab.prod.int.phx2.redhat.com> In-Reply-To: <20110126135252.GQ926@random.random> Subject: Re: mmotm 2011-01-25-15-47 uploaded MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Sender: owner-linux-mm@kvack.org To: Andrea Arcangeli Cc: linux-mm@kvack.org List-ID: ----- Original Message ----- > Hi Qian, > > On Wed, Jan 26, 2011 at 03:45:55AM -0500, CAI Qian wrote: > > Andrea, > > > > khugepaged hung during swapping there. > > > > INFO: task khugepaged:276 blocked for more than 120 seconds. > > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this > > message. > > khugepaged D ffff880fa0fd4610 0 276 2 0x00000000 > > ffff880fa07cfcc0 0000000000000046 ffff88201ffdac00 0000000000000000 > > 0000000000014d40 ffff880fa0fd4080 ffff880fa0fd4610 ffff880fa07cffd8 > > ffff880fa0fd4618 0000000000014d40 ffff880fa07ce010 0000000000014d40 > > Call Trace: > > [] rwsem_down_failed_common+0xb5/0x140 > > [] rwsem_down_write_failed+0x13/0x20 > > [] call_rwsem_down_write_failed+0x13/0x20 > > [] ? down_write+0x32/0x40 > > [] khugepaged+0x8ad/0x1300 > > [] ? __switch_to+0xd0/0x320 > > [] ? khugepaged+0x0/0x1300 > > [] ? autoremove_wake_function+0x0/0x40 > > [] ? khugepaged+0x0/0x1300 > > [] kthread+0x96/0xa0 > > [] kernel_thread_helper+0x4/0x10 > > [] ? kthread+0x0/0xa0 > > [] ? kernel_thread_helper+0x0/0x10 > > INFO: task pgrep:6039 blocked for more than 120 seconds. > > "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this > > message. > > pgrep D ffff887f606f1ab0 0 6039 6038 0x00000080 > > ffff8821e39c1ce0 0000000000000082 0000000000000246 0000000000000000 > > 0000000000014d40 ffff887f606f1520 ffff887f606f1ab0 ffff8821e39c1fd8 > > ffff887f606f1ab8 0000000000014d40 ffff8821e39c0010 0000000000014d40 > > Call Trace: > > [] rwsem_down_failed_common+0xb5/0x140 > > [] rwsem_down_read_failed+0x15/0x17 > > [] call_rwsem_down_read_failed+0x14/0x30 > > [] ? down_read+0x24/0x30 > > [] access_process_vm+0x4c/0x200 > > [] ? fallback_alloc+0x14e/0x270 > > [] proc_pid_cmdline+0x6d/0x120 > > [] ? alloc_pages_current+0x9a/0x100 > > [] proc_info_read+0xad/0xf0 > > [] vfs_read+0xc5/0x190 > > [] sys_read+0x51/0x90 > > [] system_call_fastpath+0x16/0x1b > > pgrep hung too, it's not just khugepaged hanging and it's not obvious > for now that khugepaged was guilty of forgetting an unlock, could be > the process deadlocked somewhere with the mmap_sem hold. Can you press > SYSRQ+T? Hopefully that will show the holder. Also is CONFIG_NUMA=y/n? Unfortunately, SYSRQ+T was not working. CONFIG_NUMA=y and this is an NUMA system as well. CAI Qian -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom policy in Canada: sign http://dissolvethecrtc.ca/ Don't email: email@kvack.org