All of lore.kernel.org
 help / color / mirror / Atom feed
From: Dave Jones <davej@redhat.com>
To: Oleg Nesterov <oleg@redhat.com>
Cc: "Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
	Linux Kernel <linux-kernel@vger.kernel.org>,
	Linus Torvalds <torvalds@linux-foundation.org>,
	"Eric W. Biederman" <ebiederm@xmission.com>,
	Andrey Vagin <avagin@openvz.org>
Subject: Re: frequent softlockups with 3.10rc6.
Date: Sat, 22 Jun 2013 17:59:05 -0400	[thread overview]
Message-ID: <20130622215905.GA28238@redhat.com> (raw)
In-Reply-To: <20130622173129.GA29375@redhat.com>

On Sat, Jun 22, 2013 at 07:31:29PM +0200, Oleg Nesterov wrote:
 
 > > [ 7485.261299] WARNING: at include/linux/nsproxy.h:63 get_proc_task_net+0x1c8/0x1d0()
 > > [ 7485.262021] Modules linked in: 8021q garp stp tun fuse rfcomm bnep hidp snd_seq_dummy nfnetlink scsi_transport_iscsi can_bcm ipt_ULOG can_raw rds af_802154 nfc can rose caif_socket caif llc2 af_rxrpc phonet ipx p8023 p8022 pppoe pppox ppp_generic netrom slhc ax25 x25 af_key appletalk atm psnap llc irda crc_ccitt bluetooth rfkill coretemp hwmon kvm_intel snd_hda_codec_realtek kvm snd_hda_codec_hdmi crc32c_intel ghash_clmulni_intel microcode snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device pcspkr snd_pcm snd_page_alloc e1000e snd_timer ptp snd pps_core soundcore xfs libcrc32c
 > > [ 7485.265434] CPU: 2 PID: 5623 Comm: trinity-child3 Not tainted 3.10.0-rc6+ #28 
 > > [ 7485.267158]  ffffffff81a1529c ffff8801c8eafd30 ffffffff816e432d ffff8801c8eafd68
 > > [ 7485.268045]  ffffffff8104a0c1 0000000000000000 ffff880225e9bd18 ffff8801bc6e4de0
 > > [ 7485.268932]  0000000000000000 00000000000000dd ffff8801c8eafd78 ffffffff8104a19a
 > > [ 7485.270463] Call Trace:
 > > [ 7485.271338]  [<ffffffff816e432d>] dump_stack+0x19/0x1b
 > > [ 7485.272207]  [<ffffffff8104a0c1>] warn_slowpath_common+0x61/0x80
 > > [ 7485.273092]  [<ffffffff8104a19a>] warn_slowpath_null+0x1a/0x20
 > > [ 7485.273942]  [<ffffffff81229f58>] get_proc_task_net+0x1c8/0x1d0
 > > [ 7485.274793]  [<ffffffff81229d95>] ? get_proc_task_net+0x5/0x1d0
 > > [ 7485.275659]  [<ffffffff8122a0bd>] proc_tgid_net_lookup+0x1d/0x80
 > > [ 7485.276531]  [<ffffffff811b778d>] lookup_real+0x1d/0x50
 > > [ 7485.277646]  [<ffffffff811b7d83>] __lookup_hash+0x33/0x40
 > > [ 7485.278477]  [<ffffffff811bb143>] kern_path_create+0xb3/0x190
 > > [ 7485.279345]  [<ffffffff811b93d5>] ? getname_flags+0xb5/0x190
 > > [ 7485.280292]  [<ffffffff811bb261>] user_path_create+0x41/0x60
 > > [ 7485.281233]  [<ffffffff811be6bb>] SyS_symlinkat+0x4b/0xd0
 > > [ 7485.282072]  [<ffffffff816f5a54>] tracesys+0xdd/0xe2
 > > [ 7485.282973] ---[ end trace 2204b7c65d6c5519 ]---
 > 
 > Hmm. The test case tries to create the symlink in /proc/*/net/ ?

hit it with symlink, but also some other syscalls. eg:

WARNING: at include/linux/nsproxy.h:63 get_proc_task_net+0x1c8/0x1d0()
 Modules linked in: 8021q garp stp tun fuse rfcomm bnep hidp snd_seq_dummy nfnetlink scsi_transport_iscsi can_bcm ipt_ULOG can_raw rds af_802154 nfc can rose caif_socket caif llc2 af_rxrpc phonet ipx p8023 p8022 pppoe pppox ppp_generic netrom slhc ax25 x25 af_key appletalk atm psnap llc irda crc_ccitt bluetooth rfkill coretemp hwmon kvm_intel snd_hda_codec_realtek kvm snd_hda_codec_hdmi crc32c_intel ghash_clmulni_intel microcode snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device pcspkr snd_pcm snd_page_alloc e1000e snd_timer ptp snd pps_core soundcore xfs libcrc32c
 CPU: 2 PID: 12821 Comm: trinity-child2 Tainted: G        W    3.10.0-rc6+ #28
  ffffffff81a1529c ffff8801bcbdbc70 ffffffff816e432d ffff8801bcbdbca8
  ffffffff8104a0c1 0000000000000000 ffff880226704828 ffff8801b95a7090
  00000000ffffff9c 0000000000000008 ffff8801bcbdbcb8 ffffffff8104a19a
 Call Trace:
  [<ffffffff816e432d>] dump_stack+0x19/0x1b
  [<ffffffff8104a0c1>] warn_slowpath_common+0x61/0x80
  [<ffffffff8104a19a>] warn_slowpath_null+0x1a/0x20
  [<ffffffff81229f58>] get_proc_task_net+0x1c8/0x1d0
  [<ffffffff81229d95>] ? get_proc_task_net+0x5/0x1d0
  [<ffffffff8122a0bd>] proc_tgid_net_lookup+0x1d/0x80
  [<ffffffff811b778d>] lookup_real+0x1d/0x50
  [<ffffffff811b7d83>] __lookup_hash+0x33/0x40
  [<ffffffff816e3635>] lookup_slow+0x44/0xa9
  [<ffffffff811bae88>] path_lookupat+0x7b8/0x810
  [<ffffffff8119d3d2>] ? kmem_cache_alloc+0x142/0x320
  [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
  [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
  [<ffffffff811baf0b>] filename_lookup+0x2b/0xc0
  [<ffffffff811be144>] user_path_at_empty+0x54/0x90
  [<ffffffff81145965>] ? user_exit+0x45/0x90
  [<ffffffff810b773d>] ? trace_hardirqs_on_caller+0xfd/0x1c0
  [<ffffffff811be191>] user_path_at+0x11/0x20
  [<ffffffff811d5338>] SyS_lgetxattr+0x38/0x90
  [<ffffffff816f5a54>] tracesys+0xdd/0xe2
 ---[ end trace 2204b7c65d6c551a ]---
 
WARNING: at include/linux/nsproxy.h:63 get_proc_task_net+0x1c8/0x1d0()
Modules linked in: 8021q garp stp tun fuse rfcomm bnep hidp snd_seq_dummy nfnetlink scsi_transport_iscsi can_bcm ipt_ULOG can_raw rds af_802154 nfc can rose caif_socket caif llc2 af_rxrpc phonet ipx p8023 p8022 pppoe pppox ppp_generic netrom slhc ax25 x25 af_key appletalk atm psnap llc irda crc_ccitt bluetooth rfkill coretemp hwmon kvm_intel snd_hda_codec_realtek kvm snd_hda_codec_hdmi crc32c_intel ghash_clmulni_intel microcode snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device pcspkr snd_pcm snd_page_alloc e1000e snd_timer ptp snd pps_core soundcore xfs libcrc32c
CPU: 2 PID: 13142 Comm: trinity-child1 Tainted: G        W    3.10.0-rc6+ #28
 ffffffff81a1529c ffff880228f3dc30 ffffffff816e432d ffff880228f3dc68
 ffffffff8104a0c1 0000000000000000 ffff8802239f3790 ffff880224021030
 00000000ffffff9c 0000000000000001 ffff880228f3dc78 ffffffff8104a19a
Call Trace:
 [<ffffffff816e432d>] dump_stack+0x19/0x1b
 [<ffffffff8104a0c1>] warn_slowpath_common+0x61/0x80
 [<ffffffff8104a19a>] warn_slowpath_null+0x1a/0x20
 [<ffffffff81229f58>] get_proc_task_net+0x1c8/0x1d0
 [<ffffffff81229d95>] ? get_proc_task_net+0x5/0x1d0
 [<ffffffff8122a0bd>] proc_tgid_net_lookup+0x1d/0x80
 [<ffffffff811b778d>] lookup_real+0x1d/0x50
 [<ffffffff811b7d83>] __lookup_hash+0x33/0x40
 [<ffffffff816e3635>] lookup_slow+0x44/0xa9
 [<ffffffff811bae88>] path_lookupat+0x7b8/0x810
 [<ffffffff8119d3d2>] ? kmem_cache_alloc+0x142/0x320
 [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
 [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
 [<ffffffff811baf0b>] filename_lookup+0x2b/0xc0
 [<ffffffff811be144>] user_path_at_empty+0x54/0x90
 [<ffffffff810ba1f8>] ? lock_release_non_nested+0x308/0x350
 [<ffffffff810b4dae>] ? lock_release_holdtime.part.30+0xee/0x170
 [<ffffffff811be191>] user_path_at+0x11/0x20
 [<ffffffff811e1069>] do_utimes+0xa9/0x160
 [<ffffffff811e118f>] SyS_utime+0x6f/0xa0
 [<ffffffff816f5a54>] tracesys+0xdd/0xe2
---[ end trace 2204b7c65d6c551e ]---

WARNING: at include/linux/nsproxy.h:63 get_proc_task_net+0x1c8/0x1d0()
Modules linked in: 8021q garp stp tun fuse rfcomm bnep hidp snd_seq_dummy nfnetlink scsi_transport_iscsi can_bcm ipt_ULOG can_raw rds af_802154 nfc can rose caif_socket caif llc2 af_rxrpc phonet ipx p8023 p8022 pppoe pppox ppp_generic netrom slhc ax25 x25 af_key appletalk atm psnap llc irda crc_ccitt bluetooth rfkill coretemp hwmon kvm_intel snd_hda_codec_realtek kvm snd_hda_codec_hdmi crc32c_intel ghash_clmulni_intel microcode snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device pcspkr snd_pcm snd_page_alloc e1000e snd_timer ptp snd pps_core soundcore xfs libcrc32c
CPU: 0 PID: 13692 Comm: trinity-child0 Tainted: G        W    3.10.0-rc6+ #28
 ffffffff81a1529c ffff88021dea7c90 ffffffff816e432d ffff88021dea7cc8
 ffffffff8104a0c1 0000000000000000 ffff880226704828 ffff88022404ae40
 00000000ffffff9c 0000000000000000 ffff88021dea7cd8 ffffffff8104a19a
Call Trace:
 [<ffffffff816e432d>] dump_stack+0x19/0x1b
 [<ffffffff8104a0c1>] warn_slowpath_common+0x61/0x80
 [<ffffffff8104a19a>] warn_slowpath_null+0x1a/0x20
 [<ffffffff81229f58>] get_proc_task_net+0x1c8/0x1d0
 [<ffffffff81229d95>] ? get_proc_task_net+0x5/0x1d0
 [<ffffffff8122a0bd>] proc_tgid_net_lookup+0x1d/0x80
 [<ffffffff811b778d>] lookup_real+0x1d/0x50
 [<ffffffff811b7d83>] __lookup_hash+0x33/0x40
 [<ffffffff816e3635>] lookup_slow+0x44/0xa9
 [<ffffffff811bae88>] path_lookupat+0x7b8/0x810
 [<ffffffff8119d3d2>] ? kmem_cache_alloc+0x142/0x320
 [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
 [<ffffffff811b936f>] ? getname_flags+0x4f/0x190
 [<ffffffff811baf0b>] filename_lookup+0x2b/0xc0
 [<ffffffff811be144>] user_path_at_empty+0x54/0x90
 [<ffffffff810b773d>] ? trace_hardirqs_on_caller+0xfd/0x1c0
 [<ffffffff810b780d>] ? trace_hardirqs_on+0xd/0x10
 [<ffffffff811be191>] user_path_at+0x11/0x20
 [<ffffffff811abe7f>] SyS_chdir+0x2f/0xc0
 [<ffffffff816f59f5>] ? tracesys+0x7e/0xe2
 [<ffffffff816f5a54>] tracesys+0xdd/0xe2
---[ end trace 2204b7c65d6c5521 ]---

and a bunch of other similar VFS related calls.

 > >  > +		pr_info("YESTHISHAPPENS new=%p\n", new);
 > >
 > > This didn't trigger. (yet?)
 > 
 > This should only trigger if the test-case plays with the namespaces...
 > But once again, the warnings are fine. I hoped that they can provide
 > more info when/if you reproduce the lockup.
 > 
 > But it seems you can't ?
 >
 > Dave, I am sorry but all I can do is to ask you to do more testing.
 > Could you please reproduce the lockup again on the clean Linus's
 > current ? (and _without_ reverting 8aac6270, of course).
 
I'll give it a shot.  Just rebuilt clean tree, and restarted the tests.

 > If watchdog will blame __do_softirq() again I can try to make a
 > better debugging patch.
 > 
 > Perhaps it makes sense to decrease /proc/sys/kernel/watchdog_thresh
 > to detect the possible lockups earlier. 2 * 10 is probably too much.

I can try that too if it doesn't show up.

 > And who knows, perhaps you pulled some fix (say 34376a50fb1 looks
 > promising) after you finished bisecting and then pulled Linus
 > current.

maybe, though I'm doubtful. I'm sure I already saw it yesterday
on this tree without the revert, but I'll confirm for sure of course..

	Dave


  reply	other threads:[~2013-06-22 21:59 UTC|newest]

Thread overview: 93+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-06-19 16:45 frequent softlockups with 3.10rc6 Dave Jones
2013-06-19 17:53 ` Dave Jones
2013-06-19 18:13   ` Paul E. McKenney
2013-06-19 18:42     ` Dave Jones
2013-06-20  0:12     ` Dave Jones
2013-06-20 16:16       ` Paul E. McKenney
2013-06-20 16:27         ` Dave Jones
2013-06-21 15:11         ` Dave Jones
2013-06-21 19:59           ` Oleg Nesterov
2013-06-22  1:37             ` Dave Jones
2013-06-22 17:31               ` Oleg Nesterov
2013-06-22 21:59                 ` Dave Jones [this message]
2013-06-23  5:00                   ` Andrew Vagin
2013-06-23 14:36                   ` Oleg Nesterov
2013-06-23 15:06                     ` Dave Jones
2013-06-23 16:04                       ` Oleg Nesterov
2013-06-24  0:21                         ` Dave Jones
2013-06-24  2:00                         ` Dave Jones
2013-06-24 14:39                           ` Oleg Nesterov
2013-06-24 14:52                             ` Steven Rostedt
2013-06-24 16:00                               ` Dave Jones
2013-06-24 16:24                                 ` Steven Rostedt
2013-06-24 16:51                                   ` Dave Jones
2013-06-24 17:04                                     ` Steven Rostedt
2013-06-25 16:55                                       ` Dave Jones
2013-06-25 17:21                                         ` Steven Rostedt
2013-06-25 17:23                                           ` Steven Rostedt
2013-06-25 17:26                                           ` Dave Jones
2013-06-25 17:31                                             ` Steven Rostedt
2013-06-25 17:32                                             ` Steven Rostedt
2013-06-25 17:29                                           ` Steven Rostedt
2013-06-25 17:34                                             ` Dave Jones
2013-06-24 16:37                                 ` Oleg Nesterov
2013-06-24 16:49                                   ` Dave Jones
2013-06-24 15:57                         ` Dave Jones
2013-06-24 17:35                           ` Oleg Nesterov
2013-06-24 17:44                             ` Dave Jones
2013-06-24 17:53                             ` Steven Rostedt
2013-06-24 18:00                               ` Dave Jones
2013-06-25 15:35                             ` Dave Jones
2013-06-25 16:23                               ` Steven Rostedt
2013-06-26  5:23                                 ` Dave Jones
2013-06-26 19:52                                   ` Steven Rostedt
2013-06-26 20:00                                     ` Dave Jones
2013-06-27  3:01                                       ` Steven Rostedt
2013-06-26  5:48                                 ` Dave Jones
2013-06-26 19:18                               ` Oleg Nesterov
2013-06-26 19:40                                 ` Dave Jones
2013-06-27  0:22                                 ` Dave Jones
2013-06-27  1:06                                   ` Eric W. Biederman
2013-06-27  2:32                                     ` Tejun Heo
2013-06-27  7:55                                   ` Dave Chinner
2013-06-27 10:06                                     ` Dave Chinner
2013-06-27 12:52                                       ` Dave Chinner
2013-06-27 15:21                                         ` Dave Jones
2013-06-28  1:13                                           ` Dave Chinner
2013-06-28  3:58                                             ` Dave Chinner
2013-06-28 10:28                                               ` Jan Kara
2013-06-29  3:39                                                 ` Dave Chinner
2013-07-01 12:00                                                   ` Jan Kara
2013-07-02  6:29                                                     ` Dave Chinner
2013-07-02  8:19                                                       ` Jan Kara
2013-07-02 12:38                                                         ` Dave Chinner
2013-07-02 14:05                                                           ` Jan Kara
2013-07-02 16:13                                                             ` Linus Torvalds
2013-07-02 16:57                                                               ` Jan Kara
2013-07-02 17:38                                                                 ` Linus Torvalds
2013-07-03  3:07                                                                   ` Dave Chinner
2013-07-03  3:28                                                                     ` Linus Torvalds
2013-07-03  4:49                                                                       ` Dave Chinner
2013-07-04  7:19                                                                         ` Andrew Morton
2013-06-29 20:13                                               ` Dave Jones
2013-06-29 22:23                                                 ` Linus Torvalds
2013-06-29 23:44                                                   ` Dave Jones
2013-06-30  0:21                                                     ` Steven Rostedt
2013-07-01 12:49                                                     ` Pavel Machek
2013-06-30  0:17                                                   ` Steven Rostedt
2013-06-30  2:05                                                   ` Dave Chinner
2013-06-30  2:34                                                     ` Dave Chinner
2013-06-27 14:30                                     ` Dave Jones
2013-06-28  1:18                                       ` Dave Chinner
2013-06-28  2:54                                         ` Linus Torvalds
2013-06-28  3:54                                           ` Dave Chinner
2013-06-28  5:59                                             ` Linus Torvalds
2013-06-28  7:21                                               ` Dave Chinner
2013-06-28  8:22                                                 ` Linus Torvalds
2013-06-28  8:32                                                   ` Al Viro
2013-06-28  8:22                                               ` Al Viro
2013-06-28  9:49                                               ` Jan Kara
2013-07-01 17:57                                             ` block layer softlockup Dave Jones
2013-07-02  2:07                                               ` Dave Chinner
2013-07-02  6:01                                                 ` Dave Jones
2013-07-02  7:30                                                   ` Dave Chinner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20130622215905.GA28238@redhat.com \
    --to=davej@redhat.com \
    --cc=avagin@openvz.org \
    --cc=ebiederm@xmission.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=oleg@redhat.com \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.