From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754195Ab2AZWyb (ORCPT ); Thu, 26 Jan 2012 17:54:31 -0500 Received: from out03.mta.xmission.com ([166.70.13.233]:57782 "EHLO out03.mta.xmission.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754157Ab2AZWy3 convert rfc822-to-8bit (ORCPT ); Thu, 26 Jan 2012 17:54:29 -0500 From: ebiederm@xmission.com (Eric W. Biederman) To: Eric Dumazet Cc: Pavel Emelyanov , Sjur =?utf-8?Q?Br=C3=A6ndeland?= , "levinsasha928\@gmail.com" , "netdev\@vger.kernel.org" , "davem\@davemloft.net" , "linux-kernel\@vger.kernel.org" , "davej\@redhat.com" , "sjurbren\@gmail.com" Subject: Re: [PATCH] netns: fix net_alloc_generic() References: <1327523631-3480-1-git-send-email-sjur.brandeland@stericsson.com> <1327574498.2500.22.camel@edumazet-HP-Compaq-6005-Pro-SFF-PC> <4F212E7E.2040801@parallels.com> <1327575108.2500.24.camel@edumazet-HP-Compaq-6005-Pro-SFF-PC> Date: Thu, 26 Jan 2012 14:57:02 -0800 In-Reply-To: <1327575108.2500.24.camel@edumazet-HP-Compaq-6005-Pro-SFF-PC> (Eric Dumazet's message of "Thu, 26 Jan 2012 11:51:48 +0100") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/23.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT X-XM-SPF: eid=;;;mid=;;;hst=in02.mta.xmission.com;;;ip=98.207.153.68;;;frm=ebiederm@xmission.com;;;spf=neutral X-XM-AID: U2FsdGVkX1+69FPQhyA4vS5OaWO/M1U6qlh1XRF2u+k= X-SA-Exim-Connect-IP: 98.207.153.68 X-SA-Exim-Mail-From: ebiederm@xmission.com X-SA-Exim-Scanned: No (on in02.mta.xmission.com); SAEximRunCond expanded to false Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Eric Dumazet writes: > Le jeudi 26 janvier 2012 à 14:44 +0400, Pavel Emelyanov a écrit : >> > I believe the problem is in net_namespace infrastructure, not in CAIF. >> > >> > Could you test following patch instead ? >> > >> > [PATCH] netns: fix net_alloc_generic() >> > >> > When a new net namespace is created, we should attach to it a "struct >> > net_generic" with enough slots (even empty), or we can hit the following >> > BUG_ON() : >> > >> > [ 200.752016] kernel BUG at include/net/netns/generic.h:40! >> > ... >> > [ 200.752016] [] ? get_cfcnfg+0x3a/0x180 >> > [ 200.752016] [] ? lockdep_rtnl_is_held+0x10/0x20 >> > [ 200.752016] [] caif_device_notify+0x2e/0x530 >> > [ 200.752016] [] notifier_call_chain+0x67/0x110 >> > [ 200.752016] [] raw_notifier_call_chain+0x11/0x20 >> > [ 200.752016] [] call_netdevice_notifiers+0x32/0x60 >> > [ 200.752016] [] register_netdevice+0x196/0x300 >> > [ 200.752016] [] register_netdev+0x19/0x30 >> > [ 200.752016] [] loopback_net_init+0x4a/0xa0 >> > [ 200.752016] [] ops_init+0x42/0x180 >> > [ 200.752016] [] setup_net+0x6b/0x100 >> > [ 200.752016] [] copy_net_ns+0x86/0x110 >> > [ 200.752016] [] create_new_namespaces+0xd9/0x190 >> > >> > net_alloc_generic() should take into account the maximum index into the >> > ptr array, as a subsystem might use net_generic() anytime. >> >> I'm not sure I understand it correctly, but subsystem can only use the >> net_generic() only (!) after the net_assign_generic() is performed. > > Yes, but here, loopback_net_init() calls register_netdev() > > So every subsystems _notify are called, even if subsystem _init_net() > was not yet called. > > Its a chicken and egg problem. It is not a chicken and egg problem. It is a bug in caif. caif is claiming to be a network device when it is acting as a subsytem. That means it is being initialized too late. Untested but this should trivially fix the problem, and a bunch of others of the same ilk. It is not safe to shutdown subsystems until all of the devices are gone, otherwise there will be problems with packets in flight. diff --git a/net/caif/caif_dev.c b/net/caif/caif_dev.c index 673728a..cf5bdd3 100644 --- a/net/caif/caif_dev.c +++ b/net/caif/caif_dev.c @@ -569,7 +569,7 @@ static int __init caif_device_init(void) { int result; - result = register_pernet_device(&caif_net_ops); + result = register_pernet_subsys(&caif_net_ops); if (result) return result; @@ -582,7 +582,7 @@ static int __init caif_device_init(void) static void __exit caif_device_exit(void) { - unregister_pernet_device(&caif_net_ops); + unregister_pernet_subsys(&caif_net_ops); unregister_netdevice_notifier(&caif_device_notifier); dev_remove_pack(&caif_packet_type); } Eric