From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756559AbdESD0K (ORCPT ); Thu, 18 May 2017 23:26:10 -0400 Received: from mail.kernel.org ([198.145.29.99]:47584 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755890AbdESDYt (ORCPT ); Thu, 18 May 2017 23:24:49 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 84279239E3 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=mcgrof@kernel.org From: "Luis R. Rodriguez" To: shuah@kernel.org, jeyu@redhat.com, rusty@rustcorp.com.au, ebiederm@xmission.com, dmitry.torokhov@gmail.com, acme@redhat.com, corbet@lwn.net Cc: martin.wilck@suse.com, mmarek@suse.com, pmladek@suse.com, hare@suse.com, rwright@hpe.com, jeffm@suse.com, DSterba@suse.com, fdmanana@suse.com, neilb@suse.com, linux@roeck-us.net, rgoldwyn@suse.com, subashab@codeaurora.org, xypron.glpk@gmx.de, keescook@chromium.org, atomlin@redhat.com, mbenes@suse.cz, paulmck@linux.vnet.ibm.com, dan.j.williams@intel.com, jpoimboe@redhat.com, davem@davemloft.net, mingo@redhat.com, akpm@linux-foundation.org, torvalds@linux-foundation.org, gregkh@linuxfoundation.org, linux-kselftest@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, "Luis R. Rodriguez" Subject: [PATCH 1/6] kmod: add dynamic max concurrent thread count Date: Thu, 18 May 2017 20:24:39 -0700 Message-Id: <20170519032444.18416-2-mcgrof@kernel.org> X-Mailer: git-send-email 2.11.0 In-Reply-To: <20170519032444.18416-1-mcgrof@kernel.org> References: <20170519032444.18416-1-mcgrof@kernel.org> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org We currently statically limit the number of modprobe threads which we allow to run concurrently to 50. As per Keith Owens, this was a completely arbitrary value, and it was set in the 2.3.38 days [0] over 16 years ago in year 2000. Although we haven't yet hit our lower limits, experimentation [1] shows that when and if we hit this limit in the worst case, will be fatal -- consider get_fs_type() failures upon mount on a system which has many partitions, some of which might even be with the same filesystem. Its best to be prudent and increase and set this value to something more sensible which ensures we're far from hitting the limit and also allows default build/user run time override. The worst case is fatal given that once a module fails to load there is a period of time during which subsequent request for the same module will fail, so in the case of partitions its not just one request that could fail, but whole series of partitions. This later issue of a module request failure domino effect can be addressed later, but increasing the limit to something more meaninful should at least give us enough cushion to avoid this for a while. Set this value up with a bit more meaninful modern limits: Bump this up to 64 max for small systems (CONFIG_BASE_SMALL) Bump this up to 128 max for larger systems (!CONFIG_BASE_SMALL) Also allow the default max limit to be further fine tuned at compile time and at initialization at run time at boot up using the kernel parameter: max_modprobes. [0] https://git.kernel.org/cgit/linux/kernel/git/history/history.git/commit/?id=ab1c4ec7410f6ec64e1511d1a7d850fc99c09b44 [1] https://github.com/mcgrof/test_request_module Signed-off-by: Luis R. Rodriguez --- Documentation/admin-guide/kernel-parameters.txt | 7 ++++ include/linux/kmod.h | 3 +- init/Kconfig | 23 +++++++++++++ init/main.c | 1 + kernel/kmod.c | 43 ++++++++++++++++--------- 5 files changed, 61 insertions(+), 16 deletions(-) diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt index 15f79c27748d..1314a85c10c9 100644 --- a/Documentation/admin-guide/kernel-parameters.txt +++ b/Documentation/admin-guide/kernel-parameters.txt @@ -1748,6 +1748,13 @@ keepinitrd [HW,ARM] + kmod.max_modprobes [KNL] + This lets you set the max allowed of concurrent + modprobes threads possible on a system overriding the + default heuristic of: + + min(max_threads/2, 1 << CONFIG_MAX_KMOD_CONCURRENT) + kernelcore= [KNL,X86,IA-64,PPC] Format: nn[KMGTPE] | "mirror" This parameter diff --git a/include/linux/kmod.h b/include/linux/kmod.h index c4e441e00db5..302c9dfdcda9 100644 --- a/include/linux/kmod.h +++ b/include/linux/kmod.h @@ -38,13 +38,14 @@ int __request_module(bool wait, const char *name, ...); #define request_module_nowait(mod...) __request_module(false, mod) #define try_then_request_module(x, mod...) \ ((x) ?: (__request_module(true, mod), (x))) +void init_kmod_umh(void); #else static inline int request_module(const char *name, ...) { return -ENOSYS; } static inline int request_module_nowait(const char *name, ...) { return -ENOSYS; } +static inline void init_kmod_umh(void) { } #define try_then_request_module(x, mod...) (x) #endif - struct cred; struct file; diff --git a/init/Kconfig b/init/Kconfig index 1d3475fc9496..5974ad7c9d0a 100644 --- a/init/Kconfig +++ b/init/Kconfig @@ -2191,6 +2191,29 @@ config TRIM_UNUSED_KSYMS If unsure, or if you need to build out-of-tree modules, say N. +config MAX_KMOD_CONCURRENT + int "Max allowed concurrent request_module() calls (6=>64, 10=>1024)" + range 6 14 + default 7 if !BASE_SMALL + default 6 if BASE_SMALL + help + The kernel restricts the number of possible concurrent calls to + request_module() to help avoid a recursive loop possible with + modules. The default maximum number of concurrent threads allowed + to run request_module() will be: + + max_modprobes = min(max_threads/2, 1 << CONFIG_MAX_KMOD_CONCURRENT); + + The value set in CONFIG_MAX_KMOD_CONCURRENT represents then the power + of 2 value used at boot time for the above computation. You can + override the default built value using the kernel parameter: + + kmod.max_modprobes=4096 + + We set this to default to 64 (2^6) concurrent modprobe threads for + small systems, for larger systems this defaults to 128 (2^7) + concurrent modprobe threads. + endif # MODULES config MODULES_TREE_LOOKUP diff --git a/init/main.c b/init/main.c index f866510472d7..25669dfded42 100644 --- a/init/main.c +++ b/init/main.c @@ -650,6 +650,7 @@ asmlinkage __visible void __init start_kernel(void) thread_stack_cache_init(); cred_init(); fork_init(); + init_kmod_umh(); proc_caches_init(); buffer_init(); key_init(); diff --git a/kernel/kmod.c b/kernel/kmod.c index 563f97e2be36..6fe6848787b2 100644 --- a/kernel/kmod.c +++ b/kernel/kmod.c @@ -46,6 +46,9 @@ #include extern int max_threads; +unsigned int max_modprobes; +module_param(max_modprobes, uint, 0644); +MODULE_PARM_DESC(max_modprobes, "Max number of allowed concurrent modprobes"); #define CAP_BSET (void *)1 #define CAP_PI (void *)2 @@ -127,10 +130,8 @@ int __request_module(bool wait, const char *fmt, ...) { va_list args; char module_name[MODULE_NAME_LEN]; - unsigned int max_modprobes; int ret; static atomic_t kmod_concurrent = ATOMIC_INIT(0); -#define MAX_KMOD_CONCURRENT 50 /* Completely arbitrary value - KAO */ static int kmod_loop_msg; /* @@ -154,19 +155,6 @@ int __request_module(bool wait, const char *fmt, ...) if (ret) return ret; - /* If modprobe needs a service that is in a module, we get a recursive - * loop. Limit the number of running kmod threads to max_threads/2 or - * MAX_KMOD_CONCURRENT, whichever is the smaller. A cleaner method - * would be to run the parents of this process, counting how many times - * kmod was invoked. That would mean accessing the internals of the - * process tables to get the command line, proc_pid_cmdline is static - * and it is not worth changing the proc code just to handle this case. - * KAO. - * - * "trace the ppid" is simple, but will fail if someone's - * parent exits. I think this is as good as it gets. --RR - */ - max_modprobes = min(max_threads/2, MAX_KMOD_CONCURRENT); atomic_inc(&kmod_concurrent); if (atomic_read(&kmod_concurrent) > max_modprobes) { /* We may be blaming an innocent here, but unlikely */ @@ -188,6 +176,31 @@ int __request_module(bool wait, const char *fmt, ...) return ret; } EXPORT_SYMBOL(__request_module); + +/* + * If modprobe needs a service that is in a module, we get a recursive + * loop. Limit the number of running kmod threads to max_threads/2 or + * CONFIG_MAX_KMOD_CONCURRENT, whichever is the smaller. A cleaner method + * would be to run the parents of this process, counting how many times + * kmod was invoked. That would mean accessing the internals of the + * process tables to get the command line, proc_pid_cmdline is static + * and it is not worth changing the proc code just to handle this case. + * + * "trace the ppid" is simple, but will fail if someone's + * parent exits. I think this is as good as it gets. + * + * You can override with with a kernel parameter, for instance to allow + * 4096 concurrent modprobe instances: + * + * kmod.max_modprobes=4096 + */ +void __init init_kmod_umh(void) +{ + if (!max_modprobes) + max_modprobes = min(max_threads/2, + 1 << CONFIG_MAX_KMOD_CONCURRENT); +} + #endif /* CONFIG_MODULES */ static void call_usermodehelper_freeinfo(struct subprocess_info *info) -- 2.11.0