From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dmitry Osipenko Subject: Re: [PATCH V7 5/7] cpufreq: Register notifiers with the PM QoS framework Date: Tue, 15 Oct 2019 16:45:46 +0300 Message-ID: <0d0b050a-4d79-2e65-5d71-dfd662310e1f@gmail.com> References: <5ad2624194baa2f53acc1f1e627eb7684c577a19.1562210705.git.viresh.kumar@linaro.org> <2c7a751a58adb4ce6f345dab9714b924504009b6.1562583394.git.viresh.kumar@linaro.org> <20191015114637.pcdbs2ctxl4xoxdo@vireshk-i7> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Return-path: In-Reply-To: <20191015114637.pcdbs2ctxl4xoxdo@vireshk-i7> Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org To: Viresh Kumar Cc: Rafael Wysocki , linux-pm@vger.kernel.org, Vincent Guittot , mka@chromium.org, ulf.hansson@linaro.org, sfr@canb.auug.org.au, pavel@ucw.cz, "Rafael J . Wysocki" , linux-kernel@vger.kernel.org, linux-tegra@vger.kernel.org List-Id: linux-tegra@vger.kernel.org 15.10.2019 14:46, Viresh Kumar пишет: > On 22-09-19, 23:12, Dmitry Osipenko wrote: >> Hello Viresh, >> >> This patch causes use-after-free on a cpufreq driver module reload. Please take a look, thanks in advance. >> >> >> [ 87.952369] ================================================================== >> [ 87.953259] BUG: KASAN: use-after-free in notifier_chain_register+0x4f/0x9c >> [ 87.954031] Read of size 4 at addr e6abbd0c by task modprobe/243 >> >> [ 87.954901] CPU: 1 PID: 243 Comm: modprobe Tainted: G W >> 5.3.0-next-20190920-00185-gf61698eab956-dirty #2408 >> [ 87.956077] Hardware name: NVIDIA Tegra SoC (Flattened Device Tree) >> [ 87.956807] [] (unwind_backtrace) from [] (show_stack+0x11/0x14) >> [ 87.957709] [] (show_stack) from [] (dump_stack+0x89/0x98) >> [ 87.958616] [] (dump_stack) from [] >> (print_address_description.constprop.0+0x3d/0x340) >> [ 87.959785] [] (print_address_description.constprop.0) from [] >> (__kasan_report+0xe3/0x12c) >> [ 87.960907] [] (__kasan_report) from [] (notifier_chain_register+0x4f/0x9c) >> [ 87.962001] [] (notifier_chain_register) from [] >> (blocking_notifier_chain_register+0x29/0x3c) >> [ 87.963180] [] (blocking_notifier_chain_register) from [] >> (dev_pm_qos_add_notifier+0x79/0xf8) >> [ 87.964339] [] (dev_pm_qos_add_notifier) from [] (cpufreq_online+0x5e1/0x8a4) >> [ 87.965351] [] (cpufreq_online) from [] (cpufreq_add_dev+0x79/0x80) >> [ 87.966247] [] (cpufreq_add_dev) from [] (subsys_interface_register+0xc3/0x100) >> [ 87.967297] [] (subsys_interface_register) from [] >> (cpufreq_register_driver+0x13b/0x1ec) >> [ 87.968476] [] (cpufreq_register_driver) from [] >> (tegra20_cpufreq_probe+0x165/0x1a8 [tegra20_cpufreq]) > > Hi Dmitry, > > Thanks for the bug report and I was finally able to reproduce it at my end and > this was quite an interesting debugging exercise :) > > When a cpufreq driver gets registered, we register with the subsys interface and > it calls cpufreq_add_dev() for each CPU, starting from CPU0. And so the QoS > notifiers get added to the first CPU of the policy, i.e. CPU0 in common cases. > > When the cpufreq driver gets unregistered, we unregister with the subsys > interface and it calls cpufreq_remove_dev() for each CPU, starting from CPU0 > (should have been in reverse order I feel). We remove the QoS notifier only when > cpufreq_remove_dev() gets called for the last CPU of the policy, lets call it > CPUx. Now this has a different notifier list as compared to CPU0. > > In short, we are adding the cpufreq notifiers to CPU0 and removing them from > CPUx. When we try to add it again by inserting the module for second time, we > find a node in the notifier list which is already freed but still in the list as > we removed it from CPUx's list (which doesn't do anything as the node wasn't > there in the first place). > > @Rafael: How do you see we solve this problem ? Here are the options I could > think of: > > - Update subsys layer to reverse the order of devices while unregistering (this > will fix the current problem, but we will still have corner cases hanging > around, like if the CPU0 is hotplugged out, etc). > > - Update QoS framework with the knowledge of related CPUs, this has been pending > until now from my side. And this is the thing we really need to do. Eventually > we shall have only a single notifier list for all CPUs of a policy, at least > for MIN/MAX frequencies. > > - ?? > Viresh, thank you very much! Looking forward to a fix :)