Greeting,
FYI, we noticed the following commit (built with gcc-11):
commit: 78ffa3e58d93bb43654788a857835bbe7afe366b ("thermal/core: Add a generic thermal_zone_get_trip() function")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
in testcase: ltp
version: ltp-x86_64-14c1f76-1_20220829
with following parameters:
disk: 1HDD
fs: ext4
test: fsx
test-description: The LTP testsuite contains a collection of tools for testing the Linux kernel and related features.
test-url: http://linux-test-project.github.io/
on test machine: 8 threads 1 sockets Intel(R) Core(TM) i7-7700 CPU @ 3.60GHz (Kaby Lake) with 32G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
[ 991.357420][ T67] INFO: task swapper/0:1 blocked for more than 491 seconds.
[ 991.364720][ T67] Tainted: G I 6.0.0-rc1-00053-g78ffa3e58d93 #1
[ 991.372893][ T67] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 991.381586][ T67] task:swapper/0 state:D stack: 0 pid: 1 ppid: 0 flags:0x00004000
[ 991.390807][ T67] Call Trace:
[ 991.394088][ T67]
[ 991.397020][ T67] __schedule (kernel/sched/core.c:5182 kernel/sched/core.c:6494)
[ 991.401437][ T67] ? io_schedule_timeout (kernel/sched/core.c:6376)
[ 991.406721][ T67] schedule (include/linux/instrumented.h:71 (discriminator 1) include/asm-generic/bitops/instrumented-non-atomic.h:141 (discriminator 1) include/linux/thread_info.h:118 (discriminator 1) include/linux/sched.h:2208 (discriminator 1) kernel/sched/core.c:6572 (discriminator 1))
[ 991.410871][ T67] schedule_preempt_disabled (arch/x86/include/asm/preempt.h:80 kernel/sched/core.c:6630)
[ 991.416328][ T67] __mutex_lock+0x725/0x1040
[ 991.421961][ T67] ? acpi_device_probe (drivers/acpi/bus.c:1034)
[ 991.426985][ T67] ? acpi_evaluate_integer (drivers/acpi/utils.c:260)
[ 991.432354][ T67] ? ww_mutex_lock (kernel/locking/mutex.c:744)
[ 991.437116][ T67] ? ret_from_fork (arch/x86/entry/entry_64.S:306)
[ 991.441701][ T67] ? _raw_write_lock_irq (kernel/locking/spinlock.c:153)
[ 991.446985][ T67] ? kmem_cache_alloc (mm/slub.c:3245 mm/slub.c:3251 mm/slub.c:3258 mm/slub.c:3268)
[ 991.452009][ T67] mutex_lock (kernel/locking/mutex.c:286)
[ 991.456248][ T67] ? __mutex_lock_slowpath (kernel/locking/mutex.c:282)
[ 991.461545][ T67] ? acpi_thermal_zone_device_hot (drivers/acpi/thermal.c:183)
[ 991.467615][ T67] ? __thermal_zone_get_temp (drivers/thermal/thermal_helpers.c:125)
[ 991.473074][ T67] ? __mutex_lock_slowpath (kernel/locking/mutex.c:282)
[ 991.478357][ T67] ? down_write_killable (kernel/locking/rwsem.c:1550)
[ 991.483639][ T67] thermal_zone_get_trip (drivers/thermal/thermal_core.c:1152 drivers/thermal/thermal_core.c:1179)
[ 991.488836][ T67] thermal_zone_device_update (drivers/thermal/thermal_core.c:398 drivers/thermal/thermal_core.c:1482 drivers/thermal/thermal_core.c:1466)
[ 991.495170][ T67] ? perf_trace_cdev_update (drivers/thermal/thermal_core.c:453)
[ 991.500729][ T67] ? mutex_unlock (arch/x86/include/asm/atomic64_64.h:190 include/linux/atomic/atomic-long.h:449 include/linux/atomic/atomic-instrumented.h:1790 kernel/locking/mutex.c:181 kernel/locking/mutex.c:540)
[ 991.505316][ T67] ? __mutex_unlock_slowpath+0x2c0/0x2c0
[ 991.511999][ T67] ? acpi_ns_attach_data (drivers/acpi/acpica/nsobject.c:336)
[ 991.519512][ T67] thermal_zone_device_set_mode (drivers/thermal/thermal_core.c:432)
[ 991.525340][ T67] acpi_thermal_register_thermal_zone (drivers/acpi/thermal.c:839)
[ 991.531766][ T67] acpi_thermal_add (drivers/acpi/thermal.c:1046)
[ 991.536621][ T67] acpi_device_probe (drivers/acpi/bus.c:1034)
[ 991.541477][ T67] really_probe (drivers/base/dd.c:530 drivers/base/dd.c:609)
[ 991.545978][ T67] __driver_probe_device (drivers/base/dd.c:748)
[ 991.551264][ T67] driver_probe_device (drivers/base/dd.c:778)
[ 991.556300][ T67] __driver_attach (drivers/base/dd.c:1151)
[ 991.561061][ T67] ? __device_attach_driver (drivers/base/dd.c:1100)
[ 991.566608][ T67] bus_for_each_dev (drivers/base/bus.c:301)
[ 991.571455][ T67] ? subsys_dev_iter_exit (drivers/base/bus.c:290)
[ 991.576651][ T67] ? klist_add_tail (include/linux/list.h:69 include/linux/list.h:102 lib/klist.c:104 lib/klist.c:137)
[ 991.581501][ T67] bus_add_driver (drivers/base/bus.c:618)
[ 991.586176][ T67] driver_register (drivers/base/driver.c:240)
[ 991.590950][ T67] acpi_thermal_init (drivers/acpi/thermal.c:1213)
[ 991.595714][ T67] ? acpi_container_init (drivers/acpi/thermal.c:1197)
[ 991.600822][ T67] do_one_initcall (init/main.c:1296)
[ 991.605495][ T67] ? trace_event_raw_event_initcall_level (init/main.c:1287)
[ 991.612262][ T67] ? parse_one (kernel/params.c:170)
[ 991.616691][ T67] ? sysvec_call_function (arch/x86/kernel/smp.c:243)
[ 991.621887][ T67] do_initcalls (init/main.c:1368 init/main.c:1385)
[ 991.626386][ T67] kernel_init_freeable (init/main.c:1615)
[ 991.631583][ T67] ? console_on_rootfs (init/main.c:1582)
[ 991.636517][ T67] ? usleep_range_state (kernel/time/timer.c:1897)
[ 991.641715][ T67] ? _raw_spin_lock_bh (kernel/locking/spinlock.c:169)
[ 991.646827][ T67] ? rest_init (init/main.c:1492)
[ 991.651242][ T67] ? rest_init (init/main.c:1492)
[ 991.655671][ T67] kernel_init (init/main.c:1502)
[ 991.659997][ T67] ret_from_fork (arch/x86/entry/entry_64.S:306)
[ 991.664411][ T67]
[ 1482.877487][ T67] INFO: task swapper/0:1 blocked for more than 983 seconds.
[ 1482.884782][ T67] Tainted: G I 6.0.0-rc1-00053-g78ffa3e58d93 #1
[ 1482.892948][ T67] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[ 1482.901632][ T67] task:swapper/0 state:D stack: 0 pid: 1 ppid: 0 flags:0x00004000
[ 1482.910820][ T67] Call Trace:
[ 1482.914074][ T67]
[ 1482.916980][ T67] __schedule (kernel/sched/core.c:5182 kernel/sched/core.c:6494)
[ 1482.921368][ T67] ? io_schedule_timeout (kernel/sched/core.c:6376)
[ 1482.926656][ T67] schedule (include/linux/instrumented.h:71 (discriminator 1) include/asm-generic/bitops/instrumented-non-atomic.h:141 (discriminator 1) include/linux/thread_info.h:118 (discriminator 1) include/linux/sched.h:2208 (discriminator 1) kernel/sched/core.c:6572 (discriminator 1))
[ 1482.930809][ T67] schedule_preempt_disabled (arch/x86/include/asm/preempt.h:80 kernel/sched/core.c:6630)
[ 1482.936280][ T67] __mutex_lock+0x725/0x1040
[ 1482.941894][ T67] ? acpi_device_probe (drivers/acpi/bus.c:1034)
[ 1482.946921][ T67] ? acpi_evaluate_integer (drivers/acpi/utils.c:260)
[ 1482.952313][ T67] ? ww_mutex_lock (kernel/locking/mutex.c:744)
[ 1482.957071][ T67] ? ret_from_fork (arch/x86/entry/entry_64.S:306)
[ 1482.961655][ T67] ? _raw_write_lock_irq (kernel/locking/spinlock.c:153)
[ 1482.966934][ T67] ? kmem_cache_alloc (mm/slub.c:3245 mm/slub.c:3251 mm/slub.c:3258 mm/slub.c:3268)
[ 1482.971952][ T67] mutex_lock (kernel/locking/mutex.c:286)
[ 1482.976208][ T67] ? __mutex_lock_slowpath (kernel/locking/mutex.c:282)
[ 1482.981502][ T67] ? acpi_thermal_zone_device_hot (drivers/acpi/thermal.c:183)
[ 1482.987539][ T67] ? __thermal_zone_get_temp (drivers/thermal/thermal_helpers.c:125)
[ 1482.992969][ T67] ? __mutex_lock_slowpath (kernel/locking/mutex.c:282)
[ 1482.998281][ T67] ? down_write_killable (kernel/locking/rwsem.c:1550)
[ 1483.003576][ T67] thermal_zone_get_trip (drivers/thermal/thermal_core.c:1152 drivers/thermal/thermal_core.c:1179)
[ 1483.008770][ T67] thermal_zone_device_update (drivers/thermal/thermal_core.c:398 drivers/thermal/thermal_core.c:1482 drivers/thermal/thermal_core.c:1466)
[ 1483.015070][ T67] ? perf_trace_cdev_update (drivers/thermal/thermal_core.c:453)
[ 1483.020585][ T67] ? mutex_unlock (arch/x86/include/asm/atomic64_64.h:190 include/linux/atomic/atomic-long.h:449 include/linux/atomic/atomic-instrumented.h:1790 kernel/locking/mutex.c:181 kernel/locking/mutex.c:540)
[ 1483.025146][ T67] ? __mutex_unlock_slowpath+0x2c0/0x2c0
[ 1483.031813][ T67] ? acpi_ns_attach_data (drivers/acpi/acpica/nsobject.c:336)
[ 1483.036984][ T67] thermal_zone_device_set_mode (drivers/thermal/thermal_core.c:432)
[ 1483.042762][ T67] acpi_thermal_register_thermal_zone (drivers/acpi/thermal.c:839)
[ 1483.049181][ T67] acpi_thermal_add (drivers/acpi/thermal.c:1046)
[ 1483.054040][ T67] acpi_device_probe (drivers/acpi/bus.c:1034)
[ 1483.058860][ T67] really_probe (drivers/base/dd.c:530 drivers/base/dd.c:609)
[ 1483.063341][ T67] __driver_probe_device (drivers/base/dd.c:748)
[ 1483.068597][ T67] driver_probe_device (drivers/base/dd.c:778)
[ 1483.073593][ T67] __driver_attach (drivers/base/dd.c:1151)
[ 1483.078366][ T67] ? __device_attach_driver (drivers/base/dd.c:1100)
[ 1483.083908][ T67] bus_for_each_dev (drivers/base/bus.c:301)
[ 1483.088727][ T67] ? subsys_dev_iter_exit (drivers/base/bus.c:290)
[ 1483.093896][ T67] ? klist_add_tail (include/linux/list.h:69 include/linux/list.h:102 lib/klist.c:104 lib/klist.c:137)
[ 1483.098719][ T67] bus_add_driver (drivers/base/bus.c:618)
[ 1483.103365][ T67] driver_register (drivers/base/driver.c:240)
[ 1483.108100][ T67] acpi_thermal_init (drivers/acpi/thermal.c:1213)
[ 1483.112836][ T67] ? acpi_container_init (drivers/acpi/thermal.c:1197)
[ 1483.117918][ T67] do_one_initcall (init/main.c:1296)
[ 1483.122563][ T67] ? trace_event_raw_event_initcall_level (init/main.c:1287)
[ 1483.129317][ T67] ? parse_one (kernel/params.c:170)
[ 1483.133705][ T67] ? sysvec_call_function (arch/x86/kernel/smp.c:243)
[ 1483.138876][ T67] do_initcalls (init/main.c:1368 init/main.c:1385)
[ 1483.143349][ T67] kernel_init_freeable (init/main.c:1615)
[ 1483.148540][ T67] ? console_on_rootfs (init/main.c:1582)
[ 1483.153469][ T67] ? usleep_range_state (kernel/time/timer.c:1897)
[ 1483.158638][ T67] ? _raw_spin_lock_bh (kernel/locking/spinlock.c:169)
[ 1483.163720][ T67] ? rest_init (init/main.c:1492)
[ 1483.168125][ T67] ? rest_init (init/main.c:1492)
[ 1483.172541][ T67] kernel_init (init/main.c:1502)
[ 1483.176870][ T67] ret_from_fork (arch/x86/entry/entry_64.S:306)
[ 1483.181307][ T67]
If you fix the issue, kindly add following tag
| Reported-by: kernel test robot
| Link: https://lore.kernel.org/r/202209270841.afe9a25b-yujie.liu@intel.com
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
--
0-DAY CI Kernel Test Service
https://01.org/lkp