Greeting, FYI, we noticed a -50.0% regression of filebench.sum_operations/s due to commit: commit: 5efdd9122eff772eae2feae9f0fc0ec02d4846a3 ("smb3: allow deferred close timeout to be configurable") https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master in testcase: filebench on test machine: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz (Cascade Lake) with 128G memory with following parameters: disk: 1HDD fs: ext4 fs2: cifs test: filemicro_delete.f cpufreq_governor: performance ucode: 0x5003302 ========================================================================================= compiler/cpufreq_governor/disk/fs2/fs/kconfig/rootfs/tbox_group/test/testcase/ucode: gcc-11/performance/1HDD/cifs/ext4/x86_64-rhel-8.3/debian-11.1-x86_64-20220510.cgz/lkp-csl-2sp9/filemicro_delete.f/filebench/0x5003302 commit: dcb45fd7f5 ("cifs: Do not use tcon->cfid directly, use the cfid we get from open_cached_dir") 5efdd9122e ("smb3: allow deferred close timeout to be configurable") dcb45fd7f501f864 5efdd9122eff772eae2feae9f0f ---------------- --------------------------- %stddev %change %stddev \ | \ 515.95 -50.0% 257.98 filebench.sum_operations/s 4.81 ± 2% +1038.4% 54.78 ± 6% filebench.sum_time_ms/op 29.00 ± 8% +212.1% 90.50 ± 3% filebench.time.percent_of_cpu_this_job_got 24629 +2.7% 25297 filebench.time.voluntary_context_switches 7.685e+08 +19.3% 9.169e+08 ± 4% cpuidle..time 2.53 ± 6% -20.6% 2.00 ± 3% iostat.cpu.iowait 1506141 ± 8% +22.4% 1843256 ± 3% turbostat.IRQ 2.00 -50.0% 1.00 vmstat.procs.b 21969 ± 2% -9.5% 19885 ± 2% vmstat.system.cs 3.06 ± 7% -0.7 2.35 ± 4% mpstat.cpu.all.iowait% 0.79 ± 5% +0.5 1.27 ± 2% mpstat.cpu.all.sys% 0.89 ± 3% -0.1 0.79 ± 3% mpstat.cpu.all.usr% 34.55 ± 14% -34.8% 22.51 ± 27% sched_debug.cfs_rq:/.removed.runnable_avg.avg 119.64 ± 3% -20.0% 95.69 ± 17% sched_debug.cfs_rq:/.removed.runnable_avg.stddev 34.55 ± 14% -34.8% 22.51 ± 27% sched_debug.cfs_rq:/.removed.util_avg.avg 119.64 ± 3% -20.0% 95.69 ± 17% sched_debug.cfs_rq:/.removed.util_avg.stddev 5249 +15.8% 6076 meminfo.Active 3866 ± 2% +17.7% 4552 meminfo.Active(anon) 1382 ± 4% +10.3% 1524 ± 4% meminfo.Active(file) 69791 ± 14% +39.8% 97553 ± 6% meminfo.AnonHugePages 72709 ± 2% +12.5% 81779 ± 3% meminfo.Inactive(file) 23219 +13.5% 26352 ± 3% meminfo.KernelStack 966.50 ± 2% +17.7% 1137 proc-vmstat.nr_active_anon 74302 +6.3% 78977 ± 2% proc-vmstat.nr_anon_pages 81133 +6.0% 85973 proc-vmstat.nr_inactive_anon 18172 ± 2% +12.5% 20442 ± 3% proc-vmstat.nr_inactive_file 23213 +13.5% 26348 ± 3% proc-vmstat.nr_kernel_stack 17983 +2.3% 18400 proc-vmstat.nr_mapped 7446 ± 2% +5.5% 7853 ± 3% proc-vmstat.nr_shmem 26888 +1.6% 27306 proc-vmstat.nr_slab_reclaimable 47220 +3.4% 48803 proc-vmstat.nr_slab_unreclaimable 966.50 ± 2% +17.7% 1137 proc-vmstat.nr_zone_active_anon 81133 +6.0% 85973 proc-vmstat.nr_zone_inactive_anon 18172 ± 2% +12.5% 20442 ± 3% proc-vmstat.nr_zone_inactive_file 361460 +2.5% 370454 proc-vmstat.numa_hit 946.67 +18.6% 1122 proc-vmstat.pgactivate 361562 +2.5% 370553 proc-vmstat.pgalloc_normal 187906 +4.7% 196761 proc-vmstat.pgfault 8189 +2.5% 8395 proc-vmstat.pgreuse 1.097e+09 +15.5% 1.267e+09 ± 7% perf-stat.i.branch-instructions 39079265 ± 6% -20.9% 30915354 ± 4% perf-stat.i.branch-misses 5093263 ± 4% -23.7% 3884752 ± 9% perf-stat.i.cache-misses 29213 -18.7% 23764 ± 5% perf-stat.i.context-switches 7.666e+09 ± 4% +5.7% 8.106e+09 ± 2% perf-stat.i.cpu-cycles 1877 ± 15% +75.1% 3287 ± 12% perf-stat.i.cycles-between-cache-misses 1735450 ± 3% -12.9% 1512060 ± 3% perf-stat.i.iTLB-load-misses 2898 ± 3% +34.4% 3895 ± 7% perf-stat.i.instructions-per-iTLB-miss 1493 -20.3% 1190 ± 7% perf-stat.i.major-faults 0.09 ± 3% +5.8% 0.09 ± 2% perf-stat.i.metric.GHz 48.47 ± 11% +8.4 56.83 ± 7% perf-stat.i.node-store-miss-rate% 283426 ± 4% -21.6% 222190 ± 10% perf-stat.i.node-stores 3.57 ± 7% -1.1 2.44 ± 6% perf-stat.overall.branch-miss-rate% 1508 ± 3% +39.8% 2108 ± 9% perf-stat.overall.cycles-between-cache-misses 3022 ± 3% +23.6% 3736 ± 5% perf-stat.overall.instructions-per-iTLB-miss 9.585e+08 +18.8% 1.138e+09 ± 6% perf-stat.ps.branch-instructions 34151514 ± 6% -18.8% 27725316 ± 4% perf-stat.ps.branch-misses 4450329 ± 5% -21.7% 3486409 ± 9% perf-stat.ps.cache-misses 25524 -16.4% 21333 ± 4% perf-stat.ps.context-switches 77139 +2.5% 79105 perf-stat.ps.cpu-clock 6.704e+09 ± 4% +8.7% 7.287e+09 perf-stat.ps.cpu-cycles 1.06e+09 +11.3% 1.18e+09 ± 5% perf-stat.ps.dTLB-loads 1517349 ± 3% -10.5% 1357716 ± 2% perf-stat.ps.iTLB-load-misses 4.582e+09 +10.8% 5.075e+09 ± 6% perf-stat.ps.instructions 1296 -18.1% 1061 ± 6% perf-stat.ps.major-faults 247613 ± 4% -19.5% 199283 ± 9% perf-stat.ps.node-stores 77139 +2.5% 79105 perf-stat.ps.task-clock 3.697e+10 +35.3% 5.003e+10 perf-stat.total.instructions 8.51 ± 91% -6.9 1.59 ±223% perf-profile.calltrace.cycles-pp.getdents64 8.34 ± 83% -6.7 1.67 ±141% perf-profile.calltrace.cycles-pp.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.25 ±107% -6.2 0.00 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.open64 6.25 ±107% -6.2 0.00 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.open64 6.25 ±107% -6.2 0.00 perf-profile.calltrace.cycles-pp.open64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.getdents64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.calltrace.cycles-pp.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.calltrace.cycles-pp.iterate_dir.__x64_sys_getdents64.do_syscall_64.entry_SYSCALL_64_after_hwframe.getdents64 6.26 ±115% -5.4 0.88 ±223% perf-profile.calltrace.cycles-pp.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.26 ±115% -5.4 0.88 ±223% perf-profile.calltrace.cycles-pp.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.26 ±115% -4.6 1.67 ±141% perf-profile.calltrace.cycles-pp.do_group_exit.get_signal.arch_do_signal_or_restart.exit_to_user_mode_loop.exit_to_user_mode_prepare 6.26 ±115% -4.6 1.67 ±141% perf-profile.calltrace.cycles-pp.do_exit.do_group_exit.get_signal.arch_do_signal_or_restart.exit_to_user_mode_loop 6.26 ±115% -4.6 1.67 ±141% perf-profile.calltrace.cycles-pp.arch_do_signal_or_restart.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64 6.26 ±115% -4.6 1.67 ±141% perf-profile.calltrace.cycles-pp.get_signal.arch_do_signal_or_restart.exit_to_user_mode_loop.exit_to_user_mode_prepare.syscall_exit_to_user_mode 4.57 ± 73% -3.8 0.76 ±223% perf-profile.calltrace.cycles-pp.asm_exc_page_fault.perf_mmap__read_head.perf_mmap__push.record__mmap_read_evlist.__cmd_record 4.57 ± 73% -3.8 0.76 ±223% perf-profile.calltrace.cycles-pp.exc_page_fault.asm_exc_page_fault.perf_mmap__read_head.perf_mmap__push.record__mmap_read_evlist 4.57 ± 73% -3.8 0.76 ±223% perf-profile.calltrace.cycles-pp.do_user_addr_fault.exc_page_fault.asm_exc_page_fault.perf_mmap__read_head.perf_mmap__push 4.57 ± 73% -3.8 0.76 ±223% perf-profile.calltrace.cycles-pp.perf_mmap__read_head.perf_mmap__push.record__mmap_read_evlist.__cmd_record.cmd_record 3.38 ±103% -3.4 0.00 perf-profile.calltrace.cycles-pp.unmap_vmas.exit_mmap.mmput.exit_mm.do_exit 3.38 ±103% -3.4 0.00 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.exit_mmap.mmput.exit_mm 3.38 ±103% -3.4 0.00 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap.mmput 3.38 ±103% -3.4 0.00 perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.exit_mmap 8.51 ± 91% -6.9 1.59 ±223% perf-profile.children.cycles-pp.getdents64 8.34 ± 83% -6.7 1.67 ±141% perf-profile.children.cycles-pp.exit_to_user_mode_prepare 8.34 ± 83% -6.7 1.67 ±141% perf-profile.children.cycles-pp.exit_to_user_mode_loop 8.34 ± 83% -6.7 1.67 ±141% perf-profile.children.cycles-pp.syscall_exit_to_user_mode 6.25 ±107% -6.2 0.00 perf-profile.children.cycles-pp.open64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.children.cycles-pp.__x64_sys_getdents64 7.63 ± 84% -6.0 1.59 ±223% perf-profile.children.cycles-pp.iterate_dir 6.26 ±115% -4.6 1.67 ±141% perf-profile.children.cycles-pp.arch_do_signal_or_restart 6.26 ±115% -4.6 1.67 ±141% perf-profile.children.cycles-pp.get_signal 4.57 ± 73% -3.8 0.76 ±223% perf-profile.children.cycles-pp.perf_mmap__read_head 3.38 ±103% -3.4 0.00 perf-profile.children.cycles-pp.unmap_vmas 3.38 ±103% -3.4 0.00 perf-profile.children.cycles-pp.unmap_page_range 3.38 ±103% -3.4 0.00 perf-profile.children.cycles-pp.zap_pmd_range 3.38 ±103% -3.4 0.00 perf-profile.children.cycles-pp.zap_pte_range 8.54 ± 43% +8.6 17.19 ± 38% perf-profile.children.cycles-pp.asm_exc_page_fault If you fix the issue, kindly add following tag Reported-by: kernel test robot To reproduce: git clone https://github.com/intel/lkp-tests.git cd lkp-tests sudo bin/lkp install job.yaml # job file is attached in this email bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run sudo bin/lkp run generated-yaml-file # if come across any failure that blocks the test, # please remove ~/.lkp and /lkp dir to run from a clean state. Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. #regzbot introduced: 5efdd9122e -- 0-DAY CI Kernel Test Service https://01.org/lkp