Greeting, FYI, we noticed a 80.1% improvement of stress-ng.dir.ops_per_sec due to commit: commit: 80d287de7b49ed673418f4fb91a523c58c5bebf2 ("[PATCH 07/39] xfs: journal IO cache flush reductions") url: https://github.com/0day-ci/linux/commits/Dave-Chinner/xfs-CIL-and-log-optimisations/20210603-134113 base: https://git.kernel.org/cgit/fs/xfs/xfs-linux.git for-next in testcase: stress-ng on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory with following parameters: nr_threads: 10% disk: 1HDD testtime: 60s fs: xfs class: filesystem test: dir cpufreq_governor: performance ucode: 0x5003006 Details are as below: --------------------------------------------------------------------------------------------------> To reproduce: git clone https://github.com/intel/lkp-tests.git cd lkp-tests bin/lkp install job.yaml # job file is attached in this email bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run bin/lkp run generated-yaml-file ========================================================================================= class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode: filesystem/gcc-9/performance/1HDD/xfs/x86_64-rhel-8.3/10%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/dir/stress-ng/60s/0x5003006 commit: d5405b4dd5 ("xfs: remove need_start_rec parameter from xlog_write()") 80d287de7b ("xfs: journal IO cache flush reductions") d5405b4dd577ba9a 80d287de7b49ed673418f4fb91a ---------------- --------------------------- %stddev %change %stddev \ | \ 1482543 +80.1% 2669796 ± 6% stress-ng.dir.ops 24569 +80.1% 44248 ± 6% stress-ng.dir.ops_per_sec 713.83 ± 10% +74.0% 1241 ± 6% stress-ng.time.involuntary_context_switches 204.83 +84.8% 378.50 stress-ng.time.percent_of_cpu_this_job_got 124.81 +84.5% 230.26 stress-ng.time.system_time 2619366 +82.1% 4770408 ± 4% stress-ng.time.voluntary_context_switches 1.72 ± 14% +1.6 3.35 ± 5% mpstat.cpu.all.sys% 236.19 +15.9% 273.81 ± 2% pmeter.Average_Active_Power 96.31 -1.4% 94.95 iostat.cpu.idle 3.51 +37.5% 4.82 iostat.cpu.system 4568 ± 9% +22.0% 5572 ± 6% meminfo.Active 3959 ± 3% +27.8% 5061 ± 3% meminfo.Active(anon) 380393 ± 10% +49.6% 569186 ± 14% numa-numastat.node0.local_node 457743 ± 5% +39.2% 637259 ± 11% numa-numastat.node0.numa_hit 10063 ± 2% +33.4% 13426 ± 5% softirqs.BLOCK 8625 ± 19% +18.3% 10204 ± 4% softirqs.CPU14.SCHED 5410 ± 55% +89.4% 10248 ± 3% softirqs.CPU65.SCHED 41612495 +79.6% 74744322 ± 5% cpuidle.C1.time 1407022 +80.2% 2535388 ± 9% cpuidle.C1.usage 322643 ± 8% +52.7% 492813 ± 9% cpuidle.POLL.time 28413 ± 4% +39.4% 39600 ± 8% cpuidle.POLL.usage 96.00 -1.7% 94.33 vmstat.cpu.id 6494 ± 3% +74.4% 11328 ± 6% vmstat.io.bo 2.00 +75.0% 3.50 ± 14% vmstat.procs.r 81184 +81.5% 147330 ± 4% vmstat.system.cs 97.17 +49.2% 145.00 turbostat.Avg_MHz 4.87 +1.2 6.06 ± 2% turbostat.Busy% 2003 +19.9% 2402 ± 3% turbostat.Bzy_MHz 1402579 +80.4% 2530574 ± 9% turbostat.C1 0.67 +0.5 1.20 ± 5% turbostat.C1% 94.08 -2.0 92.07 turbostat.C1E% 46.17 +10.5% 51.00 turbostat.PkgTmp 112.71 +29.0% 145.38 ± 3% turbostat.PkgWatt 987.67 ± 3% +27.7% 1261 ± 3% proc-vmstat.nr_active_anon 3894 +7.3% 4179 proc-vmstat.nr_shmem 987.67 ± 3% +27.7% 1261 ± 3% proc-vmstat.nr_zone_active_anon 657336 +29.0% 847751 proc-vmstat.numa_hit 570584 +33.4% 761004 ± 2% proc-vmstat.numa_local 1419 ± 13% +56.8% 2226 ± 12% proc-vmstat.pgactivate 914482 ± 3% +31.0% 1197750 ± 2% proc-vmstat.pgalloc_normal 738528 ± 4% +37.0% 1011485 ± 3% proc-vmstat.pgfree 429578 ± 2% +74.9% 751447 ± 6% proc-vmstat.pgpgout 1282 ± 6% +25.4% 1608 ± 12% slabinfo.buffer_head.active_objs 1282 ± 6% +25.4% 1608 ± 12% slabinfo.buffer_head.num_objs 10382 ± 4% -12.9% 9043 ± 3% slabinfo.kmalloc-128.active_objs 10533 ± 3% -11.9% 9284 ± 3% slabinfo.kmalloc-128.num_objs 126328 ± 3% -14.4% 108194 ± 3% slabinfo.kmalloc-512.active_objs 257974 ± 5% +21.6% 313574 ± 9% slabinfo.kmalloc-8.active_objs 506.17 ± 5% +21.4% 614.33 ± 9% slabinfo.kmalloc-8.active_slabs 259338 ± 5% +21.4% 314774 ± 9% slabinfo.kmalloc-8.num_objs 506.17 ± 5% +21.4% 614.33 ± 9% slabinfo.kmalloc-8.num_slabs 17113 ± 2% +39.3% 23841 ± 6% interrupts.315:PCI-MSI.376832-edge.ahci[0000:00:17.0] 95405 ± 3% +27.9% 122055 ± 6% interrupts.CAL:Function_call_interrupts 1021 ± 25% +64.5% 1680 ± 22% interrupts.CPU14.CAL:Function_call_interrupts 81.83 ± 44% +100.8% 164.33 ± 17% interrupts.CPU14.RES:Rescheduling_interrupts 734.33 ± 21% +85.7% 1363 ± 47% interrupts.CPU16.CAL:Function_call_interrupts 936.83 ± 34% +67.6% 1570 ± 21% interrupts.CPU21.CAL:Function_call_interrupts 74.67 ± 46% +87.5% 140.00 ± 34% interrupts.CPU21.RES:Rescheduling_interrupts 990.50 ± 12% +28.8% 1275 ± 10% interrupts.CPU56.CAL:Function_call_interrupts 1000 ± 25% +88.2% 1882 ± 30% interrupts.CPU65.CAL:Function_call_interrupts 75.50 ± 34% +171.3% 204.83 ± 58% interrupts.CPU65.RES:Rescheduling_interrupts 1180 ± 23% +63.4% 1928 ± 31% interrupts.CPU8.CAL:Function_call_interrupts 1174 ± 27% +96.3% 2305 ± 38% interrupts.CPU9.CAL:Function_call_interrupts 31436 ± 8% +67.5% 52645 ± 7% interrupts.NMI:Non-maskable_interrupts 31436 ± 8% +67.5% 52645 ± 7% interrupts.PMI:Performance_monitoring_interrupts 4741 ± 4% +45.0% 6874 ± 13% interrupts.RES:Rescheduling_interrupts 4.00 -17.7% 3.29 ± 2% perf-stat.i.MPKI 2.268e+09 +70.7% 3.871e+09 ± 5% perf-stat.i.branch-instructions 0.84 ± 7% -0.3 0.59 ± 3% perf-stat.i.branch-miss-rate% 13994599 ± 5% +42.4% 19932596 ± 4% perf-stat.i.branch-misses 3478750 +137.0% 8243762 ± 67% perf-stat.i.cache-misses 40667842 +69.3% 68868679 ± 7% perf-stat.i.cache-references 84844 +81.4% 153896 ± 5% perf-stat.i.context-switches 1.24 ± 9% -46.8% 0.66 ± 7% perf-stat.i.cpi 8.853e+09 +52.6% 1.351e+10 perf-stat.i.cpu-cycles 111.43 +6.6% 118.78 perf-stat.i.cpu-migrations 2619 ± 5% -19.1% 2118 ± 27% perf-stat.i.cycles-between-cache-misses 0.01 ± 10% -0.0 0.01 ± 4% perf-stat.i.dTLB-load-miss-rate% 177121 ± 6% +93.5% 342763 ± 11% perf-stat.i.dTLB-load-misses 3.145e+09 +74.6% 5.492e+09 ± 6% perf-stat.i.dTLB-loads 0.00 ± 14% -0.0 0.00 ± 5% perf-stat.i.dTLB-store-miss-rate% 20981 ± 14% +41.5% 29678 ± 6% perf-stat.i.dTLB-store-misses 1.626e+09 +72.4% 2.804e+09 ± 5% perf-stat.i.dTLB-stores 4231356 +62.4% 6872280 ± 4% perf-stat.i.iTLB-load-misses 6324364 +60.3% 10139019 ± 4% perf-stat.i.iTLB-loads 1.222e+10 +71.5% 2.096e+10 ± 5% perf-stat.i.instructions 2568 +23.5% 3171 perf-stat.i.instructions-per-iTLB-miss 1.15 +33.4% 1.54 ± 6% perf-stat.i.ipc 0.09 +52.6% 0.14 perf-stat.i.metric.GHz 509.54 +70.8% 870.20 ± 3% perf-stat.i.metric.K/sec 73.36 +72.8% 126.75 ± 6% perf-stat.i.metric.M/sec 993104 +77.1% 1758681 ± 8% perf-stat.i.node-loads 0.62 ± 4% -0.1 0.52 ± 4% perf-stat.overall.branch-miss-rate% 2545 ± 2% -19.2% 2056 ± 30% perf-stat.overall.cycles-between-cache-misses 0.00 ± 14% -0.0 0.00 ± 6% perf-stat.overall.dTLB-store-miss-rate% 2888 +5.5% 3048 perf-stat.overall.instructions-per-iTLB-miss 2.233e+09 +70.6% 3.811e+09 ± 5% perf-stat.ps.branch-instructions 13776498 ± 5% +42.4% 19620370 ± 4% perf-stat.ps.branch-misses 3424430 +136.9% 8113289 ± 67% perf-stat.ps.cache-misses 40038403 +69.3% 67785317 ± 7% perf-stat.ps.cache-references 83516 +81.4% 151470 ± 5% perf-stat.ps.context-switches 8.716e+09 +52.6% 1.33e+10 perf-stat.ps.cpu-cycles 109.67 +6.6% 116.90 perf-stat.ps.cpu-migrations 174387 ± 6% +93.4% 337330 ± 11% perf-stat.ps.dTLB-load-misses 3.096e+09 +74.6% 5.406e+09 ± 6% perf-stat.ps.dTLB-loads 20649 ± 14% +41.4% 29202 ± 6% perf-stat.ps.dTLB-store-misses 1.601e+09 +72.4% 2.76e+09 ± 5% perf-stat.ps.dTLB-stores 4165626 +62.4% 6764240 ± 4% perf-stat.ps.iTLB-load-misses 6225700 +60.3% 9979567 ± 4% perf-stat.ps.iTLB-loads 1.203e+10 +71.5% 2.063e+10 ± 5% perf-stat.ps.instructions 977553 +77.0% 1730599 ± 8% perf-stat.ps.node-loads 7.679e+11 +71.5% 1.317e+12 ± 5% perf-stat.total.instructions 55.09 -5.3 49.84 perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify 8.68 ± 4% -4.7 4.02 ± 17% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry 53.42 ± 2% -3.9 49.47 perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify 53.41 ± 2% -3.9 49.47 perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 53.38 ± 2% -3.9 49.45 perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 7.08 ± 4% -3.7 3.33 ± 14% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle 49.39 ± 2% -2.8 46.62 perf-profile.calltrace.cycles-pp.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 4.86 ± 6% -2.6 2.23 ± 13% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter 4.81 ± 6% -2.6 2.20 ± 13% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state 48.61 ± 2% -2.5 46.14 perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary 2.65 ± 13% -1.1 1.57 ± 18% perf-profile.calltrace.cycles-pp.menu_select.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify 2.23 ± 5% -1.1 1.16 ± 11% perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt 1.77 ± 11% -1.1 0.71 ± 22% perf-profile.calltrace.cycles-pp.clockevents_program_event.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt 1.59 ± 5% -0.8 0.77 ± 15% perf-profile.calltrace.cycles-pp.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt 1.21 ± 23% -0.7 0.47 ± 79% perf-profile.calltrace.cycles-pp.tick_nohz_next_event.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry 1.37 ± 5% -0.7 0.68 ± 17% perf-profile.calltrace.cycles-pp.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt 1.48 ± 18% -0.7 0.81 ± 25% perf-profile.calltrace.cycles-pp.tick_nohz_get_sleep_length.menu_select.do_idle.cpu_startup_entry.start_secondary 1.33 ± 6% -0.7 0.65 ± 18% perf-profile.calltrace.cycles-pp.update_process_times.tick_sched_handle.tick_sched_timer.__hrtimer_run_queues.hrtimer_interrupt 0.97 ± 6% -0.5 0.46 ± 44% perf-profile.calltrace.cycles-pp.irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter 0.85 ± 6% -0.3 0.51 ± 70% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork 0.85 ± 6% -0.3 0.50 ± 70% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork 0.91 ± 5% -0.2 0.73 ± 18% perf-profile.calltrace.cycles-pp.ret_from_fork 0.91 ± 5% -0.2 0.73 ± 18% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork 0.64 ± 7% +0.1 0.76 ± 5% perf-profile.calltrace.cycles-pp.xfs_btree_lookup.xfs_check_agi_freecount.xfs_dialloc_ag.xfs_dir_ialloc.xfs_create 0.84 ± 6% +0.1 0.96 ± 4% perf-profile.calltrace.cycles-pp.xlog_cil_insert_items.xfs_log_commit_cil.__xfs_trans_commit.xfs_remove.xfs_vn_unlink 0.64 ± 4% +0.1 0.77 ± 7% perf-profile.calltrace.cycles-pp.xfs_trans_read_buf_map.xfs_read_agi.xfs_iunlink.xfs_remove.xfs_vn_unlink 0.72 ± 11% +0.1 0.85 ± 4% perf-profile.calltrace.cycles-pp.xfs_dir2_leafn_lookup_for_entry.xfs_da3_node_lookup_int.xfs_dir2_node_removename.xfs_dir_removename.xfs_remove 0.67 ± 3% +0.1 0.80 ± 7% perf-profile.calltrace.cycles-pp.xfs_read_agi.xfs_iunlink.xfs_remove.xfs_vn_unlink.vfs_rmdir 0.74 ± 3% +0.1 0.88 ± 8% perf-profile.calltrace.cycles-pp.xfs_dir2_leafn_remove.xfs_dir2_node_removename.xfs_dir_removename.xfs_remove.xfs_vn_unlink 0.77 ± 3% +0.1 0.91 ± 7% perf-profile.calltrace.cycles-pp.xfs_iunlink.xfs_remove.xfs_vn_unlink.vfs_rmdir.do_rmdir 1.15 ± 3% +0.2 1.32 ± 6% perf-profile.calltrace.cycles-pp.xlog_cil_insert_items.xfs_log_commit_cil.__xfs_trans_commit.xfs_create.xfs_generic_create 1.49 ± 4% +0.2 1.66 ± 3% perf-profile.calltrace.cycles-pp.xfs_btree_check_sblock.xfs_btree_get_rec.xfs_inobt_get_rec.xfs_check_agi_freecount.xfs_dialloc_ag 0.44 ± 44% +0.2 0.63 ± 7% perf-profile.calltrace.cycles-pp.xfs_buf_get_map.xfs_buf_read_map.xfs_trans_read_buf_map.xfs_read_agi.xfs_iunlink 0.94 ± 8% +0.2 1.13 ± 5% perf-profile.calltrace.cycles-pp.xfs_iunlink_remove.xfs_ifree.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode 0.46 ± 45% +0.2 0.65 ± 7% perf-profile.calltrace.cycles-pp.xfs_buf_read_map.xfs_trans_read_buf_map.xfs_read_agi.xfs_iunlink.xfs_remove 1.74 ± 3% +0.2 1.94 ± 3% perf-profile.calltrace.cycles-pp.__xfs_btree_check_sblock.xfs_btree_check_sblock.xfs_btree_get_rec.xfs_inobt_get_rec.xfs_check_agi_freecount 1.15 ± 7% +0.2 1.35 perf-profile.calltrace.cycles-pp.xfs_da3_node_lookup_int.xfs_dir2_node_removename.xfs_dir_removename.xfs_remove.xfs_vn_unlink 0.51 ± 46% +0.2 0.71 ± 5% perf-profile.calltrace.cycles-pp.xfs_read_agi.xfs_iunlink_remove.xfs_ifree.xfs_inactive_ifree.xfs_inactive 0.95 ± 8% +0.2 1.16 ± 7% perf-profile.calltrace.cycles-pp.xfs_dialloc_select_ag.xfs_dir_ialloc.xfs_create.xfs_generic_create.vfs_mkdir 1.64 ± 4% +0.2 1.86 ± 3% perf-profile.calltrace.cycles-pp.xfs_btree_check_sblock.xfs_btree_increment.xfs_check_agi_freecount.xfs_dialloc_ag.xfs_dir_ialloc 2.67 ± 3% +0.2 2.90 ± 2% perf-profile.calltrace.cycles-pp.xfs_btree_get_rec.xfs_inobt_get_rec.xfs_check_agi_freecount.xfs_dialloc_ag.xfs_dir_ialloc 2.40 ± 4% +0.3 2.66 ± 5% perf-profile.calltrace.cycles-pp.__xfs_trans_commit.xfs_create.xfs_generic_create.vfs_mkdir.do_mkdirat 1.88 ± 4% +0.3 2.14 ± 3% perf-profile.calltrace.cycles-pp.xfs_log_commit_cil.__xfs_trans_commit.xfs_remove.xfs_vn_unlink.vfs_rmdir 2.36 ± 4% +0.3 2.62 ± 5% perf-profile.calltrace.cycles-pp.xfs_log_commit_cil.__xfs_trans_commit.xfs_create.xfs_generic_create.vfs_mkdir 1.96 ± 4% +0.3 2.23 ± 3% perf-profile.calltrace.cycles-pp.__xfs_trans_commit.xfs_remove.xfs_vn_unlink.vfs_rmdir.do_rmdir 1.56 ± 2% +0.3 1.86 ± 10% perf-profile.calltrace.cycles-pp.xfs_log_commit_cil.__xfs_trans_commit.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode 1.64 ± 2% +0.3 1.94 ± 9% perf-profile.calltrace.cycles-pp.__xfs_trans_commit.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode.destroy_inode 0.27 ±100% +0.3 0.59 ± 2% perf-profile.calltrace.cycles-pp.xfs_dir2_node_addname_int.xfs_dir2_node_addname.xfs_dir_createname.xfs_create.xfs_generic_create 2.87 ± 5% +0.4 3.23 ± 3% perf-profile.calltrace.cycles-pp.xfs_btree_increment.xfs_check_agi_freecount.xfs_dialloc_ag.xfs_dir_ialloc.xfs_create 2.05 ± 4% +0.4 2.41 ± 2% perf-profile.calltrace.cycles-pp.xfs_dir2_node_removename.xfs_dir_removename.xfs_remove.xfs_vn_unlink.vfs_rmdir 2.27 ± 4% +0.4 2.63 ± 3% perf-profile.calltrace.cycles-pp.xfs_dir_removename.xfs_remove.xfs_vn_unlink.vfs_rmdir.do_rmdir 8.99 ± 3% +0.8 9.77 ± 2% perf-profile.calltrace.cycles-pp.xfs_check_agi_freecount.xfs_dialloc_ag.xfs_dir_ialloc.xfs_create.xfs_generic_create 0.43 ± 80% +0.8 1.23 ± 18% perf-profile.calltrace.cycles-pp.xfs_buf_find.xfs_buf_get_map.xfs_buf_read_map.xfs_trans_read_buf_map.xfs_read_agi 9.71 ± 3% +0.8 10.53 ± 2% perf-profile.calltrace.cycles-pp.xfs_dialloc_ag.xfs_dir_ialloc.xfs_create.xfs_generic_create.vfs_mkdir 5.58 ± 3% +0.9 6.47 ± 2% perf-profile.calltrace.cycles-pp.xfs_vn_unlink.vfs_rmdir.do_rmdir.do_syscall_64.entry_SYSCALL_64_after_hwframe 5.56 ± 3% +0.9 6.46 ± 2% perf-profile.calltrace.cycles-pp.xfs_remove.xfs_vn_unlink.vfs_rmdir.do_rmdir.do_syscall_64 5.13 ± 4% +0.9 6.02 ± 8% perf-profile.calltrace.cycles-pp.__percpu_counter_sum.xfs_fs_statfs.statfs_by_dentry.vfs_statfs.user_statfs 5.41 ± 4% +0.9 6.36 ± 9% perf-profile.calltrace.cycles-pp.vfs_statfs.user_statfs.__do_sys_statfs.do_syscall_64.entry_SYSCALL_64_after_hwframe 5.39 ± 4% +1.0 6.34 ± 9% perf-profile.calltrace.cycles-pp.statfs_by_dentry.vfs_statfs.user_statfs.__do_sys_statfs.do_syscall_64 5.35 ± 4% +1.0 6.31 ± 9% perf-profile.calltrace.cycles-pp.xfs_fs_statfs.statfs_by_dentry.vfs_statfs.user_statfs.__do_sys_statfs 6.25 ± 3% +1.0 7.30 ± 7% perf-profile.calltrace.cycles-pp.user_statfs.__do_sys_statfs.do_syscall_64.entry_SYSCALL_64_after_hwframe 6.36 ± 3% +1.1 7.42 ± 7% perf-profile.calltrace.cycles-pp.__do_sys_statfs.do_syscall_64.entry_SYSCALL_64_after_hwframe 11.53 ± 3% +1.2 12.69 perf-profile.calltrace.cycles-pp.xfs_dir_ialloc.xfs_create.xfs_generic_create.vfs_mkdir.do_mkdirat 10.20 ± 3% +1.2 11.36 ± 4% perf-profile.calltrace.cycles-pp.xfs_difree.xfs_ifree.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode 11.37 ± 3% +1.4 12.73 ± 3% perf-profile.calltrace.cycles-pp.xfs_ifree.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode.destroy_inode 15.83 ± 3% +1.7 17.50 perf-profile.calltrace.cycles-pp.xfs_create.xfs_generic_create.vfs_mkdir.do_mkdirat.do_syscall_64 13.23 ± 3% +1.7 14.91 ± 2% perf-profile.calltrace.cycles-pp.xfs_inactive.xfs_fs_destroy_inode.destroy_inode.vfs_rmdir.do_rmdir 13.28 ± 3% +1.7 14.96 ± 2% perf-profile.calltrace.cycles-pp.xfs_fs_destroy_inode.destroy_inode.vfs_rmdir.do_rmdir.do_syscall_64 13.22 ± 3% +1.7 14.90 ± 2% perf-profile.calltrace.cycles-pp.xfs_inactive_ifree.xfs_inactive.xfs_fs_destroy_inode.destroy_inode.vfs_rmdir 13.44 ± 3% +1.7 15.14 ± 2% perf-profile.calltrace.cycles-pp.destroy_inode.vfs_rmdir.do_rmdir.do_syscall_64.entry_SYSCALL_64_after_hwframe 15.95 ± 3% +1.7 17.64 perf-profile.calltrace.cycles-pp.xfs_generic_create.vfs_mkdir.do_mkdirat.do_syscall_64.entry_SYSCALL_64_after_hwframe 15.99 ± 3% +1.7 17.70 perf-profile.calltrace.cycles-pp.vfs_mkdir.do_mkdirat.do_syscall_64.entry_SYSCALL_64_after_hwframe 16.34 ± 3% +1.8 18.12 perf-profile.calltrace.cycles-pp.do_mkdirat.do_syscall_64.entry_SYSCALL_64_after_hwframe 19.38 ± 3% +2.6 21.99 perf-profile.calltrace.cycles-pp.vfs_rmdir.do_rmdir.do_syscall_64.entry_SYSCALL_64_after_hwframe 19.73 ± 3% +2.7 22.41 perf-profile.calltrace.cycles-pp.do_rmdir.do_syscall_64.entry_SYSCALL_64_after_hwframe 43.26 +5.5 48.78 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe 43.31 +5.5 48.83 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe 55.08 -5.3 49.83 perf-profile.children.cycles-pp.do_idle 55.09 -5.3 49.84 perf-profile.children.cycles-pp.secondary_startup_64_no_verify 55.09 -5.3 49.84 perf-profile.children.cycles-pp.cpu_startup_entry 8.30 ± 4% -4.2 4.14 ± 12% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt 51.02 -4.1 46.94 ± 2% perf-profile.children.cycles-pp.cpuidle_enter 51.00 -4.1 46.93 ± 2% perf-profile.children.cycles-pp.cpuidle_enter_state 53.42 ± 2% -3.9 49.47 perf-profile.children.cycles-pp.start_secondary 7.39 ± 4% -3.8 3.59 ± 13% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt 5.09 ± 6% -2.7 2.42 ± 13% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt 5.05 ± 6% -2.7 2.38 ± 13% perf-profile.children.cycles-pp.hrtimer_interrupt 2.50 ± 7% -1.6 0.93 ± 37% perf-profile.children.cycles-pp.ktime_get 1.85 ± 12% -1.1 0.74 ± 23% perf-profile.children.cycles-pp.clockevents_program_event 2.69 ± 13% -1.1 1.60 ± 19% perf-profile.children.cycles-pp.menu_select 2.38 ± 4% -1.1 1.29 ± 11% perf-profile.children.cycles-pp.__hrtimer_run_queues 1.71 ± 5% -0.8 0.86 ± 14% perf-profile.children.cycles-pp.tick_sched_timer 1.47 ± 5% -0.7 0.76 ± 16% perf-profile.children.cycles-pp.tick_sched_handle 1.43 ± 5% -0.7 0.74 ± 16% perf-profile.children.cycles-pp.update_process_times 1.50 ± 18% -0.7 0.83 ± 25% perf-profile.children.cycles-pp.tick_nohz_get_sleep_length 1.23 ± 23% -0.6 0.65 ± 31% perf-profile.children.cycles-pp.tick_nohz_next_event 1.05 ± 6% -0.4 0.61 ± 6% perf-profile.children.cycles-pp.irq_exit_rcu 0.68 ± 30% -0.4 0.29 ± 56% perf-profile.children.cycles-pp.timekeeping_max_deferment 0.50 ± 11% -0.4 0.14 ± 51% perf-profile.children.cycles-pp.ktime_get_update_offsets_now 0.78 ± 11% -0.3 0.44 ± 17% perf-profile.children.cycles-pp.scheduler_tick 0.85 ± 6% -0.3 0.51 ± 6% perf-profile.children.cycles-pp.__softirqentry_text_start 0.50 ± 15% -0.3 0.22 ± 25% perf-profile.children.cycles-pp.tick_nohz_irq_exit 0.34 ± 15% -0.2 0.13 ± 48% perf-profile.children.cycles-pp.calc_global_load_tick 0.45 ± 9% -0.2 0.26 ± 6% perf-profile.children.cycles-pp.rebalance_domains 0.46 ± 9% -0.2 0.27 ± 8% perf-profile.children.cycles-pp.lapic_next_deadline 0.85 ± 6% -0.2 0.66 ± 19% perf-profile.children.cycles-pp.process_one_work 0.46 ± 5% -0.2 0.28 ± 11% perf-profile.children.cycles-pp.native_irq_return_iret 0.92 ± 5% -0.2 0.73 ± 18% perf-profile.children.cycles-pp.ret_from_fork 0.85 ± 6% -0.2 0.67 ± 19% perf-profile.children.cycles-pp.worker_thread 0.91 ± 5% -0.2 0.73 ± 18% perf-profile.children.cycles-pp.kthread 0.48 ± 4% -0.1 0.33 ± 9% perf-profile.children.cycles-pp.sched_clock_cpu 0.37 ± 11% -0.1 0.23 ± 13% perf-profile.children.cycles-pp.read_tsc 0.42 ± 5% -0.1 0.29 ± 9% perf-profile.children.cycles-pp.native_sched_clock 0.34 ± 12% -0.1 0.22 ± 9% perf-profile.children.cycles-pp.perf_mux_hrtimer_handler 0.29 ± 9% -0.1 0.18 ± 7% perf-profile.children.cycles-pp.irqtime_account_irq 0.17 ± 12% -0.1 0.06 ± 54% perf-profile.children.cycles-pp._raw_spin_trylock 0.28 ± 7% -0.1 0.17 ± 7% perf-profile.children.cycles-pp.irq_work_run_list 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.asm_sysvec_irq_work 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.sysvec_irq_work 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.__sysvec_irq_work 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.irq_work_run 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.irq_work_single 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.printk 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.vprintk_emit 0.27 ± 7% -0.1 0.17 ± 6% perf-profile.children.cycles-pp.console_unlock 0.26 ± 8% -0.1 0.16 ± 8% perf-profile.children.cycles-pp.serial8250_console_write 0.25 ± 7% -0.1 0.15 ± 7% perf-profile.children.cycles-pp.uart_console_write 0.18 ± 25% -0.1 0.07 ± 37% perf-profile.children.cycles-pp.tsc_verify_tsc_adjust 0.25 ± 7% -0.1 0.15 ± 8% perf-profile.children.cycles-pp.wait_for_xmitr 0.18 ± 25% -0.1 0.08 ± 34% perf-profile.children.cycles-pp.arch_cpu_idle_enter 0.27 ± 11% -0.1 0.18 ± 12% perf-profile.children.cycles-pp.load_balance 0.24 ± 7% -0.1 0.15 ± 9% perf-profile.children.cycles-pp.serial8250_console_putchar 0.32 ± 4% -0.1 0.24 ± 13% perf-profile.children.cycles-pp.update_rq_clock 0.21 ± 13% -0.1 0.13 ± 10% perf-profile.children.cycles-pp.io_serial_in 0.21 ± 12% -0.1 0.14 ± 8% perf-profile.children.cycles-pp.hrtimer_next_event_without 0.20 ± 13% -0.1 0.14 ± 11% perf-profile.children.cycles-pp.find_busiest_group 0.09 ± 11% -0.1 0.03 ± 99% perf-profile.children.cycles-pp.timerqueue_del 0.15 ± 14% -0.1 0.09 ± 30% perf-profile.children.cycles-pp.rcu_sched_clock_irq 0.18 ± 7% -0.1 0.11 ± 17% perf-profile.children.cycles-pp.__hrtimer_next_event_base 0.19 ± 14% -0.1 0.13 ± 12% perf-profile.children.cycles-pp.update_sd_lb_stats 0.12 ± 13% -0.1 0.07 ± 11% perf-profile.children.cycles-pp.__remove_hrtimer 0.14 ± 11% -0.1 0.09 ± 11% perf-profile.children.cycles-pp.run_rebalance_domains 0.14 ± 13% -0.0 0.09 ± 11% perf-profile.children.cycles-pp.hrtimer_update_next_event 0.12 ± 16% -0.0 0.07 ± 19% perf-profile.children.cycles-pp.update_irq_load_avg 0.14 ± 13% -0.0 0.09 ± 20% perf-profile.children.cycles-pp.update_blocked_averages 0.12 ± 10% +0.0 0.14 ± 5% perf-profile.children.cycles-pp.update_curr 0.07 ± 7% +0.0 0.09 ± 7% perf-profile.children.cycles-pp.tick_nohz_idle_exit 0.08 ± 30% +0.0 0.11 ± 18% perf-profile.children.cycles-pp.map_id_range_down 0.04 ± 73% +0.0 0.08 ± 13% perf-profile.children.cycles-pp.xfs_verify_agbno 0.20 ± 15% +0.0 0.25 ± 7% perf-profile.children.cycles-pp.filename_parentat 0.25 ± 14% +0.0 0.30 ± 9% perf-profile.children.cycles-pp.xfs_lookup_get_search_key 0.37 ± 9% +0.1 0.43 ± 7% perf-profile.children.cycles-pp.kmem_cache_alloc 0.32 ± 7% +0.1 0.39 ± 5% perf-profile.children.cycles-pp.__xfs_dir3_free_read 0.64 ± 6% +0.1 0.72 ± 6% perf-profile.children.cycles-pp.filename_lookup 0.57 ± 7% +0.1 0.66 ± 4% perf-profile.children.cycles-pp.xfs_inobt_get_maxrecs 0.50 ± 7% +0.1 0.59 ± 2% perf-profile.children.cycles-pp.xfs_dir2_node_addname_int 0.36 ± 6% +0.1 0.46 ± 9% perf-profile.children.cycles-pp.xfs_imap_to_bp 0.33 ± 2% +0.1 0.43 ± 11% perf-profile.children.cycles-pp.xfs_dir3_data_read 0.45 ± 6% +0.1 0.54 ± 6% perf-profile.children.cycles-pp.xfs_buf_rele 0.54 ± 7% +0.1 0.64 ± 8% perf-profile.children.cycles-pp.xfs_inode_item_format 0.60 ± 4% +0.1 0.71 ± 8% perf-profile.children.cycles-pp.xfs_trans_log_inode 1.21 ± 4% +0.1 1.34 ± 3% perf-profile.children.cycles-pp.xfs_buf_unlock 0.75 ± 3% +0.1 0.89 ± 8% perf-profile.children.cycles-pp.xfs_dir2_leafn_remove 0.70 ± 6% +0.1 0.84 ± 10% perf-profile.children.cycles-pp.xfs_perag_get 0.77 ± 3% +0.1 0.91 ± 7% perf-profile.children.cycles-pp.xfs_iunlink 1.15 ± 2% +0.1 1.29 ± 3% perf-profile.children.cycles-pp.up 1.08 ± 3% +0.1 1.23 ± 6% perf-profile.children.cycles-pp.xfs_btree_read_buf_block 1.29 ± 4% +0.2 1.46 ± 2% perf-profile.children.cycles-pp.xfs_buf_item_release 0.83 ± 3% +0.2 1.00 ± 7% perf-profile.children.cycles-pp.down 0.80 ± 3% +0.2 0.98 ± 7% perf-profile.children.cycles-pp.__down 0.86 ± 3% +0.2 1.04 ± 6% perf-profile.children.cycles-pp.xfs_buf_lock 1.22 ± 4% +0.2 1.40 ± 6% perf-profile.children.cycles-pp.xfs_btree_lookup_get_block 1.19 ± 3% +0.2 1.38 ± 5% perf-profile.children.cycles-pp.memcpy_erms 0.94 ± 8% +0.2 1.13 ± 5% perf-profile.children.cycles-pp.xfs_iunlink_remove 1.39 ± 2% +0.2 1.58 ± 4% perf-profile.children.cycles-pp.xfs_buf_item_format_segment 1.47 ± 2% +0.2 1.67 ± 4% perf-profile.children.cycles-pp.xfs_buf_item_format 0.95 ± 8% +0.2 1.16 ± 7% perf-profile.children.cycles-pp.xfs_dialloc_select_ag 1.47 ± 2% +0.2 1.70 ± 2% perf-profile.children.cycles-pp.xfs_errortag_test 1.61 ± 5% +0.3 1.87 perf-profile.children.cycles-pp.xfs_da3_node_lookup_int 1.41 ± 7% +0.3 1.69 ± 2% perf-profile.children.cycles-pp.xfs_da_read_buf 1.89 ± 3% +0.3 2.19 ± 4% perf-profile.children.cycles-pp.xfs_btree_lookup 1.82 ± 4% +0.3 2.16 ± 5% perf-profile.children.cycles-pp.xfs_read_agi 2.05 ± 4% +0.4 2.41 ± 2% perf-profile.children.cycles-pp.xfs_dir2_node_removename 2.27 ± 4% +0.4 2.63 ± 3% perf-profile.children.cycles-pp.xfs_dir_removename 3.93 ± 4% +0.4 4.35 ± 3% perf-profile.children.cycles-pp.__xfs_btree_check_sblock 2.70 +0.4 3.14 ± 5% perf-profile.children.cycles-pp.xlog_cil_insert_items 2.71 ± 2% +0.5 3.19 ± 3% perf-profile.children.cycles-pp.xfs_buf_find 2.93 ± 3% +0.5 3.45 ± 3% perf-profile.children.cycles-pp.xfs_buf_get_map 2.96 ± 2% +0.5 3.51 ± 3% perf-profile.children.cycles-pp.xfs_buf_read_map 4.06 ± 3% +0.7 4.77 ± 4% perf-profile.children.cycles-pp.xfs_trans_read_buf_map 6.62 +0.8 7.39 ± 3% perf-profile.children.cycles-pp.xfs_btree_check_sblock 9.72 ± 3% +0.8 10.54 ± 2% perf-profile.children.cycles-pp.xfs_dialloc_ag 5.92 ± 2% +0.8 6.75 ± 4% perf-profile.children.cycles-pp.xfs_log_commit_cil 6.11 ± 2% +0.9 6.97 ± 4% perf-profile.children.cycles-pp.__xfs_trans_commit 5.58 ± 3% +0.9 6.47 ± 2% perf-profile.children.cycles-pp.xfs_vn_unlink 5.56 ± 3% +0.9 6.46 ± 2% perf-profile.children.cycles-pp.xfs_remove 5.15 ± 4% +0.9 6.07 ± 8% perf-profile.children.cycles-pp.__percpu_counter_sum 10.88 ± 2% +0.9 11.80 ± 3% perf-profile.children.cycles-pp.xfs_inobt_get_rec 5.41 ± 4% +0.9 6.36 ± 9% perf-profile.children.cycles-pp.vfs_statfs 5.36 ± 4% +1.0 6.31 ± 9% perf-profile.children.cycles-pp.xfs_fs_statfs 5.39 ± 4% +1.0 6.34 ± 9% perf-profile.children.cycles-pp.statfs_by_dentry 6.25 ± 3% +1.0 7.30 ± 7% perf-profile.children.cycles-pp.user_statfs 6.36 ± 3% +1.1 7.42 ± 7% perf-profile.children.cycles-pp.__do_sys_statfs 11.53 ± 3% +1.2 12.69 perf-profile.children.cycles-pp.xfs_dir_ialloc 10.20 ± 3% +1.2 11.36 ± 4% perf-profile.children.cycles-pp.xfs_difree 11.37 ± 3% +1.4 12.74 ± 3% perf-profile.children.cycles-pp.xfs_ifree 15.83 ± 3% +1.7 17.50 perf-profile.children.cycles-pp.xfs_create 13.28 ± 3% +1.7 14.96 ± 2% perf-profile.children.cycles-pp.xfs_fs_destroy_inode 13.22 ± 3% +1.7 14.90 ± 2% perf-profile.children.cycles-pp.xfs_inactive_ifree 13.23 ± 3% +1.7 14.92 ± 2% perf-profile.children.cycles-pp.xfs_inactive 13.44 ± 3% +1.7 15.14 ± 2% perf-profile.children.cycles-pp.destroy_inode 15.95 ± 3% +1.7 17.65 perf-profile.children.cycles-pp.xfs_generic_create 15.99 ± 3% +1.7 17.70 perf-profile.children.cycles-pp.vfs_mkdir 16.34 ± 3% +1.8 18.12 perf-profile.children.cycles-pp.do_mkdirat 18.25 ± 2% +1.8 20.07 ± 3% perf-profile.children.cycles-pp.xfs_check_agi_freecount 19.38 ± 3% +2.6 22.00 perf-profile.children.cycles-pp.vfs_rmdir 19.73 ± 3% +2.7 22.42 perf-profile.children.cycles-pp.do_rmdir 43.47 +5.4 48.90 perf-profile.children.cycles-pp.do_syscall_64 43.51 +5.4 48.95 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe 2.19 ± 7% -1.5 0.73 ± 46% perf-profile.self.cycles-pp.ktime_get 1.09 ± 5% -0.4 0.67 ± 7% perf-profile.self.cycles-pp.cpuidle_enter_state 0.68 ± 30% -0.4 0.28 ± 57% perf-profile.self.cycles-pp.timekeeping_max_deferment 0.45 ± 11% -0.3 0.11 ± 65% perf-profile.self.cycles-pp.ktime_get_update_offsets_now 0.37 ± 31% -0.3 0.12 ± 20% perf-profile.self.cycles-pp.update_process_times 0.34 ± 15% -0.2 0.13 ± 48% perf-profile.self.cycles-pp.calc_global_load_tick 0.46 ± 9% -0.2 0.27 ± 8% perf-profile.self.cycles-pp.lapic_next_deadline 0.46 ± 5% -0.2 0.28 ± 11% perf-profile.self.cycles-pp.native_irq_return_iret 0.36 ± 11% -0.1 0.22 ± 12% perf-profile.self.cycles-pp.read_tsc 0.41 ± 6% -0.1 0.29 ± 10% perf-profile.self.cycles-pp.native_sched_clock 0.17 ± 12% -0.1 0.06 ± 54% perf-profile.self.cycles-pp._raw_spin_trylock 0.17 ± 25% -0.1 0.07 ± 39% perf-profile.self.cycles-pp.tsc_verify_tsc_adjust 0.21 ± 13% -0.1 0.13 ± 10% perf-profile.self.cycles-pp.io_serial_in 0.13 ± 11% -0.1 0.07 ± 52% perf-profile.self.cycles-pp.rcu_sched_clock_irq 0.10 ± 25% -0.1 0.04 ± 75% perf-profile.self.cycles-pp.asm_sysvec_apic_timer_interrupt 0.15 ± 9% -0.0 0.10 ± 20% perf-profile.self.cycles-pp.__hrtimer_next_event_base 0.12 ± 14% -0.0 0.07 ± 19% perf-profile.self.cycles-pp.update_irq_load_avg 0.10 ± 16% -0.0 0.06 ± 9% perf-profile.self.cycles-pp.irqtime_account_irq 0.13 ± 15% -0.0 0.09 ± 16% perf-profile.self.cycles-pp.update_sd_lb_stats 0.05 ± 47% +0.0 0.08 ± 18% perf-profile.self.cycles-pp.schedule_timeout 0.14 ± 11% +0.0 0.17 ± 4% perf-profile.self.cycles-pp.xfs_buf_get_map 0.10 ± 17% +0.0 0.13 ± 8% perf-profile.self.cycles-pp.xfs_bmapi_read 0.04 ± 72% +0.0 0.08 ± 13% perf-profile.self.cycles-pp.xfs_verify_agbno 0.32 ± 8% +0.1 0.38 ± 8% perf-profile.self.cycles-pp.xlog_cil_insert_items 0.47 ± 9% +0.1 0.54 ± 6% perf-profile.self.cycles-pp.xfs_verify_dir_ino 0.45 ± 6% +0.1 0.52 ± 4% perf-profile.self.cycles-pp.xfs_check_agi_freecount 0.47 ± 6% +0.1 0.56 ± 5% perf-profile.self.cycles-pp.xfs_inobt_get_maxrecs 0.58 ± 6% +0.1 0.70 ± 8% perf-profile.self.cycles-pp.xfs_perag_get 1.17 ± 3% +0.2 1.36 ± 6% perf-profile.self.cycles-pp.memcpy_erms 1.43 +0.2 1.66 ± 2% perf-profile.self.cycles-pp.xfs_errortag_test 3.34 ± 5% +0.4 3.71 ± 3% perf-profile.self.cycles-pp.__xfs_btree_check_sblock 2.62 ± 5% +0.5 3.16 ± 11% perf-profile.self.cycles-pp.__percpu_counter_sum stress-ng.dir.ops 3e+06 +-----------------------------------------------------------------+ | OO O O | 2.8e+06 |-+ O OO O O OO O O O OO OO O OO OO O OO OO OO O O | 2.6e+06 |-+ O | | O O | 2.4e+06 |-+ O O O | 2.2e+06 |-+ O | | | 2e+06 |-+ | 1.8e+06 |-+ | | | 1.6e+06 |.+ .+ .+ +.+ .+ .+ .+.++.+.++.+.+ .+. .+ | 1.4e+06 |-++.+ +.+ :+ + +.+ +.+.++ + ++ +.+.++.+.++ | | + | 1.2e+06 +-----------------------------------------------------------------+ stress-ng.dir.ops_per_sec 50000 +-------------------------------------------------------------------+ | OO O O | 45000 |-+ O O O O O O O O O OO OO O O O O OO O OO O OO O O | | O O | | O | 40000 |-+ O O O O | | O | 35000 |-+ | | | 30000 |-+ | | | |.+ .+. + .+ +. .+ +. +.+.+. +.+. +. | 25000 |-++.+ + + + + + : +.+ +.+.+ + + + ++.+.+ +.+.++.+.++ | | + + + + | 20000 +-------------------------------------------------------------------+ [*] bisect-good sample [O] bisect-bad sample Disclaimer: Results have been estimated based on internal Intel analysis and are provided for informational purposes only. Any difference in system hardware or software design or configuration may affect actual performance. --- 0DAY/LKP+ Test Infrastructure Open Source Technology Center https://lists.01.org/hyperkitty/list/lkp@lists.01.org Intel Corporation Thanks, Oliver Sang