Detecting whether hug pages are working with fsdax

* Detecting whether hug pages are working with fsdax
@ 2022-11-09 21:59 Eliot Moss
  2022-11-14 15:09 ` Detecting whether huge pages are working with fsdax - only partial success Eliot Moss
  0 siblings, 1 reply; 4+ messages in thread
From: Eliot Moss @ 2022-11-09 21:59 UTC (permalink / raw)
  To: nvdimm

Dear nvdimmers -

I tried following Darrick Wong's advice from this page:

https://nvdimm.wiki.kernel.org/2mib_fs_dax

In particular, these instructions:
================================================================================
The way that I normally do this is by looking at the filesystem DAX tracepoints:

# cd /sys/kernel/debug/tracing
# echo 1 > events/fs_dax/dax_pmd_fault_done/enable
<run test which faults in filesystem DAX mappings>
We can then look at the dax_pmd_fault_done events in

/sys/kernel/debug/tracing/trace
and see whether they were successful. An event that successfully faulted in a filesystem DAX PMD
looks like this:

big-1434  [008] ....  1502.341229: dax_pmd_fault_done: dev 259:0 ino 0xc shared
WRITE|ALLOW_RETRY|KILLABLE|USER address 0x10505000 vm_start 0x10200000 vm_end
0x10700000 pgoff 0x305 max_pgoff 0x1400 NOPAGE
The first thing to look at is the NOPAGE return value at the end of the line. This means that the
fault succeeded and didn't return a page cache page, which is expected for DAX. A 2 MiB fault that
failed and fell back to 4 KiB DAX faults will instead look like this:

small-1431  [008] ....  1499.402672: dax_pmd_fault_done: dev 259:0 ino 0xc shared
WRITE|ALLOW_RETRY|KILLABLE|USER address 0x10420000 vm_start 0x10200000 vm_end
0x10500000 pgoff 0x220 max_pgoff 0x3ffff FALLBACK
You can see that this fault resulted in a fallback to 4 KiB faults via the FALLBACK return code at
the end of the line. The rest of the data in this line can help you determine why the fallback
happened. In this case it was because I intentionally created an mmap() area that was smaller than 2
MiB.
================================================================================

I get no trace output whatsoever, whether I am using 2Mb huge pages or 1Gb
huge pages.  My mmap calls are successful but I get no trace output at all,
only this:

================================================================================
# tracer: nop
#
# entries-in-buffer/entries-written: 0/0   #P:63
#
#                                _-----=> irqs-off
#                               / _----=> need-resched
#                              | / _---=> hardirq/softirq
#                              || / _--=> preempt-depth
#                              ||| / _-=> migrate-disable
#                              |||| /     delay
#           TASK-PID     CPU#  |||||  TIMESTAMP  FUNCTION
#              | |         |   |||||     |         |
================================================================================

Any suggestions about what may be different in my system?  It is clear that we
are mapping files created in an fdax file system, and that the contents of the
files are changing.

Regards - Eliot Moss

^ permalink raw reply	[flat|nested] 4+ messages in thread