[RFC] nvfs: a filesystem for persistent memory

* [RFC] nvfs: a filesystem for persistent memory
@ 2020-09-15 12:34 Mikulas Patocka
  2020-09-15 13:00 ` Matthew Wilcox
  2020-09-15 15:16 ` Dan Williams
  0 siblings, 2 replies; 38+ messages in thread
From: Mikulas Patocka @ 2020-09-15 12:34 UTC (permalink / raw)
  To: Linus Torvalds, Alexander Viro, Andrew Morton, Dan Williams,
	Vishal Verma, Dave Jiang, Ira Weiny, Matthew Wilcox, Jan Kara,
	Eric Sandeen, Dave Chinner, Kani, Toshi, Norton, Scott J,
	Tadakamadla, Rajesh (DCIG/CDI/HPS Perf)
  Cc: linux-kernel, linux-fsdevel, linux-nvdimm

Hi

I am developing a new filesystem suitable for persistent memory - nvfs. 
The goal is to have a small and fast filesystem that can be used on 
DAX-based devices. Nvfs maps the whole device into linear address space 
and it completely bypasses the overhead of the block layer and buffer 
cache.

In the past, there was nova filesystem for pmem, but it was abandoned a 
year ago (the last version is for the kernel 5.1 - 
https://github.com/NVSL/linux-nova ). Nvfs is smaller and performs better.

The design of nvfs is similar to ext2/ext4, so that it fits into the VFS 
layer naturally, without too much glue code.

I'd like to ask you to review it.

tarballs:
	http://people.redhat.com/~mpatocka/nvfs/
git:
	git://leontynka.twibright.com/nvfs.git
the description of filesystem internals:
	http://people.redhat.com/~mpatocka/nvfs/INTERNALS
benchmarks:
	http://people.redhat.com/~mpatocka/nvfs/BENCHMARKS

TODO:

- programs run approximately 4% slower when running from Optane-based 
persistent memory. Therefore, programs and libraries should use page cache 
and not DAX mapping.

- when the fsck.nvfs tool mmaps the device /dev/pmem0, the kernel uses 
buffer cache for the mapping. The buffer cache slows does fsck by a factor 
of 5 to 10. Could it be possible to change the kernel so that it maps DAX 
based block devices directly?

- __copy_from_user_inatomic_nocache doesn't flush cache for leading and 
trailing bytes.

Mikulas

^ permalink raw reply	[flat|nested] 38+ messages in thread