linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCHSET v8] Support for polled and buffered aio (and more)
@ 2018-12-13 17:56 Jens Axboe
  2018-12-13 17:56 ` [PATCH 01/26] fs: add an iopoll method to struct file_operations Jens Axboe
                   ` (26 more replies)
  0 siblings, 27 replies; 44+ messages in thread
From: Jens Axboe @ 2018-12-13 17:56 UTC (permalink / raw)
  To: linux-block, linux-fsdevel, linux-aio; +Cc: hch, jmoyer, clm

We're getting closer... Some good bug fixes in this release, fully
stable for me.

I decided to make the SQ/CQ ring interface the primary innovation
here, and build the new features on top of that. This means that
the io_submit/io_getevents interface doesn't get any of the advanced
features, outside of polled IO. If you want any of the new fancy,
you really sould use the ring interface, it's more efficient.

As before, there's a sample application as an introduction to how
to use the ring interface:

http://git.kernel.dk/cgit/fio/plain/t/aio-ring.c

Outside of that, I also built a fio engine for it yesterday. If you
update your fio to current -git, you'll have an aio-ring engine. As
an example, if you want to run buffered async IO, you would do:

[aio-buffered]
ioengine=aio-ring
sqwq=1
filename=/dev/nvme1n1
direct=0
ramp_time=2
runtime=10s
iodepth=64
rw=randread
norandommap
bs=4k

which enables the SQ/CQ workqueue offload. As before, cached data is
served without punting to a workqueue.

aio-ring also supports the sqthread interface, and it supports fixed
buffers, and (of course) polled IO. For a speed run, you'd do:

[aio-polled]
ioengine=aio-ring
hipri=1
fixedbufs=1
filename=/dev/nvme1n1
direct=1
ramp_time=2
runtime=10s
iodepth=64
iodepth_batch_complete_max=64
iodepth_batch=32
rw=randread
norandommap
bs=4k
cpus_allowed=0

which enables polling (hipri=1) and fixed buffers (fixedbufs=1).

Interface wise, I made an important change to how the SQ ring works.
Previously the ring was just an index into an array of iocbs. This
was somewhat inflexible, as I found out when I converted applications
to use it, as it requires an ordered relationship between application
IO units and iocbs. Now the ring is just indexes into the iocb array,
so you can submit multiple iocbs in one go without having to rely
on iocbs being ordered in the array.

As before, find this in my aio-poll branch:

http://git.kernel.dk/cgit/linux-block/log/?h=aio-poll

or clone it here:

git://git.kernel.dk/linux-block aio-poll

and the code is based on the pending 4.21 block changes, for-4.21/block.

Since v7:

- Fix fput error handling in aio_prep_rw()
- Dropped USERIOCB for non-ring interface
- Fix CQ ring sizing
- Change sq_ring to be ring of indexes, not iocbs
- Rename aio_iocb_ring -> aio_sq_ring, aio_io_event_ring -> aio_cq_ring
- Fix not passing submission state always, if setup
- Fix double mmget/use_mm for SQTHREAD for rare conditions
- Fix __kthread_bind_mask() warning for SQTHREAD exit
- Comments
- Various bug fixes

 Documentation/filesystems/vfs.txt      |    3 +
 Documentation/sysctl/fs.txt            |    8 +-
 arch/x86/entry/syscalls/syscall_64.tbl |    2 +
 block/bio.c                            |   33 +-
 fs/aio.c                               | 1891 ++++++++++++++++++++++--
 fs/block_dev.c                         |   34 +-
 fs/file.c                              |   15 +-
 fs/file_table.c                        |   10 +-
 fs/gfs2/file.c                         |    2 +
 fs/iomap.c                             |   57 +-
 fs/xfs/xfs_file.c                      |    1 +
 include/linux/bio.h                    |    1 +
 include/linux/blk_types.h              |    2 +
 include/linux/file.h                   |    2 +
 include/linux/fs.h                     |    5 +-
 include/linux/iomap.h                  |    1 +
 include/linux/syscalls.h               |    4 +
 include/uapi/asm-generic/unistd.h      |    4 +-
 include/uapi/linux/aio_abi.h           |   37 +
 kernel/sys_ni.c                        |    2 +
 20 files changed, 1916 insertions(+), 198 deletions(-)

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 44+ messages in thread

end of thread, other threads:[~2018-12-18 15:07 UTC | newest]

Thread overview: 44+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-12-13 17:56 [PATCHSET v8] Support for polled and buffered aio (and more) Jens Axboe
2018-12-13 17:56 ` [PATCH 01/26] fs: add an iopoll method to struct file_operations Jens Axboe
2018-12-13 17:56 ` [PATCH 02/26] block: add REQ_HIPRI_ASYNC Jens Axboe
2018-12-13 17:56 ` [PATCH 03/26] block: wire up block device iopoll method Jens Axboe
2018-12-13 17:56 ` [PATCH 04/26] block: use REQ_HIPRI_ASYNC for non-sync polled IO Jens Axboe
2018-12-13 17:56 ` [PATCH 05/26] iomap: wire up the iopoll method Jens Axboe
2018-12-13 17:56 ` [PATCH 06/26] aio: use assigned completion handler Jens Axboe
2018-12-13 17:56 ` [PATCH 07/26] aio: separate out ring reservation from req allocation Jens Axboe
2018-12-13 17:56 ` [PATCH 08/26] aio: don't zero entire aio_kiocb aio_get_req() Jens Axboe
2018-12-13 17:56 ` [PATCH 09/26] aio: only use blk plugs for > 2 depth submissions Jens Axboe
2018-12-13 17:56 ` [PATCH 10/26] aio: use iocb_put() instead of open coding it Jens Axboe
2018-12-13 17:56 ` [PATCH 11/26] aio: split out iocb copy from io_submit_one() Jens Axboe
2018-12-16 16:46   ` Christoph Hellwig
2018-12-13 17:56 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe
2018-12-16 16:46   ` Christoph Hellwig
2018-12-13 17:56 ` [PATCH 13/26] aio: add io_setup2() system call Jens Axboe
2018-12-13 17:56 ` [PATCH 14/26] aio: support for IO polling Jens Axboe
2018-12-18 10:32   ` Benny Halevy
2018-12-18 13:27     ` Jens Axboe
2018-12-18 14:54       ` Benny Halevy
2018-12-18 15:07         ` Jens Axboe
2018-12-13 17:56 ` [PATCH 15/26] aio: add submission side request cache Jens Axboe
2018-12-13 17:56 ` [PATCH 16/26] fs: add fget_many() and fput_many() Jens Axboe
2018-12-16 16:37   ` Christoph Hellwig
2018-12-16 16:57     ` Jens Axboe
2018-12-13 17:56 ` [PATCH 17/26] aio: use fget/fput_many() for file references Jens Axboe
2018-12-13 17:56 ` [PATCH 18/26] aio: split iocb init from allocation Jens Axboe
2018-12-13 17:56 ` [PATCH 19/26] aio: batch aio_kiocb allocation Jens Axboe
2018-12-13 17:56 ` [PATCH 20/26] aio: split old ring complete out from aio_complete() Jens Axboe
2018-12-13 17:56 ` [PATCH 21/26] aio: add support for submission/completion rings Jens Axboe
2018-12-13 17:56 ` [PATCH 22/26] block: add BIO_HOLD_PAGES flag Jens Axboe
2018-12-13 17:56 ` [PATCH 23/26] block: implement bio helper to add iter bvec pages to bio Jens Axboe
2018-12-16 16:40   ` Christoph Hellwig
2018-12-16 16:43     ` Jens Axboe
2018-12-13 17:56 ` [PATCH 24/26] fs: add support for mapping an ITER_BVEC for O_DIRECT Jens Axboe
2018-12-16 16:41   ` Christoph Hellwig
2018-12-16 16:45     ` Jens Axboe
2018-12-13 17:56 ` [PATCH 25/26] aio: add support for pre-mapped user IO buffers Jens Axboe
2018-12-13 17:56 ` [PATCH 26/26] aio: support kernel side submission for aio with SCQRING Jens Axboe
2018-12-16 16:34 ` [PATCHSET v8] Support for polled and buffered aio (and more) Christoph Hellwig
2018-12-16 16:37   ` Jens Axboe
2018-12-16 16:42     ` Christoph Hellwig
2018-12-16 16:43       ` Jens Axboe
2018-12-16 16:44         ` Christoph Hellwig

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).