linux-fsdevel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCHSET v6] Support for polled aio (and more)
@ 2018-12-07 22:19 Jens Axboe
  2018-12-07 22:19 ` [PATCH 01/26] fs: add an iopoll method to struct file_operations Jens Axboe
                   ` (25 more replies)
  0 siblings, 26 replies; 35+ messages in thread
From: Jens Axboe @ 2018-12-07 22:19 UTC (permalink / raw)
  To: linux-block, linux-fsdevel, linux-aio; +Cc: hch, jmoyer, clm

For the grand introduction to this feature, see my original posting
here:

https://lore.kernel.org/linux-block/20181117235317.7366-1-axboe@kernel.dk/

and refer to the previous postings of this patchset for whatever
features were added there. Particularly v4 has some performance results:

https://lore.kernel.org/linux-block/20181130165646.27341-1-axboe@kernel.dk/

No new features in this version, I think we've got what we need. Just
a collection of fixes. Especially interesting is probably that XFS is
tested in all configurations now, it works with both IO polling and
fixed user buffers.

Find it here for your cgit browsing:

http://git.kernel.dk/cgit/linux-block/log/?h=aio-poll

or clone/pull from this branch:

git://git.kernel.dk/linux-block aio-poll


Since v5

- Correct spin/no-spin condition for polling
- Add some comments
- Fix missing io_event_ring(2) if !CONFIG_AIO
- Make sure XFS works for all cases
- Fix cq/sq ring sizing
- Make cq/sq ring work with non-polled IO
- Make cq/sq ring work without fixed buffers
- Ensure that io_ring_enter() doesn't race with kill_ioctx()
- Rebase on top of for-4.21/block


 Documentation/filesystems/vfs.txt      |    3 +
 arch/x86/entry/syscalls/syscall_64.tbl |    2 +
 block/bio.c                            |   33 +-
 fs/aio.c                               | 1538 +++++++++++++++++++++---
 fs/block_dev.c                         |   34 +-
 fs/file.c                              |   15 +-
 fs/file_table.c                        |   10 +-
 fs/gfs2/file.c                         |    2 +
 fs/iomap.c                             |   57 +-
 fs/xfs/xfs_file.c                      |    1 +
 include/linux/bio.h                    |    1 +
 include/linux/blk_types.h              |    2 +
 include/linux/file.h                   |    2 +
 include/linux/fs.h                     |    5 +-
 include/linux/iomap.h                  |    1 +
 include/linux/syscalls.h               |    5 +
 include/uapi/asm-generic/unistd.h      |    4 +-
 include/uapi/linux/aio_abi.h           |   32 +
 kernel/sys_ni.c                        |    2 +
 19 files changed, 1552 insertions(+), 197 deletions(-)

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 35+ messages in thread
* [PATCHSET v8] Support for polled and buffered aio (and more)
@ 2018-12-13 17:56 Jens Axboe
  2018-12-13 17:56 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe
  0 siblings, 1 reply; 35+ messages in thread
From: Jens Axboe @ 2018-12-13 17:56 UTC (permalink / raw)
  To: linux-block, linux-fsdevel, linux-aio; +Cc: hch, jmoyer, clm

We're getting closer... Some good bug fixes in this release, fully
stable for me.

I decided to make the SQ/CQ ring interface the primary innovation
here, and build the new features on top of that. This means that
the io_submit/io_getevents interface doesn't get any of the advanced
features, outside of polled IO. If you want any of the new fancy,
you really sould use the ring interface, it's more efficient.

As before, there's a sample application as an introduction to how
to use the ring interface:

http://git.kernel.dk/cgit/fio/plain/t/aio-ring.c

Outside of that, I also built a fio engine for it yesterday. If you
update your fio to current -git, you'll have an aio-ring engine. As
an example, if you want to run buffered async IO, you would do:

[aio-buffered]
ioengine=aio-ring
sqwq=1
filename=/dev/nvme1n1
direct=0
ramp_time=2
runtime=10s
iodepth=64
rw=randread
norandommap
bs=4k

which enables the SQ/CQ workqueue offload. As before, cached data is
served without punting to a workqueue.

aio-ring also supports the sqthread interface, and it supports fixed
buffers, and (of course) polled IO. For a speed run, you'd do:

[aio-polled]
ioengine=aio-ring
hipri=1
fixedbufs=1
filename=/dev/nvme1n1
direct=1
ramp_time=2
runtime=10s
iodepth=64
iodepth_batch_complete_max=64
iodepth_batch=32
rw=randread
norandommap
bs=4k
cpus_allowed=0

which enables polling (hipri=1) and fixed buffers (fixedbufs=1).

Interface wise, I made an important change to how the SQ ring works.
Previously the ring was just an index into an array of iocbs. This
was somewhat inflexible, as I found out when I converted applications
to use it, as it requires an ordered relationship between application
IO units and iocbs. Now the ring is just indexes into the iocb array,
so you can submit multiple iocbs in one go without having to rely
on iocbs being ordered in the array.

As before, find this in my aio-poll branch:

http://git.kernel.dk/cgit/linux-block/log/?h=aio-poll

or clone it here:

git://git.kernel.dk/linux-block aio-poll

and the code is based on the pending 4.21 block changes, for-4.21/block.

Since v7:

- Fix fput error handling in aio_prep_rw()
- Dropped USERIOCB for non-ring interface
- Fix CQ ring sizing
- Change sq_ring to be ring of indexes, not iocbs
- Rename aio_iocb_ring -> aio_sq_ring, aio_io_event_ring -> aio_cq_ring
- Fix not passing submission state always, if setup
- Fix double mmget/use_mm for SQTHREAD for rare conditions
- Fix __kthread_bind_mask() warning for SQTHREAD exit
- Comments
- Various bug fixes

 Documentation/filesystems/vfs.txt      |    3 +
 Documentation/sysctl/fs.txt            |    8 +-
 arch/x86/entry/syscalls/syscall_64.tbl |    2 +
 block/bio.c                            |   33 +-
 fs/aio.c                               | 1891 ++++++++++++++++++++++--
 fs/block_dev.c                         |   34 +-
 fs/file.c                              |   15 +-
 fs/file_table.c                        |   10 +-
 fs/gfs2/file.c                         |    2 +
 fs/iomap.c                             |   57 +-
 fs/xfs/xfs_file.c                      |    1 +
 include/linux/bio.h                    |    1 +
 include/linux/blk_types.h              |    2 +
 include/linux/file.h                   |    2 +
 include/linux/fs.h                     |    5 +-
 include/linux/iomap.h                  |    1 +
 include/linux/syscalls.h               |    4 +
 include/uapi/asm-generic/unistd.h      |    4 +-
 include/uapi/linux/aio_abi.h           |   37 +
 kernel/sys_ni.c                        |    2 +
 20 files changed, 1916 insertions(+), 198 deletions(-)

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 35+ messages in thread
* [PATCHSET v5] Support for polled aio
@ 2018-12-04 23:37 Jens Axboe
  2018-12-04 23:37 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe
  0 siblings, 1 reply; 35+ messages in thread
From: Jens Axboe @ 2018-12-04 23:37 UTC (permalink / raw)
  To: linux-block, linux-fsdevel, linux-aio; +Cc: hch, jmoyer

For the grand introduction to this feature, see my original posting
here:

https://lore.kernel.org/linux-block/20181117235317.7366-1-axboe@kernel.dk/

and refer to the previous postings of this patchset for whatever
features were added there. Particularly v4 has some performance results:

https://lore.kernel.org/linux-block/20181130165646.27341-1-axboe@kernel.dk/

New in this version is io_ring_enter(2) and a ring based SQ/CQ
interface. The rings are mapped from user space, an application can
submit IO by writing to its own SQ ring of struct iocbs, and get
completions by reading the CQ ring of io_events. This eliminates the
need to copy iocbs and io_events completely. It also opens up the
possibility of doing polled IO without any system calls at all, if we
add a thread on the kernel side... I've done experiments with that, but
not ready yet. Doesn't require any changes to the API. I can trivially
do 1.2M IOPS with a single thread through this, and 173K IOPS with
QD=1/sync.

You can also find the patches in my aio-poll branch:

http://git.kernel.dk/cgit/linux-block/log/?h=aio-poll

or by cloning:

git://git.kernel.dk/linux-block aio-poll

Patches are against for-4.21/block.


Since v4

- Switch to using ITER_BVEC for user mapped buffers.
- Drop unneeded import_kvec().
- Use RLIMIT_MEMLOCK as a cap on total memory pinned.
- Fix poll check with min_events == 0 actually checking for events
- Add REQ_HIPRI_ASYNC
- Add ring interface and io_ring_enter(2)
- Change io_setup2() system call to accommodate the new ring interface

 Documentation/filesystems/vfs.txt      |    3 +
 arch/x86/entry/syscalls/syscall_64.tbl |    2 +
 block/bio.c                            |   33 +-
 fs/aio.c                               | 1440 +++++++++++++++++++++---
 fs/block_dev.c                         |   34 +-
 fs/file.c                              |   15 +-
 fs/file_table.c                        |   10 +-
 fs/gfs2/file.c                         |    2 +
 fs/iomap.c                             |   52 +-
 fs/xfs/xfs_file.c                      |    1 +
 include/linux/bio.h                    |    1 +
 include/linux/blk_types.h              |    2 +
 include/linux/file.h                   |    2 +
 include/linux/fs.h                     |    5 +-
 include/linux/iomap.h                  |    1 +
 include/linux/syscalls.h               |    5 +
 include/uapi/asm-generic/unistd.h      |    4 +-
 include/uapi/linux/aio_abi.h           |   32 +
 kernel/sys_ni.c                        |    1 +
 19 files changed, 1450 insertions(+), 195 deletions(-)

-- 
Jens Axboe

^ permalink raw reply	[flat|nested] 35+ messages in thread

end of thread, other threads:[~2018-12-16 16:46 UTC | newest]

Thread overview: 35+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-12-07 22:19 [PATCHSET v6] Support for polled aio (and more) Jens Axboe
2018-12-07 22:19 ` [PATCH 01/26] fs: add an iopoll method to struct file_operations Jens Axboe
2018-12-07 22:19 ` [PATCH 02/26] block: add REQ_HIPRI_ASYNC Jens Axboe
2018-12-07 22:19 ` [PATCH 03/26] block: wire up block device iopoll method Jens Axboe
2018-12-07 22:19 ` [PATCH 04/26] block: use REQ_HIPRI_ASYNC for non-sync polled IO Jens Axboe
2018-12-07 22:19 ` [PATCH 05/26] iomap: wire up the iopoll method Jens Axboe
2018-12-07 22:19 ` [PATCH 06/26] aio: use assigned completion handler Jens Axboe
2018-12-07 22:19 ` [PATCH 07/26] aio: separate out ring reservation from req allocation Jens Axboe
2018-12-07 22:19 ` [PATCH 08/26] aio: don't zero entire aio_kiocb aio_get_req() Jens Axboe
2018-12-07 22:19 ` [PATCH 09/26] aio: only use blk plugs for > 2 depth submissions Jens Axboe
2018-12-07 22:20 ` [PATCH 10/26] aio: use iocb_put() instead of open coding it Jens Axboe
2018-12-07 22:20 ` [PATCH 11/26] aio: split out iocb copy from io_submit_one() Jens Axboe
2018-12-07 22:20 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe
2018-12-07 22:20 ` [PATCH 13/26] aio: add io_setup2() system call Jens Axboe
2018-12-09 10:12   ` Benny Halevy
2018-12-09 14:37     ` Jens Axboe
2018-12-07 22:20 ` [PATCH 14/26] aio: add support for having user mapped iocbs Jens Axboe
2018-12-07 22:20 ` [PATCH 15/26] aio: support for IO polling Jens Axboe
2018-12-09 10:02   ` Benny Halevy
2018-12-09 14:36     ` Jens Axboe
2018-12-10  8:10       ` Benny Halevy
2018-12-07 22:20 ` [PATCH 16/26] aio: add submission side request cache Jens Axboe
2018-12-07 22:20 ` [PATCH 17/26] fs: add fget_many() and fput_many() Jens Axboe
2018-12-07 22:20 ` [PATCH 18/26] aio: use fget/fput_many() for file references Jens Axboe
2018-12-07 22:20 ` [PATCH 19/26] aio: split iocb init from allocation Jens Axboe
2018-12-07 22:20 ` [PATCH 20/26] aio: batch aio_kiocb allocation Jens Axboe
2018-12-07 22:20 ` [PATCH 21/26] block: add BIO_HOLD_PAGES flag Jens Axboe
2018-12-07 22:20 ` [PATCH 22/26] block: implement bio helper to add iter bvec pages to bio Jens Axboe
2018-12-07 22:20 ` [PATCH 23/26] fs: add support for mapping an ITER_BVEC for O_DIRECT Jens Axboe
2018-12-07 22:20 ` [PATCH 24/26] aio: add support for pre-mapped user IO buffers Jens Axboe
2018-12-07 22:20 ` [PATCH 25/26] aio: split old ring complete out from aio_complete() Jens Axboe
2018-12-07 22:20 ` [PATCH 26/26] aio: add support for submission/completion rings Jens Axboe
  -- strict thread matches above, loose matches on Subject: below --
2018-12-13 17:56 [PATCHSET v8] Support for polled and buffered aio (and more) Jens Axboe
2018-12-13 17:56 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe
2018-12-16 16:46   ` Christoph Hellwig
2018-12-04 23:37 [PATCHSET v5] Support for polled aio Jens Axboe
2018-12-04 23:37 ` [PATCH 12/26] aio: abstract out io_event filler helper Jens Axboe

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).