Fwd: FIO 3.11

* Fwd: FIO 3.11
       [not found] <BN7PR08MB409869032E10FEC58C9CBE08D9FF0@BN7PR08MB4098.namprd08.prod.outlook.com>
@ 2018-10-17 19:03 ` Jens Axboe
  2018-10-17 21:41   ` Elliott, Robert (Persistent Memory)
  0 siblings, 1 reply; 5+ messages in thread
From: Jens Axboe @ 2018-10-17 19:03 UTC (permalink / raw)
  To: fio; +Cc: Etienne-Hugues Fortin

-------- Forwarded Message --------
Subject: 	FIO 3.11
Date: 	Wed, 17 Oct 2018 18:45:03 +0000
From: 	Etienne-Hugues Fortin <efortin@cyberspicace.com>
To: 	axboe@kernel.dk <axboe@kernel.dk>

Hi,

ï¿½

Sorry to bother you with those questions but Iï¿½ve been unable to find the answers by myself. Iï¿½ve been experimenting with FIO for a little over a month but Iï¿½m still unsure about those aspects.

ï¿½

First, when we create a profile, my assumption is that if I have two jobs without using ï¿½stonewallï¿½, each job will run at the same time and will have the same priority. So, if I want to do a job with a 50% read/write but with 25% random and 75% sequential, I would need a job that look like this:

ï¿½

[128k_ random]

rw=randread

[128k_ seq_1]

numjobs=3

rw=read

[128k_ random_w]

rw=randwrite

[128k_ seq_1_w]

numjobs=3

rw=write

ï¿½

Here,ï¿½ I have exactly 6 jobs on 8 (75%) sequential and of this, 50% are read. Iï¿½ve run this job over a standard NFS server that has SSD as the backend. As I have 4 clients running FIO simultaneous, Iï¿½m getting 4 results which are 65/113, 68/108, 67/115, 66/111 MiB. Thatï¿½s about 37% read, 63% write. What would explain the fact that Iï¿½m getting more write then read with this scenario? Iï¿½ve also run the job on an all-flash array that do NAS and I got similar results. So, I think this may have with the speed at which the storage is answering. On standard storage unity, write are usually cached in memory so we get acknowledgement faster. I could make the hypothesis that as soon as FIO is getting an answer, it will resend a new IO right away which would then skew the results the way Iï¿½m seeing it. Is that what is happening? If so, outside of trying to work the numjobs required to get about 50% R/W, is there a better way of doing this?

ï¿½

Related to the same simulation, I saw that we can also use rw and randrw to combine read/write in sequential/random workload. Then, we can set the percentage of the mixed with rwmixread and/or rwmixwrite. What Iï¿½ve not been able to find is if the read/write are made on each file or not. With a single jobs, it has to be. However, if I put numjobs=10, will all of the 10 files got read and write or I can have 40% of the file to get the write and 60% of them to get read? My current thinking is that the mixed workload is executed on each file. If thatï¿½s the case, thatï¿½s not what I want to simulate. Is there another way of doing this?

ï¿½

I also did some testing on raw device (block storage) and I tried the latency target profile. I can see that if I give it the latency target needed (0.75 ms) with a 75 percentile, it will get to about the level of IOPS that the unit is able to sustain while being under the target, 75% of the time. However, Iï¿½m not seeing the iothread gradually increasing as the test is progressing. As such, creating the final job file is still not as simple as looking at the result and taking the iothread that make sense. Did I missed something in the results or the latency profile job? My job file is the following:

ï¿½

[global]

ioengine=libaio

invalidate=1

iodepth=32

direct=1

latency_target=750

latency_window=30s

latency_percentile=75

per_job_logs=0

group_reporting

bs=32k

rw=randrw

rwmixread=60

write_bw_log=blocks

write_lat_log=blocks

write_iops_log=blocks

ï¿½

[sdb-32k]

filename=/dev/sdb

[sdc-32k]

filename=/dev/sdc

[sdd-32k]

filename=/dev/sdd

[sde-32k]

filename=/dev/sde

ï¿½

The last thing is about the bw/iops/lat logs. Iï¿½ve used them in the latency test and in some other jobfile and the filename format Iï¿½m getting is always filename_[bw|clat|iops|lat|slat].log.server_name

ï¿½

With this format, the fio_generate_plots and fio2gnuplot canï¿½t find any files. It seems those program expect to find files in the format *_bw.log so with no servername at the end. Is there a way to get the servername at the beginning of the filename? The documentation seems to indicate that it should end in .log but thatï¿½s not what Iï¿½m getting up to now.

ï¿½

Thank you for your assistance and keep up the good work on such a great tool.

ï¿½

Etienne Fortin

^ permalink raw reply	[flat|nested] 5+ messages in thread