[SPDK] Long test times and Python interpreter

* [SPDK] Long test times and Python interpreter
@ 2019-05-01  0:44 Harris, James R
  0 siblings, 0 replies; 3+ messages in thread
From: Harris, James R @ 2019-05-01  0:44 UTC (permalink / raw)
  To: spdk

[-- Attachment #1: Type: text/plain, Size: 2010 bytes --]

I’ve been digging into the NVMe-oF target tests today – starting with looking at how long it takes to run all of the per-patch tests.  I’d like to consider a new strategy for using rpc.py from our test scripts to reduce the number of times we start the Python interpreter.  There’s a non-trivial amount of overhead just related to starting the interpreter – and tests like nvmf/shutdown do 30 RPCs (10 subsystems * 3 RPCs to set up each subsystem).  If it’s only 100ms overhead per RPC, that’s 3 seconds wasted*.

There are two approaches I can see:

  1.  Save the configuration in JSON RPC format, and check in the file into the test directory.  Then we just have a single RPC to call (load_config).
  2.  Add a new top level function to rpc.py called “run_from_file”.  This is not an RPC – it just loads a file, and runs each line as if it is was its own rpc.py invocation.
     *   A variation could also be piping this file through stdin

I think #2 is probably better.  At least for nvmf/shutdown, we use a different number of subsystems based on whether we’re testing with a real RDMA NIC or with soft-roce.  In those cases, we’d need two separate JSON RPC format files.

This would also be helpful for adding more stressful tests – for example, create 1000 subsystems or 1000 logical volumes.  The SPDK application itself can do this pretty quickly, but I think our current strategy for actually building that config via RPC would force it be run only nightly due to length of time.

I’ll probably try putting something together tomorrow, but wanted to see if anyone else had any additional thoughts or comments.

Thanks,

-Jim

* I know 3 seconds doesn’t sound like much.  It’s probably more like 5 or 6 seconds in this case.  We probably run about 8 patches per hour through the test pool – so every 7-8 seconds saved is a 1% improvement in test pool throughput.  There are several other places we could use this – especially vhost and iscsi.

^ permalink raw reply	[flat|nested] 3+ messages in thread