crypto: caam from tasklet to threadirq

* crypto: caam from tasklet to threadirq
       [not found] <DB5PR04MB130229BBD433C8FF7BDD975FEEF90@DB5PR04MB1302.eurprd04.prod.outlook.com>
@ 2016-09-16 14:01 ` Cata Vasile
  2016-09-16 16:53   ` Russell King - ARM Linux
  2016-09-20 16:12   ` Russell King - ARM Linux
  0 siblings, 2 replies; 6+ messages in thread
From: Cata Vasile @ 2016-09-16 14:01 UTC (permalink / raw)
  To: rmk+kernel; +Cc: Horia Geanta Neag, linux-crypto

Hi,

We've tried to test and benchmark your submitted work[1].

Cryptographic offloading is also used in IPsec in the Linux Kernel. In heavy traffic scenarios, the NIC driver competes with the crypto device driver. Most NICs use the NAPI context, which is one of the most prioritized context types. In IPsec scenarios  the performance is trashed because, although raw data gets in to device, the data is encrypted/decrypted and the dequeue code in CAAM driver has a hard time being scheduled to actually call the callback to notify the networking stack it can continue working with  that data.

Being this scenario, at heavy load, the Kernel warns on rcu stalls and the forwarding path has a lot of latency.
Have you tried benchmarking the board you used for testing?

I have ran some on our other platforms. The after benchmark fails to run at the top level of the before results. The rcu stall does not always stall in the same place. The after ping latency is greater, and oscillates a lot.

It might be a good idea for the codebase to change to a threadirq, but from a pragmatic perspective, the whole system has to suffer. That is one the reasons most crypto accelerators try to run dequeue primitives in high priority contexts.

Regards,
Catalin Vasile

[1] https://git.kernel.org/cgit/linux/kernel/git/herbert/cryptodev-2.6.git/commit/?id=66d2e2028091a074aa1290d2eeda5ddb1a6c329c

^ permalink raw reply	[flat|nested] 6+ messages in thread