All of lore.kernel.org
 help / color / mirror / Atom feed
* Mellanox ConnectX-3 VF driver can't work with 16 CPUs?
@ 2017-02-09 10:03 Dexuan Cui
  2017-02-09 16:47 ` Tariq Toukan
  2017-02-16 14:57 ` jackm
  0 siblings, 2 replies; 3+ messages in thread
From: Dexuan Cui @ 2017-02-09 10:03 UTC (permalink / raw)
  To: Yishai Hadas, Tariq Toukan; +Cc: netdev

Hi, 
While trying SR-IOV with a Linux guest running on Hyper-V, I found this issue:
the VF driver can't work if the guest has 16 virtual CPUs (less vCPUs, e.g. 8,  can work fine):

[    9.927820] mlx4_core: Mellanox ConnectX core driver v2.2-1 (Feb, 2014)
[    9.927882] mlx4_core: Initializing b961:00:02.0
[    9.970994] mlx4_core b961:00:02.0: Detected virtual function - running in slave mode
[    9.976783] mlx4_core b961:00:02.0: Sending reset
[    9.985858] mlx4_core b961:00:02.0: Sending vhcr0
[   10.004855] mlx4_core b961:00:02.0: HCA minimum page size:512
[   10.010465] mlx4_core b961:00:02.0: Timestamping is not supported in slave mode
[   10.203065] mlx4_core b961:00:02.0: Failed to initialize event queue table, aborting
[   10.226728] mlx4_core: probe of b961:00:02.0 failed with error -12

I'm using the mainline kernel (4.10.0-rc4).

Any idea?

Thanks,
-- Dexuan

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Mellanox ConnectX-3 VF driver can't work with 16 CPUs?
  2017-02-09 10:03 Mellanox ConnectX-3 VF driver can't work with 16 CPUs? Dexuan Cui
@ 2017-02-09 16:47 ` Tariq Toukan
  2017-02-16 14:57 ` jackm
  1 sibling, 0 replies; 3+ messages in thread
From: Tariq Toukan @ 2017-02-09 16:47 UTC (permalink / raw)
  To: Dexuan Cui, Yishai Hadas, Tariq Toukan; +Cc: netdev

Hi Dexuan,


On 09/02/2017 12:03 PM, Dexuan Cui wrote:
> Hi,
> While trying SR-IOV with a Linux guest running on Hyper-V, I found this issue:
> the VF driver can't work if the guest has 16 virtual CPUs (less vCPUs, e.g. 8,  can work fine):
>
> [    9.927820] mlx4_core: Mellanox ConnectX core driver v2.2-1 (Feb, 2014)
> [    9.927882] mlx4_core: Initializing b961:00:02.0
> [    9.970994] mlx4_core b961:00:02.0: Detected virtual function - running in slave mode
> [    9.976783] mlx4_core b961:00:02.0: Sending reset
> [    9.985858] mlx4_core b961:00:02.0: Sending vhcr0
> [   10.004855] mlx4_core b961:00:02.0: HCA minimum page size:512
> [   10.010465] mlx4_core b961:00:02.0: Timestamping is not supported in slave mode
> [   10.203065] mlx4_core b961:00:02.0: Failed to initialize event queue table, aborting
> [   10.226728] mlx4_core: probe of b961:00:02.0 failed with error -12
>
> I'm using the mainline kernel (4.10.0-rc4).
>
> Any idea?
Thanks for your report.
We'll check this internally and get back to you.

Regards,
Tariq
>
> Thanks,
> -- Dexuan
>

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: Mellanox ConnectX-3 VF driver can't work with 16 CPUs?
  2017-02-09 10:03 Mellanox ConnectX-3 VF driver can't work with 16 CPUs? Dexuan Cui
  2017-02-09 16:47 ` Tariq Toukan
@ 2017-02-16 14:57 ` jackm
  1 sibling, 0 replies; 3+ messages in thread
From: jackm @ 2017-02-16 14:57 UTC (permalink / raw)
  To: Dexuan Cui; +Cc: Yishai Hadas, Tariq Toukan, netdev

On Thu, 9 Feb 2017 10:03:20 +0000
Dexuan Cui <decui@microsoft.com> wrote:

> Hi, 
> While trying SR-IOV with a Linux guest running on Hyper-V, I found
> this issue: the VF driver can't work if the guest has 16 virtual CPUs
> (less vCPUs, e.g. 8,  can work fine):
> 
> [    9.927820] mlx4_core: Mellanox ConnectX core driver v2.2-1 (Feb,
> 2014) [    9.927882] mlx4_core: Initializing b961:00:02.0
> [    9.970994] mlx4_core b961:00:02.0: Detected virtual function -
> running in slave mode [    9.976783] mlx4_core b961:00:02.0: Sending
> reset [    9.985858] mlx4_core b961:00:02.0: Sending vhcr0
> [   10.004855] mlx4_core b961:00:02.0: HCA minimum page size:512
> [   10.010465] mlx4_core b961:00:02.0: Timestamping is not supported
> in slave mode [   10.203065] mlx4_core b961:00:02.0: Failed to
> initialize event queue table, aborting [   10.226728] mlx4_core:
> probe of b961:00:02.0 failed with error -12
> 
> I'm using the mainline kernel (4.10.0-rc4).
> 
> Any idea?
> 
Thanks for reporting this, Dexuan.  We indeed found a bug in the Linux
driver, which requested too many EQEs per EQ created in SRIOV mode.
This resulted in the VF exhausting its supply of MTT entries.

We will submit an upstream fix for this soon (after thorough testing).

-Jack

> Thanks,
> -- Dexuan
> 

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2017-02-16 14:57 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-02-09 10:03 Mellanox ConnectX-3 VF driver can't work with 16 CPUs? Dexuan Cui
2017-02-09 16:47 ` Tariq Toukan
2017-02-16 14:57 ` jackm

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.