* mlx5_core irisc not responding
@ 2020-04-21 4:23 Alexey Kardashevskiy
0 siblings, 0 replies; only message in thread
From: Alexey Kardashevskiy @ 2020-04-21 4:23 UTC (permalink / raw)
To: netdev; +Cc: Leon Romanovsky, Saeed Mahameed
Hi!
I got a Mellanox CX4 card constantly complaining about "irisc not
responding" (below). Is there a way to get a better idea what it is
unhappy about? It is plugged to an experimental POWER9 box which might
have PCI problems. The kernel is v5.6.0.
I thought I try updating the firmware first but mlxup refuses to update
the firmware as it is an OEM adapter (below); and there is no way to
find out which Mellanox PSID corresponds to what I got, any hints? Thanks,
The device is:
root@ltcssss2:~# mstflint -d 0001:19:00.0 q
Image type: FS3
FW Version: 14.26.0226
FW Release Date: 4.8.2019
Product Version: 6.0226
Description: UID GuidsNumber
Base GUID: 0894ef030080a89f 8
Base MAC: 00000894ef80a89f 8
Image VSD: N/A
Device VSD: N/A
PSID: IBM0000000034
Security Attributes: N/A
root@ltcswift2:~# ./mlxup
Querying Mellanox devices firmware ...
Device #1:
----------
Device Type: ConnectX4LX
Part Number: IBM_CX4LX_2p_10GE_x4_Ax
Description: ConnectX-4 LX 10 and 1 G-BaseT dual-port BP; PCIe3.0 x4;
PSID: IBM0000000034
PCI Device Name: 0001:19:00.0
Base MAC: 0894ef80a89f
Versions: Current Available
FW 14.26.0226 N/A Status: No
matching image found
dmesg (the same for :0001:19:00.1):
[ 13.283418] mlx5_core 0001:19:00.0: print_health_info:374:(pid 0):
assert_var[0] 0x00000001
[ 13.283447] mlx5_core 0001:19:00.0: print_health_info:374:(pid 0):
assert_var[1] 0x0087f14c
[ 13.283481] mlx5_core 0001:19:00.0: print_health_info:374:(pid 0):
assert_var[2] 0x00000000
[ 13.283535] mlx5_core 0001:19:00.0: print_health_info:374:(pid 0):
assert_var[3] 0x01020000
[ 13.283588] mlx5_core 0001:19:00.0: print_health_info:374:(pid 0):
assert_var[4] 0x00000000
[ 13.283631] mlx5_core 0001:19:00.0: print_health_info:377:(pid 0):
assert_exit_ptr 0x0080e428
[ 13.283667] mlx5_core 0001:19:00.0: print_health_info:379:(pid 0):
assert_callra 0x0080e070
[ 13.283726] mlx5_core 0001:19:00.0: print_health_info:381:(pid 0):
fw_ver 14.26.226
[ 13.283786] mlx5_core 0001:19:00.0: print_health_info:382:(pid 0):
hw_id 0x0000020b
[ 13.283840] mlx5_core 0001:19:00.0: print_health_info:383:(pid 0):
irisc_index 2
[ 13.283908] mlx5_core 0001:19:00.0: print_health_info:385:(pid 0):
synd 0x7: irisc not responding
[ 13.283949] mlx5_core 0001:19:00.0: print_health_info:386:(pid 0):
ext_synd 0x00c0
[ 13.284014] mlx5_core 0001:19:00.0: print_health_info:388:(pid 0):
raw fw_ver 0xe01a00e2
--
Alexey
^ permalink raw reply [flat|nested] only message in thread
only message in thread, other threads:[~2020-04-21 4:23 UTC | newest]
Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-04-21 4:23 mlx5_core irisc not responding Alexey Kardashevskiy
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).