All of lore.kernel.org
 help / color / mirror / Atom feed
* QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target.
@ 2021-04-10 15:27 TomK
  2021-04-14  2:11 ` Roman Bolshakov
  2021-04-14  8:12 ` Saurav Kashyap
  0 siblings, 2 replies; 4+ messages in thread
From: TomK @ 2021-04-10 15:27 UTC (permalink / raw)
  To: linux-scsi

Hey Everyone,

In a basic SAN config using a few QLE2432 adapters connected to a target 
using a QLE2464 adapter via a EMC Brocade 5000B switch, one of the 
initiators fails to present a LUN after being online for sometime.  Over 
a period of time, few months, the initiator on one of the hosts stops 
showing available LUN's to the underlying VMware clients.

All the other hosts are fine, except for this specific one.  Unless that 
is, I reboot the target completely, affecting all the other working 
hosts in the process.

Digging a bit closer, I notice that the issue seems to strike an uncanny 
similarity to the following one:

ht tps://www.spinics.net/lists/linux-scsi/msg136622.html

However, I'm wondering why only one of the servers is affected and not 
the others?  Seems it is a card issue with the first host (please see 
image) however I'm not familiar with all the messages printed so can't 
be sure of the reason nor link things as effectively. Neither of the two 
ports of the HBA on Server 1 work, when in this disconnected state, 
despite switching SFP's on the Brocade switch, switching cables etc. 
That is, again, until I reboot the target entirely.

Initiator:
                  50:01:43:80:16:77:99:38; 50:01:43:80:16:77:99:3a;
                  50:01:43:80:16:77:99:3b; 50:01:43:80:16:77:99:39

Target:
		(see below)

This also started to happen a few months ago.  Everything was fine for a 
few years before that.


Log snippet:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=73728, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 00e1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 0110
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=10240, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 0110
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:2: 
IMMED_NOTIFY ATIO
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: 
qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7 
50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f897:2: Linking 
sess 00000000088a6dfa [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to 
wwn 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=1 pla 000000004bdf1d76 link 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2: 
qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 8
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2: 
qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e801:2: 
Scheduling sess 00000000088a6dfa for deletion
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: 
qla_target(0): Exit ELS opcode: 0x03 res 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-290a:2: 
qlt_unreg_sess sess 00000000088a6dfa for deletion 50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-287d:2: FCPort 
50:01:43:80:16:77:99:3a state transitioned from ONLINE to LOST - 
portid=010900.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28a3:2: Port 
login retry 500143801677993a, lid 0x0007 retry cnt=45.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f884:2: 
qlt_free_session_done: se_sess 000000001f2eac78 / sess 00000000088a6dfa 
from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 01:09:00 logout 1 
keep 1 els_logo 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f886:2: 
qlt_free_session_done: waiting for sess 00000000088a6dfa logout
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2870:2: 
Async-logout - hdl=172 loop-id=7 portid=010900 50:01:43:80:16:77:99:3a.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5836:2: 
Async-logout complete - 50:01:43:80:16:77:99:3a hdl=172 portid=010900 
iop0=0.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f893:2: 
qlt_logo_completion_handler: se_sess 000000001f2eac78 / sess 
00000000088a6dfa from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 
01:09:00 LOGO failed: 0x0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 008a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=4096, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 008a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680b:2: 
isp_abort_needed=0 loop_resync_needed=0 fcport_update_needed=0 
start_dpc=0 reset_marker_needed=0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680c:2: 
beacon_blink_needed=0 isp_unrecoverable=0 fcoe_ctx_reset_needed=0 
vp_dpc_needed=0 relogin_needed=1.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4801:2: DPC 
handler waking up, dpc_flags=0x100.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480d:2: Relogin 
scheduled.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4800:2: DPC 
handler sleeping.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 21:01:00:1b:32:a1:81:21 DS 0 LS 7
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 del 2 
cnfl           (null) rscn 0|0 login 0|0 fl 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2: 
qla24xx_fcport_handle_login 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 fl 3 
confl           (null) rscn 0|0 login 0 retry 45 lid 4096 scan 2
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 50:01:43:80:16:77:99:38 DS 3 LS 4
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 del 1 
cnfl           (null) rscn 0|0 login 4|18 fl 1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2: 
qla24xx_fcport_handle_login 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 fl 1 
confl           (null) rscn 0|0 login 18 retry 45 lid 4 scan 2
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 50:01:43:80:16:77:99:3a DS 10 LS 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 50:01:43:80:16:77:99:3a DS 10 LS 3 P 0 del 
1 cnfl           (null) rscn 0|0 login 8|9 fl 1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480e:2: Relogin end.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f887:2: 
qlt_free_session_done: sess 00000000088a6dfa logout completed
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f89a:2: se_sess 
          (null) / sess 00000000088a6dfa port 50:01:43:80:16:77:99:3a is 
gone, releasing own PLOGI (ref=1)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5889:2: Sending 
PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00 loop_id 0x07 exch 
0x11223c ox_id 0xae
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f801:2: 
Unregistration of sess 00000000088a6dfa 50:01:43:80:16:77:99:3a finished 
fcp_cnt 4
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f4:2: 
Async-nack 50:01:43:80:16:77:99:3a hndl 175 PLOGI
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f2:2: Async 
done-nack res 0 50:01:43:80:16:77:99:3a  type 16
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5811:2: 
Asynchronous PORT UPDATE ignored 0007/0004/0600.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83d:2: 
qla_target(0): Port update async event 0x8014 occurred: updating the 
ports database (m[0]=8014, m[1]=7, m[2]=4, m[3]=600)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83e:2: Async MB 
2: Got PLOGI Complete





Thanks,




Seems similar to this issue:
ht tps://www.spinics.net/lists/linux-scsi/msg136470.html


Log link:

ht 
tps://www.microdevsys.com/WordPressDownloads/qla2xxx-hba.log-recent.start.end.event.txt

Infra Setup (image):

ht tps://www.microdevsys.com/WordPressImages/brocade-san-issue.png



Adapter details on the target.

# QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA ^C
You have mail in /var/spool/mail/root



[root@mbpc-pc 1]# lspci -vvv -s 03:00.0
03:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to 
PCI Express HBA (rev 03)
          Subsystem: QLogic Corp. Device 0146
          Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr+ Stepping- SERR+ FastB2B- DisINTx-
          Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- INTx-
          Latency: 0, Cache Line Size: 64 bytes
          Interrupt: pin A routed to IRQ 18
          Region 0: I/O ports at ee00 [size=256]
          Region 1: Memory at fdffc000 (64-bit, non-prefetchable) [size=16K]
          [virtual] Expansion ROM at fdf00000 [disabled] [size=256K]
          Capabilities: [44] Power Management version 2
                  Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                  Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
          Capabilities: [4c] Express (v1) Endpoint, MSI 00
                  DevCap: MaxPayload 1024 bytes, PhantFunc 0, Latency 
L0s <4us, L1 <1us
                          ExtTag- AttnBtn+ AttnInd+ PwrInd+ RBE- FLReset-
                  DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ 
Unsupported+
                          RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
                          MaxPayload 128 bytes, MaxReadReq 4096 bytes
                  DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+ 
AuxPwr- TransPend-
                  LnkCap: Port #0, Speed 2.5GT/s, Width x4, ASPM L0s, 
Latency L0 <4us, L1 unlimited
                          ClockPM- Surprise- LLActRep- BwNot-
                  LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- 
CommClk-
                          ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                  LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- 
SlotClk+ DLActive- BWMgmt- ABWMgmt-
          Capabilities: [64] MSI: Enable- Count=1/16 Maskable- 64bit+
                  Address: 0000000000000000  Data: 0000
          Capabilities: [74] Vital Product Data
                  Product Name: PCI-Express Quad Channel 4Gb Fibre 
Channel HBA
                  Read-only fields:
                          [PN] Part number: QLE2464
                          [SN] Serial number: GFC0840A74113
                          [V0] Vendor specific: PW=15W
                          [MN] Manufacture ID: 50 58 32 36 31 30 34 30 
31 2d 31 31 20 20 41
                          [V1] Vendor specific: 06.12
                          [V3] Vendor specific: 08.01.02
                          [V4] Vendor specific: 03.29
                          [V5] Vendor specific: 03.23
                          [YA] Asset tag:
                          [RV] Reserved: checksum good, 0 byte(s) reserved
                  End
          Capabilities: [7c] MSI-X: Enable- Count=16 Masked-
                  Vector table: BAR=1 offset=00002000
                  PBA: BAR=1 offset=00003000
          Capabilities: [100 v1] Advanced Error Reporting
                  UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
                  UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
                  UESvrt: DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
                  CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
NonFatalErr-
                  CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
NonFatalErr-
                  AERCap: First Error Pointer: 14, GenCap+ CGenEn- 
ChkCap+ ChkEn-
          Capabilities: [138 v1] Power Budgeting <?>
          Kernel driver in use: qla2xxx



Target details:


[root@mbpc-pc 1]# systool -c fc_host -v
Class = "fc_host"

    Class Device = "host0"
    Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.0/host0/fc_host/host0"
      dev_loss_tmo        = "45"
      fabric_name         = "0xffffffffffffffff"
      issue_lip           = <store method only>
      max_npiv_vports     = "127"
      node_name           = "0x2002001b32c18121"
      npiv_vports_inuse   = "0"
      port_id             = "0x000000"
      port_name           = "0x2102001b32c18121"
      port_state          = "Linkdown"
      port_type           = "Unknown"
      speed               = "unknown"
      supported_classes   = "Class 3"
      supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
      symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
      system_hostname     = ""
      tgtid_bind_type     = "wwpn (World Wide Port Name)"
      uevent              =
      vport_create        = <store method only>
      vport_delete        = <store method only>

      Device = "host0"
      Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.0/host0"
        fw_dump             =
        issue_logo          = <store method only>
        nvram               = "ISP "
        optrom_ctl          = <store method only>
        optrom              =
        reset               = <store method only>
        sfp                 = ""
        uevent              = "DEVTYPE=scsi_host"
        vpd                 = "▒."


    Class Device = "host1"
    Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.1/host1/fc_host/host1"
      dev_loss_tmo        = "45"
      fabric_name         = "0xffffffffffffffff"
      issue_lip           = <store method only>
      max_npiv_vports     = "127"
      node_name           = "0x2003001b32e18121"
      npiv_vports_inuse   = "0"
      port_id             = "0x000000"
      port_name           = "0x2103001b32e18121"
      port_state          = "Linkdown"
      port_type           = "Unknown"
      speed               = "unknown"
      supported_classes   = "Class 3"
      supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
      symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
      system_hostname     = ""
      tgtid_bind_type     = "wwpn (World Wide Port Name)"
      uevent              =
      vport_create        = <store method only>
      vport_delete        = <store method only>

      Device = "host1"
      Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.1/host1"
        fw_dump             =
        issue_logo          = <store method only>
        nvram               = "ISP "
        optrom_ctl          = <store method only>
        optrom              =
        reset               = <store method only>
        sfp                 = ""
        uevent              = "DEVTYPE=scsi_host"
        vpd                 = "▒."


    Class Device = "host2"
    Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.0/host2/fc_host/host2"
      dev_loss_tmo        = "45"
      fabric_name         = "0x100000051e903dab"
      issue_lip           = <store method only>
      max_npiv_vports     = "127"
      node_name           = "0x2000001b32818121"
      npiv_vports_inuse   = "0"
      port_id             = "0x011300"
      port_name           = "0x2100001b32818121"
      port_state          = "Online"
      port_type           = "NPort (fabric via point-to-point)"
      speed               = "4 Gbit"
      supported_classes   = "Class 3"
      supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
      symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
      system_hostname     = ""
      tgtid_bind_type     = "wwpn (World Wide Port Name)"
      uevent              =
      vport_create        = <store method only>
      vport_delete        = <store method only>

      Device = "host2"
      Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.0/host2"
        fw_dump             =
        issue_logo          = <store method only>
        nvram               = "ISP "
        optrom_ctl          = <store method only>
        optrom              =
        reset               = <store method only>
        sfp                 = ""
        uevent              = "DEVTYPE=scsi_host"
        vpd                 = "▒."


    Class Device = "host3"
    Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.1/host3/fc_host/host3"
      dev_loss_tmo        = "45"
      fabric_name         = "0x100000051e903dab"
      issue_lip           = <store method only>
      max_npiv_vports     = "127"
      node_name           = "0x2001001b32a18121"
      npiv_vports_inuse   = "0"
      port_id             = "0x011700"
      port_name           = "0x2101001b32a18121"
      port_state          = "Online"
      port_type           = "NPort (fabric via point-to-point)"
      speed               = "4 Gbit"
      supported_classes   = "Class 3"
      supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
      symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
      system_hostname     = ""
      tgtid_bind_type     = "wwpn (World Wide Port Name)"
      uevent              =
      vport_create        = <store method only>
      vport_delete        = <store method only>

      Device = "host3"
      Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.1/host3"
        fw_dump             =
        issue_logo          = <store method only>
        nvram               = "ISP "
        optrom_ctl          = <store method only>
        optrom              =
        reset               = <store method only>
        sfp                 = ""
        uevent              = "DEVTYPE=scsi_host"
        vpd                 = "▒."


[root@mbpc-pc 1]#




-- 
Thx,
TK.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target.
  2021-04-10 15:27 QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target TomK
@ 2021-04-14  2:11 ` Roman Bolshakov
  2021-04-14  8:12 ` Saurav Kashyap
  1 sibling, 0 replies; 4+ messages in thread
From: Roman Bolshakov @ 2021-04-14  2:11 UTC (permalink / raw)
  To: TomK; +Cc: linux-scsi

On Sat, Apr 10, 2021 at 11:27:12AM -0400, TomK wrote:
> Hey Everyone,
> 
> In a basic SAN config using a few QLE2432 adapters connected to a target
> using a QLE2464 adapter via a EMC Brocade 5000B switch, one of the
> initiators fails to present a LUN after being online for sometime.  Over a
> period of time, few months, the initiator on one of the hosts stops showing
> available LUN's to the underlying VMware clients.
> 
> All the other hosts are fine, except for this specific one.  Unless that is,
> I reboot the target completely, affecting all the other working hosts in the
> process.
> 
> Digging a bit closer, I notice that the issue seems to strike an uncanny
> similarity to the following one:
> 
> https://www.spinics.net/lists/linux-scsi/msg136622.html

Hi Tom,

The discussion was related to P2P mode and that didn't work reliably
back in 2019. Your case is different because of the fabric switch
involved.

You're running driver v10.00.00.07-k, included in Linux 4.18.  I guess
that's Debian 9 (stretch) or Ubuntu 18.04 (Bionic).

What distribution are you running and what kernel version is installed
on the target?

> 
> However, I'm wondering why only one of the servers is affected and not the
> others?  Seems it is a card issue with the first host (please see image)
> however I'm not familiar with all the messages printed so can't be sure of
> the reason nor link things as effectively. Neither of the two ports of the
> HBA on Server 1 work, when in this disconnected state, despite switching
> SFP's on the Brocade switch, switching cables etc. That is, again, until I
> reboot the target entirely.
> 
> Initiator:
>                  50:01:43:80:16:77:99:38; 50:01:43:80:16:77:99:3a;
>                  50:01:43:80:16:77:99:3b; 50:01:43:80:16:77:99:39
> 
> Target:
> 		(see below)
> 
> This also started to happen a few months ago.  Everything was fine for a few
> years before that.
> 

I wonder if the kernel on the target was updated or may be ESXi itself
was updated?

The log below is helpful but not complete, it's not clear what happened
early after the first link init. To collect full logs you might need to
boot the kernel with qla2xxx.logging=0x7fffffff and then dump everything
related a port of qla2xxx, (e.g. 0000:04.00.0):

journalctl -k | grep 'qla2xxx \[0000:04:00.0\]'

> 
> Log snippet:
> 
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:
> is_send_status=1, cmd->bufflen=73728, cmd->sg_cnt=0,
> cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: qlt_free_cmd:
> se_cmd[00000000cc1dc466] ox_id 00e1
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2:
> qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 0110
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:
> is_send_status=1, cmd->bufflen=10240, cmd->sg_cnt=0,
> cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: qlt_free_cmd:
> se_cmd[00000000cc1dc466] ox_id 0110
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2:
> qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:2: IMMED_NOTIFY
> ATIO
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2:
> qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7
> 50:01:43:80:16:77:99:3a


Initiator port 3a performs PLOGI.

> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f897:2: Linking sess
> 00000000088a6dfa [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to wwn
> 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=1 pla 000000004bdf1d76 link 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2:
> qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 8
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2:
> qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e801:2: Scheduling
> sess 00000000088a6dfa for deletion
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2:
> qla_target(0): Exit ELS opcode: 0x03 res 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-290a:2:
> qlt_unreg_sess sess 00000000088a6dfa for deletion 50:01:43:80:16:77:99:3a

PLOGI replaces existing and session and it's scheduled for deletion and
transitioned from ONLINE to LOST.

> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-287d:2: FCPort
> 50:01:43:80:16:77:99:3a state transitioned from ONLINE to LOST -
> portid=010900.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28a3:2: Port login
> retry 500143801677993a, lid 0x0007 retry cnt=45.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f884:2:
> qlt_free_session_done: se_sess 000000001f2eac78 / sess 00000000088a6dfa from
> port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 01:09:00 logout 1 keep 1
> els_logo 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f886:2:
> qlt_free_session_done: waiting for sess 00000000088a6dfa logout
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2870:2: Async-logout
> - hdl=172 loop-id=7 portid=010900 50:01:43:80:16:77:99:3a.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5836:2: Async-logout
> complete - 50:01:43:80:16:77:99:3a hdl=172 portid=010900 iop0=0.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f893:2:
> qlt_logo_completion_handler: se_sess 000000001f2eac78 / sess
> 00000000088a6dfa from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id
> 01:09:00 LOGO failed: 0x0

Then some interesting things happens:
(https://www.microdevsys.com/WordPressDownloads/qla2xxx-hba.log-recent.start.end.event.txt):

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680b:2: isp_abort_needed=0 loop_resync_needed=0 fcport_update_needed=0 start_dpc=0 reset_marker_needed=0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680c:2: beacon_blink_needed=0 isp_unrecoverable=0 fcoe_ctx_reset_needed=0 vp_dpc_needed=0 relogin_needed=1.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4801:2: DPC handler waking up, dpc_flags=0x100.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480d:2: Relogin scheduled.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4800:2: DPC handler sleeping.

Target then tries to relogin into some ports.

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: qla2x00_relogin 21:01:00:1b:32:a1:81:21 DS 0 LS 7
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: qla24xx_handle_relogin_event 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 del 2 cnfl           (null) rscn 0|0 login 0|0 fl 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2: qla24xx_fcport_handle_login 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 fl 3 confl           (null) rscn 0|0 login 0 retry 45 lid 4096 scan 2

This is a nitpick but it seems you have both target ports in the same
zone: 21:01:00:1b:32:a1:81:21 and 21:00:00:1b:32:81:81:21. It's
recommended to use pairs Single Initiator Port / Single Target Port zones.

Typically there's not much sense to have zoning between pure target
ports.

Please see:
 https://www.brighttalk.com/webcast/14967/355259
 https://fibrechannel.org/wp-content/uploads/2019/06/FCIA-FC-Zoning-Basics-Final.pdf
 https://fibrechannel.org/fibre-channel-zoning-fundamentals-and-fundamental-questions/

[... log message related to port 38 ...]

The target considers relogin into port 3a:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: qla2x00_relogin 50:01:43:80:16:77:99:3a DS 10 LS 3

DS = 10 means DSC_DELETE_PEND, LS 3 = DSC_LS_PLOGI_PEND

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: qla24xx_handle_relogin_event 50:01:43:80:16:77:99:3a DS 10 LS 3 P 0 del 1 cnfl           (null) rscn 0|0 login 8|9 fl 1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480e:2: Relogin end.

But relogin from target side is skipped because the session is pending PLOGI:

        if ((fcport->fw_login_state == DSC_LS_PLOGI_PEND) ||
            (fcport->fw_login_state == DSC_LS_PRLI_PEND))
                return;

The session is then getting fully deleted:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f887:2: qlt_free_session_done: sess 00000000088a6dfa logout completed
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f89a:2: se_sess           (null) / sess 00000000088a6dfa port 50:01:43:80:16:77:99:3a is gone, releasing own PLOGI (ref=1)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5889:2: Sending PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00 loop_id 0x07 exch 0x11223c ox_id 0xae
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f801:2: Unregistration of sess 00000000088a6dfa 50:01:43:80:16:77:99:3a finished fcp_cnt 4
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f4:2: Async-nack 50:01:43:80:16:77:99:3a hndl 175 PLOGI
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f2:2: Async done-nack res 0 50:01:43:80:16:77:99:3a  type 16
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5811:2: Asynchronous PORT UPDATE ignored 0007/0004/0600.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83d:2: qla_target(0): Port update async event 0x8014 occurred: updating the ports database (m[0]=8014, m[1]=7, m[2]=4, m[3]=600)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83e:2: Async MB 2: Got PLOGI Complete
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83d:2: qla_target(0): Port update async event 0x8014 occurred: updating the ports database (m[0]=8014, m[1]=7, m[2]=4, m[3]=600)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83e:2: Async MB 2: Got PLOGI Complete

New PLOGI from 3a is accepted.

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 5001
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:2: IMMED_NOTIFY ATIO
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: qla_target(0): Port ID: 01:09:00 ELS opcode: 0x20 lid 7 50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f896:2: PRLI (loop_id 0x07) for existing sess 00000000088a6dfa (loop_id 0x07)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28fb:2: qlt_24xx_handle_els 4986 50:01:43:80:16:77:99:3a post nack
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: qla_target(0): Exit ELS opcode: 0x20 res 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f806:2: Adding sess 00000000088a6dfa se_sess 000000001f2eac78  to tgt 00000000be50a70a sess_count 6
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f84b:2: qla_target(0): session for wwn 50:01:43:80:16:77:99:3a (loop_id 7, s_id 1:9:0, confirmed completion not supported) added
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-d034:2: qla24xx_do_nack_work create sess success 00000000088a6dfa
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f4:2: Async-nack 50:01:43:80:16:77:99:3a hndl 176 PRLI
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f2:2: Async done-nack res 0 50:01:43:80:16:77:99:3a  type 17
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f3:2: qla2x00_async_nack_sp_done 606 50:01:43:80:16:77:99:3a post upd_fcport fcp_cnt 5
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28ef:2: qla2x00_update_fcport 50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-287d:2: FCPort 50:01:43:80:16:77:99:3a state transitioned from LOST to ONLINE - portid=010900.

Initiator successfully performed PRLI, so FCP session from port 3a should
be established... but then the session is not added to the target.
The line provide a hint about what's going on:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2100:2: qla_nvme_register_remote: Not registering target since Host NVME is not enabled

The line is printed from:

int qla_nvme_register_remote(struct scsi_qla_host *vha, struct fc_port *fcport)
{
[...]
        if (!vha->flags.nvme_enabled) {
                ql_log(ql_log_info, vha, 0x2100,
                    "%s: Not registering target since Host NVME is not enabled\n",
                    __func__);
                return 0;
        }
[...]
}

And qla2x00_update_fcport() quits before the session is added to target:

void
qla2x00_update_fcport(scsi_qla_host_t *vha, fc_port_t *fcport)
{
[...]
        qla2x00_set_fcport_state(fcport, FCS_ONLINE);
        qla2x00_iidma_fcport(vha, fcport);

        if (fcport->fc4f_nvme) {
                qla_nvme_register_remote(vha, fcport);
                return;
        }

        qla24xx_update_fcport_fcp_prio(vha, fcport);

	switch (vha->host->active_mode) {
        case MODE_INITIATOR:
                qla2x00_reg_remote_port(vha, fcport);
                break;
        case MODE_TARGET:
                if (!vha->vha_tgt.qla_tgt->tgt_stop &&
                        !vha->vha_tgt.qla_tgt->tgt_stopped)
                        qlt_fc_port_added(vha, fcport);
                break;
        case MODE_DUAL:
                qla2x00_reg_remote_port(vha, fcport);
                if (!vha->vha_tgt.qla_tgt->tgt_stop &&
                        !vha->vha_tgt.qla_tgt->tgt_stopped)
                        qlt_fc_port_added(vha, fcport);
                break;
        default:
                break;
        }
[...]
}

Initiator thinks the session is alive because PRLI was accepted but the
session doesn't exist from the target perspective and all I/O from the
initiator port is going to be discarded.

I guess NVMe support introduced in e84067d743010 ("scsi: qla2xxx: Add
FC-NVMe F/W initialization and transport registration") broke target in
v4.18

On the second target port of the same HBA (21:01:00:1b:32:a1:81:21) I
can see that target is blindly rejecting PLOGI from port 3a:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e872:3: qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e82e:3: IMMED_NOTIFY ATIO
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f826:3: qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7 50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e81c:3: Sending TERM ELS CTIO (ha=0000000000841139)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f897:3: Linking sess 0000000050a6ebea [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=2 pla 0000000091a7edfc link 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-28f9:3: qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-28f9:3: qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f826:3: qla_target(0): Exit ELS opcode: 0x03 res 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e862:3: qla_target(0): Unexpected NOTIFY_ACK received

[...]

Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e872:3: qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e82e:3: IMMED_NOTIFY ATIO
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f826:3: qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7 50:01:43:80:16:77:99:3a
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e81c:3: Sending TERM ELS CTIO (ha=0000000000841139)
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f897:3: Linking sess 0000000050a6ebea [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=2 pla 0000000091a7edfc link 0
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-28f9:3: qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 3
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-28f9:3: qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-f826:3: qla_target(0): Exit ELS opcode: 0x03 res 0
Mar 28 02:40:45 mbpc-pc kernel: qla2xxx [0000:04:00.1]-e862:3: qla_target(0): Unexpected NOTIFY_ACK received

I couldn't find anything relevant in the provided logs about the port
but I guess it might be because of session not added to the target.

As a solution I recommend you to change kernel source on the target
from:

        if (fcport->fc4f_nvme) {
                qla_nvme_register_remote(vha, fcport);
                return;
        }

to:

        if (vha->host->active_mode != MODE_TARGET && fcport->fc4f_nvme) {
                qla_nvme_register_remote(vha, fcport);
                return;
        }


Hope that helps,
Roman

^ permalink raw reply	[flat|nested] 4+ messages in thread

* RE: QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target.
  2021-04-10 15:27 QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target TomK
  2021-04-14  2:11 ` Roman Bolshakov
@ 2021-04-14  8:12 ` Saurav Kashyap
  1 sibling, 0 replies; 4+ messages in thread
From: Saurav Kashyap @ 2021-04-14  8:12 UTC (permalink / raw)
  To: tomkcpr, linux-scsi

Hi Tom,
Can you also share the logs of Server1(inititator) with ql2xextended_error_logging=1? It will be great if logs are from load of the driver.

Thanks,
~Saurav

> -----Original Message-----
> From: TomK <tomkcpr@mdevsys.com>
> Sent: Saturday, April 10, 2021 8:57 PM
> To: linux-scsi@vger.kernel.org
> Subject: QLE2432 initiator fails to see any LUN's on one of servers while using
> 5QLE2464 as a target.
> 
> Hey Everyone,
> 
> In a basic SAN config using a few QLE2432 adapters connected to a target
> using a QLE2464 adapter via a EMC Brocade 5000B switch, one of the
> initiators fails to present a LUN after being online for sometime.  Over
> a period of time, few months, the initiator on one of the hosts stops
> showing available LUN's to the underlying VMware clients.
> 
> All the other hosts are fine, except for this specific one.  Unless that
> is, I reboot the target completely, affecting all the other working
> hosts in the process.
> 
> Digging a bit closer, I notice that the issue seems to strike an uncanny
> similarity to the following one:
> 
> ht tps://www.spinics.net/lists/linux-scsi/msg136622.html
> 
> However, I'm wondering why only one of the servers is affected and not
> the others?  Seems it is a card issue with the first host (please see
> image) however I'm not familiar with all the messages printed so can't
> be sure of the reason nor link things as effectively. Neither of the two
> ports of the HBA on Server 1 work, when in this disconnected state,
> despite switching SFP's on the Brocade switch, switching cables etc.
> That is, again, until I reboot the target entirely.
> 
> Initiator:
>                   50:01:43:80:16:77:99:38; 50:01:43:80:16:77:99:3a;
>                   50:01:43:80:16:77:99:3b; 50:01:43:80:16:77:99:39
> 
> Target:
> 		(see below)
> 
> This also started to happen a few months ago.  Everything was fine for a
> few years before that.
> 
> 
> Log snippet:
> 
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:
> is_send_status=1, cmd->bufflen=73728, cmd->sg_cnt=0,
> cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2:
> qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 00e1
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2:
> qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 0110
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:
> is_send_status=1, cmd->bufflen=10240, cmd->sg_cnt=0,
> cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2:
> qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 0110
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2:
> qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:2:
> IMMED_NOTIFY ATIO
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2:
> qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7
> 50:01:43:80:16:77:99:3a
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f897:2: Linking
> sess 00000000088a6dfa [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to
> wwn 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=1 pla 000000004bdf1d76 link 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2:
> qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 8
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2:
> qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e801:2:
> Scheduling sess 00000000088a6dfa for deletion
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2:
> qla_target(0): Exit ELS opcode: 0x03 res 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-290a:2:
> qlt_unreg_sess sess 00000000088a6dfa for deletion 50:01:43:80:16:77:99:3a
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-287d:2: FCPort
> 50:01:43:80:16:77:99:3a state transitioned from ONLINE to LOST -
> portid=010900.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28a3:2: Port
> login retry 500143801677993a, lid 0x0007 retry cnt=45.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f884:2:
> qlt_free_session_done: se_sess 000000001f2eac78 / sess 00000000088a6dfa
> from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 01:09:00 logout 1
> keep 1 els_logo 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f886:2:
> qlt_free_session_done: waiting for sess 00000000088a6dfa logout
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2870:2:
> Async-logout - hdl=172 loop-id=7 portid=010900 50:01:43:80:16:77:99:3a.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5836:2:
> Async-logout complete - 50:01:43:80:16:77:99:3a hdl=172 portid=010900
> iop0=0.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f893:2:
> qlt_logo_completion_handler: se_sess 000000001f2eac78 / sess
> 00000000088a6dfa from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id
> 01:09:00 LOGO failed: 0x0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2:
> qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 008a
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:
> is_send_status=1, cmd->bufflen=4096, cmd->sg_cnt=0,
> cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2:
> qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 008a
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680b:2:
> isp_abort_needed=0 loop_resync_needed=0 fcport_update_needed=0
> start_dpc=0 reset_marker_needed=0
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680c:2:
> beacon_blink_needed=0 isp_unrecoverable=0 fcoe_ctx_reset_needed=0
> vp_dpc_needed=0 relogin_needed=1.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4801:2: DPC
> handler waking up, dpc_flags=0x100.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480d:2: Relogin
> scheduled.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4800:2: DPC
> handler sleeping.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2:
> qla2x00_relogin 21:01:00:1b:32:a1:81:21 DS 0 LS 7
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2:
> qla24xx_handle_relogin_event 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 del 2
> cnfl           (null) rscn 0|0 login 0|0 fl 3
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2:
> qla24xx_fcport_handle_login 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 fl 3
> confl           (null) rscn 0|0 login 0 retry 45 lid 4096 scan 2
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2:
> qla2x00_relogin 50:01:43:80:16:77:99:38 DS 3 LS 4
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2:
> qla24xx_handle_relogin_event 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 del 1
> cnfl           (null) rscn 0|0 login 4|18 fl 1
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2:
> qla24xx_fcport_handle_login 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 fl 1
> confl           (null) rscn 0|0 login 18 retry 45 lid 4 scan 2
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2:
> qla2x00_relogin 50:01:43:80:16:77:99:3a DS 10 LS 3
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2:
> qla24xx_handle_relogin_event 50:01:43:80:16:77:99:3a DS 10 LS 3 P 0 del
> 1 cnfl           (null) rscn 0|0 login 8|9 fl 1
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480e:2: Relogin end.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f887:2:
> qlt_free_session_done: sess 00000000088a6dfa logout completed
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f89a:2: se_sess
>           (null) / sess 00000000088a6dfa port 50:01:43:80:16:77:99:3a is
> gone, releasing own PLOGI (ref=1)
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5889:2: Sending
> PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00 loop_id 0x07 exch
> 0x11223c ox_id 0xae
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f801:2:
> Unregistration of sess 00000000088a6dfa 50:01:43:80:16:77:99:3a finished
> fcp_cnt 4
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f4:2:
> Async-nack 50:01:43:80:16:77:99:3a hndl 175 PLOGI
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f2:2: Async
> done-nack res 0 50:01:43:80:16:77:99:3a  type 16
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5811:2:
> Asynchronous PORT UPDATE ignored 0007/0004/0600.
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83d:2:
> qla_target(0): Port update async event 0x8014 occurred: updating the
> ports database (m[0]=8014, m[1]=7, m[2]=4, m[3]=600)
> Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83e:2: Async MB
> 2: Got PLOGI Complete
> 
> 
> 
> 
> 
> Thanks,
> 
> 
> 
> 
> Seems similar to this issue:
> ht tps://www.spinics.net/lists/linux-scsi/msg136470.html
> 
> 
> Log link:
> 
> ht
> tps://www.microdevsys.com/WordPressDownloads/qla2xxx-hba.log-
> recent.start.end.event.txt
> 
> Infra Setup (image):
> 
> ht tps://www.microdevsys.com/WordPressImages/brocade-san-issue.png
> 
> 
> 
> Adapter details on the target.
> 
> # QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA ^C
> You have mail in /var/spool/mail/root
> 
> 
> 
> [root@mbpc-pc 1]# lspci -vvv -s 03:00.0
> 03:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to
> PCI Express HBA (rev 03)
>           Subsystem: QLogic Corp. Device 0146
>           Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
> ParErr+ Stepping- SERR+ FastB2B- DisINTx-
>           Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort-
> <TAbort- <MAbort- >SERR- <PERR- INTx-
>           Latency: 0, Cache Line Size: 64 bytes
>           Interrupt: pin A routed to IRQ 18
>           Region 0: I/O ports at ee00 [size=256]
>           Region 1: Memory at fdffc000 (64-bit, non-prefetchable) [size=16K]
>           [virtual] Expansion ROM at fdf00000 [disabled] [size=256K]
>           Capabilities: [44] Power Management version 2
>                   Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
> PME(D0-,D1-,D2-,D3hot-,D3cold-)
>                   Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
>           Capabilities: [4c] Express (v1) Endpoint, MSI 00
>                   DevCap: MaxPayload 1024 bytes, PhantFunc 0, Latency
> L0s <4us, L1 <1us
>                           ExtTag- AttnBtn+ AttnInd+ PwrInd+ RBE- FLReset-
>                   DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+
> Unsupported+
>                           RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
>                           MaxPayload 128 bytes, MaxReadReq 4096 bytes
>                   DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+
> AuxPwr- TransPend-
>                   LnkCap: Port #0, Speed 2.5GT/s, Width x4, ASPM L0s,
> Latency L0 <4us, L1 unlimited
>                           ClockPM- Surprise- LLActRep- BwNot-
>                   LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain-
> CommClk-
>                           ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
>                   LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train-
> SlotClk+ DLActive- BWMgmt- ABWMgmt-
>           Capabilities: [64] MSI: Enable- Count=1/16 Maskable- 64bit+
>                   Address: 0000000000000000  Data: 0000
>           Capabilities: [74] Vital Product Data
>                   Product Name: PCI-Express Quad Channel 4Gb Fibre
> Channel HBA
>                   Read-only fields:
>                           [PN] Part number: QLE2464
>                           [SN] Serial number: GFC0840A74113
>                           [V0] Vendor specific: PW=15W
>                           [MN] Manufacture ID: 50 58 32 36 31 30 34 30
> 31 2d 31 31 20 20 41
>                           [V1] Vendor specific: 06.12
>                           [V3] Vendor specific: 08.01.02
>                           [V4] Vendor specific: 03.29
>                           [V5] Vendor specific: 03.23
>                           [YA] Asset tag:
>                           [RV] Reserved: checksum good, 0 byte(s) reserved
>                   End
>           Capabilities: [7c] MSI-X: Enable- Count=16 Masked-
>                   Vector table: BAR=1 offset=00002000
>                   PBA: BAR=1 offset=00003000
>           Capabilities: [100 v1] Advanced Error Reporting
>                   UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt-
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
>                   UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt-
> UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
>                   UESvrt: DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt-
> UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
>                   CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr-
>                   CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout-
> NonFatalErr-
>                   AERCap: First Error Pointer: 14, GenCap+ CGenEn-
> ChkCap+ ChkEn-
>           Capabilities: [138 v1] Power Budgeting <?>
>           Kernel driver in use: qla2xxx
> 
> 
> 
> Target details:
> 
> 
> [root@mbpc-pc 1]# systool -c fc_host -v
> Class = "fc_host"
> 
>     Class Device = "host0"
>     Class Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.
> 0/host0/fc_host/host0"
>       dev_loss_tmo        = "45"
>       fabric_name         = "0xffffffffffffffff"
>       issue_lip           = <store method only>
>       max_npiv_vports     = "127"
>       node_name           = "0x2002001b32c18121"
>       npiv_vports_inuse   = "0"
>       port_id             = "0x000000"
>       port_name           = "0x2102001b32c18121"
>       port_state          = "Linkdown"
>       port_type           = "Unknown"
>       speed               = "unknown"
>       supported_classes   = "Class 3"
>       supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
>       symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
>       system_hostname     = ""
>       tgtid_bind_type     = "wwpn (World Wide Port Name)"
>       uevent              =
>       vport_create        = <store method only>
>       vport_delete        = <store method only>
> 
>       Device = "host0"
>       Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.
> 0/host0"
>         fw_dump             =
>         issue_logo          = <store method only>
>         nvram               = "ISP "
>         optrom_ctl          = <store method only>
>         optrom              =
>         reset               = <store method only>
>         sfp                 = ""
>         uevent              = "DEVTYPE=scsi_host"
>         vpd                 = "▒."
> 
> 
>     Class Device = "host1"
>     Class Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.
> 1/host1/fc_host/host1"
>       dev_loss_tmo        = "45"
>       fabric_name         = "0xffffffffffffffff"
>       issue_lip           = <store method only>
>       max_npiv_vports     = "127"
>       node_name           = "0x2003001b32e18121"
>       npiv_vports_inuse   = "0"
>       port_id             = "0x000000"
>       port_name           = "0x2103001b32e18121"
>       port_state          = "Linkdown"
>       port_type           = "Unknown"
>       speed               = "unknown"
>       supported_classes   = "Class 3"
>       supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
>       symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
>       system_hostname     = ""
>       tgtid_bind_type     = "wwpn (World Wide Port Name)"
>       uevent              =
>       vport_create        = <store method only>
>       vport_delete        = <store method only>
> 
>       Device = "host1"
>       Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.
> 1/host1"
>         fw_dump             =
>         issue_logo          = <store method only>
>         nvram               = "ISP "
>         optrom_ctl          = <store method only>
>         optrom              =
>         reset               = <store method only>
>         sfp                 = ""
>         uevent              = "DEVTYPE=scsi_host"
>         vpd                 = "▒."
> 
> 
>     Class Device = "host2"
>     Class Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.
> 0/host2/fc_host/host2"
>       dev_loss_tmo        = "45"
>       fabric_name         = "0x100000051e903dab"
>       issue_lip           = <store method only>
>       max_npiv_vports     = "127"
>       node_name           = "0x2000001b32818121"
>       npiv_vports_inuse   = "0"
>       port_id             = "0x011300"
>       port_name           = "0x2100001b32818121"
>       port_state          = "Online"
>       port_type           = "NPort (fabric via point-to-point)"
>       speed               = "4 Gbit"
>       supported_classes   = "Class 3"
>       supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
>       symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
>       system_hostname     = ""
>       tgtid_bind_type     = "wwpn (World Wide Port Name)"
>       uevent              =
>       vport_create        = <store method only>
>       vport_delete        = <store method only>
> 
>       Device = "host2"
>       Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.
> 0/host2"
>         fw_dump             =
>         issue_logo          = <store method only>
>         nvram               = "ISP "
>         optrom_ctl          = <store method only>
>         optrom              =
>         reset               = <store method only>
>         sfp                 = ""
>         uevent              = "DEVTYPE=scsi_host"
>         vpd                 = "▒."
> 
> 
>     Class Device = "host3"
>     Class Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.
> 1/host3/fc_host/host3"
>       dev_loss_tmo        = "45"
>       fabric_name         = "0x100000051e903dab"
>       issue_lip           = <store method only>
>       max_npiv_vports     = "127"
>       node_name           = "0x2001001b32a18121"
>       npiv_vports_inuse   = "0"
>       port_id             = "0x011700"
>       port_name           = "0x2101001b32a18121"
>       port_state          = "Online"
>       port_type           = "NPort (fabric via point-to-point)"
>       speed               = "4 Gbit"
>       supported_classes   = "Class 3"
>       supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
>       symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
>       system_hostname     = ""
>       tgtid_bind_type     = "wwpn (World Wide Port Name)"
>       uevent              =
>       vport_create        = <store method only>
>       vport_delete        = <store method only>
> 
>       Device = "host3"
>       Device path =
> "/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.
> 1/host3"
>         fw_dump             =
>         issue_logo          = <store method only>
>         nvram               = "ISP "
>         optrom_ctl          = <store method only>
>         optrom              =
>         reset               = <store method only>
>         sfp                 = ""
>         uevent              = "DEVTYPE=scsi_host"
>         vpd                 = "▒."
> 
> 
> [root@mbpc-pc 1]#
> 
> 
> 
> 
> --
> Thx,
> TK.


^ permalink raw reply	[flat|nested] 4+ messages in thread

* QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target.
@ 2021-03-28 21:54 TomK
  0 siblings, 0 replies; 4+ messages in thread
From: TomK @ 2021-03-28 21:54 UTC (permalink / raw)
  To: linux-scsi

[-- Attachment #1: Type: text/plain, Size: 18005 bytes --]

Hey Everyone,


In a basic SAN config using a few QLE2432 adapters connected to a target 
using a QLE2464 adapter via a EMC Brocade 5000B switch, one of the 
initiators fails to present a LUN after being online for sometime.  Over 
a period of time, few months, the initiator on one of the hosts stops 
showing available LUN's to the underlying VMware clients.

All the other hosts are fine, except for this specific one.  Unless that 
is, I reboot the target completely, affecting all the other working 
hosts in the process.

Digging a bit closer, I notice that the issue seems to strike an uncanny 
similarity to the following one:

https://www.spinics.net/lists/linux-scsi/msg136622.html

However, I'm wondering why only one of the servers is affected and not 
the others?  Seems it is a card issue with the first host (please see 
image) however I'm not familiar with all the messages printed so can't 
be sure of the reason nor link things as effectively. Neither of the two 
ports of the HBA on Server 1 work, when in this disconnected state, 
despite switching SFP's on the Brocade switch, switching cables etc. 
That is, again, until I reboot the target entirely.

Initiator:
                 50:01:43:80:16:77:99:38; 50:01:43:80:16:77:99:3a;
                 50:01:43:80:16:77:99:3b; 50:01:43:80:16:77:99:39

Target:
		(see below)

This also started to happen a few months ago.  Everything was fine for a 
few years before that.


Log snippet:

Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=73728, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 00e1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 0110
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=10240, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 0110
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:2: 
IMMED_NOTIFY ATIO
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: 
qla_target(0): Port ID: 01:09:00 ELS opcode: 0x03 lid 7 
50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f897:2: Linking 
sess 00000000088a6dfa [0] wwn 50:01:43:80:16:77:99:3a with PLOGI ACK to 
wwn 50:01:43:80:16:77:99:3a s_id 01:09:00, ref=1 pla 000000004bdf1d76 link 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2: 
qlt_handle_login 4772 50:01:43:80:16:77:99:3a  DS 8
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f9:2: 
qlt_handle_login 4803 50:01:43:80:16:77:99:3a post del sess
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e801:2: 
Scheduling sess 00000000088a6dfa for deletion
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:2: 
qla_target(0): Exit ELS opcode: 0x03 res 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-290a:2: 
qlt_unreg_sess sess 00000000088a6dfa for deletion 50:01:43:80:16:77:99:3a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-287d:2: FCPort 
50:01:43:80:16:77:99:3a state transitioned from ONLINE to LOST - 
portid=010900.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28a3:2: Port 
login retry 500143801677993a, lid 0x0007 retry cnt=45.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f884:2: 
qlt_free_session_done: se_sess 000000001f2eac78 / sess 00000000088a6dfa 
from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 01:09:00 logout 1 
keep 1 els_logo 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f886:2: 
qlt_free_session_done: waiting for sess 00000000088a6dfa logout
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2870:2: 
Async-logout - hdl=172 loop-id=7 portid=010900 50:01:43:80:16:77:99:3a.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5836:2: 
Async-logout complete - 50:01:43:80:16:77:99:3a hdl=172 portid=010900 
iop0=0.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f893:2: 
qlt_logo_completion_handler: se_sess 000000001f2eac78 / sess 
00000000088a6dfa from port 50:01:43:80:16:77:99:3a loop_id 0x07 s_id 
01:09:00 LOGO failed: 0x0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:2: 
qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 008a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818: 
is_send_status=1, cmd->bufflen=4096, cmd->sg_cnt=0, 
cmd->dma_data_direction=2 se_cmd[00000000cc1dc466] qp 0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:2: 
qlt_free_cmd: se_cmd[00000000cc1dc466] ox_id 008a
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680b:2: 
isp_abort_needed=0 loop_resync_needed=0 fcport_update_needed=0 
start_dpc=0 reset_marker_needed=0
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-680c:2: 
beacon_blink_needed=0 isp_unrecoverable=0 fcoe_ctx_reset_needed=0 
vp_dpc_needed=0 relogin_needed=1.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4801:2: DPC 
handler waking up, dpc_flags=0x100.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480d:2: Relogin 
scheduled.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-4800:2: DPC 
handler sleeping.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 21:01:00:1b:32:a1:81:21 DS 0 LS 7
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 del 2 
cnfl           (null) rscn 0|0 login 0|0 fl 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2: 
qla24xx_fcport_handle_login 21:01:00:1b:32:a1:81:21 DS 0 LS 7 P 0 fl 3 
confl           (null) rscn 0|0 login 0 retry 45 lid 4096 scan 2
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 50:01:43:80:16:77:99:38 DS 3 LS 4
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 del 1 
cnfl           (null) rscn 0|0 login 4|18 fl 1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28d8:2: 
qla24xx_fcport_handle_login 50:01:43:80:16:77:99:38 DS 3 LS 4 P 0 fl 1 
confl           (null) rscn 0|0 login 18 retry 45 lid 4 scan 2
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2908:2: 
qla2x00_relogin 50:01:43:80:16:77:99:3a DS 10 LS 3
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-2902:2: 
qla24xx_handle_relogin_event 50:01:43:80:16:77:99:3a DS 10 LS 3 P 0 del 
1 cnfl           (null) rscn 0|0 login 8|9 fl 1
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-480e:2: Relogin end.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f887:2: 
qlt_free_session_done: sess 00000000088a6dfa logout completed
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f89a:2: se_sess 
          (null) / sess 00000000088a6dfa port 50:01:43:80:16:77:99:3a is 
gone, releasing own PLOGI (ref=1)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5889:2: Sending 
PLOGI ACK to wwn 50:01:43:80:16:77:99:3a s_id 01:09:00 loop_id 0x07 exch 
0x11223c ox_id 0xae
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f801:2: 
Unregistration of sess 00000000088a6dfa 50:01:43:80:16:77:99:3a finished 
fcp_cnt 4
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f4:2: 
Async-nack 50:01:43:80:16:77:99:3a hndl 175 PLOGI
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-28f2:2: Async 
done-nack res 0 50:01:43:80:16:77:99:3a  type 16
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-5811:2: 
Asynchronous PORT UPDATE ignored 0007/0004/0600.
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83d:2: 
qla_target(0): Port update async event 0x8014 occurred: updating the 
ports database (m[0]=8014, m[1]=7, m[2]=4, m[3]=600)
Mar 28 02:40:24 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f83e:2: Async MB 
2: Got PLOGI Complete





Thanks,




Seems similar to this issue:
https://www.spinics.net/lists/linux-scsi/msg136470.html


Log link:

https://www.microdevsys.com/WordPressDownloads/qla2xxx-hba.log-recent.start.end.event.txt


Adapter details on the target.

# QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA ^C
You have mail in /var/spool/mail/root
[root@mbpc-pc 1]# lspci -vvv -s 03:00.0
03:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to 
PCI Express HBA (rev 03)
         Subsystem: QLogic Corp. Device 0146
         Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- 
ParErr+ Stepping- SERR+ FastB2B- DisINTx-
         Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- INTx-
         Latency: 0, Cache Line Size: 64 bytes
         Interrupt: pin A routed to IRQ 18
         Region 0: I/O ports at ee00 [size=256]
         Region 1: Memory at fdffc000 (64-bit, non-prefetchable) [size=16K]
         [virtual] Expansion ROM at fdf00000 [disabled] [size=256K]
         Capabilities: [44] Power Management version 2
                 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-)
                 Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
         Capabilities: [4c] Express (v1) Endpoint, MSI 00
                 DevCap: MaxPayload 1024 bytes, PhantFunc 0, Latency L0s 
<4us, L1 <1us
                         ExtTag- AttnBtn+ AttnInd+ PwrInd+ RBE- FLReset-
                 DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ 
Unsupported+
                         RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+
                         MaxPayload 128 bytes, MaxReadReq 4096 bytes
                 DevSta: CorrErr- UncorrErr+ FatalErr- UnsuppReq+ 
AuxPwr- TransPend-
                 LnkCap: Port #0, Speed 2.5GT/s, Width x4, ASPM L0s, 
Latency L0 <4us, L1 unlimited
                         ClockPM- Surprise- LLActRep- BwNot-
                 LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- 
CommClk-
                         ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt-
                 LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk+ 
DLActive- BWMgmt- ABWMgmt-
         Capabilities: [64] MSI: Enable- Count=1/16 Maskable- 64bit+
                 Address: 0000000000000000  Data: 0000
         Capabilities: [74] Vital Product Data
                 Product Name: PCI-Express Quad Channel 4Gb Fibre 
Channel HBA
                 Read-only fields:
                         [PN] Part number: QLE2464
                         [SN] Serial number: GFC0840A74113
                         [V0] Vendor specific: PW=15W
                         [MN] Manufacture ID: 50 58 32 36 31 30 34 30 31 
2d 31 31 20 20 41
                         [V1] Vendor specific: 06.12
                         [V3] Vendor specific: 08.01.02
                         [V4] Vendor specific: 03.29
                         [V5] Vendor specific: 03.23
                         [YA] Asset tag:
                         [RV] Reserved: checksum good, 0 byte(s) reserved
                 End
         Capabilities: [7c] MSI-X: Enable- Count=16 Masked-
                 Vector table: BAR=1 offset=00002000
                 PBA: BAR=1 offset=00003000
         Capabilities: [100 v1] Advanced Error Reporting
                 UESta:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq+ ACSViol-
                 UEMsk:  DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol-
                 UESvrt: DLP+ SDES- TLP- FCP+ CmpltTO- CmpltAbrt- 
UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol-
                 CESta:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
NonFatalErr-
                 CEMsk:  RxErr- BadTLP- BadDLLP- Rollover- Timeout- 
NonFatalErr-
                 AERCap: First Error Pointer: 14, GenCap+ CGenEn- 
ChkCap+ ChkEn-
         Capabilities: [138 v1] Power Budgeting <?>
         Kernel driver in use: qla2xxx



Target details:


[root@mbpc-pc 1]# systool -c fc_host -v
Class = "fc_host"

   Class Device = "host0"
   Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.0/host0/fc_host/host0"
     dev_loss_tmo        = "45"
     fabric_name         = "0xffffffffffffffff"
     issue_lip           = <store method only>
     max_npiv_vports     = "127"
     node_name           = "0x2002001b32c18121"
     npiv_vports_inuse   = "0"
     port_id             = "0x000000"
     port_name           = "0x2102001b32c18121"
     port_state          = "Linkdown"
     port_type           = "Unknown"
     speed               = "unknown"
     supported_classes   = "Class 3"
     supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
     symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
     system_hostname     = ""
     tgtid_bind_type     = "wwpn (World Wide Port Name)"
     uevent              =
     vport_create        = <store method only>
     vport_delete        = <store method only>

     Device = "host0"
     Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.0/host0"
       fw_dump             =
       issue_logo          = <store method only>
       nvram               = "ISP "
       optrom_ctl          = <store method only>
       optrom              =
       reset               = <store method only>
       sfp                 = ""
       uevent              = "DEVTYPE=scsi_host"
       vpd                 = "▒."


   Class Device = "host1"
   Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.1/host1/fc_host/host1"
     dev_loss_tmo        = "45"
     fabric_name         = "0xffffffffffffffff"
     issue_lip           = <store method only>
     max_npiv_vports     = "127"
     node_name           = "0x2003001b32e18121"
     npiv_vports_inuse   = "0"
     port_id             = "0x000000"
     port_name           = "0x2103001b32e18121"
     port_state          = "Linkdown"
     port_type           = "Unknown"
     speed               = "unknown"
     supported_classes   = "Class 3"
     supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
     symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
     system_hostname     = ""
     tgtid_bind_type     = "wwpn (World Wide Port Name)"
     uevent              =
     vport_create        = <store method only>
     vport_delete        = <store method only>

     Device = "host1"
     Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:00.0/0000:03:00.1/host1"
       fw_dump             =
       issue_logo          = <store method only>
       nvram               = "ISP "
       optrom_ctl          = <store method only>
       optrom              =
       reset               = <store method only>
       sfp                 = ""
       uevent              = "DEVTYPE=scsi_host"
       vpd                 = "▒."


   Class Device = "host2"
   Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.0/host2/fc_host/host2"
     dev_loss_tmo        = "45"
     fabric_name         = "0x100000051e903dab"
     issue_lip           = <store method only>
     max_npiv_vports     = "127"
     node_name           = "0x2000001b32818121"
     npiv_vports_inuse   = "0"
     port_id             = "0x011300"
     port_name           = "0x2100001b32818121"
     port_state          = "Online"
     port_type           = "NPort (fabric via point-to-point)"
     speed               = "4 Gbit"
     supported_classes   = "Class 3"
     supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
     symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
     system_hostname     = ""
     tgtid_bind_type     = "wwpn (World Wide Port Name)"
     uevent              =
     vport_create        = <store method only>
     vport_delete        = <store method only>

     Device = "host2"
     Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.0/host2"
       fw_dump             =
       issue_logo          = <store method only>
       nvram               = "ISP "
       optrom_ctl          = <store method only>
       optrom              =
       reset               = <store method only>
       sfp                 = ""
       uevent              = "DEVTYPE=scsi_host"
       vpd                 = "▒."


   Class Device = "host3"
   Class Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.1/host3/fc_host/host3"
     dev_loss_tmo        = "45"
     fabric_name         = "0x100000051e903dab"
     issue_lip           = <store method only>
     max_npiv_vports     = "127"
     node_name           = "0x2001001b32a18121"
     npiv_vports_inuse   = "0"
     port_id             = "0x011700"
     port_name           = "0x2101001b32a18121"
     port_state          = "Online"
     port_type           = "NPort (fabric via point-to-point)"
     speed               = "4 Gbit"
     supported_classes   = "Class 3"
     supported_speeds    = "1 Gbit, 2 Gbit, 4 Gbit"
     symbolic_name       = "QLE2464 FW:v8.06.02 DVR:v10.00.00.07-k-debug"
     system_hostname     = ""
     tgtid_bind_type     = "wwpn (World Wide Port Name)"
     uevent              =
     vport_create        = <store method only>
     vport_delete        = <store method only>

     Device = "host3"
     Device path = 
"/sys/devices/pci0000:00/0000:00:02.0/0000:01:00.0/0000:02:01.0/0000:04:00.1/host3"
       fw_dump             =
       issue_logo          = <store method only>
       nvram               = "ISP "
       optrom_ctl          = <store method only>
       optrom              =
       reset               = <store method only>
       sfp                 = ""
       uevent              = "DEVTYPE=scsi_host"
       vpd                 = "▒."


[root@mbpc-pc 1]#




-- 
Thx,
TK.

[-- Attachment #2: brocade-san-issue.png --]
[-- Type: image/png, Size: 43367 bytes --]

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2021-04-14  8:13 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-04-10 15:27 QLE2432 initiator fails to see any LUN's on one of servers while using 5QLE2464 as a target TomK
2021-04-14  2:11 ` Roman Bolshakov
2021-04-14  8:12 ` Saurav Kashyap
  -- strict thread matches above, loose matches on Subject: below --
2021-03-28 21:54 TomK

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.