All of lore.kernel.org
 help / color / mirror / Atom feed
* A path to a different device was added to an existing mapth
@ 2015-03-19 10:10 Ilan Steinberg
  2015-03-19 10:35 ` Ilan Steinberg
  0 siblings, 1 reply; 4+ messages in thread
From: Ilan Steinberg @ 2015-03-19 10:10 UTC (permalink / raw)
  To: dm-devel; +Cc: Guy Keren, Eli Malul, Shahar Salzman


[-- Attachment #1.1: Type: text/plain, Size: 8537 bytes --]

Hello,


I have never posted here before but I can across an interesting error.


I have run into a situation were a path (sdag) to LUN 15 with a scsi serial number of ending with 0xd5190010 was added to an mapth (mpathiw) which is for LUN 11 with scsi serial number ending with 0xd5190026 (sdz and sdw)


I would appreciate any help understanding how/why this happened.


Initiator Info:

Linux il-ksm3-044 2.6.32-431.el6.x86_64 #1

device-mapper-multipath version 0.4.9


multipath.conf:

defaults {
                   user_friendly_names     yes
                   polling_interval        10
                    verbosity               2
}
devices {
                    device  {
                             vendor                      "KMNRIO"
                             product                     "K2"
                         path_grouping_policy        multibus
                             getuid_callout          "/lib/udev/scsi_id --whitelisted --device=/dev/%n"
                             path_checker                tur
                             path_selector               "queue-length 0"
                             no_path_retry               fail
                             hardware_handler            "0"
                             rr_weight                   priorities
                             rr_min_io               1
                             failback                15
                             fast_io_fail_tmo        5
                             dev_loss_tmo            8
                             }
}


Since I see that sdz and sdw used to belong to mapthiv (was it renamed to mathiw?)

This might be useful info - in the bindings file I see:

...

mpathiv 20024f400d5190010                      <---------------------------------------- this is the scsi_sn of sdag

mpathiw 20024f400d5190026                     <---------------------------------------- this is the scsi_sn of sdz/w

...


I am adding the relevant host logs (I can add the full logs if needed) - error is in the last log line:

Mar 14 09:44:36 il-ksm3-044 kernel: sdag: detected capacity change from 0 to 322122547200

Mar 14 09:59:35 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:00:06 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:00:21 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:26:31 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 167369280
Mar 14 10:26:31 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 80032448

Mar 14 10:26:44 il-ksm3-044 multipathd: sdz: remove path (uevent)
Mar 14 10:26:44 il-ksm3-044 multipathd: sdz [65:144]: path removed from map mpathiv
Mar 14 10:26:44 il-ksm3-044 multipathd: sdag: remove path (uevent)
Mar 14 10:26:44 il-ksm3-044 multipathd: sdag [66:0]: path removed from map mpathks

Mar 14 10:26:44 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:96 1]
Mar 14 10:26:44 il-ksm3-044 multipathd: mpathks: load table [0 629145600 multipath 0 0 1 1 queue-length 0 1 1 65:240 1]

Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write Protect is off
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 10:53:50 il-ksm3-044 kernel: sdz:
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Attached SCSI disk
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] 629145600 512-byte logical blocks: (322 GB/300 GiB)
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write Protect is off
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 10:53:50 il-ksm3-044 kernel: sdag: unknown partition table
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Attached SCSI disk
Mar 14 10:53:50 il-ksm3-044 multipathd: sdag: add path (uevent)
Mar 14 10:53:50 il-ksm3-044 multipathd: mpathks: failed in domap for addition of new path sdag
Mar 14 10:53:50 il-ksm3-044 multipathd: sdz: add path (uevent)
Mar 14 10:53:50 il-ksm3-044 multipathd: sdz [65:144]: path added to devmap mpathiv

Mar 14 10:53:50 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 2 1 65:96 1 65:144 1]

Mar 14 12:29:12 il-ksm3-044 multipathd: sdw: remove path (uevent)
Mar 14 12:29:12 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:144 1]
Mar 14 12:29:12 il-ksm3-044 multipathd: sdw [65:96]: path removed from map mpathiv

Mar 15 12:45:59 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments.

Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Write Protect is off
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Attached SCSI disk

Mar 14 12:59:18 il-ksm3-044 multipathd: sdw: add path (uevent)
Mar 14 12:59:18 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 2 1 65:144 1 65:96 1]
Mar 14 12:59:18 il-ksm3-044 multipathd: sdw [65:96]: path added to devmap mpathiv

Mar 15 11:45:27 il-ksm3-044 kernel: sdz: detected capacity change from 0 to 107374182400

Mar 15 11:45:33 il-ksm3-044 multipathd: mpathiw: event checker started
Mar 15 11:45:33 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 12:28:12 il-ksm3-044 multipathd: mpathiw: event checker exit

Mar 15 12:28:15 il-ksm3-044 multipathd: mpathiw: event checker started
Mar 15 12:28:15 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 12:56:35 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 204563968
Mar 15 12:56:35 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 193484032
Mar 15 12:56:35 il-ksm3-044 multipathd: mpathiw: remaining active paths: 1

Mar 15 12:56:48 il-ksm3-044 multipathd: sdz: remove path (uevent)
Mar 15 12:56:48 il-ksm3-044 multipathd: mpathiw: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:96 1]
Mar 15 12:56:48 il-ksm3-044 multipathd: sdz [65:144]: path removed from map mpathiw
Mar 15 12:56:48 il-ksm3-044 multipathd: sdag: remove path (uevent)

Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write Protect is off
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 15 13:27:00 il-ksm3-044 kernel: sdz: unknown partition table
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Attached SCSI disk
Mar 15 13:27:00 il-ksm3-044 multipathd: sdz: add path (uevent)
Mar 15 13:27:00 il-ksm3-044 multipathd: mpathiw: failed in domap for addition of new path sdz

Mar 15 15:20:41 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 17:38:36 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments.

Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] 629145600 512-byte logical blocks: (322 GB/300 GiB)
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write Protect is off
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 15 18:44:54 il-ksm3-044 kernel: sdag: unknown partition table
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Attached SCSI disk

Mar 15 18:44:55 il-ksm3-044 multipathd: sdag: add path (uevent)
Mar 15 18:44:55 il-ksm3-044 multipathd: mpathiw: load table [0 209715200 multipath 0 0 1 1 queue-length 0 3 1 65:96 1 65:144 1 66:0 1]
Mar 15 18:44:55 il-ksm3-044 multipathd: sdag [66:0]: path added to devmap mpathiw

Thank you,
Ilan Steinberg

[-- Attachment #1.2: Type: text/html, Size: 13518 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: A path to a different device was added to an existing mapth
  2015-03-19 10:10 A path to a different device was added to an existing mapth Ilan Steinberg
@ 2015-03-19 10:35 ` Ilan Steinberg
  2015-03-27 23:26   ` Stewart, Sean
  0 siblings, 1 reply; 4+ messages in thread
From: Ilan Steinberg @ 2015-03-19 10:35 UTC (permalink / raw)
  To: dm-devel; +Cc: Guy Keren, Eli Malul, Shahar Salzman


[-- Attachment #1.1: Type: text/plain, Size: 11357 bytes --]

Hi,


I have some more info:


multipathd> show maps
name    sysfs uuid
mpathhs dm-2  20024f400d5190000
mpathit dm-16 20024f400d519002b
mpathis dm-15 20024f400d519001c
mpathin dm-11 20024f400d5190012
mpathim dm-14 20024f400d5190011
mpathiw dm-13 20024f400d5190010
mpathhz dm-7  20024f400d5190006
mpathhx dm-6  20024f400d5190005
mpathhw dm-5  20024f400d5190004
mpathht dm-8  20024f400d5190003
mpathij dm-12 20024f400d519000e
mpathil dm-10 20024f400d519000d
mpathhv dm-3  20024f400d5190001
mpathik dm-9  20024f400d519000f
mpathhu dm-4  20024f400d5190002


It seems that mpathiw is indeed with the uuid of sdag but...


note that  if you run 'multipath -l' you will see mpathiw having the correct
signature (20024f400d5190026) - because it queries it again from the storage.


Also:


multipathd> show paths format "%d %s %S %w"
dev  vend/prod/rev size uuid
.
.
sdaf KMNRIO,K2     300G 20024f400d519001e
.
sdw  KMNRIO,K2     100G 20024f400d5190010
.
.
sdz  KMNRIO,K2     100G 20024f400d5190026
.
sdag KMNRIO,K2     300G 20024f400d5190010


Unfortunately, the device for sdag was deleted from the storage, but I can still see that:

[root@il-ksm3-044 ~]# sg_inq -p 0x83 /dev/sdw
VPD INQUIRY: Device Identification page
  Designation descriptor number 1, descriptor length: 36
    designator_type: T10 vendor identification,  code_set: ASCII
    associated with the addressed logical unit
      vendor id: KMNRIO
      vendor specific: ATF-VG-1:EXP-SNAd5190026
  Designation descriptor number 2, descriptor length: 12
    designator_type: EUI-64 based,  code_set: Binary
    associated with the addressed logical unit
      EUI-64 based 8 byte identifier
      IEEE Company_id: 0x24f4
      Vendor Specific Extension Identifier: 0xd5190026
      [0x0024f400d5190026]
[root@il-ksm3-044 ~]# sg_inq -p 0x83 /dev/sdz
VPD INQUIRY: Device Identification page
  Designation descriptor number 1, descriptor length: 36
    designator_type: T10 vendor identification,  code_set: ASCII
    associated with the addressed logical unit
      vendor id: KMNRIO
      vendor specific: ATF-VG-1:EXP-SNAd5190026
  Designation descriptor number 2, descriptor length: 12
    designator_type: EUI-64 based,  code_set: Binary
    associated with the addressed logical unit
      EUI-64 based 8 byte identifier
      IEEE Company_id: 0x24f4
      Vendor Specific Extension Identifier: 0xd5190026
      [0x0024f400d5190026]

So, sdw and sdz are path to the same device, and so are sdaf and sdag...

Regards,
Ilan Steinberg


________________________________
From: Ilan Steinberg
Sent: Thursday, March 19, 2015 12:10 PM
To: dm-devel@redhat.com
Cc: Eli Malul; Guy Keren; Shahar Salzman
Subject: A path to a different device was added to an existing mapth


Hello,


I have never posted here before but I can across an interesting error.


I have run into a situation were a path (sdag) to LUN 15 with a scsi serial number of ending with 0xd5190010 was added to an mapth (mpathiw) which is for LUN 11 with scsi serial number ending with 0xd5190026 (sdz and sdw)


I would appreciate any help understanding how/why this happened.


Initiator Info:

Linux il-ksm3-044 2.6.32-431.el6.x86_64 #1

device-mapper-multipath version 0.4.9


multipath.conf:

defaults {
                   user_friendly_names     yes
                   polling_interval        10
                    verbosity               2
}
devices {
                    device  {
                             vendor                      "KMNRIO"
                             product                     "K2"
                         path_grouping_policy        multibus
                             getuid_callout          "/lib/udev/scsi_id --whitelisted --device=/dev/%n"
                             path_checker                tur
                             path_selector               "queue-length 0"
                             no_path_retry               fail
                             hardware_handler            "0"
                             rr_weight                   priorities
                             rr_min_io               1
                             failback                15
                             fast_io_fail_tmo        5
                             dev_loss_tmo            8
                             }
}


Since I see that sdz and sdw used to belong to mapthiv (was it renamed to mathiw?)

This might be useful info - in the bindings file I see:

...

mpathiv 20024f400d5190010                      <---------------------------------------- this is the scsi_sn of sdag

mpathiw 20024f400d5190026                     <---------------------------------------- this is the scsi_sn of sdz/w

...


I am adding the relevant host logs (I can add the full logs if needed) - error is in the last log line:

Mar 14 09:44:36 il-ksm3-044 kernel: sdag: detected capacity change from 0 to 322122547200

Mar 14 09:59:35 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:00:06 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:00:21 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Warning! Received an indication that the LUN reached a thin provisioning soft threshold.

Mar 14 10:26:31 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 167369280
Mar 14 10:26:31 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 80032448

Mar 14 10:26:44 il-ksm3-044 multipathd: sdz: remove path (uevent)
Mar 14 10:26:44 il-ksm3-044 multipathd: sdz [65:144]: path removed from map mpathiv
Mar 14 10:26:44 il-ksm3-044 multipathd: sdag: remove path (uevent)
Mar 14 10:26:44 il-ksm3-044 multipathd: sdag [66:0]: path removed from map mpathks

Mar 14 10:26:44 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:96 1]
Mar 14 10:26:44 il-ksm3-044 multipathd: mpathks: load table [0 629145600 multipath 0 0 1 1 queue-length 0 1 1 65:240 1]

Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write Protect is off
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 10:53:50 il-ksm3-044 kernel: sdz:
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Attached SCSI disk
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] 629145600 512-byte logical blocks: (322 GB/300 GiB)
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write Protect is off
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 10:53:50 il-ksm3-044 kernel: sdag: unknown partition table
Mar 14 10:53:50 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Attached SCSI disk
Mar 14 10:53:50 il-ksm3-044 multipathd: sdag: add path (uevent)
Mar 14 10:53:50 il-ksm3-044 multipathd: mpathks: failed in domap for addition of new path sdag
Mar 14 10:53:50 il-ksm3-044 multipathd: sdz: add path (uevent)
Mar 14 10:53:50 il-ksm3-044 multipathd: sdz [65:144]: path added to devmap mpathiv

Mar 14 10:53:50 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 2 1 65:96 1 65:144 1]

Mar 14 12:29:12 il-ksm3-044 multipathd: sdw: remove path (uevent)
Mar 14 12:29:12 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:144 1]
Mar 14 12:29:12 il-ksm3-044 multipathd: sdw [65:96]: path removed from map mpathiv

Mar 15 12:45:59 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments.

Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Write Protect is off
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 14 12:59:18 il-ksm3-044 kernel: sd 7:0:1:11: [sdw] Attached SCSI disk

Mar 14 12:59:18 il-ksm3-044 multipathd: sdw: add path (uevent)
Mar 14 12:59:18 il-ksm3-044 multipathd: mpathiv: load table [0 209715200 multipath 0 0 1 1 queue-length 0 2 1 65:144 1 65:96 1]
Mar 14 12:59:18 il-ksm3-044 multipathd: sdw [65:96]: path added to devmap mpathiv

Mar 15 11:45:27 il-ksm3-044 kernel: sdz: detected capacity change from 0 to 107374182400

Mar 15 11:45:33 il-ksm3-044 multipathd: mpathiw: event checker started
Mar 15 11:45:33 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 12:28:12 il-ksm3-044 multipathd: mpathiw: event checker exit

Mar 15 12:28:15 il-ksm3-044 multipathd: mpathiw: event checker started
Mar 15 12:28:15 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 12:56:35 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 204563968
Mar 15 12:56:35 il-ksm3-044 kernel: end_request: I/O error, dev sdz, sector 193484032
Mar 15 12:56:35 il-ksm3-044 multipathd: mpathiw: remaining active paths: 1

Mar 15 12:56:48 il-ksm3-044 multipathd: sdz: remove path (uevent)
Mar 15 12:56:48 il-ksm3-044 multipathd: mpathiw: load table [0 209715200 multipath 0 0 1 1 queue-length 0 1 1 65:96 1]
Mar 15 12:56:48 il-ksm3-044 multipathd: sdz [65:144]: path removed from map mpathiw
Mar 15 12:56:48 il-ksm3-044 multipathd: sdag: remove path (uevent)

Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] 209715200 512-byte logical blocks: (107 GB/100 GiB)
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write Protect is off
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 15 13:27:00 il-ksm3-044 kernel: sdz: unknown partition table
Mar 15 13:27:00 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Attached SCSI disk
Mar 15 13:27:00 il-ksm3-044 multipathd: sdz: add path (uevent)
Mar 15 13:27:00 il-ksm3-044 multipathd: mpathiw: failed in domap for addition of new path sdz

Mar 15 15:20:41 il-ksm3-044 multipathd: mpathiw: devmap dm-13 added

Mar 15 17:38:36 il-ksm3-044 kernel: sd 8:0:1:11: [sdz] Warning! Received an indication that the LUN assignments on this target have changed. The Linux SCSI layer does not automatically remap LUN assignments.

Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] 629145600 512-byte logical blocks: (322 GB/300 GiB)
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write Protect is off
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Write cache: disabled, read cache: enabled, supports DPO and FUA
Mar 15 18:44:54 il-ksm3-044 kernel: sdag: unknown partition table
Mar 15 18:44:54 il-ksm3-044 kernel: sd 8:0:1:15: [sdag] Attached SCSI disk

Mar 15 18:44:55 il-ksm3-044 multipathd: sdag: add path (uevent)
Mar 15 18:44:55 il-ksm3-044 multipathd: mpathiw: load table [0 209715200 multipath 0 0 1 1 queue-length 0 3 1 65:96 1 65:144 1 66:0 1]
Mar 15 18:44:55 il-ksm3-044 multipathd: sdag [66:0]: path added to devmap mpathiw

Thank you,
Ilan Steinberg

[-- Attachment #1.2: Type: text/html, Size: 20915 bytes --]

[-- Attachment #2: Type: text/plain, Size: 0 bytes --]



^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: A path to a different device was added to an existing mapth
  2015-03-19 10:35 ` Ilan Steinberg
@ 2015-03-27 23:26   ` Stewart, Sean
  2015-03-28 11:32     ` guy keren
  0 siblings, 1 reply; 4+ messages in thread
From: Stewart, Sean @ 2015-03-27 23:26 UTC (permalink / raw)
  To: device-mapper development; +Cc: Guy Keren, Eli Malul, Shahar Salzman

Hi Ilan,


I believe this is solved by the following patch:
http://git.opensvc.com/gitweb.cgi?p=multipath-tools/.git;a=commit;h=5adec73edcdee912821ca8378439dc105e82c60f 

Per the patch description:
When a system is booted to the SAN, a condition can occur where one
user friendly name is given to a disk during boot, but multipathd tries
to allocate a different one after boot. If the second alias is already
used by another device, multipathd can't rename it. Multipathd then has
incorrect information about the alias/wwid relationships, which can
result in paths being added to the wrong map.

On Thu, 2015-03-19 at 10:35 +0000, Ilan Steinberg wrote:

> multipathd> show maps
> name    sysfs uuid
> mpathiw dm-13 20024f400d5190010

This shows the current running configuration, that multipathd is
operating believing that mpathiw should have be 20024f400d5190010. 

> This might be useful info - in the bindings file I see:
> 
> ...
> 
> mpathiv 20024f400d5190010
> <---------------------------------------- this is the scsi_sn of sdag
> 
> mpathiw 20024f400d5190026
> <---------------------------------------- this is the scsi_sn of sdz/w
> 
> ...
> 
So what must have happened was that the bindings file was out of sync on
the initramfs and the local fs (which you can check by unwrapping the
initramfs and comparing the wwids in each file), and it created the
device with one name, tried to rename it, couldn't, and multipathd then
starts adding into the wrong map. It's hard to explain clearly. :)  If
we could see what both bindings files say, maybe I could explain it
better.

To work around it, remaking the initramfs to sync the bindings should
suffice, or you could define aliases via the multipath sections
multipath.conf.

Hope this helps.


Thanks,
Sean Stewart

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: A path to a different device was added to an existing mapth
  2015-03-27 23:26   ` Stewart, Sean
@ 2015-03-28 11:32     ` guy keren
  0 siblings, 0 replies; 4+ messages in thread
From: guy keren @ 2015-03-28 11:32 UTC (permalink / raw)
  To: Stewart, Sean, device-mapper development; +Cc: Eli Malul, Shahar Salzman


hi Sean,

in the case stated, the host was not rebooted during this test. what has 
changed is LUNs exposure to the host (unmapping and re-mapping of LUNs, 
deleting and re-creating of LUNs).

could these kind of operations trigger similar issues?

from what we saw, multipathd's state made it think that some paths, that 
have different WWIDs, should be bound to the same multipath device 
(which is incorrect).

a different question comes to mind:

   if we avoid using the friendly names, and instead use the 
/dev/mapper/<wwid> device paths - is this likely to avoid the problem in 
the first place?

--guy


On 03/28/2015 02:26 AM, Stewart, Sean wrote:
> Hi Ilan,
>
>
> I believe this is solved by the following patch:
> http://git.opensvc.com/gitweb.cgi?p=multipath-tools/.git;a=commit;h=5adec73edcdee912821ca8378439dc105e82c60f
>
> Per the patch description:
> When a system is booted to the SAN, a condition can occur where one
> user friendly name is given to a disk during boot, but multipathd tries
> to allocate a different one after boot. If the second alias is already
> used by another device, multipathd can't rename it. Multipathd then has
> incorrect information about the alias/wwid relationships, which can
> result in paths being added to the wrong map.
>
> On Thu, 2015-03-19 at 10:35 +0000, Ilan Steinberg wrote:
>
>> multipathd> show maps
>> name    sysfs uuid
>> mpathiw dm-13 20024f400d5190010
> This shows the current running configuration, that multipathd is
> operating believing that mpathiw should have be 20024f400d5190010.
>
>> This might be useful info - in the bindings file I see:
>>
>> ...
>>
>> mpathiv 20024f400d5190010
>> <---------------------------------------- this is the scsi_sn of sdag
>>
>> mpathiw 20024f400d5190026
>> <---------------------------------------- this is the scsi_sn of sdz/w
>>
>> ...
>>
> So what must have happened was that the bindings file was out of sync on
> the initramfs and the local fs (which you can check by unwrapping the
> initramfs and comparing the wwids in each file), and it created the
> device with one name, tried to rename it, couldn't, and multipathd then
> starts adding into the wrong map. It's hard to explain clearly. :)  If
> we could see what both bindings files say, maybe I could explain it
> better.
>
> To work around it, remaking the initramfs to sync the bindings should
> suffice, or you could define aliases via the multipath sections
> multipath.conf.
>
> Hope this helps.
>
>
> Thanks,
> Sean Stewart
>
>

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2015-03-28 11:32 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-03-19 10:10 A path to a different device was added to an existing mapth Ilan Steinberg
2015-03-19 10:35 ` Ilan Steinberg
2015-03-27 23:26   ` Stewart, Sean
2015-03-28 11:32     ` guy keren

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.