linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Btrfs broken in massive transfar
@ 2018-03-12 21:57 MASAKI haruka
  2018-03-13  2:27 ` Qu Wenruo
  0 siblings, 1 reply; 9+ messages in thread
From: MASAKI haruka @ 2018-03-12 21:57 UTC (permalink / raw)
  To: linux-btrfs

I'm trying to clone 18TiB data between btrfs,
but it will crash anyway.

This probrem is occured even how to clone (btrfs send/receive, rsync or cp.)
I experienced same probrem in Linux 3.9 and Linux 3.10.

What happen:

1. Failed to write because I/O error (read only filesystem)
2. writing to the btrfs succeeds and fails randomly.
3. The btrfs unable to unmount (resource is busy.) Unable to umount even forcely, so cannot halt.

Example:
---
mkfile o7784-11-0
rename o7784-11-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw size=1073698824
chown .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - uid=1000, gid=1000
chmod .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - mode=0600
utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
mkfile o7785-12-0
rename o7785-12-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X
utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X size=864067592
ERROR: truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X failed: Input/output error
btrfs send 180310235348  0.09s user 11.98s system 16% cpu 1:14.42 total
---

Tries:
1.
Connect between host A (btrfs, 4disks) and B with socat (TCP).
Host B write to iSCSI disk (btrfs, single).
clone with btrfs send/receive. Linux 4.15.
-> Crashed at transfarred 1.78TB

2.
Delete snapshot and retry.
Connect between host A and B with SSH and socat (UNIX).
Host B write to iSCSI disk (btrfs, single).
clone with btrfs send/receive. Linux 4.15.
-> Crashed at transfarred 90GB

3.
Recreate btrfs.
Host A write to iSCSI disk.
clone with btrfs send/receive. Linux 4.15.
-> Crashed at transfarred 260GB

4.
Recreate btrfs.
Original disk attach to other computer (having more resource.)
clone with btrfs send/receive. Linux 4.15.
-> Crashed at transfarred 120GB

5.
Recreate btrfs.
Clone with rsync. Linux 4.15.
-> Crashed at transfarred 100GB

6.
Recreate btrfs.
Try with Linux 4.14, btrfs send/receive.
-> Crashed at transfarred 3.98TB

7.
Recreate btrfs.
Connect between host and NAS (iSCSI) with GbE cable directly.
Mounted with options relatime, spase_cache, compress=lzo.
clone with btrfs send/receive. Linux 4.14.
-> Crashed at transfarred 2.13TB
-- 
MASAKI haruka <yek@reasonset.net>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-12 21:57 Btrfs broken in massive transfar MASAKI haruka
@ 2018-03-13  2:27 ` Qu Wenruo
  2018-03-13  4:19   ` MASAKI haruka
  0 siblings, 1 reply; 9+ messages in thread
From: Qu Wenruo @ 2018-03-13  2:27 UTC (permalink / raw)
  To: MASAKI haruka, linux-btrfs


[-- Attachment #1.1: Type: text/plain, Size: 3525 bytes --]



On 2018年03月13日 05:57, MASAKI haruka wrote:
> I'm trying to clone 18TiB data between btrfs,
> but it will crash anyway.
> 
> This probrem is occured even how to clone (btrfs send/receive, rsync or cp.)
> I experienced same probrem in Linux 3.9 and Linux 3.10.

Did you really mean *3*.9 and *3*.10?

That's too old for btrfs usage IIRC.

It would be *4*.9 or *4*.10 for a relative new kernel for btrfs.

Would you please try some latest mainline kernel again?

> 
> What happen:
> 
> 1. Failed to write because I/O error (read only filesystem)
> 2. writing to the btrfs succeeds and fails randomly.
> 3. The btrfs unable to unmount (resource is busy.) Unable to umount even forcely, so cannot halt.
> 
> Example:
> ---
> mkfile o7784-11-0
> rename o7784-11-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw size=1073698824
> chown .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - uid=1000, gid=1000
> chmod .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - mode=0600
> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> mkfile o7785-12-0
> rename o7785-12-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X
> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X size=864067592
> ERROR: truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X failed: Input/output error
> btrfs send 180310235348  0.09s user 11.98s system 16% cpu 1:14.42 total
> ---

In that case, we need kernel message to investigate.
(And of course, please use at least 4.x kernel)

Thanks,
Qu

> 
> Tries:
> 1.
> Connect between host A (btrfs, 4disks) and B with socat (TCP).
> Host B write to iSCSI disk (btrfs, single).
> clone with btrfs send/receive. Linux 4.15.
> -> Crashed at transfarred 1.78TB
> 
> 2.
> Delete snapshot and retry.
> Connect between host A and B with SSH and socat (UNIX).
> Host B write to iSCSI disk (btrfs, single).
> clone with btrfs send/receive. Linux 4.15.
> -> Crashed at transfarred 90GB
> 
> 3.
> Recreate btrfs.
> Host A write to iSCSI disk.
> clone with btrfs send/receive. Linux 4.15.
> -> Crashed at transfarred 260GB
> 
> 4.
> Recreate btrfs.
> Original disk attach to other computer (having more resource.)
> clone with btrfs send/receive. Linux 4.15.
> -> Crashed at transfarred 120GB
> 
> 5.
> Recreate btrfs.
> Clone with rsync. Linux 4.15.
> -> Crashed at transfarred 100GB
> 
> 6.
> Recreate btrfs.
> Try with Linux 4.14, btrfs send/receive.
> -> Crashed at transfarred 3.98TB
> 
> 7.
> Recreate btrfs.
> Connect between host and NAS (iSCSI) with GbE cable directly.
> Mounted with options relatime, spase_cache, compress=lzo.
> clone with btrfs send/receive. Linux 4.14.
> -> Crashed at transfarred 2.13TB
> 


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 520 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-13  2:27 ` Qu Wenruo
@ 2018-03-13  4:19   ` MASAKI haruka
  2018-03-13  5:04     ` Qu Wenruo
  0 siblings, 1 reply; 9+ messages in thread
From: MASAKI haruka @ 2018-03-13  4:19 UTC (permalink / raw)
  To: Qu Wenruo; +Cc: linux-btrfs

*Now* I tried with Linux 4.14 and 4.15.
I experienced same probrem and reported in 2014 with Linux 3.9 and 3.10. (Perchance, actually the kernel was newer than 3.10, anyway I experienced same probrem with old 3.x kernel.)

> 
> 
> On 2018年03月13日 05:57, MASAKI haruka wrote:
> > I'm trying to clone 18TiB data between btrfs,
> > but it will crash anyway.
> > 
> > This probrem is occured even how to clone (btrfs send/receive, rsync or cp.)
> > I experienced same probrem in Linux 3.9 and Linux 3.10.
> 
> Did you really mean *3*.9 and *3*.10?
> 
> That's too old for btrfs usage IIRC.
> 
> It would be *4*.9 or *4*.10 for a relative new kernel for btrfs.
> 
> Would you please try some latest mainline kernel again?
> 
> > 
> > What happen:
> > 
> > 1. Failed to write because I/O error (read only filesystem)
> > 2. writing to the btrfs succeeds and fails randomly.
> > 3. The btrfs unable to unmount (resource is busy.) Unable to umount even forcely, so cannot halt.
> > 
> > Example:
> > ---
> > mkfile o7784-11-0
> > rename o7784-11-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> > utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> > truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw size=1073698824
> > chown .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - uid=1000, gid=1000
> > chmod .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - mode=0600
> > utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> > mkfile o7785-12-0
> > rename o7785-12-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X
> > utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> > truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X size=864067592
> > ERROR: truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X failed: Input/output error
> > btrfs send 180310235348  0.09s user 11.98s system 16% cpu 1:14.42 total
> > ---
> 
> In that case, we need kernel message to investigate.
> (And of course, please use at least 4.x kernel)
> 
> Thanks,
> Qu
> 
> > 
> > Tries:
> > 1.
> > Connect between host A (btrfs, 4disks) and B with socat (TCP).
> > Host B write to iSCSI disk (btrfs, single).
> > clone with btrfs send/receive. Linux 4.15.
> > -> Crashed at transfarred 1.78TB
> > 
> > 2.
> > Delete snapshot and retry.
> > Connect between host A and B with SSH and socat (UNIX).
> > Host B write to iSCSI disk (btrfs, single).
> > clone with btrfs send/receive. Linux 4.15.
> > -> Crashed at transfarred 90GB
> > 
> > 3.
> > Recreate btrfs.
> > Host A write to iSCSI disk.
> > clone with btrfs send/receive. Linux 4.15.
> > -> Crashed at transfarred 260GB
> > 
> > 4.
> > Recreate btrfs.
> > Original disk attach to other computer (having more resource.)
> > clone with btrfs send/receive. Linux 4.15.
> > -> Crashed at transfarred 120GB
> > 
> > 5.
> > Recreate btrfs.
> > Clone with rsync. Linux 4.15.
> > -> Crashed at transfarred 100GB
> > 
> > 6.
> > Recreate btrfs.
> > Try with Linux 4.14, btrfs send/receive.
> > -> Crashed at transfarred 3.98TB
> > 
> > 7.
> > Recreate btrfs.
> > Connect between host and NAS (iSCSI) with GbE cable directly.
> > Mounted with options relatime, spase_cache, compress=lzo.
> > clone with btrfs send/receive. Linux 4.14.
> > -> Crashed at transfarred 2.13TB
> > 
> 


-- 
MASAKI haruka <yek@reasonset.net>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-13  4:19   ` MASAKI haruka
@ 2018-03-13  5:04     ` Qu Wenruo
  2018-03-13 19:25       ` MASAKI haruka
  0 siblings, 1 reply; 9+ messages in thread
From: Qu Wenruo @ 2018-03-13  5:04 UTC (permalink / raw)
  To: MASAKI haruka; +Cc: linux-btrfs


[-- Attachment #1.1: Type: text/plain, Size: 4125 bytes --]



On 2018年03月13日 12:19, MASAKI haruka wrote:
> *Now* I tried with Linux 4.14 and 4.15.
> I experienced same probrem and reported in 2014 with Linux 3.9 and 3.10. (Perchance, actually the kernel was newer than 3.10, anyway I experienced same probrem with old 3.x kernel.)

Then kernel message please.

Especially for the readonly case.

And "btrfs check" output please.

Thanks,
Qu


> 
>>
>>
>> On 2018年03月13日 05:57, MASAKI haruka wrote:
>>> I'm trying to clone 18TiB data between btrfs,
>>> but it will crash anyway.
>>>
>>> This probrem is occured even how to clone (btrfs send/receive, rsync or cp.)
>>> I experienced same probrem in Linux 3.9 and Linux 3.10.
>>
>> Did you really mean *3*.9 and *3*.10?
>>
>> That's too old for btrfs usage IIRC.
>>
>> It would be *4*.9 or *4*.10 for a relative new kernel for btrfs.
>>
>> Would you please try some latest mainline kernel again?
>>
>>>
>>> What happen:
>>>
>>> 1. Failed to write because I/O error (read only filesystem)
>>> 2. writing to the btrfs succeeds and fails randomly.
>>> 3. The btrfs unable to unmount (resource is busy.) Unable to umount even forcely, so cannot halt.
>>>
>>> Example:
>>> ---
>>> mkfile o7784-11-0
>>> rename o7784-11-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
>>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
>>> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw size=1073698824
>>> chown .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - uid=1000, gid=1000
>>> chmod .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - mode=0600
>>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
>>> mkfile o7785-12-0
>>> rename o7785-12-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X
>>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
>>> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X size=864067592
>>> ERROR: truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X failed: Input/output error
>>> btrfs send 180310235348  0.09s user 11.98s system 16% cpu 1:14.42 total
>>> ---
>>
>> In that case, we need kernel message to investigate.
>> (And of course, please use at least 4.x kernel)
>>
>> Thanks,
>> Qu
>>
>>>
>>> Tries:
>>> 1.
>>> Connect between host A (btrfs, 4disks) and B with socat (TCP).
>>> Host B write to iSCSI disk (btrfs, single).
>>> clone with btrfs send/receive. Linux 4.15.
>>> -> Crashed at transfarred 1.78TB
>>>
>>> 2.
>>> Delete snapshot and retry.
>>> Connect between host A and B with SSH and socat (UNIX).
>>> Host B write to iSCSI disk (btrfs, single).
>>> clone with btrfs send/receive. Linux 4.15.
>>> -> Crashed at transfarred 90GB
>>>
>>> 3.
>>> Recreate btrfs.
>>> Host A write to iSCSI disk.
>>> clone with btrfs send/receive. Linux 4.15.
>>> -> Crashed at transfarred 260GB
>>>
>>> 4.
>>> Recreate btrfs.
>>> Original disk attach to other computer (having more resource.)
>>> clone with btrfs send/receive. Linux 4.15.
>>> -> Crashed at transfarred 120GB
>>>
>>> 5.
>>> Recreate btrfs.
>>> Clone with rsync. Linux 4.15.
>>> -> Crashed at transfarred 100GB
>>>
>>> 6.
>>> Recreate btrfs.
>>> Try with Linux 4.14, btrfs send/receive.
>>> -> Crashed at transfarred 3.98TB
>>>
>>> 7.
>>> Recreate btrfs.
>>> Connect between host and NAS (iSCSI) with GbE cable directly.
>>> Mounted with options relatime, spase_cache, compress=lzo.
>>> clone with btrfs send/receive. Linux 4.14.
>>> -> Crashed at transfarred 2.13TB
>>>
>>
> 
> 


[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 520 bytes --]

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-13  5:04     ` Qu Wenruo
@ 2018-03-13 19:25       ` MASAKI haruka
  2018-03-14  4:09         ` Chris Murphy
  0 siblings, 1 reply; 9+ messages in thread
From: MASAKI haruka @ 2018-03-13 19:25 UTC (permalink / raw)
  To: Qu Wenruo; +Cc: linux-btrfs

journal(Kernel log), 7th try (to be readonly):

---
 3月 12 16:25:51 lily kernel: BTRFS info (device dm-6): creating UUID tree
 3月 12 16:25:53 lily iscsid[1406]: Connection-1:0 to [target: iqn.1994-11.com.netgear:eggplant-01:edc9adcf:btr1group, portal: 192.168.1.166,3260] through [iface: default] is shutdown.
 3月 12 16:25:53 lily iscsid[1406]: IPC qtask write failed: Broken pipe
 3月 12 16:26:18 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 16:26:19 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor] 
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0 
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 5c 00 00 00 34 00 00 00
 3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4348928
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 1, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 2, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 3, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 4, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 5, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 6, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 7, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 8, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 9, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 10, rd 0, flush 0, corrupt 0, gen 0
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#6 Sense Key : 0x2 [current] [descriptor] 
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#6 ASC=0x8 ASCQ=0x0 
 3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#6 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 20 00 00 00 3c 00 00 00
 3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4333568
 3月 12 16:26:21 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 16:26:21 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 16:26:21 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 16:26:23 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 16:26:23 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 16:26:23 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 16:26:25 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 16:26:25 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 16:26:25 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 16:26:27 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 16:31:31 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 16:31:32 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 16:31:34 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 16:35:13 lily systemd[1]: Starting Cleanup of Temporary Directories...
 3月 12 16:35:13 lily systemd[1]: Started Cleanup of Temporary Directories.
...
 3月 12 16:53:40 lily kernel: hrtimer: interrupt took 14162 ns
 3月 12 17:01:01 lily CROND[1926]: (root) CMD (run-parts /etc/cron.hourly)
 3月 12 18:01:01 lily CROND[2095]: (root) CMD (run-parts /etc/cron.hourly)
 3月 12 18:11:32 lily kernel: sd 8:0:0:0: [sdg] tag#15 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 12 18:11:32 lily kernel: sd 8:0:0:0: [sdg] tag#15 Sense Key : 0x2 [current] [descriptor] 
 3月 12 18:11:32 lily kernel: sd 8:0:0:0: [sdg] tag#15 ASC=0x8 ASCQ=0x0 
 3月 12 18:11:32 lily kernel: sd 8:0:0:0: [sdg] tag#15 CDB: opcode=0x8a 8a 00 00 00 00 00 35 9f df 30 00 00 40 00 00 00
 3月 12 18:11:32 lily kernel: print_req_error: I/O error, dev sdg, sector 899669808
 3月 12 18:11:32 lily kernel: btrfs_dev_stat_print_on_error: 18 callbacks suppressed
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 29, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 30, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 31, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 32, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 33, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 34, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 35, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 36, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 37, rd 0, flush 0, corrupt 0, gen 0
 3月 12 18:11:32 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 38, rd 0, flush 0, corrupt 0, gen 0
 3月 12 19:01:01 lily CROND[2257]: (root) CMD (run-parts /etc/cron.hourly)
 3月 12 19:14:24 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 19:14:25 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 19:14:27 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
 3月 12 19:14:34 lily kernel:  connection1:0: detected conn error (1020)
 3月 12 19:14:35 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
 3月 12 19:14:37 lily iscsid[1406]: connection1:0 is operational after recovery (1 attempts)
...
 3月 13 00:15:26 lily kernel: sd 8:0:0:0: [sdg] tag#2 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:15:26 lily kernel: sd 8:0:0:0: [sdg] tag#2 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:15:26 lily kernel: sd 8:0:0:0: [sdg] tag#2 ASC=0x8 ASCQ=0x0 
 3月 13 00:15:26 lily kernel: sd 8:0:0:0: [sdg] tag#2 CDB: opcode=0x8a 8a 00 00 00 00 00 c9 e0 8b 80 00 00 40 00 00 00
 3月 13 00:15:26 lily kernel: print_req_error: I/O error, dev sdg, sector 3386936192
 3月 13 00:15:26 lily kernel: btrfs_dev_stat_print_on_error: 6 callbacks suppressed
 3月 13 00:15:26 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 45, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:15:26 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 46, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:15:26 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 47, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:15:26 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 48, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:27:47 lily systemd[1]: Started Update locate database.
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#22 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#22 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#22 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#22 CDB: opcode=0x8a 8a 00 00 00 00 00 ee 9e bb c0 00 00 40 00 00 00
 3月 13 00:36:43 lily kernel: print_req_error: I/O error, dev sdg, sector 4003380160
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 49, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 50, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 51, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 52, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 53, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 54, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 55, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#23 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#23 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#23 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:43 lily kernel: sd 8:0:0:0: [sdg] tag#23 CDB: opcode=0x8a 8a 00 00 00 00 00 ee 9e fb c0 00 00 38 80 00 00
 3月 13 00:36:43 lily kernel: print_req_error: I/O error, dev sdg, sector 4003396544
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 56, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 57, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:43 lily kernel: BTRFS error (device dm-6): bdev /dev/mapper/hymaster_1 errs: wr 58, rd 0, flush 0, corrupt 0, gen 0
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 6f 0b 80 00 00 2a 20 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3446606720
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 6f 35 c0 00 00 3f c0 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3446617536
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#3 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 6f 0b 80 00 00 2a 20 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3446606720
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#2 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 6f 35 c0 00 00 3f c0 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3446617536
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#3 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#3 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#3 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#3 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 6f 75 80 00 00 30 80 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3446633856
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 7f 35 c0 00 00 3f c0 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3447666112
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#6 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#6 Sense Key : 0x2 [current] [descriptor] 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#6 ASC=0x8 ASCQ=0x0 
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#6 CDB: opcode=0x8a 8a 00 00 00 00 00 cd 7f 75 80 00 00 30 80 00 00
 3月 13 00:36:47 lily kernel: print_req_error: I/O error, dev sdg, sector 3447682432
 3月 13 00:36:47 lily kernel: BTRFS: error (device dm-6) in btrfs_commit_transaction:2253: errno=-5 IO failure (Error while writing out transaction)
 3月 13 00:36:47 lily kernel: BTRFS info (device dm-6): forced readonly
 3月 13 00:36:47 lily kernel: BTRFS warning (device dm-6): Skipping commit of aborted transaction.
 3月 13 00:36:47 lily kernel: BTRFS: error (device dm-6) in cleanup_transaction:1873: errno=-5 IO failure
 3月 13 00:36:47 lily kernel: BTRFS info (device dm-6): delayed_refs has NO entry
---


btrfs check output, 11th try (to be readonly):

---
checking extents
checking free space cache
checking fs roots
root 257 inode 503608 errors 100, file extent discount
Found file extent holes:
        start: 1610612736, len: 134217728
ERROR: errors found in fs roots
Checking filesystem on /dev/mapper/hymaster_1
UUID: ed789994-420e-4781-bab8-d395cfff27ed
found 1333560619008 bytes used, error(s) found
total csum bytes: 1299853404
total tree bytes: 2184232960
total fs tree bytes: 560693248
total extent tree bytes: 76267520
btree space waste bytes: 302431759
file data blocks allocated: 1331376386048
 referenced 1332755861504
---

Note: This system's structure is;
Computer (Linux 4.14/4.15) - btrfs (original) - dm-crypt plain - internal 4 disks
 \_ btrfs (destination) - dm-crypt plain - iSCSI (single) - NAS - Hardware RAID5 - 8 disks




> 
> 
> On 2018年03月13日 12:19, MASAKI haruka wrote:
> > *Now* I tried with Linux 4.14 and 4.15.
> > I experienced same probrem and reported in 2014 with Linux 3.9 and 3.10. (Perchance, actually the kernel was newer than 3.10, anyway I experienced same probrem with old 3.x kernel.)
> 
> Then kernel message please.
> 
> Especially for the readonly case.
> 
> And "btrfs check" output please.
> 
> Thanks,
> Qu
> 
> 
> > 
> >>
> >>
> >> On 2018年03月13日 05:57, MASAKI haruka wrote:
> >>> I'm trying to clone 18TiB data between btrfs,
> >>> but it will crash anyway.
> >>>
> >>> This probrem is occured even how to clone (btrfs send/receive, rsync or cp.)
> >>> I experienced same probrem in Linux 3.9 and Linux 3.10.
> >>
> >> Did you really mean *3*.9 and *3*.10?
> >>
> >> That's too old for btrfs usage IIRC.
> >>
> >> It would be *4*.9 or *4*.10 for a relative new kernel for btrfs.
> >>
> >> Would you please try some latest mainline kernel again?
> >>
> >>>
> >>> What happen:
> >>>
> >>> 1. Failed to write because I/O error (read only filesystem)
> >>> 2. writing to the btrfs succeeds and fails randomly.
> >>> 3. The btrfs unable to unmount (resource is busy.) Unable to umount even forcely, so cannot halt.
> >>>
> >>> Example:
> >>> ---
> >>> mkfile o7784-11-0
> >>> rename o7784-11-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> >>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> >>> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw size=1073698824
> >>> chown .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - uid=1000, gid=1000
> >>> chmod .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw - mode=0600
> >>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/kVxX8RdGhryQiEMOm4II2qMw
> >>> mkfile o7785-12-0
> >>> rename o7785-12-0 -> .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X
> >>> utimes .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo
> >>> truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X size=864067592
> >>> ERROR: truncate .filesystem/HDD/.XFV_pp/,fQO40jotqhUZ0/5JSSubx1Ph5xYNOcXhIAoIK3/XDGOWpbx,5zYWEi0L5LHdWBo/lSABmfoArm9pAtade-gHmS6X failed: Input/output error
> >>> btrfs send 180310235348  0.09s user 11.98s system 16% cpu 1:14.42 total
> >>> ---
> >>
> >> In that case, we need kernel message to investigate.
> >> (And of course, please use at least 4.x kernel)
> >>
> >> Thanks,
> >> Qu
> >>
> >>>
> >>> Tries:
> >>> 1.
> >>> Connect between host A (btrfs, 4disks) and B with socat (TCP).
> >>> Host B write to iSCSI disk (btrfs, single).
> >>> clone with btrfs send/receive. Linux 4.15.
> >>> -> Crashed at transfarred 1.78TB
> >>>
> >>> 2.
> >>> Delete snapshot and retry.
> >>> Connect between host A and B with SSH and socat (UNIX).
> >>> Host B write to iSCSI disk (btrfs, single).
> >>> clone with btrfs send/receive. Linux 4.15.
> >>> -> Crashed at transfarred 90GB
> >>>
> >>> 3.
> >>> Recreate btrfs.
> >>> Host A write to iSCSI disk.
> >>> clone with btrfs send/receive. Linux 4.15.
> >>> -> Crashed at transfarred 260GB
> >>>
> >>> 4.
> >>> Recreate btrfs.
> >>> Original disk attach to other computer (having more resource.)
> >>> clone with btrfs send/receive. Linux 4.15.
> >>> -> Crashed at transfarred 120GB
> >>>
> >>> 5.
> >>> Recreate btrfs.
> >>> Clone with rsync. Linux 4.15.
> >>> -> Crashed at transfarred 100GB
> >>>
> >>> 6.
> >>> Recreate btrfs.
> >>> Try with Linux 4.14, btrfs send/receive.
> >>> -> Crashed at transfarred 3.98TB
> >>>
> >>> 7.
> >>> Recreate btrfs.
> >>> Connect between host and NAS (iSCSI) with GbE cable directly.
> >>> Mounted with options relatime, spase_cache, compress=lzo.
> >>> clone with btrfs send/receive. Linux 4.14.
> >>> -> Crashed at transfarred 2.13TB
> >>>
> >>
> > 
> > 
> 


-- 
MASAKI haruka <yek@reasonset.net>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-13 19:25       ` MASAKI haruka
@ 2018-03-14  4:09         ` Chris Murphy
  2018-03-14  4:56           ` MASAKI haruka
  0 siblings, 1 reply; 9+ messages in thread
From: Chris Murphy @ 2018-03-14  4:09 UTC (permalink / raw)
  To: MASAKI haruka; +Cc: Qu Wenruo, Btrfs BTRFS

On Tue, Mar 13, 2018 at 1:25 PM, MASAKI haruka <yek@reasonset.net> wrote:
> journal(Kernel log), 7th try (to be readonly):
>
> ---
>  3月 12 16:25:51 lily kernel: BTRFS info (device dm-6): creating UUID tree
>  3月 12 16:25:53 lily iscsid[1406]: Connection-1:0 to [target: iqn.1994-11.com.netgear:eggplant-01:edc9adcf:btr1group, portal: 192.168.1.166,3260] through [iface: default] is shutdown.
>  3月 12 16:25:53 lily iscsid[1406]: IPC qtask write failed: Broken pipe
>  3月 12 16:26:18 lily kernel:  connection1:0: detected conn error (1020)
>  3月 12 16:26:19 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
>  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
>  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor]
>  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0
>  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 5c 00 00 00 34 00 00 00
>  3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4348928


Looks like network problems. Is one of these Btrfs volumes on an iSCSI
device? Because there's a bunch of iSCSI errors followed by an I/O
error with sector LBA reported, and then you get a bunch of Btrfs
write errors.

What's the relationship between /dev/sdg and device (dm-6)
/dev/mapper/hymaster_1 ?


>
> Note: This system's structure is;
> Computer (Linux 4.14/4.15) - btrfs (original) - dm-crypt plain - internal 4 disks
>  \_ btrfs (destination) - dm-crypt plain - iSCSI (single) - NAS - Hardware RAID5 - 8 disks

dm-6 is what btrfs is directly using and is complaining about, and I
will guess that this is a dmcrypt device backed by /dev/sdg which is
iSCSI to the NAS. Correct? Looks like either network problems, or
possibly there is a real hardware problem with an error that's only
partly passing through iSCSI. I can't parse this:

 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 UNKNOWN(0x2003)
Result: hostbyte=0x00 driverbyte=0x08
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 Sense Key : 0x2
[current] [descriptor]
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 ASC=0x8 ASCQ=0x0
 3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 CDB: opcode=0x8a
8a 00 00 00 00 00 cd 6f 0b 80 00 00 2a 20 00 00

Anyway, Btrfs detects the write failures, and is going read-only in
order to prevent corrupting the file system. So I think you've got
some iSCSI troubleshooting to do, and fix that. Doesn't seem like it's
a Btrfs specific problem to me.





-- 
Chris Murphy

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-14  4:09         ` Chris Murphy
@ 2018-03-14  4:56           ` MASAKI haruka
  2018-03-14  7:33             ` MASAKI,  Aki  Yuhsuke
  2018-03-15 21:26             ` Chris Murphy
  0 siblings, 2 replies; 9+ messages in thread
From: MASAKI haruka @ 2018-03-14  4:56 UTC (permalink / raw)
  To: linux-btrfs

> On Tue, Mar 13, 2018 at 1:25 PM, MASAKI haruka <yek@reasonset.net> wrote:
> > journal(Kernel log), 7th try (to be readonly):
> >
> > ---
> >  3月 12 16:25:51 lily kernel: BTRFS info (device dm-6): creating UUID tree
> >  3月 12 16:25:53 lily iscsid[1406]: Connection-1:0 to [target: iqn.1994-11.com.netgear:eggplant-01:edc9adcf:btr1group, portal: 192.168.1.166,3260] through [iface: default] is shutdown.
> >  3月 12 16:25:53 lily iscsid[1406]: IPC qtask write failed: Broken pipe
> >  3月 12 16:26:18 lily kernel:  connection1:0: detected conn error (1020)
> >  3月 12 16:26:19 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor]
> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0
> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 5c 00 00 00 34 00 00 00
> >  3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4348928
> 
> 
> Looks like network problems. Is one of these Btrfs volumes on an iSCSI
> device? Because there's a bunch of iSCSI errors followed by an I/O
> error with sector LBA reported, and then you get a bunch of Btrfs
> write errors.
> 
> What's the relationship between /dev/sdg and device (dm-6)
> /dev/mapper/hymaster_1 ?
> 


/dev/mapper/hymaster_1 is dm-crypt plain device.
Its real device is /dev/sdg it is an iSCSI disk
connected to NAS over GbE link local.

If this probrem from network, it's looked difficult to solve
because I tried with two different computers without any other network device...

> 
> >
> > Note: This system's structure is;
> > Computer (Linux 4.14/4.15) - btrfs (original) - dm-crypt plain - internal 4 disks
> >  \_ btrfs (destination) - dm-crypt plain - iSCSI (single) - NAS - Hardware RAID5 - 8 disks
> 
> dm-6 is what btrfs is directly using and is complaining about, and I
> will guess that this is a dmcrypt device backed by /dev/sdg which is
> iSCSI to the NAS. Correct? Looks like either network problems, or
> possibly there is a real hardware problem with an error that's only
> partly passing through iSCSI. I can't parse this:
> 
>  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 UNKNOWN(0x2003)
> Result: hostbyte=0x00 driverbyte=0x08
>  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 Sense Key : 0x2
> [current] [descriptor]
>  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 ASC=0x8 ASCQ=0x0
>  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 CDB: opcode=0x8a
> 8a 00 00 00 00 00 cd 6f 0b 80 00 00 2a 20 00 00
> 
> Anyway, Btrfs detects the write failures, and is going read-only in
> order to prevent corrupting the file system. So I think you've got
> some iSCSI troubleshooting to do, and fix that. Doesn't seem like it's
> a Btrfs specific problem to me.
> 

> and I will guess that this is a dmcrypt device backed by /dev/sdg which is
> iSCSI to the NAS. Correct?

Yes.

The log looks network (disk?) probrem me too, but I think it is unlikely
because I didn't used iSCSI in case of I experienced (when Linux 3.9.)
Altough then btrfs disks are little unstable, so it's guessable that
target device (disks, iSCSI or network) reason...

I didn't see iSCSI error without btrfs transfaring.
I thought if most people didn't see probrem like this,
maybe the reason is some difference... dm-crypt plain?

I'm trying to use encrypt function on NAS (LUKS?) instead of dm-crypt plain on iSCSI disk.

(I don't know how to find iSCSI probrem...)

Thank you.

> -- 
> Chris Murphy
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


-- 
MASAKI haruka <yek@reasonset.net>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-14  4:56           ` MASAKI haruka
@ 2018-03-14  7:33             ` MASAKI,  Aki  Yuhsuke
  2018-03-15 21:26             ` Chris Murphy
  1 sibling, 0 replies; 9+ messages in thread
From: MASAKI,  Aki  Yuhsuke @ 2018-03-14  7:33 UTC (permalink / raw)
  To: MASAKI haruka; +Cc: linux-btrfs

Changing crypt layer didn't solve this probrem...


> > On Tue, Mar 13, 2018 at 1:25 PM, MASAKI haruka <yek@reasonset.net> wrote:
> > > journal(Kernel log), 7th try (to be readonly):
> > >
> > > ---
> > >  3月 12 16:25:51 lily kernel: BTRFS info (device dm-6): creating UUID tree
> > >  3月 12 16:25:53 lily iscsid[1406]: Connection-1:0 to [target: iqn.1994-11.com.netgear:eggplant-01:edc9adcf:btr1group, portal: 192.168.1.166,3260] through [iface: default] is shutdown.
> > >  3月 12 16:25:53 lily iscsid[1406]: IPC qtask write failed: Broken pipe
> > >  3月 12 16:26:18 lily kernel:  connection1:0: detected conn error (1020)
> > >  3月 12 16:26:19 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
> > >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
> > >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor]
> > >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0
> > >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 5c 00 00 00 34 00 00 00
> > >  3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4348928
> > 
> > 
> > Looks like network problems. Is one of these Btrfs volumes on an iSCSI
> > device? Because there's a bunch of iSCSI errors followed by an I/O
> > error with sector LBA reported, and then you get a bunch of Btrfs
> > write errors.
> > 
> > What's the relationship between /dev/sdg and device (dm-6)
> > /dev/mapper/hymaster_1 ?
> > 
> 
> 
> /dev/mapper/hymaster_1 is dm-crypt plain device.
> Its real device is /dev/sdg it is an iSCSI disk
> connected to NAS over GbE link local.
> 
> If this probrem from network, it's looked difficult to solve
> because I tried with two different computers without any other network device...
> 
> > 
> > >
> > > Note: This system's structure is;
> > > Computer (Linux 4.14/4.15) - btrfs (original) - dm-crypt plain - internal 4 disks
> > >  \_ btrfs (destination) - dm-crypt plain - iSCSI (single) - NAS - Hardware RAID5 - 8 disks
> > 
> > dm-6 is what btrfs is directly using and is complaining about, and I
> > will guess that this is a dmcrypt device backed by /dev/sdg which is
> > iSCSI to the NAS. Correct? Looks like either network problems, or
> > possibly there is a real hardware problem with an error that's only
> > partly passing through iSCSI. I can't parse this:
> > 
> >  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 UNKNOWN(0x2003)
> > Result: hostbyte=0x00 driverbyte=0x08
> >  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 Sense Key : 0x2
> > [current] [descriptor]
> >  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 ASC=0x8 ASCQ=0x0
> >  3月 13 00:36:47 lily kernel: sd 8:0:0:0: [sdg] tag#1 CDB: opcode=0x8a
> > 8a 00 00 00 00 00 cd 6f 0b 80 00 00 2a 20 00 00
> > 
> > Anyway, Btrfs detects the write failures, and is going read-only in
> > order to prevent corrupting the file system. So I think you've got
> > some iSCSI troubleshooting to do, and fix that. Doesn't seem like it's
> > a Btrfs specific problem to me.
> > 
> 
> > and I will guess that this is a dmcrypt device backed by /dev/sdg which is
> > iSCSI to the NAS. Correct?
> 
> Yes.
> 
> The log looks network (disk?) probrem me too, but I think it is unlikely
> because I didn't used iSCSI in case of I experienced (when Linux 3.9.)
> Altough then btrfs disks are little unstable, so it's guessable that
> target device (disks, iSCSI or network) reason...
> 
> I didn't see iSCSI error without btrfs transfaring.
> I thought if most people didn't see probrem like this,
> maybe the reason is some difference... dm-crypt plain?
> 
> I'm trying to use encrypt function on NAS (LUKS?) instead of dm-crypt plain on iSCSI disk.
> 
> (I don't know how to find iSCSI probrem...)
> 
> Thank you.
> 
> > -- 
> > Chris Murphy
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
> -- 
> MASAKI haruka <yek@reasonset.net>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


-- 
MASAKI, "Aki" Yuhsuke <aki@reasonset.net>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Btrfs broken in massive transfar
  2018-03-14  4:56           ` MASAKI haruka
  2018-03-14  7:33             ` MASAKI,  Aki  Yuhsuke
@ 2018-03-15 21:26             ` Chris Murphy
  1 sibling, 0 replies; 9+ messages in thread
From: Chris Murphy @ 2018-03-15 21:26 UTC (permalink / raw)
  To: MASAKI haruka; +Cc: Btrfs BTRFS

On Tue, Mar 13, 2018 at 10:56 PM, MASAKI haruka <yek@reasonset.net> wrote:
>> On Tue, Mar 13, 2018 at 1:25 PM, MASAKI haruka <yek@reasonset.net> wrote:
>> > journal(Kernel log), 7th try (to be readonly):
>> >
>> > ---
>> >  3月 12 16:25:51 lily kernel: BTRFS info (device dm-6): creating UUID tree
>> >  3月 12 16:25:53 lily iscsid[1406]: Connection-1:0 to [target: iqn.1994-11.com.netgear:eggplant-01:edc9adcf:btr1group, portal: 192.168.1.166,3260] through [iface: default] is shutdown.
>> >  3月 12 16:25:53 lily iscsid[1406]: IPC qtask write failed: Broken pipe
>> >  3月 12 16:26:18 lily kernel:  connection1:0: detected conn error (1020)
>> >  3月 12 16:26:19 lily iscsid[1406]: Kernel reported iSCSI connection 1:0 error (1020 - ISCSI_ERR_TCP_CONN_CLOSE: TCP connection closed) state (3)
>> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 UNKNOWN(0x2003) Result: hostbyte=0x00 driverbyte=0x08
>> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 Sense Key : 0x2 [current] [descriptor]
>> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 ASC=0x8 ASCQ=0x0
>> >  3月 12 16:26:21 lily kernel: sd 8:0:0:0: [sdg] tag#5 CDB: opcode=0x8a 8a 00 00 00 00 00 00 42 5c 00 00 00 34 00 00 00
>> >  3月 12 16:26:21 lily kernel: print_req_error: I/O error, dev sdg, sector 4348928
>>
>>
>> Looks like network problems. Is one of these Btrfs volumes on an iSCSI
>> device? Because there's a bunch of iSCSI errors followed by an I/O
>> error with sector LBA reported, and then you get a bunch of Btrfs
>> write errors.
>>
>> What's the relationship between /dev/sdg and device (dm-6)
>> /dev/mapper/hymaster_1 ?
>>
>
>
> /dev/mapper/hymaster_1 is dm-crypt plain device.
> Its real device is /dev/sdg it is an iSCSI disk
> connected to NAS over GbE link local.
>
> If this probrem from network, it's looked difficult to solve
> because I tried with two different computers without any other network device...

There's a lot of complexity here. It might be drives on the NAS, it
might networking hardware on the NAS, it could be networking hardware
like switch or cables. It all has to be narrowed down. You'll need
logs for the NAS and try to go from there.

Maybe http://vger.kernel.org/vger-lists.html#linux-scsi is a place to
start to see if these are drive or iSCSI specific problems, or network
problems and then just process of elimination.

But based on available information I don't think this is Btrfs. It's
complaining about a preexisting problem.


-- 
Chris Murphy

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2018-03-15 21:26 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-03-12 21:57 Btrfs broken in massive transfar MASAKI haruka
2018-03-13  2:27 ` Qu Wenruo
2018-03-13  4:19   ` MASAKI haruka
2018-03-13  5:04     ` Qu Wenruo
2018-03-13 19:25       ` MASAKI haruka
2018-03-14  4:09         ` Chris Murphy
2018-03-14  4:56           ` MASAKI haruka
2018-03-14  7:33             ` MASAKI,  Aki  Yuhsuke
2018-03-15 21:26             ` Chris Murphy

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).