All of lore.kernel.org
 help / color / mirror / Atom feed
* IO errors when building RAID1.... ?
@ 2018-08-31 16:35 Pierre Couderc
  2018-08-31 18:52 ` Chris Murphy
  0 siblings, 1 reply; 10+ messages in thread
From: Pierre Couderc @ 2018-08-31 16:35 UTC (permalink / raw)
  To: Btrfs BTRFS

When trying to build a RAID1 on main fs. After  normal debian stretch 
install :

<code>

root@server:/home/nous# btrfs device add /dev/sdb1 /
root@server:/home/nous# btrfs fi show
Label: none  uuid: ef0b9dad-c0eb-4a3b-9b41-e5e249363abc
         Total devices 2 FS bytes used 824.60MiB
         devid    1 size 1.82TiB used 3.02GiB path /dev/sda1
         devid    2 size 1.82TiB used 0.00B path /dev/sdb1

root@server:/home/nous# btrfs balance start -v -mconvert=raid1 
-dconvert=raid1 /
Dumping filters: flags 0x7, state 0x0, force is off
   DATA (flags 0x100): converting, target=16, soft is off
   METADATA (flags 0x100): converting, target=16, soft is off
   SYSTEM (flags 0x100): converting, target=16, soft is off
Killed
root@server:/home/nous# btrfs fi show
Label: none  uuid: ef0b9dad-c0eb-4a3b-9b41-e5e249363abc
         Total devices 2 FS bytes used 1.29GiB
         devid    2 size 1.82TiB used 1.00GiB path /dev/sdb1
         *** Some devices missing
</code>

Some IO errors on /dev/sda are found in journalctl (see them below)

I cannot believe that /dev/sda has no hard disk errors when installing 
without problems, but has many ones when I "btrfs device add /dev/sdb1 /".

I can reproduce the problem : reinstall (3times...) and try "btrfs 
device add /dev/sdb1 /" with the same results...


<journalctl>

Aug 31 17:34:55 server su[559]: Successful su for root by nous
Aug 31 17:34:55 server su[559]: + /dev/pts/1 nous:root
Aug 31 17:34:55 server su[559]: pam_unix(su:session): session opened for 
user root by nous(uid=1000)
Aug 31 17:34:55 server su[559]: pam_systemd(su:session): Cannot create 
session: Already running in a session
Aug 31 17:35:03 server kernel: BTRFS info (device sda1): disk added 
/dev/sdb1
Aug 31 17:35:40 server kernel: BTRFS info (device sda1): relocating 
block group 1103101952 flags 1
Aug 31 17:36:12 server sshd[572]: Accepted password for nous from 
2a01:e34:eeaf:c5f0:e54:15ff:feb1:b1c9 port 49308 ssh2
Aug 31 17:36:12 server sshd[572]: pam_unix(sshd:session): session opened 
for user nous by (uid=0)
Aug 31 17:36:12 server systemd-logind[415]: New session 4 of user nous.
Aug 31 17:36:12 server systemd[1]: Started Session 4 of user nous.
Aug 31 17:36:16 server kernel: ata1: lost interrupt (Status 0x50)
Aug 31 17:36:16 server kernel: ata1.00: exception Emask 0x50 SAct 0x0 
SErr 0x40d0802 action 0xe frozen
Aug 31 17:36:16 server kernel: ata1.00: SError: { RecovComm HostInt 
PHYRdyChg CommWake 10B8B DevExch }
Aug 31 17:36:16 server kernel: ata1.00: failed command: READ DMA
Aug 31 17:36:16 server kernel: ata1.00: cmd 
c8/00:60:00:cd:02/00:00:00:00:00/e0 tag 0 dma 49152 in
                                         res 
40/00:01:00:00:00/00:00:00:00:00/40 Emask 0x54 (ATA bus error)
Aug 31 17:36:16 server kernel: ata1.00: status: { DRDY }
Aug 31 17:36:16 server kernel: ata1.00: hard resetting link
Aug 31 17:36:17 server kernel: ata1.01: hard resetting link
Aug 31 17:36:18 server kernel: ata1.01: failed to resume link (SControl 0)
Aug 31 17:36:18 server kernel: ata1.00: SATA link up 6.0 Gbps (SStatus 
133 SControl 300)
Aug 31 17:36:18 server kernel: ata1.01: SATA link down (SStatus 4 
SControl 0)
Aug 31 17:36:18 server kernel: ata1.00: NODEV after polling detection
Aug 31 17:36:18 server kernel: ata1.00: revalidation failed (errno=-2)
Aug 31 17:36:20 server su[590]: Successful su for root by nous
Aug 31 17:36:20 server su[590]: + /dev/pts/2 nous:root
Aug 31 17:36:20 server su[590]: pam_unix(su:session): session opened for 
user root by nous(uid=1000)
Aug 31 17:36:20 server su[590]: pam_systemd(su:session): Cannot create 
session: Already running in a session
Aug 31 17:36:23 server kernel: ata1.00: hard resetting link
Aug 31 17:36:23 server kernel: ata1.01: hard resetting link
Aug 31 17:36:24 server kernel: ata1.01: failed to resume link (SControl 0)
Aug 31 17:36:25 server kernel: ata1.00: SATA link up 6.0 Gbps (SStatus 
133 SControl 300)
Aug 31 17:36:25 server kernel: ata1.01: SATA link down (SStatus 4 
SControl 0)
Aug 31 17:36:25 server kernel: ata1.00: NODEV after polling detection
Aug 31 17:36:25 server kernel: ata1.00: revalidation failed (errno=-2)
Aug 31 17:36:30 server kernel: ata1.00: hard resetting link
Aug 31 17:36:30 server kernel: ata1.01: hard resetting link
Aug 31 17:36:31 server kernel: ata1.01: failed to resume link (SControl 0)
Aug 31 17:36:31 server kernel: ata1.00: SATA link up 6.0 Gbps (SStatus 
133 SControl 300)
Aug 31 17:36:31 server kernel: ata1.01: SATA link down (SStatus 4 
SControl 0)
Aug 31 17:36:31 server kernel: ata1.00: NODEV after polling detection
Aug 31 17:36:31 server kernel: ata1.00: revalidation failed (errno=-2)
Aug 31 17:36:31 server kernel: ata1.00: disabled
Aug 31 17:36:36 server kernel: ata1.00: hard resetting link
Aug 31 17:36:37 server kernel: ata1.01: hard resetting link
Aug 31 17:36:38 server kernel: ata1.01: failed to resume link (SControl 0)
Aug 31 17:36:38 server kernel: ata1.00: SATA link up 6.0 Gbps (SStatus 
133 SControl 300)
Aug 31 17:36:38 server kernel: ata1.01: SATA link down (SStatus 4 
SControl 0)
Aug 31 17:36:38 server kernel: ata1.00: NODEV after polling detection
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] tag#0 FAILED Result: 
hostbyte=DID_OK driverbyte=DRIVER_SENSE
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] tag#0 Sense Key : 
Illegal Request [current]
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] tag#0 Add. Sense: 
Unaligned write command
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] tag#0 CDB: Read(10) 28 
00 00 02 cd 00 00 00 60 00
Aug 31 17:36:38 server kernel: blk_update_request: I/O error, dev sda, 
sector 183552
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 0, rd 1, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 0, rd 2, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 0, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: sd 0:0:0:0: rejecting I/O to offline device
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] killing request
Aug 31 17:36:38 server kernel: sd 0:0:0:0: rejecting I/O to offline device
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 1, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 2, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 3, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 4, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 5, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 6, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: BTRFS error (device sda1): bdev /dev/sda1 
errs: wr 7, rd 3, flush 0, corrupt 0, gen 0
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] FAILED Result: 
hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] CDB: Write(10) 2a 00 00 
61 9c 00 00 0a 00 00
Aug 31 17:36:38 server kernel: blk_update_request: I/O error, dev sda, 
sector 6396928
Aug 31 17:36:38 server kernel: sd 0:0:0:0: rejecting I/O to offline device
Aug 31 17:36:38 server kernel: sd 0:0:0:0: rejecting I/O to offline device

more than 100 identical lines...

Aug 31 17:36:38 server kernel: sd 0:0:0:0: rejecting I/O to offline device
Aug 31 17:36:38 server kernel: ata1: EH complete
Aug 31 17:36:38 server kernel: ata1.00: detaching (SCSI 0:0:0:0)
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] Synchronizing SCSI cache
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] Synchronize Cache(10) 
failed: Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] Stopping disk
Aug 31 17:36:38 server kernel: sd 0:0:0:0: [sda] Start/Stop Unit failed: 
Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Aug 31 17:36:38 server kernel: Buffer I/O error on dev sda1, logical 
block 488378352, async page read
Aug 31 17:36:38 server kernel: scsi 0:0:0:0: rejecting I/O to dead device
Aug 31 17:36:38 server kernel: blk_update_request: I/O error, dev sda, 
sector 6762624
Aug 31 17:36:38 server kernel: BTRFS: error (device sda1) in 
btrfs_commit_transaction:2227: errno=-5 IO failure (Error while writing 
out transaction)
Aug 31 17:36:38 server kernel: BTRFS info (device sda1): forced readonly
Aug 31 17:36:38 server kernel: BTRFS warning (device sda1): Skipping 
commit of aborted transaction.
Aug 31 17:36:38 server kernel: ------------[ cut here ]------------
Aug 31 17:36:38 server kernel: WARNING: CPU: 1 PID: 159 at 
/build/linux-cRtIym/linux-4.9.30/fs/btrfs/transaction.c:1850 
cleanup_transaction+0x1f0/0x2e0 [btrfs]
Aug 31 17:36:38 server kernel: BTRFS: Transaction aborted (error -5)
Aug 31 17:36:38 server kernel: Modules linked in: intel_rapl 
x86_pkg_temp_thermal intel_powerclamp coretemp kvm irqbypass eeepc_wmi 
asus_wmi crct10dif_pclmul sparse_keymap crc32_pclmul g

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2018-09-03 23:57 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-08-31 16:35 IO errors when building RAID1.... ? Pierre Couderc
2018-08-31 18:52 ` Chris Murphy
2018-08-31 19:02   ` Chris Murphy
2018-09-01  1:35     ` Duncan
2018-09-01 15:46       ` Pierre Couderc
2018-09-01  7:03   ` Pierre Couderc
2018-09-03  3:15     ` Chris Murphy
2018-09-03  6:21       ` Pierre Couderc
2018-09-03 10:23       ` Adam Borowski
2018-09-03 19:35         ` Chris Murphy

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.