[4.8 hang] xfstests generic/361 hangs on dax enabled filesystems

All of lore.kernel.org
 help / color / mirror / Atom feed

* [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03  0:33 ` Dave Chinner
  0 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03  0:33 UTC (permalink / raw)
  To: xfs; +Cc: linux-fsdevel, linux-nvdimm

Hi folks,

Just hit a reproducable hang in generic/361. Essentially this on
a 8GB pmem device:

mkfs.xfs -f /dev/pmem1
mount -o dax /dev/pmem1 /mnt/scratch
xfs_io -f -c "truncate 1g" test.img
losetup -f --show /mnt/scratch/test.img
mkfs.xfs -f /dev/loop0

And the mkfs.xfs command hangs with a discard that never completes:

[  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
[  243.415678]       Not tainted 4.7.0-dgc+ #862
[  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
[  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
[  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
[  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
[  243.426466] Call Trace:
[  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
[  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
[  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
[  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
[  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
[  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
[  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
[  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
[  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
[  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
[  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
[  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
[  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
[  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
[  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
[  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
[  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
[  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
[  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4

This only reproduces when the underlying filesystem is mounted with
-o dax, so there is a bad interaction with loop devices and DAX
occurring somewhere. generic/361 is a recent test (committed june 14)
so this probably hasn't actually been tested until now.

I haven't got time to look at this right now, hence the report.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03  0:33 ` Dave Chinner
  0 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03  0:33 UTC (permalink / raw)
  To: xfs; +Cc: linux-fsdevel, linux-nvdimm

Hi folks,

Just hit a reproducable hang in generic/361. Essentially this on
a 8GB pmem device:

mkfs.xfs -f /dev/pmem1
mount -o dax /dev/pmem1 /mnt/scratch
xfs_io -f -c "truncate 1g" test.img
losetup -f --show /mnt/scratch/test.img
mkfs.xfs -f /dev/loop0

And the mkfs.xfs command hangs with a discard that never completes:

[  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
[  243.415678]       Not tainted 4.7.0-dgc+ #862
[  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
[  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
[  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
[  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
[  243.426466] Call Trace:
[  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
[  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
[  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
[  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
[  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
[  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
[  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
[  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
[  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
[  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
[  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
[  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
[  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
[  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
[  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
[  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
[  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
[  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
[  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4

This only reproduces when the underlying filesystem is mounted with
-o dax, so there is a bad interaction with loop devices and DAX
occurring somewhere. generic/361 is a recent test (committed june 14)
so this probably hasn't actually been tested until now.

I haven't got time to look at this right now, hence the report.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

^ permalink raw reply	[flat|nested] 25+ messages in thread

* [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03  0:33 ` Dave Chinner
  0 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03  0:33 UTC (permalink / raw)
  To: xfs; +Cc: linux-fsdevel, linux-nvdimm

Hi folks,

Just hit a reproducable hang in generic/361. Essentially this on
a 8GB pmem device:

mkfs.xfs -f /dev/pmem1
mount -o dax /dev/pmem1 /mnt/scratch
xfs_io -f -c "truncate 1g" test.img
losetup -f --show /mnt/scratch/test.img
mkfs.xfs -f /dev/loop0

And the mkfs.xfs command hangs with a discard that never completes:

[  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
[  243.415678]       Not tainted 4.7.0-dgc+ #862
[  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
[  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
[  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
[  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
[  243.426466] Call Trace:
[  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
[  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
[  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
[  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
[  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
[  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
[  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
[  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
[  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
[  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
[  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
[  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
[  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
[  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
[  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
[  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
[  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
[  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
[  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4

This only reproduces when the underlying filesystem is mounted with
-o dax, so there is a bad interaction with loop devices and DAX
occurring somewhere. generic/361 is a recent test (committed june 14)
so this probably hasn't actually been tested until now.

I haven't got time to look at this right now, hence the report.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03  0:33 ` Dave Chinner
  (?)
@ 2016-08-03  0:59   ` Dan Williams
  -1 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-03  0:59 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, XFS Developers

On Tue, Aug 2, 2016 at 5:33 PM, Dave Chinner <david@fromorbit.com> wrote:
> Hi folks,
>
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
>
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
>
> And the mkfs.xfs command hangs with a discard that never completes:
>
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
>
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
>
> I haven't got time to look at this right now, hence the report.
>

Thanks Dave, we'll take a look.
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03  0:59   ` Dan Williams
  0 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-03  0:59 UTC (permalink / raw)
  To: Dave Chinner; +Cc: XFS Developers, linux-fsdevel, linux-nvdimm

On Tue, Aug 2, 2016 at 5:33 PM, Dave Chinner <david@fromorbit.com> wrote:
> Hi folks,
>
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
>
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
>
> And the mkfs.xfs command hangs with a discard that never completes:
>
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
>
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
>
> I haven't got time to look at this right now, hence the report.
>

Thanks Dave, we'll take a look.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03  0:59   ` Dan Williams
  0 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-03  0:59 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, XFS Developers

On Tue, Aug 2, 2016 at 5:33 PM, Dave Chinner <david@fromorbit.com> wrote:
> Hi folks,
>
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
>
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
>
> And the mkfs.xfs command hangs with a discard that never completes:
>
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
>
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
>
> I haven't got time to look at this right now, hence the report.
>

Thanks Dave, we'll take a look.

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03  0:33 ` Dave Chinner
  (?)
@ 2016-08-03 17:11   ` Ross Zwisler
  -1 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 17:11 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, xfs

On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> Hi folks,
> 
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
> 
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
> 
> And the mkfs.xfs command hangs with a discard that never completes:
> 
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> 
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
> 
> I haven't got time to look at this right now, hence the report.

Cool, thanks for the report.  I've reproduced this with linux/master, and the
test passes with v4.7.

Running a bisect...
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 17:11   ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 17:11 UTC (permalink / raw)
  To: Dave Chinner; +Cc: xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> Hi folks,
> 
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
> 
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
> 
> And the mkfs.xfs command hangs with a discard that never completes:
> 
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> 
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
> 
> I haven't got time to look at this right now, hence the report.

Cool, thanks for the report.  I've reproduced this with linux/master, and the
test passes with v4.7.

Running a bisect...

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 17:11   ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 17:11 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, xfs

On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> Hi folks,
> 
> Just hit a reproducable hang in generic/361. Essentially this on
> a 8GB pmem device:
> 
> mkfs.xfs -f /dev/pmem1
> mount -o dax /dev/pmem1 /mnt/scratch
> xfs_io -f -c "truncate 1g" test.img
> losetup -f --show /mnt/scratch/test.img
> mkfs.xfs -f /dev/loop0
> 
> And the mkfs.xfs command hangs with a discard that never completes:
> 
> [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> [  243.415678]       Not tainted 4.7.0-dgc+ #862
> [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> [  243.426466] Call Trace:
> [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> 
> This only reproduces when the underlying filesystem is mounted with
> -o dax, so there is a bad interaction with loop devices and DAX
> occurring somewhere. generic/361 is a recent test (committed june 14)
> so this probably hasn't actually been tested until now.
> 
> I haven't got time to look at this right now, hence the report.

Cool, thanks for the report.  I've reproduced this with linux/master, and the
test passes with v4.7.

Running a bisect...

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03 17:11   ` Ross Zwisler
  (?)
@ 2016-08-03 22:37     ` Ross Zwisler
  -1 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 22:37 UTC (permalink / raw)
  To: Ross Zwisler, Dave Chinner, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > Hi folks,
> > 
> > Just hit a reproducable hang in generic/361. Essentially this on
> > a 8GB pmem device:
> > 
> > mkfs.xfs -f /dev/pmem1
> > mount -o dax /dev/pmem1 /mnt/scratch
> > xfs_io -f -c "truncate 1g" test.img
> > losetup -f --show /mnt/scratch/test.img
> > mkfs.xfs -f /dev/loop0
> > 
> > And the mkfs.xfs command hangs with a discard that never completes:
> > 
> > [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> > [  243.415678]       Not tainted 4.7.0-dgc+ #862
> > [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> > [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> > [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> > [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> > [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> > [  243.426466] Call Trace:
> > [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> > [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> > [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> > [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> > [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> > [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> > [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> > [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> > [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> > [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> > [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> > [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> > [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> > [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> > [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> > [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> > [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> > [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> > [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> > 
> > This only reproduces when the underlying filesystem is mounted with
> > -o dax, so there is a bad interaction with loop devices and DAX
> > occurring somewhere. generic/361 is a recent test (committed june 14)
> > so this probably hasn't actually been tested until now.
> > 
> > I haven't got time to look at this right now, hence the report.
> 
> Cool, thanks for the report.  I've reproduced this with linux/master, and the
> test passes with v4.7.
> 
> Running a bisect...

This bisected to a commit to the block layer code.  I've sent a bug report to
the author of the commit.

- Ross
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 22:37     ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 22:37 UTC (permalink / raw)
  To: Ross Zwisler, Dave Chinner, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > Hi folks,
> > 
> > Just hit a reproducable hang in generic/361. Essentially this on
> > a 8GB pmem device:
> > 
> > mkfs.xfs -f /dev/pmem1
> > mount -o dax /dev/pmem1 /mnt/scratch
> > xfs_io -f -c "truncate 1g" test.img
> > losetup -f --show /mnt/scratch/test.img
> > mkfs.xfs -f /dev/loop0
> > 
> > And the mkfs.xfs command hangs with a discard that never completes:
> > 
> > [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> > [  243.415678]       Not tainted 4.7.0-dgc+ #862
> > [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> > [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> > [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> > [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> > [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> > [  243.426466] Call Trace:
> > [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> > [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> > [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> > [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> > [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> > [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> > [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> > [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> > [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> > [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> > [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> > [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> > [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> > [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> > [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> > [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> > [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> > [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> > [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> > 
> > This only reproduces when the underlying filesystem is mounted with
> > -o dax, so there is a bad interaction with loop devices and DAX
> > occurring somewhere. generic/361 is a recent test (committed june 14)
> > so this probably hasn't actually been tested until now.
> > 
> > I haven't got time to look at this right now, hence the report.
> 
> Cool, thanks for the report.  I've reproduced this with linux/master, and the
> test passes with v4.7.
> 
> Running a bisect...

This bisected to a commit to the block layer code.  I've sent a bug report to
the author of the commit.

- Ross

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 22:37     ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-03 22:37 UTC (permalink / raw)
  To: Ross Zwisler, Dave Chinner, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > Hi folks,
> > 
> > Just hit a reproducable hang in generic/361. Essentially this on
> > a 8GB pmem device:
> > 
> > mkfs.xfs -f /dev/pmem1
> > mount -o dax /dev/pmem1 /mnt/scratch
> > xfs_io -f -c "truncate 1g" test.img
> > losetup -f --show /mnt/scratch/test.img
> > mkfs.xfs -f /dev/loop0
> > 
> > And the mkfs.xfs command hangs with a discard that never completes:
> > 
> > [  243.413918] INFO: task mkfs.xfs:5708 blocked for more than 120 seconds.
> > [  243.415678]       Not tainted 4.7.0-dgc+ #862
> > [  243.416772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> > [  243.418769] mkfs.xfs        D ffff880835143c18 13848  5708   5441 0x00000000
> > [  243.420620]  ffff880835143c18 ffff880835143c20 ffff88083a244780 ffff8808358ba3c0
> > [  243.422636]  ffff88023aa20000 ffff880835144000 7fffffffffffffff 7fffffffffffffff
> > [  243.424586]  ffff8808358ba3c0 00000000024000c0 ffff880835143c30 ffffffff81e5e38c
> > [  243.426466] Call Trace:
> > [  243.427050]  [<ffffffff81e5e38c>] schedule+0x3c/0x90
> > [  243.428224]  [<ffffffff81e62be5>] schedule_timeout+0x265/0x330
> > [  243.429563]  [<ffffffff8109f125>] ? kvm_clock_read+0x25/0x40
> > [  243.430896]  [<ffffffff8109f149>] ? kvm_clock_get_cycles+0x9/0x10
> > [  243.432360]  [<ffffffff81125edc>] ? ktime_get+0x3c/0xb0
> > [  243.433556]  [<ffffffff81e5db54>] io_schedule_timeout+0xa4/0x110
> > [  243.434932]  [<ffffffff81e5eed6>] wait_for_completion_io+0xd6/0x110
> > [  243.436297]  [<ffffffff810decd0>] ? wake_up_q+0x70/0x70
> > [  243.437436]  [<ffffffff817d6f06>] submit_bio_wait+0x56/0x70
> > [  243.438671]  [<ffffffff817e851a>] blkdev_issue_discard+0x6a/0xb0
> > [  243.439980]  [<ffffffff810dab69>] ? __might_sleep+0x49/0x80
> > [  243.441182]  [<ffffffff817eea87>] blk_ioctl_discard+0x97/0xb0
> > [  243.442370]  [<ffffffff817ef7bb>] blkdev_ioctl+0x7eb/0x9a0
> > [  243.443485]  [<ffffffff81236a1d>] block_ioctl+0x3d/0x50
> > [  243.444552]  [<ffffffff812100df>] do_vfs_ioctl+0x8f/0x670
> > [  243.445630]  [<ffffffff81002434>] ? exit_to_usermode_loop+0x94/0xb0
> > [  243.446902]  [<ffffffff81210739>] SyS_ioctl+0x79/0x90
> > [  243.447927]  [<ffffffff81002bc5>] ? syscall_return_slowpath+0xf5/0x190
> > [  243.449236]  [<ffffffff81e63d32>] entry_SYSCALL_64_fastpath+0x1a/0xa4
> > 
> > This only reproduces when the underlying filesystem is mounted with
> > -o dax, so there is a bad interaction with loop devices and DAX
> > occurring somewhere. generic/361 is a recent test (committed june 14)
> > so this probably hasn't actually been tested until now.
> > 
> > I haven't got time to look at this right now, hence the report.
> 
> Cool, thanks for the report.  I've reproduced this with linux/master, and the
> test passes with v4.7.
> 
> Running a bisect...

This bisected to a commit to the block layer code.  I've sent a bug report to
the author of the commit.

- Ross

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03 22:37     ` Ross Zwisler
  (?)
@ 2016-08-03 23:16       ` Dave Chinner
  -1 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03 23:16 UTC (permalink / raw)
  To: Ross Zwisler, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > > Hi folks,
> > > 
> > > Just hit a reproducable hang in generic/361. Essentially this on
> > > a 8GB pmem device:
> > > 
> > > mkfs.xfs -f /dev/pmem1
> > > mount -o dax /dev/pmem1 /mnt/scratch
> > > xfs_io -f -c "truncate 1g" test.img
> > > losetup -f --show /mnt/scratch/test.img
> > > mkfs.xfs -f /dev/loop0
> > > 
> > > And the mkfs.xfs command hangs with a discard that never completes:
....
> > > This only reproduces when the underlying filesystem is mounted with
> > > -o dax, so there is a bad interaction with loop devices and DAX
> > > occurring somewhere. generic/361 is a recent test (committed june 14)
> > > so this probably hasn't actually been tested until now.
> > > 
> > > I haven't got time to look at this right now, hence the report.
> > 
> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
> > test passes with v4.7.
> > 
> > Running a bisect...
> 
> This bisected to a commit to the block layer code.  I've sent a bug report to
> the author of the commit.

And the commit id is ...?

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 23:16       ` Dave Chinner
  0 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03 23:16 UTC (permalink / raw)
  To: Ross Zwisler, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > > Hi folks,
> > > 
> > > Just hit a reproducable hang in generic/361. Essentially this on
> > > a 8GB pmem device:
> > > 
> > > mkfs.xfs -f /dev/pmem1
> > > mount -o dax /dev/pmem1 /mnt/scratch
> > > xfs_io -f -c "truncate 1g" test.img
> > > losetup -f --show /mnt/scratch/test.img
> > > mkfs.xfs -f /dev/loop0
> > > 
> > > And the mkfs.xfs command hangs with a discard that never completes:
....
> > > This only reproduces when the underlying filesystem is mounted with
> > > -o dax, so there is a bad interaction with loop devices and DAX
> > > occurring somewhere. generic/361 is a recent test (committed june 14)
> > > so this probably hasn't actually been tested until now.
> > > 
> > > I haven't got time to look at this right now, hence the report.
> > 
> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
> > test passes with v4.7.
> > 
> > Running a bisect...
> 
> This bisected to a commit to the block layer code.  I've sent a bug report to
> the author of the commit.

And the commit id is ...?

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-03 23:16       ` Dave Chinner
  0 siblings, 0 replies; 25+ messages in thread
From: Dave Chinner @ 2016-08-03 23:16 UTC (permalink / raw)
  To: Ross Zwisler, xfs, linux-fsdevel, linux-nvdimm

On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
> > > Hi folks,
> > > 
> > > Just hit a reproducable hang in generic/361. Essentially this on
> > > a 8GB pmem device:
> > > 
> > > mkfs.xfs -f /dev/pmem1
> > > mount -o dax /dev/pmem1 /mnt/scratch
> > > xfs_io -f -c "truncate 1g" test.img
> > > losetup -f --show /mnt/scratch/test.img
> > > mkfs.xfs -f /dev/loop0
> > > 
> > > And the mkfs.xfs command hangs with a discard that never completes:
....
> > > This only reproduces when the underlying filesystem is mounted with
> > > -o dax, so there is a bad interaction with loop devices and DAX
> > > occurring somewhere. generic/361 is a recent test (committed june 14)
> > > so this probably hasn't actually been tested until now.
> > > 
> > > I haven't got time to look at this right now, hence the report.
> > 
> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
> > test passes with v4.7.
> > 
> > Running a bisect...
> 
> This bisected to a commit to the block layer code.  I've sent a bug report to
> the author of the commit.

And the commit id is ...?

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03 23:16       ` Dave Chinner
  (?)
@ 2016-08-04  2:52         ` Dan Williams
  -1 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-04  2:52 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, XFS Developers

On Wed, Aug 3, 2016 at 4:16 PM, Dave Chinner <david@fromorbit.com> wrote:
> On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
>> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
>> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
>> > > Hi folks,
>> > >
>> > > Just hit a reproducable hang in generic/361. Essentially this on
>> > > a 8GB pmem device:
>> > >
>> > > mkfs.xfs -f /dev/pmem1
>> > > mount -o dax /dev/pmem1 /mnt/scratch
>> > > xfs_io -f -c "truncate 1g" test.img
>> > > losetup -f --show /mnt/scratch/test.img
>> > > mkfs.xfs -f /dev/loop0
>> > >
>> > > And the mkfs.xfs command hangs with a discard that never completes:
> ....
>> > > This only reproduces when the underlying filesystem is mounted with
>> > > -o dax, so there is a bad interaction with loop devices and DAX
>> > > occurring somewhere. generic/361 is a recent test (committed june 14)
>> > > so this probably hasn't actually been tested until now.
>> > >
>> > > I haven't got time to look at this right now, hence the report.
>> >
>> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
>> > test passes with v4.7.
>> >
>> > Running a bisect...
>>
>> This bisected to a commit to the block layer code.  I've sent a bug report to
>> the author of the commit.
>
> And the commit id is ...?

c2df40dfb8c0 drivers: use req op accessor
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04  2:52         ` Dan Williams
  0 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-04  2:52 UTC (permalink / raw)
  To: Dave Chinner; +Cc: Ross Zwisler, XFS Developers, linux-fsdevel, linux-nvdimm

On Wed, Aug 3, 2016 at 4:16 PM, Dave Chinner <david@fromorbit.com> wrote:
> On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
>> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
>> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
>> > > Hi folks,
>> > >
>> > > Just hit a reproducable hang in generic/361. Essentially this on
>> > > a 8GB pmem device:
>> > >
>> > > mkfs.xfs -f /dev/pmem1
>> > > mount -o dax /dev/pmem1 /mnt/scratch
>> > > xfs_io -f -c "truncate 1g" test.img
>> > > losetup -f --show /mnt/scratch/test.img
>> > > mkfs.xfs -f /dev/loop0
>> > >
>> > > And the mkfs.xfs command hangs with a discard that never completes:
> ....
>> > > This only reproduces when the underlying filesystem is mounted with
>> > > -o dax, so there is a bad interaction with loop devices and DAX
>> > > occurring somewhere. generic/361 is a recent test (committed june 14)
>> > > so this probably hasn't actually been tested until now.
>> > >
>> > > I haven't got time to look at this right now, hence the report.
>> >
>> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
>> > test passes with v4.7.
>> >
>> > Running a bisect...
>>
>> This bisected to a commit to the block layer code.  I've sent a bug report to
>> the author of the commit.
>
> And the commit id is ...?

c2df40dfb8c0 drivers: use req op accessor

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04  2:52         ` Dan Williams
  0 siblings, 0 replies; 25+ messages in thread
From: Dan Williams @ 2016-08-04  2:52 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, Ross Zwisler, linux-nvdimm, XFS Developers

On Wed, Aug 3, 2016 at 4:16 PM, Dave Chinner <david@fromorbit.com> wrote:
> On Wed, Aug 03, 2016 at 04:37:13PM -0600, Ross Zwisler wrote:
>> On Wed, Aug 03, 2016 at 11:11:27AM -0600, Ross Zwisler wrote:
>> > On Wed, Aug 03, 2016 at 10:33:54AM +1000, Dave Chinner wrote:
>> > > Hi folks,
>> > >
>> > > Just hit a reproducable hang in generic/361. Essentially this on
>> > > a 8GB pmem device:
>> > >
>> > > mkfs.xfs -f /dev/pmem1
>> > > mount -o dax /dev/pmem1 /mnt/scratch
>> > > xfs_io -f -c "truncate 1g" test.img
>> > > losetup -f --show /mnt/scratch/test.img
>> > > mkfs.xfs -f /dev/loop0
>> > >
>> > > And the mkfs.xfs command hangs with a discard that never completes:
> ....
>> > > This only reproduces when the underlying filesystem is mounted with
>> > > -o dax, so there is a bad interaction with loop devices and DAX
>> > > occurring somewhere. generic/361 is a recent test (committed june 14)
>> > > so this probably hasn't actually been tested until now.
>> > >
>> > > I haven't got time to look at this right now, hence the report.
>> >
>> > Cool, thanks for the report.  I've reproduced this with linux/master, and the
>> > test passes with v4.7.
>> >
>> > Running a bisect...
>>
>> This bisected to a commit to the block layer code.  I've sent a bug report to
>> the author of the commit.
>
> And the commit id is ...?

c2df40dfb8c0 drivers: use req op accessor

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-03  0:33 ` Dave Chinner
@ 2016-08-04 15:48   ` Christoph Hellwig
  -1 siblings, 0 replies; 25+ messages in thread
From: Christoph Hellwig @ 2016-08-04 15:48 UTC (permalink / raw)
  To: Dave Chinner; +Cc: linux-fsdevel, linux-nvdimm, xfs

Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
as well.

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04 15:48   ` Christoph Hellwig
  0 siblings, 0 replies; 25+ messages in thread
From: Christoph Hellwig @ 2016-08-04 15:48 UTC (permalink / raw)
  To: Dave Chinner; +Cc: xfs, linux-fsdevel, linux-nvdimm

Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
as well.

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-04 15:48   ` Christoph Hellwig
  (?)
@ 2016-08-04 18:46       ` Ross Zwisler
  -1 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-04 18:46 UTC (permalink / raw)
  To: Christoph Hellwig
  Cc: linux-fsdevel-u79uwXL29TY76Z2rM5mHXA,
	linux-nvdimm-y27Ovi1pjclAfugRpC6u6w, Dave Chinner,
	xfs-VZNHf3L845pBDgjK7y7TUQ

On Thu, Aug 04, 2016 at 08:48:05AM -0700, Christoph Hellwig wrote:
> Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
> only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
> as well.

I don't think Mike's fix solves this issue.  I'm still able to reproduce the
hang with linux/master + the patch he sent out.

Did you retest and get a different result?

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04 18:46       ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-04 18:46 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: Dave Chinner, linux-fsdevel, linux-nvdimm, xfs

On Thu, Aug 04, 2016 at 08:48:05AM -0700, Christoph Hellwig wrote:
> Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
> only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
> as well.

I don't think Mike's fix solves this issue.  I'm still able to reproduce the
hang with linux/master + the patch he sent out.

Did you retest and get a different result?

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04 18:46       ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-04 18:46 UTC (permalink / raw)
  To: Christoph Hellwig; +Cc: linux-fsdevel, linux-nvdimm, xfs

On Thu, Aug 04, 2016 at 08:48:05AM -0700, Christoph Hellwig wrote:
> Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
> only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
> as well.

I don't think Mike's fix solves this issue.  I'm still able to reproduce the
hang with linux/master + the patch he sent out.

Did you retest and get a different result?

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
  2016-08-04 18:46       ` Ross Zwisler
@ 2016-08-04 18:54         ` Ross Zwisler
  -1 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-04 18:54 UTC (permalink / raw)
  To: Ross Zwisler, Christoph Hellwig, Dave Chinner, linux-fsdevel,
	linux-nvdimm, xfs

On Thu, Aug 04, 2016 at 12:46:58PM -0600, Ross Zwisler wrote:
> On Thu, Aug 04, 2016 at 08:48:05AM -0700, Christoph Hellwig wrote:
> > Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
> > only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
> > as well.
> 
> I don't think Mike's fix solves this issue.  I'm still able to reproduce the
> hang with linux/master + the patch he sent out.
> 
> Did you retest and get a different result?

Never mind, didn't see that you had sent out patches as well.

_______________________________________________
xfs mailing list
xfs@oss.sgi.com
http://oss.sgi.com/mailman/listinfo/xfs

^ permalink raw reply	[flat|nested] 25+ messages in thread

* Re: [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems
@ 2016-08-04 18:54         ` Ross Zwisler
  0 siblings, 0 replies; 25+ messages in thread
From: Ross Zwisler @ 2016-08-04 18:54 UTC (permalink / raw)
  To: Ross Zwisler, Christoph Hellwig, Dave Chinner, linux-fsdevel,
	linux-nvdimm, xfs

On Thu, Aug 04, 2016 at 12:46:58PM -0600, Ross Zwisler wrote:
> On Thu, Aug 04, 2016 at 08:48:05AM -0700, Christoph Hellwig wrote:
> > Just send a fix that you're Cc'ed on.  But now xfs/049 hangs, although
> > only on pmem devices, loop on non-pmem seems to be fine.  4.7 was fine
> > as well.
> 
> I don't think Mike's fix solves this issue.  I'm still able to reproduce the
> hang with linux/master + the patch he sent out.
> 
> Did you retest and get a different result?

Never mind, didn't see that you had sent out patches as well.

^ permalink raw reply	[flat|nested] 25+ messages in thread

end of thread, other threads:[~2016-08-04 18:54 UTC | newest]

Thread overview: 25+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-08-03  0:33 [4.8 hang] xfstests generic/361 hangs on dax enabled filesystems Dave Chinner
2016-08-03  0:33 ` Dave Chinner
2016-08-03  0:33 ` Dave Chinner
2016-08-03  0:59 ` Dan Williams
2016-08-03  0:59   ` Dan Williams
2016-08-03  0:59   ` Dan Williams
2016-08-03 17:11 ` Ross Zwisler
2016-08-03 17:11   ` Ross Zwisler
2016-08-03 17:11   ` Ross Zwisler
2016-08-03 22:37   ` Ross Zwisler
2016-08-03 22:37     ` Ross Zwisler
2016-08-03 22:37     ` Ross Zwisler
2016-08-03 23:16     ` Dave Chinner
2016-08-03 23:16       ` Dave Chinner
2016-08-03 23:16       ` Dave Chinner
2016-08-04  2:52       ` Dan Williams
2016-08-04  2:52         ` Dan Williams
2016-08-04  2:52         ` Dan Williams
2016-08-04 15:48 ` Christoph Hellwig
2016-08-04 15:48   ` Christoph Hellwig
     [not found]   ` <20160804154805.GA24025-wEGCiKHe2LqWVfeAwA7xHQ@public.gmane.org>
2016-08-04 18:46     ` Ross Zwisler
2016-08-04 18:46       ` Ross Zwisler
2016-08-04 18:46       ` Ross Zwisler
2016-08-04 18:54       ` Ross Zwisler
2016-08-04 18:54         ` Ross Zwisler

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.