kernel BUG at nvme/host/pci.c

* kernel BUG at nvme/host/pci.c
@ 2017-07-10 18:03 Andreas Pflug
  2017-07-10 19:08 ` Keith Busch
  0 siblings, 1 reply; 16+ messages in thread
From: Andreas Pflug @ 2017-07-10 18:03 UTC (permalink / raw)

I'm running a patched (see below) debian 4.9.30 kernel with xen4.8.1 on 
Debian9. Starting a specific virtual machine, very soon the kernel will emit

     kernel BUG at /usr/src/kernel/linux-4.9.30/drivers/nvme/host/pci.c:495!

via netconsole to my logging host, and become unstable until hard reset. 
Hardware is dual E5-2620v4 on Supermicro 10DRI-T with two SAMSUNG 
MZQLW960HMJP-00003 NVME disks (mdadm RAID-1) backing the vhds (os on 
separate SSD).

The bug was reported to debian as https://bugs.debian.org/866511 . 
According to Ben Hutchings' advice, I patched the standard kernel with 
0001-swiotlb-ensure-that-page-sized-mappings-are-page-ali.patch since 
its description sounded promising, but the bug remains.

Log is attached, cut after 460 lines: the last trace on CPU15 is 
repeated all over again, eventually leading to "Fixing recursive fault 
but reboot is needed!"

Regards,
Andreas
-------------- next part --------------
A non-text attachment was scrubbed...
Name: xen2-kernel.log
Type: text/x-log
Size: 17073 bytes
Desc: not available
URL: <http://lists.infradead.org/pipermail/linux-nvme/attachments/20170710/e8cb46ce/attachment.bin>

^ permalink raw reply	[flat|nested] 16+ messages in thread