All of lore.kernel.org
 help / color / mirror / Atom feed
* change in xhci result in soft lockup
@ 2016-08-06 10:34 Kui Zhang
  2016-08-13 22:13 ` Kui Zhang
  0 siblings, 1 reply; 4+ messages in thread
From: Kui Zhang @ 2016-08-06 10:34 UTC (permalink / raw)
  To: mathias.nyman, gregkh, linux-kernel

Hello

After this patch

# first bad commit: [86065c2719a5685cef36945f09def3f0658c7860] xhci:
don't rely on precalculated value of needed trbs in the enqueue loop


My system would hang while installing android packages over USB

adb install -r VLC-Android-2.1.0-ARMv7.apk


Log:

Aug 01 01:49:17  kernel: CPU: 3 PID: 174 Comm: kworker/3:1 Tainted: P
         O    4.7.0+ #12
Aug 01 01:49:17  kernel: Hardware name: Notebook
  P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
BIOS 4.6.5 04/20/2015
Aug 01 01:49:17  kernel: Workqueue: events
pkg_temp_thermal_threshold_work_fn [x86_pkg_temp_thermal]
Aug 01 01:49:17  kernel: task: ffff880222fac600 task.stack: ffff880222814000
Aug 01 01:49:17  kernel: RIP: 0010:[<ffffffff98175194>]
[<ffffffff98175194>] smp_call_function_single+0xd4/0x130
Aug 01 01:49:17  kernel: RSP: 0018:ffff880222817cb0  EFLAGS: 00000202
Aug 01 01:49:17  kernel: RAX: 0000000000000000 RBX: 0000000000000000
RCX: 0000000000000830
Aug 01 01:49:17  kernel: RDX: 0000000000000003 RSI: 00000000000000fb
RDI: 0000000000000001
Aug 01 01:49:17  kernel: RBP: ffff880222817cf8 R08: ffff88022fad7940
R09: 0000000000000000
Aug 01 01:49:17  kernel: R10: 0000000000000000 R11: 0000000000000000
R12: ffffffff985162f0
Aug 01 01:49:17  kernel: R13: ffff8802258624c0 R14: ffff880222817dd4
R15: ffff88022fad94c0
Aug 01 01:49:17  kernel: FS:  0000000000000000(0000)
GS:ffff88022fac0000(0000) knlGS:0000000000000000
Aug 01 01:49:17  kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 01 01:49:17  kernel: CR2: 00002a6d3e049000 CR3: 000000003b206000
CR4: 00000000001406e0
Aug 01 01:49:17  kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Aug 01 01:49:17  kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
DR7: 0000000000000400
Aug 01 01:49:17  kernel: Stack:
Aug 01 01:49:17  kernel:  ffff880222817cf0 ffffffff981259d5
0000000000000000 ffffffff985162f0
Aug 01 01:49:17  kernel:  ffff880222817d08 0000000000000003
00000000cb48f97b ffff880222817d54
Aug 01 01:49:17  kernel:  ffff880222817d50 ffff880222817d40
ffffffff98515ead 00000000000001b1
Aug 01 01:49:17  kernel: Call Trace:
Aug 01 01:49:17  kernel:  [<ffffffff981259d5>] ? update_curr+0xf5/0x170
Aug 01 01:49:17  kernel:  [<ffffffff985162f0>] ?
wrmsr_safe_regs_on_cpu+0x60/0x60
Aug 01 01:49:17  kernel:  [<ffffffff98515ead>] rdmsr_on_cpu+0x5d/0x90
Aug 01 01:49:17  kernel:  [<ffffffffc0d000f8>]
sys_get_curr_temp+0x48/0xb0 [x86_pkg_temp_thermal]
Aug 01 01:49:17  kernel:  [<ffffffff98866a28>] thermal_zone_get_temp+0x78/0x120
Aug 01 01:49:17  kernel:  [<ffffffff9812e1ca>] ? pick_next_task_fair+0x10a/0x4c0
Aug 01 01:49:17  kernel:  [<ffffffff98868803>]
thermal_zone_device_update+0x43/0x1a0
Aug 01 01:49:17  kernel:  [<ffffffffc0d00cd7>]
pkg_temp_thermal_threshold_work_fn+0x1a7/0x214 [x86_pkg_temp_thermal]
Aug 01 01:49:17  kernel:  [<ffffffff9810a38b>] process_one_work+0x14b/0x400
Aug 01 01:49:17  kernel:  [<ffffffff9810a9cb>] worker_thread+0x4b/0x480
Aug 01 01:49:17  kernel:  [<ffffffff9810a980>] ? rescuer_thread+0x340/0x340
Aug 01 01:49:17  kernel:  [<ffffffff9810a980>] ? rescuer_thread+0x340/0x340
Aug 01 01:49:17  kernel:  [<ffffffff9810fb38>] kthread+0xd8/0xf0
Aug 01 01:49:17  kernel:  [<ffffffff98a3347f>] ret_from_fork+0x1f/0x40
Aug 01 01:49:17  kernel:  [<ffffffff9810fa60>] ? kthread_park+0x60/0x60
Aug 01 01:49:17  kernel: Code: 00 00 75 70 48 83 c4 38 5b 41 5c 5d c3
48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0 83 e2 01
74 0a f3 90 8b 55 e0 <83> e2
Aug 01 01:49:17  kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr
0x0 action 0x6 frozen
Aug 01 01:49:17  kernel: ata4.00: cmd
a0/00:00:00:08:00/00:00:00:00:00/a0 tag 28 pio 16392 in
                                  Get event status notification 4a 01
00 00 10 00 00 00 08 00res 40/00:01:00:00:02/00:00:00:00:00/00 Emask
0x4 (timeout)
Aug 01 01:49:17  kernel: ata4.00: status: { DRDY }
Aug 01 01:49:17  kernel: ata4: hard resetting link
Aug 01 01:49:17  kernel: ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#12 FAILED Result:
hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#12 CDB: Read(10) 28 00
05 9a c2 d0 00 01 00 00
Aug 01 01:49:17  kernel: blk_update_request: I/O error, dev sda, sector 94028496
Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#11 FAILED Result:
hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#11 CDB: Read(10) 28 00
05 9a c1 d0 00 01 00 00
Aug 01 01:49:17  kernel: blk_update_request: I/O error, dev sda, sector 94028240
Aug 01 01:49:17  kernel: ata4.00: configured for UDMA/133
Aug 01 01:49:17  kernel: ata4: EH complete
Aug 01 01:49:17  kernel: NMI watchdog: BUG: soft lockup - CPU#3 stuck
for 22s! [kworker/3:1:174]




Thanks
Kui.Z

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: change in xhci result in soft lockup
  2016-08-06 10:34 change in xhci result in soft lockup Kui Zhang
@ 2016-08-13 22:13 ` Kui Zhang
  2016-08-15 10:52   ` Mathias Nyman
  0 siblings, 1 reply; 4+ messages in thread
From: Kui Zhang @ 2016-08-13 22:13 UTC (permalink / raw)
  To: mathias.nyman, gregkh, linux-kernel

Hello,

System still hangs with 4.8.0-rc1+. There are new info in the logs.

00:1f.2 SATA controller: Intel Corporation 8 Series/C220 Series
Chipset Family 6-port SATA Controller 1 [AHCI mode] (rev 05)
00:14.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset
Family USB xHCI (rev 05)
00:1a.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset
Family USB EHCI #2 (rev 05)
00:1d.0 USB controller: Intel Corporation 8 Series/C220 Series Chipset
Family USB EHCI #1 (rev 05)


[  562.435838] scsi host6: runtime PM trying to activate child device
host6 but parent (3-2:1.0) is not active


Aug 13 14:47:36 kernel: xhci_hcd 0000:00:14.0: WARN: TRB error on endpoint
Aug 13 14:47:36 kernel: NMI watchdog: BUG: soft lockup - CPU#1 stuck
for 22s! [thermald:2511]
Aug 13 14:47:36 kernel: Modules linked in: cmac bnep bbswitch(O)
binfmt_misc iwlmvm btusb btrtl btbcm btintel bluetooth mac80211
iwlwifi cfg80211 input_
Aug 13 14:47:36 kernel: CPU: 1 PID: 2511 Comm: thermald Tainted: P
      O    4.8.0-rc1+ #10
Aug 13 14:47:36 kernel: Hardware name: Notebook
 P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
BIOS 4.6.5
Aug 13 14:47:36 kernel: task: ffff9b08e4a05580 task.stack: ffff9b08e5708000
Aug 13 14:47:36 kernel: RIP: 0010:[<ffffffff92174e94>]
[<ffffffff92174e94>] smp_call_function_single+0xd4/0x130
Aug 13 14:47:36 kernel: RSP: 0018:ffff9b08e570bbf0  EFLAGS: 00000202
Aug 13 14:47:36 kernel: RAX: 0000000000000000 RBX: 0000000000000000
RCX: 0000000000000000
Aug 13 14:47:36 kernel: RDX: 0000000000000003 RSI: 00000000000000fb
RDI: 0000000001000000
Aug 13 14:47:36 kernel: RBP: ffff9b08e570bc38 R08: 000000000001b600
R09: ffff9b08dee8f000
Aug 13 14:47:36 kernel: R10: ffff9b08dee8f000 R11: ffff9b08e2b15e40
R12: ffffffff92517cf0
Aug 13 14:47:36 kernel: R13: ffff9b08e143bb90 R14: ffff9b08e570bd0c
R15: ffff9b08dee8f000
Aug 13 14:47:36 kernel: FS:  00007f00c9ead700(0000)
GS:ffff9b08efa40000(0000) knlGS:0000000000000000
Aug 13 14:47:36 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 13 14:47:36 kernel: CR2: 00005558ddf00000 CR3: 00000002200ba000
CR4: 00000000001406e0
Aug 13 14:47:36 kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Aug 13 14:47:36 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
DR7: 0000000000000400
Aug 13 14:47:36 kernel: Stack:
Aug 13 14:47:36 kernel:  ffff9b08050f2b00 ffff9b0805a0e840
0000000000000000 ffffffff92517cf0
Aug 13 14:47:36 kernel:  ffff9b08e570bc48 0000000000000003
0000000073535663 ffff9b08e570bc94
Aug 13 14:47:36 kernel:  ffff9b08e570bc90 ffff9b08e570bc80
ffffffff925178ad 00000000000001b1
Aug 13 14:47:36 kernel: Call Trace:
Aug 13 14:47:36 kernel:  [<ffffffff92517cf0>] ? wrmsr_safe_regs_on_cpu+0x60/0x60
Aug 13 14:47:36 kernel:  [<ffffffff925178ad>] rdmsr_on_cpu+0x5d/0x90
Aug 13 14:47:36 kernel:  [<ffffffffc0bcd0f1>]
sys_get_curr_temp+0x41/0xa0 [x86_pkg_temp_thermal]
Aug 13 14:47:36 kernel:  [<ffffffff92865468>] thermal_zone_get_temp+0x78/0x120
Aug 13 14:47:36 kernel:  [<ffffffff922a7b34>] ? mntput+0x24/0x40
Aug 13 14:47:36 kernel:  [<ffffffff9286576d>] temp_show+0x2d/0x70
Aug 13 14:47:36 kernel:  [<ffffffff92713250>] dev_attr_show+0x20/0x50
Aug 13 14:47:36 kernel:  [<ffffffff92308046>] sysfs_kf_seq_show+0xa6/0x140
Aug 13 14:47:36 kernel:  [<ffffffff92306853>] kernfs_seq_show+0x23/0x30
Aug 13 14:47:36 kernel:  [<ffffffff922ab2bf>] seq_read+0xef/0x390
Aug 13 14:47:36 kernel:  [<ffffffff92307019>] kernfs_fop_read+0x109/0x180
Aug 13 14:47:36 kernel:  [<ffffffff92285ba7>] __vfs_read+0x37/0x150
Aug 13 14:47:36 kernel:  [<ffffffff92447243>] ?
security_file_permission+0xa3/0xc0
Aug 13 14:47:36 kernel:  [<ffffffff92286175>] vfs_read+0x95/0x140
Aug 13 14:47:36 kernel:  [<ffffffff92287685>] SyS_read+0x55/0xc0
Aug 13 14:47:36 kernel:  [<ffffffff92a310df>]
entry_SYSCALL_64_fastpath+0x17/0x93
Aug 13 14:47:36 kernel: Code: 00 00 75 70 48 83 c4 38 5b 41 5c 5d c3
48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0 83 e2 01
74 0a f3 90 8b
Aug 13 14:47:40 kernel: ata1.00: exception Emask 0x0 SAct 0xc00 SErr
0x0 action 0x6 frozen
Aug 13 14:47:40 kernel: ata1.00: failed command: READ FPDMA QUEUED
Aug 13 14:47:40 kernel: ata1.00: cmd
60/00:50:e8:e5:ca/01:00:04:00:00/40 tag 10 ncq dma 131072 in
                                 res
40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Aug 13 14:47:40 kernel: ata1.00: status: { DRDY }
Aug 13 14:47:40 kernel: ata1.00: failed command: READ FPDMA QUEUED
Aug 13 14:47:40 kernel: ata1.00: cmd
60/00:58:e8:e6:ca/01:00:04:00:00/40 tag 11 ncq dma 131072 in
                                 res
40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
Aug 13 14:47:40 kernel: ata1.00: status: { DRDY }
Aug 13 14:47:40 kernel: ata1: hard resetting link
Aug 13 14:47:40 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Aug 13 14:47:40 kernel: ata1.00: configured for UDMA/133
Aug 13 14:47:40 kernel: ata1.00: device reported invalid CHS sector 0
Aug 13 14:47:40 kernel: ata1: EH complete
Aug 13 14:48:04 kernel: NMI watchdog: BUG: soft lockup - CPU#1 stuck
for 23s! [thermald:2511]
Aug 13 14:48:04 kernel: Modules linked in: cmac bnep bbswitch(O)
binfmt_misc iwlmvm btusb btrtl btbcm btintel bluetooth mac80211
iwlwifi cfg80211 input_
Aug 13 14:48:04 kernel: CPU: 1 PID: 2511 Comm: thermald Tainted: P
      O L  4.8.0-rc1+ #10
Aug 13 14:48:04 kernel: Hardware name: Notebook
 P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
BIOS 4.6.5
Aug 13 14:48:04 kernel: task: ffff9b08e4a05580 task.stack: ffff9b08e5708000
Aug 13 14:48:04 kernel: RIP: 0010:[<ffffffff92174e94>]
[<ffffffff92174e94>] smp_call_function_single+0xd4/0x130
Aug 13 14:48:04 kernel: RSP: 0018:ffff9b08e570bbf0  EFLAGS: 00000202
Aug 13 14:48:04 kernel: RAX: 0000000000000000 RBX: 0000000000000000
RCX: 0000000000000000
Aug 13 14:48:04 kernel: RDX: 0000000000000003 RSI: 00000000000000fb
RDI: 0000000001000000
Aug 13 14:48:04 kernel: RBP: ffff9b08e570bc38 R08: 000000000001b600
R09: ffff9b08dee8f000
Aug 13 14:48:04 kernel: R10: ffff9b08dee8f000 R11: ffff9b08e2b15e40
R12: ffffffff92517cf0
Aug 13 14:48:04 kernel: R13: ffff9b08e143bb90 R14: ffff9b08e570bd0c
R15: ffff9b08dee8f000
Aug 13 14:48:04 kernel: FS:  00007f00c9ead700(0000)
GS:ffff9b08efa40000(0000) knlGS:0000000000000000
Aug 13 14:48:04 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 13 14:48:04 kernel: CR2: 00005558ddf00000 CR3: 00000002200ba000
CR4: 00000000001406e0
Aug 13 14:48:04 kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Aug 13 14:48:04 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
DR7: 0000000000000400
Aug 13 14:48:04 kernel: Stack:
Aug 13 14:48:04 kernel:  ffff9b08050f2b00 ffff9b0805a0e840
0000000000000000 ffffffff92517cf0
Aug 13 14:48:04 kernel:  ffff9b08e570bc48 0000000000000003
0000000073535663 ffff9b08e570bc94
Aug 13 14:48:04 kernel:  ffff9b08e570bc90 ffff9b08e570bc80
ffffffff925178ad 00000000000001b1
Aug 13 14:48:04 kernel: Call Trace:
Aug 13 14:48:04 kernel:  [<ffffffff92517cf0>] ? wrmsr_safe_regs_on_cpu+0x60/0x60
Aug 13 14:48:04 kernel:  [<ffffffff925178ad>] rdmsr_on_cpu+0x5d/0x90
Aug 13 14:48:04 kernel:  [<ffffffffc0bcd0f1>]
sys_get_curr_temp+0x41/0xa0 [x86_pkg_temp_thermal]
Aug 13 14:48:04 kernel:  [<ffffffff92865468>] thermal_zone_get_temp+0x78/0x120
Aug 13 14:48:04 kernel:  [<ffffffff922a7b34>] ? mntput+0x24/0x40
Aug 13 14:48:04 kernel:  [<ffffffff9286576d>] temp_show+0x2d/0x70
Aug 13 14:48:04 kernel:  [<ffffffff92713250>] dev_attr_show+0x20/0x50
Aug 13 14:48:04 kernel:  [<ffffffff92308046>] sysfs_kf_seq_show+0xa6/0x140
Aug 13 14:48:04 kernel:  [<ffffffff92306853>] kernfs_seq_show+0x23/0x30
Aug 13 14:48:04 kernel:  [<ffffffff922ab2bf>] seq_read+0xef/0x390
Aug 13 14:48:04 kernel:  [<ffffffff92307019>] kernfs_fop_read+0x109/0x180
Aug 13 14:48:04 kernel:  [<ffffffff92285ba7>] __vfs_read+0x37/0x150
Aug 13 14:48:04 kernel:  [<ffffffff92447243>] ?
security_file_permission+0xa3/0xc0
Aug 13 14:48:04 kernel:  [<ffffffff92286175>] vfs_read+0x95/0x140
Aug 13 14:48:04 kernel:  [<ffffffff92287685>] SyS_read+0x55/0xc0
Aug 13 14:48:04 kernel:  [<ffffffff92a310df>]
entry_SYSCALL_64_fastpath+0x17/0x93
Aug 13 14:48:04 kernel: Code: 00 00 75 70 48 83 c4 38 5b 41 5c 5d c3
48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0 83 e2 01
74 0a f3 90 8b
Aug 13 14:48:09 kernel: INFO: rcu_sched detected stalls on CPUs/tasks:
Aug 13 14:48:09 kernel:         0-...: (0 ticks this GP) idle=f1f/1/0
softirq=14833/14833 fqs=6000
Aug 13 14:48:09 kernel:         (detected by 1, t=18002 jiffies,
g=13018, c=13017, q=0)
Aug 13 14:48:09 kernel: Task dump for CPU 0:
Aug 13 14:48:09 kernel: swapper/0       R  running task        0     0
     0 0x00000008
Aug 13 14:48:09 kernel:  ffffffff9287e685 0000000000000010
0000000000000246 ffffffff93203ed0
Aug 13 14:48:09 kernel:  0000000000000018 0000000000000000
000000856bce9e40 0000000000000000
Aug 13 14:48:09 kernel:  ffff9b08efa20200 ffffffff932e2f80
ffffffff93200000 ffffffff93200000
Aug 13 14:48:09 kernel: Call Trace:
Aug 13 14:48:09 kernel:  [<ffffffff9287e685>] ? cpuidle_enter_state+0x115/0x260
Aug 13 14:48:09 kernel:  [<ffffffff9287e807>] ? cpuidle_enter+0x17/0x20
Aug 13 14:48:09 kernel:  [<ffffffff92134a08>] ? cpu_startup_entry+0x2a8/0x360
Aug 13 14:48:09 kernel:  [<ffffffff92a2a6b7>] ? rest_init+0x77/0x80
Aug 13 14:48:09 kernel:  [<ffffffff933c7f0f>] ? start_kernel+0x453/0x474
Aug 13 14:48:09 kernel:  [<ffffffff933c7430>] ?
x86_64_start_reservations+0x24/0x26
Aug 13 14:48:09 kernel:  [<ffffffff933c751a>] ? x86_64_start_kernel+0xe8/0xeb
Aug 13 14:48:36 kernel: NMI watchdog: BUG: soft lockup - CPU#1 stuck
for 22s! [thermald:2511]
Aug 13 14:48:36 kernel: Modules linked in: cmac bnep bbswitch(O)
binfmt_misc iwlmvm btusb btrtl btbcm btintel bluetooth mac80211
iwlwifi cfg80211 input_
Aug 13 14:48:36 kernel: CPU: 1 PID: 2511 Comm: thermald Tainted: P
      O L  4.8.0-rc1+ #10
Aug 13 14:48:36 kernel: Hardware name: Notebook
 P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
BIOS 4.6.5
Aug 13 14:48:36 kernel: task: ffff9b08e4a05580 task.stack: ffff9b08e5708000
Aug 13 14:48:36 kernel: RIP: 0010:[<ffffffff92174e91>]
[<ffffffff92174e91>] smp_call_function_single+0xd1/0x130
Aug 13 14:48:36 kernel: RSP: 0018:ffff9b08e570bbf0  EFLAGS: 00000202
Aug 13 14:48:36 kernel: RAX: 0000000000000000 RBX: 0000000000000000
RCX: 0000000000000000
Aug 13 14:48:36 kernel: RDX: 0000000000000001 RSI: 00000000000000fb
RDI: 0000000001000000
Aug 13 14:48:36 kernel: RBP: ffff9b08e570bc38 R08: 000000000001b600
R09: ffff9b08dee8f000
Aug 13 14:48:36 kernel: R10: ffff9b08dee8f000 R11: ffff9b08e2b15e40
R12: ffffffff92517cf0
Aug 13 14:48:36 kernel: R13: ffff9b08e143bb90 R14: ffff9b08e570bd0c
R15: ffff9b08dee8f000
Aug 13 14:48:36 kernel: FS:  00007f00c9ead700(0000)
GS:ffff9b08efa40000(0000) knlGS:0000000000000000
Aug 13 14:48:36 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 13 14:48:36 kernel: CR2: 00005558ddf00000 CR3: 00000002200ba000
CR4: 00000000001406e0
Aug 13 14:48:36 kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Aug 13 14:48:36 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
DR7: 0000000000000400
Aug 13 14:48:36 kernel: Stack:
Aug 13 14:48:36 kernel:  ffff9b08050f2b00 ffff9b0805a0e840
0000000000000000 ffffffff92517cf0
Aug 13 14:48:36 kernel:  ffff9b08e570bc48 0000000000000003
0000000073535663 ffff9b08e570bc94
Aug 13 14:48:36 kernel:  ffff9b08e570bc90 ffff9b08e570bc80
ffffffff925178ad 00000000000001b1
Aug 13 14:48:36 kernel: Call Trace:
Aug 13 14:48:36 kernel:  [<ffffffff92517cf0>] ? wrmsr_safe_regs_on_cpu+0x60/0x60
Aug 13 14:48:36 kernel:  [<ffffffff925178ad>] rdmsr_on_cpu+0x5d/0x90
Aug 13 14:48:36 kernel:  [<ffffffffc0bcd0f1>]
sys_get_curr_temp+0x41/0xa0 [x86_pkg_temp_thermal]
Aug 13 14:48:36 kernel:  [<ffffffff92865468>] thermal_zone_get_temp+0x78/0x120
Aug 13 14:48:36 kernel:  [<ffffffff922a7b34>] ? mntput+0x24/0x40
Aug 13 14:48:36 kernel:  [<ffffffff9286576d>] temp_show+0x2d/0x70
Aug 13 14:48:36 kernel:  [<ffffffff92713250>] dev_attr_show+0x20/0x50
Aug 13 14:48:36 kernel:  [<ffffffff92308046>] sysfs_kf_seq_show+0xa6/0x140
Aug 13 14:48:36 kernel:  [<ffffffff92306853>] kernfs_seq_show+0x23/0x30
Aug 13 14:48:36 kernel:  [<ffffffff922ab2bf>] seq_read+0xef/0x390
Aug 13 14:48:36 kernel:  [<ffffffff92307019>] kernfs_fop_read+0x109/0x180
Aug 13 14:48:36 kernel:  [<ffffffff92285ba7>] __vfs_read+0x37/0x150
Aug 13 14:48:36 kernel:  [<ffffffff92447243>] ?
security_file_permission+0xa3/0xc0
Aug 13 14:48:36 kernel:  [<ffffffff92286175>] vfs_read+0x95/0x140
Aug 13 14:48:36 kernel:  [<ffffffff92287685>] SyS_read+0x55/0xc0
Aug 13 14:48:36 kernel:  [<ffffffff92a310df>]
entry_SYSCALL_64_fastpath+0x17/0x93
Aug 13 14:48:36 kernel: Code: 25 28 00 00 00 75 70 48 83 c4 38 5b 41
5c 5d c3 48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0
83 e2 01 74 0a
Aug 13 14:48:57 kernel: ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0
action 0x6 frozen
Aug 13 14:48:57 kernel: ata1.00: failed command: FLUSH CACHE EXT
Aug 13 14:48:57 kernel: ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 27
                                 res
40/00:01:01:4f:c2/00:00:00:00:00/00 Emask 0x4 (timeout)
Aug 13 14:48:57 kernel: ata1.00: status: { DRDY }
Aug 13 14:48:57 kernel: ata1: hard resetting link
Aug 13 14:48:57 kernel: ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
Aug 13 14:48:57 kernel: ata1.00: configured for UDMA/133
Aug 13 14:48:57 kernel: ata1.00: retrying FLUSH 0xea Emask 0x4
Aug 13 14:48:57 kernel: ata1: EH complete
Aug 13 14:49:04 kernel: NMI watchdog: BUG: soft lockup - CPU#1 stuck
for 22s! [thermald:2511]
Aug 13 14:49:04 kernel: Modules linked in: cmac bnep bbswitch(O)
binfmt_misc iwlmvm btusb btrtl btbcm btintel bluetooth mac80211
iwlwifi cfg80211 input_
Aug 13 14:49:05 kernel: CPU: 1 PID: 2511 Comm: thermald Tainted: P
      O L  4.8.0-rc1+ #10
Aug 13 14:49:05 kernel: Hardware name: Notebook
 P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
BIOS 4.6.5
Aug 13 14:49:05 kernel: task: ffff9b08e4a05580 task.stack: ffff9b08e5708000
Aug 13 14:49:05 kernel: RIP: 0010:[<ffffffff92174e94>]
[<ffffffff92174e94>] smp_call_function_single+0xd4/0x130
Aug 13 14:49:05 kernel: RSP: 0018:ffff9b08e570bbf0  EFLAGS: 00000202
Aug 13 14:49:05 kernel: RAX: 0000000000000000 RBX: 0000000000000000
RCX: 0000000000000000
Aug 13 14:49:05 kernel: RDX: 0000000000000003 RSI: 00000000000000fb
RDI: 0000000001000000
Aug 13 14:49:05 kernel: RBP: ffff9b08e570bc38 R08: 000000000001b600
R09: ffff9b08dee8f000
Aug 13 14:49:05 kernel: R10: ffff9b08dee8f000 R11: ffff9b08e2b15e40
R12: ffffffff92517cf0
Aug 13 14:49:05 kernel: R13: ffff9b08e143bb90 R14: ffff9b08e570bd0c
R15: ffff9b08dee8f000
Aug 13 14:49:05 kernel: FS:  00007f00c9ead700(0000)
GS:ffff9b08efa40000(0000) knlGS:0000000000000000
Aug 13 14:49:05 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 13 14:49:05 kernel: CR2: 00005558ddf00000 CR3: 00000002200ba000
CR4: 00000000001406e0
Aug 13 14:49:05 kernel: DR0: 0000000000000000 DR1: 0000000000000000
DR2: 0000000000000000
Aug 13 14:49:05 kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
DR7: 0000000000000400
Aug 13 14:49:05 kernel: Stack:
Aug 13 14:49:05 kernel:  ffff9b08050f2b00 ffff9b0805a0e840
0000000000000000 ffffffff92517cf0
Aug 13 14:49:05 kernel:  ffff9b08e570bc48 0000000000000003
0000000073535663 ffff9b08e570bc94
Aug 13 14:49:05 kernel:  ffff9b08e570bc90 ffff9b08e570bc80
ffffffff925178ad 00000000000001b1
Aug 13 14:49:05 kernel: Call Trace:
Aug 13 14:49:05 kernel:  [<ffffffff92517cf0>] ? wrmsr_safe_regs_on_cpu+0x60/0x60
Aug 13 14:49:05 kernel:  [<ffffffff925178ad>] rdmsr_on_cpu+0x5d/0x90
Aug 13 14:49:05 kernel:  [<ffffffffc0bcd0f1>]
sys_get_curr_temp+0x41/0xa0 [x86_pkg_temp_thermal]
Aug 13 14:49:05 kernel:  [<ffffffff92865468>] thermal_zone_get_temp+0x78/0x120
Aug 13 14:49:05 kernel:  [<ffffffff922a7b34>] ? mntput+0x24/0x40
Aug 13 14:49:05 kernel:  [<ffffffff9286576d>] temp_show+0x2d/0x70
Aug 13 14:49:05 kernel:  [<ffffffff92713250>] dev_attr_show+0x20/0x50
Aug 13 14:49:05 kernel:  [<ffffffff92308046>] sysfs_kf_seq_show+0xa6/0x140
Aug 13 14:49:05 kernel:  [<ffffffff92306853>] kernfs_seq_show+0x23/0x30
Aug 13 14:49:05 kernel:  [<ffffffff922ab2bf>] seq_read+0xef/0x390
Aug 13 14:49:05 kernel:  [<ffffffff92307019>] kernfs_fop_read+0x109/0x180
Aug 13 14:49:05 kernel:  [<ffffffff92285ba7>] __vfs_read+0x37/0x150
Aug 13 14:49:05 kernel:  [<ffffffff92447243>] ?
security_file_permission+0xa3/0xc0
Aug 13 14:49:05 kernel:  [<ffffffff92286175>] vfs_read+0x95/0x140
Aug 13 14:49:05 kernel:  [<ffffffff92287685>] SyS_read+0x55/0xc0
Aug 13 14:49:05 kernel:  [<ffffffff92a310df>]
entry_SYSCALL_64_fastpath+0x17/0x93
Aug 13 14:49:05 kernel: Code: 00 00 75 70 48 83 c4 38 5b 41 5c 5d c3
48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0 83 e2 01
74 0a f3 90 8b
Aug 13 14:49:32 kernel: NMI watchdog: BUG: soft lockup - CPU#1 stuck
for 23s! [thermald:2511]
-- Reboot --








On Sat, Aug 6, 2016 at 3:34 AM, Kui Zhang <kuizhang@gmail.com> wrote:
> Hello
>
> After this patch
>
> # first bad commit: [86065c2719a5685cef36945f09def3f0658c7860] xhci:
> don't rely on precalculated value of needed trbs in the enqueue loop
>
>
> My system would hang while installing android packages over USB
>
> adb install -r VLC-Android-2.1.0-ARMv7.apk
>
>
> Log:
>
> Aug 01 01:49:17  kernel: CPU: 3 PID: 174 Comm: kworker/3:1 Tainted: P
>          O    4.7.0+ #12
> Aug 01 01:49:17  kernel: Hardware name: Notebook
>   P15SM-A/SM1-A                    /P15SM-A/SM1-A                    ,
> BIOS 4.6.5 04/20/2015
> Aug 01 01:49:17  kernel: Workqueue: events
> pkg_temp_thermal_threshold_work_fn [x86_pkg_temp_thermal]
> Aug 01 01:49:17  kernel: task: ffff880222fac600 task.stack: ffff880222814000
> Aug 01 01:49:17  kernel: RIP: 0010:[<ffffffff98175194>]
> [<ffffffff98175194>] smp_call_function_single+0xd4/0x130
> Aug 01 01:49:17  kernel: RSP: 0018:ffff880222817cb0  EFLAGS: 00000202
> Aug 01 01:49:17  kernel: RAX: 0000000000000000 RBX: 0000000000000000
> RCX: 0000000000000830
> Aug 01 01:49:17  kernel: RDX: 0000000000000003 RSI: 00000000000000fb
> RDI: 0000000000000001
> Aug 01 01:49:17  kernel: RBP: ffff880222817cf8 R08: ffff88022fad7940
> R09: 0000000000000000
> Aug 01 01:49:17  kernel: R10: 0000000000000000 R11: 0000000000000000
> R12: ffffffff985162f0
> Aug 01 01:49:17  kernel: R13: ffff8802258624c0 R14: ffff880222817dd4
> R15: ffff88022fad94c0
> Aug 01 01:49:17  kernel: FS:  0000000000000000(0000)
> GS:ffff88022fac0000(0000) knlGS:0000000000000000
> Aug 01 01:49:17  kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> Aug 01 01:49:17  kernel: CR2: 00002a6d3e049000 CR3: 000000003b206000
> CR4: 00000000001406e0
> Aug 01 01:49:17  kernel: DR0: 0000000000000000 DR1: 0000000000000000
> DR2: 0000000000000000
> Aug 01 01:49:17  kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0
> DR7: 0000000000000400
> Aug 01 01:49:17  kernel: Stack:
> Aug 01 01:49:17  kernel:  ffff880222817cf0 ffffffff981259d5
> 0000000000000000 ffffffff985162f0
> Aug 01 01:49:17  kernel:  ffff880222817d08 0000000000000003
> 00000000cb48f97b ffff880222817d54
> Aug 01 01:49:17  kernel:  ffff880222817d50 ffff880222817d40
> ffffffff98515ead 00000000000001b1
> Aug 01 01:49:17  kernel: Call Trace:
> Aug 01 01:49:17  kernel:  [<ffffffff981259d5>] ? update_curr+0xf5/0x170
> Aug 01 01:49:17  kernel:  [<ffffffff985162f0>] ?
> wrmsr_safe_regs_on_cpu+0x60/0x60
> Aug 01 01:49:17  kernel:  [<ffffffff98515ead>] rdmsr_on_cpu+0x5d/0x90
> Aug 01 01:49:17  kernel:  [<ffffffffc0d000f8>]
> sys_get_curr_temp+0x48/0xb0 [x86_pkg_temp_thermal]
> Aug 01 01:49:17  kernel:  [<ffffffff98866a28>] thermal_zone_get_temp+0x78/0x120
> Aug 01 01:49:17  kernel:  [<ffffffff9812e1ca>] ? pick_next_task_fair+0x10a/0x4c0
> Aug 01 01:49:17  kernel:  [<ffffffff98868803>]
> thermal_zone_device_update+0x43/0x1a0
> Aug 01 01:49:17  kernel:  [<ffffffffc0d00cd7>]
> pkg_temp_thermal_threshold_work_fn+0x1a7/0x214 [x86_pkg_temp_thermal]
> Aug 01 01:49:17  kernel:  [<ffffffff9810a38b>] process_one_work+0x14b/0x400
> Aug 01 01:49:17  kernel:  [<ffffffff9810a9cb>] worker_thread+0x4b/0x480
> Aug 01 01:49:17  kernel:  [<ffffffff9810a980>] ? rescuer_thread+0x340/0x340
> Aug 01 01:49:17  kernel:  [<ffffffff9810a980>] ? rescuer_thread+0x340/0x340
> Aug 01 01:49:17  kernel:  [<ffffffff9810fb38>] kthread+0xd8/0xf0
> Aug 01 01:49:17  kernel:  [<ffffffff98a3347f>] ret_from_fork+0x1f/0x40
> Aug 01 01:49:17  kernel:  [<ffffffff9810fa60>] ? kthread_park+0x60/0x60
> Aug 01 01:49:17  kernel: Code: 00 00 75 70 48 83 c4 38 5b 41 5c 5d c3
> 48 8d 75 c8 48 89 d1 89 df 4c 89 e2 e8 39 fe ff ff 8b 55 e0 83 e2 01
> 74 0a f3 90 8b 55 e0 <83> e2
> Aug 01 01:49:17  kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr
> 0x0 action 0x6 frozen
> Aug 01 01:49:17  kernel: ata4.00: cmd
> a0/00:00:00:08:00/00:00:00:00:00/a0 tag 28 pio 16392 in
>                                   Get event status notification 4a 01
> 00 00 10 00 00 00 08 00res 40/00:01:00:00:02/00:00:00:00:00/00 Emask
> 0x4 (timeout)
> Aug 01 01:49:17  kernel: ata4.00: status: { DRDY }
> Aug 01 01:49:17  kernel: ata4: hard resetting link
> Aug 01 01:49:17  kernel: ata4: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
> Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#12 FAILED Result:
> hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
> Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#12 CDB: Read(10) 28 00
> 05 9a c2 d0 00 01 00 00
> Aug 01 01:49:17  kernel: blk_update_request: I/O error, dev sda, sector 94028496
> Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#11 FAILED Result:
> hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
> Aug 01 01:49:17  kernel: sd 0:0:0:0: [sda] tag#11 CDB: Read(10) 28 00
> 05 9a c1 d0 00 01 00 00
> Aug 01 01:49:17  kernel: blk_update_request: I/O error, dev sda, sector 94028240
> Aug 01 01:49:17  kernel: ata4.00: configured for UDMA/133
> Aug 01 01:49:17  kernel: ata4: EH complete
> Aug 01 01:49:17  kernel: NMI watchdog: BUG: soft lockup - CPU#3 stuck
> for 22s! [kworker/3:1:174]
>
>
>
>
> Thanks
> Kui.Z

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: change in xhci result in soft lockup
  2016-08-13 22:13 ` Kui Zhang
@ 2016-08-15 10:52   ` Mathias Nyman
  2016-08-16  8:35     ` Kui Zhang
  0 siblings, 1 reply; 4+ messages in thread
From: Mathias Nyman @ 2016-08-15 10:52 UTC (permalink / raw)
  To: Kui Zhang, gregkh, linux-kernel; +Cc: Alban Browaeys

On 14.08.2016 01:13, Kui Zhang wrote:
> Hello,
>
> System still hangs with 4.8.0-rc1+. There are new info in the logs.
>

Does Alban Browaeys Patch xhci: really enqueue zero length TRBs help?
It solves an ADB triggered issue in xhci in that same bad patch.

http://marc.info/?l=linux-usb&m=147102031717014&w=2

-Mathias

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: change in xhci result in soft lockup
  2016-08-15 10:52   ` Mathias Nyman
@ 2016-08-16  8:35     ` Kui Zhang
  0 siblings, 0 replies; 4+ messages in thread
From: Kui Zhang @ 2016-08-16  8:35 UTC (permalink / raw)
  To: Mathias Nyman; +Cc: gregkh, linux-kernel, Alban Browaeys

Thanks.

This patch fixed my issue.


Kui.Z

On Mon, Aug 15, 2016 at 3:52 AM, Mathias Nyman
<mathias.nyman@linux.intel.com> wrote:
> On 14.08.2016 01:13, Kui Zhang wrote:
>>
>> Hello,
>>
>> System still hangs with 4.8.0-rc1+. There are new info in the logs.
>>
>
> Does Alban Browaeys Patch xhci: really enqueue zero length TRBs help?
> It solves an ADB triggered issue in xhci in that same bad patch.
>
> http://marc.info/?l=linux-usb&m=147102031717014&w=2
>
> -Mathias
>
>

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2016-08-16  8:35 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2016-08-06 10:34 change in xhci result in soft lockup Kui Zhang
2016-08-13 22:13 ` Kui Zhang
2016-08-15 10:52   ` Mathias Nyman
2016-08-16  8:35     ` Kui Zhang

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.