All of lore.kernel.org
 help / color / mirror / Atom feed
* [BUG] hpsa: Controller lockup detected: 0x00150028
@ 2015-05-18 12:40 Peter Zijlstra
  2015-05-18 13:57 ` Oelke, Mark
  0 siblings, 1 reply; 11+ messages in thread
From: Peter Zijlstra @ 2015-05-18 12:40 UTC (permalink / raw)
  To: don.brace; +Cc: iss_storagedev, storagedev, linux-scsi

Hi,

On my HP-DL180-G6 with a HP Smart Array P212.

I can reliably trigger a controller lockup by running smartctl.

I'm trying to monitor my HDD temps using:

  for ((i=0; i<8; i++)) ; do
	smartctl -d cciss,$i -a /dev/sg0 | grep ^194 ;

  done | awk '{t=$10; if (t > T) T = t;} END {print T}'

After a few of those runs, I get:

[ 1540.277776] hpsa 0000:06:00.0: Controller lockup detected: 0x00150028

And my disks are gone.

With linux 3.16 the whole kernel came down with NMI watchdog timeouts /
RCU stalls in the detect_lockup() worklet.

On linux 4.0 those appear to be gone, but the controller isn't coming
back either.

It this a known 'feature'; is there anything I can do to help
diagnose/fix this issue?

 ~ Peter

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2015-08-24 14:11 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-05-18 12:40 [BUG] hpsa: Controller lockup detected: 0x00150028 Peter Zijlstra
2015-05-18 13:57 ` Oelke, Mark
2015-05-18 15:20   ` Peter Zijlstra
2015-05-18 16:03     ` Peter Zijlstra
2015-05-18 16:11       ` Peter Zijlstra
2015-05-22 15:10         ` Tomas Henzl
2015-05-22 16:40           ` Peter Zijlstra
2015-05-22 16:48             ` Handzik, Joe
2015-08-24  9:43               ` Wouter Depuydt
2015-08-24 10:02                 ` Wouter Depuydt
2015-08-24 14:11                   ` Don Brace

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.