All of lore.kernel.org
 help / color / mirror / Atom feed
* Add support to debug unresponsive host
@ 2019-05-15 12:39 Jayanth Othayoth
  2019-05-15 18:26 ` Neeraj Ladkani
  2019-05-16  6:36 ` Deepak Kodihalli
  0 siblings, 2 replies; 7+ messages in thread
From: Jayanth Othayoth @ 2019-05-15 12:39 UTC (permalink / raw)
  To: openbmc, geissonator, bradleyb

[-- Attachment #1: Type: text/plain, Size: 2011 bytes --]

## Problem Description
Issue #457:  Add support to debug unresponsive host.

Scope: High level design direction to solve this problem,

## Background and References
There are situation at customer places where OPAL/Linux goes unresponsive
causing a system hang. And there is no way to figure out what went wrong
with Linux kernel or OPAL. Looking for a way to trigger a dump capture on
Linux host so that we can capture the OS dump for post analysis.

## Proposed Design for POWER processor based systems:
Get all Host CPUs in reset vector and Linux then has a mechanism to patch
it into panic-kdump path to trigger dump capture. This will enable us to
analyze and fix customer issue where we see Linux hang and unresponsive
system.

### Redfish Schema used:
* Reference: DSP2046 2018.3,
* ComputerSystem 1.6.0 schema provides an action called
#ComputerSystem.Reset”, This action is used to reset the system. ResetType
parameter is used  for indicating type of reset need to be performed. In
this use case we can use “Nmi” type
    * Nmi: Generate a Diagnostic Interrupt (usually an NMI on x86 systems)
to cease normal operations, perform diagnostic actions and typically halt
the system.
* ### d-bus :

Option 1:   Extending  the existing  d-bus interface  state.Host  name
space (
/openbmc/phosphor-dbus-interfaces/xyz/openbmc_project/State/Host.interface.yaml
) to support new RequestedHostTransition property called  “Nmi”.   d-bus
backend can internally invoke processor specific target to do Sreset(
equivalent to x86 NMI) and associated  actions.

Option 2: Introducing new d-bus interface in the control.state namespace (
/openbmc/phosphor-dbus-interfaces/xyz/openbmc_project/Control/Host/NMI.interface.yaml)
namespace and implement the new d-bus back-end for respective  processor
specific targets.

## Alternatives Considered
NA

## Impacts:
NA

## Testing
NA

Looking for input from the team on this  High level design direction
approach.

[-- Attachment #2: Type: text/html, Size: 2124 bytes --]

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2019-05-27 12:42 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-05-15 12:39 Add support to debug unresponsive host Jayanth Othayoth
2019-05-15 18:26 ` Neeraj Ladkani
2019-05-16  9:11   ` Artem Senichev
2019-05-16  6:36 ` Deepak Kodihalli
2019-05-16 13:01   ` Andrew Geissler
2019-05-27  7:15     ` Jayanth Othayoth
2019-05-27 12:42       ` vishwa

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.