SError handling vs. SIGSEGV

* SError handling vs. SIGSEGV
@ 2020-03-28  4:31 Florian Fainelli
  2020-03-28 16:43 ` Andrew Lunn
  2020-03-30 11:30 ` James Morse
  0 siblings, 2 replies; 8+ messages in thread
From: Florian Fainelli @ 2020-03-28  4:31 UTC (permalink / raw)
  To: linux-arm-kernel, Catalin Marinas, Will Deacon, Mark Rutland,
	Dave.Martin, james.morse, Doug Berger, bcm-kernel-feedback-list,
	Scott Branden, Ray Jui

Hello,

Up until commit e4ba15debcfd27f60d43da940a58108783bff2a6 ("arm64:
fix for bad_mode() handler to always result in panic") we had been
getting SIGSEGV delivered to applications running on Broadcom STB
platforms which access register holes or registers for which we have
purposely blocked the access via the GISB (proprietary bus for control
registers) bus arbiter used on those SoCs. That commit arguably plugged
a hole in that scheduling was possible when panic() was intended, so
this is not really the only culprit. We are actually relying on this
behavior to pass a number of tests that specifically exercise that
register blocking is effective without taking down the whole system.

Due to our SoC integration all of those register access errors are
SErrors with the signature at the bottom.

Doug had tried to submit a patch series that allowed a given platform to
install custom abort handlers, similar to what ARM 32-bit permits, but
this got shot down:

https://lkml.org/lkml/2017/3/24/413

and this was eventually merged in this shape:

https://lore.kernel.org/patchwork/cover/775056/

I understand that such a SError is deemed catastrophic and
unrecoverable, but taking down the whole system for something we could
possibly resolve with a SIGSEGV provided the platform is known and hooks
are in place would be more desirable IMHO, otherwise we have nice DoS
lurking around and hard to debug systems in production, too.

As it stands today, I see no way to have a self hosted test case that
exercises that our GISB bus arbiter blocking works correctly because the
whole kernel is taken down when the test is successful :/

Thank you!

[   14.460690] SError Interrupt on CPU3, code 0xbf000002 -- SError
[   14.460695] CPU: 3 PID: 177 Comm: devmem Not tainted
5.6.0-rc7-g3893c2025fec #82
[   14.460696] Hardware name: BCX972160DV (DT)
[   14.460697] pstate: 60000000 (nZCv daif -PAN -UAO)
[   14.460699] pc : 00000000004087b0
[   14.460700] lr : 0000000000407b54
[   14.460701] sp : 0000007fea6fd740
[   14.460702] x29: 0000007fea6fd7e0 x28: 0000000000000000
[   14.460706] x27: 0000000000000000 x26: 0000000000000000
[   14.460709] x25: 0000000000000000 x24: 0000000000000000
[   14.460712] x23: 0000000000000000 x22: 0000000000000004
[   14.460714] x21: 00000000004cf000 x20: 0000007fea6fd918
[   14.460717] x19: 0000000000000029 x18: 0000000000050600
[   14.460720] x17: 00000000004cf408 x16: 0000007fba21b3d8
[   14.460723] x15: 000000000000013b x14: 0000000000000000
[   14.460726] x13: 0000000000000000 x12: 0000000000000007
[   14.460729] x11: 0000000000000008 x10: 0101010101010101
[   14.460731] x9 : 0000007fba25b1a8 x8 : 00000000000000de
[   14.460734] x7 : 1fffffffffffffff x6 : 0000007fba2999f0
[   14.460737] x5 : 0000000009902000 x4 : 0000000000000003
[   14.460739] x3 : 0000000000000001 x2 : 0000007fea6fdf71
[   14.460742] x1 : 0000000000000030 x0 : 0000000000000000
[   14.460745] Kernel panic - not syncing: Asynchronous SError Interrupt
[   14.460747] CPU: 3 PID: 177 Comm: devmem Not tainted
5.6.0-rc7-g3893c2025fec #82
[   14.460749] Hardware name: BCX972160DV (DT)
[   14.460750] Call trace:
[   14.460751]  dump_backtrace+0x0/0x1d8
[   14.460752]  show_stack+0x24/0x30
[   14.460753]  dump_stack+0xdc/0x14c
[   14.460754]  panic+0x13c/0x320
[   14.460755]  nmi_panic+0x50/0x70
[   14.460757]  arm64_serror_panic+0x74/0x80
[   14.460758]  do_serror+0xb4/0x158
[   14.460759]  el0_error_naked+0x14/0x1c
[   14.460781] SMP: stopping secondary CPUs
[   14.460782] Kernel Offset: disabled
[   14.460783] CPU features: 0x00002,24002004
[   14.460784] Memory Limit: none
-- 
Florian

_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel

^ permalink raw reply	[flat|nested] 8+ messages in thread