linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* 4.18.0-rc1-next-20180619 boot failed on beagle board x15
@ 2018-06-20  5:52 Naresh Kamboju
  2018-06-20 10:29 ` Tony Lindgren
  0 siblings, 1 reply; 17+ messages in thread
From: Naresh Kamboju @ 2018-06-20  5:52 UTC (permalink / raw)
  To: linux-omap
  Cc: Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson

Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.

Full log of trace please refer this link,
https://lkft.validation.linaro.org/scheduler/job/291535#L946

Here is back trace,
[    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
[    3.611949] brd: module loaded
[    3.641298] loop: module loaded
[    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
[    3.658481] ------------[ cut here ]------------
[    3.663196] WARNING: CPU: 0 PID: 1 at
/srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
l3_interrupt_handler+0x27c/0x39c
[    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
(Read): Data Access in User mode during Functional access
[    3.689330] Modules linked in:
[    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
4.18.0-rc1-next-20180619 #1
[    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
[    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
(show_stack+0x20/0x24)
[    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
(dump_stack+0xb0/0xdc)
[    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
[    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
(warn_slowpath_fmt+0x58/0x74)
[    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
(l3_interrupt_handler+0x27c/0x39c)
[    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
(__handle_irq_event_percpu+0x90/0x45c)
[    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
[<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
[    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
[<c04cd710>] (handle_irq_event+0x48/0x6c)
[    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
(handle_fasteoi_irq+0xb0/0x164)
[    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
(generic_handle_irq+0x34/0x44)
[    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
(__handle_domain_irq+0x6c/0xc4)
[    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
(gic_handle_irq+0x68/0xac)
[    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
(__irq_svc+0x5c/0x7c)
[    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
[    3.819963] 9c60:                                     fc140000
ee3e2f10 ee3a9810 00000000
[    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
ee3a9810 00000000 ee179cdc
[    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
[    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
(ahci_enable_ahci+0x24/0x70)
[    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
(ahci_save_initial_config+0x2c/0x388)
[    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
[<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
[    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
[<c0c3c33c>] (ahci_probe+0xa0/0xc0)
[    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
(platform_drv_probe+0x58/0xac)
[    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
(driver_probe_device+0x25c/0x338)
[    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
(__driver_attach+0xf4/0xf8)
[    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
(bus_for_each_dev+0x78/0xc4)
[    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
(driver_attach+0x2c/0x30)
[    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
(bus_add_driver+0x114/0x220)
[    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
(driver_register+0x88/0x11c)
[    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
(__platform_driver_register+0x50/0x58)
[    3.945003] [<c0ba4044>] (__platform_driver_register) from
[<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
[    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
(do_one_initcall+0x9c/0x434)
[    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
(kernel_init_freeable+0x3b8/0x53c)
[    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
(kernel_init+0x18/0x128)
[    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
(ret_from_fork+0x14/0x28)
[    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
[    3.992298] 9fa0:                                     00000000
00000000 00000000 00000000
[    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
00000000 00000000 00000000
[    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
[    4.015405] irq event stamp: 663130
[    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
_raw_spin_unlock_irqrestore+0x40/0x6c
[    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
__irq_svc+0x4c/0x7c
[    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
__do_softirq+0x2a8/0x5e8
[    4.043262] softirqs last disabled at (661449): [<c04594a4>]
irq_exit+0x178/0x1a4
[    4.050826] ---[ end trace 095abac70926448f ]---

Best regards
Naresh Kamboju

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-20  5:52 4.18.0-rc1-next-20180619 boot failed on beagle board x15 Naresh Kamboju
@ 2018-06-20 10:29 ` Tony Lindgren
  2018-06-20 12:57   ` Samuel Morris
  2018-06-20 12:58   ` Roger Quadros
  0 siblings, 2 replies; 17+ messages in thread
From: Tony Lindgren @ 2018-06-20 10:29 UTC (permalink / raw)
  To: Naresh Kamboju, Samuel Morris, Tejun Heo, Roger Quadros
  Cc: linux-omap, Linux-Next Mailing List, open list, Arnd Bergmann,
	ssantosh, Olof Johansson

Hi,

* Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.

Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
hotplug to save power").

Reverting the patch makes things work again. Any ideas what
might be going wrong here? Things clearly idle but then there
seems to be some register access with clocks disabled.

> Full log of trace please refer this link,
> https://lkft.validation.linaro.org/scheduler/job/291535#L946
> 
> Here is back trace,
> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
> [    3.611949] brd: module loaded
> [    3.641298] loop: module loaded
> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
> [    3.658481] ------------[ cut here ]------------
> [    3.663196] WARNING: CPU: 0 PID: 1 at
> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
> l3_interrupt_handler+0x27c/0x39c
> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
> (Read): Data Access in User mode during Functional access
> [    3.689330] Modules linked in:
> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
> 4.18.0-rc1-next-20180619 #1
> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
> (show_stack+0x20/0x24)
> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
> (dump_stack+0xb0/0xdc)
> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
> (warn_slowpath_fmt+0x58/0x74)
> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
> (l3_interrupt_handler+0x27c/0x39c)
> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
> (__handle_irq_event_percpu+0x90/0x45c)
> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
> [<c04cd710>] (handle_irq_event+0x48/0x6c)
> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
> (handle_fasteoi_irq+0xb0/0x164)
> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
> (generic_handle_irq+0x34/0x44)
> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
> (__handle_domain_irq+0x6c/0xc4)
> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
> (gic_handle_irq+0x68/0xac)
> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
> (__irq_svc+0x5c/0x7c)
> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
> [    3.819963] 9c60:                                     fc140000
> ee3e2f10 ee3a9810 00000000
> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
> ee3a9810 00000000 ee179cdc
> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
> (ahci_enable_ahci+0x24/0x70)
> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
> (ahci_save_initial_config+0x2c/0x388)
> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
> (platform_drv_probe+0x58/0xac)
> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
> (driver_probe_device+0x25c/0x338)
> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
> (__driver_attach+0xf4/0xf8)
> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
> (bus_for_each_dev+0x78/0xc4)
> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
> (driver_attach+0x2c/0x30)
> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
> (bus_add_driver+0x114/0x220)
> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
> (driver_register+0x88/0x11c)
> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
> (__platform_driver_register+0x50/0x58)
> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
> (do_one_initcall+0x9c/0x434)
> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
> (kernel_init_freeable+0x3b8/0x53c)
> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
> (kernel_init+0x18/0x128)
> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
> (ret_from_fork+0x14/0x28)
> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
> [    3.992298] 9fa0:                                     00000000
> 00000000 00000000 00000000
> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
> 00000000 00000000 00000000
> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
> [    4.015405] irq event stamp: 663130
> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
> _raw_spin_unlock_irqrestore+0x40/0x6c
> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
> __irq_svc+0x4c/0x7c
> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
> __do_softirq+0x2a8/0x5e8
> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
> irq_exit+0x178/0x1a4
> [    4.050826] ---[ end trace 095abac70926448f ]---
> 
> Best regards
> Naresh Kamboju
> --
> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-20 10:29 ` Tony Lindgren
@ 2018-06-20 12:57   ` Samuel Morris
  2018-06-20 12:58   ` Roger Quadros
  1 sibling, 0 replies; 17+ messages in thread
From: Samuel Morris @ 2018-06-20 12:57 UTC (permalink / raw)
  To: Tony Lindgren
  Cc: Naresh Kamboju, Tejun Heo, Roger Quadros, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson

On Wed, Jun 20, 2018 at 6:29 AM, Tony Lindgren <tony@atomide.com> wrote:
> Hi,
>
> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>
> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> hotplug to save power").
>
> Reverting the patch makes things work again. Any ideas what
> might be going wrong here? Things clearly idle but then there
> seems to be some register access with clocks disabled.
>
>> Full log of trace please refer this link,
>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>
>> Here is back trace,
>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>> [    3.611949] brd: module loaded
>> [    3.641298] loop: module loaded
>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>> [    3.658481] ------------[ cut here ]------------
>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>> l3_interrupt_handler+0x27c/0x39c
>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>> (Read): Data Access in User mode during Functional access
>> [    3.689330] Modules linked in:
>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>> 4.18.0-rc1-next-20180619 #1
>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>> (show_stack+0x20/0x24)
>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>> (dump_stack+0xb0/0xdc)
>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>> (warn_slowpath_fmt+0x58/0x74)
>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>> (l3_interrupt_handler+0x27c/0x39c)
>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>> (__handle_irq_event_percpu+0x90/0x45c)
>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>> (handle_fasteoi_irq+0xb0/0x164)
>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>> (generic_handle_irq+0x34/0x44)
>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>> (__handle_domain_irq+0x6c/0xc4)
>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>> (gic_handle_irq+0x68/0xac)
>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>> (__irq_svc+0x5c/0x7c)
>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>> [    3.819963] 9c60:                                     fc140000
>> ee3e2f10 ee3a9810 00000000
>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>> ee3a9810 00000000 ee179cdc
>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>> (ahci_enable_ahci+0x24/0x70)
>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>> (ahci_save_initial_config+0x2c/0x388)
>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)

Now that the pm_runtime callbacks are actually implemented, the
runtime_forbid() I think will call ahci_platform_runtime_resume(), and
that will resume the host. That happens in
ahci_platform_get_resources(), before ahci_platform_init_host(). I
could see that putting the host in a bad state. Perhaps I should move
the runtime_forbid() clause to ahci_platform_init_host() instead,
after everything has been initialized. I can just make another patch
version after I do some testing.

>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>> (platform_drv_probe+0x58/0xac)
>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>> (driver_probe_device+0x25c/0x338)
>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>> (__driver_attach+0xf4/0xf8)
>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>> (bus_for_each_dev+0x78/0xc4)
>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>> (driver_attach+0x2c/0x30)
>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>> (bus_add_driver+0x114/0x220)
>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>> (driver_register+0x88/0x11c)
>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>> (__platform_driver_register+0x50/0x58)
>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>> (do_one_initcall+0x9c/0x434)
>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>> (kernel_init_freeable+0x3b8/0x53c)
>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>> (kernel_init+0x18/0x128)
>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>> (ret_from_fork+0x14/0x28)
>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>> [    3.992298] 9fa0:                                     00000000
>> 00000000 00000000 00000000
>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>> 00000000 00000000 00000000
>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>> [    4.015405] irq event stamp: 663130
>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>> _raw_spin_unlock_irqrestore+0x40/0x6c
>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>> __irq_svc+0x4c/0x7c
>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>> __do_softirq+0x2a8/0x5e8
>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>> irq_exit+0x178/0x1a4
>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>
>> Best regards
>> Naresh Kamboju
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html

thanks,
Sam

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-20 10:29 ` Tony Lindgren
  2018-06-20 12:57   ` Samuel Morris
@ 2018-06-20 12:58   ` Roger Quadros
  2018-06-20 15:30     ` Samuel Morris
  1 sibling, 1 reply; 17+ messages in thread
From: Roger Quadros @ 2018-06-20 12:58 UTC (permalink / raw)
  To: Tony Lindgren, Naresh Kamboju, Samuel Morris, Tejun Heo
  Cc: linux-omap, Linux-Next Mailing List, open list, Arnd Bergmann,
	ssantosh, Olof Johansson, Tero Kristo

Tony,

On 20/06/18 13:29, Tony Lindgren wrote:
> Hi,
> 
> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
> 
> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> hotplug to save power").
> 
> Reverting the patch makes things work again. Any ideas what
> might be going wrong here? Things clearly idle but then there
> seems to be some register access with clocks disabled.

The commit is doing this in probe.

+	pm_runtime_set_active(dev);
 	pm_runtime_enable(dev);
-	pm_runtime_get_sync(dev);
+	pm_runtime_forbid(dev);

On OMAP, the device is not guaranteed to be active at probe and so we can't
say pm_runtime_set_active() and get rid of pm_runtime_get_sync().


> 
>> Full log of trace please refer this link,
>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>
>> Here is back trace,
>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>> [    3.611949] brd: module loaded
>> [    3.641298] loop: module loaded
>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>> [    3.658481] ------------[ cut here ]------------
>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>> l3_interrupt_handler+0x27c/0x39c
>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>> (Read): Data Access in User mode during Functional access
>> [    3.689330] Modules linked in:
>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>> 4.18.0-rc1-next-20180619 #1
>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>> (show_stack+0x20/0x24)
>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>> (dump_stack+0xb0/0xdc)
>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>> (warn_slowpath_fmt+0x58/0x74)
>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>> (l3_interrupt_handler+0x27c/0x39c)
>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>> (__handle_irq_event_percpu+0x90/0x45c)
>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>> (handle_fasteoi_irq+0xb0/0x164)
>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>> (generic_handle_irq+0x34/0x44)
>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>> (__handle_domain_irq+0x6c/0xc4)
>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>> (gic_handle_irq+0x68/0xac)
>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>> (__irq_svc+0x5c/0x7c)
>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>> [    3.819963] 9c60:                                     fc140000
>> ee3e2f10 ee3a9810 00000000
>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>> ee3a9810 00000000 ee179cdc
>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>> (ahci_enable_ahci+0x24/0x70)
>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>> (ahci_save_initial_config+0x2c/0x388)
>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>> (platform_drv_probe+0x58/0xac)
>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>> (driver_probe_device+0x25c/0x338)
>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>> (__driver_attach+0xf4/0xf8)
>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>> (bus_for_each_dev+0x78/0xc4)
>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>> (driver_attach+0x2c/0x30)
>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>> (bus_add_driver+0x114/0x220)
>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>> (driver_register+0x88/0x11c)
>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>> (__platform_driver_register+0x50/0x58)
>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>> (do_one_initcall+0x9c/0x434)
>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>> (kernel_init_freeable+0x3b8/0x53c)
>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>> (kernel_init+0x18/0x128)
>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>> (ret_from_fork+0x14/0x28)
>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>> [    3.992298] 9fa0:                                     00000000
>> 00000000 00000000 00000000
>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>> 00000000 00000000 00000000
>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>> [    4.015405] irq event stamp: 663130
>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>> _raw_spin_unlock_irqrestore+0x40/0x6c
>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>> __irq_svc+0x4c/0x7c
>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>> __do_softirq+0x2a8/0x5e8
>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>> irq_exit+0x178/0x1a4
>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>
>> Best regards
>> Naresh Kamboju
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html

-- 
cheers,
-roger

Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-20 12:58   ` Roger Quadros
@ 2018-06-20 15:30     ` Samuel Morris
  2018-06-21  7:58       ` Roger Quadros
  0 siblings, 1 reply; 17+ messages in thread
From: Samuel Morris @ 2018-06-20 15:30 UTC (permalink / raw)
  To: Roger Quadros
  Cc: Tony Lindgren, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo

On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> Tony,
>
> On 20/06/18 13:29, Tony Lindgren wrote:
>> Hi,
>>
>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>>
>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>> hotplug to save power").
>>
>> Reverting the patch makes things work again. Any ideas what
>> might be going wrong here? Things clearly idle but then there
>> seems to be some register access with clocks disabled.
>
> The commit is doing this in probe.
>
> +       pm_runtime_set_active(dev);
>         pm_runtime_enable(dev);
> -       pm_runtime_get_sync(dev);
> +       pm_runtime_forbid(dev);
>
> On OMAP, the device is not guaranteed to be active at probe and so we can't
> say pm_runtime_set_active() and get rid of pm_runtime_get_sync().

Okay, by calling set_active(), I'm preventing the rpm_resume from
completing that would normally happen in pm_runtime_forbid(). I assume
you mean that there are parent devices that need to be resumed before
this device may be assumed active. I'm going to try removing the
set_active(), then move that clause to the end of
ahci_platform_init_host(). The pm_runtime_forbid() is effectively the
same as get_sync(), it just also sets the runtime_auto flag to false.
I don't think we should be saying the device is active until the host
is initialized, so that seems like a better, common place for the
pm_runtime init callbacks anyway. How does that sound?

>
>
>>
>>> Full log of trace please refer this link,
>>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>>
>>> Here is back trace,
>>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>>> [    3.611949] brd: module loaded
>>> [    3.641298] loop: module loaded
>>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>>> [    3.658481] ------------[ cut here ]------------
>>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>>> l3_interrupt_handler+0x27c/0x39c
>>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>>> (Read): Data Access in User mode during Functional access
>>> [    3.689330] Modules linked in:
>>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>>> 4.18.0-rc1-next-20180619 #1
>>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>>> (show_stack+0x20/0x24)
>>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>>> (dump_stack+0xb0/0xdc)
>>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>>> (warn_slowpath_fmt+0x58/0x74)
>>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>>> (l3_interrupt_handler+0x27c/0x39c)
>>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>>> (__handle_irq_event_percpu+0x90/0x45c)
>>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>>> (handle_fasteoi_irq+0xb0/0x164)
>>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>>> (generic_handle_irq+0x34/0x44)
>>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>>> (__handle_domain_irq+0x6c/0xc4)
>>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>>> (gic_handle_irq+0x68/0xac)
>>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>>> (__irq_svc+0x5c/0x7c)
>>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>>> [    3.819963] 9c60:                                     fc140000
>>> ee3e2f10 ee3a9810 00000000
>>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>>> ee3a9810 00000000 ee179cdc
>>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>>> (ahci_enable_ahci+0x24/0x70)
>>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>>> (ahci_save_initial_config+0x2c/0x388)
>>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
>>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>>> (platform_drv_probe+0x58/0xac)
>>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>>> (driver_probe_device+0x25c/0x338)
>>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>>> (__driver_attach+0xf4/0xf8)
>>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>>> (bus_for_each_dev+0x78/0xc4)
>>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>>> (driver_attach+0x2c/0x30)
>>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>>> (bus_add_driver+0x114/0x220)
>>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>>> (driver_register+0x88/0x11c)
>>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>>> (__platform_driver_register+0x50/0x58)
>>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>>> (do_one_initcall+0x9c/0x434)
>>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>>> (kernel_init_freeable+0x3b8/0x53c)
>>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>>> (kernel_init+0x18/0x128)
>>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>>> (ret_from_fork+0x14/0x28)
>>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>>> [    3.992298] 9fa0:                                     00000000
>>> 00000000 00000000 00000000
>>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>>> 00000000 00000000 00000000
>>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>>> [    4.015405] irq event stamp: 663130
>>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>>> _raw_spin_unlock_irqrestore+0x40/0x6c
>>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>>> __irq_svc+0x4c/0x7c
>>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>>> __do_softirq+0x2a8/0x5e8
>>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>>> irq_exit+0x178/0x1a4
>>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>>
>>> Best regards
>>> Naresh Kamboju
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>
> --
> cheers,
> -roger
>
> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-20 15:30     ` Samuel Morris
@ 2018-06-21  7:58       ` Roger Quadros
  2018-06-21 12:02         ` Rafael J. Wysocki
  2018-06-21 14:31         ` Samuel Morris
  0 siblings, 2 replies; 17+ messages in thread
From: Roger Quadros @ 2018-06-21  7:58 UTC (permalink / raw)
  To: Samuel Morris
  Cc: Tony Lindgren, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, rafael

+Rafael

On 20/06/18 18:30, Samuel Morris wrote:
> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>> Tony,
>>
>> On 20/06/18 13:29, Tony Lindgren wrote:
>>> Hi,
>>>
>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>>>
>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>>> hotplug to save power").
>>>
>>> Reverting the patch makes things work again. Any ideas what
>>> might be going wrong here? Things clearly idle but then there
>>> seems to be some register access with clocks disabled.
>>
>> The commit is doing this in probe.
>>
>> +       pm_runtime_set_active(dev);
>>         pm_runtime_enable(dev);
>> -       pm_runtime_get_sync(dev);
>> +       pm_runtime_forbid(dev);
>>
>> On OMAP, the device is not guaranteed to be active at probe and so we can't
>> say pm_runtime_set_active() and get rid of pm_runtime_get_sync().
> 
> Okay, by calling set_active(), I'm preventing the rpm_resume from
> completing that would normally happen in pm_runtime_forbid(). I assume
> you mean that there are parent devices that need to be resumed before

Actually, in the OMAP case, the AHCI controller device isn't active when probe is called.
For other platforms this might not be the case. So we need to be careful here.

> this device may be assumed active. I'm going to try removing the
> set_active(), then move that clause to the end of
> ahci_platform_init_host(). The pm_runtime_forbid() is effectively the
> same as get_sync(), it just also sets the runtime_auto flag to false.
> I don't think we should be saying the device is active until the host
> is initialized, so that seems like a better, common place for the
> pm_runtime init callbacks anyway. How does that sound?

Device active and initialized are different things. If the device is powered up
and can be accessed it is active, even if it is not yet initialized.
I don't think we should club the two.

Why do you need to call pm_runtime_set_active() at all in the probe sequence?

Documentation for pm_runtime_set_active() says,
"(it is only valid to use this function if 'power.runtime_error' is set
or 'power.disable_depth' is greater than zero);"

> 
>>
>>
>>>
>>>> Full log of trace please refer this link,
>>>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>>>
>>>> Here is back trace,
>>>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>>>> [    3.611949] brd: module loaded
>>>> [    3.641298] loop: module loaded
>>>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>>>> [    3.658481] ------------[ cut here ]------------
>>>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>>>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>>>> l3_interrupt_handler+0x27c/0x39c
>>>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>>>> (Read): Data Access in User mode during Functional access
>>>> [    3.689330] Modules linked in:
>>>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>>>> 4.18.0-rc1-next-20180619 #1
>>>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>>>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>>>> (show_stack+0x20/0x24)
>>>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>>>> (dump_stack+0xb0/0xdc)
>>>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>>>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>>>> (warn_slowpath_fmt+0x58/0x74)
>>>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>>>> (l3_interrupt_handler+0x27c/0x39c)
>>>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>>>> (__handle_irq_event_percpu+0x90/0x45c)
>>>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>>>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>>>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>>>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>>>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>>>> (handle_fasteoi_irq+0xb0/0x164)
>>>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>>>> (generic_handle_irq+0x34/0x44)
>>>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>>>> (__handle_domain_irq+0x6c/0xc4)
>>>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>>>> (gic_handle_irq+0x68/0xac)
>>>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>>>> (__irq_svc+0x5c/0x7c)
>>>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>>>> [    3.819963] 9c60:                                     fc140000
>>>> ee3e2f10 ee3a9810 00000000
>>>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>>>> ee3a9810 00000000 ee179cdc
>>>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>>>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>>>> (ahci_enable_ahci+0x24/0x70)
>>>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>>>> (ahci_save_initial_config+0x2c/0x388)
>>>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>>>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
>>>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>>>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>>>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>>>> (platform_drv_probe+0x58/0xac)
>>>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>>>> (driver_probe_device+0x25c/0x338)
>>>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>>>> (__driver_attach+0xf4/0xf8)
>>>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>>>> (bus_for_each_dev+0x78/0xc4)
>>>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>>>> (driver_attach+0x2c/0x30)
>>>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>>>> (bus_add_driver+0x114/0x220)
>>>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>>>> (driver_register+0x88/0x11c)
>>>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>>>> (__platform_driver_register+0x50/0x58)
>>>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>>>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>>>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>>>> (do_one_initcall+0x9c/0x434)
>>>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>>>> (kernel_init_freeable+0x3b8/0x53c)
>>>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>>>> (kernel_init+0x18/0x128)
>>>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>>>> (ret_from_fork+0x14/0x28)
>>>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>>>> [    3.992298] 9fa0:                                     00000000
>>>> 00000000 00000000 00000000
>>>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>>>> 00000000 00000000 00000000
>>>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>>>> [    4.015405] irq event stamp: 663130
>>>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>>>> _raw_spin_unlock_irqrestore+0x40/0x6c
>>>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>>>> __irq_svc+0x4c/0x7c
>>>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>>>> __do_softirq+0x2a8/0x5e8
>>>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>>>> irq_exit+0x178/0x1a4
>>>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>>>
>>>> Best regards
>>>> Naresh Kamboju
>>>> --
>>>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>>>> the body of a message to majordomo@vger.kernel.org
>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
>> --
>> cheers,
>> -roger
>>
>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

-- 
cheers,
-roger

Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-21  7:58       ` Roger Quadros
@ 2018-06-21 12:02         ` Rafael J. Wysocki
  2018-06-21 14:31         ` Samuel Morris
  1 sibling, 0 replies; 17+ messages in thread
From: Rafael J. Wysocki @ 2018-06-21 12:02 UTC (permalink / raw)
  To: Roger Quadros
  Cc: Samuel Morris, Tony Lindgren, Naresh Kamboju, Tejun Heo,
	Linux OMAP Mailing List, Linux-Next Mailing List, open list,
	Arnd Bergmann, Santosh Shilimkar, Olof Johansson, Tero Kristo,
	Rafael J. Wysocki, Linux PM

On Thu, Jun 21, 2018 at 9:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> +Rafael
>
> On 20/06/18 18:30, Samuel Morris wrote:
>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>>> Tony,
>>>
>>> On 20/06/18 13:29, Tony Lindgren wrote:
>>>> Hi,
>>>>
>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>>>>
>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>>>> hotplug to save power").
>>>>
>>>> Reverting the patch makes things work again. Any ideas what
>>>> might be going wrong here? Things clearly idle but then there
>>>> seems to be some register access with clocks disabled.
>>>
>>> The commit is doing this in probe.
>>>
>>> +       pm_runtime_set_active(dev);
>>>         pm_runtime_enable(dev);
>>> -       pm_runtime_get_sync(dev);
>>> +       pm_runtime_forbid(dev);
>>>
>>> On OMAP, the device is not guaranteed to be active at probe and so we can't
>>> say pm_runtime_set_active() and get rid of pm_runtime_get_sync().
>>
>> Okay, by calling set_active(), I'm preventing the rpm_resume from
>> completing that would normally happen in pm_runtime_forbid(). I assume
>> you mean that there are parent devices that need to be resumed before
>
> Actually, in the OMAP case, the AHCI controller device isn't active when probe is called.
> For other platforms this might not be the case. So we need to be careful here.
>
>> this device may be assumed active. I'm going to try removing the
>> set_active(), then move that clause to the end of
>> ahci_platform_init_host(). The pm_runtime_forbid() is effectively the
>> same as get_sync(), it just also sets the runtime_auto flag to false.
>> I don't think we should be saying the device is active until the host
>> is initialized, so that seems like a better, common place for the
>> pm_runtime init callbacks anyway. How does that sound?
>
> Device active and initialized are different things. If the device is powered up
> and can be accessed it is active, even if it is not yet initialized.
> I don't think we should club the two.
>
> Why do you need to call pm_runtime_set_active() at all in the probe sequence?
>
> Documentation for pm_runtime_set_active() says,
> "(it is only valid to use this function if 'power.runtime_error' is set
> or 'power.disable_depth' is greater than zero);"

I guess on some platforms the AHCI controller actually is active
initially and that's a matter of setting the initial status to reflect
the real situation.

If different things can happen on different platforms, there needs to
be a way to discover the initial state instead of making assumptions
on it.

Thanks,
Rafael

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-21  7:58       ` Roger Quadros
  2018-06-21 12:02         ` Rafael J. Wysocki
@ 2018-06-21 14:31         ` Samuel Morris
  2018-06-21 14:53           ` Roger Quadros
  1 sibling, 1 reply; 17+ messages in thread
From: Samuel Morris @ 2018-06-21 14:31 UTC (permalink / raw)
  To: Roger Quadros
  Cc: Tony Lindgren, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki

On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> +Rafael
>
> On 20/06/18 18:30, Samuel Morris wrote:
>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>>> Tony,
>>>
>>> On 20/06/18 13:29, Tony Lindgren wrote:
>>>> Hi,
>>>>
>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>>>>
>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>>>> hotplug to save power").
>>>>
>>>> Reverting the patch makes things work again. Any ideas what
>>>> might be going wrong here? Things clearly idle but then there
>>>> seems to be some register access with clocks disabled.
>>>
>>> The commit is doing this in probe.
>>>
>>> +       pm_runtime_set_active(dev);
>>>         pm_runtime_enable(dev);
>>> -       pm_runtime_get_sync(dev);
>>> +       pm_runtime_forbid(dev);
>>>
>>> On OMAP, the device is not guaranteed to be active at probe and so we can't
>>> say pm_runtime_set_active() and get rid of pm_runtime_get_sync().
>>
>> Okay, by calling set_active(), I'm preventing the rpm_resume from
>> completing that would normally happen in pm_runtime_forbid(). I assume
>> you mean that there are parent devices that need to be resumed before
>
> Actually, in the OMAP case, the AHCI controller device isn't active when probe is called.
> For other platforms this might not be the case. So we need to be careful here.
>
>> this device may be assumed active. I'm going to try removing the
>> set_active(), then move that clause to the end of
>> ahci_platform_init_host(). The pm_runtime_forbid() is effectively the
>> same as get_sync(), it just also sets the runtime_auto flag to false.
>> I don't think we should be saying the device is active until the host
>> is initialized, so that seems like a better, common place for the
>> pm_runtime init callbacks anyway. How does that sound?
>
> Device active and initialized are different things. If the device is powered up
> and can be accessed it is active, even if it is not yet initialized.
> I don't think we should club the two.
>
> Why do you need to call pm_runtime_set_active() at all in the probe sequence?

What I meant was, I'm not going to call pm_runtime_set_active()
anymore. The device status would be changed to RPM_ACTIVE at the end
of rpm_resume() in either runtime_get() or runtime_forbid(). It is
expected to be active after those have been called. Calling
set_active() before the runtime_forbid() before was to prevent the
device from actually calling the resume() callbacks, because the host
had not yet been initialized, and resuming the host at that point was
causing hangs. I was hoping that actually doing a runtime_resume()
after the host has been initialized would work, and make all parent
devices happy, though I am turning on a host that's already on and
initialized. It should be just about the same as what was there
before, just runtime_forbid() replacing get_sync(), and all that moved
to the ahci_platform_init_host() function.

Now that I think about it though, I want all the parent devices
resumed before I resume the current device, and probably before host
initialization, and I suspect that's why you need to call
runtime_get_sync(). Maybe I should just put the
ahci_platform_init_host() in the runtime_resume callback, and only run
it the first time that resume() is called. That would require
modifying all drivers that call ahci_platform_init_host() though. Or,
I could just not call ahci_platform_resume_host() on the first
runtime_resume(). That seems like the simplest solution. How about
that?

>
> Documentation for pm_runtime_set_active() says,
> "(it is only valid to use this function if 'power.runtime_error' is set
> or 'power.disable_depth' is greater than zero);"
>
>>
>>>
>>>
>>>>
>>>>> Full log of trace please refer this link,
>>>>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>>>>
>>>>> Here is back trace,
>>>>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>>>>> [    3.611949] brd: module loaded
>>>>> [    3.641298] loop: module loaded
>>>>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>>>>> [    3.658481] ------------[ cut here ]------------
>>>>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>>>>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>>>>> l3_interrupt_handler+0x27c/0x39c
>>>>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>>>>> (Read): Data Access in User mode during Functional access
>>>>> [    3.689330] Modules linked in:
>>>>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>>>>> 4.18.0-rc1-next-20180619 #1
>>>>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>>>>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>>>>> (show_stack+0x20/0x24)
>>>>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>>>>> (dump_stack+0xb0/0xdc)
>>>>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>>>>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>>>>> (warn_slowpath_fmt+0x58/0x74)
>>>>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>>>>> (l3_interrupt_handler+0x27c/0x39c)
>>>>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>>>>> (__handle_irq_event_percpu+0x90/0x45c)
>>>>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>>>>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>>>>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>>>>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>>>>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>>>>> (handle_fasteoi_irq+0xb0/0x164)
>>>>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>>>>> (generic_handle_irq+0x34/0x44)
>>>>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>>>>> (__handle_domain_irq+0x6c/0xc4)
>>>>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>>>>> (gic_handle_irq+0x68/0xac)
>>>>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>>>>> (__irq_svc+0x5c/0x7c)
>>>>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>>>>> [    3.819963] 9c60:                                     fc140000
>>>>> ee3e2f10 ee3a9810 00000000
>>>>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>>>>> ee3a9810 00000000 ee179cdc
>>>>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>>>>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>>>>> (ahci_enable_ahci+0x24/0x70)
>>>>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>>>>> (ahci_save_initial_config+0x2c/0x388)
>>>>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>>>>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
>>>>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>>>>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>>>>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>>>>> (platform_drv_probe+0x58/0xac)
>>>>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>>>>> (driver_probe_device+0x25c/0x338)
>>>>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>>>>> (__driver_attach+0xf4/0xf8)
>>>>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>>>>> (bus_for_each_dev+0x78/0xc4)
>>>>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>>>>> (driver_attach+0x2c/0x30)
>>>>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>>>>> (bus_add_driver+0x114/0x220)
>>>>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>>>>> (driver_register+0x88/0x11c)
>>>>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>>>>> (__platform_driver_register+0x50/0x58)
>>>>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>>>>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>>>>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>>>>> (do_one_initcall+0x9c/0x434)
>>>>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>>>>> (kernel_init_freeable+0x3b8/0x53c)
>>>>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>>>>> (kernel_init+0x18/0x128)
>>>>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>>>>> (ret_from_fork+0x14/0x28)
>>>>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>>>>> [    3.992298] 9fa0:                                     00000000
>>>>> 00000000 00000000 00000000
>>>>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>>>>> 00000000 00000000 00000000
>>>>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>>>>> [    4.015405] irq event stamp: 663130
>>>>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>>>>> _raw_spin_unlock_irqrestore+0x40/0x6c
>>>>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>>>>> __irq_svc+0x4c/0x7c
>>>>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>>>>> __do_softirq+0x2a8/0x5e8
>>>>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>>>>> irq_exit+0x178/0x1a4
>>>>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>>>>
>>>>> Best regards
>>>>> Naresh Kamboju
>>>>> --
>>>>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>>>>> the body of a message to majordomo@vger.kernel.org
>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>
>>> --
>>> cheers,
>>> -roger
>>>
>>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
>>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki
>
> --
> cheers,
> -roger
>
> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-21 14:31         ` Samuel Morris
@ 2018-06-21 14:53           ` Roger Quadros
  2018-07-02  9:32             ` Tony Lindgren
  0 siblings, 1 reply; 17+ messages in thread
From: Roger Quadros @ 2018-06-21 14:53 UTC (permalink / raw)
  To: Samuel Morris
  Cc: Tony Lindgren, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki

On 21/06/18 17:31, Samuel Morris wrote:
> On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>> +Rafael
>>
>> On 20/06/18 18:30, Samuel Morris wrote:
>>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>>>> Tony,
>>>>
>>>> On 20/06/18 13:29, Tony Lindgren wrote:
>>>>> Hi,
>>>>>
>>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>>>>>
>>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>>>>> hotplug to save power").
>>>>>
>>>>> Reverting the patch makes things work again. Any ideas what
>>>>> might be going wrong here? Things clearly idle but then there
>>>>> seems to be some register access with clocks disabled.
>>>>
>>>> The commit is doing this in probe.
>>>>
>>>> +       pm_runtime_set_active(dev);
>>>>         pm_runtime_enable(dev);
>>>> -       pm_runtime_get_sync(dev);
>>>> +       pm_runtime_forbid(dev);
>>>>
>>>> On OMAP, the device is not guaranteed to be active at probe and so we can't
>>>> say pm_runtime_set_active() and get rid of pm_runtime_get_sync().
>>>
>>> Okay, by calling set_active(), I'm preventing the rpm_resume from
>>> completing that would normally happen in pm_runtime_forbid(). I assume
>>> you mean that there are parent devices that need to be resumed before
>>
>> Actually, in the OMAP case, the AHCI controller device isn't active when probe is called.
>> For other platforms this might not be the case. So we need to be careful here.
>>
>>> this device may be assumed active. I'm going to try removing the
>>> set_active(), then move that clause to the end of
>>> ahci_platform_init_host(). The pm_runtime_forbid() is effectively the
>>> same as get_sync(), it just also sets the runtime_auto flag to false.
>>> I don't think we should be saying the device is active until the host
>>> is initialized, so that seems like a better, common place for the
>>> pm_runtime init callbacks anyway. How does that sound?
>>
>> Device active and initialized are different things. If the device is powered up
>> and can be accessed it is active, even if it is not yet initialized.
>> I don't think we should club the two.
>>
>> Why do you need to call pm_runtime_set_active() at all in the probe sequence?
> 
> What I meant was, I'm not going to call pm_runtime_set_active()
> anymore. The device status would be changed to RPM_ACTIVE at the end
> of rpm_resume() in either runtime_get() or runtime_forbid(). It is
> expected to be active after those have been called. Calling
> set_active() before the runtime_forbid() before was to prevent the
> device from actually calling the resume() callbacks, because the host
> had not yet been initialized, and resuming the host at that point was
> causing hangs. I was hoping that actually doing a runtime_resume()
> after the host has been initialized would work, and make all parent
> devices happy, though I am turning on a host that's already on and
> initialized. It should be just about the same as what was there
> before, just runtime_forbid() replacing get_sync(), and all that moved
> to the ahci_platform_init_host() function.
> 
> Now that I think about it though, I want all the parent devices
> resumed before I resume the current device, and probably before host
> initialization, and I suspect that's why you need to call
> runtime_get_sync(). Maybe I should just put the
> ahci_platform_init_host() in the runtime_resume callback, and only run
> it the first time that resume() is called. That would require
> modifying all drivers that call ahci_platform_init_host() though. Or,
> I could just not call ahci_platform_resume_host() on the first
> runtime_resume(). That seems like the simplest solution. How about
> that?

This approach is quirky and might not work properly on all platforms.

As Rafael mentioned, we can't really assume weather the device was active
or not during probe. It might be active on some platforms and not on others.

We need a way for the driver to figure out what's the actual hardware state
and then execute a pm_runtime_get_sync() if it is not active during probe.

I don't know how though. DT flag maybe?

> 
>>
>> Documentation for pm_runtime_set_active() says,
>> "(it is only valid to use this function if 'power.runtime_error' is set
>> or 'power.disable_depth' is greater than zero);"
>>
>>>
>>>>
>>>>
>>>>>
>>>>>> Full log of trace please refer this link,
>>>>>> https://lkft.validation.linaro.org/scheduler/job/291535#L946
>>>>>>
>>>>>> Here is back trace,
>>>>>> [    3.561640] omap_rng 48090000.rng: Random Number Generator ver. 20
>>>>>> [    3.611949] brd: module loaded
>>>>>> [    3.641298] loop: module loaded
>>>>>> [    3.646435] at24 0-0050: 4096 byte 24c32 EEPROM, writable, 1 bytes/write
>>>>>> [    3.658481] ------------[ cut here ]------------
>>>>>> [    3.663196] WARNING: CPU: 0 PID: 1 at
>>>>>> /srv/oe/build/tmp-rpb-glibc/work-shared/am57xx-evm/kernel-source/drivers/bus/omap_l3_noc.c:147
>>>>>> l3_interrupt_handler+0x27c/0x39c
>>>>>> [    3.678052] 44000000.ocp:L3 Custom Error: MASTER MPU TARGET L4_CFG
>>>>>> (Read): Data Access in User mode during Functional access
>>>>>> [    3.689330] Modules linked in:
>>>>>> [    3.692426] CPU: 0 PID: 1 Comm: swapper/0 Not tainted
>>>>>> 4.18.0-rc1-next-20180619 #1
>>>>>> [    3.699952] Hardware name: Generic DRA74X (Flattened Device Tree)
>>>>>> [    3.706099] [<c04156f0>] (unwind_backtrace) from [<c040eec4>]
>>>>>> (show_stack+0x20/0x24)
>>>>>> [    3.713897] [<c040eec4>] (show_stack) from [<c116a668>]
>>>>>> (dump_stack+0xb0/0xdc)
>>>>>> [    3.721171] [<c116a668>] (dump_stack) from [<c0450a14>] (__warn+0x118/0x130)
>>>>>> [    3.728270] [<c0450a14>] (__warn) from [<c0450a84>]
>>>>>> (warn_slowpath_fmt+0x58/0x74)
>>>>>> [    3.735808] [<c0450a84>] (warn_slowpath_fmt) from [<c08be648>]
>>>>>> (l3_interrupt_handler+0x27c/0x39c)
>>>>>> [    3.744742] [<c08be648>] (l3_interrupt_handler) from [<c04cd270>]
>>>>>> (__handle_irq_event_percpu+0x90/0x45c)
>>>>>> [    3.754286] [<c04cd270>] (__handle_irq_event_percpu) from
>>>>>> [<c04cd678>] (handle_irq_event_percpu+0x3c/0x8c)
>>>>>> [    3.764001] [<c04cd678>] (handle_irq_event_percpu) from
>>>>>> [<c04cd710>] (handle_irq_event+0x48/0x6c)
>>>>>> [    3.772933] [<c04cd710>] (handle_irq_event) from [<c04d170c>]
>>>>>> (handle_fasteoi_irq+0xb0/0x164)
>>>>>> [    3.781513] [<c04d170c>] (handle_fasteoi_irq) from [<c04cc344>]
>>>>>> (generic_handle_irq+0x34/0x44)
>>>>>> [    3.790183] [<c04cc344>] (generic_handle_irq) from [<c04cca48>]
>>>>>> (__handle_domain_irq+0x6c/0xc4)
>>>>>> [    3.798941] [<c04cca48>] (__handle_domain_irq) from [<c0402528>]
>>>>>> (gic_handle_irq+0x68/0xac)
>>>>>> [    3.807347] [<c0402528>] (gic_handle_irq) from [<c04019bc>]
>>>>>> (__irq_svc+0x5c/0x7c)
>>>>>> [    3.814874] Exception stack(0xee179c70 to 0xee179cb8)
>>>>>> [    3.819963] 9c60:                                     fc140000
>>>>>> ee3e2f10 ee3a9810 00000000
>>>>>> [    3.828189] 9c80: 00000000 ee3e2f10 fc140004 fc140000 ee3a9800
>>>>>> ee3a9810 00000000 ee179cdc
>>>>>> [    3.836414] 9ca0: ee179ce0 ee179cc0 c0c3b364 c0c3b2ec 60000013 ffffffff
>>>>>> [    3.843079] [<c04019bc>] (__irq_svc) from [<c0c3b2ec>]
>>>>>> (ahci_enable_ahci+0x24/0x70)
>>>>>> [    3.850789] [<c0c3b2ec>] (ahci_enable_ahci) from [<c0c3b364>]
>>>>>> (ahci_save_initial_config+0x2c/0x388)
>>>>>> [    3.859894] [<c0c3b364>] (ahci_save_initial_config) from
>>>>>> [<c0c3cbdc>] (ahci_platform_init_host+0x9c/0x4f4)
>>>>>> [    3.869609] [<c0c3cbdc>] (ahci_platform_init_host) from
>>>>>> [<c0c3c33c>] (ahci_probe+0xa0/0xc0)
>>>>>> [    3.878018] [<c0c3c33c>] (ahci_probe) from [<c0ba40f0>]
>>>>>> (platform_drv_probe+0x58/0xac)
>>>>>> [    3.885989] [<c0ba40f0>] (platform_drv_probe) from [<c0ba1aec>]
>>>>>> (driver_probe_device+0x25c/0x338)
>>>>>> [    3.894915] [<c0ba1aec>] (driver_probe_device) from [<c0ba1cbc>]
>>>>>> (__driver_attach+0xf4/0xf8)
>>>>>> [    3.903411] [<c0ba1cbc>] (__driver_attach) from [<c0b9f9ac>]
>>>>>> (bus_for_each_dev+0x78/0xc4)
>>>>>> [    3.911643] [<c0b9f9ac>] (bus_for_each_dev) from [<c0ba13b8>]
>>>>>> (driver_attach+0x2c/0x30)
>>>>>> [    3.919696] [<c0ba13b8>] (driver_attach) from [<c0ba0cfc>]
>>>>>> (bus_add_driver+0x114/0x220)
>>>>>> [    3.927752] [<c0ba0cfc>] (bus_add_driver) from [<c0ba2e2c>]
>>>>>> (driver_register+0x88/0x11c)
>>>>>> [    3.935896] [<c0ba2e2c>] (driver_register) from [<c0ba4044>]
>>>>>> (__platform_driver_register+0x50/0x58)
>>>>>> [    3.945003] [<c0ba4044>] (__platform_driver_register) from
>>>>>> [<c1aa9cf0>] (ahci_driver_init+0x24/0x28)
>>>>>> [    3.954195] [<c1aa9cf0>] (ahci_driver_init) from [<c0403a8c>]
>>>>>> (do_one_initcall+0x9c/0x434)
>>>>>> [    3.962517] [<c0403a8c>] (do_one_initcall) from [<c1a01410>]
>>>>>> (kernel_init_freeable+0x3b8/0x53c)
>>>>>> [    3.971276] [<c1a01410>] (kernel_init_freeable) from [<c118551c>]
>>>>>> (kernel_init+0x18/0x128)
>>>>>> [    3.979597] [<c118551c>] (kernel_init) from [<c04010ac>]
>>>>>> (ret_from_fork+0x14/0x28)
>>>>>> [    3.987210] Exception stack(0xee179fb0 to 0xee179ff8)
>>>>>> [    3.992298] 9fa0:                                     00000000
>>>>>> 00000000 00000000 00000000
>>>>>> [    4.000524] 9fc0: 00000000 00000000 00000000 00000000 00000000
>>>>>> 00000000 00000000 00000000
>>>>>> [    4.008750] 9fe0: 00000000 00000000 00000000 00000000 00000013 00000000
>>>>>> [    4.015405] irq event stamp: 663130
>>>>>> [    4.018926] hardirqs last  enabled at (663129): [<c118d9d4>]
>>>>>> _raw_spin_unlock_irqrestore+0x40/0x6c
>>>>>> [    4.027939] hardirqs last disabled at (663130): [<c04019ac>]
>>>>>> __irq_svc+0x4c/0x7c
>>>>>> [    4.035384] softirqs last  enabled at (661486): [<c0402c98>]
>>>>>> __do_softirq+0x2a8/0x5e8
>>>>>> [    4.043262] softirqs last disabled at (661449): [<c04594a4>]
>>>>>> irq_exit+0x178/0x1a4
>>>>>> [    4.050826] ---[ end trace 095abac70926448f ]---
>>>>>>
>>>>>> Best regards
>>>>>> Naresh Kamboju
>>>>>> --
>>>>>> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
>>>>>> the body of a message to majordomo@vger.kernel.org
>>>>>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>>>
>>>> --
>>>> cheers,
>>>> -roger
>>>>
>>>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
>>>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki
>>
>> --
>> cheers,
>> -roger
>>
>> Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
>> Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

-- 
cheers,
-roger

Texas Instruments Finland Oy, Porkkalankatu 22, 00180 Helsinki.
Y-tunnus/Business ID: 0615521-4. Kotipaikka/Domicile: Helsinki

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-06-21 14:53           ` Roger Quadros
@ 2018-07-02  9:32             ` Tony Lindgren
  2018-07-02 13:31               ` Samuel Morris
  0 siblings, 1 reply; 17+ messages in thread
From: Tony Lindgren @ 2018-07-02  9:32 UTC (permalink / raw)
  To: Roger Quadros
  Cc: Samuel Morris, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki

Hi,

* Roger Quadros <rogerq@ti.com> [180621 14:56]:
> On 21/06/18 17:31, Samuel Morris wrote:
> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> >> +Rafael
> >>
> >> On 20/06/18 18:30, Samuel Morris wrote:
> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> >>>> Tony,
> >>>>
> >>>> On 20/06/18 13:29, Tony Lindgren wrote:
> >>>>> Hi,
> >>>>>
> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
> >>>>>
> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> >>>>> hotplug to save power").
> >>>>>
> >>>>> Reverting the patch makes things work again. Any ideas what
> >>>>> might be going wrong here? Things clearly idle but then there
> >>>>> seems to be some register access with clocks disabled.

So this issue is still happening as of next-20180702. Can you guys
please revert the commit above while working on a better solution?

Regards,

Tony

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-02  9:32             ` Tony Lindgren
@ 2018-07-02 13:31               ` Samuel Morris
  2018-07-17  6:08                 ` Tony Lindgren
  2018-08-05 23:30                 ` Stephen Rothwell
  0 siblings, 2 replies; 17+ messages in thread
From: Samuel Morris @ 2018-07-02 13:31 UTC (permalink / raw)
  To: Tony Lindgren
  Cc: Roger Quadros, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki

On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:
> Hi,
>
> * Roger Quadros <rogerq@ti.com> [180621 14:56]:
>> On 21/06/18 17:31, Samuel Morris wrote:
>> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>> >> +Rafael
>> >>
>> >> On 20/06/18 18:30, Samuel Morris wrote:
>> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
>> >>>> Tony,
>> >>>>
>> >>>> On 20/06/18 13:29, Tony Lindgren wrote:
>> >>>>> Hi,
>> >>>>>
>> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
>> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
>> >>>>>
>> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>> >>>>> hotplug to save power").
>> >>>>>
>> >>>>> Reverting the patch makes things work again. Any ideas what
>> >>>>> might be going wrong here? Things clearly idle but then there
>> >>>>> seems to be some register access with clocks disabled.
>
> So this issue is still happening as of next-20180702. Can you guys
> please revert the commit above while working on a better solution?

That's fine with me. I'm not very familiar with the process here, does
this require anything on my end? And would that require the
accompanying patch to be reverted: "ata: ahci: rpm_put port on
port_stop to match rpm_get in port_start"? There shouldn't be any
problem leaving that one in, but I just want to know before submitting
my next patch set.

>
> Regards,
>
> Tony

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-02 13:31               ` Samuel Morris
@ 2018-07-17  6:08                 ` Tony Lindgren
  2018-07-17  8:00                   ` Stephen Rothwell
  2018-08-05 23:30                 ` Stephen Rothwell
  1 sibling, 1 reply; 17+ messages in thread
From: Tony Lindgren @ 2018-07-17  6:08 UTC (permalink / raw)
  To: Tejun Heo, Stephen Rothwell
  Cc: Roger Quadros, Naresh Kamboju, Tejun Heo, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki, Samuel Morris

Tejun & Stephen,

The following regression is still pending in next, see below.

* Samuel Morris <samorris@lexmark.com> [180702 13:35]:
> On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:
> > Hi,
> >
> > * Roger Quadros <rogerq@ti.com> [180621 14:56]:
> >> On 21/06/18 17:31, Samuel Morris wrote:
> >> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> >> >> +Rafael
> >> >>
> >> >> On 20/06/18 18:30, Samuel Morris wrote:
> >> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:
> >> >>>> Tony,
> >> >>>>
> >> >>>> On 20/06/18 13:29, Tony Lindgren wrote:
> >> >>>>> Hi,
> >> >>>>>
> >> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:
> >> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.
> >> >>>>>
> >> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> >> >>>>> hotplug to save power").
> >> >>>>>
> >> >>>>> Reverting the patch makes things work again. Any ideas what
> >> >>>>> might be going wrong here? Things clearly idle but then there
> >> >>>>> seems to be some register access with clocks disabled.
> >
> > So this issue is still happening as of next-20180702. Can you guys
> > please revert the commit above while working on a better solution?
> 
> That's fine with me. I'm not very familiar with the process here, does
> this require anything on my end? And would that require the
> accompanying patch to be reverted: "ata: ahci: rpm_put port on
> port_stop to match rpm_get in port_start"? There shouldn't be any
> problem leaving that one in, but I just want to know before submitting
> my next patch set.

Well usually the maintainer just reverts the regression causing
patch in the related branch and that's it.

Stephen, can you please revert in next until we hear back from
Tejun?

Regards,

Tony

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-17  6:08                 ` Tony Lindgren
@ 2018-07-17  8:00                   ` Stephen Rothwell
  2018-07-17  8:03                     ` Tony Lindgren
  0 siblings, 1 reply; 17+ messages in thread
From: Stephen Rothwell @ 2018-07-17  8:00 UTC (permalink / raw)
  To: Tony Lindgren
  Cc: Tejun Heo, Roger Quadros, Naresh Kamboju, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki, Samuel Morris

[-- Attachment #1: Type: text/plain, Size: 2228 bytes --]

Hi Tony,

On Mon, 16 Jul 2018 23:08:08 -0700 Tony Lindgren <tony@atomide.com> wrote:
>
> The following regression is still pending in next, see below.
> 
> * Samuel Morris <samorris@lexmark.com> [180702 13:35]:
> > On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:  
> > > Hi,
> > >
> > > * Roger Quadros <rogerq@ti.com> [180621 14:56]:  
> > >> On 21/06/18 17:31, Samuel Morris wrote:  
> > >> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> > >> >> +Rafael
> > >> >>
> > >> >> On 20/06/18 18:30, Samuel Morris wrote:  
> > >> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> > >> >>>> Tony,
> > >> >>>>
> > >> >>>> On 20/06/18 13:29, Tony Lindgren wrote:  
> > >> >>>>> Hi,
> > >> >>>>>
> > >> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:  
> > >> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.  
> > >> >>>>>
> > >> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> > >> >>>>> hotplug to save power").
> > >> >>>>>
> > >> >>>>> Reverting the patch makes things work again. Any ideas what
> > >> >>>>> might be going wrong here? Things clearly idle but then there
> > >> >>>>> seems to be some register access with clocks disabled.  
> > >
> > > So this issue is still happening as of next-20180702. Can you guys
> > > please revert the commit above while working on a better solution?  
> > 
> > That's fine with me. I'm not very familiar with the process here, does
> > this require anything on my end? And would that require the
> > accompanying patch to be reverted: "ata: ahci: rpm_put port on
> > port_stop to match rpm_get in port_start"? There shouldn't be any
> > problem leaving that one in, but I just want to know before submitting
> > my next patch set.  
> 
> Well usually the maintainer just reverts the regression causing
> patch in the related branch and that's it.
> 
> Stephen, can you please revert in next until we hear back from
> Tejun?

OK, I have reverted that commit from today.  Please let me know when the
problem is fixed in the libata tree ...

-- 
Cheers,
Stephen Rothwell

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-17  8:00                   ` Stephen Rothwell
@ 2018-07-17  8:03                     ` Tony Lindgren
  2018-07-18  5:05                       ` Keerthy
  0 siblings, 1 reply; 17+ messages in thread
From: Tony Lindgren @ 2018-07-17  8:03 UTC (permalink / raw)
  To: Stephen Rothwell
  Cc: Tejun Heo, Roger Quadros, Naresh Kamboju, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki, Samuel Morris

* Stephen Rothwell <sfr@canb.auug.org.au> [180717 08:03]:
> Hi Tony,
> 
> On Mon, 16 Jul 2018 23:08:08 -0700 Tony Lindgren <tony@atomide.com> wrote:
> >
> > The following regression is still pending in next, see below.
> > 
> > * Samuel Morris <samorris@lexmark.com> [180702 13:35]:
> > > On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:  
> > > > Hi,
> > > >
> > > > * Roger Quadros <rogerq@ti.com> [180621 14:56]:  
> > > >> On 21/06/18 17:31, Samuel Morris wrote:  
> > > >> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> > > >> >> +Rafael
> > > >> >>
> > > >> >> On 20/06/18 18:30, Samuel Morris wrote:  
> > > >> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> > > >> >>>> Tony,
> > > >> >>>>
> > > >> >>>> On 20/06/18 13:29, Tony Lindgren wrote:  
> > > >> >>>>> Hi,
> > > >> >>>>>
> > > >> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:  
> > > >> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.  
> > > >> >>>>>
> > > >> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> > > >> >>>>> hotplug to save power").
> > > >> >>>>>
> > > >> >>>>> Reverting the patch makes things work again. Any ideas what
> > > >> >>>>> might be going wrong here? Things clearly idle but then there
> > > >> >>>>> seems to be some register access with clocks disabled.  
> > > >
> > > > So this issue is still happening as of next-20180702. Can you guys
> > > > please revert the commit above while working on a better solution?  
> > > 
> > > That's fine with me. I'm not very familiar with the process here, does
> > > this require anything on my end? And would that require the
> > > accompanying patch to be reverted: "ata: ahci: rpm_put port on
> > > port_stop to match rpm_get in port_start"? There shouldn't be any
> > > problem leaving that one in, but I just want to know before submitting
> > > my next patch set.  
> > 
> > Well usually the maintainer just reverts the regression causing
> > patch in the related branch and that's it.
> > 
> > Stephen, can you please revert in next until we hear back from
> > Tejun?
> 
> OK, I have reverted that commit from today.  Please let me know when the
> problem is fixed in the libata tree ...

Thanks!

Tony



^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-17  8:03                     ` Tony Lindgren
@ 2018-07-18  5:05                       ` Keerthy
  0 siblings, 0 replies; 17+ messages in thread
From: Keerthy @ 2018-07-18  5:05 UTC (permalink / raw)
  To: Tony Lindgren, Stephen Rothwell
  Cc: Tejun Heo, Roger Quadros, Naresh Kamboju, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki, Samuel Morris



On Tuesday 17 July 2018 01:33 PM, Tony Lindgren wrote:
> * Stephen Rothwell <sfr@canb.auug.org.au> [180717 08:03]:
>> Hi Tony,
>>
>> On Mon, 16 Jul 2018 23:08:08 -0700 Tony Lindgren <tony@atomide.com> wrote:
>>>
>>> The following regression is still pending in next, see below.
>>>
>>> * Samuel Morris <samorris@lexmark.com> [180702 13:35]:
>>>> On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:  
>>>>> Hi,
>>>>>
>>>>> * Roger Quadros <rogerq@ti.com> [180621 14:56]:  
>>>>>> On 21/06/18 17:31, Samuel Morris wrote:  
>>>>>>> On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
>>>>>>>> +Rafael
>>>>>>>>
>>>>>>>> On 20/06/18 18:30, Samuel Morris wrote:  
>>>>>>>>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
>>>>>>>>>> Tony,
>>>>>>>>>>
>>>>>>>>>> On 20/06/18 13:29, Tony Lindgren wrote:  
>>>>>>>>>>> Hi,
>>>>>>>>>>>
>>>>>>>>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:  
>>>>>>>>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.  
>>>>>>>>>>>
>>>>>>>>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
>>>>>>>>>>> hotplug to save power").
>>>>>>>>>>>
>>>>>>>>>>> Reverting the patch makes things work again. Any ideas what
>>>>>>>>>>> might be going wrong here? Things clearly idle but then there
>>>>>>>>>>> seems to be some register access with clocks disabled.  
>>>>>
>>>>> So this issue is still happening as of next-20180702. Can you guys
>>>>> please revert the commit above while working on a better solution?  
>>>>
>>>> That's fine with me. I'm not very familiar with the process here, does
>>>> this require anything on my end? And would that require the
>>>> accompanying patch to be reverted: "ata: ahci: rpm_put port on
>>>> port_stop to match rpm_get in port_start"? There shouldn't be any
>>>> problem leaving that one in, but I just want to know before submitting
>>>> my next patch set.  
>>>
>>> Well usually the maintainer just reverts the regression causing
>>> patch in the related branch and that's it.
>>>
>>> Stephen, can you please revert in next until we hear back from
>>> Tejun?
>>
>> OK, I have reverted that commit from today.  Please let me know when the
>> problem is fixed in the libata tree ...

Hi Stephen,

Thanks for the revert.

commit 1dcbe5f2c615337cb7d4e13fab198ab716180733
Author: Stephen Rothwell <sfr@canb.auug.org.au>
Date:   Tue Jul 17 19:02:59 2018 +1000

With the above top commit i confirm that BEAGLE-X15, AM572X-IDK,
AM574X-IDK, DRA7, DRA72 TI platforms booted to prompt.

Regards,
Keerthy

> 
> Thanks!
> 
> Tony
> 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-omap" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-07-02 13:31               ` Samuel Morris
  2018-07-17  6:08                 ` Tony Lindgren
@ 2018-08-05 23:30                 ` Stephen Rothwell
  2018-08-06 17:24                   ` Tejun Heo
  1 sibling, 1 reply; 17+ messages in thread
From: Stephen Rothwell @ 2018-08-05 23:30 UTC (permalink / raw)
  To: Samuel Morris, Tejun Heo
  Cc: Tony Lindgren, Roger Quadros, Naresh Kamboju, linux-omap,
	Linux-Next Mailing List, open list, Arnd Bergmann, ssantosh,
	Olof Johansson, Tero Kristo, Rafael J. Wysocki

[-- Attachment #1: Type: text/plain, Size: 1848 bytes --]

Hi Samuel,

On Mon, 2 Jul 2018 09:31:56 -0400 Samuel Morris <samorris@lexmark.com> wrote:
>
> On Mon, Jul 2, 2018 at 5:32 AM, Tony Lindgren <tony@atomide.com> wrote:
> > Hi,
> >
> > * Roger Quadros <rogerq@ti.com> [180621 14:56]:  
> >> On 21/06/18 17:31, Samuel Morris wrote:  
> >> > On Thu, Jun 21, 2018 at 3:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> >> >> +Rafael
> >> >>
> >> >> On 20/06/18 18:30, Samuel Morris wrote:  
> >> >>> On Wed, Jun 20, 2018 at 8:58 AM, Roger Quadros <rogerq@ti.com> wrote:  
> >> >>>> Tony,
> >> >>>>
> >> >>>> On 20/06/18 13:29, Tony Lindgren wrote:  
> >> >>>>> Hi,
> >> >>>>>
> >> >>>>> * Naresh Kamboju <naresh.kamboju@linaro.org> [180620 05:55]:  
> >> >>>>>> Linux next (4.18.0-rc1-next-20180619) boot failed on beagle board x15.  
> >> >>>>>
> >> >>>>> Bisect points to commit aece27a2f01b ("ata: ahci_platform: allow disabling of
> >> >>>>> hotplug to save power").
> >> >>>>>
> >> >>>>> Reverting the patch makes things work again. Any ideas what
> >> >>>>> might be going wrong here? Things clearly idle but then there
> >> >>>>> seems to be some register access with clocks disabled.  
> >
> > So this issue is still happening as of next-20180702. Can you guys
> > please revert the commit above while working on a better solution?  
> 
> That's fine with me. I'm not very familiar with the process here, does
> this require anything on my end? And would that require the
> accompanying patch to be reverted: "ata: ahci: rpm_put port on
> port_stop to match rpm_get in port_start"? There shouldn't be any
> problem leaving that one in, but I just want to know before submitting
> my next patch set.

I am still reverting that commit from linux-next myself.  Has this
issue been fixed some other way, or is it still ongoing?

-- 
Cheers,
Stephen Rothwell

[-- Attachment #2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 488 bytes --]

^ permalink raw reply	[flat|nested] 17+ messages in thread

* Re: 4.18.0-rc1-next-20180619 boot failed on beagle board x15
  2018-08-05 23:30                 ` Stephen Rothwell
@ 2018-08-06 17:24                   ` Tejun Heo
  0 siblings, 0 replies; 17+ messages in thread
From: Tejun Heo @ 2018-08-06 17:24 UTC (permalink / raw)
  To: Stephen Rothwell
  Cc: Samuel Morris, Tony Lindgren, Roger Quadros, Naresh Kamboju,
	linux-omap, Linux-Next Mailing List, open list, Arnd Bergmann,
	ssantosh, Olof Johansson, Tero Kristo, Rafael J. Wysocki

Hello,

On Mon, Aug 06, 2018 at 09:30:15AM +1000, Stephen Rothwell wrote:
> > > So this issue is still happening as of next-20180702. Can you guys
> > > please revert the commit above while working on a better solution?  
> > 
> > That's fine with me. I'm not very familiar with the process here, does
> > this require anything on my end? And would that require the
> > accompanying patch to be reverted: "ata: ahci: rpm_put port on
> > port_stop to match rpm_get in port_start"? There shouldn't be any
> > problem leaving that one in, but I just want to know before submitting
> > my next patch set.
> 
> I am still reverting that commit from linux-next myself.  Has this
> issue been fixed some other way, or is it still ongoing?

Hasn't been resolved yet.  Reverting from libata/for-4.19.

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 17+ messages in thread

end of thread, other threads:[~2018-08-06 17:24 UTC | newest]

Thread overview: 17+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-06-20  5:52 4.18.0-rc1-next-20180619 boot failed on beagle board x15 Naresh Kamboju
2018-06-20 10:29 ` Tony Lindgren
2018-06-20 12:57   ` Samuel Morris
2018-06-20 12:58   ` Roger Quadros
2018-06-20 15:30     ` Samuel Morris
2018-06-21  7:58       ` Roger Quadros
2018-06-21 12:02         ` Rafael J. Wysocki
2018-06-21 14:31         ` Samuel Morris
2018-06-21 14:53           ` Roger Quadros
2018-07-02  9:32             ` Tony Lindgren
2018-07-02 13:31               ` Samuel Morris
2018-07-17  6:08                 ` Tony Lindgren
2018-07-17  8:00                   ` Stephen Rothwell
2018-07-17  8:03                     ` Tony Lindgren
2018-07-18  5:05                       ` Keerthy
2018-08-05 23:30                 ` Stephen Rothwell
2018-08-06 17:24                   ` Tejun Heo

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).