All of lore.kernel.org
 help / color / mirror / Atom feed
* [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano
@ 2020-03-03  9:15 Pavel Machek
  2020-03-03  9:23 ` Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-03  9:15 UTC (permalink / raw)
  To: Chris.Paterson2, cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 1113 bytes --]

Hi!

I pushed candidate for -cip-rt, but it seems to fail on de0-nano
board. Code under testing is at:

https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip

So far testing fails, I did few steps to narrow it down.

? 0f2960c75dd68d339f0aff2935f51652b5625fbf 4.19.106-rt44
? 68541faf398c02bfa6a65ffcf4484e71b8e0306d v4.19.103-cip20.
ok  122663369 c7691f9c497d0f8393724875a8875b61daa9fc29 v4.19.100-rt41.
? b7f7a3d15716de6d78b6d4807e23fa46de106c2f v4.19.103-rt42.
ok  122667425 51184ac81b2de92545a3b9d330d4d53a176e0976 v4.19.104-rt43.
bad 122671461 f019fe97731d222ee59ac8e54a62753e9c69ef31 v4.19.106-rt44.
bad 122657523 881b771ab11616e2aab6e914d950467f7c8914bd v4.19.106-cip21.

So it seems failure was introduced between v4.19.104-rt43 and
v4.19.106-rt44. Unfortunately, I don't understand the logs enough to
know what the real failure is. If someone can help...

I'll proceed to narrow it down.

Best regards,
									Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-03  9:15 [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano Pavel Machek
@ 2020-03-03  9:23 ` Pavel Machek
  2020-03-03 17:10   ` Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-03  9:23 UTC (permalink / raw)
  To: Pavel Machek; +Cc: cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 562 bytes --]

On Tue 2020-03-03 10:15:56, Pavel Machek wrote:
> Hi!
> 
> I pushed candidate for -cip-rt, but it seems to fail on de0-nano
> board. Code under testing is at:
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip

It is pipeline

https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401

I'll reuse the branch for more testing.

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-03  9:23 ` Pavel Machek
@ 2020-03-03 17:10   ` Pavel Machek
  2020-03-03 21:28     ` Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-03 17:10 UTC (permalink / raw)
  To: Pavel Machek; +Cc: cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 1974 bytes --]

Hi!

> > I pushed candidate for -cip-rt, but it seems to fail on de0-nano
> > board. Code under testing is at:
> > 
> > https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip
> 
> It is pipeline
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401
> 
> I'll reuse the branch for more testing.

I managed to narrow the bad commit to the -rt tree, between:

OK 122904930 pick 69aa73357e6a rcu: Don't allow to change rcu_normal_after_boot on RT
pick 849ef8789077 pci/switchtec: fix stream_open.cocci warnings
pick ad8a5e8279c4 sched/core: Drop a preempt_disable_rt() statement
pick 966f066d96cb timers: Redo the notification of canceling timers on -RT
pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw
pick cc1812bf198b futex: Delay deallocation of pi_state
> pick f5e115c43100 mm/zswap: Do not disable preemption in zswap_frontswap_store()
pick e0d0d09a08ad revert-aio
pick a0a40bfb4300 fs/aio: simple simple work
pick 0fae581d8c5e revert-thermal
pick c0d95b4a8a1b thermal: Defer thermal wakups to threads
pick 700fbb4afb6e revert-block
pick 4cda50ff12cf block: blk-mq: move blk_queue_usage_counter_release() into process context
pick 9e982f55745b workqueue: rework
pick c0db53dc3bf4 i2c: exynos5: Remove IRQF_ONESHOT
pick 1f160d170203 i2c: hix5hd2: Remove IRQF_ONESHOT
BAD 122882826 eae5a7cab722 sched/deadline: Ensure inactive_timer runs in hardirq context

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-03 17:10   ` Pavel Machek
@ 2020-03-03 21:28     ` Pavel Machek
  2020-03-04 19:53       ` Jan Kiszka
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-03 21:28 UTC (permalink / raw)
  To: Pavel Machek; +Cc: cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 2447 bytes --]

Hi!

> > > I pushed candidate for -cip-rt, but it seems to fail on de0-nano
> > > board. Code under testing is at:
> > > 
> > > https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip
> > 
> > It is pipeline
> > 
> > https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401
> > 
> > I'll reuse the branch for more testing.
> 
> I managed to narrow the bad commit to the -rt tree, between:
> 
> OK 122904930 pick 69aa73357e6a rcu: Don't allow to change rcu_normal_after_boot on RT
> pick 849ef8789077 pci/switchtec: fix stream_open.cocci warnings
> pick ad8a5e8279c4 sched/core: Drop a preempt_disable_rt() statement
> pick 966f066d96cb timers: Redo the notification of canceling timers on -RT
> pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
> pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
> pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
> pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
> pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw
> pick cc1812bf198b futex: Delay deallocation of pi_state
> > pick f5e115c43100 mm/zswap: Do not disable preemption in zswap_frontswap_store()
> pick e0d0d09a08ad revert-aio
> pick a0a40bfb4300 fs/aio: simple simple work
> pick 0fae581d8c5e revert-thermal
> pick c0d95b4a8a1b thermal: Defer thermal wakups to threads
> pick 700fbb4afb6e revert-block
> pick 4cda50ff12cf block: blk-mq: move blk_queue_usage_counter_release() into process context
> pick 9e982f55745b workqueue: rework
> pick c0db53dc3bf4 i2c: exynos5: Remove IRQF_ONESHOT
> pick 1f160d170203 i2c: hix5hd2: Remove IRQF_ONESHOT
> BAD 122882826 eae5a7cab722 sched/deadline: Ensure inactive_timer runs in hardirq context
>

And something went seriously wrong after these tests. I submitted same
tree twice, and got different results.

First this -- de0-nano succeeds:

https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904930

Now this -- de0-nano fails (and ipc227e is unfinished for long time):

https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959477

I'll need some help here.

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-03 21:28     ` Pavel Machek
@ 2020-03-04 19:53       ` Jan Kiszka
  2020-03-09 10:21         ` [cip-dev] I need de0-nano testing for -rt release was " Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Jan Kiszka @ 2020-03-04 19:53 UTC (permalink / raw)
  To: Pavel Machek; +Cc: cip-dev

On 03.03.20 22:28, Pavel Machek wrote:
> Hi!
> 
>>>> I pushed candidate for -cip-rt, but it seems to fail on de0-nano
>>>> board. Code under testing is at:
>>>>
>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip
>>>
>>> It is pipeline
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401
>>>
>>> I'll reuse the branch for more testing.
>>
>> I managed to narrow the bad commit to the -rt tree, between:
>>
>> OK 122904930 pick 69aa73357e6a rcu: Don't allow to change rcu_normal_after_boot on RT
>> pick 849ef8789077 pci/switchtec: fix stream_open.cocci warnings
>> pick ad8a5e8279c4 sched/core: Drop a preempt_disable_rt() statement
>> pick 966f066d96cb timers: Redo the notification of canceling timers on -RT
>> pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
>> pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
>> pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
>> pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
>> pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw
>> pick cc1812bf198b futex: Delay deallocation of pi_state
>>> pick f5e115c43100 mm/zswap: Do not disable preemption in zswap_frontswap_store()
>> pick e0d0d09a08ad revert-aio
>> pick a0a40bfb4300 fs/aio: simple simple work
>> pick 0fae581d8c5e revert-thermal
>> pick c0d95b4a8a1b thermal: Defer thermal wakups to threads
>> pick 700fbb4afb6e revert-block
>> pick 4cda50ff12cf block: blk-mq: move blk_queue_usage_counter_release() into process context
>> pick 9e982f55745b workqueue: rework
>> pick c0db53dc3bf4 i2c: exynos5: Remove IRQF_ONESHOT
>> pick 1f160d170203 i2c: hix5hd2: Remove IRQF_ONESHOT
>> BAD 122882826 eae5a7cab722 sched/deadline: Ensure inactive_timer runs in hardirq context
>>
> 
> And something went seriously wrong after these tests. I submitted same
> tree twice, and got different results.
> 
> First this -- de0-nano succeeds:
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904930
> 
> Now this -- de0-nano fails (and ipc227e is unfinished for long time):
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959477
> 
> I'll need some help here.
> 

The logs read like the targets are not (always) coming up, e.g.
https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/457824214#L377

Jan

-- 
Siemens AG, Corporate Technology, CT RDA IOT SES-DE
Corporate Competence Center Embedded Linux
_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-04 19:53       ` Jan Kiszka
@ 2020-03-09 10:21         ` Pavel Machek
  2020-03-09 18:52           ` Jan Kiszka
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-09 10:21 UTC (permalink / raw)
  To: Jan Kiszka; +Cc: cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 3251 bytes --]

Hi!

> > > > > I pushed candidate for -cip-rt, but it seems to fail on de0-nano
> > > > > board. Code under testing is at:
> > > > > 
> > > > > https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip
> > > > 
> > > > It is pipeline
> > > > 
> > > > https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401
> > > > 
> > > > I'll reuse the branch for more testing.
> > > 
> > > I managed to narrow the bad commit to the -rt tree, between:
> > > 
> > > OK 122904930 pick 69aa73357e6a rcu: Don't allow to change rcu_normal_after_boot on RT
> > > pick 849ef8789077 pci/switchtec: fix stream_open.cocci warnings
> > > pick ad8a5e8279c4 sched/core: Drop a preempt_disable_rt() statement
> > > pick 966f066d96cb timers: Redo the notification of canceling timers on -RT
> > > pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
> > > pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
> > > pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
> > > pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
> > > pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw
> > > pick cc1812bf198b futex: Delay deallocation of pi_state
> > > > pick f5e115c43100 mm/zswap: Do not disable preemption in zswap_frontswap_store()
> > > pick e0d0d09a08ad revert-aio
> > > pick a0a40bfb4300 fs/aio: simple simple work
> > > pick 0fae581d8c5e revert-thermal
> > > pick c0d95b4a8a1b thermal: Defer thermal wakups to threads
> > > pick 700fbb4afb6e revert-block
> > > pick 4cda50ff12cf block: blk-mq: move blk_queue_usage_counter_release() into process context
> > > pick 9e982f55745b workqueue: rework
> > > pick c0db53dc3bf4 i2c: exynos5: Remove IRQF_ONESHOT
> > > pick 1f160d170203 i2c: hix5hd2: Remove IRQF_ONESHOT
> > > BAD 122882826 eae5a7cab722 sched/deadline: Ensure inactive_timer runs in hardirq context
> > > 
> > 
> > And something went seriously wrong after these tests. I submitted same
> > tree twice, and got different results.
> > 
> > First this -- de0-nano succeeds:
> > 
> > https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904930
> > 
> > Now this -- de0-nano fails (and ipc227e is unfinished for long time):
> > 
> > https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959477
> > 
> > I'll need some help here.
> 
> The logs read like the targets are not (always) coming up, e.g.
> https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/457824214#L377

Yes... I don't need that target, but I need de0-nano... and it did not
work last time I checked.

On a related note... it would be good to somehow show difference
between "kernel test failure" and "target failure".

If we see bootloader in the logs, and then test fails/timeouts =>
"kernel test failure", I need to solve it.

If we don't get messages from the bootloader => "target failure",
someone needs to check the power relays or something...

Best regards,
								Pavel

-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-09 10:21         ` [cip-dev] I need de0-nano testing for -rt release was " Pavel Machek
@ 2020-03-09 18:52           ` Jan Kiszka
  2020-03-10 16:50             ` Bhola, Bikram
       [not found]             ` <78fd4f1df8034021ae195b8287cf2f29@SVR-IES-MBX-03.mgc.mentorg.com>
  0 siblings, 2 replies; 12+ messages in thread
From: Jan Kiszka @ 2020-03-09 18:52 UTC (permalink / raw)
  To: Pavel Machek, Bhola, Bikram, Quirin Gylstorff; +Cc: cip-dev

On 09.03.20 11:21, Pavel Machek wrote:
> Hi!
> 
>>>>>> I pushed candidate for -cip-rt, but it seems to fail on de0-nano
>>>>>> board. Code under testing is at:
>>>>>>
>>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel/linux-cip
>>>>>
>>>>> It is pipeline
>>>>>
>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122762401
>>>>>
>>>>> I'll reuse the branch for more testing.
>>>>
>>>> I managed to narrow the bad commit to the -rt tree, between:
>>>>
>>>> OK 122904930 pick 69aa73357e6a rcu: Don't allow to change rcu_normal_after_boot on RT
>>>> pick 849ef8789077 pci/switchtec: fix stream_open.cocci warnings
>>>> pick ad8a5e8279c4 sched/core: Drop a preempt_disable_rt() statement
>>>> pick 966f066d96cb timers: Redo the notification of canceling timers on -RT
>>>> pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
>>>> pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
>>>> pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
>>>> pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
>>>> pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw
>>>> pick cc1812bf198b futex: Delay deallocation of pi_state
>>>>> pick f5e115c43100 mm/zswap: Do not disable preemption in zswap_frontswap_store()
>>>> pick e0d0d09a08ad revert-aio
>>>> pick a0a40bfb4300 fs/aio: simple simple work
>>>> pick 0fae581d8c5e revert-thermal
>>>> pick c0d95b4a8a1b thermal: Defer thermal wakups to threads
>>>> pick 700fbb4afb6e revert-block
>>>> pick 4cda50ff12cf block: blk-mq: move blk_queue_usage_counter_release() into process context
>>>> pick 9e982f55745b workqueue: rework
>>>> pick c0db53dc3bf4 i2c: exynos5: Remove IRQF_ONESHOT
>>>> pick 1f160d170203 i2c: hix5hd2: Remove IRQF_ONESHOT
>>>> BAD 122882826 eae5a7cab722 sched/deadline: Ensure inactive_timer runs in hardirq context
>>>>
>>>
>>> And something went seriously wrong after these tests. I submitted same
>>> tree twice, and got different results.
>>>
>>> First this -- de0-nano succeeds:
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904930
>>>
>>> Now this -- de0-nano fails (and ipc227e is unfinished for long time):
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959477
>>>
>>> I'll need some help here.
>>
>> The logs read like the targets are not (always) coming up, e.g.
>> https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/457824214#L377
> 
> Yes... I don't need that target, but I need de0-nano... and it did not
> work last time I checked.

Bikram, could someone on your side check the board status in the Mentor 
lab? Thanks!

> 
> On a related note... it would be good to somehow show difference
> between "kernel test failure" and "target failure".
> 
> If we see bootloader in the logs, and then test fails/timeouts =>
> "kernel test failure", I need to solve it.
> 
> If we don't get messages from the bootloader => "target failure",
> someone needs to check the power relays or something...

I'm not happy about the parsability of those LAVA logs either, but I 
have no idea if/how that can be improved best. Maybe Quirin has some 
idea based on his work with them.

Jan

-- 
Siemens AG, Corporate Technology, CT RDA IOT SES-DE
Corporate Competence Center Embedded Linux
_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-09 18:52           ` Jan Kiszka
@ 2020-03-10 16:50             ` Bhola, Bikram
       [not found]             ` <78fd4f1df8034021ae195b8287cf2f29@SVR-IES-MBX-03.mgc.mentorg.com>
  1 sibling, 0 replies; 12+ messages in thread
From: Bhola, Bikram @ 2020-03-10 16:50 UTC (permalink / raw)
  To: Jan Kiszka, Pavel Machek, Quirin Gylstorff; +Cc: cip-dev

Hi Jan and All,,

We are working on it. 

Looks like we have a slow network in last few days in our lab that results in rootfs download timeout failure. Time being we need to increase the current timeout from 15 mins to 30 mins for safer side (its failing in between 90% completion). Meantime I am working with our network team to diagnose the slowness. 

Thank You!!

Regards,
Bikram 

-----Original Message-----
From: Jan Kiszka [mailto:jan.kiszka@siemens.com] 
Sent: 10 March 2020 00:23
To: Pavel Machek <pavel@denx.de>; Bhola, Bikram <Bikram_Bhola@mentor.com>; Quirin Gylstorff <quirin.gylstorff@siemens.com>
Cc: cip-dev@lists.cip-project.org
Subject: Re: I need de0-nano testing for -rt release was Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano

On 09.03.20 11:21, Pavel Machek wrote:
> Hi!
> 
>>>>>> I pushed candidate for -cip-rt, but it seems to fail on de0-nano 
>>>>>> board. Code under testing is at:
>>>>>>
>>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel
>>>>>> /linux-cip
>>>>>
>>>>> It is pipeline
>>>>>
>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/1227
>>>>> 62401
>>>>>
>>>>> I'll reuse the branch for more testing.
>>>>
>>>> I managed to narrow the bad commit to the -rt tree, between:
>>>>
>>>> OK 122904930 pick 69aa73357e6a rcu: Don't allow to change 
>>>> rcu_normal_after_boot on RT pick 849ef8789077 pci/switchtec: fix 
>>>> stream_open.cocci warnings pick ad8a5e8279c4 sched/core: Drop a 
>>>> preempt_disable_rt() statement pick 966f066d96cb timers: Redo the 
>>>> notification of canceling timers on -RT pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
>>>> pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
>>>> pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
>>>> pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
>>>> pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw pick 
>>>> cc1812bf198b futex: Delay deallocation of pi_state
>>>>> pick f5e115c43100 mm/zswap: Do not disable preemption in 
>>>>> zswap_frontswap_store()
>>>> pick e0d0d09a08ad revert-aio
>>>> pick a0a40bfb4300 fs/aio: simple simple work pick 0fae581d8c5e 
>>>> revert-thermal pick c0d95b4a8a1b thermal: Defer thermal wakups to 
>>>> threads pick 700fbb4afb6e revert-block pick 4cda50ff12cf block: 
>>>> blk-mq: move blk_queue_usage_counter_release() into process context 
>>>> pick 9e982f55745b workqueue: rework pick c0db53dc3bf4 i2c: exynos5: 
>>>> Remove IRQF_ONESHOT pick 1f160d170203 i2c: hix5hd2: Remove 
>>>> IRQF_ONESHOT BAD 122882826 eae5a7cab722 sched/deadline: Ensure 
>>>> inactive_timer runs in hardirq context
>>>>
>>>
>>> And something went seriously wrong after these tests. I submitted 
>>> same tree twice, and got different results.
>>>
>>> First this -- de0-nano succeeds:
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904
>>> 930
>>>
>>> Now this -- de0-nano fails (and ipc227e is unfinished for long time):
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959
>>> 477
>>>
>>> I'll need some help here.
>>
>> The logs read like the targets are not (always) coming up, e.g.
>> https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/457824214#
>> L377
> 
> Yes... I don't need that target, but I need de0-nano... and it did not 
> work last time I checked.

Bikram, could someone on your side check the board status in the Mentor lab? Thanks!

> 
> On a related note... it would be good to somehow show difference 
> between "kernel test failure" and "target failure".
> 
> If we see bootloader in the logs, and then test fails/timeouts => 
> "kernel test failure", I need to solve it.
> 
> If we don't get messages from the bootloader => "target failure", 
> someone needs to check the power relays or something...

I'm not happy about the parsability of those LAVA logs either, but I have no idea if/how that can be improved best. Maybe Quirin has some idea based on his work with them.

Jan

--
Siemens AG, Corporate Technology, CT RDA IOT SES-DE Corporate Competence Center Embedded Linux
_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
       [not found]             ` <78fd4f1df8034021ae195b8287cf2f29@SVR-IES-MBX-03.mgc.mentorg.com>
@ 2020-03-11 15:36               ` Bhola, Bikram
  2020-03-15  9:42                 ` [cip-dev] " Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Bhola, Bikram @ 2020-03-11 15:36 UTC (permalink / raw)
  To: Jan Kiszka, Pavel Machek, Quirin Gylstorff; +Cc: cip-dev

Hi Jan and All,

Both de0-nano and IPC227E targets are up and running. I have monitored for test jobs on it and those completed successfully. 

Thank You!!

Regards,
Bikram 
-----Original Message-----
From: Bhola, Bikram 
Sent: 10 March 2020 22:20
To: 'Jan Kiszka' <jan.kiszka@siemens.com>; Pavel Machek <pavel@denx.de>; Quirin Gylstorff <quirin.gylstorff@siemens.com>
Cc: cip-dev@lists.cip-project.org
Subject: RE: I need de0-nano testing for -rt release was Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano

Hi Jan and All,,

We are working on it. 

Looks like we have a slow network in last few days in our lab that results in rootfs download timeout failure. Time being we need to increase the current timeout from 15 mins to 30 mins for safer side (its failing in between 90% completion). Meantime I am working with our network team to diagnose the slowness. 

Thank You!!

Regards,
Bikram 

-----Original Message-----
From: Jan Kiszka [mailto:jan.kiszka@siemens.com]
Sent: 10 March 2020 00:23
To: Pavel Machek <pavel@denx.de>; Bhola, Bikram <Bikram_Bhola@mentor.com>; Quirin Gylstorff <quirin.gylstorff@siemens.com>
Cc: cip-dev@lists.cip-project.org
Subject: Re: I need de0-nano testing for -rt release was Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano

On 09.03.20 11:21, Pavel Machek wrote:
> Hi!
> 
>>>>>> I pushed candidate for -cip-rt, but it seems to fail on de0-nano 
>>>>>> board. Code under testing is at:
>>>>>>
>>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/tree/ci/pavel
>>>>>> /linux-cip
>>>>>
>>>>> It is pipeline
>>>>>
>>>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/1227
>>>>> 62401
>>>>>
>>>>> I'll reuse the branch for more testing.
>>>>
>>>> I managed to narrow the bad commit to the -rt tree, between:
>>>>
>>>> OK 122904930 pick 69aa73357e6a rcu: Don't allow to change 
>>>> rcu_normal_after_boot on RT pick 849ef8789077 pci/switchtec: fix 
>>>> stream_open.cocci warnings pick ad8a5e8279c4 sched/core: Drop a
>>>> preempt_disable_rt() statement pick 966f066d96cb timers: Redo the 
>>>> notification of canceling timers on -RT pick 0393fd5a4f9a Revert "futex: Ensure lock/unlock symetry versus pi_lock and hash bucket lock"
>>>> pick 84eb0b64a27a Revert "futex: Fix bug on when a requeued RT task times out"
>>>> pick fcc893280f4e Revert "rtmutex: Handle the various new futex race conditions"
>>>> pick 2eac93cf9d16 Revert "futex: workaround migrate_disable/enable in different context"
>>>> pick 9b8964629f4f futex: Make the futex_hash_bucket lock raw pick 
>>>> cc1812bf198b futex: Delay deallocation of pi_state
>>>>> pick f5e115c43100 mm/zswap: Do not disable preemption in
>>>>> zswap_frontswap_store()
>>>> pick e0d0d09a08ad revert-aio
>>>> pick a0a40bfb4300 fs/aio: simple simple work pick 0fae581d8c5e 
>>>> revert-thermal pick c0d95b4a8a1b thermal: Defer thermal wakups to 
>>>> threads pick 700fbb4afb6e revert-block pick 4cda50ff12cf block:
>>>> blk-mq: move blk_queue_usage_counter_release() into process context 
>>>> pick 9e982f55745b workqueue: rework pick c0db53dc3bf4 i2c: exynos5:
>>>> Remove IRQF_ONESHOT pick 1f160d170203 i2c: hix5hd2: Remove 
>>>> IRQF_ONESHOT BAD 122882826 eae5a7cab722 sched/deadline: Ensure 
>>>> inactive_timer runs in hardirq context
>>>>
>>>
>>> And something went seriously wrong after these tests. I submitted 
>>> same tree twice, and got different results.
>>>
>>> First this -- de0-nano succeeds:
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122904
>>> 930
>>>
>>> Now this -- de0-nano fails (and ipc227e is unfinished for long time):
>>>
>>> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/122959
>>> 477
>>>
>>> I'll need some help here.
>>
>> The logs read like the targets are not (always) coming up, e.g.
>> https://gitlab.com/cip-project/cip-kernel/linux-cip/-/jobs/457824214#
>> L377
> 
> Yes... I don't need that target, but I need de0-nano... and it did not 
> work last time I checked.

Bikram, could someone on your side check the board status in the Mentor lab? Thanks!

> 
> On a related note... it would be good to somehow show difference 
> between "kernel test failure" and "target failure".
> 
> If we see bootloader in the logs, and then test fails/timeouts => 
> "kernel test failure", I need to solve it.
> 
> If we don't get messages from the bootloader => "target failure", 
> someone needs to check the power relays or something...

I'm not happy about the parsability of those LAVA logs either, but I have no idea if/how that can be improved best. Maybe Quirin has some idea based on his work with them.

Jan

--
Siemens AG, Corporate Technology, CT RDA IOT SES-DE Corporate Competence Center Embedded Linux



^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-11 15:36               ` Bhola, Bikram
@ 2020-03-15  9:42                 ` Pavel Machek
  2020-03-16 11:59                   ` Pavel Machek
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-15  9:42 UTC (permalink / raw)
  To: Bhola, Bikram; +Cc: Jan Kiszka, cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 531 bytes --]

Hi!

> Both de0-nano and IPC227E targets are up and running. I have monitored for test jobs on it and those completed successfully. 
> 
> Thank You!!

There's still something broken with the testing. renesas_shmobile
initially failed (okay after restart), rest failed:

https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/126355890

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-15  9:42                 ` [cip-dev] " Pavel Machek
@ 2020-03-16 11:59                   ` Pavel Machek
  2020-03-16 16:05                     ` Bhola, Bikram
  0 siblings, 1 reply; 12+ messages in thread
From: Pavel Machek @ 2020-03-16 11:59 UTC (permalink / raw)
  To: Pavel Machek; +Cc: Jan Kiszka, Bhola, Bikram, cip-dev


[-- Attachment #1.1: Type: text/plain, Size: 1239 bytes --]

Hi!

> > Both de0-nano and IPC227E targets are up and running. I have monitored for test jobs on it and those completed successfully. 
> > 
> > Thank You!!
> 
> There's still something broken with the testing. renesas_shmobile
> initially failed (okay after restart), rest failed:
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/126355890

https://lava.ciplatform.org/scheduler/job/12718

Going through the logs:

progress  90% (81MB)
progress  95% (86MB)
progress 100% (90MB)
90MB downloaded in 383.03s (0.24MB/s)
end: 1.3.1 http-download (duration 00:06:23) [common]
case: http-download
case_id: 403737
definition: lava
duration: 383.03
extra: ...
level: 1.3.1
namespace: common
result: pass
tftp-deploy timed out after 1283 seconds
end: 1.3 download-retry (duration 00:06:24) [common

You are not trying to do tftp over WAN, are you?

Seeing the download speeds... would it make sense to do downloads with
rsync? Root filesystems (etc) are not changing too often, so that
should provide some speedups.

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany

[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]

[-- Attachment #2: Type: text/plain, Size: 154 bytes --]

_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

* Re: [cip-dev] I need de0-nano testing for -rt release was Re: 4.19.106-cip21-rt8 problems on de0-nano
  2020-03-16 11:59                   ` Pavel Machek
@ 2020-03-16 16:05                     ` Bhola, Bikram
  0 siblings, 0 replies; 12+ messages in thread
From: Bhola, Bikram @ 2020-03-16 16:05 UTC (permalink / raw)
  To: Pavel Machek; +Cc: Jan Kiszka, Gumansingh, Smita, cip-dev

Hi Pavel,

All these random failures are happening because of the slowness in network. 
Because of the COVID-19 situation and in preparation of all employees to work from home office, Our IT team was doing some experiments/setup by reserving some network bandwidth. That caused the slowness in network and the timeout.

The board is connected through network cable. If Root filesystems (etc) are not changing too often, it totally make sense to do rsync to pull additional changes if any. We will give it a try on that. 

I saw all of the jobs started working fine today again for both 127E and 227E board. But we may see these random failures as IT told to have these network thing will settle in another day or two. 

Sorry for the trouble and thank you for being patient. 

Regards,
Bikram 

-----Original Message-----
From: Pavel Machek [mailto:pavel@denx.de] 
Sent: 16 March 2020 17:29
To: Pavel Machek <pavel@denx.de>
Cc: Bhola, Bikram <Bikram_Bhola@mentor.com>; Jan Kiszka <jan.kiszka@siemens.com>; Quirin Gylstorff <quirin.gylstorff@siemens.com>; cip-dev@lists.cip-project.org
Subject: Re: I need de0-nano testing for -rt release was Re: [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano

Hi!

> > Both de0-nano and IPC227E targets are up and running. I have monitored for test jobs on it and those completed successfully. 
> > 
> > Thank You!!
> 
> There's still something broken with the testing. renesas_shmobile 
> initially failed (okay after restart), rest failed:
> 
> https://gitlab.com/cip-project/cip-kernel/linux-cip/pipelines/12635589
> 0

https://lava.ciplatform.org/scheduler/job/12718

Going through the logs:

progress  90% (81MB)
progress  95% (86MB)
progress 100% (90MB)
90MB downloaded in 383.03s (0.24MB/s)
end: 1.3.1 http-download (duration 00:06:23) [common]
case: http-download
case_id: 403737
definition: lava
duration: 383.03
extra: ...
level: 1.3.1
namespace: common
result: pass
tftp-deploy timed out after 1283 seconds
end: 1.3 download-retry (duration 00:06:24) [common

You are not trying to do tftp over WAN, are you?

Seeing the download speeds... would it make sense to do downloads with rsync? Root filesystems (etc) are not changing too often, so that should provide some speedups.

Best regards,
								Pavel
-- 
DENX Software Engineering GmbH,      Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
_______________________________________________
cip-dev mailing list
cip-dev@lists.cip-project.org
https://lists.cip-project.org/mailman/listinfo/cip-dev

^ permalink raw reply	[flat|nested] 12+ messages in thread

end of thread, other threads:[~2020-03-16 16:13 UTC | newest]

Thread overview: 12+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-03-03  9:15 [cip-dev] 4.19.106-cip21-rt8 problems on de0-nano Pavel Machek
2020-03-03  9:23 ` Pavel Machek
2020-03-03 17:10   ` Pavel Machek
2020-03-03 21:28     ` Pavel Machek
2020-03-04 19:53       ` Jan Kiszka
2020-03-09 10:21         ` [cip-dev] I need de0-nano testing for -rt release was " Pavel Machek
2020-03-09 18:52           ` Jan Kiszka
2020-03-10 16:50             ` Bhola, Bikram
     [not found]             ` <78fd4f1df8034021ae195b8287cf2f29@SVR-IES-MBX-03.mgc.mentorg.com>
2020-03-11 15:36               ` Bhola, Bikram
2020-03-15  9:42                 ` [cip-dev] " Pavel Machek
2020-03-16 11:59                   ` Pavel Machek
2020-03-16 16:05                     ` Bhola, Bikram

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.