* Prompt timeouts on ipc227e board -- randomness related?
@ 2021-09-25 20:06 ` Pavel Machek
0 siblings, 0 replies; 8+ messages in thread
From: Pavel Machek @ 2021-09-25 20:06 UTC (permalink / raw)
To: Chris.Paterson2, cip-dev
[-- Attachment #1: Type: text/plain, Size: 1775 bytes --]
Hi!
It is not first time I see this failure:
https://lava.ciplatform.org/scheduler/job/444336
[[0;32m OK [0m] Started Login Service.
[[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (9s / no limit)[K[ [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-upgrade.timer: Adding 3min 2.027476s random time.
[ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s random time.
[ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s random time.
[ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s random time.
[ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s random time.
[ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s random time.
[ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s random time.
[ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s random time.
[K[[0;32m OK [0m] Started Regenerate sshd host keys.
wait for prompt timed out
end: 2.3.4.1 login-action (duration 00:00:24) [common]
case: login-action
case_id: 9417066
definition: lava
duration: 23.98
Any idea what is going on there? Is it just a test problem, or do we
have kernel regression that only happens sometimes?
Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
^ permalink raw reply [flat|nested] 8+ messages in thread
* [cip-dev] Prompt timeouts on ipc227e board -- randomness related?
@ 2021-09-25 20:06 ` Pavel Machek
0 siblings, 0 replies; 8+ messages in thread
From: Pavel Machek @ 2021-09-25 20:06 UTC (permalink / raw)
To: Chris.Paterson2, cip-dev
[-- Attachment #1.1: Type: text/plain, Size: 1775 bytes --]
Hi!
It is not first time I see this failure:
https://lava.ciplatform.org/scheduler/job/444336
[[0;32m OK [0m] Started Login Service.
[[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (9s / no limit)[K[ [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-upgrade.timer: Adding 3min 2.027476s random time.
[ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s random time.
[ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s random time.
[ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s random time.
[ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s random time.
[ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s random time.
[ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s random time.
[ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s random time.
[K[[0;32m OK [0m] Started Regenerate sshd host keys.
wait for prompt timed out
end: 2.3.4.1 login-action (duration 00:00:24) [common]
case: login-action
case_id: 9417066
definition: lava
duration: 23.98
Any idea what is going on there? Is it just a test problem, or do we
have kernel regression that only happens sometimes?
Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
[-- Attachment #2: Type: text/plain, Size: 429 bytes --]
-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#6749): https://lists.cip-project.org/g/cip-dev/message/6749
Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388
Group Owner: cip-dev+owner@lists.cip-project.org
Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org]
-=-=-=-=-=-=-=-=-=-=-=-
^ permalink raw reply [flat|nested] 8+ messages in thread
* RE: Prompt timeouts on ipc227e board -- randomness related?
@ 2021-09-28 10:08 ` Chris Paterson
0 siblings, 0 replies; 8+ messages in thread
From: Chris Paterson @ 2021-09-28 10:08 UTC (permalink / raw)
To: Pavel Machek, Bhola, Bikram; +Cc: cip-dev, Jan Kiszka
Hello Pavel,
> From: Pavel Machek <pavel@denx.de>
> Sent: 25 September 2021 21:06
>
> Hi!
>
> It is not first time I see this failure:
Thank you for reporting the issue.
Bikram is going to take a look for us (thank you).
Kind regards, Chris
>
> https://lava.ciplatform.org/scheduler/job/444336
>
>
> [[0;32m OK [0m] Started Login Service.
> [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s /
> no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate
> sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2)
> A start job is running for…ate sshd host keys (9s / no limit)[K[
> [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-
> eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-
> upgrade.timer: Adding 3min 2.027476s random time.
> [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s
> random time.
> [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s
> random time.
> [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s
> random time.
> [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s
> random time.
> [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s
> random time.
> [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s
> random time.
> [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s
> random time.
> [K[[0;32m OK [0m] Started Regenerate sshd host keys.
> wait for prompt timed out
> end: 2.3.4.1 login-action (duration 00:00:24) [common]
> case: login-action
> case_id: 9417066
> definition: lava
> duration: 23.98
>
> Any idea what is going on there? Is it just a test problem, or do we
> have kernel regression that only happens sometimes?
>
> Best regards,
> Pavel
> --
> DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
> HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related?
@ 2021-09-28 10:08 ` Chris Paterson
0 siblings, 0 replies; 8+ messages in thread
From: Chris Paterson @ 2021-09-28 10:08 UTC (permalink / raw)
To: Pavel Machek, Bhola, Bikram; +Cc: cip-dev, Jan Kiszka
[-- Attachment #1: Type: text/plain, Size: 2087 bytes --]
Hello Pavel,
> From: Pavel Machek <pavel@denx.de>
> Sent: 25 September 2021 21:06
>
> Hi!
>
> It is not first time I see this failure:
Thank you for reporting the issue.
Bikram is going to take a look for us (thank you).
Kind regards, Chris
>
> https://lava.ciplatform.org/scheduler/job/444336
>
>
> [[0;32m OK [0m] Started Login Service.
> [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s /
> no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate
> sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2)
> A start job is running for…ate sshd host keys (9s / no limit)[K[
> [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-
> eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-
> upgrade.timer: Adding 3min 2.027476s random time.
> [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s
> random time.
> [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s
> random time.
> [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s
> random time.
> [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s
> random time.
> [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s
> random time.
> [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s
> random time.
> [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s
> random time.
> [K[[0;32m OK [0m] Started Regenerate sshd host keys.
> wait for prompt timed out
> end: 2.3.4.1 login-action (duration 00:00:24) [common]
> case: login-action
> case_id: 9417066
> definition: lava
> duration: 23.98
>
> Any idea what is going on there? Is it just a test problem, or do we
> have kernel regression that only happens sometimes?
>
> Best regards,
> Pavel
> --
> DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
> HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #2: Type: text/plain, Size: 429 bytes --]
-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#6754): https://lists.cip-project.org/g/cip-dev/message/6754
Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388
Group Owner: cip-dev+owner@lists.cip-project.org
Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org]
-=-=-=-=-=-=-=-=-=-=-=-
^ permalink raw reply [flat|nested] 8+ messages in thread
* RE: Prompt timeouts on ipc227e board -- randomness related?
@ 2021-10-05 13:41 ` Chris Paterson
0 siblings, 0 replies; 8+ messages in thread
From: Chris Paterson @ 2021-10-05 13:41 UTC (permalink / raw)
To: Bhola, Bikram, Pavel Machek; +Cc: cip-dev, Jan Kiszka
Hello Bikram,
> From: Bhola, Bikram <Bikram_Bhola@mentor.com>
> Sent: 30 September 2021 12:19
>
> Hi Chris,
>
> We investigated the failure job and looks like before getting login prompt job
> timeout is happening . In the job definition file - job timeout is mentioned
> 15mins and sometimes due to slow network issue, it takes more time while
> downloading, untar and deploying image. So we are seeing timeout during
> login prompt or in some cases in earlier stages also. The work in progress to
> double up the network bandwidth within a few weeks, which will reduce the
> occurrence of this type of issues.
Thank you for your investigation.
I've have increased the timeout as you have suggested:
https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49
One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform
https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5
Has a lower value for boot_character_delay ever been tried?
Kind regards, Chris
>
> Time being, with an increased job timeout to 20mins, failure is not observed.
> We tested 10 times to be working fine.
> Example :
> https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01
>
>
> changes in the job definition file
> https://lava.ciplatform.org/scheduler/job/444336/definition
> Current implementation
> ------------------------
> timeouts:
> job:
> minutes: 15
>
> Need to Modify
> -----------------------------------
> timeouts:
> job:
> minutes: 20
>
>
> Regards,
> Bikram
>
> -----Original Message-----
> From: Chris Paterson <Chris.Paterson2@renesas.com>
> Sent: 28 September 2021 15:38
> To: Pavel Machek <pavel@denx.de>; Bhola, Bikram
> <Bikram_Bhola@mentor.com>
> Cc: cip-dev@lists.cip-project.org; Jan Kiszka <jan.kiszka@siemens.com>
> Subject: RE: Prompt timeouts on ipc227e board -- randomness related?
>
> Hello Pavel,
>
> > From: Pavel Machek <pavel@denx.de>
> > Sent: 25 September 2021 21:06
> >
> > Hi!
> >
> > It is not first time I see this failure:
>
> Thank you for reporting the issue.
>
> Bikram is going to take a look for us (thank you).
>
> Kind regards, Chris
>
> >
> >
> https://jpn01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flava.c
> iplatform.org%2Fscheduler%2Fjob%2F444336&data=04%7C01%7CChris.
> Paterson2%40renesas.com%7Cacb37b995a6c41d2090808d98404189e%7C53d
> 82571da1947e49cb4625a166a4a2a%7C0%7C0%7C637685975391318359%7CUn
> known%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6
> Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yIPWAcblC17x6PpT3TSiGT
> QWiiinQiSuu9a3HRg4u3Q%3D&reserved=0
> >
> >
> > [[0;32m OK [0m] Started Login Service.
> > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys
> (7s /
> > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running
> for…ate
> > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2)
> > A start job is running for…ate sshd host keys (9s / no limit)[K[
> > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running
> for…evices-
> > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-
> > upgrade.timer: Adding 3min 2.027476s random time.
> > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s
> > random time.
> > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min
> 47.041488s
> > random time.
> > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s
> > random time.
> > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s
> > random time.
> > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s
> > random time.
> > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min
> 13.015405s
> > random time.
> > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s
> > random time.
> > [K[[0;32m OK [0m] Started Regenerate sshd host keys.
> > wait for prompt timed out
> > end: 2.3.4.1 login-action (duration 00:00:24) [common]
> > case: login-action
> > case_id: 9417066
> > definition: lava
> > duration: 23.98
> >
> > Any idea what is going on there? Is it just a test problem, or do we
> > have kernel regression that only happens sometimes?
> >
> > Best regards,
> > Pavel
> > --
> > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
> > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related?
@ 2021-10-05 13:41 ` Chris Paterson
0 siblings, 0 replies; 8+ messages in thread
From: Chris Paterson @ 2021-10-05 13:41 UTC (permalink / raw)
To: Bhola, Bikram, Pavel Machek; +Cc: cip-dev, Jan Kiszka
[-- Attachment #1: Type: text/plain, Size: 4653 bytes --]
Hello Bikram,
> From: Bhola, Bikram <Bikram_Bhola@mentor.com>
> Sent: 30 September 2021 12:19
>
> Hi Chris,
>
> We investigated the failure job and looks like before getting login prompt job
> timeout is happening . In the job definition file - job timeout is mentioned
> 15mins and sometimes due to slow network issue, it takes more time while
> downloading, untar and deploying image. So we are seeing timeout during
> login prompt or in some cases in earlier stages also. The work in progress to
> double up the network bandwidth within a few weeks, which will reduce the
> occurrence of this type of issues.
Thank you for your investigation.
I've have increased the timeout as you have suggested:
https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49
One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform
https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5
Has a lower value for boot_character_delay ever been tried?
Kind regards, Chris
>
> Time being, with an increased job timeout to 20mins, failure is not observed.
> We tested 10 times to be working fine.
> Example :
> https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01
>
>
> changes in the job definition file
> https://lava.ciplatform.org/scheduler/job/444336/definition
> Current implementation
> ------------------------
> timeouts:
> job:
> minutes: 15
>
> Need to Modify
> -----------------------------------
> timeouts:
> job:
> minutes: 20
>
>
> Regards,
> Bikram
>
> -----Original Message-----
> From: Chris Paterson <Chris.Paterson2@renesas.com>
> Sent: 28 September 2021 15:38
> To: Pavel Machek <pavel@denx.de>; Bhola, Bikram
> <Bikram_Bhola@mentor.com>
> Cc: cip-dev@lists.cip-project.org; Jan Kiszka <jan.kiszka@siemens.com>
> Subject: RE: Prompt timeouts on ipc227e board -- randomness related?
>
> Hello Pavel,
>
> > From: Pavel Machek <pavel@denx.de>
> > Sent: 25 September 2021 21:06
> >
> > Hi!
> >
> > It is not first time I see this failure:
>
> Thank you for reporting the issue.
>
> Bikram is going to take a look for us (thank you).
>
> Kind regards, Chris
>
> >
> >
> https://jpn01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flava.c
> iplatform.org%2Fscheduler%2Fjob%2F444336&data=04%7C01%7CChris.
> Paterson2%40renesas.com%7Cacb37b995a6c41d2090808d98404189e%7C53d
> 82571da1947e49cb4625a166a4a2a%7C0%7C0%7C637685975391318359%7CUn
> known%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6
> Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yIPWAcblC17x6PpT3TSiGT
> QWiiinQiSuu9a3HRg4u3Q%3D&reserved=0
> >
> >
> > [[0;32m OK [0m] Started Login Service.
> > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys
> (7s /
> > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running
> for…ate
> > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2)
> > A start job is running for…ate sshd host keys (9s / no limit)[K[
> > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running
> for…evices-
> > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-
> > upgrade.timer: Adding 3min 2.027476s random time.
> > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s
> > random time.
> > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min
> 47.041488s
> > random time.
> > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s
> > random time.
> > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s
> > random time.
> > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s
> > random time.
> > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min
> 13.015405s
> > random time.
> > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s
> > random time.
> > [K[[0;32m OK [0m] Started Regenerate sshd host keys.
> > wait for prompt timed out
> > end: 2.3.4.1 login-action (duration 00:00:24) [common]
> > case: login-action
> > case_id: 9417066
> > definition: lava
> > duration: 23.98
> >
> > Any idea what is going on there? Is it just a test problem, or do we
> > have kernel regression that only happens sometimes?
> >
> > Best regards,
> > Pavel
> > --
> > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
> > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #2: Type: text/plain, Size: 429 bytes --]
-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#6785): https://lists.cip-project.org/g/cip-dev/message/6785
Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388
Group Owner: cip-dev+owner@lists.cip-project.org
Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org]
-=-=-=-=-=-=-=-=-=-=-=-
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Prompt timeouts on ipc227e board -- randomness related?
@ 2021-10-05 17:47 ` Pavel Machek
0 siblings, 0 replies; 8+ messages in thread
From: Pavel Machek @ 2021-10-05 17:47 UTC (permalink / raw)
To: Chris Paterson; +Cc: Bhola, Bikram, Pavel Machek, cip-dev, Jan Kiszka
[-- Attachment #1: Type: text/plain, Size: 2274 bytes --]
Hi!
> > We investigated the failure job and looks like before getting login prompt job
> > timeout is happening . In the job definition file - job timeout is mentioned
> > 15mins and sometimes due to slow network issue, it takes more time while
> > downloading, untar and deploying image. So we are seeing timeout during
> > login prompt or in some cases in earlier stages also. The work in progress to
> > double up the network bandwidth within a few weeks, which will reduce the
> > occurrence of this type of issues.
>
> Thank you for your investigation.
> I've have increased the timeout as you have suggested:
> https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49
>
> One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform
> https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5
>
> Has a lower value for boot_character_delay ever been tried?>
Thank you, but that board seems to still have problems:
https://lava.ciplatform.org/scheduler/job/458108
expect-shell-connection: Wait for prompt ['root@ebsy-isar:~#'] (timeout 00:10:00)
Waiting using forced prompt support (timeout 00:05:00)
end: 2.3.5 expect-shell-connection (duration 00:00:00) [common]
start: 2.3.6 export-device-env (timeout 00:01:00) [common]
Sending with 500 millisecond of delay
export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i'
root@ebsy-isar:~# export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i'
export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract -nfsrootfs-5xpf3q6i'
Sending with 500 millisecond of delay
export NFS_SERVER_IP='134.86.254.28'
export-device-env timed out after 60 seconds
end: 2.3.6 export-device-env (duration 00:01:00) [common]
case: export-device-env
case_id: 9715377
definition: lava
(Of course, I can't rule out kernel problem at the moment, but failing
at setting environment variable would be strange.)
Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related?
@ 2021-10-05 17:47 ` Pavel Machek
0 siblings, 0 replies; 8+ messages in thread
From: Pavel Machek @ 2021-10-05 17:47 UTC (permalink / raw)
To: Chris Paterson; +Cc: Bhola, Bikram, Pavel Machek, cip-dev, Jan Kiszka
[-- Attachment #1.1: Type: text/plain, Size: 2274 bytes --]
Hi!
> > We investigated the failure job and looks like before getting login prompt job
> > timeout is happening . In the job definition file - job timeout is mentioned
> > 15mins and sometimes due to slow network issue, it takes more time while
> > downloading, untar and deploying image. So we are seeing timeout during
> > login prompt or in some cases in earlier stages also. The work in progress to
> > double up the network bandwidth within a few weeks, which will reduce the
> > occurrence of this type of issues.
>
> Thank you for your investigation.
> I've have increased the timeout as you have suggested:
> https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49
>
> One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform
> https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5
>
> Has a lower value for boot_character_delay ever been tried?>
Thank you, but that board seems to still have problems:
https://lava.ciplatform.org/scheduler/job/458108
expect-shell-connection: Wait for prompt ['root@ebsy-isar:~#'] (timeout 00:10:00)
Waiting using forced prompt support (timeout 00:05:00)
end: 2.3.5 expect-shell-connection (duration 00:00:00) [common]
start: 2.3.6 export-device-env (timeout 00:01:00) [common]
Sending with 500 millisecond of delay
export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i'
root@ebsy-isar:~# export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i'
export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract -nfsrootfs-5xpf3q6i'
Sending with 500 millisecond of delay
export NFS_SERVER_IP='134.86.254.28'
export-device-env timed out after 60 seconds
end: 2.3.6 export-device-env (duration 00:01:00) [common]
case: export-device-env
case_id: 9715377
definition: lava
(Of course, I can't rule out kernel problem at the moment, but failing
at setting environment variable would be strange.)
Best regards,
Pavel
--
DENX Software Engineering GmbH, Managing Director: Wolfgang Denk
HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany
[-- Attachment #1.2: signature.asc --]
[-- Type: application/pgp-signature, Size: 195 bytes --]
[-- Attachment #2: Type: text/plain, Size: 429 bytes --]
-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#6791): https://lists.cip-project.org/g/cip-dev/message/6791
Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388
Group Owner: cip-dev+owner@lists.cip-project.org
Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org]
-=-=-=-=-=-=-=-=-=-=-=-
^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2021-10-05 17:47 UTC | newest]
Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2021-09-25 20:06 Prompt timeouts on ipc227e board -- randomness related? Pavel Machek
2021-09-25 20:06 ` [cip-dev] " Pavel Machek
2021-09-28 10:08 ` Chris Paterson
2021-09-28 10:08 ` [cip-dev] " Chris Paterson
[not found] ` <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com>
2021-10-05 13:41 ` Chris Paterson
2021-10-05 13:41 ` [cip-dev] " Chris Paterson
2021-10-05 17:47 ` Pavel Machek
2021-10-05 17:47 ` [cip-dev] " Pavel Machek
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.