* Prompt timeouts on ipc227e board -- randomness related? @ 2021-09-25 20:06 Pavel Machek 2021-09-25 20:06 ` [cip-dev] " Pavel Machek 2021-09-28 10:08 ` Chris Paterson 0 siblings, 2 replies; 8+ messages in thread From: Pavel Machek @ 2021-09-25 20:06 UTC (permalink / raw) To: Chris.Paterson2, cip-dev [-- Attachment #1: Type: text/plain, Size: 1775 bytes --] Hi! It is not first time I see this failure: https://lava.ciplatform.org/scheduler/job/444336 [[0;32m OK [0m] Started Login Service. [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (9s / no limit)[K[ [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-upgrade.timer: Adding 3min 2.027476s random time. [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s random time. [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s random time. [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s random time. [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s random time. [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s random time. [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s random time. [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s random time. [K[[0;32m OK [0m] Started Regenerate sshd host keys. wait for prompt timed out end: 2.3.4.1 login-action (duration 00:00:24) [common] case: login-action case_id: 9417066 definition: lava duration: 23.98 Any idea what is going on there? Is it just a test problem, or do we have kernel regression that only happens sometimes? Best regards, Pavel -- DENX Software Engineering GmbH, Managing Director: Wolfgang Denk HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 195 bytes --] ^ permalink raw reply [flat|nested] 8+ messages in thread
* [cip-dev] Prompt timeouts on ipc227e board -- randomness related? 2021-09-25 20:06 Prompt timeouts on ipc227e board -- randomness related? Pavel Machek @ 2021-09-25 20:06 ` Pavel Machek 2021-09-28 10:08 ` Chris Paterson 1 sibling, 0 replies; 8+ messages in thread From: Pavel Machek @ 2021-09-25 20:06 UTC (permalink / raw) To: Chris.Paterson2, cip-dev [-- Attachment #1.1: Type: text/plain, Size: 1775 bytes --] Hi! It is not first time I see this failure: https://lava.ciplatform.org/scheduler/job/444336 [[0;32m OK [0m] Started Login Service. [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (9s / no limit)[K[ [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices-eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily-upgrade.timer: Adding 3min 2.027476s random time. [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s random time. [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s random time. [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s random time. [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s random time. [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s random time. [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s random time. [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s random time. [K[[0;32m OK [0m] Started Regenerate sshd host keys. wait for prompt timed out end: 2.3.4.1 login-action (duration 00:00:24) [common] case: login-action case_id: 9417066 definition: lava duration: 23.98 Any idea what is going on there? Is it just a test problem, or do we have kernel regression that only happens sometimes? Best regards, Pavel -- DENX Software Engineering GmbH, Managing Director: Wolfgang Denk HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #1.2: signature.asc --] [-- Type: application/pgp-signature, Size: 195 bytes --] [-- Attachment #2: Type: text/plain, Size: 429 bytes --] -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#6749): https://lists.cip-project.org/g/cip-dev/message/6749 Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388 Group Owner: cip-dev+owner@lists.cip-project.org Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org] -=-=-=-=-=-=-=-=-=-=-=- ^ permalink raw reply [flat|nested] 8+ messages in thread
* RE: Prompt timeouts on ipc227e board -- randomness related? 2021-09-25 20:06 Prompt timeouts on ipc227e board -- randomness related? Pavel Machek 2021-09-25 20:06 ` [cip-dev] " Pavel Machek @ 2021-09-28 10:08 ` Chris Paterson 2021-09-28 10:08 ` [cip-dev] " Chris Paterson [not found] ` <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com> 1 sibling, 2 replies; 8+ messages in thread From: Chris Paterson @ 2021-09-28 10:08 UTC (permalink / raw) To: Pavel Machek, Bhola, Bikram; +Cc: cip-dev, Jan Kiszka Hello Pavel, > From: Pavel Machek <pavel@denx.de> > Sent: 25 September 2021 21:06 > > Hi! > > It is not first time I see this failure: Thank you for reporting the issue. Bikram is going to take a look for us (thank you). Kind regards, Chris > > https://lava.ciplatform.org/scheduler/job/444336 > > > [[0;32m OK [0m] Started Login Service. > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) > A start job is running for…ate sshd host keys (9s / no limit)[K[ > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices- > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily- > upgrade.timer: Adding 3min 2.027476s random time. > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s > random time. > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s > random time. > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s > random time. > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s > random time. > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s > random time. > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s > random time. > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s > random time. > [K[[0;32m OK [0m] Started Regenerate sshd host keys. > wait for prompt timed out > end: 2.3.4.1 login-action (duration 00:00:24) [common] > case: login-action > case_id: 9417066 > definition: lava > duration: 23.98 > > Any idea what is going on there? Is it just a test problem, or do we > have kernel regression that only happens sometimes? > > Best regards, > Pavel > -- > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related? 2021-09-28 10:08 ` Chris Paterson @ 2021-09-28 10:08 ` Chris Paterson [not found] ` <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com> 1 sibling, 0 replies; 8+ messages in thread From: Chris Paterson @ 2021-09-28 10:08 UTC (permalink / raw) To: Pavel Machek, Bhola, Bikram; +Cc: cip-dev, Jan Kiszka [-- Attachment #1: Type: text/plain, Size: 2087 bytes --] Hello Pavel, > From: Pavel Machek <pavel@denx.de> > Sent: 25 September 2021 21:06 > > Hi! > > It is not first time I see this failure: Thank you for reporting the issue. Bikram is going to take a look for us (thank you). Kind regards, Chris > > https://lava.ciplatform.org/scheduler/job/444336 > > > [[0;32m OK [0m] Started Login Service. > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys (7s / > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running for…ate > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) > A start job is running for…ate sshd host keys (9s / no limit)[K[ > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running for…evices- > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily- > upgrade.timer: Adding 3min 2.027476s random time. > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s > random time. > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min 47.041488s > random time. > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s > random time. > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s > random time. > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s > random time. > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min 13.015405s > random time. > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s > random time. > [K[[0;32m OK [0m] Started Regenerate sshd host keys. > wait for prompt timed out > end: 2.3.4.1 login-action (duration 00:00:24) [common] > case: login-action > case_id: 9417066 > definition: lava > duration: 23.98 > > Any idea what is going on there? Is it just a test problem, or do we > have kernel regression that only happens sometimes? > > Best regards, > Pavel > -- > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #2: Type: text/plain, Size: 429 bytes --] -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#6754): https://lists.cip-project.org/g/cip-dev/message/6754 Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388 Group Owner: cip-dev+owner@lists.cip-project.org Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org] -=-=-=-=-=-=-=-=-=-=-=- ^ permalink raw reply [flat|nested] 8+ messages in thread
[parent not found: <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com>]
* RE: Prompt timeouts on ipc227e board -- randomness related? [not found] ` <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com> @ 2021-10-05 13:41 ` Chris Paterson 2021-10-05 13:41 ` [cip-dev] " Chris Paterson 2021-10-05 17:47 ` Pavel Machek 0 siblings, 2 replies; 8+ messages in thread From: Chris Paterson @ 2021-10-05 13:41 UTC (permalink / raw) To: Bhola, Bikram, Pavel Machek; +Cc: cip-dev, Jan Kiszka Hello Bikram, > From: Bhola, Bikram <Bikram_Bhola@mentor.com> > Sent: 30 September 2021 12:19 > > Hi Chris, > > We investigated the failure job and looks like before getting login prompt job > timeout is happening . In the job definition file - job timeout is mentioned > 15mins and sometimes due to slow network issue, it takes more time while > downloading, untar and deploying image. So we are seeing timeout during > login prompt or in some cases in earlier stages also. The work in progress to > double up the network bandwidth within a few weeks, which will reduce the > occurrence of this type of issues. Thank you for your investigation. I've have increased the timeout as you have suggested: https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49 One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5 Has a lower value for boot_character_delay ever been tried? Kind regards, Chris > > Time being, with an increased job timeout to 20mins, failure is not observed. > We tested 10 times to be working fine. > Example : > https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01 > > > changes in the job definition file > https://lava.ciplatform.org/scheduler/job/444336/definition > Current implementation > ------------------------ > timeouts: > job: > minutes: 15 > > Need to Modify > ----------------------------------- > timeouts: > job: > minutes: 20 > > > Regards, > Bikram > > -----Original Message----- > From: Chris Paterson <Chris.Paterson2@renesas.com> > Sent: 28 September 2021 15:38 > To: Pavel Machek <pavel@denx.de>; Bhola, Bikram > <Bikram_Bhola@mentor.com> > Cc: cip-dev@lists.cip-project.org; Jan Kiszka <jan.kiszka@siemens.com> > Subject: RE: Prompt timeouts on ipc227e board -- randomness related? > > Hello Pavel, > > > From: Pavel Machek <pavel@denx.de> > > Sent: 25 September 2021 21:06 > > > > Hi! > > > > It is not first time I see this failure: > > Thank you for reporting the issue. > > Bikram is going to take a look for us (thank you). > > Kind regards, Chris > > > > > > https://jpn01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flava.c > iplatform.org%2Fscheduler%2Fjob%2F444336&data=04%7C01%7CChris. > Paterson2%40renesas.com%7Cacb37b995a6c41d2090808d98404189e%7C53d > 82571da1947e49cb4625a166a4a2a%7C0%7C0%7C637685975391318359%7CUn > known%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6 > Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yIPWAcblC17x6PpT3TSiGT > QWiiinQiSuu9a3HRg4u3Q%3D&reserved=0 > > > > > > [[0;32m OK [0m] Started Login Service. > > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys > (7s / > > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running > for…ate > > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) > > A start job is running for…ate sshd host keys (9s / no limit)[K[ > > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running > for…evices- > > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily- > > upgrade.timer: Adding 3min 2.027476s random time. > > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s > > random time. > > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min > 47.041488s > > random time. > > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s > > random time. > > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s > > random time. > > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s > > random time. > > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min > 13.015405s > > random time. > > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s > > random time. > > [K[[0;32m OK [0m] Started Regenerate sshd host keys. > > wait for prompt timed out > > end: 2.3.4.1 login-action (duration 00:00:24) [common] > > case: login-action > > case_id: 9417066 > > definition: lava > > duration: 23.98 > > > > Any idea what is going on there? Is it just a test problem, or do we > > have kernel regression that only happens sometimes? > > > > Best regards, > > Pavel > > -- > > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk > > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related? 2021-10-05 13:41 ` Chris Paterson @ 2021-10-05 13:41 ` Chris Paterson 2021-10-05 17:47 ` Pavel Machek 1 sibling, 0 replies; 8+ messages in thread From: Chris Paterson @ 2021-10-05 13:41 UTC (permalink / raw) To: Bhola, Bikram, Pavel Machek; +Cc: cip-dev, Jan Kiszka [-- Attachment #1: Type: text/plain, Size: 4653 bytes --] Hello Bikram, > From: Bhola, Bikram <Bikram_Bhola@mentor.com> > Sent: 30 September 2021 12:19 > > Hi Chris, > > We investigated the failure job and looks like before getting login prompt job > timeout is happening . In the job definition file - job timeout is mentioned > 15mins and sometimes due to slow network issue, it takes more time while > downloading, untar and deploying image. So we are seeing timeout during > login prompt or in some cases in earlier stages also. The work in progress to > double up the network bandwidth within a few weeks, which will reduce the > occurrence of this type of issues. Thank you for your investigation. I've have increased the timeout as you have suggested: https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49 One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5 Has a lower value for boot_character_delay ever been tried? Kind regards, Chris > > Time being, with an increased job timeout to 20mins, failure is not observed. > We tested 10 times to be working fine. > Example : > https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01 > > > changes in the job definition file > https://lava.ciplatform.org/scheduler/job/444336/definition > Current implementation > ------------------------ > timeouts: > job: > minutes: 15 > > Need to Modify > ----------------------------------- > timeouts: > job: > minutes: 20 > > > Regards, > Bikram > > -----Original Message----- > From: Chris Paterson <Chris.Paterson2@renesas.com> > Sent: 28 September 2021 15:38 > To: Pavel Machek <pavel@denx.de>; Bhola, Bikram > <Bikram_Bhola@mentor.com> > Cc: cip-dev@lists.cip-project.org; Jan Kiszka <jan.kiszka@siemens.com> > Subject: RE: Prompt timeouts on ipc227e board -- randomness related? > > Hello Pavel, > > > From: Pavel Machek <pavel@denx.de> > > Sent: 25 September 2021 21:06 > > > > Hi! > > > > It is not first time I see this failure: > > Thank you for reporting the issue. > > Bikram is going to take a look for us (thank you). > > Kind regards, Chris > > > > > > https://jpn01.safelinks.protection.outlook.com/?url=https%3A%2F%2Flava.c > iplatform.org%2Fscheduler%2Fjob%2F444336&data=04%7C01%7CChris. > Paterson2%40renesas.com%7Cacb37b995a6c41d2090808d98404189e%7C53d > 82571da1947e49cb4625a166a4a2a%7C0%7C0%7C637685975391318359%7CUn > known%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6 > Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yIPWAcblC17x6PpT3TSiGT > QWiiinQiSuu9a3HRg4u3Q%3D&reserved=0 > > > > > > [[0;32m OK [0m] Started Login Service. > > [[0m[0;31m* [0m] (1 of 2) A start job is running for…ate sshd host keys > (7s / > > no limit)[K[[0;1;31m*[0m[0;31m* [0m] (1 of 2) A start job is running > for…ate > > sshd host keys (8s / no limit)[K[[0;31m*[0;1;31m*[0m[0;31m* [0m] (1 of 2) > > A start job is running for…ate sshd host keys (9s / no limit)[K[ > > [0;31m*[0;1;31m*[0m[0;31m* [0m] (2 of 2) A start job is running > for…evices- > > eth0.device (8s / 1min 30s)[ 19.855328] systemd[1]: apt-daily- > > upgrade.timer: Adding 3min 2.027476s random time. > > [ 19.864207] systemd[1]: apt-daily.timer: Adding 1h 54min 15.794344s > > random time. > > [ 21.406490] systemd[1]: apt-daily-upgrade.timer: Adding 55min > 47.041488s > > random time. > > [ 21.415357] systemd[1]: apt-daily.timer: Adding 11h 48min 4.457495s > > random time. > > [ 22.049807] systemd[1]: apt-daily-upgrade.timer: Adding 3min 54.125406s > > random time. > > [ 22.058500] systemd[1]: apt-daily.timer: Adding 8h 34min 47.388595s > > random time. > > [ 22.511646] systemd[1]: apt-daily-upgrade.timer: Adding 25min > 13.015405s > > random time. > > [ 22.520510] systemd[1]: apt-daily.timer: Adding 11h 58min 24.212170s > > random time. > > [K[[0;32m OK [0m] Started Regenerate sshd host keys. > > wait for prompt timed out > > end: 2.3.4.1 login-action (duration 00:00:24) [common] > > case: login-action > > case_id: 9417066 > > definition: lava > > duration: 23.98 > > > > Any idea what is going on there? Is it just a test problem, or do we > > have kernel regression that only happens sometimes? > > > > Best regards, > > Pavel > > -- > > DENX Software Engineering GmbH, Managing Director: Wolfgang Denk > > HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #2: Type: text/plain, Size: 429 bytes --] -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#6785): https://lists.cip-project.org/g/cip-dev/message/6785 Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388 Group Owner: cip-dev+owner@lists.cip-project.org Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org] -=-=-=-=-=-=-=-=-=-=-=- ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: Prompt timeouts on ipc227e board -- randomness related? 2021-10-05 13:41 ` Chris Paterson 2021-10-05 13:41 ` [cip-dev] " Chris Paterson @ 2021-10-05 17:47 ` Pavel Machek 2021-10-05 17:47 ` [cip-dev] " Pavel Machek 1 sibling, 1 reply; 8+ messages in thread From: Pavel Machek @ 2021-10-05 17:47 UTC (permalink / raw) To: Chris Paterson; +Cc: Bhola, Bikram, Pavel Machek, cip-dev, Jan Kiszka [-- Attachment #1: Type: text/plain, Size: 2274 bytes --] Hi! > > We investigated the failure job and looks like before getting login prompt job > > timeout is happening . In the job definition file - job timeout is mentioned > > 15mins and sometimes due to slow network issue, it takes more time while > > downloading, untar and deploying image. So we are seeing timeout during > > login prompt or in some cases in earlier stages also. The work in progress to > > double up the network bandwidth within a few weeks, which will reduce the > > occurrence of this type of issues. > > Thank you for your investigation. > I've have increased the timeout as you have suggested: > https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49 > > One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform > https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5 > > Has a lower value for boot_character_delay ever been tried?> Thank you, but that board seems to still have problems: https://lava.ciplatform.org/scheduler/job/458108 expect-shell-connection: Wait for prompt ['root@ebsy-isar:~#'] (timeout 00:10:00) Waiting using forced prompt support (timeout 00:05:00) end: 2.3.5 expect-shell-connection (duration 00:00:00) [common] start: 2.3.6 export-device-env (timeout 00:01:00) [common] Sending with 500 millisecond of delay export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i' root@ebsy-isar:~# export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i' export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract -nfsrootfs-5xpf3q6i' Sending with 500 millisecond of delay export NFS_SERVER_IP='134.86.254.28' export-device-env timed out after 60 seconds end: 2.3.6 export-device-env (duration 00:01:00) [common] case: export-device-env case_id: 9715377 definition: lava (Of course, I can't rule out kernel problem at the moment, but failing at setting environment variable would be strange.) Best regards, Pavel -- DENX Software Engineering GmbH, Managing Director: Wolfgang Denk HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #2: signature.asc --] [-- Type: application/pgp-signature, Size: 195 bytes --] ^ permalink raw reply [flat|nested] 8+ messages in thread
* Re: [cip-dev] Prompt timeouts on ipc227e board -- randomness related? 2021-10-05 17:47 ` Pavel Machek @ 2021-10-05 17:47 ` Pavel Machek 0 siblings, 0 replies; 8+ messages in thread From: Pavel Machek @ 2021-10-05 17:47 UTC (permalink / raw) To: Chris Paterson; +Cc: Bhola, Bikram, Pavel Machek, cip-dev, Jan Kiszka [-- Attachment #1.1: Type: text/plain, Size: 2274 bytes --] Hi! > > We investigated the failure job and looks like before getting login prompt job > > timeout is happening . In the job definition file - job timeout is mentioned > > 15mins and sometimes due to slow network issue, it takes more time while > > downloading, untar and deploying image. So we are seeing timeout during > > login prompt or in some cases in earlier stages also. The work in progress to > > double up the network bandwidth within a few weeks, which will reduce the > > occurrence of this type of issues. > > Thank you for your investigation. > I've have increased the timeout as you have suggested: > https://gitlab.com/cip-project/cip-testing/linux-cip-ci/-/merge_requests/49 > > One additional thing I've noticed, the default x86 character delay during boot is 500ms, which seems a long time inbetween each character sent to the platform > https://lava.ciplatform.org/scheduler/device/x86-simatic-ipc227e-01/devicedict#defline5 > > Has a lower value for boot_character_delay ever been tried?> Thank you, but that board seems to still have problems: https://lava.ciplatform.org/scheduler/job/458108 expect-shell-connection: Wait for prompt ['root@ebsy-isar:~#'] (timeout 00:10:00) Waiting using forced prompt support (timeout 00:05:00) end: 2.3.5 expect-shell-connection (duration 00:00:00) [common] start: 2.3.6 export-device-env (timeout 00:01:00) [common] Sending with 500 millisecond of delay export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i' root@ebsy-isar:~# export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract-nfsrootfs-5xpf3q6i' export NFS_ROOTFS='/var/lib/lava/dispatcher/tmp/458108/extract -nfsrootfs-5xpf3q6i' Sending with 500 millisecond of delay export NFS_SERVER_IP='134.86.254.28' export-device-env timed out after 60 seconds end: 2.3.6 export-device-env (duration 00:01:00) [common] case: export-device-env case_id: 9715377 definition: lava (Of course, I can't rule out kernel problem at the moment, but failing at setting environment variable would be strange.) Best regards, Pavel -- DENX Software Engineering GmbH, Managing Director: Wolfgang Denk HRB 165235 Munich, Office: Kirchenstr.5, D-82194 Groebenzell, Germany [-- Attachment #1.2: signature.asc --] [-- Type: application/pgp-signature, Size: 195 bytes --] [-- Attachment #2: Type: text/plain, Size: 429 bytes --] -=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#6791): https://lists.cip-project.org/g/cip-dev/message/6791 Mute This Topic: https://lists.cip-project.org/mt/85867572/4520388 Group Owner: cip-dev+owner@lists.cip-project.org Unsubscribe: https://lists.cip-project.org/g/cip-dev/leave/10495289/4520388/727948398/xyzzy [cip-dev@archiver.kernel.org] -=-=-=-=-=-=-=-=-=-=-=- ^ permalink raw reply [flat|nested] 8+ messages in thread
end of thread, other threads:[~2021-10-05 17:47 UTC | newest] Thread overview: 8+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- 2021-09-25 20:06 Prompt timeouts on ipc227e board -- randomness related? Pavel Machek 2021-09-25 20:06 ` [cip-dev] " Pavel Machek 2021-09-28 10:08 ` Chris Paterson 2021-09-28 10:08 ` [cip-dev] " Chris Paterson [not found] ` <a34a0fcb84f449ef83e2843bec5a6d02@SVR-IES-MBX-03.mgc.mentorg.com> 2021-10-05 13:41 ` Chris Paterson 2021-10-05 13:41 ` [cip-dev] " Chris Paterson 2021-10-05 17:47 ` Pavel Machek 2021-10-05 17:47 ` [cip-dev] " Pavel Machek
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).