All of lore.kernel.org
 help / color / mirror / Atom feed
* kselftest: net: tls: hangs
@ 2022-03-25  7:40 Naresh Kamboju
  2022-03-25 22:13 ` Shuah Khan
  0 siblings, 1 reply; 9+ messages in thread
From: Naresh Kamboju @ 2022-03-25  7:40 UTC (permalink / raw)
  To: open list:KERNEL SELFTEST FRAMEWORK, open list,
	Linux-Next Mailing List, Netdev, lkft-triage
  Cc: Shuah Khan, David S. Miller, Jakub Kicinski, Paolo Abeni

While running kselftest net tls test case on Linux next and mainline kernels
the test case fails at following sub test cases and hangs every time.
Please investigate this hang issue.

kconfigs are generated from kselftest-merge config.

metadata:
  git_ref: master
  git_repo: https://gitlab.com/Linaro/lkft/mirrors/next/linux-next
  git_sha: b61581ae229d8eb9f21f8753be3f4011f7692384
  git_describe: next-20220323
  kernel_version: 5.17.0
  kernel-config: https://builds.tuxbuild.com/26mKij4yB5Q6WUpOyHHEoHLstVJ/config

Test log link,
--------------
# selftests: net: tls
# TAP version 13
# 1..502
# # Starting 502 tests from 14 test cases.
# #  RUN           global.non_established ...
# #            OK  global.non_established
# ok 1 global.non_established
# #  RUN           global.keysizes ...
# #            OK  global.keysizes
<trim>

# #  RUN           tls.12_aes_gcm.splice_cmsg_to_pipe ...
# # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
p[1], NULL, send_len, 0) (10) == -1 (-1)
# # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
# # splice_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.12_aes_gcm.splice_cmsg_to_pipe
# not ok 21 tls.12_aes_gcm.splice_cmsg_to_pipe
# #  RUN           tls.12_aes_gcm.splice_dec_cmsg_to_pipe ...
# # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
send_len, 0) (10) == -1 (-1)
# # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
# # splice_dec_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.12_aes_gcm.splice_dec_cmsg_to_pipe
# not ok 22 tls.12_aes_gcm.splice_dec_cmsg_to_pipe
# #  RUN           tls.12_aes_gcm.recv_and_splice ...
# #            OK  tls.12_aes_gcm.recv_and_splice

<trim>

# #  RUN           tls.13_aes_gcm.splice_cmsg_to_pipe ...
# # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
p[1], NULL, send_len, 0) (10) == -1 (-1)
# # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
# # splice_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.13_aes_gcm.splice_cmsg_to_pipe
# not ok 70 tls.13_aes_gcm.splice_cmsg_to_pipe
# #  RUN           tls.13_aes_gcm.splice_dec_cmsg_to_pipe ...
# # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
send_len, 0) (10) == -1 (-1)
# # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
# # splice_dec_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.13_aes_gcm.splice_dec_cmsg_to_pipe
# not ok 71 tls.13_aes_gcm.splice_dec_cmsg_to_pipe


<trim>

# #  RUN           tls.12_chacha.splice_cmsg_to_pipe ...
# # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
p[1], NULL, send_len, 0) (10) == -1 (-1)
# # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
# # splice_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.12_chacha.splice_cmsg_to_pipe
# not ok 119 tls.12_chacha.splice_cmsg_to_pipe
# #  RUN           tls.12_chacha.splice_dec_cmsg_to_pipe ...
# # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
send_len, 0) (10) == -1 (-1)
# # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
# # splice_dec_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.12_chacha.splice_dec_cmsg_to_pipe
# not ok 120 tls.12_chacha.splice_dec_cmsg_to_pipe

<trim>

# #  RUN           tls.13_chacha.splice_cmsg_to_pipe ...
# # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
p[1], NULL, send_len, 0) (10) == -1 (-1)
# # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
# # splice_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.13_chacha.splice_cmsg_to_pipe
# not ok 168 tls.13_chacha.splice_cmsg_to_pipe
# #  RUN           tls.13_chacha.splice_dec_cmsg_to_pipe ...
# # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
send_len, 0) (10) == -1 (-1)
# # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
# # splice_dec_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.13_chacha.splice_dec_cmsg_to_pipe
# not ok 169 tls.13_chacha.splice_dec_cmsg_to_pipe

<trim>

# #  RUN           tls.13_sm4_gcm.splice_cmsg_to_pipe ...
# # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
p[1], NULL, send_len, 0) (10) == -1 (-1)
# # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
# # splice_cmsg_to_pipe: Test terminated by timeout
# #          FAIL  tls.13_sm4_gcm.splice_cmsg_to_pipe
# not ok 217 tls.13_sm4_gcm.splice_cmsg_to_pipe
# #  RUN           tls.13_sm4_gcm.splice_dec_cmsg_to_pipe ...
# # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
send_len, 0) (10) == -1 (-1)
# # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
[  661.901558] kworker/dying (49) used greatest stack depth: 10576 bytes left

Test case HANG here.

Full test log links [1] including boot log and test run log.

Reported-by: Linux Kernel Functional Testing <lkft@linaro.org>

--
Linaro LKFT
https://lkft.linaro.org

https://lkft.validation.linaro.org/scheduler/job/4770773#L2700

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-25  7:40 kselftest: net: tls: hangs Naresh Kamboju
@ 2022-03-25 22:13 ` Shuah Khan
  2022-03-25 23:12   ` Jakub Kicinski
  0 siblings, 1 reply; 9+ messages in thread
From: Shuah Khan @ 2022-03-25 22:13 UTC (permalink / raw)
  To: Naresh Kamboju, Jakub Kicinski
  Cc: Shuah Khan, David S. Miller, Paolo Abeni, Shuah Khan,
	Linux-Next Mailing List, open list:KERNEL SELFTEST FRAMEWORK,
	open list, lkft-triage, Netdev

On 3/25/22 1:40 AM, Naresh Kamboju wrote:
> While running kselftest net tls test case on Linux next and mainline kernels
> the test case fails at following sub test cases and hangs every time.
> Please investigate this hang issue.
> 
> kconfigs are generated from kselftest-merge config.
> 
> metadata:
>    git_ref: master
>    git_repo: https://gitlab.com/Linaro/lkft/mirrors/next/linux-next
>    git_sha: b61581ae229d8eb9f21f8753be3f4011f7692384
>    git_describe: next-20220323
>    kernel_version: 5.17.0
>    kernel-config: https://builds.tuxbuild.com/26mKij4yB5Q6WUpOyHHEoHLstVJ/config
> 
> Test log link,
> --------------
> # selftests: net: tls
> # TAP version 13
> # 1..502
> # # Starting 502 tests from 14 test cases.
> # #  RUN           global.non_established ...
> # #            OK  global.non_established
> # ok 1 global.non_established
> # #  RUN           global.keysizes ...
> # #            OK  global.keysizes
> <trim>
> 
> # #  RUN           tls.12_aes_gcm.splice_cmsg_to_pipe ...
> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> p[1], NULL, send_len, 0) (10) == -1 (-1)
> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> # # splice_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.12_aes_gcm.splice_cmsg_to_pipe
> # not ok 21 tls.12_aes_gcm.splice_cmsg_to_pipe
> # #  RUN           tls.12_aes_gcm.splice_dec_cmsg_to_pipe ...
> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> send_len, 0) (10) == -1 (-1)
> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> # # splice_dec_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.12_aes_gcm.splice_dec_cmsg_to_pipe
> # not ok 22 tls.12_aes_gcm.splice_dec_cmsg_to_pipe
> # #  RUN           tls.12_aes_gcm.recv_and_splice ...
> # #            OK  tls.12_aes_gcm.recv_and_splice
> 
> <trim>
> 
> # #  RUN           tls.13_aes_gcm.splice_cmsg_to_pipe ...
> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> p[1], NULL, send_len, 0) (10) == -1 (-1)
> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> # # splice_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.13_aes_gcm.splice_cmsg_to_pipe
> # not ok 70 tls.13_aes_gcm.splice_cmsg_to_pipe
> # #  RUN           tls.13_aes_gcm.splice_dec_cmsg_to_pipe ...
> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> send_len, 0) (10) == -1 (-1)
> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> # # splice_dec_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.13_aes_gcm.splice_dec_cmsg_to_pipe
> # not ok 71 tls.13_aes_gcm.splice_dec_cmsg_to_pipe
> 
> 
> <trim>
> 
> # #  RUN           tls.12_chacha.splice_cmsg_to_pipe ...
> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> p[1], NULL, send_len, 0) (10) == -1 (-1)
> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> # # splice_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.12_chacha.splice_cmsg_to_pipe
> # not ok 119 tls.12_chacha.splice_cmsg_to_pipe
> # #  RUN           tls.12_chacha.splice_dec_cmsg_to_pipe ...
> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> send_len, 0) (10) == -1 (-1)
> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> # # splice_dec_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.12_chacha.splice_dec_cmsg_to_pipe
> # not ok 120 tls.12_chacha.splice_dec_cmsg_to_pipe
> 
> <trim>
> 
> # #  RUN           tls.13_chacha.splice_cmsg_to_pipe ...
> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> p[1], NULL, send_len, 0) (10) == -1 (-1)
> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> # # splice_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.13_chacha.splice_cmsg_to_pipe
> # not ok 168 tls.13_chacha.splice_cmsg_to_pipe
> # #  RUN           tls.13_chacha.splice_dec_cmsg_to_pipe ...
> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> send_len, 0) (10) == -1 (-1)
> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> # # splice_dec_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.13_chacha.splice_dec_cmsg_to_pipe
> # not ok 169 tls.13_chacha.splice_dec_cmsg_to_pipe
> 
> <trim>
> 
> # #  RUN           tls.13_sm4_gcm.splice_cmsg_to_pipe ...
> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> p[1], NULL, send_len, 0) (10) == -1 (-1)
> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> # # splice_cmsg_to_pipe: Test terminated by timeout
> # #          FAIL  tls.13_sm4_gcm.splice_cmsg_to_pipe
> # not ok 217 tls.13_sm4_gcm.splice_cmsg_to_pipe
> # #  RUN           tls.13_sm4_gcm.splice_dec_cmsg_to_pipe ...
> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> send_len, 0) (10) == -1 (-1)
> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> [  661.901558] kworker/dying (49) used greatest stack depth: 10576 bytes left

This seems to be the problem perhaps.

Jakub, any thoughts. The last change to tls.c was a while back.

> 
> Test case HANG here.
> 
> Full test log links [1] including boot log and test run log.
> 
> Reported-by: Linux Kernel Functional Testing <lkft@linaro.org>
> 
> --
> Linaro LKFT
> https://lkft.linaro.org
> 
> https://lkft.validation.linaro.org/scheduler/job/4770773#L2700
> 

thanks,
-- Shuah
thanks,
-- Shuah

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-25 22:13 ` Shuah Khan
@ 2022-03-25 23:12   ` Jakub Kicinski
  2022-03-25 23:34     ` Shuah Khan
  0 siblings, 1 reply; 9+ messages in thread
From: Jakub Kicinski @ 2022-03-25 23:12 UTC (permalink / raw)
  To: Shuah Khan
  Cc: Naresh Kamboju, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List, open list:KERNEL SELFTEST FRAMEWORK,
	open list, lkft-triage, Netdev

On Fri, 25 Mar 2022 16:13:33 -0600 Shuah Khan wrote:
> > # #  RUN           tls.13_sm4_gcm.splice_cmsg_to_pipe ...
> > # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
> > p[1], NULL, send_len, 0) (10) == -1 (-1)
> > # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
> > # # splice_cmsg_to_pipe: Test terminated by timeout
> > # #          FAIL  tls.13_sm4_gcm.splice_cmsg_to_pipe
> > # not ok 217 tls.13_sm4_gcm.splice_cmsg_to_pipe
> > # #  RUN           tls.13_sm4_gcm.splice_dec_cmsg_to_pipe ...
> > # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
> > send_len, 0) (10) == -1 (-1)
> > # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
> > [  661.901558] kworker/dying (49) used greatest stack depth: 10576 bytes left  
> 
> This seems to be the problem perhaps.
>
> Jakub, any thoughts. The last change to tls.c was a while back.

Yes, sorry, kicked off a build and got distracted.

I can repro the failures, TLS=n in the config I must have not tested
that in the new cases.

But I can't repro the hung, and we have a timer at the hardness level
IIUC so IDK how this could "hang"?

Naresh, is there any stack trace in the logs? Can you repro on Linus's
tree?

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-25 23:12   ` Jakub Kicinski
@ 2022-03-25 23:34     ` Shuah Khan
  2022-03-29 14:49       ` Naresh Kamboju
  0 siblings, 1 reply; 9+ messages in thread
From: Shuah Khan @ 2022-03-25 23:34 UTC (permalink / raw)
  To: Jakub Kicinski
  Cc: Naresh Kamboju, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List, open list:KERNEL SELFTEST FRAMEWORK,
	open list, lkft-triage, Netdev, Shuah Khan

On 3/25/22 5:12 PM, Jakub Kicinski wrote:
> On Fri, 25 Mar 2022 16:13:33 -0600 Shuah Khan wrote:
>>> # #  RUN           tls.13_sm4_gcm.splice_cmsg_to_pipe ...
>>> # # tls.c:688:splice_cmsg_to_pipe:Expected splice(self->cfd, NULL,
>>> p[1], NULL, send_len, 0) (10) == -1 (-1)
>>> # # tls.c:689:splice_cmsg_to_pipe:Expected errno (2) == EINVAL (22)
>>> # # splice_cmsg_to_pipe: Test terminated by timeout
>>> # #          FAIL  tls.13_sm4_gcm.splice_cmsg_to_pipe
>>> # not ok 217 tls.13_sm4_gcm.splice_cmsg_to_pipe
>>> # #  RUN           tls.13_sm4_gcm.splice_dec_cmsg_to_pipe ...
>>> # # tls.c:708:splice_dec_cmsg_to_pipe:Expected recv(self->cfd, buf,
>>> send_len, 0) (10) == -1 (-1)
>>> # # tls.c:709:splice_dec_cmsg_to_pipe:Expected errno (2) == EIO (5)
>>> [  661.901558] kworker/dying (49) used greatest stack depth: 10576 bytes left
>>
>> This seems to be the problem perhaps.
>>
>> Jakub, any thoughts. The last change to tls.c was a while back.
> 
> Yes, sorry, kicked off a build and got distracted.
> 
> I can repro the failures, TLS=n in the config I must have not tested
> that in the new cases.
> 
> But I can't repro the hung, and we have a timer at the hardness level
> IIUC so IDK how this could "hang"?
> 
> Naresh, is there any stack trace in the logs? Can you repro on Linus's
> tree?
> 

I couldn't reproduce this either - I have TLS=m

thanks,
-- Shuah

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-25 23:34     ` Shuah Khan
@ 2022-03-29 14:49       ` Naresh Kamboju
       [not found]         ` <20220329102649.507bbf2a@kernel.org>
  0 siblings, 1 reply; 9+ messages in thread
From: Naresh Kamboju @ 2022-03-29 14:49 UTC (permalink / raw)
  To: Jakub Kicinski
  Cc: Shuah Khan, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List, open list:KERNEL SELFTEST FRAMEWORK,
	open list, lkft-triage, Netdev

Hi Jakub,

> >> This seems to be the problem perhaps.
> >>
> >> Jakub, any thoughts. The last change to tls.c was a while back.
> >
> > Yes, sorry, kicked off a build and got distracted.
> >
> > I can repro the failures, TLS=n in the config I must have not tested
> > that in the new cases.

Thanks for investigation and reproducing the test failures.

> >
> > But I can't repro the hung, and we have a timer at the hardness level
> > IIUC so IDK how this could "hang"?

Did you get a chance to build and test with the config file that I
have provided ?
I am using kselftest-merge configs for my daily CI system [1] on gitlab.

If you think the Kconfig is a test

> > Naresh, is there any stack trace in the logs? Can you repro on Linus's
> > tree?

There is no stack trace print, it was only hung.

> I couldn't reproduce this either - I have TLS=m

If you want to reproduce the same build or re-use the bzImage /
vmlinux / System.map
you may check the provided link [1].

Step to reproduce [1]:

# TuxMake supports the concept of runtimes.
# See https://docs.tuxmake.org/runtimes/, for that to work it requires
# that you install podman or docker on your system.
#
# To install tuxmake on your system globally:
# sudo pip3 install -U tuxmake
#
# See https://docs.tuxmake.org/ for complete documentation.
# Original tuxmake command with fragments listed below.

tuxmake --runtime podman --target-arch x86_64 --toolchain gcc-11
--kconfig defconfig --kconfig-add
https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/lkft.config
--kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/lkft-crypto.config
--kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/distro-overrides.config
--kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/systemd.config
--kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/virtio.config
--kconfig-add CONFIG_IGB=y --kconfig-add
CONFIG_UNWINDER_FRAME_POINTER=y cpupower headers kernel kselftest
kselftest-merge modules


[1] https://builds.tuxbuild.com/26mKij4yB5Q6WUpOyHHEoHLstVJ/

- Naresh

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
       [not found]         ` <20220329102649.507bbf2a@kernel.org>
@ 2022-03-29 17:46           ` Naresh Kamboju
  2022-03-29 17:56             ` Jakub Kicinski
  2022-03-31  7:48           ` Naresh Kamboju
  1 sibling, 1 reply; 9+ messages in thread
From: Naresh Kamboju @ 2022-03-29 17:46 UTC (permalink / raw)
  To: Jakub Kicinski
  Cc: Shuah Khan, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List,
	open list:KERNEL SELFTEST FRAMEWORK
	<linux-kselftest@vger.kernel.org>,
	open list <linux-kernel@vger.kernel.org>,
	lkft-triage@lists.linaro.org, Netdev

Hi Jakub,

On Tue, 29 Mar 2022 at 22:56, Jakub Kicinski <kuba@kernel.org> wrote:
>
> On Tue, 29 Mar 2022 20:19:10 +0530 Naresh Kamboju wrote:
> > > > But I can't repro the hung, and we have a timer at the hardness level
> > > > IIUC so IDK how this could "hang"?
> >
> > Did you get a chance to build and test with the config file that I
> > have provided ?
> > I am using kselftest-merge configs for my daily CI system [1] on gitlab.
> >
> > If you think the Kconfig is a test
>
> I have not used the exact config, just a config with TLS=n.
> I also have a different GCC version (11.1).
> Unfortunately I don't have access to docker or podman on the machines
> I'm using.
>
> > > > Naresh, is there any stack trace in the logs? Can you repro on Linus's
> > > > tree?
> >
> > There is no stack trace print, it was only hung.
>
> Can you check where the process is stuck and it's state?
> /proc/$pid/stack and run that thru scripts/decode_stacktrace

I generally use remote devices. which have single serial connection only.
However, I will try your suggestions tomorrow and get back to you.

> > > I couldn't reproduce this either - I have TLS=m
> >
> > If you want to reproduce the same build or re-use the bzImage /
> > vmlinux / System.map
> > you may check the provided link [1].
> >
> > Step to reproduce [1]:
> >
> > # TuxMake supports the concept of runtimes.
> > # See https://docs.tuxmake.org/runtimes/, for that to work it requires
> > # that you install podman or docker on your system.
> > #
> > # To install tuxmake on your system globally:
> > # sudo pip3 install -U tuxmake
> > #
> > # See https://docs.tuxmake.org/ for complete documentation.
> > # Original tuxmake command with fragments listed below.
> >
> > tuxmake --runtime podman --target-arch x86_64 --toolchain gcc-11
> > --kconfig defconfig --kconfig-add
> > https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/lkft.config
> > --kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/lkft-crypto.config
> > --kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/distro-overrides.config
> > --kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/systemd.config
> > --kconfig-add https://raw.githubusercontent.com/Linaro/meta-lkft/sumo/recipes-kernel/linux/files/virtio.config
> > --kconfig-add CONFIG_IGB=y --kconfig-add
> > CONFIG_UNWINDER_FRAME_POINTER=y cpupower headers kernel kselftest
> > kselftest-merge modules
> >
> >
> > [1] https://builds.tuxbuild.com/26mKij4yB5Q6WUpOyHHEoHLstVJ/
>
> I only see build logs here, are there logs for the run?

Yes. Those are only build logs.
The test log is here but not very useful.
 https://lkft.validation.linaro.org/scheduler/job/4770773#L2700

- Naresh

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-29 17:46           ` Naresh Kamboju
@ 2022-03-29 17:56             ` Jakub Kicinski
  0 siblings, 0 replies; 9+ messages in thread
From: Jakub Kicinski @ 2022-03-29 17:56 UTC (permalink / raw)
  To: Naresh Kamboju
  Cc: Shuah Khan, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List,
	open list:KERNEL SELFTEST FRAMEWORK
	<linux-kselftest@vger.kernel.org>,
	 open list <linux-kernel@vger.kernel.org>,
	lkft-triage@lists.linaro.org,  Netdev

On Tue, 29 Mar 2022 23:16:25 +0530 Naresh Kamboju wrote:
> > I only see build logs here, are there logs for the run?  
> 
> Yes. Those are only build logs.
> The test log is here but not very useful.
>  https://lkft.validation.linaro.org/scheduler/job/4770773#L2700

Thanks, interesting. We have the timeout set to 30 sec so if there's no
progress for 30 sec the test should just fail. But the test setup says
no progress for 900 sec, so something must have gotten hard stuck :S

Looking forward to the update tomorrow, something to consider long term
for the test setup would be enabling CONFIG_*LOCKUP_DETECTOR and
CONFIG_DETECT_HUNG_TASK.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
       [not found]         ` <20220329102649.507bbf2a@kernel.org>
  2022-03-29 17:46           ` Naresh Kamboju
@ 2022-03-31  7:48           ` Naresh Kamboju
  2022-03-31 18:54             ` Jakub Kicinski
  1 sibling, 1 reply; 9+ messages in thread
From: Naresh Kamboju @ 2022-03-31  7:48 UTC (permalink / raw)
  To: Jakub Kicinski
  Cc: Shuah Khan, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List,
	open list:KERNEL SELFTEST FRAMEWORK
	<linux-kselftest@vger.kernel.org>,
	open list <linux-kernel@vger.kernel.org>,
	lkft-triage@lists.linaro.org, Netdev, Anders Roxell

Hi Jakub,

> Can you check where the process is stuck and it's state?
> /proc/$pid/stack and run that thru scripts/decode_stacktrace
>

Steps to reproduce:
          - cd /opt/kselftests/default-in-kernel/net
          - ./tls &
          - tests_pid=$!
          - echo $tests_pid
          - sleep 90
          - cat /proc/$tests_pid/stack | tee tests_pid_stack.log
          - cat tests_pid_stack.log

[<0>] do_wait+0x191/0x3a0
[<0>] kernel_wait4+0xaf/0x160
[<0>] __do_sys_wait4+0x85/0x90
[<0>] __x64_sys_wait4+0x1c/0x20
[<0>] do_syscall_64+0x5c/0x80
[<0>] entry_SYSCALL_64_after_hwframe+0x44/0xae

Detail test log can be found here in this link [1]

I do not see any output from
./scripts/decode_stacktrace.sh  stack-dump.txt


- Naresh

[1] https://lkft.validation.linaro.org/scheduler/job/4812800#L2256

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: kselftest: net: tls: hangs
  2022-03-31  7:48           ` Naresh Kamboju
@ 2022-03-31 18:54             ` Jakub Kicinski
  0 siblings, 0 replies; 9+ messages in thread
From: Jakub Kicinski @ 2022-03-31 18:54 UTC (permalink / raw)
  To: Naresh Kamboju
  Cc: Shuah Khan, Shuah Khan, David S. Miller, Paolo Abeni,
	Linux-Next Mailing List,
	open list:KERNEL SELFTEST FRAMEWORK
	<linux-kselftest@vger.kernel.org>,
	 open list <linux-kernel@vger.kernel.org>,
	lkft-triage@lists.linaro.org,  Netdev, Anders Roxell

On Thu, 31 Mar 2022 13:18:56 +0530 Naresh Kamboju wrote:
> Hi Jakub,
> 
> > Can you check where the process is stuck and it's state?
> > /proc/$pid/stack and run that thru scripts/decode_stacktrace
> >  
> 
> Steps to reproduce:
>           - cd /opt/kselftests/default-in-kernel/net
>           - ./tls &
>           - tests_pid=$!
>           - echo $tests_pid
>           - sleep 90
>           - cat /proc/$tests_pid/stack | tee tests_pid_stack.log
>           - cat tests_pid_stack.log
> 
> [<0>] do_wait+0x191/0x3a0
> [<0>] kernel_wait4+0xaf/0x160
> [<0>] __do_sys_wait4+0x85/0x90
> [<0>] __x64_sys_wait4+0x1c/0x20
> [<0>] do_syscall_64+0x5c/0x80
> [<0>] entry_SYSCALL_64_after_hwframe+0x44/0xae

The kernel test harness runs the tests in a separate process,
this is the stack trace for the main process. We'll need a stack trace
for the child, so:

	main_pid=$!
	tests_pid=$(pgrep -P $main_pid)

Also since each test runs in a separate child, we can try to catch the
one that got stuck for longer than 30 sec:

	pp=$(pgrep -P $main_pid)
	while true; do 
		sleep 30
		p=$(pgrep -P $main_pid)
		if [ $p != $pp ]; then
			pp=$p
			echo "New PID $p, continue waiting..."
			continue
		fi

		echo "PID $p is stuck!"
		cat /proc/$p/stack
	done

> Detail test log can be found here in this link [1]
> 
> I do not see any output from
> ./scripts/decode_stacktrace.sh  stack-dump.txt
> 
> 
> - Naresh
> 
> [1] https://lkft.validation.linaro.org/scheduler/job/4812800#L2256


^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2022-03-31 18:54 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-03-25  7:40 kselftest: net: tls: hangs Naresh Kamboju
2022-03-25 22:13 ` Shuah Khan
2022-03-25 23:12   ` Jakub Kicinski
2022-03-25 23:34     ` Shuah Khan
2022-03-29 14:49       ` Naresh Kamboju
     [not found]         ` <20220329102649.507bbf2a@kernel.org>
2022-03-29 17:46           ` Naresh Kamboju
2022-03-29 17:56             ` Jakub Kicinski
2022-03-31  7:48           ` Naresh Kamboju
2022-03-31 18:54             ` Jakub Kicinski

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.