From: Naresh Kamboju <naresh.kamboju@linaro.org>
To: Mike Kravetz <mike.kravetz@oracle.com>
Cc: Michal Hocko <mhocko@kernel.org>,
linux- stable <stable@vger.kernel.org>,
open list <linux-kernel@vger.kernel.org>,
linux-mm <linux-mm@kvack.org>, Hugh Dickins <hughd@google.com>,
Naoya Horiguchi <n-horiguchi@ah.jp.nec.com>,
"Aneesh Kumar K.V" <aneesh.kumar@linux.vnet.ibm.com>,
Andrea Arcangeli <aarcange@redhat.com>,
"Kirill A.Shutemov" <kirill.shutemov@linux.intel.com>,
Davidlohr Bueso <dave@stgolabs.net>,
Prakash Sangappa <prakash.sangappa@oracle.com>,
Andrew Morton <akpm@linux-foundation.org>,
lkft-triage@lists.linaro.org, Arnd Bergmann <arnd@arndb.de>,
John Stultz <john.stultz@linaro.org>
Subject: Re: stable-rc 5.4: libhugetlbfs fallocate_stress.sh: Unable to handle kernel paging request at virtual address ffff00006772f000
Date: Wed, 10 Jun 2020 12:38:56 +0530 [thread overview]
Message-ID: <CA+G9fYvyLj1Mp=+4R+ah250fCmhboWhVd=Phg8PsYGJ98LWR3Q@mail.gmail.com> (raw)
In-Reply-To: <f3a2db67-f7b7-1bb7-340f-24806a999192@oracle.com>
On Thu, 14 May 2020 at 22:01, Mike Kravetz <mike.kravetz@oracle.com> wrote:
>
> On 5/13/20 11:40 PM, Michal Hocko wrote:
> > On Wed 13-05-20 23:11:40, Naresh Kamboju wrote:
> >> While running libhugetlbfs fallocate_stress.sh on stable-rc 5.4 branch kernel
> >> on arm64 hikey device. The following kernel Internal error: Oops:
> >> crash dump noticed.
> >
> > Is the same problem reproducible on vanilla 5.4 without any stable
> > patches?
> >
>
> Or, an earlier version of 5.4-stable? Nothing in the changelog for 5.4.41
> looks related to this issue. There was an arm specific hugetlb change
> "arm64: hugetlb: avoid potential NULL dereference", but that is pretty
> straight forward.
>
> I'm guessing this may not reproduce easily. To help reproduce, you could
> change the
> #define FALLOCATE_ITERATIONS 100000
> in .../libhugetlbfs/tests/fallocate_stress.c to a larger number to force
> the stress test to run longer.
Sorry i did not get a chance to run as per your suggestion.
But this issue is reproducible on stable-rc 5.4.46-rc1 on arm64 hikey device
./runltp -p -q -f hugetlb
<>
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
[ 383.751513] oom01 invoked oom-killer:
gfp_mask=0x100cca(GFP_HIGHUSER_MOVABLE), order=0, oom_score_adj=0
[ 384.715831] EC = 0x25: DABT (current EL), IL = 32 bits
[ 384.725478] CPU: 0 PID: 10948 Comm: oom01 Not tainted
5.4.46-rc1-00035-g12a5ce113626 #1
[ 384.730887] SET = 0, FnV = 0
[ 384.739060] Hardware name: HiKey Development Board (DT)
[ 384.739066] Call trace:
[ 384.739081] dump_backtrace+0x0/0x140
[ 384.739090] show_stack+0x14/0x20
[ 384.742209] EA = 0, S1PTW = 0
[ 384.746701] dwmmc_k3 f723d000.dwmmc0: Unexpected interrupt latency
[ 384.747550] dump_stack+0xb4/0xf8
[ 384.747559] dump_header+0x44/0x1ec
[ 384.747565] oom_kill_process+0x1d4/0x1d8
[ 384.747572] out_of_memory+0x170/0x4e0
[ 384.750070] Data abort info:
[ 384.753813] __alloc_pages_slowpath+0x954/0x9f8
[ 384.753819] __alloc_pages_nodemask+0x21c/0x280
[ 384.753826] alloc_pages_vma+0x88/0x210
[ 384.753836] __handle_mm_fault+0x638/0x1080
[ 384.757236] ISV = 0, ISS = 0x00000047
[ 384.760428] handle_mm_fault+0xdc/0x1a8
[ 384.760436] do_page_fault+0x130/0x460
[ 384.760442] do_translation_fault+0x5c/0x78
[ 384.760450] do_mem_abort+0x3c/0x98
[ 384.766776] CM = 0, WnR = 1
[ 384.770154] el0_da+0x1c/0x20
[ 384.773735] swapper pgtable: 4k pages, 48-bit VAs, pgdp=00000000013c5000
[ 384.777949] Mem-Info:
[ 384.781679] [ffff0000641ff000] pgd=0000000077ff7003,
pud=0000000077e0d003, pmd=0000000077cec003, pte=0000000000000000
[ 384.781694] Internal error: Oops: 96000047 [#1] PREEMPT SMP
[ 384.781698] Modules linked in: wl18xx wlcore mac80211 cfg80211
hci_uart snd_soc_audio_graph_card btbcm snd_soc_simple_card_utils
crct10dif_ce wlcore_sdio adv7511 bluetooth kirin_drm cec dw_drm_dsi
rfkill drm_kms_helper drm fuse
[ 384.784854] active_anon:472313 inactive_anon:2168 isolated_anon:0
[ 384.784854] active_file:63 inactive_file:0 isolated_file:0
[ 384.784854] unevictable:0 dirty:0 writeback:0 unstable:0
[ 384.784854] slab_reclaimable:2625 slab_unreclaimable:7426
[ 384.784854] mapped:202 shmem:2175 pagetables:1188 bounce:0
[ 384.784854] free:5469 free_pcp:1684 free_cma:14
[ 384.789304] CPU: 5 PID: 10945 Comm: oom01 Not tainted
5.4.46-rc1-00035-g12a5ce113626 #1
[ 384.789309] Hardware name: HiKey Development Board (DT)
[ 384.789315] pstate: 80000005 (Nzcv daif -PAN -UAO)
[ 384.789328] pc : clear_page+0x10/0x24
[ 384.789339] lr : __cpu_clear_user_page+0xc/0x18
[ 384.794000] Node 0 active_anon:1889252kB inactive_anon:8672kB
active_file:412kB inactive_file:0kB unevictable:0kB isolated(anon):0kB
isolated(file):0kB mapped:800kB dirty:0kB writeback:0kB shmem:8700kB
shmem_thp: 0kB shmem_pmdmapped: 0kB anon_thp: 1478656kB
writeback_tmp:0kB unstable:0kB all_unreclaimable? yes
[ 384.797884] sp : ffff8000190ebc10
[ 384.797888] x29: ffff8000190ebc10 x28: ffff000066cf2a00
[ 384.797895] x27: 0000000000000002 x26: fffffe00019f2400
[ 384.797901] x25: ffff00006e51df00 x24: 0000000000001000
[ 384.802205] Node 0 DMA32 free:21876kB min:22528kB low:28160kB
high:33792kB active_anon:1889252kB inactive_anon:8672kB
active_file:0kB inactive_file:484kB unevictable:0kB writepending:0kB
present:2061364kB managed:1995396kB mlocked:0kB kernel_stack:2800kB
pagetables:4752kB bounce:0kB free_pcp:6864kB local_pcp:1320kB
free_cma:56kB
[ 384.806099] x23: 0000000000000000 x22: fffffe0001700000
[ 384.806106] x21: 0000000000000000 x20: 0000fffef3800000
[ 384.806112] x19: ffff000066cf2a00 x18: 0000000000000000
[ 384.806117] x17: 0000000000000000 x16: 0000000000000000
[ 384.810066] lowmem_reserve[]: 0 0 0
[ 384.813873] x15: 0000000000000000 x14: 0000000000000000
[ 384.813879] x13: 0000000000000000 x12: 0000000000000000
[ 384.813885] x11: 0000000000000000 x10: 0000000000000000
[ 384.813891] x9 : ffff800066671000 x8 : 0000000000000200
[ 384.818192] Node 0 DMA32: 899*4kB (UME) 471*8kB (UMEC) 205*16kB
(UEC) 153*32kB (UMEC) 61*64kB (UME) 15*128kB (UE) 2*256kB (ME) 0*512kB
0*1024kB 0*2048kB 0*4096kB = 21876kB
[ 384.821730] x7 : ffff800066671000 x6 : 0000000000000000
[ 384.821737] x5 : 0000000000000000 x4 : 0000020000200000
[ 384.821743] x3 : 0000000000007fc0 x2 : 0000000000000004
[ 384.821748] x1 : 0000000000000040 x0 : ffff0000641ff000
[ 384.821754] Call trace:
[ 384.821762] clear_page+0x10/0x24
[ 384.821771] clear_subpage+0x54/0x90
[ 384.821780] clear_huge_page+0x6c/0x208
[ 384.824842] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=1048576kB
[ 384.827843] do_huge_pmd_anonymous_page+0x1a4/0x7a0
[ 384.827851] __handle_mm_fault+0x83c/0x1080
[ 384.827857] handle_mm_fault+0xdc/0x1a8
[ 384.827863] do_page_fault+0x130/0x460
[ 384.827872] do_translation_fault+0x5c/0x78
[ 384.834786] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=32768kB
[ 384.837038] do_mem_abort+0x3c/0x98
[ 384.837044] el0_da+0x1c/0x20
[ 384.837056] Code: d53b00e1 12000c21 d2800082 9ac12041 (d50b7420)
[ 384.847921] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=2048kB
[ 384.853582] ---[ end trace 298eea3ec03b10c2 ]---
[ 384.853619] note: oom01[10945] exited with preempt_count 1
[ 384.874237] Node 0 hugepages_total=0 hugepages_free=0
hugepages_surp=0 hugepages_size=64kB
[ 385.070710] dwmmc_k3 f723d000.dwmmc0: Unexpected interrupt latency
[ 385.551002] 2384 total pagecache pages
[ 385.563572] 0 pages in swap cache
[ 385.575536] Swap cache stats: add 0, delete 0, find 0/0
[ 385.589403] Free swap = 0kB
[ 385.600885] Total swap = 0kB
[ 385.612403] 515341 pages RAM
[ 385.623860] 0 pages HighMem/MovableOnly
[ 385.636339] 16492 pages reserved
[ 385.648140] 32768 pages cma reserved
[ 385.660271] 0 pages hwpoisoned
[ 385.671865] Tasks state (memory values in pages):
[ 385.685192] [ pid ] uid tgid total_vm rss pgtables_bytes
swapents oom_score_adj name
[ 385.702711] [ 377] 0 377 3387 315 61440
0 0 systemd-journal
[ 385.720672] [ 413] 0 413 3520 310 49152
0 -1000 systemd-udevd
[ 385.738467] [ 435] 993 435 1526 79 53248
0 0 systemd-network
[ 385.756400] [ 459] 992 459 1665 99 49152
0 0 systemd-resolve
[ 385.774283] [ 463] 0 463 553 21 40960
0 0 tee-supplicant
[ 385.792002] [ 464] 0 464 1479 111 45056
0 0 systemd-logind
[ 385.809585] [ 472] 995 472 1197 105 45056
0 0 avahi-daemon
[ 385.826656] [ 473] 995 473 1166 66 45056
0 0 avahi-daemon
[ 385.843371] [ 474] 0 474 771 19 40960
0 0 syslogd
[ 385.859600] [ 475] 0 475 771 18 45056
0 0 klogd
[ 385.875602] [ 476] 0 476 1382 62 49152
0 0 bluetoothd
[ 385.892028] [ 479] 996 479 1151 187 45056
0 -900 dbus-daemon
[ 385.908563] [ 481] 0 481 78394 563 106496
0 0 NetworkManager
[ 385.925332] [ 482] 0 482 698 133 40960
0 0 crond
[ 385.941236] [ 527] 65534 527 629 44 40960
0 0 dnsmasq
[ 385.957226] [ 529] 0 529 578 32 40960
0 0 agetty
[ 385.973067] [ 530] 0 530 1173 107 49152
0 0 login
[ 385.988832] [ 531] 0 531 578 32 40960
0 0 agetty
[ 386.004670] [ 536] 0 536 2385 148 49152
0 0 wpa_supplicant
[ 386.020865] [ 537] 998 537 115916 1319 131072
0 0 polkitd
[ 386.036060] [ 563] 0 563 24661 430 69632
0 0 dhclient
[ 386.051325] [ 602] 0 602 1899 214 57344
0 0 systemd
[ 386.066521] [ 603] 0 603 2569 477 61440
0 0 (sd-pam)
[ 386.081811] [ 607] 0 607 910 102 40960
0 0 sh
[ 386.096558] [ 611] 0 611 1039 81 45056
0 0 su
[ 386.110911] [ 612] 0 612 910 97 40960
0 0 sh
[ 386.124866] [ 615] 0 615 756 55 40960
0 0 lava-test-runne
[ 386.139888] [ 1327] 0 1327 756 50 40960
0 0 lava-test-shell
[ 386.154903] [ 1328] 0 1328 756 52 36864
0 0 sh
[ 386.168797] [ 1330] 0 1330 822 133 40960
0 0 ltp.sh
[ 386.183055] [ 1348] 0 1348 822 133 40960
0 0 ltp.sh
[ 386.197278] [ 1349] 0 1349 822 133 40960
0 0 ltp.sh
[ 386.211413] [ 1350] 0 1350 822 133 40960
0 0 ltp.sh
[ 386.225573] [ 1351] 0 1351 921 230 45056
0 0 runltp
[ 386.239776] [ 1352] 0 1352 452 15 40960
0 0 tee
[ 386.253702] [ 1426] 0 1426 451 28 40960
0 0 ltp-pan
[ 386.267740] [ 10933] 0 10933 494 18 32768
0 0 oom01
[ 386.281352] [ 10934] 0 10934 527 31 36864
0 0 oom01
[ 386.294984] [ 10944] 0 10944 5519894 467709 3833856
0 0 oom01
[ 386.308646] oom-kill:constraint=CONSTRAINT_NONE,nodemask=(null),cpuset=/,mems_allowed=0,global_oom,task_memcg=/,task=oom01,pid=10944,uid=0
[ 386.326954] Out of memory: Killed process 10944 (oom01)
total-vm:22079576kB, anon-rss:1870836kB, file-rss:0kB, shmem-rss:0kB,
UID:0 pgtables:3744kB oom_score_adj:0
ksm05.c:78: PASS: still alive.
ksm05.c:78: PASS: still alive.
> --
> Mike Kravetz
next prev parent reply other threads:[~2020-06-10 7:09 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2020-05-13 17:41 stable-rc 5.4: libhugetlbfs fallocate_stress.sh: Unable to handle kernel paging request at virtual address ffff00006772f000 Naresh Kamboju
2020-05-13 17:41 ` Naresh Kamboju
2020-05-14 6:40 ` Michal Hocko
2020-05-14 16:31 ` Mike Kravetz
2020-06-10 7:08 ` Naresh Kamboju [this message]
2020-06-10 7:08 ` Naresh Kamboju
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CA+G9fYvyLj1Mp=+4R+ah250fCmhboWhVd=Phg8PsYGJ98LWR3Q@mail.gmail.com' \
--to=naresh.kamboju@linaro.org \
--cc=aarcange@redhat.com \
--cc=akpm@linux-foundation.org \
--cc=aneesh.kumar@linux.vnet.ibm.com \
--cc=arnd@arndb.de \
--cc=dave@stgolabs.net \
--cc=hughd@google.com \
--cc=john.stultz@linaro.org \
--cc=kirill.shutemov@linux.intel.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lkft-triage@lists.linaro.org \
--cc=mhocko@kernel.org \
--cc=mike.kravetz@oracle.com \
--cc=n-horiguchi@ah.jp.nec.com \
--cc=prakash.sangappa@oracle.com \
--cc=stable@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.