linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
@ 2015-05-27 18:05 Jovi Zhangwei
  2015-05-27 18:42 ` Jovi Zhangwei
  2015-05-28 12:00 ` Mel Gorman
  0 siblings, 2 replies; 7+ messages in thread
From: Jovi Zhangwei @ 2015-05-27 18:05 UTC (permalink / raw)
  To: linux-kernel, mgorman, sasha.levin, n-horiguchi, akpm, hughd,
	linux-mm, vbabka, rientjes

Hi,

I got below kernel bug error in our 3.18.13 stable kernel.
"kernel BUG at mm/migrate.c:1661!"

Source code:

1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
1658   {
1659            int page_lru;
1660
1661           VM_BUG_ON_PAGE(compound_order(page) &&
!PageTransHuge(page), page);

It's easy to trigger the error by run tcpdump in our system.(not sure
it will easily be reproduced in another system)
"sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"

Any comments for this bug would be great appreciated. thanks.

-----------------------------------------------------------------
dmesg:

[Mon May 25 05:29:33 2015] page:ffffea0015414000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:29:33 2015] flags: 0x20047580004000(head)
[Mon May 25 05:29:33 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:29:33 2015] ------------[ cut here ]------------
[Mon May 25 05:29:33 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:29:33 2015] invalid opcode: 0000 [#1] SMP
[Mon May 25 05:29:33 2015] Modules linked in: veth xt_comment xt_CT
iptable_raw xt_addrtype ipt_MASQUERADE nf_nat_masquerade_ipv4
iptable_nat nf_nat_ipv4 nf_nat bridge overlay tcp_cubic binfmt_misc
nf_conntrack_ipv6 nf_defrag_ipv6 xt_tcpudp ip6table_filter ip6_tables
nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
xt_multiport iptable_filter ip_tables x_tables rpcsec_gss_krb5
auth_rpcgss oid_registry nfsv4 nfs lockd grace sunrpc fscache ses
enclosure 8021q garp stp llc bonding ext4 crc16 jbd2 mbcache sg sd_mod
ipmi_watchdog x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt evdev
kvm crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
mfd_core i2c_algo_bit
[Mon May 25 05:29:33 2015]  pps_core usb_common scsi_mod dca i2c_core
wmi acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
[Mon May 25 05:29:33 2015] CPU: 8 PID: 25835 Comm: tcpdump Not tainted
3.18.13-cloudflare #1
[Mon May 25 05:29:33 2015] Hardware name: Quanta Computer Inc D51B-2U
(dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
[Mon May 25 05:29:34 2015] task: ffff880fb4605580 ti: ffff880f7ca54000
task.ti: ffff880f7ca54000
[Mon May 25 05:29:34 2015] RIP: 0010:[<ffffffff8112346c>]
[<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:34 2015] RSP: 0000:ffff880f7ca57d28  EFLAGS: 00010246
[Mon May 25 05:29:34 2015] RAX: 0000000000000000 RBX: ffffea0015414000
RCX: 0000000000000000
[Mon May 25 05:29:34 2015] RDX: 0000000000000000 RSI: ffff88207fc0c1a8
RDI: 0000000000000540
[Mon May 25 05:29:34 2015] RBP: ffff88207ffd7000 R08: 0000000000000000
R09: 0000000000000000
[Mon May 25 05:29:34 2015] R10: ffffffff81678b40 R11: ffff88207ff9aa00
R12: ffff880f7ca57d38
[Mon May 25 05:29:34 2015] R13: 0000000000000001 R14: 0000000000000000
R15: 0000000000000000
[Mon May 25 05:29:34 2015] FS:  00007fcb89855700(0000)
GS:ffff88207fc00000(0000) knlGS:0000000000000000
[Mon May 25 05:29:34 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon May 25 05:29:34 2015] CR2: 00007fcb88859008 CR3: 0000000f6a3f9000
CR4: 00000000001407e0
[Mon May 25 05:29:34 2015] Stack:
[Mon May 25 05:29:34 2015]  0000000000000000 ffffea0015414000
ffff880f7ca57d38 ffff880f7ca57d38
[Mon May 25 05:29:34 2015]  0000000000000001 ffff880f6a3b8450
ffffea003fa43330 0000000000000001
[Mon May 25 05:29:34 2015]  ffffea0015414000 0000000000000000
0000000000000000 ffffffff81100c85
[Mon May 25 05:29:34 2015] Call Trace:
[Mon May 25 05:29:34 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
[Mon May 25 05:29:34 2015]  [<ffffffff81105bc6>] ? change_protection+0x12a/0x580
[Mon May 25 05:29:34 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
[Mon May 25 05:29:34 2015]  [<ffffffff8112dcf6>] ? new_sync_write+0x6a/0x8e
[Mon May 25 05:29:34 2015]  [<ffffffff81158cfe>] ? fsnotify+0x276/0x2bf
[Mon May 25 05:29:34 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
[Mon May 25 05:29:34 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
[Mon May 25 05:29:34 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
[Mon May 25 05:29:34 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
8b 85 40 5f 02 00
[Mon May 25 05:29:34 2015] RIP  [<ffffffff8112346c>]
migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:34 2015]  RSP <ffff880f7ca57d28>
[Mon May 25 05:29:34 2015] ---[ end trace 83fa2f6761648dbd ]---
[Mon May 25 05:29:34 2015] device bond0.100 left promiscuous mode
[Mon May 25 05:29:34 2015] device bond0 left promiscuous mode
[Mon May 25 05:29:34 2015] device eth2 left promiscuous mode
[Mon May 25 05:29:34 2015] device eth3 left promiscuous mode
[Mon May 25 05:29:46 2015] device bond0.100 entered promiscuous mode
[Mon May 25 05:29:46 2015] device bond0 entered promiscuous mode
[Mon May 25 05:29:46 2015] device eth2 entered promiscuous mode
[Mon May 25 05:29:46 2015] device eth3 entered promiscuous mode

Thanks.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
  2015-05-27 18:05 kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c Jovi Zhangwei
@ 2015-05-27 18:42 ` Jovi Zhangwei
  2015-05-28 12:00 ` Mel Gorman
  1 sibling, 0 replies; 7+ messages in thread
From: Jovi Zhangwei @ 2015-05-27 18:42 UTC (permalink / raw)
  To: linux-kernel, Mel Gorman, sasha.levin, n-horiguchi, akpm,
	Hugh Dickins, linux-mm, vbabka, rientjes

On Wed, May 27, 2015 at 11:05 AM, Jovi Zhangwei <jovi@cloudflare.com> wrote:
> Hi,
>
> I got below kernel bug error in our 3.18.13 stable kernel.
> "kernel BUG at mm/migrate.c:1661!"
>
> Source code:
>
> 1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
> 1658   {
> 1659            int page_lru;
> 1660
> 1661           VM_BUG_ON_PAGE(compound_order(page) &&
> !PageTransHuge(page), page);
>
> It's easy to trigger the error by run tcpdump in our system.(not sure
> it will easily be reproduced in another system)
> "sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"
>
> Any comments for this bug would be great appreciated. thanks.
>
> -----------------------------------------------------------------
> dmesg:
>
> [Mon May 25 05:29:33 2015] page:ffffea0015414000 count:66 mapcount:1
> mapping:          (null) index:0x0
> [Mon May 25 05:29:33 2015] flags: 0x20047580004000(head)
> [Mon May 25 05:29:33 2015] page dumped because:
> VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
> [Mon May 25 05:29:33 2015] ------------[ cut here ]------------
> [Mon May 25 05:29:33 2015] kernel BUG at mm/migrate.c:1661!
> [Mon May 25 05:29:33 2015] invalid opcode: 0000 [#1] SMP
> [Mon May 25 05:29:33 2015] Modules linked in: veth xt_comment xt_CT
> iptable_raw xt_addrtype ipt_MASQUERADE nf_nat_masquerade_ipv4
> iptable_nat nf_nat_ipv4 nf_nat bridge overlay tcp_cubic binfmt_misc
> nf_conntrack_ipv6 nf_defrag_ipv6 xt_tcpudp ip6table_filter ip6_tables
> nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
> xt_multiport iptable_filter ip_tables x_tables rpcsec_gss_krb5
> auth_rpcgss oid_registry nfsv4 nfs lockd grace sunrpc fscache ses
> enclosure 8021q garp stp llc bonding ext4 crc16 jbd2 mbcache sg sd_mod
> ipmi_watchdog x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt evdev
> kvm crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
> ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
> ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
> mfd_core i2c_algo_bit
> [Mon May 25 05:29:33 2015]  pps_core usb_common scsi_mod dca i2c_core
> wmi acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
> ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
> [Mon May 25 05:29:33 2015] CPU: 8 PID: 25835 Comm: tcpdump Not tainted
> 3.18.13-cloudflare #1
> [Mon May 25 05:29:33 2015] Hardware name: Quanta Computer Inc D51B-2U
> (dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
> [Mon May 25 05:29:34 2015] task: ffff880fb4605580 ti: ffff880f7ca54000
> task.ti: ffff880f7ca54000
> [Mon May 25 05:29:34 2015] RIP: 0010:[<ffffffff8112346c>]
> [<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
> [Mon May 25 05:29:34 2015] RSP: 0000:ffff880f7ca57d28  EFLAGS: 00010246
> [Mon May 25 05:29:34 2015] RAX: 0000000000000000 RBX: ffffea0015414000
> RCX: 0000000000000000
> [Mon May 25 05:29:34 2015] RDX: 0000000000000000 RSI: ffff88207fc0c1a8
> RDI: 0000000000000540
> [Mon May 25 05:29:34 2015] RBP: ffff88207ffd7000 R08: 0000000000000000
> R09: 0000000000000000
> [Mon May 25 05:29:34 2015] R10: ffffffff81678b40 R11: ffff88207ff9aa00
> R12: ffff880f7ca57d38
> [Mon May 25 05:29:34 2015] R13: 0000000000000001 R14: 0000000000000000
> R15: 0000000000000000
> [Mon May 25 05:29:34 2015] FS:  00007fcb89855700(0000)
> GS:ffff88207fc00000(0000) knlGS:0000000000000000
> [Mon May 25 05:29:34 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [Mon May 25 05:29:34 2015] CR2: 00007fcb88859008 CR3: 0000000f6a3f9000
> CR4: 00000000001407e0
> [Mon May 25 05:29:34 2015] Stack:
> [Mon May 25 05:29:34 2015]  0000000000000000 ffffea0015414000
> ffff880f7ca57d38 ffff880f7ca57d38
> [Mon May 25 05:29:34 2015]  0000000000000001 ffff880f6a3b8450
> ffffea003fa43330 0000000000000001
> [Mon May 25 05:29:34 2015]  ffffea0015414000 0000000000000000
> 0000000000000000 ffffffff81100c85
> [Mon May 25 05:29:34 2015] Call Trace:
> [Mon May 25 05:29:34 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
> [Mon May 25 05:29:34 2015]  [<ffffffff81105bc6>] ? change_protection+0x12a/0x580
> [Mon May 25 05:29:34 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
> [Mon May 25 05:29:34 2015]  [<ffffffff8112dcf6>] ? new_sync_write+0x6a/0x8e
> [Mon May 25 05:29:34 2015]  [<ffffffff81158cfe>] ? fsnotify+0x276/0x2bf
> [Mon May 25 05:29:34 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
> [Mon May 25 05:29:34 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
> [Mon May 25 05:29:34 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
> [Mon May 25 05:29:34 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
> 02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
> 48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
> 8b 85 40 5f 02 00
> [Mon May 25 05:29:34 2015] RIP  [<ffffffff8112346c>]
> migrate_misplaced_page+0xeb/0x2a1
> [Mon May 25 05:29:34 2015]  RSP <ffff880f7ca57d28>
> [Mon May 25 05:29:34 2015] ---[ end trace 83fa2f6761648dbd ]---
> [Mon May 25 05:29:34 2015] device bond0.100 left promiscuous mode
> [Mon May 25 05:29:34 2015] device bond0 left promiscuous mode
> [Mon May 25 05:29:34 2015] device eth2 left promiscuous mode
> [Mon May 25 05:29:34 2015] device eth3 left promiscuous mode
> [Mon May 25 05:29:46 2015] device bond0.100 entered promiscuous mode
> [Mon May 25 05:29:46 2015] device bond0 entered promiscuous mode
> [Mon May 25 05:29:46 2015] device eth2 entered promiscuous mode
> [Mon May 25 05:29:46 2015] device eth3 entered promiscuous mode
>
> Thanks.

Hi Mel,

Would you please give me some hints on this bug? it seems very wried
that the page is compounded but is not hugepage.

Thanks.

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
  2015-05-27 18:05 kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c Jovi Zhangwei
  2015-05-27 18:42 ` Jovi Zhangwei
@ 2015-05-28 12:00 ` Mel Gorman
  2015-05-28 18:38   ` Jovi Zhangwei
  1 sibling, 1 reply; 7+ messages in thread
From: Mel Gorman @ 2015-05-28 12:00 UTC (permalink / raw)
  To: Jovi Zhangwei
  Cc: linux-kernel, sasha.levin, n-horiguchi, akpm, hughd, linux-mm,
	vbabka, rientjes

On Wed, May 27, 2015 at 11:05:33AM -0700, Jovi Zhangwei wrote:
> Hi,
> 
> I got below kernel bug error in our 3.18.13 stable kernel.
> "kernel BUG at mm/migrate.c:1661!"
> 
> Source code:
> 
> 1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
> 1658   {
> 1659            int page_lru;
> 1660
> 1661           VM_BUG_ON_PAGE(compound_order(page) &&
> !PageTransHuge(page), page);
> 
> It's easy to trigger the error by run tcpdump in our system.(not sure
> it will easily be reproduced in another system)
> "sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"
> 
> Any comments for this bug would be great appreciated. thanks.
> 

What sort of compound page is it? What sort of VMA is it in? hugetlbfs
pages should never be tagged for NUMA migrate and never enter this
path. Transparent huge pages are handled properly so I'm wondering
exactly what type of compound page this is and what mapped it into
userspace.

-- 
Mel Gorman
SUSE Labs

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
  2015-05-28 12:00 ` Mel Gorman
@ 2015-05-28 18:38   ` Jovi Zhangwei
  2015-05-31  1:39     ` Jovi Zhangwei
  2015-06-02  7:19     ` Mel Gorman
  0 siblings, 2 replies; 7+ messages in thread
From: Jovi Zhangwei @ 2015-05-28 18:38 UTC (permalink / raw)
  To: Mel Gorman
  Cc: linux-kernel, sasha.levin, n-horiguchi, akpm, Hugh Dickins,
	linux-mm, vbabka, rientjes

Hi Mel,

On Thu, May 28, 2015 at 5:00 AM, Mel Gorman <mgorman@suse.de> wrote:
> On Wed, May 27, 2015 at 11:05:33AM -0700, Jovi Zhangwei wrote:
>> Hi,
>>
>> I got below kernel bug error in our 3.18.13 stable kernel.
>> "kernel BUG at mm/migrate.c:1661!"
>>
>> Source code:
>>
>> 1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
>> 1658   {
>> 1659            int page_lru;
>> 1660
>> 1661           VM_BUG_ON_PAGE(compound_order(page) &&
>> !PageTransHuge(page), page);
>>
>> It's easy to trigger the error by run tcpdump in our system.(not sure
>> it will easily be reproduced in another system)
>> "sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"
>>
>> Any comments for this bug would be great appreciated. thanks.
>>
>
> What sort of compound page is it? What sort of VMA is it in? hugetlbfs
> pages should never be tagged for NUMA migrate and never enter this
> path. Transparent huge pages are handled properly so I'm wondering
> exactly what type of compound page this is and what mapped it into
> userspace.
>
Thanks for your reply.

After reading net/packet/af_packet.c:alloc_one_pg_vec_page, I found
there indeed have compound page maped into userspace.

I sent a patch for this issue(you may received it), but not sure it's
right to fix,
feel free to update it or use your own patch.

Thanks.

--------------------------------------------------------------------------------------------

[PATCH] mm/migrate: Avoid migrate mmaped compound pages

Below kernel vm bug can be triggered by tcpdump which mmaped a lot of
pages with GFP_COMP flag.

[Mon May 25 05:29:33 2015] page:ffffea0015414000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:29:33 2015] flags: 0x20047580004000(head)
[Mon May 25 05:29:33 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:29:33 2015] ------------[ cut here ]------------
[Mon May 25 05:29:33 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:29:33 2015] invalid opcode: 0000 [#1] SMP

The fix is simply disallow migrate mmaped compound pages, return 0 instead of
report vm bug.

Signed-off-by: Jovi Zhangwei <jovi.zhangwei@gmail.com>
---
 mm/migrate.c | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git a/mm/migrate.c b/mm/migrate.c
index f53838f..839adef 100644
--- a/mm/migrate.c
+++ b/mm/migrate.c
@@ -1606,7 +1606,8 @@ static int numamigrate_isolate_page(pg_data_t
*pgdat, struct page *page)
 {
  int page_lru;

- VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page), page);
+ if (compound_order(page) && !PageTransHuge(page))
+ return 0;

  /* Avoid migrating to a node that is nearly full */
  if (!migrate_balanced_pgdat(pgdat, 1UL << compound_order(page)))
-- 
1.9.1

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
  2015-05-28 18:38   ` Jovi Zhangwei
@ 2015-05-31  1:39     ` Jovi Zhangwei
  2015-06-02  7:19     ` Mel Gorman
  1 sibling, 0 replies; 7+ messages in thread
From: Jovi Zhangwei @ 2015-05-31  1:39 UTC (permalink / raw)
  To: Mel Gorman
  Cc: linux-kernel, sasha.levin, n-horiguchi, akpm, Hugh Dickins,
	linux-mm, vbabka, rientjes

On Thu, May 28, 2015 at 11:38 AM, Jovi Zhangwei <jovi@cloudflare.com> wrote:
> Hi Mel,
>
> On Thu, May 28, 2015 at 5:00 AM, Mel Gorman <mgorman@suse.de> wrote:
>> On Wed, May 27, 2015 at 11:05:33AM -0700, Jovi Zhangwei wrote:
>>> Hi,
>>>
>>> I got below kernel bug error in our 3.18.13 stable kernel.
>>> "kernel BUG at mm/migrate.c:1661!"
>>>
>>> Source code:
>>>
>>> 1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
>>> 1658   {
>>> 1659            int page_lru;
>>> 1660
>>> 1661           VM_BUG_ON_PAGE(compound_order(page) &&
>>> !PageTransHuge(page), page);
>>>
>>> It's easy to trigger the error by run tcpdump in our system.(not sure
>>> it will easily be reproduced in another system)
>>> "sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"
>>>
>>> Any comments for this bug would be great appreciated. thanks.
>>>
>>
>> What sort of compound page is it? What sort of VMA is it in? hugetlbfs
>> pages should never be tagged for NUMA migrate and never enter this
>> path. Transparent huge pages are handled properly so I'm wondering
>> exactly what type of compound page this is and what mapped it into
>> userspace.
>>
> Thanks for your reply.
>
> After reading net/packet/af_packet.c:alloc_one_pg_vec_page, I found
> there indeed have compound page maped into userspace.
>
> I sent a patch for this issue(you may received it), but not sure it's
> right to fix,
> feel free to update it or use your own patch.
>
ping?

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
  2015-05-28 18:38   ` Jovi Zhangwei
  2015-05-31  1:39     ` Jovi Zhangwei
@ 2015-06-02  7:19     ` Mel Gorman
  1 sibling, 0 replies; 7+ messages in thread
From: Mel Gorman @ 2015-06-02  7:19 UTC (permalink / raw)
  To: Jovi Zhangwei
  Cc: linux-kernel, sasha.levin, n-horiguchi, akpm, Hugh Dickins,
	linux-mm, vbabka, rientjes

On Thu, May 28, 2015 at 11:38:36AM -0700, Jovi Zhangwei wrote:
> Hi Mel,
> 
> On Thu, May 28, 2015 at 5:00 AM, Mel Gorman <mgorman@suse.de> wrote:
> > On Wed, May 27, 2015 at 11:05:33AM -0700, Jovi Zhangwei wrote:
> >> Hi,
> >>
> >> I got below kernel bug error in our 3.18.13 stable kernel.
> >> "kernel BUG at mm/migrate.c:1661!"
> >>
> >> Source code:
> >>
> >> 1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
> >> 1658   {
> >> 1659            int page_lru;
> >> 1660
> >> 1661           VM_BUG_ON_PAGE(compound_order(page) &&
> >> !PageTransHuge(page), page);
> >>
> >> It's easy to trigger the error by run tcpdump in our system.(not sure
> >> it will easily be reproduced in another system)
> >> "sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"
> >>
> >> Any comments for this bug would be great appreciated. thanks.
> >>
> >
> > What sort of compound page is it? What sort of VMA is it in? hugetlbfs
> > pages should never be tagged for NUMA migrate and never enter this
> > path. Transparent huge pages are handled properly so I'm wondering
> > exactly what type of compound page this is and what mapped it into
> > userspace.
> >
> Thanks for your reply.
> 
> After reading net/packet/af_packet.c:alloc_one_pg_vec_page, I found
> there indeed have compound page maped into userspace.
> 

Ok, it's clear now. Thanks very much.

> I sent a patch for this issue(you may received it), but not sure it's
> right to fix,
> feel free to update it or use your own patch.
> 

It avoids the problem but it's not the best fix because a lot of useless
overhead has been incurred for a page that can never be migrated. Can you
try the following instead please?

---8<---

sched, numa: Do not hint for NUMA balancing on VM_MIXEDMAP mappings

Jovi Zhangwei reported the following problem

  Below kernel vm bug can be triggered by tcpdump which mmaped a lot of pages
  with GFP_COMP flag.

  [Mon May 25 05:29:33 2015] page:ffffea0015414000 count:66 mapcount:1 mapping:          (null) index:0x0
  [Mon May 25 05:29:33 2015] flags: 0x20047580004000(head)
  [Mon May 25 05:29:33 2015] page dumped because: VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
  [Mon May 25 05:29:33 2015] ------------[ cut here ]------------
  [Mon May 25 05:29:33 2015] kernel BUG at mm/migrate.c:1661!
  [Mon May 25 05:29:33 2015] invalid opcode: 0000 [#1] SMP

Compound pages cannot be migrated and it was not expected that such pages
be marked for NUMA balancing. This did not take into account that drivers
such as net/packet/af_packet.c may insert compound pages into userspace
with vm_insert_page. This patch tells the NUMA balancing protection scanner
to skip all VM_MIXEDMAP mappings which avoids the possibility that compound
pages are marked for migration.

Signed-off-by: Mel Gorman <mgorman@suse.de>
Reported-by: Jovi Zhangwei <jovi@cloudflare.com>
---
 kernel/sched/fair.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 241213be507c..486d00c408b0 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -2166,7 +2166,7 @@ void task_numa_work(struct callback_head *work)
 	}
 	for (; vma; vma = vma->vm_next) {
 		if (!vma_migratable(vma) || !vma_policy_mof(vma) ||
-			is_vm_hugetlb_page(vma)) {
+			is_vm_hugetlb_page(vma) || (vma->vm_flags & VM_MIXEDMAP)) {
 			continue;
 		}
 

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c
@ 2015-05-26 23:44 Jovi Zhangwei
  0 siblings, 0 replies; 7+ messages in thread
From: Jovi Zhangwei @ 2015-05-26 23:44 UTC (permalink / raw)
  To: linux-kernel, mgorman, sasha.levin, n-horiguchi, akpm, hughd,
	linux-mm, vbabka, rientjes

Hi,

I got below kernel bug error in our 3.18.13 stable kernel.
"kernel BUG at mm/migrate.c:1661!"

Source code:

1657    static int numamigrate_isolate_page(pg_data_t *pgdat, struct page *page)
1658   {
1659            int page_lru;
1660
1661           VM_BUG_ON_PAGE(compound_order(page) &&
!PageTransHuge(page), page);

It's easy to trigger the error by run tcpdump in our system.(not sure
it will easily be reproduced in another system)
"sudo tcpdump -i bond0.100 'tcp port 4242' -c 100000000000 -w 4242.pcap"

Any comments for this bug would be great appreciated. thanks.


dmesg:

[Mon May 25 05:29:33 2015] page:ffffea0015414000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:29:33 2015] flags: 0x20047580004000(head)
[Mon May 25 05:29:33 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:29:33 2015] ------------[ cut here ]------------
[Mon May 25 05:29:33 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:29:33 2015] invalid opcode: 0000 [#1] SMP
[Mon May 25 05:29:33 2015] Modules linked in: veth xt_comment xt_CT
iptable_raw xt_addrtype ipt_MASQUERADE nf_nat_masquerade_ipv4
iptable_nat nf_nat_ipv4 nf_nat bridge overlay tcp_cubic binfmt_misc
nf_conntrack_ipv6 nf_defrag_ipv6 xt_tcpudp ip6table_filter ip6_tables
nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
xt_multiport iptable_filter ip_tables x_tables rpcsec_gss_krb5
auth_rpcgss oid_registry nfsv4 nfs lockd grace sunrpc fscache ses
enclosure 8021q garp stp llc bonding ext4 crc16 jbd2 mbcache sg sd_mod
ipmi_watchdog x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt evdev
kvm crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
mfd_core i2c_algo_bit
[Mon May 25 05:29:33 2015]  pps_core usb_common scsi_mod dca i2c_core
wmi acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
[Mon May 25 05:29:33 2015] CPU: 8 PID: 25835 Comm: tcpdump Not tainted
3.18.13-cloudflare #1
[Mon May 25 05:29:33 2015] Hardware name: Quanta Computer Inc D51B-2U
(dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
[Mon May 25 05:29:34 2015] task: ffff880fb4605580 ti: ffff880f7ca54000
task.ti: ffff880f7ca54000
[Mon May 25 05:29:34 2015] RIP: 0010:[<ffffffff8112346c>]
[<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:34 2015] RSP: 0000:ffff880f7ca57d28  EFLAGS: 00010246
[Mon May 25 05:29:34 2015] RAX: 0000000000000000 RBX: ffffea0015414000
RCX: 0000000000000000
[Mon May 25 05:29:34 2015] RDX: 0000000000000000 RSI: ffff88207fc0c1a8
RDI: 0000000000000540
[Mon May 25 05:29:34 2015] RBP: ffff88207ffd7000 R08: 0000000000000000
R09: 0000000000000000
[Mon May 25 05:29:34 2015] R10: ffffffff81678b40 R11: ffff88207ff9aa00
R12: ffff880f7ca57d38
[Mon May 25 05:29:34 2015] R13: 0000000000000001 R14: 0000000000000000
R15: 0000000000000000
[Mon May 25 05:29:34 2015] FS:  00007fcb89855700(0000)
GS:ffff88207fc00000(0000) knlGS:0000000000000000
[Mon May 25 05:29:34 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon May 25 05:29:34 2015] CR2: 00007fcb88859008 CR3: 0000000f6a3f9000
CR4: 00000000001407e0
[Mon May 25 05:29:34 2015] Stack:
[Mon May 25 05:29:34 2015]  0000000000000000 ffffea0015414000
ffff880f7ca57d38 ffff880f7ca57d38
[Mon May 25 05:29:34 2015]  0000000000000001 ffff880f6a3b8450
ffffea003fa43330 0000000000000001
[Mon May 25 05:29:34 2015]  ffffea0015414000 0000000000000000
0000000000000000 ffffffff81100c85
[Mon May 25 05:29:34 2015] Call Trace:
[Mon May 25 05:29:34 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
[Mon May 25 05:29:34 2015]  [<ffffffff81105bc6>] ? change_protection+0x12a/0x580
[Mon May 25 05:29:34 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
[Mon May 25 05:29:34 2015]  [<ffffffff8112dcf6>] ? new_sync_write+0x6a/0x8e
[Mon May 25 05:29:34 2015]  [<ffffffff81158cfe>] ? fsnotify+0x276/0x2bf
[Mon May 25 05:29:34 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
[Mon May 25 05:29:34 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
[Mon May 25 05:29:34 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
[Mon May 25 05:29:34 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
8b 85 40 5f 02 00
[Mon May 25 05:29:34 2015] RIP  [<ffffffff8112346c>]
migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:34 2015]  RSP <ffff880f7ca57d28>
[Mon May 25 05:29:34 2015] ---[ end trace 83fa2f6761648dbd ]---
[Mon May 25 05:29:34 2015] device bond0.100 left promiscuous mode
[Mon May 25 05:29:34 2015] device bond0 left promiscuous mode
[Mon May 25 05:29:34 2015] device eth2 left promiscuous mode
[Mon May 25 05:29:34 2015] device eth3 left promiscuous mode
[Mon May 25 05:29:46 2015] device bond0.100 entered promiscuous mode
[Mon May 25 05:29:46 2015] device bond0 entered promiscuous mode
[Mon May 25 05:29:46 2015] device eth2 entered promiscuous mode
[Mon May 25 05:29:46 2015] device eth3 entered promiscuous mode
[Mon May 25 05:29:49 2015] page:ffffea00190d3000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:29:49 2015] flags: 0x20050100004000(head)
[Mon May 25 05:29:49 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:29:49 2015] ------------[ cut here ]------------
[Mon May 25 05:29:49 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:29:49 2015] invalid opcode: 0000 [#2] SMP
[Mon May 25 05:29:49 2015] Modules linked in: veth xt_comment xt_CT
iptable_raw xt_addrtype ipt_MASQUERADE nf_nat_masquerade_ipv4
iptable_nat nf_nat_ipv4 nf_nat bridge overlay tcp_cubic binfmt_misc
nf_conntrack_ipv6 nf_defrag_ipv6 xt_tcpudp ip6table_filter ip6_tables
nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
xt_multiport iptable_filter ip_tables x_tables rpcsec_gss_krb5
auth_rpcgss oid_registry nfsv4 nfs lockd grace sunrpc fscache ses
enclosure 8021q garp stp llc bonding ext4 crc16 jbd2 mbcache sg sd_mod
ipmi_watchdog x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt evdev
kvm crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
mfd_core i2c_algo_bit
[Mon May 25 05:29:49 2015]  pps_core usb_common scsi_mod dca i2c_core
wmi acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
[Mon May 25 05:29:49 2015] CPU: 10 PID: 25858 Comm: tcpdump Tainted: G
     D        3.18.13-cloudflare #1
[Mon May 25 05:29:49 2015] Hardware name: Quanta Computer Inc D51B-2U
(dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
[Mon May 25 05:29:49 2015] task: ffff880fb4600e40 ti: ffff8805520ec000
task.ti: ffff8805520ec000
[Mon May 25 05:29:49 2015] RIP: 0010:[<ffffffff8112346c>]
[<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:49 2015] RSP: 0000:ffff8805520efd28  EFLAGS: 00010246
[Mon May 25 05:29:49 2015] RAX: 0000000000000000 RBX: ffffea00190d3000
RCX: 000000000000649e
[Mon May 25 05:29:49 2015] RDX: 0000000000000000 RSI: 0000000000000296
RDI: 0000000000000900
[Mon May 25 05:29:49 2015] RBP: ffff88207ffd7000 R08: 0000000000000000
R09: 0000000000000000
[Mon May 25 05:29:49 2015] R10: 000000000000b4d0 R11: ffff88207ff9bad0
R12: ffff8805520efd38
[Mon May 25 05:29:49 2015] R13: 0000000000000001 R14: 0000000000000000
R15: 0000000000000000
[Mon May 25 05:29:49 2015] FS:  00007fd334ecd700(0000)
GS:ffff88207fc40000(0000) knlGS:0000000000000000
[Mon May 25 05:29:49 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon May 25 05:29:49 2015] CR2: 00007fd334051008 CR3: 0000000f6810a000
CR4: 00000000001407e0
[Mon May 25 05:29:49 2015] Stack:
[Mon May 25 05:29:49 2015]  0000000000000000 ffffea00190d3000
ffff8805520efd38 ffff8805520efd38
[Mon May 25 05:29:49 2015]  0000000000000001 ffff8805a875bd78
ffffea003daa4cf0 0000000000000001
[Mon May 25 05:29:49 2015]  ffffea00190d3000 0000000000000000
0000000000000000 ffffffff81100c85
[Mon May 25 05:29:49 2015] Call Trace:
[Mon May 25 05:29:49 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
[Mon May 25 05:29:49 2015]  [<ffffffff81105bc6>] ? change_protection+0x12a/0x580
[Mon May 25 05:29:49 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
[Mon May 25 05:29:49 2015]  [<ffffffff8112dcf6>] ? new_sync_write+0x6a/0x8e
[Mon May 25 05:29:49 2015]  [<ffffffff81158cfe>] ? fsnotify+0x276/0x2bf
[Mon May 25 05:29:49 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
[Mon May 25 05:29:49 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
[Mon May 25 05:29:49 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
[Mon May 25 05:29:49 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
8b 85 40 5f 02 00
[Mon May 25 05:29:49 2015] RIP  [<ffffffff8112346c>]
migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:29:49 2015]  RSP <ffff8805520efd28>
[Mon May 25 05:29:49 2015] ---[ end trace 83fa2f6761648dbe ]---
[Mon May 25 05:29:49 2015] device bond0.100 left promiscuous mode
[Mon May 25 05:29:49 2015] device bond0 left promiscuous mode
[Mon May 25 05:29:49 2015] device eth2 left promiscuous mode
[Mon May 25 05:29:49 2015] device eth3 left promiscuous mode
[Mon May 25 05:30:07 2015] device bond0.100 entered promiscuous mode
[Mon May 25 05:30:07 2015] device bond0 entered promiscuous mode
[Mon May 25 05:30:07 2015] device eth2 entered promiscuous mode
[Mon May 25 05:30:07 2015] device eth3 entered promiscuous mode
[Mon May 25 05:30:42 2015] page:ffffea00153cf000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:30:42 2015] flags: 0x20050c80004000(head)
[Mon May 25 05:30:42 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:30:42 2015] ------------[ cut here ]------------
[Mon May 25 05:30:42 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:30:42 2015] invalid opcode: 0000 [#3]
[Mon May 25 05:30:42 2015] SMP

[Mon May 25 05:30:42 2015] Modules linked in:
[Mon May 25 05:30:42 2015]  veth
[Mon May 25 05:30:42 2015]  xt_comment
[Mon May 25 05:30:42 2015]  xt_CT
[Mon May 25 05:30:42 2015]  iptable_raw
[Mon May 25 05:30:42 2015]  xt_addrtype
[Mon May 25 05:30:42 2015]  ipt_MASQUERADE
[Mon May 25 05:30:42 2015]  nf_nat_masquerade_ipv4
[Mon May 25 05:30:42 2015]  iptable_nat
[Mon May 25 05:30:42 2015]  nf_nat_ipv4
[Mon May 25 05:30:42 2015]  nf_nat
[Mon May 25 05:30:42 2015]  bridge
[Mon May 25 05:30:42 2015]  overlay
[Mon May 25 05:30:42 2015]  tcp_cubic
[Mon May 25 05:30:42 2015]  binfmt_misc
[Mon May 25 05:30:42 2015]  nf_conntrack_ipv6
[Mon May 25 05:30:42 2015]  nf_defrag_ipv6
[Mon May 25 05:30:42 2015]  xt_tcpudp
[Mon May 25 05:30:42 2015]  ip6table_filter
[Mon May 25 05:30:42 2015]  ip6_tables
[Mon May 25 05:30:42 2015]  nf_conntrack_ipv4
[Mon May 25 05:30:42 2015]  nf_defrag_ipv4
[Mon May 25 05:30:42 2015]  xt_conntrack
[Mon May 25 05:30:42 2015]  nf_conntrack
[Mon May 25 05:30:42 2015]  xt_multiport
[Mon May 25 05:30:42 2015]  iptable_filter
[Mon May 25 05:30:42 2015]  ip_tables
[Mon May 25 05:30:42 2015]  x_tables
[Mon May 25 05:30:42 2015]  rpcsec_gss_krb5
[Mon May 25 05:30:42 2015]  auth_rpcgss
[Mon May 25 05:30:42 2015]  oid_registry
[Mon May 25 05:30:42 2015]  nfsv4
[Mon May 25 05:30:42 2015]  nfs
[Mon May 25 05:30:42 2015]  lockd
[Mon May 25 05:30:42 2015]  grace
[Mon May 25 05:30:42 2015]  sunrpc
[Mon May 25 05:30:42 2015]  fscache
[Mon May 25 05:30:42 2015]  ses
[Mon May 25 05:30:42 2015]  enclosure
[Mon May 25 05:30:42 2015]  8021q
[Mon May 25 05:30:42 2015]  garp
[Mon May 25 05:30:42 2015]  stp
[Mon May 25 05:30:42 2015]  llc
[Mon May 25 05:30:42 2015]  bonding
[Mon May 25 05:30:42 2015]  ext4
[Mon May 25 05:30:42 2015]  crc16
[Mon May 25 05:30:42 2015]  jbd2
[Mon May 25 05:30:42 2015]  mbcache
[Mon May 25 05:30:42 2015]  sg
[Mon May 25 05:30:42 2015]  sd_mod
[Mon May 25 05:30:42 2015]  ipmi_watchdog
[Mon May 25 05:30:42 2015]  x86_pkg_temp_thermal
[Mon May 25 05:30:42 2015]  coretemp kvm_intel iTCO_wdt evdev kvm
crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
mfd_core i2c_algo_bit pps_core usb_common scsi_mod dca i2c_core wmi
acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
[Mon May 25 05:30:42 2015] CPU: 10 PID: 25881 Comm: tcpdump Tainted: G
     D        3.18.13-cloudflare #1
[Mon May 25 05:30:42 2015] Hardware name: Quanta Computer Inc D51B-2U
(dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
[Mon May 25 05:30:42 2015] task: ffff880fb4601c80 ti: ffff880d790fc000
task.ti: ffff880d790fc000
[Mon May 25 05:30:42 2015] RIP: 0010:[<ffffffff8112346c>]
[<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:30:42 2015] RSP: 0000:ffff880d790ffd28  EFLAGS: 00010246
[Mon May 25 05:30:42 2015] RAX: 0000000000000000 RBX: ffffea00153cf000
RCX: 000000000000670d
[Mon May 25 05:30:42 2015] RDX: 0000000000000000 RSI: 0000000000000296
RDI: 0000000000000520
[Mon May 25 05:30:42 2015] RBP: ffff88207ffd7000 R08: 0000000000000000
R09: 0000000000000000
[Mon May 25 05:30:42 2015] R10: 0000000000000000 R11: ffff88207ff9cb0c
R12: ffff880d790ffd38
[Mon May 25 05:30:42 2015] R13: 0000000000000001 R14: 0000000000000000
R15: 0000000000000000
[Mon May 25 05:30:42 2015] FS:  00007f9fa7c98700(0000)
GS:ffff88207fc40000(0000) knlGS:0000000000000000
[Mon May 25 05:30:42 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon May 25 05:30:42 2015] CR2: 00007f9fa6ddc008 CR3: 0000000fe9391000
CR4: 00000000001407e0
[Mon May 25 05:30:42 2015] Stack:
[Mon May 25 05:30:42 2015]  0000000000000001 ffffea00153cf000
ffff880d790ffd38 ffff880d790ffd38
[Mon May 25 05:30:42 2015]  ffffea00153cefc0 ffff880f89deeb80
ffffea003da83f70 0000000000000001
[Mon May 25 05:30:42 2015]  ffffea00153cf000 0000000000000000
0000000000000000 ffffffff81100c85
[Mon May 25 05:30:42 2015] Call Trace:
[Mon May 25 05:30:42 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
[Mon May 25 05:30:42 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
[Mon May 25 05:30:42 2015]  [<ffffffff8112dcf6>] ? new_sync_write+0x6a/0x8e
[Mon May 25 05:30:42 2015]  [<ffffffff81158cfe>] ? fsnotify+0x276/0x2bf
[Mon May 25 05:30:42 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
[Mon May 25 05:30:42 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
[Mon May 25 05:30:42 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
[Mon May 25 05:30:42 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
8b 85 40 5f 02 00
[Mon May 25 05:30:42 2015] RIP  [<ffffffff8112346c>]
migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:30:42 2015]  RSP <ffff880d790ffd28>
[Mon May 25 05:30:42 2015] ---[ end trace 83fa2f6761648dbf ]---
[Mon May 25 05:30:42 2015] device bond0.100 left promiscuous mode
[Mon May 25 05:30:42 2015] device bond0 left promiscuous mode
[Mon May 25 05:30:42 2015] device eth2 left promiscuous mode
[Mon May 25 05:30:42 2015] device eth3 left promiscuous mode
[Mon May 25 05:32:03 2015] device bond0.100 entered promiscuous mode
[Mon May 25 05:32:03 2015] device bond0 entered promiscuous mode
[Mon May 25 05:32:03 2015] device eth2 entered promiscuous mode
[Mon May 25 05:32:03 2015] device eth3 entered promiscuous mode
[Mon May 25 05:32:29 2015] page:ffffea006c581000 count:66 mapcount:1
mapping:          (null) index:0x0
[Mon May 25 05:32:29 2015] flags: 0x6002ba00004000(head)
[Mon May 25 05:32:29 2015] page dumped because:
VM_BUG_ON_PAGE(compound_order(page) && !PageTransHuge(page))
[Mon May 25 05:32:29 2015] ------------[ cut here ]------------
[Mon May 25 05:32:29 2015] kernel BUG at mm/migrate.c:1661!
[Mon May 25 05:32:29 2015] invalid opcode: 0000 [#4] SMP
[Mon May 25 05:32:29 2015] Modules linked in: veth xt_comment xt_CT
iptable_raw xt_addrtype ipt_MASQUERADE nf_nat_masquerade_ipv4
iptable_nat nf_nat_ipv4 nf_nat bridge overlay tcp_cubic binfmt_misc
nf_conntrack_ipv6 nf_defrag_ipv6 xt_tcpudp ip6table_filter ip6_tables
nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack
xt_multiport iptable_filter ip_tables x_tables rpcsec_gss_krb5
auth_rpcgss oid_registry nfsv4 nfs lockd grace sunrpc fscache ses
enclosure 8021q garp stp llc bonding ext4 crc16 jbd2 mbcache sg sd_mod
ipmi_watchdog x86_pkg_temp_thermal coretemp kvm_intel iTCO_wdt evdev
kvm crc32c_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper
ablk_helper cryptd ahci libahci ehci_pci mpt3sas raid_class ehci_hcd
ixgbe libata igb scsi_transport_sas mdio usbcore ptp lpc_ich i2c_i801
mfd_core i2c_algo_bit
[Mon May 25 05:32:29 2015]  pps_core usb_common scsi_mod dca i2c_core
wmi acpi_pad acpi_cpufreq md_mod processor thermal_sys button ipmi_si
ipmi_poweroff ipmi_devintf ipmi_msghandler autofs4
[Mon May 25 05:32:29 2015] CPU: 5 PID: 25972 Comm: tcpdump Tainted: G
    D        3.18.13-cloudflare #1
[Mon May 25 05:32:29 2015] Hardware name: Quanta Computer Inc D51B-2U
(dual 1G LoM)/S2B-MB (dual 1G LoM), BIOS S2B_3A17 11/07/2014
[Mon May 25 05:32:29 2015] task: ffff881dee014740 ti: ffff881d5eb3c000
task.ti: ffff881d5eb3c000
[Mon May 25 05:32:29 2015] RIP: 0010:[<ffffffff8112346c>]
[<ffffffff8112346c>] migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:32:29 2015] RSP: 0000:ffff881d5eb3fd28  EFLAGS: 00010246
[Mon May 25 05:32:29 2015] RAX: 0000000000000000 RBX: ffffea006c581000
RCX: 0000000000006b84
[Mon May 25 05:32:29 2015] RDX: 0000000000000000 RSI: 0000000000000296
RDI: 0000000000000c40
[Mon May 25 05:32:29 2015] RBP: ffff88107ffda000 R08: 0000000000000000
R09: 0000000000000000
[Mon May 25 05:32:29 2015] R10: 000000000000bb40 R11: ffff88207ff9de94
R12: ffff881d5eb3fd38
[Mon May 25 05:32:29 2015] R13: 0000000000000000 R14: 0000000000000001
R15: 0000000000000000
[Mon May 25 05:32:29 2015] FS:  00007f31bdaf0700(0000)
GS:ffff88103fca0000(0000) knlGS:0000000000000000
[Mon May 25 05:32:29 2015] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Mon May 25 05:32:29 2015] CR2: 00007f31bcc74008 CR3: 0000001d74b8c000
CR4: 00000000001407e0
[Mon May 25 05:32:29 2015] Stack:
[Mon May 25 05:32:29 2015]  0000000000000001 ffffea006c581000
ffff881d5eb3fd38 ffff881d5eb3fd38
[Mon May 25 05:32:29 2015]  ffffffffffffffff ffff880fe2332228
ffffea00757aadf0 0000000000000000
[Mon May 25 05:32:29 2015]  ffffea006c581000 0000000000000001
0000000000000000 ffffffff81100c85
[Mon May 25 05:32:29 2015] Call Trace:
[Mon May 25 05:32:29 2015]  [<ffffffff81100c85>] ? handle_mm_fault+0x945/0xa62
[Mon May 25 05:32:29 2015]  [<ffffffff81038cbc>] ? flush_tlb_mm_range+0xb5/0xdb
[Mon May 25 05:32:29 2015]  [<ffffffff81105bc6>] ? change_protection+0x12a/0x580
[Mon May 25 05:32:29 2015]  [<ffffffff81034502>] ? __do_page_fault+0x2bf/0x395
[Mon May 25 05:32:29 2015]  [<ffffffff81061467>] ? vtime_account_user+0x35/0x40
[Mon May 25 05:32:29 2015]  [<ffffffff8103460f>] ? do_page_fault+0x37/0x58
[Mon May 25 05:32:29 2015]  [<ffffffff81491082>] ? page_fault+0x22/0x30
[Mon May 25 05:32:29 2015] Code: a5 00 00 00 48 ff c0 48 89 85 b8 5f
02 00 48 8b 03 f6 c4 40 74 17 83 7b 68 00 74 11 48 c7 c6 79 71 7f 81
48 89 df e8 b1 86 fd ff <0f> 0b 48 8b 03 31 c9 f6 c4 40 74 03 8b 4b 68
8b 85 40 5f 02 00
[Mon May 25 05:32:29 2015] RIP  [<ffffffff8112346c>]
migrate_misplaced_page+0xeb/0x2a1
[Mon May 25 05:32:29 2015]  RSP <ffff881d5eb3fd28>
[Mon May 25 05:32:29 2015] ---[ end trace 83fa2f6761648dc0 ]---

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2015-06-02  7:19 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-05-27 18:05 kernel bug(VM_BUG_ON_PAGE) with 3.18.13 in mm/migrate.c Jovi Zhangwei
2015-05-27 18:42 ` Jovi Zhangwei
2015-05-28 12:00 ` Mel Gorman
2015-05-28 18:38   ` Jovi Zhangwei
2015-05-31  1:39     ` Jovi Zhangwei
2015-06-02  7:19     ` Mel Gorman
  -- strict thread matches above, loose matches on Subject: below --
2015-05-26 23:44 Jovi Zhangwei

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).