vhost_net panics under heavy load

asked 2014-09-05 12:16:27 -0600

I am seeing these oops messages on compute hosts with KVM under heavy network load (hadoop shuffle). Distribution is Ubuntu Precise, with kernel 3.2.0-67 and qemu-system-x86 2.0.0+dfsg-2ubuntu1.1~cloud0. Is it a known problem?

[693454.436345] BUG: unable to handle kernel NULL pointer dereference at 0000000000000030
[693454.446765] IP: [<ffffffff8117c7b9>] fput+0x9/0x30
[693454.451943] PGD 0 
[693454.456935] Oops: 0002 [#7] SMP 
[693454.461817] CPU 12 
[693454.461886] Modules linked in: ebtable_broute ebtable_filter veth bridge stp nf_conntrack_ipv6 nf_defrag_ipv6 xt_mac xt_tcpudp xt_state xt_physdev iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables x_tables kvm_intel kvm nbd openvswitch(O) gre libcrc32c ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi bonding vesafb mei(C) sb_edac edac_core dm_multipath shpchp joydev dcdbas mac_hid wmi coretemp acpi_power_meter vhost_net macvtap macvlan lp parport usbhid hid usb_storage igb megaraid_sas dca
[693454.512115] 
[693454.516908] Pid: 11902, comm: qemu-system-x86 Tainted: G      D WC O 3.2.0-67-generic #101-Ubuntu Dell Inc. PowerEdge R620/0KCKR5
[693454.526920] RIP: 0010:[<ffffffff8117c7b9>]  [<ffffffff8117c7b9>] fput+0x9/0x30
[693454.537151] RSP: 0018:ffff880f924cbe28  EFLAGS: 00010292
[693454.542271] RAX: 0000000076b876b8 RBX: ffff880f82330000 RCX: ffff880fd7f507f0
[693454.552502] RDX: 00000000000076b8 RSI: ffff880f82330130 RDI: 0000000000000000
[693454.563116] RBP: ffff880f924cbe28 R08: ffff881fd5e5b868 R09: 00000000bff40000
[693454.574123] R10: ffff880f924cbfd8 R11: 0000000000000000 R12: 0000000000000000
[693454.585090] R13: ffff880f82330010 R14: ffff880f82330078 R15: ffff880f82330080
[693454.596075] FS:  00007f0a155ee940(0000) GS:ffff880fffcc0000(0000) knlGS:0000000000000000
[693454.607309] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[693454.613152] CR2: 0000000000000030 CR3: 0000000faccb5000 CR4: 00000000000426e0
[693454.624888] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[693454.636655] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[693454.648424] Process qemu-system-x86 (pid: 11902, threadinfo ffff880f924ca000, task ffff880f8224c500)
[693454.660272] Stack:
[693454.665953]  ffff880f924cbe88 ffffffffa0085ee7 ffff880f924cbe58 0000000000000000
[693454.677579]  0000000000000000 ffff880fd7f507f0 ffff880f924cbe78 ffff880f82330000
[693454.689496]  00007fff590f9340 ffff881fd5f09cf0 00007fff590f9340 0000000000000000
[693454.701427] Call Trace:
[693454.707261]  [<ffffffffa0085ee7>] vhost_net_set_backend+0x1f7/0x290 [vhost_net]
[693454.719022]  [<ffffffffa0086087>] vhost_net_ioctl+0x107/0x190 [vhost_net]
[693454.725006]  [<ffffffff8118d45a>] do_vfs_ioctl+0x8a/0x340
[693454.730867]  [<ffffffff8118d7a1>] sys_ioctl+0x91/0xa0
[693454.736604]  [<ffffffff8166bd42>] system_call_fastpath+0x16/0x1b
[693454.742254] Code: 85 c0 0f 84 b7 fe ff ff 31 d2 48 89 de 83 cf ff ff d0 e9 9f fe ff ff 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 66 66 66 66 90 <f0> 48 ff 4f 30 0f 94 c0 84 c0 75 0b 5d c3 66 0f 1f 84 00 00 00 
[693454.759632] RIP  [<ffffffff8117c7b9>] fput+0x9/0x30
[693454.765124]  RSP <ffff880f924cbe28>
[693454.770467] CR2: 0000000000000030
[693454.775696] ---[ end trace 6ede1b75ef4ed3eb ]---
edit retag flag offensive close merge delete