Compute deployment fails OVS kernel oops smp, unhandled paging request

asked 2015-10-08 12:18:17 -0600

rmhayes462 gravatar image

updated 2015-10-10 17:18:23 -0600

We have a test deployment of Kilo w/ neutron GRE using Fuel 7 - 3 controller, 2 storage, 4 compute. Three of the compute nodes deploy fine; they are running intel xeon e5620's with intel pro gbe nics.

The 4th compute node is a supermicro h8dg6-f w/ amd opteron 6344's, dp x540-at2's and dp intel pro gbe nics. Once ovs starts to create bridges we run into defunct ovs processes and on br-floating we get the SMP Oops - unable to handle kernel paging request at 0000000000001e08.

Originally we were deploying with the fuel pxe network on the 1gbe nic and the rest of the networks split between the 2 10gbe ports. Because the x540-at2's have been reported to cause problems in the past we have tried deploying with the fuel pxe network on one of the 10gbe ports and the remainder of the networks split between the two 1gbe ports with the same result.

We are deploying with neutron DVR also.

I've checked for bad memory and disk (just because) with no problems found. I've also installed just the OS from the fuel deployment and didn't run into any immediate problems. However, I have not tried a manual configuration of OVS.

I have the full dmesg output and everything from /var/log in tar.gz but I can't attach to this post. Any insights are greatly appreciated. Below is the relevant section of dmesg.

[  393.377894] br-ex: port 1(eth2.104) entered forwarding state
[  393.377906] br-ex: port 1(eth2.104) entered forwarding state
[  393.624396] device ovs-system entered promiscuous mode
[  393.624531] BUG: unable to handle kernel paging request at 0000000000001e08
[  393.692068] IP: [<ffffffff81158fae>] __alloc_pages_nodemask+0x8e/0xb80
[  393.692072] PGD 7f9004067 PUD 7fa42f067 PMD 0
[  393.692074] Oops: 0000 [#1] SMP
[  393.692129] Modules linked in: 8021q garp mrp bridge stp llc bonding openvswi                                                                             tch(OX) gre vxlan ip_tunnel iptable_filter ip_tables x_tables kvm_amd kvm crct10                                                                             dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel amd64_edac_mod aes_x86_6                                                                             4 edac_core lrw fam15h_power k10temp edac_mce_amd gf128mul joydev glue_helper ab                                                                             lk_helper i2c_piix4 serio_raw cryptd shpchp mac_hid nf_conntrack_proto_gre nf_co                                                                             nntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_conntrack nls_ut                                                                             f8 isofs xfs libcrc32c raid10 raid456 async_raid6_recov async_memcpy async_pq as                                                                             ync_xor async_tx xor raid6_pq raid1 raid0 hid_generic igb ixgbe multipath usbhid                                                                              i2c_algo_bit dca pata_acpi mpt2sas linear psmouse hid ptp pata_atiixp raid_clas                                                                             s ahci usb_storage libahci scsi_transport_sas pps_core mdio
[  393.692132] CPU: 20 PID: 29281 Comm: ovs-vswitchd Tainted: G           OX 3.1                                                                             3.0-65-generic #105-Ubuntu
[  393.692133] Hardware name: Supermicro H8DG6/H8DGi/H8DG6/H8DGi, BIOS 3.5a                                                                                    08/01/2015
[  393.692135] task: ffff8807f6534800 ti: ffff8807fa932000 task.ti: ffff8807fa93                                                                             2000
[  393.692139] RIP: 0010:[<ffffffff81158fae>]  [<ffffffff81158fae>] __alloc_page                                                                             s_nodemask+0x8e/0xb80
[  393.692140] RSP: 0018:ffff8807fa9336d0  EFLAGS: 00010246
[  393.692141] RAX: 0000000000001e00 RBX: 00000000002012d0 RCX: 0000000000000000
[  393.692142] RDX: 0000000000001e00 RSI: 0000000000000000 RDI: 00000000002012d0
[  393.692143] RBP: ffff8807fa9337f8 R08: 0000000040000000 R09: ffffea003fe46e20
[  393.692144] R10: ffffffffa0383100 R11: ffff881006c00f90 R12: 0000000000000080
[  393.692144] R13: 00000000002012d0 R14: 0000000000000000 R15: 0000000000000000
[  393.692146] FS:  00007f3625a18980(0000) GS:ffff881007280000(0000) knlGS:00000 ...
edit retag flag offensive close merge delete