openstack-nova-compute fails to start. No messages logged.

2014-08-27 17:53:13 -0600

dt.turner

updated 2014-08-27 17:53:56 -0600

Our setup consists of the following: 1 control node 1 db/cinder node 1 network node 10 compute nodes

We are running Oracle Linux 6 for our service/control nodes and Oracle VM server 3.3 for compute nodes.

I'm running the Icehouse build. All compute nodes are running the following package versions:

openstack-nova-compute-2014.1-2.0.1.el6.noarch openstack-ceilometer-common-2014.1-1.el6.noarch openstack-utils-2014.1-1.el6.noarch openstack-neutron-openvswitch-2014.1-11.0.1.el6.noarch openstack-neutron-2014.1-11.0.1.el6.noarch openstack-ceilometer-compute-2014.1-1.el6.noarch oraclevm-openstack-preinstall-1.0-5.el6.noarch openstack-nova-common-2014.1-2.0.1.el6.noarch openstack-neutron-ml2-2014.1-11.0.1.el6.noarch openstack-selinux-0.1.3-2.el6ost.noarch (note that we currently have selinux disabled)

We've been running standard instance launch and deletion tests for the past few days. All compute nodes have had instances launched on them as we add them to the our site, to ensure proper configuration.

Today, I noticed that a compute node was reported down. The pid and lock file for the service were still there. The nova log simply showed that the service received a sigterm. No messages previous to this indicated any pending doom. Upon removing the pid and lock file, I tried restarting the service. The service fails to start, and no log entries are made to /var/log/nova/compute.log.

After verifying that the node's nova.conf file matched our standard config, I verified that the remaining services (xend, libvirtd, messagebus, openvswitch) were running.

I tried rebooting the node. Nova compute service will not start and again no messages are logged when attempting to start it.

Finally, I removed the openstack-nova-compute package, rebooted, reinstalled the package to no avail. The compute service refuses to start. No messages are logged to /var/log/nova/compute.log.

Any ideas?


1 answer

2014-08-27 20:37:41 -0600

dt.turner

Found the problem. I tried executing /usr/bin/nova-compute directly and could finally see that the service was failing because it could not parse nova.conf. As it turns out, there was a previously commented bit of text that had been uncommitted by mistake. So, my previous comment about ensuring that nova.conf matched my other compute nodes was not true. :-)

Asked: 2014-08-27 17:53:13 -0600

Seen: 1,229 times

Last updated: Aug 27 '14