Ask Your Question
1

Nova-compute can't connect to Nova-controller

asked 2015-05-21 08:28:00 -0500

Egor gravatar image

Hello I'm running Ubuntu 14.04LTS and trying to install Kilo (http://docs.openstack.org/kilo/install-guide/install/apt/content/ch_nova.html#compute-service (http://docs.openstack.org/kilo/instal...)).

The issue is: Nova-compute doesn't connect to nova-controller:

+----+------------------+--------+----------+---------+-------+----------------------------+-----------------+
| Id | Binary           | Host   | Zone     | Status  | State | Updated_at                 | Disabled Reason |
+----+------------------+--------+----------+---------+-------+----------------------------+-----------------+
| 1  | nova-cert        | ubuntu | internal | enabled | up    | 2015-05-21T12:17:52.000000 | -               |
| 2  | nova-consoleauth | ubuntu | internal | enabled | up    | 2015-05-21T12:17:52.000000 | -               |
| 3  | nova-scheduler   | ubuntu | internal | enabled | up    | 2015-05-21T12:17:49.000000 | -               |
| 4  | nova-conductor   | ubuntu | internal | enabled | up    | 2015-05-21T12:17:52.000000 | -               |
+----+------------------+--------+----------+---------+-------+----------------------------+-----------------+

After i moved

[oslo_messaging_rabbit]
rabbit_host = controller 
rabbit_userid = openstack
rabbit_password = RABBIT_PASS

to [DEFAULT] section, i got |5| nova-compute status "enabled" state "Up" but, later it changed state to "down".

/var/log/nova/nova-compute.log

2015-05-21 08:13:22.073 2572 **WARNING** nova.compute.resource_tracker [-] **No service record for host ubuntu**
2015-05-21 08:13:22.120 2572 **ERROR** nova.openstack.common.threadgroup [-] **Compute host ubuntu could not be found.**
Traceback (most recent call last):
  File "/usr/lib/python2.7/dist-packages/oslo_messaging/rpc/server.py", line 142, in inner
    return func(*args, **kwargs)
  File "/usr/lib/python2.7/dist-packages/nova/conductor/manager.py", line 286, in service_get_all_by
    context, result['host'])
  File "/usr/lib/python2.7/dist-packages/nova/objects/base.py", line 163, in wrapper
    result = fn(cls, context, *args, **kwargs)
  File "/usr/lib/python2.7/dist-packages/nova/objects/compute_node.py", line 355, in get_all_by_host
    raise exception.ComputeHostNotFound(host=host)

I see folowing messages in /var/log/rabbitmq/rabbit@ubuntu.log on controller:

=INFO REPORT==== 21-May-2015::08:13:21 ===
accepting AMQP connection <0.1525.0> (10.0.0.31:51452 -> 10.0.0.11:5672)

=INFO REPORT==== 21-May-2015::08:13:21 ===
accepting AMQP connection <0.1540.0> (10.0.0.31:51453 -> 10.0.0.11:5672)

=INFO REPORT==== 21-May-2015::08:13:22 ===
closing AMQP connection <0.1540.0> (10.0.0.31:51453 -> 10.0.0.11:5672)

=WARNING REPORT==== 21-May-2015::08:13:22 ===
closing AMQP connection <0.1525.0> (10.0.0.31:51452 -> 10.0.0.11:5672):
connection_closed_abruptly

If i try to telnet from compute to controller:

=INFO REPORT==== 21-May-2015::08:29:35 ===
accepting AMQP connection <0.1728.0> (10.0.0.31:51455 -> 10.0.0.11:5672)

=ERROR REPORT==== 21-May-2015::08:29:45 ===
closing AMQP connection <0.1728.0> (10.0.0.31:51455 -> 10.0.0.11:5672):
{handshake_timeout,handshake}

Seems like some issue with RabbitMQ, but i don't see where

/etc/nova/nova.conf     
[DEFAULT]
dhcpbridge_flagfile=/etc/nova/nova.conf
dhcpbridge=/usr/bin/nova-dhcpbridge
logdir=/var/log/nova
state_path=/var/lib/nova
lock_path=/var/lock/nova
force_dhcp_release=True
iscsi_helper=tgtadm
libvirt_use_virtio_for_bridges=True
connection_type=libvirt
root_helper=sudo nova-rootwrap /etc/nova/rootwrap.conf
verbose=True
ec2_private_dns_show_ip=True
api_paste_config=/etc/nova/api-paste.ini
volumes_path=/var/lib/nova/volumes
enabled_apis=ec2,osapi_compute,metadata
rpc_backend = rabbit
auth_strategy = keystone
my_ip = 10.0.0.31
vnc_enabled = True
vncserver_listen = 0.0.0.0
vncserver_proxyclient_address = 10.0.0.31
novncproxy_base_url = http://controller:6080/vnc_auto.html
verbose = True
rabbit_host = controller 
rabbit_userid = openstack
rabbit_password = RABBIT_PASS


[oslo_messaging_rabbit]
#rabbit_host = controller ...
(more)
edit retag flag offensive close merge delete

Comments

Hi,

anyone got any resolution on this error. I am also facing same error.

prasad9920 gravatar imageprasad9920 ( 2015-08-06 06:17:12 -0500 )edit

Hello Egor, Do you still have the issue or did you manage to resolve it?

conhew gravatar imageconhew ( 2015-08-28 04:03:22 -0500 )edit

3 answers

Sort by ยป oldest newest most voted
1

answered 2015-09-13 02:15:35 -0500

Waid gravatar image

I got the same problem few days ago. And find no answer. I've tried to move the rabbit authentication options to 'DEFAULT' but it did not work. Find the log(/var/log/nova/nova-compute.log) on compute node and it shows err msg about Host Compute not found something like that..

And also changed the rabbitmq version back to 3.2.4 but nothing changed.

Solved this problem by finding that my compute node did not install the correct nova-compute version. I find no such file "/etc/apt/sources.list.d/cloudarchive-kilo.list". So I missed this step when installing compute node. So I reinstall the nova-compute by specifying the correct package and finally it works.(check out the official installation guide: http://docs.openstack.org/kilo/instal...). Hope this helps.

edit flag offensive delete link more

Comments

Hi Waid,

Thanks for your comment. I was able to solve my issue. but I have question, what is the diffrent between these two repositories :

cloud-archive:liberty & trusty-updates/kilo main ( worked ).

Is it the release? is nova-compute liberty have bug?

wtmpx gravatar imagewtmpx ( 2015-12-25 08:31:41 -0500 )edit
0

answered 2015-08-14 09:54:18 -0500

prasad9920 gravatar image

updated 2015-08-14 09:56:53 -0500

Hi All,

I tested with rabbimq 3.2.4 it's working fine.

Ping me if any issue.

Download rabbitmq packages from following location

https://www.rabbitmq.com/releases/rabbitmq-server/v3.2.4/ (https://www.rabbitmq.com/releases/rab...)

edit flag offensive delete link more

Comments

Hi, I'm having the same exact issue as stated above. I'm running rabbitmq 3.4.3. nova-compute connects to nova-controller and then the connection drops. rabbitmq log shows connection_closed_abruptly. Any help is appreciated.

langarita gravatar imagelangarita ( 2015-09-06 21:18:05 -0500 )edit
0

answered 2015-08-09 06:20:12 -0500

mili gravatar image

Hi,

Im on the same boat with Kilo on ubuntu.

Please helps experts

edit flag offensive delete link more

Comments

I can also see some strange behavior with the rabbitmq: 1-To get things working in kilo, On the compute node I had to add the rabbit config in the default section: 2-After that I can see rabbitmq on the controller saying connection_closed_abruptly

mili gravatar imagemili ( 2015-08-10 06:36:11 -0500 )edit

Hi, I'm having the same exact issue as stated above. I'm running rabbitmq 3.4.3. nova-compute connects to nova-controller and then the connection drops. rabbitmq log shows connection_closed_abruptly. I too had to update nova.conf and put the rabbit config to DEFAULT section. DId you solve it?

langarita gravatar imagelangarita ( 2015-09-06 21:19:11 -0500 )edit

same issue for me i can olso say that if i put rabbit_host = controller ....etc etc in [ default ] contest compute1 appeare in service-list but with status down and in compute1 nova-compute service status is down ...so for me is not correct to put rabbit in default contest but in this case no

lgctelecom gravatar imagelgctelecom ( 2015-09-12 13:14:53 -0500 )edit

...so for me is not correct to put rabbit ref in default contest but in this case no connection with controller. i really cannot understand how to proceed

lgctelecom gravatar imagelgctelecom ( 2015-09-12 13:16:01 -0500 )edit

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

Get to know Ask OpenStack

Resources for moderators

Question Tools

1 follower

Stats

Asked: 2015-05-21 07:55:29 -0500

Seen: 2,853 times

Last updated: Sep 13 '15