Ask Your Question
0

Compute node unrecognized

asked 2014-08-11 11:05:37 -0500

bobyakov gravatar image

updated 2014-08-11 14:40:45 -0500

Hi All,

Set up Icehouse high availability cluster. Unfortunately I can get the compute nodes to be recognized. RabbitMQ is listening on proper 5672 port, and I am able to ping controllers from compute node.

nova-manager service list shows below:

nova-cert        cwtrct001                          internal         enabled    :-)   2014-08-11 16:01:08
nova-cert        cwtrct002                          internal         enabled    :-)   2014-08-11 16:01:09
nova-consoleauth cwtrct001                          internal         enabled    :-)   2014-08-11 16:01:08
nova-consoleauth cwtrct002                          internal         enabled    :-)   2014-08-11 16:01:10
nova-scheduler   cwtrct001                          internal         enabled    :-)   2014-08-11 16:01:07
nova-scheduler   cwtrct002                          internal         enabled    :-)   2014-08-11 16:01:13
nova-conductor   cwtrct001                          internal         enabled    :-)   2014-08-11 16:01:07
nova-conductor   cwtrct002                         internal         enabled    :-)   2014-08-11 16:01:09

Compute Neutron log:

2014-08-11 11:13:51.671 2023 ERROR neutron.openstack.common.rpc.common [req-0a5db9ba-af9d-478c-858b-d2086f006bce None] AMQP server on 10.1.0.5:5672 is unreachable: [ Errno 113] EHOSTUNREACH. Trying again in 1 seconds.
2014-08-11 11:13:55.691 2023 ERROR neutron.openstack.common.rpc.common [req-0a5db9ba-af9d-478c-858b-d2086f006bce None] AMQP server on 10.1.0.6:5672 is unreachable: [ Errno 113] EHOSTUNREACH. Trying again in 3 seconds.
2014-08-11 11:14:01.699 2023 ERROR neutron.openstack.common.rpc.common [req-0a5db9ba-af9d-478c-858b-d2086f006bce None] AMQP server on 10.1.0.5:5672 is unreachable: [ Errno 113] EHOSTUNREACH. Trying again in 5 seconds.
2014-08-11 11:14:09.707 2023 ERROR neutron.openstack.common.rpc.common [req-0a5db9ba-af9d-478c-858b-d2086f006bce None] AMQP server on 10.1.0.6:5672 is unreachable: [ Errno 113] EHOSTUNREACH. Trying again in 7 seconds.
2014-08-11 11:41:46.161 2023 ERROR neutron.agent.linux.ovsdb_monitor [-] Error received from ovsdb monitor: ovsdb-client: unix:/var/run/openvswitch/db.sock: receive  failed (End of file)
2014-08-11 11:42:22.793 2023 ERROR neutron.agent.linux.ovsdb_monitor [-] Error received from ovsdb monitor: 2014-08-11T15:42:22Z|00001|fatal_signal|WARN|terminating  with signal 15 (Terminated)
2014-08-11 11:54:28.546 5767 ERROR neutron.agent.linux.ovsdb_monitor [-] Error received from ovsdb monitor: 2014-08-11T15:54:28Z|00001|fatal_signal|WARN|terminating  with signal 15 (Terminated)
2014-08-11 11:54:34.704 7175 ERROR neutron.agent.linux.ovsdb_monitor [-] Error received from ovsdb monitor: ovsdb-client: unix:/var/run/openvswitch/db.sock: receive  failed (End of file)

Compute nova log: ( not sure why error states local:5672, my nova and neutron config point to 10.1.0.5,10.1.0.6)

2014-08-11 12:00:38.541 7067 INFO oslo.messaging._drivers.impl_rabbit [-] Reconnecting to AMQP server on localhost:5672
2014-08-11 12:00:38.542 7067 INFO oslo.messaging._drivers.impl_rabbit [-] Delaying reconnect for 1.0 seconds...
2014-08-11 12:00:39.555 7067 ERROR oslo.messaging._drivers.impl_rabbit [-] AMQP server on localhost:5672 is unreachable: [Errno 111] ECONNREFUSED. Trying again in 30 seconds.
2014-08-11 12:01:09.559 7067 INFO oslo.messaging._drivers.impl_rabbit [-] Reconnecting to AMQP server on localhost:5672
2014-08-11 12:01:09.559 7067 INFO oslo.messaging._drivers.impl_rabbit [-] Delaying reconnect for 1.0 seconds...
2014-08-11 12:01:10.581 7067 ERROR oslo.messaging._drivers.impl_rabbit [-] AMQP server on localhost:5672 is unreachable: [Errno 111] ECONNREFUSED. Trying again in 30 seconds.
edit retag flag offensive close merge delete

Comments

(feel like there are few things wrong)
did you try from your compute host to

telnet 10.1.0.5 5672

?

T u l gravatar imageT u l ( 2014-08-11 15:30:36 -0500 )edit

Yes connection is successful then get message connection closed. I think it may be my neutron set up. What Should neutron_url = be controller or network node, on compute node and controller?

bobyakov gravatar imagebobyakov ( 2014-08-11 17:34:56 -0500 )edit

Based on logs it seems only the compute node is having issues with connecting to RabbitMQ cluster.

bobyakov gravatar imagebobyakov ( 2014-08-12 09:18:26 -0500 )edit

looks like neutron_url @ compute host must point to the controller ( http://docs.openstack.org/havana/inst... )

T u l gravatar imageT u l ( 2014-08-12 14:29:49 -0500 )edit

It is (technically), I have it pointed to a VIP on a HAProxy server. Then that gets load balanced between the 2 controllers.

bobyakov gravatar imagebobyakov ( 2014-08-12 14:41:21 -0500 )edit

1 answer

Sort by ยป oldest newest most voted
0

answered 2014-08-12 14:45:37 -0500

bobyakov gravatar image

updated 2014-08-14 08:40:52 -0500

I was able to resolve the error on the neutron log. Appears Ubuntu by default loads virbr0 pointed at ddr:192.168.122.1. That does not match any of my IP's. I removed the default config with below commands and neutron connected to AMQP. Now only issue, nova-compute tries to connect to AMQP using localhost even though nova.conf is pointed at the 2 controllers.

virsh net-destroy default

virsh net-undefine default

** Issue reseolved moved rabbitmq information from rabbitmq section to Default section in nova.conf.

edit flag offensive delete link more

Comments

I have the exact same setup and I am running into the same issue but the given solution isn't working for me. Is there anything else I can try to get it working?

Sayali gravatar imageSayali ( 2014-09-16 07:10:21 -0500 )edit

I still see error AMQP server on 10.1.0.5:5672 is unreachable: [ Errno 113] EHOSTUNREACH. Trying again in 1 seconds, but it is not impacting functionality. I was able to resolve issue with unrecognized compute node, by moving rabbitmq information from rabbitmq section to Default section in nova.conf

bobyakov gravatar imagebobyakov ( 2014-09-24 12:31:49 -0500 )edit

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account.

Add Answer

Get to know Ask OpenStack

Resources for moderators

Question Tools

1 follower

Stats

Asked: 2014-08-11 11:05:37 -0500

Seen: 3,127 times

Last updated: Aug 14 '14