How can we overcome any type of sudden server failure?

asked 2020-03-28 00:14:52 -0500

Devendra_Singh_Balihar gravatar image

During the failure of the server or when the server is not seen to be perfectly functioning, How can we manage this.

edit retag flag offensive close merge delete


Do you mean a Nova instance, a compute node or a controller?

Generally the answer is redundancy - several servers.

Bernd Bausch gravatar imageBernd Bausch ( 2020-03-28 02:32:20 -0500 )edit

1 answer

Sort by ยป oldest newest most voted

answered 2020-03-29 14:58:27 -0500

dalem gravatar image

It depends. For e.g a) Nova Compute: then you can configure the triggering of evacauation and migration procedure of the VMs to an healthy compute node. You can read more about it here:

b) Controllers: They run in a quorum and by deafult there are 3 running instances. So failure of any one of the server causes no issues.

c) Storage ( for e.g. volume storage in CEPH): Here the data is duplicated to additional server nodes in a cluster by replication or erasure coding. In ceph by default there are three server nodes in a cluster. So even if any one of the nodes fails, VM can still find its data from the remaining two health nodes of the cluster

edit flag offensive delete link more


There is no such default as three servers in ceph. You can configure it any way you want, even a one-node-ceph-cluster ist no problem. But the idea is correct, ceph is designed as a redundant storage system for several different applications, one of them is OpenStack.

eblock gravatar imageeblock ( 2020-03-30 05:03:10 -0500 )edit

Get to know Ask OpenStack

Resources for moderators

Question Tools

1 follower


Asked: 2020-03-28 00:14:52 -0500

Seen: 32 times

Last updated: Mar 29