How to scale Openstack sahara cluster - Bigdata

asked 2018-02-04

Radhakrishnan Rk


I am running sahara service in my openstack POC infrastructure. How should I scale in and scale out my hadoop cluster size. I am using apache vannila plugin for deploying hadoop cluster. Can anyone help me to scale my cluster.

Best Regards,

Radhakrishnan Rk

1 answer

answered 2018-02-06

The Sahara documentation has a paragraph about scaling via the GUI:

The CLI has a command dataprocessing cluster scale

Of course, you can also use the API:

Disclaimer: I am by no means an expert in big data or Sahara.

Thank for the information. I have checked the doc. I am not sure will it add or remove the cluster nodes by proper service decommission. If it tries to decommission a node manager or data node of a hadoop cluster. It should backup all the data and stop the daemon gracefully. Let me check here.

Radhakrishnan Rk ( 2018-02-06 )

Some more info here: It doesn't specifically mention data backup, but "Decommissioning a Data Node may take some time because Hadoop rearranges data replicas around the Cluster" sounds good.

Bernd Bausch ( 2018-02-07 )

Thank you for the information.

Radhakrishnan Rk ( 2018-02-17 )

