Ask Your Question
2

implement Openstack on multiple laptops and an hadoop cluster over it [closed]

asked 2015-02-09 01:04:16 -0500

Akash gravatar image

updated 2015-02-09 09:26:14 -0500

smaffulli gravatar image

I have a project to do, in which a demonstration has to be made of performing string manipulation algorithms on huge amount of data(in GBs) stored under hadoop. This hadoop framework should again be installed on the openstack platform and I must demonstrate this by using multiple laptops(i.e not by using some virtual machines on a single laptop). I am new to this Big Data and Cloud field. Can you please give me some guidelines on how to implement this or some docs having the steps to implement this?

edit retag flag offensive reopen merge delete

Closed for the following reason too subjective and argumentative by smaffulli
close date 2015-02-09 09:26:23.357503

2 answers

Sort by ยป oldest newest most voted
0

answered 2015-02-09 09:27:27 -0500

smaffulli gravatar image

You had your answer so I'm closing the question that is way too wide to be useful. Please read https://ask.openstack.org/faq and if you want to have a conversation, consider using the OpenStack mailing list http://lists.openstack.org

edit flag offensive delete link more
0

answered 2015-02-09 02:35:27 -0500

9lives gravatar image

updated 2015-02-09 06:11:00 -0500

you might take a look at OpenStack Sahara project(EDP as Service) based on Hadoop, you can benefit from Sahara with two things, No1. hadoop cluster provisioning; No2. EDP on Hadoop cluster.

The only thing you might need to consider is if hadoop jobs need to be running on vm or physical box, currently Sahara only support vm based cluster.

For more information refer to the Sahara wiki here

@Akash

Theoretically speaking, yes. you may implement your cloud with multiple nodes(laptops) via two options:

  • Single dc deployment with multi-nodes

    1 controller + 1 network node + many compute nodes

  • multiple region deployment with shared 1 keystone service:

    region 1: 1 controller + 1 network node + many compute nodes;

    region 2: 1 controller( point to the keystone service in region 1) + 1 network node + many compute nodes;

Hope that helps!

Vic

edit flag offensive delete link more

Comments

Thanx, but what if I want to use actual physical boxes??what should I use in that case??

Akash gravatar imageAkash ( 2015-02-09 03:13:34 -0500 )edit

if you really want to use physical box, IMO there are two options: 1. setup hadoop cluster manually 2. use ironic or other baremetal provisioning tools to install hadoop on physical box which might need more effort than options 1.

9lives gravatar image9lives ( 2015-02-09 03:38:12 -0500 )edit

By using Sahara, can we implement the cloud over multiple laptops?? And thanx again

Akash gravatar imageAkash ( 2015-02-09 04:51:54 -0500 )edit

Get to know Ask OpenStack

Resources for moderators

Question Tools

2 followers

Stats

Asked: 2015-02-09 01:04:16 -0500

Seen: 458 times

Last updated: Feb 09 '15