Search This Blog

Friday 26 December 2014

Installing Ambari for Hadoop Installation and management

Apache Ambari  project is aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs.In This post we will see how to install Apache Ambari on Redhat Instance on Amazon EWS cloud.

Features of Ambari:
  • Provision a Hadoop Cluster
    • Ambari provides a step-by-step wizard for installing Hadoop services across any number of hosts.
    • Ambari handles configuration of Hadoop services for the cluster.
  • Manage a Hadoop Cluster
    • Ambari provides central management for starting, stopping, and reconfiguring Hadoop services across the entire cluster.
  • Monitor a Hadoop Cluster
    • Ambari provides a dashboard for monitoring health and status of the Hadoop cluster.
    • Ambari leverages Ganglia for metrics collection.
    • Ambari leverages Nagios for system alerting and will send emails when your attention is needed (e.g., a node goes down, remaining disk space is low, etc).
Apache Ambari Installation: 

Lets start installation of Apache ambari follow the steps and in case of issues send mail at info@xcelframeworks.com.


Step: 1  Check the prerequisite.
  •         Amazon Web Services account with the ability to launch 7 large instances of EC2 nodes.
  •        A Mac or a Linux machine. You could also use Windows but you will have to install additional software such as SSH clients and SCP clients, etc.
  •        Lastly, we assume that you have basic familiarity with EC2 to the extent that you have created EC2 instances and SSH’d in.

Step:2 Get the information on connection details of your server instance from Administrator along with key and store the key locally. In current Environment keys are located on you’re your desktop as shown below:



Step:3 Get Information over your public and private DNS from the Batch Adminitrator. This information will be used to connect to instances using  Putty or Cygwin.This Lab considers following Public and private DNS:

Private DNS: ip-10-186-145-41.ec2.internal

Step:4 Connect to the instance using Putty .Click on the Putty icon and launch putty .



Step:5

In the Host Name box, enter user_name@public_dns_name. Be sure to specify the appropriate user name for your AMI. For example:

  •        For an Amazon Linux AMI, the user name is ec2-user.
  •        For a RHEL5 AMI, the user name is either root or ec2-user.
  •        For an Ubuntu AMI, the user name is ubuntu.
  •        For a Fedora AMI, the user name is either fedora or ec2-user.
  •        For SUSE Linux, the user name is root.
  •     Otherwise, if ec2-user and root don't work, check with the AMI provider.

  •        Under Connection type, select SSH.
  •        Ensure that Port is 22.




Step:6  

In  the Category pane, expand Connection, expand SSH, and then select Auth. Complete the following:

·      Click Browse.
  •        Select the .ppk file that you generated for your key pair, and then click Open.
  •        (Optional) If you plan to start this session again later, you can save the session information for future use. Select Session in the Category tree, enter a name for the session in Saved Sessions, and then click Save.
  •        Click Open to start the PuTTY session.

Step:7


Once  you get connected follow the following step.


Step:8

  • Setting up Ambari:

    Get the bits of HDP and add it to the repo:

    Step:9

    Refresh the repo:



Step:10

Install the Ambari server:
Step:11

Download bits required:
Step:12

 Agree to Download key



Step:13  

Ambari server Installation successful.
Step:13 Once Ambari server is install you need to configure it.
Accept all the defaults .

Step:14 Once Configuration step is done test for Ambari server startup.

Step:15 Test the Ambari-server launching 8080 using your Public DNS from browser.



So Now we have our Ambari DashBoard up.Next blog talks about installing HDP using Apache ambari .