DOC PREVIEW
UT Dallas CS 6350 - BigDataHadoop_PPT_Lesson10

This preview shows page 1-2-3-20-21-22-41-42-43 out of 43 pages.

Save
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Big Data and Hadoop Developer Lesson 10 Commercial Distribution of Hadoop Copyright 2014 Simplilearn All rights reserved Copyright 2014 Simplilearn All rights reserved Objectives By the end of this lesson you will be able to Identify the major commercial distributions of Hadoop Explain how to download and work on the Cloudera Quickstart VM Describe how to navigate the Hue interface Demonstrate how to navigate the Cloudera Manager interface Copyright 2014 Simplilearn All rights reserved Cloudera Introduction Cloudera is a commercial vendor for deploying Hadoop in an enterprise Following are the salient features of Cloudera It uses 100 open source distribution of Apache Hadoop and related projects like Apache Pig Apache Hive Apache HBase Apache Sqoop and so on It offers the user friendly Cloudera Manager for system management Cloudera Navigator for data management dedicated technical support and so on Copyright 2014 Simplilearn All rights reserved Cloudera CDH Cloudera s distribution is known as CDH Cloudera Distribution Including Apache Hadoop which delivers the core elements of Hadoop The core elements include scalable storage distributed computing and additional components and necessary enterprise capabilities such as security Image source cloudera com Copyright 2014 Simplilearn All rights reserved Downloading the Cloudera QuickStart Virtual Machine To explore the features of Cloudera download the Cloudera QuickStart Virtual Machine VM from the link given below http www cloudera com content support en downloads html Copyright 2014 Simplilearn All rights reserved Starting the Cloudera VM Perform the following steps to start Cloudera Quickstart VM 1 2 5 Click Open to continue after opening the vmx file Navigate to the location where you have extracted the zip file and select the vmx file Click File Open to open the existing Cloudera VM Start the VMware Workstation 4 3 6 Click the Power on this virtual machine link to start Cloudera VM The Cloudera VM web screen appears Copyright 2014 Simplilearn All rights reserved Starting the Cloudera VM Steps 1 and 2 Start VMware Workstation and click File Open to open the existing Cloudera VM that you have downloaded from the official website Copyright 2014 Simplilearn All rights reserved Starting the Cloudera VM Steps 3 and 4 Navigate to the location where you have extracted the zip file and select the vmx file as shown below Once the vmx file is selected click Open to continue Copyright 2014 Simplilearn All rights reserved Starting the Cloudera VM Step 5 The VM successfully opens in the VMware Workstation To start the Cloudera VM click the Power on this virtual machine link on the left of the VMware Workstation Introduction screen as shown here Copyright 2014 Simplilearn All rights reserved Starting the Cloudera VM Step 6 Once the machine starts the web screen appears and allows you to work on Cloudera VM Copyright 2014 Simplilearn All rights reserved Logging into Hue Hue is a web front end offered by the Cloudera VM to Apache Hadoop Click Hue to access the Hue interface Copyright 2014 Simplilearn All rights reserved Logging into Hue contd The steps to access Hue are given below 1 Use the default credentials to access the Hue interface Username cloudera Password cloudera 2 Click Sign in after entering the credentials Copyright 2014 Simplilearn All rights reserved Logging into Hue contd The Hue interface is displayed when you log in by entering your credentials Copyright 2014 Simplilearn All rights reserved Cloudera Manager Cloudera Manager is used to administer Apache Hadoop It helps in the configuration of the following components but is not limited to HDFS Hive engine Hue MapReduce Oozie ZooKeeper Flume HBase Cloudera Impala Cloudera Search and YARN Copyright 2014 Simplilearn All rights reserved Logging into Cloudera Manager Step 1 You need to perform the following steps to log in to Cloudera Manager The home screen interface of Cloudera Manager is displayed 1 Click Cloudera Manager 2 Enter the username and password The Cloudera Manager 3 home screen appears Copyright 2014 Simplilearn All rights reserved Logging into Cloudera Manager Step 2 1 Click Cloudera Manager 2 Enter the username and password Enter username and password as cloudera and click Login to continue The Cloudera Manager 3 home screen appears Copyright 2014 Simplilearn All rights reserved Logging into Cloudera Manager Step 3 The Cloudera Manager interface is displayed 1 Click Cloudera Manager the username and 2 Enter password Cloudera Manager 3 The home screen appears Copyright 2014 Simplilearn All rights reserved Business Scenario As a fast growing company Nutri Worldwide Inc realizes the need for moving to the next level in their usage of Hadoop With Olivia the EVP IT Operations successfully piloting some key features and tools for Hadoop the company decides to use the Cloudera enterprise version to leverage the efficiencies and Return on Investment ROI that can be achieved by using the same The demos in the subsequent screens illustrate how to work on Cloudera VM and use Eclipse with MapReduce in Cloudera s Quickstart VM Copyright 2014 Simplilearn All rights reserved Demo 1 Downloading starting and working with Cloudera VM Copyright 2014 Simplilearn All rights reserved Demo 2 Using Eclipse with MapReduce in Cloudera s Quickstart VM Copyright 2014 Simplilearn All rights reserved Hortonworks Data Platform Hortonworks Data Platform HDP enables enterprise Hadoop with a suite of essential capabilities that serve as the functional definition of any data platform technology Download available on http hortonworks com hdp downloads Copyright 2014 Simplilearn All rights reserved MapR Data Platform MapR data platform supports more than 20 open source projects It also supports multiple versions of the individual projects thereby allowing users to migrate to the latest versions at their own pace Download available on https www mapr com products hadoop download Copyright 2014 Simplilearn All rights reserved Pivotal HD Pivotal HD is a commercially supported enterprise capable distribution of Hadoop It consists of GemFire XD along with toolsets such as HAWQ MADlib OpenMPI GraphLab and Spring XD Download available on https network pivotal io products big data Copyright 2014 Simplilearn All rights reserved Pivotal HD contd Pivotal HD offers the following benefits It aims to accelerate data analytics projects It leverages existing skillsets It significantly expands Hadoop s capabilities


View Full Document

UT Dallas CS 6350 - BigDataHadoop_PPT_Lesson10

Documents in this Course
HW3

HW3

5 pages

NOSQL-CAP

NOSQL-CAP

23 pages

BigTable

BigTable

39 pages

HW3

HW3

5 pages

Load more
Download BigDataHadoop_PPT_Lesson10
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view BigDataHadoop_PPT_Lesson10 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view BigDataHadoop_PPT_Lesson10 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?