DOC PREVIEW
UT Dallas CS 6350 - BigDataHadoop_PPT_Lesson10

This preview shows page 1-2-3-20-21-22-41-42-43 out of 43 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 43 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

Copyright 2014, Simplilearn, All rights reserved. Copyright 2014, Simplilearn, All rights reserved. Lesson 10—Commercial Distribution of Hadoop Big Data and Hadoop DeveloperCopyright 2014, Simplilearn, All rights reserved. Objectives ● Identify the major commercial distributions of Hadoop ● Explain how to download and work on the Cloudera Quickstart VM ● Describe how to navigate the Hue interface ● Demonstrate how to navigate the Cloudera Manager interface By the end of this lesson, you will be able to:Copyright 2014, Simplilearn, All rights reserved. Cloudera is a commercial vendor for deploying Hadoop in an enterprise. Following are the salient features of Cloudera: Cloudera—Introduction It uses 100% open-source distribution of Apache Hadoop and related projects like Apache Pig, Apache Hive, Apache HBase, Apache Sqoop, and so on. It offers the user-friendly Cloudera Manager for system management, Cloudera Navigator for data management, dedicated technical support, and so on.Copyright 2014, Simplilearn, All rights reserved. Cloudera’s distribution is known as CDH (Cloudera Distribution Including Apache Hadoop) which delivers the core elements of Hadoop. The core elements include: ● scalable storage; ● distributed computing and additional components; and ● necessary enterprise capabilities such as security. Cloudera CDH Image source: cloudera.comCopyright 2014, Simplilearn, All rights reserved. To explore the features of Cloudera, download the Cloudera QuickStart Virtual Machine (VM) from the link given below: http://www.cloudera.com/content/support/en/downloads.html Downloading the Cloudera QuickStart Virtual MachineCopyright 2014, Simplilearn, All rights reserved. Perform the following steps to start Cloudera Quickstart VM: Starting the Cloudera VM Navigate to the location where you have extracted the zip file and select the vmx file. Click Open to continue after opening the vmx file. Click the Power on this virtual machine link to start Cloudera VM. The Cloudera VM web screen appears. Click File > Open to open the existing Cloudera VM. Start the VMware Workstation. 3 4 5 6 2 1Copyright 2014, Simplilearn, All rights reserved. Start VMware Workstation and click File > Open to open the existing Cloudera VM that you have downloaded from the official website. Starting the Cloudera VM—Steps 1 and 2Copyright 2014, Simplilearn, All rights reserved. Navigate to the location, where you have extracted the zip file, and select the vmx file as shown below. Once the vmx file is selected, click Open to continue. Starting the Cloudera VM—Steps 3 and 4Copyright 2014, Simplilearn, All rights reserved. The VM successfully opens in the VMware Workstation. To start the Cloudera VM, click the Power on this virtual machine link on the left of the VMware Workstation Introduction screen as shown here. Starting the Cloudera VM—Step 5Copyright 2014, Simplilearn, All rights reserved. Once the machine starts, the web screen appears and allows you to work on Cloudera VM. Starting the Cloudera VM—Step 6Copyright 2014, Simplilearn, All rights reserved. Hue is a web front end offered by the Cloudera VM to Apache Hadoop. Logging into Hue Click Hue to access the Hue interface.Copyright 2014, Simplilearn, All rights reserved. The steps to access Hue are given below. Logging into Hue (contd.) Use the default credentials to access the Hue interface: ● Username: cloudera ● Password: cloudera 1 Click ‘Sign in’ after entering the credentials. 2Copyright 2014, Simplilearn, All rights reserved. The Hue interface is displayed when you log in by entering your credentials. Logging into Hue (contd.)Copyright 2014, Simplilearn, All rights reserved. Cloudera Manager is used to administer Apache Hadoop. It helps in the configuration of the following components but is not limited to: ● HDFS; ● Hive engine; ● Hue; ● MapReduce; ● Oozie; ● ZooKeeper; ● Flume; ● HBase; ● Cloudera Impala; ● Cloudera Search; and ● YARN. Cloudera ManagerCopyright 2014, Simplilearn, All rights reserved. You need to perform the following steps to log in to Cloudera Manager: Logging into Cloudera Manager—Step 1 Enter the username and password. Click ‘Cloudera Manager’. The Cloudera Manager home screen appears. 1 3 2 The home screen interface of Cloudera Manager is displayed.Copyright 2014, Simplilearn, All rights reserved. Logging into Cloudera Manager—Step 2 Enter the username and password. Click ‘Cloudera Manager’. The Cloudera Manager home screen appears. 1 3 2 Enter username and password as ‘cloudera’, and click ‘Login’ to continue.Copyright 2014, Simplilearn, All rights reserved. Logging into Cloudera Manager—Step 3 Enter the username and password. Click ‘Cloudera Manager’. The Cloudera Manager home screen appears. 1 3 2 The Cloudera Manager interface is displayed.Copyright 2014, Simplilearn, All rights reserved. The demos in the subsequent screens illustrate how to work on Cloudera VM and use Eclipse with MapReduce in Cloudera’s Quickstart VM. As a fast growing company, Nutri Worldwide Inc. realizes the need for moving to the next level in their usage of Hadoop. With Olivia, the EVP—IT Operations, successfully piloting some key features and tools for Hadoop, the company decides to use the Cloudera enterprise version to leverage the efficiencies and Return on Investment (ROI) that can be achieved by using the same. Business ScenarioCopyright 2014, Simplilearn, All rights reserved. Demo 1 Downloading, starting, and working with Cloudera VMCopyright 2014, Simplilearn, All rights reserved. Demo 2 Using Eclipse with MapReduce in Cloudera’s Quickstart VMCopyright 2014, Simplilearn, All rights reserved. Hortonworks Data Platform (HDP) enables enterprise Hadoop with a suite of essential capabilities that serve as the functional definition of any data platform technology. Hortonworks Data Platform Download available on http://hortonworks.com/hdp/downloads/Copyright 2014, Simplilearn, All rights reserved. MapR data platform supports more than 20 open source projects. It also supports multiple versions of the individual projects, thereby allowing users to migrate to the latest versions at their own pace. MapR Data Platform Download available on: https://www.mapr.com/products/hadoop-downloadCopyright 2014, Simplilearn, All rights reserved. Pivotal HD is a commercially supported, enterprise-capable distribution of Hadoop. It


View Full Document

UT Dallas CS 6350 - BigDataHadoop_PPT_Lesson10

Documents in this Course
HW3

HW3

5 pages

NOSQL-CAP

NOSQL-CAP

23 pages

BigTable

BigTable

39 pages

HW3

HW3

5 pages

Load more
Download BigDataHadoop_PPT_Lesson10
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view BigDataHadoop_PPT_Lesson10 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view BigDataHadoop_PPT_Lesson10 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?