Please enable Javascript to correctly display the contents on Dot Net Tricks!

 

Hadoop Development cum Administration

Reviews | 316 Learners Course Agenda
The Hadoop Development cum Administration course is primarily designed for Software Professional(s)/ Manager(s)/ Architect(s) who want to learn how to manage large and complex data sets and scale up it from single servers to thousands of machines. In this course, you will learn the basic and advance in-depth concepts of Big-Data and Hadoop along with implementation. This course will give insight on Hadoop 2.0, Name Node, YARN, MapReduce, Spark, Oozie, Scala, Hadoop Cluster and Hadoop Administration etc.

Watch Pre-Recorded Live Class Video

BigData and Hadoop Training

Course Description

  • About the course
  • Course Curriculum
  • Assessment
  • Projects
  • F&Q's

About the course

The Hadoop Development cum Administration course is primarily designed for Software Professional(s)/ Manager(s)/ Architect(s) who want to learn how to manage large and complex data sets and scale up it from single servers to thousands of machines. In this course, you will learn the basic and advance in-depth concepts of Big-Data and Hadoop along with implementation. This course will give insight on Hadoop 2.0, Name Node, YARN, MapReduce, Spark, Oozie, Scala, Hadoop Cluster and Hadoop Administration etc.

This course helps you to learn how to deploy, configure, manage, monitor, and secure a Hadoop Cluster.

Course objective

At the completion of this course, attendees will be able to;

  1. Understand the need of BigData and Hadoop
  2. Understand Hadoop 2.x ecosystem in-depth concepts with its Architecture
  3. Setup and Configure Hadoop and its components.
  4. Understand the concepts of HDFS, MapReduce, Hbase, Hive etc.
  5. Understand Sqoop and Flume in-depth concepts with its architecture
  6. Perform data analytics using Pig, Hive and YARN
  7. Schedule Jobs using Oozie
  8. Perform deployment, backup and recovery
  9. Use Hortonworks/Ambari Distributions
  10. Understand machine learning with Mahout

Who can do this course?

All Software Professional(s)/ Manager(s)/ Architect(s) who are keen to learn how to manage large and complex data sets and scale up it from single machine to thousands of machines should go for this course.

Pre-requisites

There are no pre-requisites to join this course but if you have knowledge of Java and Linux, it might help you.

Tools

VM Player/Virtual Box (with all required softwares for Hadoop development)

Introduction to Big Data and Hadoop

  1. Big Data Definition and Concept
  2. Structured vs. Unstructured Data
  3. Types of Big Data
  4. Limitations and Solutions of existing Data Analytics Architecture RDBMS
  5. Traditional System Vs New Data Analytics System
  6. Assignment

The Hadoop Ecosystem

  1. Introduction to Hadoop 2.x
  2. Hadoop Features
  3. Hadoop Ecosystem
  4. Hadoop 2.x core components
  5. Hadoop Key Services
  6. Assignment

Hadoop Storage: HDFS

  1. Hadoop Storage: HDFS (Hadoop Distributed File System)
  2. HDFS vs. GFS (Google File System)
  3. Hadoop Processing: MapReduce Framework
  4. Anatomy of File Write and Read, Rack Awareness
  5. Assignment

Hadoop Architecture and HDFS Architecture

  1. Hadoop 2.x Cluster Architecture - Federation and High Availability
  2. A Typical Production Hadoop Cluster
  3. Hadoop Cluster Modes
  4. Common Hadoop Shell Commands
  5. Assignment

MapReduce

  1. Hadoop 2.x Configuration Files
  2. Password-Less SSH
  3. MapReduce Framework
  4. MapReduce Job Execution
  5. Data Loading Techniques: Hadoop Copy Commands
  6. Assignment

FLUME and Sqoop

  1. FLUME and Sqoop
  2. Use of FLUME in Hadoop
  3. Use of SQOOP in Hadoop
  4. Import with Sqoop
  5. Export with Sqoop
  6. Debug issue with Sqoop
  7. Assignment

Get started with Sqoop

  1. Introduction to Sqoop
  2. Advantages of Sqoop
  3. Workflow
  4. Import with Sqoop
  5. Export with Sqoop
  6. Debug issue with Sqoop

Deep Dive in MapReduce

  1. MapReduce Use Cases
  2. Traditional way vs. MapReduce way
  3. Why MapReduce
  4. Hadoop 2.x MapReduce Architecture
  5. Assignment

Yarn

  1. Hadoop 2.x MapReduce Components
  2. YARN MR Application Execution Flow
  3. Why YARN
  4. YARN Workflow
  5. Assignment

Hadoop MapReduce Framework

  1. Anatomy of MapReduce Program
  2. Demo on MapReduce
  3. Input Splits
  4. Relation between Input Splits and HDFS Blocks
  5. MapReduce Job Submission Flow
  6. Assignment

Hands-On Hadoop MapReduce

  1. Demo of Input Splits
  2. MapReduce: Combiner & Practitioner
  3. Demo on de-identifying Health Care Data set
  4. Demo on Weather Dataset
  5. Assignment

Advance MapReduce

  1. Counters
  2. Distributed Cache
  3. MRunit
  4. Reduce Join
  5. Custom Input Format
  6. Sequence Input Format
  7. Assignment

PIG

  1. About Pig
  2. MapReduce vs. Pig
  3. Pig Use Cases
  4. Programming Structure in Pig
  5. Pig Running Modes
  6. Pig components
  7. Pig Execution
  8. Practical example

PIG Fundamentals

  1. Pig Latin Program
  2. Data Models in Pig
  3. Pig Data Types
  4. Pig Latin: Relational Operators, File Loaders
  5. Group Operator, COGROUP Operator, Joins and COGROUP, Union, Diagnostic Operators
  6. Assignment

PIG and Hive

  1. Pig UDF
  2. Pig Demo on Healthcare Data set
  3. Hive Background
  4. Hive Use case

Hive

  1. About Hive
  2. Hive vs. Pig
  3. Hive Architecture and Components
  4. Metastore in Hive
  5. Limitations of Hive
  6. Comparison with Traditional Database

Hive fundamentals and Queries

  1. Hive Data Types and Data Models
  2. Partitions and Buckets
  3. Hive Tables (Managed Tables and External Tables)
  4. Querying Data
  5. Managing Outputs
  6. Hive Script
  7. Hive Demo on Healthcare Data set

Advance Hive and HBase

  1. Hive QL: Joining Tables, Dynamic Partitioning, Custom Map/Reduce Scripts
  2. Hive: Thrift Server, User Defined Functions
  3. HBase: Introduction to NoSQL Databases and HBase, HBase vs. RDBMS, HBase

Advance HBase

  1. HBase Workflow
  2. HBase Data Model
  3. HBase Shell
  4. HBase Client API
  5. Data Loading Techniques

Zookeeper

  1. Zookeeper Data Model
  2. Zookeeper Service
  3. Zookeeper
  4. Demos on Bulk Loading
  5. Getting and Inserting Data
  6. Filters in HBase

Oozie

  1. Flume and Sqoop Demo Case study
  2. Oozie
  3. Oozie Components
  4. Oozie Workflow

Job Scheduling with Oozie

  1. Scheduling with Oozie
  2. Demo on Oozie Workflow
  3. Oozie Co-ordinator
  4. Oozie Commands
  5. Oozie Web Console

Hortonworks and Ambari

  1. Introduction to Hortonworks
  2. Introduction to Ambari
  3. Hands-On Ambari

Machine learning with Mahout

  1. Introduction to Machine Learning and Apache Mahout
  2. Mahout and Hadoop
  3. Clustering
  4. Clustering Extended
  5. Classification
  6. Implementing a Recommender and Recommendation Platform

Hands-on Projects

  1. Project1- Analyze Social Bookmarking sites to find insights
  2. Project2-Customer Complaints Analysis
  3. Project3-Analyze YouTube Data

Mock-up Tests and Assignments

Dot Net Tricks’ mock-up tests and assignments help the professionals to work on real-world projects and to get an edge in their careers and make their lives better. This training program includes 4 mock-up tests and 6 assignments. Each mock-up test includes 20 multiple choice questions.

Mockup-Tests objective

  1. Help you to monitor your learning progress.
  2. Help you to evaluate yourself.
  3. Help you to crack your technical interview first round with objective questions.

Assignments objective

  1. Gain confidence to work on Hadoop project.
  2. Help you to evaluate your development skills.
  3. Prepare yourself for real-application development.

Projects

Description

The primary goal of hands over projects is to understand how to use Hadoop in real application database management and analysis. Here we will be using PIG, HIVE, HBase and MapReduce to perform Data analytics.

Project #1 - Analyze Social Bookmarking sites to find insights

  1. Fetch Data into HDFS and analyze it by using MapReduce, Pig and Hive to find top rated links based on insights.
  2. Using MapReduce convert semi structured data into structured format.
  3. Push HDFS output and then feed into PIG.

Project #2 - Analyze YouTube Data

  1. Find out the top 5 categories in which the most number of videos are uploaded, the top 10 rated videos, the top 10 most viewed videos.

Frequently Asked Questions

Q1. Do you provide any course material?

Yes we do. All relevant course material and exercises you will be get through our mentors.

Q2. Do you provide any class video?

Yes we do. You will get the recorded sessions of your own online training classes, so that you can revise your class when you want.

Q3. What If I miss my online training class?

All online training classes are recorded. You will get the recorded sessions so that you can watch the online classes when you want. Also, you can join other class to do your missing classes.

Q4. What If I miss my classroom training class?

You can join other on going classroom batch classes to do your missing classes.

Q5. Do you prepare me for the job interview?

Yes, we do. We will discuss all possible technical interview questions and answers during the training program so that you can prepare yourself for interview.

Q6. Do you provide hands-on real application development?

Yes, we do provide you to do hands-on real application development.

Q7. Whom do I contact, if I have more queries?

You can give us a CALL at +91 113 303 4100 OR email us at enquiry@dotnettricks.com
Instructor Led Training
  • 365 days of access to interactive live sessions videos
  • Interactive instructor-led online/classroom sessions
  • 365 days of access to high-quality, interactive learning content through LMS

17,367/289
16,499/275

5% OFF

Dot Net Tricks For Corporate Training

Our corporate training programs are specially as per industry requirements, so that your employees can develop applications with latest technologies. As a trusted and resourceful training partner, Dot Net Tricks helps you achieve success with its industry-leading instructional design and customer training initiatives.

Course Overview

Type
: Project-based Program
Audience(s)
: Software Professional(s), Manager(s), Architect(s)
Tools/IDE
: Eclipse, Cloudera
Delivery method
: Instructor-led Sessions
Duration
: 40 Hours
Language
: English

Course Features

All the instructor led online training sessions are recorded and made available in LMS within 24 hours. This includes video, course material, exercise files and ppts used during the session.
All the sessions are delivered using real-life examples. You will learn how to develop real project using industry recomended principles and practices.
Each Module will be followed by objective mockup tests and practical assignments which help you to monitor your learning progress and Evaluate yourself.
You get 365 days access to the Learning Management System (LMS). Class recordings and presentations can be viewed online from the LMS.
Towards the end of the course, you will be working on a project. Dot Net Tricks certifies you based on the project.
We have a community forum for all our learners wherein you can enrich their learning through peer interaction and knowledge sharing.
We have 24x7 online support team available to help you with any technical queries you may have during the course.
Students Speak
Previous Next
+