About Hadoop Developer Training Course

This Apache Hadoop Developer Training will help you get a detailed idea about Big Data and Hadoop. Some of the topics included are introduction to the Hadoop ecosystem, understanding of HDFS and MapReduce including MapReduce abstraction. Learn to install, implement various components of Hadoop like Pig, Hive, Flume, Sqoop and YARN.

What you will learn in this Hadoop Developer Online Training Course?

  1. Learn the Hadoop Architecture and Hadoop basics for beginners
  2. Learn what is Hadoop, HDFS and MapReduce framework
  3. Write MapReduce programs and deploy Hadoop clusters
  4. Develop applications for Big Data using Hadoop Technology
  5. Develop YARN programs on the Hadoop 2.X version
  6. Work on Big Data analytics using Hive, Pig and YARN
  7. Integrate MapReduce and HBase to do advanced usage and Indexing
  8. Learn fundamentals of Spark framework and its working
  9. Understand RDD in Apache Spark
  10. Learn Hadoop development best practices
  11. Job scheduling using Oozie
  12. Prepare for the Cloudera Spark and Hadoop Developer Certification

Who should take this Hadoop Developer Training Course?

  • Software Developers, analytics, BI, ETL, and data warehousing professionals
  • Big Data Hadoop developers, architects and testing personnel
  • Big Data Hadoop Developer Course Content

    Introduction to Hadoop and its Ecosystem, Map Reduce and HDFS

    Big Data, Factors constituting Big Data,What is Hadoop?,Overview of Hadoop Ecosystem,Map Reduce -Concepts of Map, Reduce, Ordering, Concurrency, Shuffle,Reducing, Concurrency,Hadoop Distributed File System (HDFS) Concepts and its Importance,Deep Dive in Map Reduce – Execution Framework, Partitioner, Combiner, Data Types, Key pairs,HDFS Deep Dive – Architecture, Data Replication, Name Node, Data Node, Data Flow,Parallel Copying with DISTCP, Hadoop Archives

    Hands on Exercises

    Installing Hadoop in Pseudo Distributed Mode, Understanding Important ,configuration files, their Properties and Demon Threads,Accessing HDFS from Command Line,Map Reduce – Basic Exercises,Understanding Hadoop Eco-system,Introduction to Sqoop, use cases and Installation,Introduction to Hive, use cases and Installation,Introduction to Pig, use cases and Installation,Introduction to Oozie, use cases and Installation,Introduction to Flume, use cases and Installation,Introduction to Yarn

    Mini Project – Importing Mysql Data using Sqoop and Querying it using Hive

    Deep Dive in Map Reduce and Yarn

    How to develop Map Reduce Application, writing unit test,Best Practices for developing and writing, Debugging Map Reduce applications,Joining Data sets in Map Reduce,Hadoop API’s,Introduction to Hadoop Yarn,Difference between Hadoop 1.0 and 2.0

    Module 3.1

    Project 1 – Hands on exercise – end to end PoC using Yarn or Hadoop 2.

    Real World Transactions handling of Bank,Moving data using Sqoop to HDFS,Incremental update of data to HDFS,Running Map Reduce Program,Running Hive queries for data analytics

    Project 2 – Hands on exercise – end to end PoC using Yarn or Hadoop 2.7

    Running Map Reduce Code for Movie Rating and finding their fans and average rating

    Deep Dive in Pig

    A. Introduction to Pig

    What Is Pig?,Pig’s Features,Pig Use Cases,Interacting with Pig

    B. Basic Data Analysis with Pig

    Pig Latin Syntax,Loading Data,Simple Data Types,Field Definitions,Data Output,Viewing the Schema,Filtering and Sorting Data,Commonly-Used Functions,Hands-On Exercise: Using Pig for ETL Processing

    C. Processing Complex Data with Pig

    Complex/Nested Data Types,Grouping,Iterating Grouped Data,Hands-On Exercise: Analyzing Data with Pig

    Deep Dive in Hive

    A. Introduction to Hive

    What Is Hive?,Hive Schema and Data Storage,Comparing Hive to Traditional Databases,Hive vs. Pig,Hive Use Cases,Interacting with Hive

    B. Relational Data Analysis with Hive

    Hive Databases and Tables,Basic HiveQL Syntax,Data Types,Joining Data Sets,Common Built-in Functions,Hands-On Exercise: Running Hive Queries on the Shell, Scripts, and Hue

    C. Hive Data Management

    Hive Data Formats,Creating Databases and Hive-Managed Tables,Loading Data into Hive,Altering Databases and Tables,Self-Managed Tables,Simplifying Queries with Views,Storing Query Results,Controlling Access to Data,Hands-On Exercise: Data Management with Hive

    D. Hive Optimization

    Understanding Query Performance,Partitioning,Bucketing,Indexing Data

    Introduction to Hbase architecture

    What is Hbase,Where does it fits,What is NOSQL

    Hadoop Cluster Setup and Running Map Reduce Jobs

    Hadoop Multi Node Cluster Setup using Amazon ec2 – Creating 4 node cluster
    setup,Running Map Reduce Jobs on Cluster

    Advance Mapreduce

    Delving Deeper Into The Hadoop API,More Advanced Map Reduce Programming, Joining Data Sets in Map Reduce,Graph Manipulation in Hadoop

    Job and certification support

    Major Project, Hadoop Development, cloudera Certification Tips and Guidance and Mock Interview Preparation, Practical Development Tips and Techniques,certification preparation

    Big Data Hadoop Developer Project

    Project Work

    1. Project – Working with Map Reduce, Hive, Sqoop

    Problem Statement – It describes that how to import mysql data using sqoop and querying it using
    hive and also describes that how to run the word count mapreduce job.

    2. Project – Hadoop Yarn Project – End to End PoC

    Problem Statement – It includes:

    Import Movie data,Append the data,How to use sqoop commands to bring the data into the hdfs,End to End flow of transaction data,How to process the real word data or huge amount of data using map reduce program in terms of movie etc.

Write a review

Note: HTML is not translated!
    Bad           Good


  • Availability: In Stock
  • Rs8,000

  • Ex Tax: Rs8,000