HADOOP

HADOOP TRAINING

HADOOP

  • What is Hadoop?
  • Why Hadoop and its use cases.
  • Different Echosystems of Hadoop
  • Hadoop is Good For And Hadoop Is Not Good For

HDFS(Hadoop Distributed File System)

  • Significance of HDFS in Hadoop
  • Features of HDFS
  • 5 daemons of Hadoop
    • Name Node and its functionality
    • Data Node and its functionality
    • Secondary Name Node and its functionality
    • Job Tracker and its functionality
    • Task Tracker and its functionality
  • Data Storage in HDFS
    • Introduction about Blocks
    • Data replication
    • Accessing HDFS
      • CLI(Command Line Interface) and admin commands
      • How to store various types of data in HDFS using CLI-commands
      • Java Based Approach
    • Safemode concepts in HDFS

MAPREDUCE

  • Map Reduce Architecture
  • Map Reduce Programming Model
    • Different phases of MapReduce Algorithm
      • Mapper phase
      • Sort&Shuffle phase
      • Reducer phase
    • Different Data types in Map Reduce
    • How Write a basic Map Reduce Program
      • The Driver Code
      • The Mapper
      • The Reducer
  • Creating Input and Output Formats in Map Reduce Jobs
    • Text Input Format
    • Key Value InputFormat
    • Sequence File Input Format
  • Important features of MapReduce job
    • Data localization in Map Reduce
    • Combiner(Mini Reducer)
    • Partitioner

Apache PIG

  • Introduction to Apache Pig
  • Map Reduce Vs Apache Pig
  • SQL Vs Apache Pig
  • Different datatypes in Pig
  • Modes Of Execution in Pig
    • Local Mode
    • Map Reduce OR Distributed Mode
  • Exection Mehanism
    • Grunt Shell
    • Script
    • Embeddeb
  • Transformations in Pig
  • How to write a simple pig script
  • How to store Pig output data in Sqoop&HDFS
  • UDFs in Pig

HIVE

  • Hive Introduction
  • Hive architecture
  • Hive Meta Store
  • Hive Integration with Hadoop
  • Hive Tables
    • Managed Tables
    • External Tables
  • Hive Query Language(Hive QL)
  • How to load the data to Hive Tables
  • Altering Tables In Hive
  • Partitions In Hive
  • CTAS In Hive
  • Joins In Hive
  • SQL VS Hive QL
  • Hive Transform
  • UDF’s In Hive

SQOOP

  • Introduction to Sqoop
  • MySQL client and Server Installation
  • How to connect to Relational Database using Sqoop
  • Different Sqoop Commands
    • Different flavors of Imports
    • Sqoop Eval Functions
    • Export

HBase

  • Hbase introduction
  • Hbase usecases
  • Hbase basics
    • Column families
    • Scans
  • Hbase Architecture
    • Hmaster
    • Zookeeper
    • Region Servers
    • Regions
  • How to create the tables in Hbase
  • Introduction about OOZIE
  • Introduction about ZOOKEEPER

Flume

  • What is Flume?
  • How Flume work?
  • Flume Architecture
  • Flume Agents
  • Flume Examples

Oozie

  • What is Oozie?
  • How oozie will works?
  • Oozie workflow

Cassandra

  • What is Cassandra?
  • How Cassandra will work