Professional Documents
Culture Documents
MAIL ID : contact@suntrainings.com
PH NO : +91 9642434362
Course Objective Summary
Introduction to Big Data and Hadoop
Hadoop ecosystem - Concepts
Hadoop Map-reduce concepts and features
Developing the map-reduce Applications
Pig concepts
Hive concepts
Oozie workflow concepts
Flume Concepts
Hue Concepts
HBASE Concepts
Real Life Use Cases
Virtual box/VM Ware:
Basics
Installations
Backups
Snapshots
Linux:
Basics
Installations
Commands
Hadoop:
Why Hadoop?
Scaling
Distributed Framework
Hadoop v/s RDBMS
Brief history of hadoop
Setup hadoop:
Pseudo mode
Cluster mode
Ipv6
Ssh
Installation of java, hadoop
Configurations of hadoop
Hadoop Processes ( NN, SNN, JT, DN, TT)
Temporary directory
UI
Common errors when running hadoop cluster, solutions
HDFS- Hadoop distributed File System:
HDFS Design and Architecture
HDFS Concepts
Interacting HDFS using command line
Interacting HDFS using Java APIs
Dataflow
Blocks
Replica
Hadoop Processes:
Name node
Secondary name node
Job tracker
Task tracker
Data node
Map Reduce:
Developing Map Reduce Application
Phases in Map Reduce Framework
Map Reduce Input and Output Formats
Advanced Concepts
Sample Applications
Combiner
Reduce-Side join
Map reduce – customization:
Custom Input format class
Hash Partitioner
Custom Partitioner
Sorting techniques
Custom Output format class
Hadoop Programming Languages :-
I).HIVE
Introduction
Installation and Configuration
Interacting HDFS using HIVE
Map Reduce Programs through HIVE
HIVE Commands
Loading, Filtering, Grouping….
Data types, Operators…..
Joins, Groups….