You are on page 1of 2

RDBMS

ORACLE

DB2 SQOOP
HUE
MSSQL A Graphical Interface
for allowing
MySQL
Users to connect
PostgreSQL Hadoop
HBASE for DML

Hive Metastore on
PostgreSQL FDW MySQL

HiveServer
MySQL MySQL Applier for
Replication Hadoop

MSSQL Namenode
HDINSIGHT Jobtracker
AZURE
FLUME Ganglia Server
for monitoring
LOGS hadoop nodes

WEB
To HIVE N number of Datanode Server
SYSTEM Managed spread over different Racks
Tables
Datanode + Tasktracker
FILES

CSV/TSV

PIG Client for HDFS Client Beeline for


Map Reduce for Accessing connecting to
(Kerberos Auth) Hadoop Hive
SQOOP:
It is used to export/import table data from RDBMS to HIVE and vice versa

FDW: Foreign Data Wrapper – a component of PostgreSQL , hive managed tables can be accessed
Sitting on PostgreSQL prompt

MySQL Applier for Hadoop: This feature gives real time data transportation from MySQL replication to
Hive managed table in hadoop (Practical not done)

FLUME: It is used for real time logs from any log generating server to HDFS and can be used analysis

Ganglia: It is used for monitoring nodes of hadoop

Hive Server: To access hive managed tables remotely hive server is require

Hive Metastore: Hive managed tables metadata is stored in mysql

Beeline: It is client , used to access hive managed table remotely

HUE: It gives us graphical user interface and managing it data in its own sqlite database.
Creation of user, privileges to access hadoop

PIG: It is used to access hdfs and to run map reduce program

Security: It is applied through Kerberos based authentication and also through HUE

You might also like