Performance of Hadoop On OpenStack

Uploaded by

krishnanand

0% found this document useful (0 votes)

63 views39 pages

sdfdsf

Original Title

Performance of Hadoop on OpenStack

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

sdfdsf

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

63 views39 pages

Performance of Hadoop On OpenStack

Uploaded by

krishnanand

sdfdsf

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 39

Search inside document

Performance

of Hadoop
on OpenStack
Andrew Lazarev
Mirantis, 2014
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
What Is Hadoop?
A
m
b
a
r
i
(
M
a
n
a
g
e
m
e
n
t
)
Z
o
o
K
e
e
p
e
r
(
C
o
o
r
d
i
n
a
t
i
o
n
)
O
o
z
i
e
(
S
c
h
e
d
u
l
i
n
g
)
HDFS
(File System)
H
B
a
s
e
(
N
o
S
q
l

S
t
o
r
e
)
MapReduce
(Programming Framework)
P
i
g
(
D
a
t
a

F
l
o
w
)
H
i
v
e
(
S
Q
L
)
S
t
o
r
m
(
R
e
a
l
-
t
i
m
e

c
o
m
p
u
t
a
t
i
o
n
)
- Core Apache Hadoop
Easy to operate cluster
One-click self-service provisioning
Sharing hardware between several Hadoop
clusters
Tenants isolation on hypervisor and network
layers
Comparable performance with much more
flexibility
Why Virtualize Hadoop?
Sahara - OpenStack Data Processing project
OpenStack Integrated
Supports Hadoop 1 and 2
Different vendors (Apache, Hortonworks, Intel*)
Cluster provisioning and on-demand jobs
execution
How To Virtualize?
Direct impact
Disk write
Disk read
Network
CPU
Virtualization Impact
Indirect impact
Lack of low level system control
Resources for hypervisor operation
Virtualization Impact
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
Mirantis OpenStack Express cluster
20 nodes
CPU: 24 x 2.10 GHz (2 x Intel Xeon CPU E5-2620)
Memory: 8 x 4.0 GB, 32.0 GB total
Disk: 1 drive, 0.9 TB (WDC WD1003FBYX-0)
Network: 2 x 1 GbE
Environment
Host OS: CentOS 6.5
VM OS: CentOS 6.5
Mirantis OpenStack
QEMU-KVM 1.2.0
Network: Neutron + GRE
Open vSwitch 1.10.2
Environment (continuation)
Hadoop: Vanilla Apache 1.2.1
Bare metal setup:
19 Hadoop Nodes
OpenStack setup:
1 Controller + 19 Computes
19 (or 57) VMs with Hadoop
Environment (continuation)
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
Disk Write (using dd)
*greater is better
TestDFSIO - built-in hadoop IO test
write test
read test
1000 files of 1GB (1 TB total)
Disk Write (hadoop test)
Disk Write (hadoop test)
*less is better
Disk Write (hadoop test)
*less is better
disk_cachemodes param in nova.conf
writethrough (default) - guest disk write cache
is disabled
writeback - guest disk write cache is enabled
Disk Cache Mode
Writeback cache enabled
One large VM with all memory per Host
Disk Write (dd, writeback cache)
Disk Write (dd, writeback cache)
*greater is better
Disk Write (hadoop test, writeback cache)
*less is better
QEMU 1.4:
high performance virtio-blk data plane
implementation
+108.0% on rnd-write (based on RedHat
presentation on KVM Forum):
Disk Write - Way To Improve
Disk Read (using hdparm)
*greater is better
Disk Read (using hdparm)
*greater is better
Disk Read (hadoop test)
*less is better
Network (OVS+GRE)
*greater is better
PI - built-in hadoop test
Depends mostly on CPU
50 series of 10,000,000,000 probes
CPU (hadoop test)
CPU (hadoop test)
*less is better
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
Built-in hadoop test
Represents real Hadoop workload
Involves
IO
Networking
Computation
Sorting 200,000,000 of 100-byte entries (20 GB)
Writeback cache enabled
Terasort
Terasort
*less is better
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
Hadoop can consider distance between nodes
Intelligent task scheduling
Reading data from close data nodes
Data Locality
NODE
NODE
NODE
NODE
NODE
NODE
Data Locality
*greater is better
Network within host comparable to disk speed
Allows hadoop process isolation (VM per process)
Test:
1 Master Node (JobTracker + NameNode)
18 DataNodes
18 TaskTrackers
TeraSort of 20 Gb data
Data Locality
Terasort (data locality)
*less is better
Introduction
Environment description
Direct virtualization impact
Real-life workload
Data locality
Conclusion
Agenda
Only 6% performance impact for composite test
Performance continuously improving with
external libs upgrade (QEMU, Open vSwitch)
Much more topology flexibility
Isolation at low cost
between clusters
between nodes within cluster
Conclusion
Q&A
Thank you!
Andrew Lazarev
Launchpad/GitHub/IRC: alazarev
E-Mail: alazarev@mirantis.com

Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
More On M.SC - Quantitative Finance
Document2 pages
More On M.SC - Quantitative Finance
krishnanand
No ratings yet
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
Chapter 06 Test Records For Retail Banking PDF
Document11 pages
Chapter 06 Test Records For Retail Banking PDF
krishnanand
No ratings yet
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
Cassandra Succinctly PDF
Document121 pages
Cassandra Succinctly PDF
denise garcia
No ratings yet
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Chap03 CodesPostInstall
Document1 page
Chap03 CodesPostInstall
kazi23
No ratings yet
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (894)
Two Mark Questions with AI Answers
Document14 pages
Two Mark Questions with AI Answers
Adhithya Srinivasan
No ratings yet
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Yang CV
Document6 pages
Yang CV
krishnanand
No ratings yet
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
Rti Data Science Predictive Analytics
Document2 pages
Rti Data Science Predictive Analytics
krishnanand
No ratings yet
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Software Engineering Methodology Course
Document1 page
Software Engineering Methodology Course
krishnanand
No ratings yet
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
M.SC - Bioinformatics
Document43 pages
M.SC - Bioinformatics
krishnanand
No ratings yet
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (587)
1462787729248625572
Document37 pages
1462787729248625572
krishnanand
No ratings yet
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (265)
Army Battle Casualties PDF
Document1 page
Army Battle Casualties PDF
krishnanand
No ratings yet
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
ITSM Assessment for Cloud Computing
Document88 pages
ITSM Assessment for Cloud Computing
krishnanand
No ratings yet
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
Documents Profile Preet CV
Document3 pages
Documents Profile Preet CV
krishnanand
No ratings yet
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Sample CV For Freshers
Document2 pages
Sample CV For Freshers
Ravinder Singh Bhadauria
No ratings yet
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
RRB Solved Paper - 2016: Based On Memory
Document21 pages
RRB Solved Paper - 2016: Based On Memory
krishnanand
No ratings yet
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Green Energy Professional Seeks Opportunity
Document3 pages
Green Energy Professional Seeks Opportunity
krishnanand
No ratings yet
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
14646936552106475133
Document47 pages
14646936552106475133
krishnanand
No ratings yet
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2219)
International Institute of Digital Technologies: Government of Andhra Pradesh
Document2 pages
International Institute of Digital Technologies: Government of Andhra Pradesh
krishnanand
No ratings yet
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
UBRP201617 ITSpecilaist Officer Recruitment Notification
Document17 pages
UBRP201617 ITSpecilaist Officer Recruitment Notification
Yousuf Sait
No ratings yet
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
Indian Navy Career Guide
Document27 pages
Indian Navy Career Guide
sandeepsingh_1108
No ratings yet
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Assistant Manager Advertisement Ver 1.8 Website Upload
Document21 pages
Assistant Manager Advertisement Ver 1.8 Website Upload
krishnanand
No ratings yet
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
Official SBI Specialist Officers Eligibility & Recruitment Notification 2016
Document6 pages
Official SBI Specialist Officers Eligibility & Recruitment Notification 2016
Testbook Blog
No ratings yet
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Gango Week 4 Lect 1
Document8 pages
Gango Week 4 Lect 1
krishnanand
No ratings yet
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
System Identification With Matlab. Linear Models
Document267 pages
System Identification With Matlab. Linear Models
krishnanand
No ratings yet
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
CL Module III
Document188 pages
CL Module III
krishnanand
No ratings yet
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
AP Electronics & Information Technology Agency: Government of Andhra Pradesh
Document2 pages
AP Electronics & Information Technology Agency: Government of Andhra Pradesh
krishnanand
No ratings yet
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
Wearable Robotics Challenges and Trends Proceedings of The 2nd International Symposium On Wearable Robotics
Document393 pages
Wearable Robotics Challenges and Trends Proceedings of The 2nd International Symposium On Wearable Robotics
krishnanand
No ratings yet
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
Paton - Fundamentals of Digital Electronics With Labview
Document82 pages
Paton - Fundamentals of Digital Electronics With Labview
lakicar2
No ratings yet
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (119)
Assignment 2 Key
Document6 pages
Assignment 2 Key
krishnanand
No ratings yet
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
Chap3 Practice Key
Document5 pages
Chap3 Practice Key
krishnanand
No ratings yet
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Windows Azure Poster
Document1 page
Windows Azure Poster
Kuganeswara Sarma Aatithan
No ratings yet
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Toibin
Rating: 3.5 out of 5 stars
3.5/5 (1937)
ScaleIO Alarm Storage Pool Has Degraded Capacity
Document28 pages
ScaleIO Alarm Storage Pool Has Degraded Capacity
MAHDI
No ratings yet
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
Squid
Document115 pages
Squid
hientran1018
No ratings yet
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Oracle DBA on Unix and Linux Architecture
Document42 pages
Oracle DBA on Unix and Linux Architecture
jorhiavi
No ratings yet
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2099)
OneDrive Log Data from Oct 23
Document22 pages
OneDrive Log Data from Oct 23
María José Costa Torres
No ratings yet
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
NFS Noac Performance Impact
Document10 pages
NFS Noac Performance Impact
ubiqueubique
100% (1)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
PGDCA Assignments 2021: Computer Applications, C, Python
Document10 pages
PGDCA Assignments 2021: Computer Applications, C, Python
Jimmy John
No ratings yet
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
SonicWall CFS Admin Guide
Document30 pages
SonicWall CFS Admin Guide
Marlon Patrone
No ratings yet
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
PSAC Proactive Sequence-Aware Content Caching Via Deep Learning at The Network Edge
Document10 pages
PSAC Proactive Sequence-Aware Content Caching Via Deep Learning at The Network Edge
Umer Bin Salman
No ratings yet
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
Lookup Transformation
Document28 pages
Lookup Transformation
Venkata Sri Harsha
No ratings yet
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
High Availability and Disaster Recovery Guide Oracle Weblogic Server and Coherence
Document44 pages
High Availability and Disaster Recovery Guide Oracle Weblogic Server and Coherence
la messi
No ratings yet
E-Commerce 2019: Business. Technology. Society.: Fifteenth Edition, Global Edition
Document51 pages
E-Commerce 2019: Business. Technology. Society.: Fifteenth Edition, Global Edition
jojojo
No ratings yet
Override settings for consolidation administration
Document12 pages
Override settings for consolidation administration
Sudhakar k
No ratings yet
FortiOS-6.4.0-New Features Guide
Document279 pages
FortiOS-6.4.0-New Features Guide
m0nsys
No ratings yet
Lineage Retrieval For Scientific Data Processing: A Survey
Document28 pages
Lineage Retrieval For Scientific Data Processing: A Survey
Deepika Kupam
No ratings yet
Sybase Ase15 Architecture Diagram
Document1 page
Sybase Ase15 Architecture Diagram
Guru Reddy
No ratings yet
Create an Account and Apply for Scholarship
Document20 pages
Create an Account and Apply for Scholarship
Asif Mengal
No ratings yet
App List
Document2,725 pages
App List
Sreejith Sree kuttan
No ratings yet
Dbms 4.3 Recovery
Document27 pages
Dbms 4.3 Recovery
THANMAYEE JETTI
No ratings yet
Writing Into Cache PDF
Document5 pages
Writing Into Cache PDF
doomachaley
No ratings yet
Mastering GeoServer - DBMS Connection Parameters Explained
Document9 pages
Mastering GeoServer - DBMS Connection Parameters Explained
Naveen Kumar S
No ratings yet
Mikrotik Short Courses
Document4 pages
Mikrotik Short Courses
fargerm
No ratings yet
Veritas 4.5fp6 Troubleshooting Tips
Document5 pages
Veritas 4.5fp6 Troubleshooting Tips
Abhii01
No ratings yet
Oracle 11g RAC Student Guide Volume 2
Document364 pages
Oracle 11g RAC Student Guide Volume 2
shklifo
No ratings yet
Release Notes: EMC CX200LC, CX200, CX400, CX600 FLARE Software
Document30 pages
Release Notes: EMC CX200LC, CX200, CX400, CX600 FLARE Software
c193402
No ratings yet
Mit 101 Activity2
Document3 pages
Mit 101 Activity2
Carlo Moon Corpuz
No ratings yet
Cache Coherence Protocols Explained
Document10 pages
Cache Coherence Protocols Explained
Arunava Biswas
No ratings yet
Wait Event
Document8 pages
Wait Event
getsatya347
No ratings yet
Operating Systems Notes From Gitam Website
Document63 pages
Operating Systems Notes From Gitam Website
sidhartha1991
No ratings yet
Dell's Guide To Server Basics: Click On The Questions Below To Learn More About Servers: 1. 2. 3. 4. 4.1. 4.2. 4.3. 5. 6
Document11 pages
Dell's Guide To Server Basics: Click On The Questions Below To Learn More About Servers: 1. 2. 3. 4. 4.1. 4.2. 4.3. 5. 6
ramesh
No ratings yet