Professional Documents
Culture Documents
Topics
Sectoral Analysis
Big
Big
Big
Big
Big
Big
Data
Data
Data
Data
Data
Data
Analytics
Analytics
Analytics
Analytics
Analytics
Analytics
in
in
in
in
in
in
Banking
Retail
Supply Chain
telecommunications
e-governance
Healthcare
Topics
Role of Big Data Analytics in
marketing
Big data and cloud analytics
Big data analytical frameworks
Privacy issues in Big Data
Acknowledgement
Cloudera
Hortonworks
Tera-Data University network
Big Data University
Data science Central
IBM
IBM IBV/MIT Sloan Management Review Study 2011
McKinsey / Gartner / IDC reports
Taming The Big Data Tidal Wave: Finding
Opportunities in Huge Data Streams with Advanced
Analytics (Author : Bill Franks)
Bid Data (Authors: Viktor Mayer- Schonberger)
Internet ( for generic search results)
Business
Methodology
Tools / Technology
Steps in Analytics
Data Generation
Data Capturing
Data Storing
Data Processing Reporting and
Visualization
We are
surrounded
with DATA
3 Vs of Big Data
R is open source
Hadoop is open source
RHadoop Packages are open source
Application Areas
Sensor data from machines
Social Network data analysis for
promotion of products
Trend analysis on Twitter
Other Platforms
Hortonworks Sandbox
Cloudera
SAS Data Loader ( SAS Cloudera)
What is supposed to be
discussed..
Generation of Big Data in
organisation
Processing it
Reporting / Using it for organizational
performance
18
Memory Unit
19
20
According to Zuckerberg, 1
billion pieces of content are
shared via Facebooks Open
Graph Daily
https://www.aabacosmallbusiness.com/advisor/big-data-biggerfacts-132520713.html
30 billion
12+ TBs
phones
world
wide
100s
of
millio
ns of
GPS
enabl
2+
ed
billio
data every
day
? TBs of
of tweet data
every day
25+
TBs of
log data
every day
4.6
billio
n camera
76 million
smart meters in
2009
200M by 2014
devices
sold
people
annually
on the
Web by
end 2011
According to
IBM 90% of the
worlds
information
35
Financial
Healthcare
Communications
Digital Media
Real Estate
Manufacturing
Travel
Retailing
Government
Energy
140,000 to
190,000 with
deep analytical
skills
will be needed
by 2018
1,500,000
managers
and
analysts
will be needed
to fill jobs in Big
Data by 2018
There will be a
shortage of talent
necessary for
organizations to take
advantage of big data.
By 2018, the United
States alone could face
a shortage of 140,000
to 190,000 people with
deep analytical skills as
well as 1.5 million
managers and analysts
with the know-how to
use the analysis of big
data to make effective
decisions.
39
63%
2010
business initiative
BUSINESS
IMPERATIVE
2011
2012
IQ
1.6x Reve
nue
4
1
Grow
th
2.5xStock
Price
Appreci
ation
2.0xEBIT
DA
Grow
th
43
44
Finally.
`Big- Data is similar to Small-data but bigger,
speedy and multi- structured
.. But having data bigger it requires different
approaches:
Techniques, tools, architecture
48
article, 2011
Big data refers to data sets whose size is beyond the
ability of typical database software tools to capture,
store, manage and analyze. - The McKinsey Global
Institute, 2011
49
Todays Decision-making
Forward-looking recommendations
Exploit all data from diverse sources
Real-time, correlated, governed
Business Optimization
Complementary Approaches
for Different Use Cases
New Approach
Creative, holistic thought,
intuition
Traditional
Approach
Data
Structured, analytical, Transaction Data
Warehou
logical
se
Internal App
Structured
Data
Structure
Repeatabl
ed
Mainframe Data
Linear
Repeatab
Monthly sales reportsle
OLTP
System
Profitability analysis
Data
Linear
Customer surveys
ERP data
Traditional
Sources
Hadoop
Streams
Enterprise
Integration
Web Logs
Social Data
Unstructur
Unstructur ed
edExplorator
Text Data:
y
emails
Exploratory
Iterative
Sensor data: images
Iterative
Brand sentiment
Product strategy
Maximum asset
RFID
utilization
New
Sources
Big Data
Databases
Volume
Velocity
56
Data Volume
Exponential increase in
collected/generated
data
57
59
60
terabytes
of Tweets
create daily.
trade events
per second.
Volume
Velocity
Variety
Veracity
100s
of different
types of data.
5+million
Only
1 in 3
decision makers
trust
their information.
Mobile devices
(tracking all objects all the time)
Social media and networksScientific instruments
(all of us are generating data)(collecting all sorts of data)
The progress and innovation is no longer hindered by the ability to collect data
But, by the ability to manage, analyze, summarize, visualize, and discover
knowledge from the collected data in a timely manner and in a scalable fashion
63
Old Model: Few companies are generating data, all others are consumin
64
65
6
6
Boots
trap
Enrich
Adaptive
Analytics
Model
Forecast
Nowcast
Opportunity Cost
Starts Here
01011001100011101001001001001
11000100101001001011001001010
0011010100100100100110100101010011100101001111001000100100010010001000100101
01100100101001001010100010010
01100100101001001010100010010
11000100101001001011001001010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
11000100101001001011001001010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
11000100101001001011001001010
01100100101001001010100010010
01100100101001001010100010010
01100100101001001010100010010
11000100101001001011001001010
67
68
69
360-Degree View
Organizations have talked about a
360-degree view of their
customers
What is a 360-degree view?
Names & Addresses
72
98% of Information
73
74
motivation1
Intention1
Motiva
tion2
Preference1
Etc.
Preference2
Inten
tion2
75
Web sites
Kiosks
Behaviors That Can Be Captured
Mobile apps Purchases
Requesting help
Product views
Forwarding a link
Social media Shopping basket additions Posting a comment
Watching a video
Registering for a webinar
Etc
Accessing a download
Executing a search
Reading / writing a review
76
Shopping Behaviors
How customers come to a site to
begin shopping
What search engine do they use?
What specific search terms are entered?
Do they use a bookmark they created
previously?
Associated with higher sales rates
Search
keywords
77
78
79
Research Behaviors
Understanding how customers utilize
the research content can lead to
tremendous insights into
How to interact with each individual
customer
How different aspects of the site do or do
not add value
80
Detailed specification
81
Feedback Behaviors
Some of the best information is
Detailed feedback on products and
services
Customers
in general
Each specific
customer
He
He
He
He
He
has four accounts: checking, savings, credit card, and a car loan
makes five deposits and 25 withdrawals per month
never visits a branch in person
has a total of $50,000 in assets deposited
owes a total of $15,000 between his credit card and car loan
Attrition Modeling
In the telecommunications industry,
Companies have invested massive
amounts of time and effort for churn
models
Provider
101s
cancellation
policies page
Response Modeling
It is similar to attrition modeling
The goal is predicting a negative behavior rather
than a positive behavior (purchase or response)
Has the exact same score due to having the same value:
0.62
Customer Segmentation
Web data enables to segment
customers based upon typical
browsing patterns
Dreamer
91
Bing Liu
93
94