Professional Documents
Culture Documents
Big Data
A field guide for Industry-based Big
Data Opportunities
Oracle Inc.
1st Edition
Utilities Industry.........................................................................33
Big Data Use Cases ............................................................................................34
Industry Solutions ...............................................................................................37
Data Sources .......................................................................................................38
Foreword
Data?
A simple definition would be that data becomes Big Data, or rather a Big Data Problem,
when the volume, velocity, and/or variety of the data exceeds the abilities of your current IT
systems to ingest, store, analyze, or otherwise process it.
This simple definition hints at some not-so-simple challenges. Certainly, there are solutions
for handling large quantities of data. Networking and bus technologies provide a transit system
for moving data rapidly. But, what happens when that data is messy - a mix of structured and
unstructured data - that doesnt fit neatly into defined data structures, AND its high volume,
AND it needs to be processed quickly? Think of data from millions of sensors on electricity
networks or manufacturing lines or oil rigs. Identifying deviations from past trends in this data
(and whether the deviations are safe or unsafe deviations) in real time can help avoid a
power outage, reduce waste and defects, or even avoid a catastrophic oil spill.
This type of problem is occurring in almost all industries today. The volume of data is
growing too fast for traditional analytics because the data is becoming richer (each element is
larger or more varied), more granular (time intervals decreasing from months to days or days to
minutes), or just needs to be processed much faster than it used to.
The volume, velocity and variety problems all contribute to the growing need for larger and
larger data storage and processing requirements, but the final component; value, is where
investments are made. Companies need to find value in storing and processing the data in order
to justify capturing and keeping it in the first place, and this is a fundamental shift for many
organizations. It can also be a huge competitive advantage. Much of what this book focuses on
is providing you with that trick; convincing a company that in their industry there is competitive
advantage to using this data.
Oracle commissioned a study and this was one of the surprising answers only 14% of the
interviewed executives have not been considering unstructured data as an important part of their
business. Because of this, many companies and organizations are starting to realize - their
traditional IT systems are not prepared to deal with the rapid growth of Big Data that the very
same IT systems have enabled.
We have asked each of our industry big data experts to describe for you the big data
challenges that have surfaced in each industry. Each chapter contains descriptions of use cases in
the customers language, and lists of common sources of data involved. Ask your customer
which data sources they have available; and even if they already capture it there are likely
additional uses for the data like the ones contained in this book. Weve also given guidance on
whom to speak to at the customer and the opportunity they would most likely be interested in.
This book will enable you to have a conversation about Big Data thats specific to your
customers industry, and link out to content for an executive discovery workshop.
Chapter 1
The real innovation here is that we can ask questions and get the answer
back before we have forgotten why we asked the question in the first
place.
Hilary Mason, Chief Scientist Bit.ly
The next question many people throw at this idea of providing structure to the data just in
time is why dont I do that with my whole data warehouse? The simple answer is that it is
cost prohibitive to structure all data the same way a data warehouse requires. Your customers
already know how long data warehousing projects take; and that is with data that is already
structured but needs transformation. Twitter feeds and sensor data dont have a structure to begin
with (at least not a data warehouse compatible structure). Have a few queries provide dynamic
structure to the data in a data warehouse and it will work just fine. Try to do that with thousands
or tens of thousands and it becomes cost prohibitive; so traditional data warehouses still have
their place, and in fact Oracles analytics solution handles the whole lifecycle from Information
Discovery all the way through traditional Business Intelligence. The key enabler of the big data
appliance is to reduce the data to the point that it can be put in a data warehouse with good
structure so it can be compared against the rest of the customers data.
The architecture for accomplishing this type of new data paradigm is extremely important.
Some companies are using pieces from a variety of vendors and open source components to
accomplish this new data lifecycle; but only Oracle provides an end to end solution for big data,
all the way from acquiring the data and finding new insights all the way to making repeatable
decisions and scaling the analytics out to the organization at large. This architecture is very
important to making use of big data, because the analysis of data is really on a continuum not
nicely isolated use cases. Data in your RDBMS (Oracle relational database) will be useful as
part of a big data analysis, and vice versa; so having a solution that lets data move easily between
the parts of the architecture is important. Corporate data warehouses dont become obsolete in
this model, they become more important as the business finds new analyses over time that are
new imperatives to running the business.
The last bit there is probably the best takeaway point figure out where your customer is on
their big data lifecycle. Do you need to convince them there is something worthwhile to
Explore? Do they need to be convinced of large areas to expand (like new data sources), but
already have some HADOOP or Exadata boxes that they are using data mining or similar
techniques within? Do they already have custom adapters and move big data aggregations from
a HADOOP platform into their data warehouses, and need to be shown the value of scale of
truly exploiting a common architecture and platform?
The final important point when examining this problem is that Oracle has a large number of
components that fit the big data problem, and we sell best when we sell them together. BI
applications, BI foundation, the Exalytics platform it runs on, Data mining and Exadata, the Big
Data Connectors and the Big Data appliance itself all have something to contribute to the
solution and the conversation. Dont forget about the real-time components covered by the
Fusion Middleware platform with Complex Event Processing and Real Time Decisions for
streaming analysis and recommendations, and the Exalogic platform they run on. Add in our
partners, GBU industry applications and IBU industry solutions and you have a team at your
fingertips ready to help you sell this vision.
Chapter 2
electronic trading has meant that Capital Markets firms both generate and act upon hundreds of
millions of market related messages every day. For the most part, financial services firms have
relied on relational technologies coupled with business intelligence tools to handle this everincreasing data and analytics burden.
It is however increasingly clear that while such
technologies will continue to play an integral role, new technologies many of them developed
in response to the data analytics challenges first faced in e-commerce, internet search and other
industries have a transformative role in data management within this industry.
On-Demand risk, especially at a trading desk level, is now the desired goal of global banks.
The objective is not only faster measurement and reporting of risk, but also measurement across
asset classes.
Aggregation of global positions, pricing calculations,
and VaR, all fall within the realm of Big Data. This is due
to the mounting pressure to speed these calculations up well
beyond the capacity of current systems, but also because of
the need to deal with ever growing volumes of data. While
firms have adopted the use of compute grid technologies to
enable faster risk computations, the feeding of data into
these grids has become a bottleneck. Technologies such as Oracle Coherence complement
compute grid technologies, and enable faster calculations by allowing real time access to inmemory positions data and by allowing MapReduce style parallelization. Such an architecture
helped a global bank reduce VaR calculation time from 15 simulations was increased almost 25
fold.
For Enterprise Risk Management, the added challenge is of data integration from many
disparate systems. (It is not uncommon for data to be sent from source systems as flat files e.g.)
The Oracle Big Data Appliance and Big Data Connectors enable ETL-style processes to be
parallelized on Hadoop before the data is loaded into an enterprise risk warehouse. For the risk
warehouse itself, the Oracle Exadata machine with in-database analytics virtually eliminates the
performance bottlenecks typically associated with running SQL processes on database servers.
Enterprise risk software, including Oracle Financial Services Liquidity Risk Management,
benefit from this capability. In a benchmark Oracle Financial Services Liquidity Risk
Management running on Oracle Exadata Database Machine calculated business-as-usual
liquidity gaps for 371 million cash flows across 66 million accounts in just 69 minutes. After
applying modified behavior assumptions to simulate adverse market conditions, stressed
liquidity gaps were calculated in only 10 minutes. With the ability to execute an individual stress
test run in mere minutes, institutions can refine their scenarios to simulate any impact on
business-as-usual liquidity gaps and immediately assess the effects of a given counterbalancing
strategy.
Transaction Cost Analysis, which measures actual order execution performance against
established benchmarks metrics, is another excellent example. TCA, initially adopted mainly as a
check the box tool for compliance with best execution regulations, has now found wide
acceptance outside the compliance department. TCA is now used to assess broker performance
internally, to identify outlier trades and to measure performance of algorithms sold by the sellside. A number of large global banks have implemented trade data warehouses using Oracle
Exadata technology (BNP Paribas is a reference customer). The Big Data Appliance (BDA) is a
new technology, but one that can complement the Exadata based trade warehouse. Using the
BDA will allow faster trade capture on HDFS, and faster processing (using Hadoop and R)
before processed data is loaded into Exadata for analysis.
incurred by the affected firms. Here too, a lot of data needs to be crunched from multiple,
inconsistent sources in a very dynamic way, requiring a new technical approach to the analytics
platform.
Fraud Detection
Fraud detection involving cards debit, and wholesale payments is also quickly becoming a
big data problem, in as much as correlating data from multiple, unrelated sources has the
potential to catch more fraudulent activities earlier than current methods. Consider for instance
the potential of correlating Point of Sale data (available to any credit card issuer) with web
behavior analysis (either on the bank's site or externally), and potentially with other financial
institutions or service providers such as First Data or SWIFT, to detect suspect activities.
Payment providers have developed fraud detection tools that depend on massive datasets
containing not only financial details for transactions, but IP addresses, browser information, and
other technical data that will help these companies refine models to predict, identify, and prevent
fraudulent activity. These enhance the traditional approaches
to fraud prevention, which are mostly based on sanctions lists
and pre-defined rules.
Any compliance, fraud and security department in any
financial institution should be interested applying new
technologies to enhance the current Know Your Customer
initiatives, watch list screening, and the application of
fundamental rules. Correlating heterogeneous data sets has
the potential to dramatically improve fraud detection, and
could also significantly decrease the number of false positives (e.g. using a card while traveling).
INDUSTRY SOLUTIONS
Link into the Oracle Industry Solutions database for conversation scripts and questionnaires
that can get your client thinking about their big data solutions.
The Financial Services solutions are at:
http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-A-K/FinancialServices/
Solution2/index.html?ssSourceNodeId=23441&ssSourceSiteId=ibu.
The Insurance solutions are at:
http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-A-K/Insurance/Solution4/
index.html?ssSourceNodeId=23456&ssSourceSiteId=ibu.
DATA SOURCES
Structured Internal Sources:
CRM Data
Marketing Plans
Competitor Information
Chapter 3
Media Industry
BIG DATA USE CASES
Digital Advertising Sales
The old adage that half of all advertising is effective, we just dont know
which half no longer applies in the digital age. A fast-growing proportion
of advertising inventory is now sold in real-time through online auctions
where media owners (publishers, web sites) trade detailed demographic
information about their users for performance based advertising. In other
words the Google model for advertising sales is appearing in mainstream
media. Data is now gold for media companies.
For example, a detailed understanding of audiences by socio-economic group,
demographics, content consumption patterns, likes and interests can be built up from account
details, content consumption logs, viewing data and social media interactions. This enables
tighter segmentation of consumers, enabling advertising sales managers to increase advertising
rates (CPMs) and win a greater share of online advertising. The data can be used directly by
advertising sales teams, or sold as an additional revenue source to advertising trading partners.
The same detailed audience data also allows advertising sales teams to offer targeted
addressable advertising to their buyers, where advertisers will pay a premium to reach known
specific demographic niches.
Advertising agencies are able to use the same types of data to create more effective content
(advertising creative), and plan more effective, cost-efficient campaigns when purchasing media
space on behalf of their brand clients.
Increasing subscribers
The same analysis of Big Data sources used to increase advertising rates can also be used to
produce and deliver more relevant content to users, target them with appropriate marketing
messages, and increase the number of users and paying subscribers using a media companys
services.
products and platforms. Informed decisions can be made about pricing, product bundling and the
most effective payment and advertising models for each service and platform.
INDUSTRY SOLUTIONS
The IBU Media Analytics and Content Personalization solution will be available shortly from
the IBU portal.
http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-L-Z/MediaEntertainment/
Solution1/index.html
The executive conversation script and discovery questionnaire will allow you to work with
media execs to focus in on their Big Data priorities and position Oracle Big Data, Analytics, BI,
Discovery and Real-Time Decisions solutions.
DATA SOURCES
These are examples of the type of data we commonly see used in the Media industry. Only a
fraction of this data typically ends up in a data warehouse and is available for analysis. What
questions could be answered if all of this data could be combined and analyzed?
Cookies
Billing data
Purchase histories
Click-throughs
Order management
Device location
Mobile payments
Advertising
Content licensing
Traditional media
Content metadata
Competitor content
CRM data
Chapter 4
Healthcare Industry
BIG DATA USE CASES
Remote Patient Monitoring
With the world wide goal of reducing the costs of healthcare and
improving patient outcomes, many countries are looking to more closely
monitor patients on a constant real time basis. The monitoring can include in
home devices such as glucometers, weight scales, pedometers and others. The
Volume and Velocity of this data, as well as the real time nature of the analysis
and action necessitates a Big Data Solution.
For example, for patients suffering from a chronic
disease such as diabetes or congestive heart failure, the ability to monitor
the patient for weight gain, blood sugar levels and exercise attempts will
allow the care team to more appropriately converse with the patient. The
ability to extend the healthcare system into the home allows for a much
better quality of life for the patient, while at the same time giving more
visibility in the current health of the person.
The care team composed of a Case Manager, Physician or Nurse can proactively
contact the patient and provide suggestions to the patient to help improve the current condition of
concern, even being able to recommend that the patient report to an Emergency Room for
immediate treatment if needed.
Another example where real time in home devices can be used is for independent living. Just
because many countries are experiencing an ageing population, does not mean that the
population wants to give up the ability to live alone. But, living alone does not mean that there
are not people that are concerned about the well being of the person. Having the ability to
covertly monitor the person, with their permission, provides a level of safety to determine if
someone has fallen, not gotten out of bed, or has been missing meals.
Accountable Care Organizations (ACOs) or Service Providers will be interested
in providing the services needed to insure that their customers are living independent and healthy
lives.
Healthcare Analytics
With the healthcare industry moving from a paper based system to an on line digital system
around the world, the usage of EMR (Electronic Medical Record) systems is on the rise.
Unfortunately, much of this data is locked in a system designed to treat patients on an episodic
fashion, and may not contain the full longitudinal health record of the patient. Harvesting this
data is the current format has proven to be difficult. With the maturing of some solutions based
on Big Data architectures, the ability to unlock and analyze this information is now possible.
Having the ability to review patient outcomes with different treatment plans
has often been the want and need of the medical research community.
Solving the Volume and disparate nature of the data storage has long been
an issue in the industry.
The CMIO (Chief Medical Information Officer) or CRO (Chief Research
Officer) at many healthcare organizations is very interested in accessing the
scientific evidence to validate that the treatment plans being utilized are actually being effective,
efficient and at the best cost.
INDUSTRY SOLUTIONS
There are several Oracle Solutions to address Big Data in Healthcare
The Connected Health Solution can be utilized for Remote Patient Monitoring. This will
provide the framework needed to accept the data from each of the remote devices, and populate
the appropriate data stores.
The Translational Research Center is available - http://www.oracle.com/us/industries/healthsciences/oracle-translational-research-ds-497608.pdf - http://www.oracle.com/webapps/dialogue/
ns/dlgwelcome.jsp?p_ext=Y&p_dlg_id=11416590&src=7138239&Act=253 The conversation
around this solution can become pretty complex in short order, so the inclusion of SMExperts is
mandatory.
For the Healthcare Analytics Solution There are many Oracle products that can be
positioned for this solution, from Fusion MiddleWare for the data acquisition and database
population to the Health Sciences products with form the base of the solution. http://
www.oracle.com/us/industries/healthcare/058441.html -
DATA SOURCES
Much of the data required in Healthcare is proprietary data that is already in the possession of
the Healthcare Research Entity, or in public registries.
EMR Data
CDR
Data Warehouse
ERP
CRM
Cancer Genomics Hub
U.S. Health Data Healthdata.gov
Chapter 5
Retail Industry
Retailers are interested in solutions that help them differentiate from their competition and
maximize customer experience. Big Data capabilities enable retailers to collect and extract these
insights from transaction history, purchase frequency and web-behavior, as well as external
environments such as social media, demographics, weather and finance. The data can be
harnessed in multiple ways, from structured databases and distributed predictive analytic systems,
to mining of unstructured data.
Many companies are increasing the use of Data Discovery tools in addition to traditional BI
to tap into unmet customer demands. This new approach to analytics, often by easy-to-use, self
service analytics applications, helps the retailer to explore questions like why and what if,
and brings a new agility to BI and a wider use of analytics all over the organizations. This
chapter will focus on these use cases:
Omni-Channel Marketing how to get customers to spend with you.
Customer Satisfaction making sure the customer experience is more positive than the
competition.
Segment and Sentiment Analysis getting to know your customers from the data they
generate outside your walls.
Value Add to Customers being able to identify changes to your go to market strategy
based on customer sentiments.
For example, merging structured with unstructured content to find underlying customer
satisfaction issues allows enterprises to proactively monitor customer satisfaction levels. At
many retailers, sales and customer service still work in separate silos and customer feedback is
often not allowed to flow freely between the different operations resulting in ineffective
distribution channels.
A COO would be interested in the convergence of sales information, call center operations
and social media enables Big Data to create correlation between product sales, support and
customer voice to validate the true issues impacting on customer satisfaction and for the
targeting of new customer segments, even competitors customers can be analyzed for industry
trends to reveal customers propensity to buy certain products or services.
Another customer satisfaction issue solved by Big Data is to identify the most valuable
customers from a 360 degree view; to be able to reward them with offers and benefits relevant to
a loyalty program, and to exclude those customers that merely take advantage of discounts
without shopping at the merchants again.
Store operations, customer services and to some extent marketing would be interested in this
solution to get the most benefit from sales and promotions. The purpose of these is to keep loyal
customers by making them feel rewarded and special, and these insights allow better focus and
less waste in that effort.
by efficient integration of social media (unstructured data) such as blogs, social networks, service
centers combined with Big Data capabilities, retailers can better understand their customers, their
preferred channels, lifestyles and evolving service needs.
For example, in a retail market where margins are under constant pressure and product
duplication is almost immediate on a global market, retail leaders need the capability to swiftly
respond to changes in customer demand where integration between structured and unstructured
data provides market leaders with improved decision-making and drive faster response times to
market needs.
A lead analyst or any C-level executive would be interested in this. The analysts in retail
companies today spend a lot of time in spreadsheets and discovery tools that allow them to spend
more time on analysis and less time managing and massaging data can improve the companys
ability to make timely decisions.
INDUSTRY SOLUTIONS
Link in to the Oracle Industry Solutions database for conversation scripts and questionnaires
that can get your client thinking about their analytics and data warehouse solutions and check out
the Retail Insights sales play at the IBU Retail Industry Play Portal at HTTP://MY.ORACLE.COM/
SITE/IBU/PORTAL/INDUSTRYSALESPLAYS/INDUSTRIES-L-Z/RETAIL/SOLUTION1/INDEX.HTML
Learn more about Oracle retail solutions enhanced by Big Data capabilities on the Oracle
retail Content Portal http://contentportal.oraclecorp.com/industries/retail.html
Listen to the PodCast for Business Services with an industry Overview on the Sales and
Marketing Content Portal for Engineered Systems.
http://my.oracle.com/site/ibu/portal/ExaBusinessSolutions/SalesPlays-Industries/
BusinessServices/index.html
Learn more about Data Warehousing Big Data Sales Content at:
h t t p : / / m y. o r a c l e . c o m / s i t e / i b u / t e c h n o l o g y / Te c h P r o d u c t M k t g H o m e / D a t a b a s e /
DataWarehousing/SalesKits/index.htm
If the external data sources are the biggest topic, then check out the High Perf Demand
Signal Repositories plays at http://my.oracle.com/site/ibu/portal/ExaBusinessSolutions/
SalesPlays-Industries/Retail/index.html). The Top 5 Objections, Competitive Traps or Questions
section of the Discovery Guide is really useful for convincing your client that they shouldnt go
off on their own and there are some great podcasts here to get you up to speed fast.
DATA SOURCES
There are two main classifications of data sources in Retail; internal and external. The
internal sources are available but its often too expensive and difficult to align their hierarchies
with the other data sources in the company so it just hasnt been done to date. The other group is
external data sources whose hierarchies dont match up nicely to the product or segment
hierarchies that are used internally either. With Big Data it is possible to build hierarchies on the
fly based on rules that can be easily found using tools like Endeca, making mix and match of
data sources easier than what many retailers expect.
Examples of Structured Internal Sources
CRM Data
Sales Data
Marketing Plans
Shipments
Promotions
Retail Execution
Competitor Bench Prices and Promotions (from other Retailer public websites)
Chapter 6
Trade Data data from retailers that is probably not captured or not used effectively at
your customer.
Sales & Supply Data they have the data today but dont combine it well, and they dont
use it in real time.
Sentiment Data Consumer Goods companies are all about their brand, and theyll be
interested in solutions that help them understand the shopper better.
For another example, looking at a few years worth of Walmart data and sales data for
historical product mixes and comparing those to competitor sales can provide a minimal product
mix designed to grab as much market share as possible with the fewest products.
Products in a category cannibalize each other, and big data can be used to estimate an optimal
mix of products to steal market share away from competitors while limiting cannibalization and
maximizing profits. Category managers could use the data to optimize their product mix, and
marketing managers could also use it to maximize return on marketing dollars.
Another example is Trade Promotion Optimization. Every Consumer Goods company
pays retailers to put their product on the shelf and these payments are called trade
promotions. They are always bi-directional agreements to promote the companys products, but
a lot of money in the industry is wasted.
Key Account Managers and Sales Managers can use this data to make sure only the most
profitable promotions are run and figure out through the data if retailers actually implemented
the promotion or not. Figuring out how to spend as little as possible to get your product in the
best position on the shelf with the right displays and coupons can save a company billions.
Sentiment Analysis
Use of social media (Twitter, Facebook, etc.) - this is not just communications specific but a
good example applicable across industry. Collect/stream data from social media sites into CRM
INDUSTRY SOLUTIONS
Link in to the Oracle Industry Solutions database for conversation scripts and questionnaires
that can get your client thinking about their analytics and data warehouse solutions.
Check out the Retail Insights sales play and the rest of the Comprehensive Trade
Management solution at http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-A-K/
ConsumerGoods/Solution1/index.html. This solution drove the sales of 11 Exa systems at P&G.
Download the Executive Conversation Script under the Retail Insights banner and it will walk
you through a whiteboard session about using data from partners (retailers and syndicated data
resellers like IRI/Nielsen) who are closer to the consumer.
Retail Insights covers this too, but if they are interested in making new product launches
more successful, pull down the Innovation Management (http://my.oracle.com/site/ibu/portal/
IndustrySalesPlays/Industries-A-K/ConsumerGoods/Solution3/index.html) Executive
Conversation Script for a great overview of how hard it is for the company to launch products in
the first place, and how only about 20% of new products meet their objectives. Any post-launch
assistance you can give a product should definitely be put to use.
If the external data sources are the biggest topic, then check out the High Perf Demand
Signal Repositories play (http://my.oracle.com/site/ibu/portal/ExaBusinessSolutions/SalesPlaysIndustries/ConsumerGoods/index.html). The Top 5 Objections, Competitive Traps or Questions
section of the Discover Guide is really useful convincing your client that they shouldnt go off on
their own and there are some great podcasts here to get you up to speed fast.
DATA SOURCES
There are two main classifications of data sources in Consumer Goods; internal and external.
The internal sources are available but its too expensive and difficult to align their hierarchies
with the other data sources in the company so it just hasnt been done to date. The other group
are external data sources whose hierarchies dont match up nicely to the product or segment
hierarchies that are used internally either. Big Data build hierarchies on the fly based on rules
that can be easily found using tools like Endeca, so this isnt as big of a hurdle.
Billing Data
Promotions
Retail Execution
Marketing Plans
Shipments
Chapter 7
Telecommunications Industry
BIG DATA USE CASES
Sentiment Analysis & Social Marketing
Combine social media feeds (from Twitter, Facebook, etc.) and customer demographic,
psychographic (values, attitudes, interests, or lifestyles), purchase, and network usage data to
determine importance or clout of customer and to get a better overall picture of each
customers behavior, likes, and dislikes. For example, analyzing Twitter feeds and Facebook
posts can reveal a better understanding of the service providers customer service performance
and if there are quality of service issues with in specific regions or customer groups.
This combined data can be used by marketing teams to better target campaigns and
collaborate with partners on joint campaigns (e.g. cinema companies to offer discount vouchers).
Customer care and operations teams can also leverage this information to determine the next best
action (treatment, remedy, etc.) associated with that customers social influence.
Service providers can also leverage sentiment analysis data to defend their brand image and
reputation by gaining deeper insight into overall social media impact and campaigns. They can
gauge social media sentiment on newly released products, offers, and campaigns in a costeffective manner and proactively create service requests to improve brand perception.
systems, and other sources. Ads, offers and promotions can then be tailored and delivered to the
customer when they access the website, via mobile/SMS, or when talking with a retail store rep
or call center agents.
Today, when a customer logs into a telecom website, the ads that are served up have little
correlation to a particular customers service usage, content purchases, social media activity, or
site browsing history. Capturing all of this information would allow the service provider to
feature ads and offers that reflect recently consumed services and applications more relevant to
their current interests and likelihood to spend.
With a context-sensitive, 360 view profile of the customer telecoms can recommend services
or products in real time to the customer in the context of each interaction and prior history. The
adaptive logic can be integrated across multiple channels including the web, mobile, call center,
retail associate, in-store kiosk, etc. to reflect a customer preference for how and where they want
to interact with the service provider. Ad response, service usage and location data can be
collected and analyzed in real-time using complex event processing and to determine target
segments, product profitability margins prior to offer conceptualization to improve marketing
and advertising spend.
individual customers based on network events like first time user for a certain service or
download of specific applications.
Customer Management: Which customers were impacted when this network fault occurred
and should service requests or direct communication take place to acknowledge the issue and
offer treatment
Customer Care: Prepare the Call Centers with appropriate knowledge of the issue, customers,
and services affected to better prepare agents and scale up staffing volumes as necessary
Revenue and Churn Forecasting: What is the potential revenue, profitability, and churn
impact from the outage? What is the cost or impact to revenue, brand, and other KPIs of
different actions.
INDUSTRY SOLUTIONS
Link in to the Oracle Industry Solutions portal for conversation scripts and questionnaires
that can get your client thinking about their analytics and data warehouse solutions.
Check out the Communications Industry Engineered Systems plays:
http://my.oracle.com/site/ibu/portal/ExaBusinessSolutions/SalesPlays-Industries/
Communications/index.html
Open up the Data Warehouse discovery guide and start asking your customer about their
fulfillment and SLA improvement processes. Or take them on a discovery session about world
class analytics and start figuring out how your customer can get better visibility to revenue
leakage and the causes of it or integrating cross channel commerce into a single view of the
customer.
If Cross Channel takes on a life of its own in your conversations, link into the IBU Cross
Channel Customer Experience Solution for Communications:
http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-A-K/Communications/
Solution2/index.html
DATA SOURCES
These are examples of the type of data we commonly see used in the Telecommunications
industry. Perhaps there are others that are more important to your customer? Only a fraction of
this data typically ends up in a data warehouse and is available for analysis. What questions
could be answered if all of this data could be combined and analyzed in some way?
Network usage
CRM data
Location-based data
Billing data
GPS data
SMS data
System logs
Weblog data
Advertising data
Bandwidth usage
Financial system
Support logs
Communication faults
QR Code data
Portals
Device profiles
Sales data
Mobile payments
Chapter 8
Utilities Industry
It is a time of great change and transition for the utilities industryan evolving regulatory
environment, a strong push toward renewable energy sources and conservation, the advent of
smart meter and grid technologies, and the potential of competition drive uncertainty. The most
significant opportunity and risk, if not properly addressed, is presented by the coming torrent of
new data and events (Big Data) resulting from efforts to modernize utility networks and the
entire operational framework.
Big Data opportunities in utilities are also rapidly evolving. Change doesnt come easy for an
industry that operated for a hundred plus years with systems that have worked relatively well.
But the traditional systems that have served utilities well over the years were not built to handle
the frequency and volume of data emerging from smart meters, grid devices and other network
controls and sensors. As a result, utility businesses are cautiously structuring their current IT
infrastructure, systems and tools to accommodate emerging needs such as customer prepay,
demand response, self-service analytics, near-real-time operational control, distributed
generation, etc. Given the current technology landscape, utilities may be sacrificing the rapid and
dependable throughput of data that ensures efficient network performance, high reliability and
timely revenue flow.
If a network operations team implements this it can reduce network outages, limit exposure
during outages and generally improve reliability.
Demand Response
Many Energy Service Providers and Market Operators administer customer-side Demand
Response and Load Control programs to ensure grid stability and stable operation during times
of peak demand or system emergencies arising from generator outages or transmission and/or
distribution constraints. With some programs, the customer residential, commercial, or
industrial - reduces the required load upon instruction from the Energy Service Provider or
Market Operator. With other programs, the Energy Service Provider, Market Operator, or a
Curtailment Service Provider remotely reduces the load via device management.
Big Data solutions provide the technology foundation and framework enabling the analysis
of meter and event data consumption from a broad array of sources, both stored and streaming.
Utilities are able to perform continuous analytics against that data to look for anomalies, patterns
and trends that might indicate an opportunity make actionable decisions on both supply and
demand.
Marketing and Operations would be interested in this solution. It provides the ability to
integrate these Big Data analytics into other core operational systems to kick off an action based
on rules and policy; and provide robust, business-centric visualization through real-time
dashboards to customers and other key stakeholders.
Location-Based Services
This consists of any and all geo-spatial data; assets, maintenance crews, electrical network
equipment and other resources. Many organizations have geo-spatial data available from their
equipment, diagrams and vehicles.
For example this data can be used to deliver real-time analytics to pin-point maintenance
resources needs when a network is down, overloaded or reaching capacity. Analytics can also
identify patterns for when a network has the potential for reaching load constraints or when it has
extra capacity.
Integration into outage and distribution management applications allows for further
development of business capabilities such as distribution load management switching, where
protocols can be established to move customers to alternate feeders during times of over
capacity. A utilitys use of Bid Data fundamentally changes the way they can address network
capacity needs.
load profiles and capacity to more unstructured ones from city demographics, which can be used
to make smarter investment decisions.
For example, data on wealth distribution in office spaces,
commuter congestion and electric vehicle population history
combined with current load profiles and capacity can
combined to predict which buildings will have the highest
growth in electric vehicles over the next two decades. This
data can feed portfolio planning decision like deciding where
to invest in solar panels to help source cheaper and cleaner
local energy to charge those vehicles instead of transporting it
in from a remote fossil plant at high cost.
Network operations and finance groups can use this data to make the limited amount of
renewable investment be as beneficial as possible for the utility company in the long term. Most
of these decisions will otherwise be based only on current network load and capacity and not the
long term change.
Another example is wind farm investments. Traditional utility data, demographic information
and new sensor data can be combined to provide the optimal investment scenarios necessary to
meet growing renewable energy portfolio requirements. The demographic data like suburban
and urban growth and shrinkage can be also used to focus energy supply investments on long
term profitability instead of just short term views.
Asset management groups would be interested in this kind of analysis to reduce risks and
costs associated with new or replacement supply and infrastructure planning and delivery. Using
this long term plan can also maximize the long term return on investment by growing supply
resources just-in-time to meet demand instead of under or over profiling it.
INDUSTRY SOLUTIONS
Link in to the Oracle Industry Solutions database for conversation scripts and questionnaires
that can get your client thinking about their analytics and data warehouse solutions.
If youre in conversations with the network operations or customer care departments, go to
the Utilities Data Management Industry Solution portal (http://my.oracle.com/site/ibu/portal/
IndustrySalesPlays/Industries-L-Z/Utilities/Solution3/index.html).
In here youll find an
Executive Conversation Script and Discovery Questionnaire that focus on the structured and
unstructured data in the utilities space.
For the finance and asset management departments, go to the Asset Reliability &
Optimization Industry Solution (http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/
Industries-L-Z/Utilities/Solution3/index.html).
The Executive Conversation Script and
Discovery Questionnaire here have information on how to talk to IT, finance and operations
about how to improve return on investment and overall revenue using Oracles asset lifecycle
management software, which are heavy analytical applications that can take advantage of big
data from meter data systems, real-time sensors and SCADA systems.
Oracle Utilities applications, technology and hardware products are engineered to work
together. Combined; these solutions process data exceptionally faster and more reliably than the
myriad of products used by traditional utility operators. Meter Data Management is a type of big
data solution in its own right.
Link into the Engineered Systems sales plays (http://
my.oracle.com/site/ibu/portal/ExaBusinessSolutions/SalesPlays-Industries/Utilities/index.html)
for more information. In order to get the customer interested pose this critical item of
information:
Did you know that running your Meter Data Management (MDM) system on Oracle
Engineered Systems delivers superfast performance and storage efficiency, drastically reducing
costs while meeting goals for your Smart Metering and Meter to Cash Process?
DATA SOURCES
Utility companies have been dealing with big data problems around smart meter
implementations using traditional approaches for years. Geo spatial and location based data has
also been available and in some cases integrated. They also run millions of sensor reads very
second through their operations systems. Utilities are used to big data problems, but very
focused ones that dont fully leverage or even store all of the data they process. Combine these
traditional utilities data sources with the ones below and you can workshop some great use cases
with your customer.
Network Usage
Portals
Asset Data
Surfing Behavior
Event Data
Order Management
Bandwidth Usage
CRM Data
Remote Control
Network Faults
Billing Data
User Profiles
Device Profiling
Social Media
Sensors
Mobile Payments
Chapter 9
Research Industry
BIG DATA USE CASES
Scientific Instruments Data Generation
VOLUME is one of the challenges of Big Data. It is about being able to ingest and manage
very large quantities of data and to cope with its exponential growth without limiting or
hindering the ability to access critical information.
Most of the Research data comes from various kinds of scientific instruments that can be
distributed or can be large, expensive centralized facilities operated at a global scale. In both
cases Research collaborations need to effectively deal with the data deluge generated by these
machines, quickly load and organize large volumes of raw data and translate it into knowledge
and information. This and other large data sets are used in both complex analytics and real time
analytics.
The volume of worldwide climate data is expanding rapidly, creating challenges for both
physical archiving and sharing, for ease of access of relevant information in a multidisciplinary
environment. Data comes from many different sources, such as satellites, temperature sensors,
ground sensors, ocean and marine sensors, weather stations, atmospheric balloons, and many
more.
Data captured from the above sources is used by researchers to monitor climate changes, to
generate weather forecasts and to support the decision-making process in case of natural
disasters. Research climate data also has a direct impact on businesses that uses climate and
weather data to make informed economic decisions, such as agriculture, real estate, law firms,
and private research institutions.
Complex Analytics
Data is an asset in Research as in any other field, if not more, and it has a high potential
VALUE if harnessed correctly. This value (another characteristic of Big Data) is in the ability to
translate raw data into information and knowledge.
Most of the researchers and their organizations are required to exploit large data sets by
storing, retrieving and using deep analytics against a wide variety of data types while
simultaneously optimizing workloads and system operations
Data Visualization
Big Data also means VARIETY. This means the ability for the eenterprise infrastructure to
quickly accommodate new data sources and to cope with a wide range of data types.
Enhancing the visualization of research information gives the ability to transform big data into
something easier to analyze, to enable new science with access to the latest investigative
methods and tools and to maximize analytic performance and achieve faster results.
INDUSTRY SOLUTIONS
Oracle has two Industry Solutions for the Research segment: Research Data Management
(http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/Industries-A-K/EducationResearch/Solution2/index.html?
ssSourceNodeId=23425&ssSourceSiteId=ibu) and Research Analytics (http://my.oracle.com/site/ibu/portal/
IndustrySalesPlays/Industries-A-K/EducationResearch/Solution2/index.html?
ssSourceNodeId=23425&ssSourceSiteId=ibu). Both solutions were conceived to address the main
challenges Researchers and their organizations are facing, in line with the above use cases.
Research Data Management (RDM) focuses on the overall Research Data Lifecycle, while
Research Analytics (RA) addresses more specifically Big Data related issues. In particular:
Oracles Research Data Management solution empowers research institutions
to develop open, scalable, secure environments for knowledge development, discovery,
management, sharing and preservation.
Oracle Research Analytics helps Researchers to carry out collaborative and
high-performance analytics on large sets of structured or unstructured data in order to enable
innovative Research and reduce Time-to-Discovery
On the Research Enterprise portal, you will find for each solution (http://my.oracle.com/site/ibu/
portal/IndustrySalesPlays/Industries-A-K/EducationResearch/Solution2/index.html?
ssSourceNodeId=23425&ssSourceSiteId=ibu):
Document)Name
Audience
Goal/Descrip5on
Sales&CheatSheet
Oracle&Internal&
2&slides&on&how&to&best&posi6on&Oracle&with&
respect&of&the&Industry&challenges
Discovery&Ques6onnaire
Oracle&Internal&
Execu6ve&Conversa6on&Script
Oracle&internal&
Solu6on&Brief
Public/External&
Execu6ve&Presenta6on
Public/External&
A&document&with&all&high@yield&ques6ons&to&qualify&
the&pain&and&level&of&need&of&the&customer
A&document&to&help&prepare&for&an&introductory&
execu6ve&mee6ng,&including&&descrip6on&of&the&
target&buyers&role
2&slides&with&basic&informa6on&on&the&Industry&
challenges&and&Oracle&capabili6es
A&slide@deck&with&the&complete&story&on&the&Oracle&
value@proposi6on.&Success&Stories&are&also&
available.
DATA SOURCES
The new frontier in Research is the possibility to perform distributed, interdisciplinary,
collaborative Research that harnesses the power of data in a reliable, cost-effective way. Most of
the effort focuses on aggregation, standardization and linkage of research data from multiple
sources and in multiple formats, providing an analytic approach to reporting on this data and the
accomplishments of research initiatives. How to access & preserve over time raw data, metadata
and research results in different format and in a trusted way is also crucial.
Major data sources you will encounter talking to your customers are:
Environmental Sensors Data
Climate Data
Meteorology Data
Events data
Experiments data
Government Data
Chapter 10
Automotive Industry
Automotive OEMs are grappling to answer questions such as: What are our customers saying
about our brands? Where do they get most of the information about our products? What
advertising works? What promotions and incentives are effective?
Automotive OEMs are looking for insights they believe can be mined from a 360 degree
view of the customer. By leveraging Big Data solutions, automakers can boost marketing ROI
and lead-conversion rates, align product mixes with customer demand, and reduce warranty
costs. The industry is looking for ways to gain competitive advantage by leveraging data being
collected from the automobile, web browsing data, social media data, dealer interactions,
customer interactions with the call center as well as repair and warranty information. Insights
mined from analyzing, co-relating and mining these types of data can be categorized into broad
categories such as Customer Insights and Service and Early Warning & Vehicle Quality.
Customer Loyalty based Marketing driving effective campaigns. What marketing is most
effective?
Which incentives and promotions work? Does a one-size fit all incentive program work or
more segmented and targeted approach for incentives work?
How does this information change based on geography, region, and country? How can we
change our programs based on the local and regional effects?
How can we combine advertisement and incentive spend in a targeted to way to drive more
demand and sales?
How do we ensure dealer co-marketing programs work effectively?
Beneficiaries and users of this category of analytics include Brand Managers, Marketing,
Sales and Finance. These insights allow Marketing, Brand Management and Sales managers to
maximize the effectiveness of their marketing and incentive spend. Brand Managers focused on
customer loyalty can monetize these Customer Insights into improved customer loyalty and
higher repeat buying rates. Big Data solutions enable the conversion of customer insights into
actions that improve customer satisfaction and ultimately improved profitability.
How can we incorporate data from vehicle on-board diagnostics? With telematics and
connectivity, how can we leverage more detailed diagnostic trouble code data to get to
root causes?
How can we correlate problems on one vehicle line to others that might share common
components and parts?
How do we incorporate information from discussion forums, blogs, social media into
analysis of problems faced by consumers?
How do we incorporate voice of consumer into the overall quality improvement process?
How can we capture information from multiple sources to get a comprehensive view of
overall quality insight to improve customer and vehicle service?
Such Early Warning capabilities and Quality Analytics would be extremely interesting to
internal functions including Quality Engineering, Warranty, Product Development and
Manufacturing. Vehicle recalls are extremely expensive, and also damage the brand perception.
Therefore being able to provide early detection of quality issues could allow a rapid response,
thus avoiding potentially devastating costs and image problems.
INDUSTRY SOLUTIONS
Link in to the Oracle Industry Solutions database for conversation scripts and questionnaires
that can get your client thinking about their analytics solutions.
Check out the Automotive Insights sales play and the rest of the Automotive Sales,
Distribution & Aftermarket solution at http://my.oracle.com/site/ibu/portal/IndustrySalesPlays/
Industries-A-K/Automotive/Solution1/index.html
Download the Executive Conversation scripts for Integrated Sales & Marketing Industry
Solution as well as the After Sales Service Warranty Industry Solution.
DATA SOURCES
To gain better Consumer Insights, it is critical to link various data streams and consumer
touch points:
Link diverse data sets for different brands with specific need states
CRM interaction
Household purchases
Triggers
What is the call center and CRM data telling us (call center vs. social media)
What is the value of social media data and how can we leverage it to impact revenue?
Is the Social Media Data telling us anything we didnt already know from the data
coming in from the call center?
Can social media help us reach/target consumers that are currently not loyal to our
brand?
To ensure deeper consumer insights, automakers need to have a strategy that allows them to
bring together diverse set of data from varied data sources including:
Marketing systems
Social media sites such as Twitter, Facebook, Automotive blogs and forums,
Consumer websites, Review websites
Incentive systems
Web click-thru
Sales systems
To develop an effective Early Warning mechanism, it is critical to bring together diverse set
of data from:
Call center customer service, dealer techline, roadside assistance, customer survey
interactions
Chapter 11
Subcontractor Performance
How well will a contractor perform on a project? How well did he perform on the past
project? Do all subcontractors perform the same? Answering these questions will help you
determine what to expect from your subcontractor on your project. Unfortunately, there is little
to go on other in the E&C industry other than a companys own in house data that they have
collected over many years of performance data. This creates a risk when trying to expand into
new markets or provide additional services where they do not
have the history or data.As well a subcontractors performance
may be tied to the local talent and vary at other locations,
which introduces additional risk.
For example, imagine winning the contract to do work in
a part of the country where you dont know the
subcontractors, or you need to place your bid based upon
subcontractor performance. Your estimated price could be
too high and therefore you wont win the work. Or even
worse would be to win the work based upon pricing that is too low and your company is in the
position to lose money on the contract.
Operations executives would be interested in this type of big data usage. Often E&C
companies develop qualified subcontractors through surveys and in-house performance
databases. The personal performance success of estimators and project managers is tied directly
to the availability of data to help them do their jobs well. As a result, the performance of the
operational executives is also tied to the reliability of this information as well.
While within the walls of a construction company the data can be developed to qualify a
subcontractor, however, it does not indicate how they will perform (i.e., what is the cost per
linear foot of pipe installed). There are generally too many parameters associated with
installation that makes it impossible to create a simple table to specify the cost of installed
quantities, which in turn makes it difficult to predict the costs on a project or to compare the
performance of a subcontractor to that of another.
With a data source the rated the quality of the work performed as well as the price per unit a
qualified estimate and forecast could be developed.
Equipment Costs
How much equipment do you need to complete your project? That depends and the answer
is always changing. Should you buy, rent or lease equipment for your project? Optimization of
equipment procurement can have a significant impact on the profitability
of a project. Knowing where to obtain the equipment and what the
prevailing costs are expected to be can help in both the planning and the
execution of the project. Like the material and labor costs above, the
equipment costs impact the project in the same way, however, the market
for these resources has greater micro fluctuations. Two equipment dealers
will have different pricing models for either a purchase, a lease or rental options.
For example, imagine the ability of a contractor to be able to develop a forecast of equipment
needs (type, duration, location, etc.) and then be able to optimize the cost of that schedule
through locating the resources (pieces of equipment) and a mix of equipment with a variety of inhouse purchases, rentals and leased equipment.
An operations executive and finance would be interested in this big data solution to minimize
project impacts and avoid costly delays, but also to optimize the project costs and improve
margins.
INDUSTRY SOLUTIONS
The E&C Industry is very fragmented with many different types of companies providing
different services that need to be understood when engaging with these companies. For a better
understanding of how to position Big Data link in to the Oracle Industry Solutions database for
conversation scripts and questionnaires that can get your client thinking about their analytics and
data warehouse solutions.
Go to the Sales & Marketing content portal for industry overview documents as well as
specific materials to support our E&C Industry Solutions:
h t t p : / / m y. o r a c l e . c o m / s i t e / i b u / p o r t a l / I n d u s t r y S a l e s P l a y s / I n d u s t r i e s - A - K /
EngineeringConstruction/Solution1/index.html
Within the Industry Solution Executive Overview you will find several albeit short very
insightful paragraphs that with an understanding of that information will help in understanding
the complex nuances of this industry.
DATA SOURCES
Material and labor costs generally garner the greatest amount of interest and attention in
discussions of cost reduction because of the nature of the game and difficulties around obtaining
this information. Equipment costs, while a very large cost on a project, is discussed less because
companies have developed some solutions through some standardized methodologies. However
there is a tremendous upside to be able to further optimize equipment usages.
Engineering News Record
Material standards and provider institutes (American Concrete Institute, National Ready Mix
Concrete Association, World Steel Association, etc.)
Granger
Subcontractor performance review
Equipment sales, lease and rental companies (i.e., Caterpillar, Hertz, Sun Rental, etc.)
U.S. Bureau of Labor and Statistics
State and Local Government labor information
Payroll
Chapter 12
Seismic Processing
In the seismic processing workflow millions upon millions of earth measurements are
integrated together into a coherent model of the earths subsurface. In spite of the magnitude of
the data problem, Big Data techniques like the BDA and
Hadoop are of little use. There are specific steps in the
seismic processing workflow which would benefit from
Hadoop like the sorting of seismic traces but the BDA
would only be really useful for those steps. There was a paper
given at OOW 2011 which describes the performance of
Hadoop for this step. The number of customers who do
seismic processing is small - the seismic contractors like
WesternGeco and CGG and some large oil companies who still do processing in-house.
Downstream Retail
Oil & gas companies do have a public face with the public with their retail operations. These
may be gas stations or convenience stores or even websites. Many of the retail uses of Big Data
like Sentiment Analysis, Pricing Optimization, Customer Experience Management, etc. would
apply equally to these operations.
Safety/Environment
DATA SOURCES
Sensor data
Weather
Wave state
Earth Measurements
Lab Automation