Professional Documents
Culture Documents
1 2 3 4 5 6
Introduction
Big data,
big impact:
Dealing with
the three Vs
Best practices:
Putting data
lifecycle
management
into action
The power of
enterprise-scale
data lifecycle
management
Enhance data
warehouse
agility with
IBM InfoSphere
Why InfoSphere?
Introduction
Organizations are eager to harness the
power of big data. But as new big data
opportunities emerge, ensuring that
information is trusted and protected
becomes exponentially more difficult.
If these challenges are not addressed
directly, end users may lose confidence
in the insights generated from their data
which can leave them unable to act on
new opportunities or address threats.
3
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
4
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Margins
Exponential data growth also can drive up
infrastructure and operational costs, often
consuming most of an organizations data
warehousing or big data budget. Rising
data volumes require more capacity,
and organizations often must buy more
hardware and spend more money to
maintain, monitor and administer their
expanding infrastructure. Large data
warehouses and big data environments
generally require bigger servers, appliances
and testing environments, which can also
increase software licensing costs for the
database and database tooling, not to
mention labor, power and legal costs.
5
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Risks
Following the lets keep it in case someone
needs it later mandate, many organizations
already keep too much historical data.
According to the CGOC 2012 Summit
Survey, 69 percent of data has no value.
Opening the doors to excessive storage
and retention only exacerbates the situation.
75%
75% of IT
risks impact
customer
satisfaction
and brand
reputation
43%
Maintaining compliance with data retention regulations, protecting privacy and archiving
data are not just legal mattersthey are essential for sustaining customer satisfaction
and brand reputation. In recent IBM surveys, respondents indicate that data theft/
cybercrime is the number-one threat to a companys reputationa greater threat than
system failures. Sixty-four percent of respondents say their company will be focusing
more on managing and protecting their reputation than they did five years ago.1
Source: Insights from the 2012 Global Reputational Risk and IT Study.
6
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
7
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Test data
management
Dispose
Create
Use
Store /retain
Archiving
Data
masking
Share
Archive
Update
The entire data lifecycle (shown as the grey circle) benefits from
good governance, but management capabilities that focus on the
use, share and archive steps have wide-ranging benefits for cost
reduction and efficiency gains.
Archiving
Retention policies are designed to keep
important data elements for reference and
for future use while deleting data that is no
longer necessary to support the legal needs
of an organization. Effective data lifecycle
management includes the intelligence not
only to archive data in its full context, which
may include information across dozens of
databases, but also to archive it based on
specific parameters or business rules, such
as the age of the data. It can also help
storage administrators develop a tiered and
automated storage strategy to archive
dormant data in a data warehouse, thereby
improving overall warehouse performance.
8
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Enterprise information
1%
Subject
to legal
hold
31%
25%
Has business
utility
69%
Everything
else
5%
Regulatory
record keeping
9
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Original data
Customers table
Cust ID
08054
19101
Name
Alice Bennett
Carl Davis
Elliot Flynn
27645
Street
2 Park Blvd
258 Main
96 Avenue
Orders table
Cust ID
27645
27645
Item #
80-2382
86-4538
Order date
20 June 2004
10 October 2005
De-identified data
Customers table
Cust ID
10000
10001
10002
Name
Auguste Renoir
Claude Monet
Pablo Picasso
Street
23 Mars
24 Venus
25 Saturn
Orders table
Cust ID
10002
10002
Item #
80-2382
86-4538
Order date
20 June 2004
10 October 2005
10
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Private cloud
Public cloud
EJB
Third-party
services
Complex IT landscapes
make setting up test
labs extremely costly
As volume, variety and velocity impacts the
complexity of data infrastructures, scaling test
environments becomes a significant problem. It
isnt unusual for Fortune 500 companies to
spend up to USD30 million building a single test
laband many of these organizations have
dozens of labs. Add in rising wages, and testing
costs begin to spiral out of control.
Business partners
Messaging
services
Collaboration
Web/Internet
Content
providers
Routing
services
Shared services
Archives
Portals
Data
warehouse
Directory
identity
Mainframe
Enterprise
service bus
File systems
Heterogeneous environments
11
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
12
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
$500,000
44%
Cost savings of
approximately
USD500,000 per year
44 percent
fewer untested
scenarios
41%
41 percent less
labor required
over 12 months
14
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Why InfoSphere?
As the foundation of the IBM big data platform,
InfoSphere provides market-leading
functionality across all the capabilities of
information integration and governance.
It is designed to handle the challenges of
big data by providing optimal scale and
performance for massive data volumes,
agile and rightsized integration and
governance for the increasing velocity of
data, and support for a wide variety of data
types and big data systems. InfoSphere
helps make big data and analytics projects
successful by delivering the confidence to
act on insight.
15
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
Additional resources
Ready to get started? Take a self-service
InfoSphere Optim Business Value
Assessment and show the ROI results
to your big data project owner.
To learn more about the IBM approach to information integration and governance
for big data, please contact your IBM representative or IBM Business Partner,
or visit: ibm.com/software/data/information-integration-governance
16
1 Introduction
3 Best practices:
Putting data lifecycle
management into action
5 Enhance data
warehouse agility with
IBM InfoSphere
6 Why InfoSphere?
uhanna, Noel. Your Enterprise Data Archiving Strategy. Forrester. February 2011. ftp://ftp.boulder.ibm.com/software/data/sw-library/
Y
data-management/optim/papers/your-enterprise-data-archiving-strategy.pdf
IMM14126-USEN-00