You are on page 1of 4

IBM Analytics

Data Sheet

Middleware Integration

IBM Analytics for


Apache Spark
Deeper, richer, interactive analytics. Intelligent
applications. Frictionless and head-ache free, daily.
The Analytics operating system

Get up and running quickly and easily


and avoid massive upfront investment
or on-going administration

Apache for Spark was designed with three goals in mind: to be fast, easy
to use and flexible enough to run in many environments. An analytics
operating system, Spark exploits in-memory caching of data, avoiding
time-consuming round-trips to disk. The results are blazing fast
processing speed. Up to 100 times faster than other big data
technologies, with a simplified and unified programming language
and near real-time processing and Machine Learning capabilities,
Spark is a bright spot on the map of Big Data technologies.

Start small and elastically scale up


or down as your needs change

Apache for Spark

Work faster and smarter with built-in


notebooks and seamless integration with
other services and third party BI tools

Enjoy the benefits and extensibility of


100 percent open source Spark, backed
by IBMs commitment and decades of
enterprise experience

Highlights
IBM Analytics for Apache Spark
is a managed, enterprise-grade
Spark-as-a-service offering that
allows you to:

Apache for Spark is an open-source cluster computing framework with


in-memory processing to speed analytic applications up to 100 times
faster compared to other technologies on the market today. Expanding
on an approach pioneered by Apache Hadoop, Spark unlocks new
potential for data analytics in several ways:

A scalable framework for faster analysis of complex, large-scale data


A consistent programming model to run analytics against streams
and batch data
Unified access to data across the organization with support for
multiple programming languages and an ability to work with multiple
data sources
A simplified environment for innovative development with high-level
tools for machine learning and streaming data

IBM Analytics
Data Sheet

Middleware Integration

Introducing IBM Analytics for


Apache Spark

IBMs Managed Service


As one of only a few managed Spark services, IBMs Managed
Service offers more than other services by offering multiple
ways to access and analyze data, and an easier path to
integrate with other cloud data services and third-party tools
on the same platform so that more of your data can be hosted
in one place. IBMs Managed Service is built on the IBM
SoftLayer bare-metal cloud platform, which consistently
outperforms other virtualized cloud platforms and is backed
by decades of IBMs trusted enterprise experience, technology
leadership and with 24x7 support and commitment to open
source Spark.

IBM Analytics for Apache Spark enables interactive and


lightning-speed analytics via a managed, enterprise-grade
service. It provides all of the capabilities and rich features
found in an on-premises Spark deployment without the
cost and complexity of managing infrastructure or
complex components.

IBM Analytics for Apache Spark is:


Accessible
No long-term commitment or risk so you can begin
exploring right away
No financial hassle you can get started with a credit
card and pay by the hour
An easy to use managed service thats always on with no
IT, no assembly and no ongoing administration
Elasticity and scalability so you can pay-as-you go, start
small, and scale up or down as needed

Ideal use cases for IBM Analytics for


Apache Spark
Data Scientists, developers, data engineers and analysts in
companies of all sizes can use IBM Analytics for Apache Spark
to build smarter applications, advance their data analysis and
accelerate their work. Typical use cases could include:

Integrated
Machine Learning and other library capabilities (GraphX,
SQL and Streaming) in a single environment to enable
intelligent applications
Integrated Notebooks for a simple and powerful unified
interface to perform interactive analytics
Seamless integration with a variety of services and other
third party tools on a single platform

Powerful
100 percent open source Spark means you get all of the
benefits and extensibility of the rich, growing Apache
Spark community
Support to work in the language that works best for you,
including Python, Scala, Java or R
Speeds up to 100 times faster than existing technologies
In-memory processing to facilitate iterative, exploratory
and next generation analytic capabilities
Backed by IBMs massive commitment to Spark, and
decades of enterprise experience

Predictive and prescriptive analytics: Build and deploy rich


analytic models from one unified platform
Iterative algorithms or programs: Fast, repeated access of
the same set of large scale data
Real-time analytics: Implement near real-time stream event
processing
ETL: Build data pipelines to feed data platforms and
visualizations
Batch processing: Complete batch-oriented processing of
large amounts of data faster
Business intelligence: Interactively query large-scale data
Data mining and insight discovery: Iteratively run complex
analytics and experiment with diverse data sources

IBM Analytics
Data Sheet

Middleware Integration

Why IBM?
IBM is proven in analytics with 10,000 strategy and analytics
consultants and over 400 mathematicians, IBM has invested
more than USD26 billion in big data and analytics capabilities,
acquired more than 30 companies and built nine Analytics
Solution Centers.
IBM is proven in open source, with a long history of
participating in innovative open source projects. IBM is deeply
committed to open source with contributions to more than
120 projects, including more than USD1 billion in Linux
development, making IBM a significant force supporting open
source innovation and collaboration. The company participates
in more than 120 collaborative projects contributed to the
open source community, including Eclipse, Hadoop, Apache
Spark, Apache Derby and Apache Geronimo.

Learn more and get started today


To get started, register for a 30-day free trial on Bluemix:
https://console.ng.bluemix.net/catalog/apache-spark/

To learn more about the IBM Spark investment, visit:


ibm.com/analytics/us/en/technology/spark/

Copyright IBM Corporation 2015


IBM Corporation
Software Group
Route 100
Somers, NY 10589
Produced in the United States of America
October 2015
IBM, the IBM logo, ibm.com, Bluemix and SoftLayer are trademarks of
International Business Machines Corp., registered in many jurisdictions
worldwide.
Java and all Java-based trademarks and logos are trademarks or registered
trademarks of Oracle and/or its affiliates.
Linux is a registered trademark of Linus Torvalds in the United States,
other countries, or both.
Other product and service names might be trademarks of IBM or other
companies. A current list of IBM trademarks is available on the web at
Copyright and trademark information at
www.ibm.com/legal/copytrade.shtml.
This document is current as of the initial date of publication and may be
changed by IBM at any time. It is the users responsibility to evaluate and
verify the operation of any other products or programs with IBM products
and programs.
THE INFORMATION IN THIS DOCUMENT IS PROVIDED
AS IS WITHOUT ANY WARRANTY, EXPRESS OR IMPLIED,
INCLUDING WITHOUT ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY
WARRANTY OR CONDITION OF NON INFRINGEMENT. IBM
products are warranted according to the terms and conditions of the
agreements under which they are provided.
Please Recycle

CDD12348USEN-00

You might also like