You are on page 1of 10

Types

Most of them are enterprise software. A few unusual types: Infographic : Visual.ly Small-data : StatWing Social Media Data : Datasift (twitter-certified) Public Data : enigma.io Regular Training programs: Continuum Analytics, HortonWorks Digital Marketing
Causata, a leading provider of Customer Experience Management Gravity.com : Enables websites to understand the interests of their audience

and deliver personalized recommendations to each user ; major clients like WSJ, TIME, Scribd. RocketFuel : programmatic buying platform : identifying the best location and time to place an ad . to make marketing easier and more effective for brands, and to give consumers a unique and relevant online experience. Major clients like Vonage, Pizza Hut, BMW, Lufthansa etc. SumAll : understand the effectiveness of online campaigns Metamarkets : Provides a data analysis platform for the real-time bidding (RTB) ad buying marketing.

Highly specialized programmers & data scientists : Alpine Data, Cloudera, Concurrent, MapR, SkyTree, WibiData, DataStax, RainStor, Requires good programmers: Continuum Analytics, Continuity, MemSQL,

Ease of Use

MortarData, Splice Machine, Mu-Sigma, GridGain Easy to use and non-experience users can interpret and act on the findings.
Origami Logic, which is a product built specifically for marketers that brings

together big data analytics, data science and data visualisation technologies to deliver marketing insights through a marketer-friendly user experience. Datameer : No cooding in MapReduce required; enabling any business user to integrate, analyze and visualize their data. Gravity.com, Enigma.io, Datasift Platfora Rocketfuel (No need for an extra employee) SiSense StatWing SumAll ZoomData Opera Solutions MongoDB MetaMarkets

Importance on visualization
Organizations do not have to ask right questions

anymore, but simply they will be pointed to the right answers or insights while playing with the data curiously. Make Big data applications so easy to develop and use that no data scientists are required. A format that is easy to digest, analyze and share

Focused on Visualization:
Tableau, StatWing
Platfora SiSense

SumAll
ZoomData Origami Logic Alteryx + Tableau Splice Machine + Tableau / MicroStrategy

Focused on sharability on Mobile Devices


Zoomdata ,
Tableau, SumAll

Visual.ly

Low Cost
Apache Hadoop Open-source Works on large clusters of commodity hardware Cheaper than buying tailor-made hardware Due to Open-source nature of the movement and Big

data being more routine, cost will probably reduce in the long-run Alpine Data , MemSQL, SiSense, WibiData, Opera Solutions, Guavus, RainStor

Low implementation time


Alpine Data
Mortar Data (bills itself as a company that can deliver

Hadoop in an hour.) SiSense SkyTree (low MTI Mean Time to Insights) WibiData ZoomData Opera Solutions Guavus

Real-time
Hadoop does not offer the possibility to do real-time analysis Other products offer this : Storm, which is now owned by Twitter, is a real-time distributed computation system. It works the same way as Hadoop provides batch processing as it uses a set of general primitives for performing real-time analyses. Storm is easy to use and it works with any programming language. It is very scalable and faulttolerant. Cloudera Impala (Open-source) offers the Cloudera Enterprise RTQ tools that offers real-time, interactive analytical queries of the data stored in HBase or HDFS. GridGain is an enterprise open source grid computing made for Java. It is compatible with Hadoop DFS and it offers a substitute to Hadoops MapReduce. SpaceCurve can discover underlying patterns in multidimensional geodata. Geodata is different data than normal data as mobile devices create new data really fast and not in a way traditional databases are used to. They offer a big data platform and their tool set a new world record on February 12, 2013 regarding running complex queries with tens of gigabytes per second. DataSift provides access to both real-time and historical social data to uncover insights and trends MemSQL, DataStax ZoomData

Underlying Technology

Hadoop+ MapReduce : AlpineData, HortonWorks, Mu-Sigma Hadoop + MapReduce + SQL : Cloudera Hadoop + Java : Concurrent, Continuity, GridGain Python, Numpy-based : Continuum Analytics Hadoop + NoSQL : MapR, 10gen(MongoDB), DataStax, RainStor Relational DB (ACID compliant) + SQL : MemSQL Hadoop as a service + Java, python : MortarData Hadoop+ Machine Learning algo + R/ C++/ Python: SkyTree Hadoop + NewSQL : Splice Machine Non- Hadoop : Datasift, Gravity.com, Enigma.io, Origami Logic

You might also like