You are on page 1of 11

Amazon Redshift

Fast, simple, cost-effective data warehousing.

Start Free Trial

Pricing Getting Started FAQs Partners Docs & Tools Free Trial

Amazon Redshift is a fast, fully managed data warehouse that


makes it simple and cost-effective to analyze all your data
using standard SQL and your existing Business Intelligence (BI)
tools. It allows you to run complex analytic queries against 3:22
petabytes of structured data, using sophisticated query
optimization, columnar storage on high-performance local What is Amazon Redshift?

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
disks, and massively parallel query execution. Most results
come back in seconds. With Amazon Redshift, you can start
Want a Proof-of-Concept?
small for just $0.25 per hour with no commitments and scale
out to petabytes of data for $1,000 per terabyte per year, less Request support for your
than a tenth the cost of traditional solutions. proof-of-concept or
evaluation
Amazon Redshift also includes Redshift Spectrum, allowing
you to directly run SQL queries against exabytes of
unstructured data in Amazon S3. No loading or transformation
is required, and you can use open data formats, including Avro,
CSV, Grok, ORC, Parquet, RCFile, RegexSerDe, SequenceFile,
TextFile, and TSV. Redshift Spectrum automatically scales
query compute capacity based on the data being retrieved, so
queries against Amazon S3 run fast, regardless of data set
size. Learn more about Redshift Spectrum »

Learn and chat with experts, join our tech talk: Serverless
Analytics with Amazon Redshift Spectrum, AWS Glue, and
Amazon QuickSight Register Now>>

Request support for your proof-of-concept or


evaluation »

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Benefits

Fast Inexpensive Extensible

Amazon Redshift delivers fast query You only pay for what you use. You Redshift Spectrum enables you to
performance by using columnar can have unlimited number of users run queries against exabytes of data
storage technology to improve I/O doing unlimited analytics on all your in Amazon S3 as easily as you run
efficiency and by parallelizing data for just $1000 per terabyte per queries against petabytes of data
queries across multiple nodes. Data year, 1/10th the cost of traditional stored on local disks in Amazon
load speed scales linearly with data warehouse solutions. Most Redshift, using the same SQL syntax
cluster size, with integrations to customers see 3-4x reduction of data and BI tools you use today. You can
Amazon S3, Amazon DynamoDB, size after compression, reducing store highly structured, frequently
Amazon EMR, Amazon Kinesis, and their costs to $250-$333 per accessed data on Redshift local disks,
any SSH-enabled host. uncompressed terabyte per year. keep vast amounts of unstructured
data in an Amazon S3 “data lake”,
and query seamlessly across both.

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Simple Scalable Secure Compatible

Amazon Redshift allows You can easily resize your Security is built-in. You can Amazon Redshift supports
you to easily automate cluster up and down as encrypt data at rest and in standard SQL and provides
most of the common your performance and transit using hardware- custom JDBC and ODBC
administrative tasks to capacity needs change accelerated AES-256 and drivers that you can
manage, monitor, and with just a few clicks in the SSL, isolate your clusters download from the
scale your data warehouse. console or a simple API using Amazon VPC and console, allowing you to
By handling all these time- call. even manage your keys use a wide range of
consuming, labor-intensive using AWS Key familiar SQL clients. You
tasks, Amazon Redshift Management Service can also use standard
frees you up to focus on (KMS) and hardware PostgreSQL JDBC and
your data and business. security modules (HSMs). ODBC drivers.

Features

Optimized for Data Petabyte Scale Query your Amazon S3


Warehousing “data lake”
With a few clicks in console or a
Amazon Redshift uses a variety of simple API call, you can easily Redshift Spectrum enables you to
innovations to obtain very high change the number or type of nodes run queries against exabytes of
query performance on datasets in your data warehouse and scale up unstructured data in Amazon S3,
ranging in size from a hundred all the way to a petabyte or more of with no loading or ETL required.
gigabytes to an exabyte or more. For compressed user data. Dense When you issue a query, it goes to

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
petabyte-scale local data, it uses Storage (DS) nodes allow you to the Amazon Redshift SQL endpoint,
columnar storage, data compression, create very large data warehouses which generates and optimizes a
and zone maps to reduce the using hard disk drives (HDDs) for a query plan. Amazon Redshift
amount of I/O needed to perform very low price point. Dense Compute determines what data is local and
queries. Amazon Redshift has a (DC) nodes allow you to create very what is in Amazon S3, generates a
massively parallel processing (MPP) high performance data warehouses plan to minimize the amount of
data warehouse architecture, using fast CPUs, large amounts of Amazon S3 data that needs to be
parallelizing and distributing SQL RAM and solid-state disks (SSDs). read, requests Amazon Redshift
operations to take advantage of all While resizing, Amazon Redshift Spectrum workers out of a shared
available resources. The underlying allows you to continue to query your resource pool to read and process
hardware is designed for high data warehouse in read-only mode data from Amazon S3, and pulls
performance data processing, using until the new cluster is fully results back into your Amazon
local attached storage to maximize provisioned and ready for use. Redshift cluster for any remaining
throughput between the CPUs and processing.
drives, and a 10GigE mesh network
to maximize throughput between
nodes. For exabyte-scale data in
Amazon S3, Amazon Redshift
generates an optimal query plan
that minimizes the amount of data
scanned and delegates the query
execution to a pool of Redshift
Spectrum instances that scales
automatically, so queries run quickly
regardless of data size.

No Up-Front Costs Fault Tolerant Automated Backups

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
You pay only for the resources you Amazon Redshift has multiple Amazon Redshift automatically and
provision. You can choose On- features that enhance the reliability continuously backs up new data to
Demand pricing with no up-front of your data warehouse cluster. All Amazon S3. It stores your snapshots
costs or long-term commitments, or data written to a node in your cluster for a user-defined period from 1 up
obtain significantly discounted rates is automatically replicated to other to 35 days. You can take your own
with Reserved Instance pricing. On- nodes within the cluster and all data snapshots at any time, and they are
Demand pricing starts at just is continuously backed up to Amazon retained until you explicitly delete
$0.25/hour per 160GB DC1.Large S3. Amazon Redshift continuously them. Amazon Redshift can also
node or $0.85/hour per 2TB monitors the health of the cluster asynchronously replicate your
DS2.XLarge node. With Partial and automatically re-replicates data snapshots to S3 in another region
Upfront Reserved Instances, you can from failed drives and replaces for disaster recovery. Once you
lower your effective price to nodes as necessary. delete a cluster, your system
$0.10/hour per DC1.Large node snapshots are removed, but your
($5,500/TB/year) or $0.228/hour user snapshots are available until
per DS2.XLarge node you explicitly delete them.
($999/TB/year). Redshift Spectrum
queries are priced at $5/TB scanned
from S3. For more information, see
the Amazon Redshift Pricing page.

Fast Restores Encryption Network Isolation Audit and


Compliance
You can use any system or With just a couple of Amazon Redshift enables
user snapshot to restore parameter settings, you you to configure firewall Amazon Redshift
your cluster using the AWS can set up Amazon rules to control network integrates with AWS
Management Console or Redshift to use SSL to access to your data CloudTrail to enable you to
the Amazon Redshift APIs. secure data in transit and warehouse cluster. You can audit all Redshift API calls.
Your cluster is available as hardware-accelerated AES- run Amazon Redshift Amazon Redshift also logs

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
soon as the system 256 encryption for data at inside Amazon VPC to all SQL operations,
metadata has been rest. If you choose to isolate your data including connection
restored and you can start enable encryption of data warehouse cluster in your attempts, queries and
running queries while user at rest, all data written to own virtual network and changes to your database.
data is spooled down in disk will be encrypted as connect it to your existing You can access these logs
the background. well as any backups. By IT infrastructure using using SQL queries against
default, Amazon Redshift industry-standard system tables or choose to
takes care of key encrypted IPsec VPN. have them downloaded to
management but you can a secure location on
choose to manage your Amazon S3. Amazon
keys using your own Redshift is compliant with
hardware security modules SOC1, SOC2, SOC3 and
(HSMs), AWS CloudHSM, or PCI DSS Level 1
AWS Key Management requirements. For more
Service. details, please visit AWS
Cloud Compliance.

Customer References

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
 
For more Amazon Redshift customer stories across industries and company sizes, see the customer success page »

Featured Blog Posts


10 Best Practices for Amazon Redshift Spectrum
Amazon Redshift Spectrum allows you to extend the analytic power of
Amazon Redshift beyond data stored on local disks in your data warehouse
to query vast amounts of data in your Amazon S3 “data lake” - without
having to transform and load data. In this blog post, we have collected 10
important best practices for Amazon Redshift Spectrum by grouping them
into several different functional groups.

Read the entire post »

Amazon Redshift Spectrum Extends Data Warehousing Out to Exabytes—


No Loading Required
With Amazon Redshift Spectrum, Amazon Redshift customers can easily
query their data in Amazon S3. Redshift Spectrum operates directly on your

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
source data in its native format (including Avro, CSV, Grok, ORC, Parquet,
RCFile, RegexSerDe, SequenceFile, TextFile, and TSV). This means that no
data loading or transformation is needed.

Read the entire post »

Amazon Redshift Spectrum – Exabyte-Scale In-Place Queries of S3 Data


In order to allow you to process your data as-is, where-is, while taking
advantage of the power and flexibility of Amazon Redshift, we are launching
Amazon Redshift Spectrum. You can use Spectrum to run complex queries on
data stored in Amazon Simple Storage Service (S3), with no need for loading
or other data prep.

Read the entire post »

Data Compression Improvements in Amazon Redshift Bring Compression


Ratios Up to 4x
Columnar compression is an important technology in Amazon Redshift. It
both helps reduce customer costs by increasing the effective storage capacity
of our nodes and improves performance by reducing I/O needed to process
SQL requests. Improving I/O efficiency is very important for data
warehousing. Last year, our I/O enhancements doubled query throughput.
Let’s talk about some of the new compression improvements we’ve recently
added to Amazon Redshift.

Read the entire post »

For a full list of blog posts related to Amazon Redshift, see the blog posts
page »

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
Recent Announcements
Date Announcement

Nov 21 Amazon Redshift Introduces Result Caching for Sub-Second Response for Repeat Queries

Nov 20 Amazon Redshift Spectrum is Now Available in Four Additional AWS Regions, and
Enhances Query Performance in All Available AWS Regions

Nov 20 Amazon Redshift Uses Machine Learning to Accelerate Dashboards and Interactive
Analysis

For information about all the new features in Amazon Redshift, see the what's new page »

Start Using Amazon Redshift


Try Amazon Redshift for free. Get 750 free DC1.Large hours per month for 2 months. To start the free trial, create an
AWS account and sign in to the Amazon Redshift console. Then launch an Amazon Redshift cluster and select
DC1.Large for Node Type. To request support for your proof-of-concept or evaluation, click here.

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD
 

Get Started Now

GETTING STARTED WITH AMAZON KINESIS AWS FREE TIER

Learn how to launch your Gain free, hands-on experience


streaming application quickly with AWS for 12 months

Create a Free Account


  Twitter   Facebook   Google+   Twitch   AWS Blog
  What's New? RSS   Subscribe to Updates

AWS & Cloud Solutions Resources & Manage Your Amazon Web Services is Hiring.
Computing Websites & Website Training Account
Amazon Web Services (AWS) is a dynamic,
What is Cloud Hosting Developers Management
growing business unit within Amazon.com.
Computing? Java on AWS Console

Create PDF in your applications with the Pdfcrowd HTML to PDF API PDFCROWD

You might also like