The First Purpose-Built Hadoop-as-a-Service

At Altiscale, we’ve taken our experiences at Yahoo, Google, and LinkedIn to rethink how Apache Hadoop should be offered. We’ve developed a purpose-built, petabyte-scale infrastructure that delivers Apache Hadoop as a cloud service. We then back it with operational support for Hadoop itself and the jobs you run. Altiscale’s optimized solution is faster, more reliable, easier to use, and more flexible than alternatives. Whether you’re new to Hadoop or just don’t want to invest more time and resources managing Hadoop yourself, get started with Altiscale today.

Get the whole package – Hadoop optimized infrastructure and the highest level of expertise to help you realize better outcomes.

Hadoop Dialtone

The Altiscale Data Cloud is designed for Hadoop users making HDFS the primary storage service, and ensuring Hadoop is persistent. No setting up and tearing down clusters or dealing with the infrastructure under Hadoop. Enterprise-level SLAs are in place to ensure our customers’ business can rely on us.

Auto-elasticity

Hadoop workloads are bursty. Large jobs arrive unexpectedly. Lots of small jobs compete for resources. The dynamic nature of Hadoop workloads, and the need for large amounts of dedicated hardware to support them, suggest that a cloud service is the best economic approach for our customers. We provide access to a large pool of resources, dynamically adding and removing resources according to customer needs.

Proactive Hadoop Helpdesk

Hadoop operational support is an important part of our offering. We not only operate the hardware, but also monitor jobs, help with job tuning, and ensure the software is kept up-to-date with the latest in the fast-changing Hadoop ecosystem. Our customers can think of our team of experts as their Hadoop operations team.

Priced Right

Pay as you go on monthly fixed fee plans. We don’t charge by the node, but by usage. We provide a predictable opex spend, as you only pay for the Hadoop you use, which eliminates the need for major CAPEX expenditures.

Keeping Current

We run the latest versions of Apache Hadoop. This means we offer YARN as a service, and an ever expanding set of ecosystem applications, which are run on top of the Altiscale Data Cloud.

The Altiscale Data Cloud

Altiscale Data Cloud

Our Architecture

The Altiscale Data Cloud is centered around HDFS and YARN, which are provided as a persistent and elastic cloud service. The various components of Hadoop are preconfigured, including the setup of queues for team usage, and services like HttpFS.

This platform is accessed by Linux-based Workbench machines that are connected to Hadoop and have applications like Hive, Pig, and Oozie already installed. For developers, a full toolset and Maven repository is available.

Getting data in and out of Altiscale Data Cloud is an important part of the product set. Altiscale provides excellent connectivity to their other systems, whether they’re in our customers’ datacenters or at another cloud service.

Our Hadoop Platform

Apache Hadoop

The Altiscale Data Cloud is based on Apache Hadoop, so is compatible with the vibrant ecosystem of tools developed on top of Hadoop. It’s easy to transfer your current Hadoop applications to Altiscale, and there is no vendor lock-in.

Core Support

The core products we support are:

  • Hadoop 2
  • Apache Hive
  • Apache Pig
  • Apache Oozie
  • Apache HCatalog
  • Apache Flume
  • JDK/JRE, Python
  • HttpFS
  • FUSE
  • LZOP, Snappy, gzip

Other Applications

Customers run many ecosystem products on our offering, including:

  • Apache Mahout
  • Cascading
  • Revolution R
  • Kafka/Camus
  • Avro
  • Pentaho Kettle
  • Matlab
  • Spark

Altiscale Hadoop Platform

Fully compatible

Altiscale Data Cloud works alongside many other tools.

Apache Hadoop excels at storage and stream or batch analysis of vast amounts of unstructured source-form data. It then works with data warehouses and business intelligence tools for lower latency query.

The fact that the Altiscale Data Cloud is based on Apache Hadoop allows it to work with the set of data transfer, processing and analysis tools built on top of YARN. One of Altiscale’s benefits is allowing you to keep up with the pace of advancements in these tools.

Testimonials

“With Altiscale, we’re getting 4 to 5 times the performance compared to other cloud solutions, with zero startup time and a predictable budget that doesn’t break the bank.”
Satya Ramachandran, VP, MarketShare
“Altiscale’s knowledgeable and responsive team allowed us to rapidly innovate and get to market faster with new products and services.”
Joseph Benjamin, CTO, Datalogix
“OpenTable encompasses the largest network of diners and reservation-taking restaurants in the world, and Altiscale enables us to seamlessly manage terabytes of data for our data modeling without dedicated operational support.
Our entire Hadoop deployment was implemented within a few days and the Altiscale team’s best-practice advice and support has been invaluable.”

Joseph Essas, CTO, OpenTable

The Total Economic ImpactTM of Altiscale’s Hadoop-as-a-Service: Forrester Commissioned Report