PALO ALTO, Calif., June 28, 2016 – Altiscale, Inc., the leading provider of Big Data-as-a-Service, today announced support for Apache Spark 2.0, the upcoming major Spark release. Spark 2.0 is now available in beta on the Altiscale Data Cloud. The addition of Spark 2.0 to Altiscale’s Spark-as-a-Service offering provides improved performance, expanded SQL support, and streamlined programming APIs.
Apache Spark is a framework for fast, in-memory processing, with modules for handling SQL, streaming, machine learning, and graph analytics. As one of the most active open-source Big Data projects under development, Spark is an integral component of the Big Data ecosystem. Spark runs alongside other Big Data computing frameworks on the Altiscale Data Cloud platform and is used by a majority of Altiscale customers. In addition, users of the Altiscale Insight Cloud self-service analytics solution leverage Spark for interactive SQL and data transformation.
Spark 2.0 delivers substantial improvements over its predecessor, Spark 1.6. Through more intelligent code generation that emits optimized code at runtime and eliminates unnecessary overhead, Spark 2.0 targets a 10x performance gain. With the inclusion of a new SQL parser in Spark 2.0, Altiscale customers who use Spark SQL will benefit from Spark’s newly added SQL:2003 support, which reduces the need for users to modify existing applications that require SQL:2003 features. Spark 2.0 also introduces Structured Streaming to unify batch and streaming analytics. Structured Streaming simplifies the programming model for users, as they can now process static data and unbounded data streams using a single high-level API.
With Spark’s frequent releases—a new version approximately every three months—organizations can benefit from the rapid pace of community-driven innovation in Spark technology, but they are challenged to ensure the stability of existing applications built on prior versions. As the only provider of Spark-as-a-Service whose management extends to the application level, Altiscale concurrently supports multiple recent versions of Spark (2.0 in beta, 1.6, 1.5, 1.4) so that customers can migrate to newer versions of Spark at their own pace.
“Altiscale is committed to providing the best Big Data experience for our customers so that they can maximize their analytic productivity and the value they derive from their data,” said Raymie Stata, CEO and founder, Altiscale. “Our customers depend on us to manage their Big Data environments and keep them up to date with the latest technologies. Introducing Spark 2.0 as a component of the Altiscale Data Cloud will deliver significantly better performance and greater simplicity to our users, and is part of our ongoing effort to make the many advancements in Spark available to all of our customers.”
Altiscale customers can now have the Spark 2.0 preview release installed in their environments during the beta period. Altiscale plans to extend full support for Spark 2.0 shortly after the release is marked stable. Find out more about Altiscale’s Spark-as-a-Service offering at https://www.altiscale.com/big-data-as-a-service/altiscale-data-cloud/spark-as-a-service/.
Altiscale is a Silver Sponsor at Hadoop Summit from June 28th – June 30th in San Jose (Booth 1606). If you are interested in a private meeting at the event, please go to https://www.altiscale.com/contact-us/.
Altiscale, a provider of Big Data as a Service, helps businesses to maximize the value of their data quickly and easily, without the challenge and expense of managing complex technologies on their own. Altiscale offers “concierge cloud” services, with full Big Data operations services included for every customer. The combination of a secure, scalable big data platform, dedicated operations services, and a passion for results means that Altiscale customers experience performance that is up to 10x faster than alternatives. Altiscale customers include leading companies across financial services, media, marketing services, AdTech, and gaming.