“Hortonworks Data Platform 2.0 GA release is a huge milestone of progress for Hadoop. Through our use of the HDP 2.0 and through the efforts of both AT&T and Hortonworks, AT&T is developing some of the top Hadoop experts in the world, along with leading edge technology,” said Victor Nilson, AT&T’s senior vice president for data sciences.
With Hadoop 2, Apache Hadoop YARN serves as the Hadoop operating system, and takes Hadoop beyond simply a single-use data platform for batch processing to a multi-use platform that enables batch, interactive, online and stream processing. By acting as the primary resource manager and mediator of access to data stored in HDFS, YARN enables enterprises to store data in a single place and interact with it in multiple ways simultaneously and with consistent levels of service.
The Stinger Initiative was launched at the beginning of 2013 as a broad community-based effort to enhance the speed, scale and breadth of SQL semantics supported by Apache Hive. By including the recently released Hive 0.12 which is the culmination of phase 2 of the Stinger Initiative, HDP 2.0 represents a significant step forward for Hive, the de-facto standard for SQL access in Hadoop today and the only SQL interface designed for queries that scale from gigabytes to petabytes. Microsoft has been a critical partner in the development of HDP 2.0 and has contributed more than 6,000 engineering hours across various Apache projects, as well as porting HDP 2.0 to Windows, which will be available next month.
“The YARN based architecture of HDP 2.0 delivers on our mission to enable the modern data architecture by providing an enterprise Hadoop platform that deeply integrates with existing and future data center technologies,” said Shaun Connolly, vice president of corporate strategy, Hortonworks. “Hortonworks remains committed to delivering a tested, stable, and 100-percent open source Hadoop distribution of the most recent Apache project releases. Our approach ensures that HDP always includes the most proven community-driven innovations that are driving the enterprise deployments shaping the data architectures of tomorrow.”
HDP 2.0 is the first enterprise Hadoop platform to include the latest enterprise features delivered in Hadoop 2 and all the related Apache projects, many of which had significant GA community releases within the last few weeks. The key projects in HDP include:
· Apache Hadoop 2.2.0
· Apache Hive 0.12.0
· Apache HCatalog
· Apache Pig 0.12.0
· Apache HBase 0.96
· Apache Ambari 1.4.1
· Apache ZooKeeper 3.4.5
· Apache Oozie 4.0.0
· Apache Sqoop 1.4.4
· Apache Flume 1.4.0Apache Mahout 0.8.0