EMC WORLD, LAS VEGAS - May 22, 2012 -
At EMC World 2012, EMC (NYSE:EMC) today announced that the Greenplum Analytics Workbench - a 1,000-node cluster that will act as a lab environment for accelerating the pace of Big Data innovation - is now live. One of the primary uses of the Greenplum Analytics Workbench will be to act as an environment for running scale validation of the Apache Hadoop code base. Greenplum is actively working with the Apache Software Foundation to ensure that results from the Analytics Workbench are available to the open source community in an effort to leverage the resources of the Workbench to further accelerate the development of Hadoop as a revolutionary technology for Big Data. Technology from some of the world's leading software and hardware manufacturers is providing the infrastructure for the Greenplum Analytics Workbench.
Greenplum will use the Analytics Workbench to test the limits of scale-out infrastructure technology and also to explore the models for applying Big Data analytics. Whether that involves working with visionary academic institutions on data-intensive research studies, or collaborating with big data application developers, Greenplum has plans to provide the most innovative thinkers in the data space with access to the Analytics Workbench. Individuals or organizations interested in more information should visit http://www.greenplum.com/solutions/analytics-workbench.
The 1,000-node cluster will also be made available to members of Greenplum's training and certification classes for Hadoop. With the first publicly available courses launching this summer, Greenplum will offer a set of comprehensive Hadoop training programs designed to provide participants with the knowledge and programming skills required to leverage Hadoop. A unique aspect of Greenplum's Hadoop training program is that any individual who successfully completes the course will be granted access to the 1,000-node cluster to use as a sandbox environment.
The Greenplum Analytics Workbench is the result of several hardware and software companies coming together to collectively facilitate the development of Apache Hadoop as an important tool for Big Data analytics, including:
In addition to 1,000-plus hardware nodes (or 10,000 nodes with the addition of virtual machines), the test bed cluster consists of 24 petabytes of physical storage. This is the equivalent of nearly half of the entire written works of mankind, from the beginning of recorded history.
Chaitan Baru, Director, Center for Large-scale Data Systems research (CLDS), San Diego Supercomputer Center, UC San Diego
"The Workshop on Big Data Benchmarking organized by the Center for Large-scale Data Systems Research (CLDS), UC San Diego on May 8-9 in San Jose has generated a lot of enthusiasm in developing industry benchmark standards for big data applications. A big data benchmarking community is now self-organizing in order to make progress in this area. Access to the 1000-node cluster from Greenplum will be essential in assisting the community in making progress in this important area, which will have impact in business as well as scientific applications."
Amir Prescher, vice president of business development at Mellanox Technologies
"Mellanox is excited to be a part of the largest Hadoop test bed cluster ever built and to provide the critical elements that help enable this leading unstructured data analytics Hadoop solution. The new Analytics Workbench takes advantage of Mellanox's 10/40GbE and FDR 56Gb/s InfiniBand interconnect solutions, including its Unstructured Data Accelerator (UDA) software that extends Mellanox's interconnect capabilities of low latency, high-throughput, low CPU overhead and Remote Direct Memory Access (RDMA), to optimize big data application efficiency with up to 2X faster Hadoop job run-time."
Wally Liaw, Vice President of Sales, International, Super Micro Computer, Inc.
"Supermicro has contributed to the 1,000 data node infrastructure and integration resources behind EMC's Greenplum Analytics Workbench to accelerate innovation and the development of new applications within the Hadoop developer community. Our enterprise-class server platforms offer the highest performance, open standards and cost-effective architecture for massive-scale structured and unstructured data analytics. We are excited to offer growing support to the worldwide Hadoop community as Big Data sciences expand."
Scott Yara, Senior vice president of products and co-founder, Greenplum
'We're thrilled to announce that the Greenplum Analytics Workbench is now live. With more companies implementing Big Data analytics than ever before, Hadoop-based batch processing of data at massive scale, with continuous testing, is a key component to driving even better, faster data analytics."
More information on the Analytics Workbench can be found at http://www.greenplum.com/solutions/analytics-workbench.
Greenplum, a division of EMC, is driving the future of Big Data analytics with breakthrough products that harness the skills of data science teams to help global organizations realize the full promise of business agility and become data-driven, predictive enterprises. The division's products include Greenplum® Unified Analytics Platform, Greenplum® Data Computing Appliance, Greenplum® Database, Greenplum® Analytics Lab, Greenplum® HD and Greenplum® Chorus™. They embody the power of open systems, cloud computing, virtualization and social collaboration, enabling global organizations to gain greater insight and value from their data than ever before possible. Learn more at www.greenplum.com
, a part of , enables organizations to modernize, automate and transform their using industry-leading , servers, and data protection technologies. This provides a trusted foundation for businesses to transform IT, through the creation of a , and transform their business through the creation of cloud-native applications and solutions. Dell EMC services customers across 180 countries – including 98 percent of the Fortune 500 – with the industry’s most comprehensive and innovative portfolio from edge to core to cloud.