HOPKINTON, Mass., - March 01, 2012 -
EMC Corporation (NYSE:EMC) today announced version 4.2 of EMC Greenplum Database, bringing to the industry-leading platform for in-database analytics new levels of Big Data integration, database manageability and performance. That means customers can run massive-scale mission-critical analysis even more easily and rapidly, thus further boosting their analytic productivity, business value and business decision-making prowess. Sitting at the heart of the EMC Greenplum family of products, Greenplum Database 4.2 includes a high-performance gNet for Hadoop; language and compatibility enhancements for faster migrations to Greenplum; simpler, scalable backup with EMC Data Domain Boost; an extension framework and turnkey in-database analytics; and targeted performance optimization.
In order to expand the range of solutions that can be created for data integration and processing and to run queries for mission-critical complex analysis, customers seek the most efficient and flexible data exchange between Greenplum Database and Hadoop, in addition to the existing parallel data access. To address this, Greenplum 4.2 now enables high-performance parallel import and export of all data (compressed and uncompressed) from Hadoop using gNet for Hadoop, a parallel communications transport. This achievement represents the industry's first direct query interoperability between Greenplum Database and Hadoop.
A key new Greenplum Database feature is the advanced integration with EMC Data Domain deduplication storage systems via EMC Data Domain Boost, resulting in significantly faster (10 to 30x data reduction average), more efficient backup. This integration distributes parts of the deduplication process to Greenplum database servers, enabling them to send only unique data to the Data Domain system, thus dramatically increasing aggregate throughput, reducing the amount of data transferred over the network and eliminating the need to create and manage virtual drives (fast, inline deduplication with up to 26.3 TB/hour of throughput; backup over 173 TB in less than eight hours).
Addressing database manageability and performance, Greenplum Database delivers an agile, extensible platform for in-database analytics, leveraging the system's massively parallel architecture. With Release 4.2, Greenplum enables turnkey in-database analytics via Greenplum Extensions, which can be downloaded from EMC Subscribenet and installed using the new Greenplum Package Manager—a new utility that ensures automatic installation and updates of functional extensions to simplify the task of enabling and managing advanced in-database functionality across a cluster. Release 4.2 also supports dynamic partition elimination and query memory optimization, thus drastically reducing the data scanned for a query, significantly accelerating query processing and allowing for more concurrency.
EMC Greenplum Database version 4.2 and the Greenplum Command Center are available now.
Scott Yara, Senior Vice President of Products, Greenplum, a division of EMC
"The EMC Greenplum Database continues to be at the core of driving Big Data insights and decisions for our customers. As more organizations create data-driven cultures, the Greenplum Database's shared-nothing, massively parallel processing (MPP) makes business intelligence and analytical processing much faster. It is this analytic productivity that is the real benefit of the database and is something we're proud to offer."
, a part of , enables organizations to modernize, automate and transform their using industry-leading , servers, and data protection technologies. This provides a trusted foundation for businesses to transform IT, through the creation of a , and transform their business through the creation of cloud-native applications and solutions. Dell EMC services customers across 180 countries – including 98 percent of the Fortune 500 – with the industry’s most comprehensive and innovative portfolio from edge to core to cloud.