EMC World 2011: Greenplum BI appliance gets open source boost

By Derek du Preez

11 May 2011

Comment: 1

An EMC logo

At EMC World 2011 yesterday EMC said that it plans to start using open source technology within its business intelligence offering.

Greenplum, a data analytics and warehousing company that EMC acquired in July 2010, is set to use Apache Hadoop open source software to provide a co-processing appliance that will, according to EMC, allow enterprises to seamlessly analyse "big data".

Further reading

Hadoop is an open source technology used mainly by big internet companies, such as Facebook and Yahoo. It provides a framework to support data-intensive distributed applications and is used for analysing and storing massive amounts of data.

"It is estimated that enterprise data growth will increase by 650 per cent over the next five years," said Scott Yara, co-founder of Greenplum and vice president of products, EMC Data Computing Division.

"The key aspect is that a huge amount of that data growth is going to be around unstructured data – approximately 80 per cent of it," he added.

"What we are seeing in the unstructured data world, is that the Hadoop platform is becoming an important solution to solving these unstructured data processing problems."

EMC Greenplum HD will be available in Community and Enterprise editions. The  Community edition will be 100 per cent open source certified, supporting HDFS, MapReduce, Zookeeper, Hive and HBase platforms.

The Enterprise version, meanwhile, will be 100 per cent interface-compatible with the Apache Hadoop stack.

This compatibility is set to allow large organisations to carry out data management as well as load and access data using a native network file system interface, the firm said.



The Greenplum Community, Enterprise edition and the EMC Greenplum HD Data Computing Appliance are expected to be available in the third quarter of 2011. Pricing is yet to be confirmed.

Reader comments

Using business intelligence to harness Big Data

EMC announced today that it is planning to start using open source technology within its business intelligence offering in order to allow large organisations to carry out data management. At Jaspersoft, our experience is that any organisation storing more than a few terabytes of data can benefit from Big Data technologies. With traditional database, data warehousing, and business intelligence technologies – even when applying current best practices - exceeding 10TB is difficult. The predicted data growth rates mean organisations with only two or three TB of data will soon exceed the limits of traditional technologies.

Big data technologies such as Map-reduce, Hadoop and other No SQL approaches can help any organisation seeking to dramatically reduce query times, or to complete complex analysis tasks within smaller time windows. Regardless of which Big Data technologies are employed for an analytic task, business users still need to view and understand results. Given the potentially huge output sets, business users need ad-hoc reporting and analysis tools designed to find needles in haystacks – including powerful data visualizations, summary-to-fine-grain drill-down, and dynamic filtering and sorting. These reporting, analysis and user interface capabilities must also be data format-agnostic, working seamlessly traditional RDBMSs, MPP databases, or files spread across the network.

Tom Cahill, VP EMEA Jaspersoft

Posted by: Tom Cahill  13 May 2011

Have your say on this article

All fields required. Your email address will not be displayed on the site.

By submitting a comment you agree to abide by our Terms & Conditions

  • Digg
  • Tweet

Newsletters

Sign up for our FREE newsletters

Technology Patent Wars

Large companies such as Microsoft, Facebook and Google have been hoovering up technology patents recently. Is this stifling innovation?

87 %

5 %

8 %