February 26th, 2013 — Data management
EMC Pivotal HD. Hortonworks Hadoop for Windows. And more.
And that’s the data day, today.
January 30th, 2013 — Data management
Actian acquires Pervasive. ObjectRocket has liftoff. And more.
And that’s the data day. today.
December 5th, 2012 — Data management
EMC/VMware make Pivotal move. Funding for ClearStory. And more
And that’s the Data Day, today.
November 29th, 2012 — Data management
Amazon and BitYota launch DWaaSes (DWaaSi?) Continuuity’s funding and plans. And more.
And that’s the Data Day, today.
September 24th, 2012 — Data management
Alpine Data bags EMC. Infobright delivers appliance. And more.
And that’s the Data Day, today.
September 7th, 2012 — Data management
StackIQ. Alpine Data. OpenDremel/Apache Drill. Memcachier. And more.
And that’s the Data Day, today.
March 8th, 2012 — Data management
We recently speculated that EMC Greenplum’s focus on the integration of its Greenplum HD Hadoop distribution with its Data Computing Appliance (DCA) and Isilon storage technology would mean an increasingly niche role for Greenplum MR- the Hadoop distribution based on MapR’s M5.
Two recent announcements indicate that niche might continue to be a lucrative one for MapR, however. First, Cisco released details of a reference architecture for deploying Greenplum MR on Cisco’s UCS servers. Then Informatica announced a partnership with MapR to jointly support its Data Integration Platform running on MapR’s distribution for Hadoop.
The Informatica relationship also covers bi-directional data integration with Informatica PowerCenter and Informatica PowerExchange, snapshot replication using Informatica FastClone, and data streaming into MapR’s distribution via NFS using Informatica Ultra Messaging. In addition, In addition, the free Informatica HParser Community Edition will be available for download as part of the MapR distribution.
While the partnership with Informatica is a direct one for MapR, the Cisco reference architecture announcement illustrates that the benefit MapR gains from its relationship with EMC Greenplum includes exploiting the company’s leverage with potential partners.
March 2nd, 2012 — Data management
February 3rd, 2012 — Data management
New CEO at Revolution. Pentaho goes big data. EMC Hadoop gets Isilon. And more.
An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.
* Revolution Analytics Names David Rich New CEO
* Pentaho Open Sources Big Data Capabilities to Further Fuel Widespread Adoption
* EMC Isilon is Industry’s First Scale-Out NAS System with Native Hadoop Support
* Actuate Reports Fourth Quarter and Fiscal Year 2011 Financial Results
* Sumo Logic Raises $15M Series B Round for Next Generation Log Management and Analytics
* Announcing Oracle R Enterprise 1.0
* Paul Cormier Joins Hortonworks’ Board of Directors
* DataStax Launches First Complete Solution for Cassandra Development on Windows and Mac
* Latest Release of Kalido Information Engine Eliminates Data Mart Migration and Consolidation Hassles
* Karmasphere Brings More Power, Collaboration, and Faster Insights to Big Data Analytics Teams on Hadoop
* Why Big Data Won’t Make You Smart, Rich, Or Pretty
* SAP HANA – slowly moving out of hype into actual projects
* For 451 Research clients
# Actuate gets ready to go shopping in the ‘big data’ mall Acquirer IQ
# Couchbase cites enterprise adoption, clarifies distributed NoSQL database strategy Impact report
# SpagoBI illuminates 2012 roadmap, takes open source model to US, Latin America Impact report
# Customer data analysis provider nPario combines big data and smart segmentation Impact report
# Tableau details 2012 growth strategy, gets semantic for visual analytics Market development report
# EMC integrates re-branded Hadoop distribution with Isilon NAS Market development report
# Quiterian seeks funding for new customer analytics in the cloud focus Market development report
# Hortonworks refines its commercial strategy for Apache Hadoop Market development report
# Digital Reasoning pledges to automate the analysis of complex data Market development report
And that’s the Data Day, today.
January 31st, 2012 — Data management
As expected, EMC has announced that it is integrating its Greenplum HD distribution of Apache Hadoop with its Isilon scale-out NAS technology. The move coincides with a re-branding of the company’s Hadoop distributions that, while slight, could prove significant.
Specifically, EMC has enabled the Hadoop Distributed File System (HDFS) as a native protocol supported on OneFS in addition to Network File System (NFS) and Common Internet File System (CIFS) support, enabling Isilon systems to provide the underlying storage layer for Hadoop processing, as well as a common storage pool for Hadoop and other systems.
EMC is talking up the benefits of combining Isilon with Greenplum HD. For the record, that’s the Hadoop distribution previously known as Greenplum HD Community Edition, based on the Apache Hadoop 0.20.1 code branch.
Greenplum HD Enterprise Edition, based on MapR Technologies’ M5 distribution, is now known as Greenplum MR, and is not supported by Isilon due to the fact that it replaces HDFS with Direct Access NFS.
EMC notes that Greenplum MR is being positioned as a high-performance Hadoop offering for customers that have failed to achieve their required performance from other distributions.
While EMC is quick to maintain its happiness with the MapR relationship and its commitment to Greenplum MR, it’s clear that tight integration with Isilon, particularly in the EMC Greenplum DCA, will result in an expanded role for Greenplum HD.
Additionally, while the company’s Greenplum Command Center provides unified management for the Greenplum Database, Greenplum HD and Greenplum Chorus as part of the recently announced Unified Analytics Platform (UAP), MapR has its own management and monitoring functionality.
Since we expect EMC to pitch the benefits of integrated software in UAP and software and hardware in DCA, it is now clear that Greenplum HD, rather than the Greenplum MR, is considered the company’s primary Hadoop distribution.
Given Greenplum HD’s starring role in the Unified Analytics Platform (UAP), Data Computing Appliance (DCA) and integration with Isilon, Greenplum MR’s role is likely to become increasingly niche.