The Data Day: March 6, 2020

Assessing the runners and riders in the automated decision intelligence race. And more.

And that’s the Data Day, today.

The Data Day: December 21, 2017

StreamSets is on the Edge. Datameer pivots to visual data preparation and exploration. And more.

And that’s the data day, today.

The Data Day: April 7, 2017

The real story turns out to be DATA and ANALYTICS! Find the leakers.

And that’s the data day, today.

The Data Day: September 30, 2016

Data platforms and analytics doesn’t look like this anymore!

And that’s the data day, today.

The Data Day: June 10, 2016

What happened in data and analytics this week will make your jaw drop.

And that’s the data day, today.

The Data Day, A few days: February 14-20, 2015

Open Data Platform launched. And more.

And that’s the data day, today.

The Data Day, A few days: October 11 – October 17 2014

Insanely large Strata-Hadoop World edition

And that’s the data day, today.

The Data Day, A few days: October 26-November 1 2013

Cloudera launches Enterprise Data Hub. And more

And that’s the data day, today.

7 Hadoop questions. Q2: Hadoop infrastructure choices

What is your preferred infrastructure for Hadoop deployments? That’s one of the primary questions being asked in the 451 Research 2013 Hadoop survey. The answer will have significant implications for the future direction of Hadoop.


While one of the primary benefits of Hadoop – low cost data storage – means that for many organisations the primary infrastructure for Hadoop has been commodity hardware, many systems and storage vendors now offer their own dedicated appliances and/or reference architecture for Hadoop.

We expect to see more of these dedicated Hadoop configurations as the incumbent infrastructure vendors look to cash-in on Hadoop adoption and try to add greater value.

We also see some companies exploring the potential for Hadoop in the cloud, as well as hosted deployments, and on virtual infrastructure – although those are arguably in the early stages of technical maturity, and adoption.


Which infrastructure configurations are most popular? That’s one of the things our survey is designed to find out. The early results perhaps unsurprisingly indicate a greater preference for Hadoop being deployed on commodity hardware. However, cloud and virtual deployments have also scored well.

Interestingly, the early results show the preference for Hadoop on cloud infrastructure is significantly higher among respondents that are still in the development and test stage with Hadoop, which supports our anecdotal evidence about the use-cases for Hadoop in the cloud.

In order to get a little more detail on deployment preferences, the survey also asks about the level of consideration, testing and adoption for dedicated Hadoop hardware and Hadoop-as-a-service offerings respectively.

Among the choices in the dedicated hardware category are offerings from DataDirect Networks, Dell, HP, Oracle, IBM, Pivotal, Teradata, Cisco and NetApp.

The choices in the Hadoop-as-a-service category include Altiscale, Amazon EMR (including MapR), MapR on Google Compute Engine, Microsoft Windows Azure HDInsight Service, Mortar Data, Qubole, Rackspace Big Data, SunGard Unified Analytics Services and Treasure Data.

To give your view on this and other questions related to the adoption of Hadoop, please take our 451 Research 2013 Hadoop survey.

The Data Day, Two days: February 25/26 2013

EMC Pivotal HD. Hortonworks Hadoop for Windows. And more.

And that’s the data day, today.