The Data Day, A few days: September 6-12 2014

Clustrix under new management. FoundationDB adds SQL. And more

And that’s the data day, today.

The Data Day, A few days: February 8-14 2014

Hortonworks and Red Hat expand Hadoop partnership. And more.

And that’s the data day, today.

The Data Day, A few days: January 25-31 2014

VoltDB targets in-memory analytics. Garantia Data becomes Redis Labs. And more.

And that’s the data day, today.

Visualizing the $1bn+ VC investment in Hadoop and NoSQL

Cumulative VC funding for Hadoop and NoSQL vendors broke through the $1bn barrier in 2013, according a Spotlight report published by 451 Research, based on data provided by The 451 M&A KnowledgeBase.

The data indicates that there was a substantial increase in funding in 2013 ($530.5m, not including RethinkDB’s $8m announced yesterday) compared to 2012 ($190.9m), thanks to major rounds for the likes of MongoDB, Pivotal, Hortonworks and DataStax.

The report includes a visualization created by 451’s Director of Data Strategy and Solutions, Barbara Peng, that illustrates the connections between the various investors and the NoSQL and Hadoop vendors in which they have invested.

A snapshot of the visualization is shown below but the the original is interactive, enabling 451 Research clients to drag the various elements around for greater emphasis, as well as isolate the NoSQL or Hadoop categories.

vc-firms

451 Research clients can also scroll over the blue circles to see the total amount of funding raised by the individual Hadoop and NoSQL vendors, and scroll over the smaller orange circles to see which investors have backed which companies.

The sample set was limited to 16 vendors for visual clarity, but the six Hadoop and 10 NoSQL providers cited account for more than 87% of funding to date (with Pivotal representing the vast majority of the remaining 13%).

This visualization illustrates that investment in Hadoop and NoSQL providers comes from a relatively small group of VC firms (52 to be specific, excluding individual seed investors), resulting in a relatively tightly clustered graph.

However, the visualization also enables us to put to the test the recent blog post by MarkLogic’s Adam Fowler in which he stated:

“Just look at the number of investors who are investing in multiple NoSQL companies. They’re hedging their bets because they’re not sure themselves which businesses will survive.”

In fact investment in multiple Hadoop and NoSQL vendors is relatively rare. Only 11 out of the 52 VC firms have invested in more than one Hadoop and/or NoSQL vendor, with seven of those picking one Hadoop vendor and one NoSQL provider. Less hedging their bets as picking a winner in each category.

Of the remaining four investment shops, two have invested in one Hadoop distributor, one NoSQL specialist and one Hadoop-as-a-service provider (MapR, DataStax and Qubole for Lightspeed Venture Partners; Cloudera, Couchbase and Altiscale for Accel Partners), while In-Q-Tel has invested in one Hadoop supplier, one NoSQL vendor and one NoSQL-as-a-service provider (Cloudera, MongoDB and Cloudant).

Only Sequoia Capital has invested in multiple NoSQL vendors (as well as Hadoop-as-a-service provider Altiscale) having invested in MongoDB, DataStax and – hold onto your hats, irony fans – MarkLogic. It should be noted however that Sequoia has not invested in DataStax since its series A round in late 2010.

The full report, Venture funding for Hadoop and NoSQL vendors tops $1bn is available now to 451 Research clients and also includes our perspective on when combined Hadoop and NoSQL revenue might begin to exceed combined Hadoop and NoSQL VC funding, as well as the potential for M&A and IPO activity in 2014.

The Data Day, A few days: October 5-11 2013

TransLattice acquires StormDB. Funding for Cirro and TempoDB. And more.

And that’s the data day, today.

7 Hadoop questions. Q2: Hadoop infrastructure choices

What is your preferred infrastructure for Hadoop deployments? That’s one of the primary questions being asked in the 451 Research 2013 Hadoop survey. The answer will have significant implications for the future direction of Hadoop.

hadoop-elephant

While one of the primary benefits of Hadoop – low cost data storage – means that for many organisations the primary infrastructure for Hadoop has been commodity hardware, many systems and storage vendors now offer their own dedicated appliances and/or reference architecture for Hadoop.

We expect to see more of these dedicated Hadoop configurations as the incumbent infrastructure vendors look to cash-in on Hadoop adoption and try to add greater value.

We also see some companies exploring the potential for Hadoop in the cloud, as well as hosted deployments, and on virtual infrastructure – although those are arguably in the early stages of technical maturity, and adoption.

survey

Which infrastructure configurations are most popular? That’s one of the things our survey is designed to find out. The early results perhaps unsurprisingly indicate a greater preference for Hadoop being deployed on commodity hardware. However, cloud and virtual deployments have also scored well.

Interestingly, the early results show the preference for Hadoop on cloud infrastructure is significantly higher among respondents that are still in the development and test stage with Hadoop, which supports our anecdotal evidence about the use-cases for Hadoop in the cloud.

In order to get a little more detail on deployment preferences, the survey also asks about the level of consideration, testing and adoption for dedicated Hadoop hardware and Hadoop-as-a-service offerings respectively.

Among the choices in the dedicated hardware category are offerings from DataDirect Networks, Dell, HP, Oracle, IBM, Pivotal, Teradata, Cisco and NetApp.

The choices in the Hadoop-as-a-service category include Altiscale, Amazon EMR (including MapR), MapR on Google Compute Engine, Microsoft Windows Azure HDInsight Service, Mortar Data, Qubole, Rackspace Big Data, SunGard Unified Analytics Services and Treasure Data.

To give your view on this and other questions related to the adoption of Hadoop, please take our 451 Research 2013 Hadoop survey.

The Data Day, A few days: June 11-25 2013

A bumper round-up of the past 14 days’ data-related news

* Cisco announced its intention to acquire Composite Software.

* Software AG acquired Apama.

* TIBCO Software acquired StreamBase Systems.

* Cloudera appointed Tom Reilly as Chief Executive Officer and Mike Olson as Chief Strategy Officer and Chairman of the Board.

* Sears Holdings named Jeff Balagna Chief Executive Officer of MetaScale

* Ex-Yahoo CTO launched Altiscale, hardcore Hadoop as a service.

* SpaceCurve raised a $10M Series B round of financing.

* Sqrrl announced general availability of Sqrrl Enterprise.

* GE launched Predictivity services, supported by supported by Proficy Historian HD.

* Datameer announced Datameer 3.0.

* Oracle announced the general availability of MySQL Cluster 7.3.

* MemSQL announced the upcoming availability of MemSQL 2.1.

* Continuuity announced the release of Weave, a new open source project that enables Java developers to rapidly build scalable, distributed applications on YARN.

* RainStor adds security, text search features to database complement for Hadoop.

* Composite Software introduced version 6.2 SP3 of its Composite Data Virtualization Platform

* TokuDB launched TokuMX.

* Terracotta announced the immediate availability of Terracotta Universal Messaging.

* HP united its data management assets under HAVEn brand.

* Hortonworks and Red Hat announced an engineering collaboration around Hadoop.

* Rackspace Hosting’s ObjectRocket Database as a Service entered into a strategic agreement with 10gen.

* Simon Phipps posted State Of The Sea Lion – June 2013.

* Netflix announced that its Genie Hadoop-aaS management software is now open source

* Storm-YARN released as open source.

* Big Data arrived at the Oxford English Dictionary

And that’s the data day, today.