Visualizing the $1bn+ VC investment in Hadoop and NoSQL

Cumulative VC funding for Hadoop and NoSQL vendors broke through the $1bn barrier in 2013, according a Spotlight report published by 451 Research, based on data provided by The 451 M&A KnowledgeBase.

The data indicates that there was a substantial increase in funding in 2013 ($530.5m, not including RethinkDB’s $8m announced yesterday) compared to 2012 ($190.9m), thanks to major rounds for the likes of MongoDB, Pivotal, Hortonworks and DataStax.

The report includes a visualization created by 451’s Director of Data Strategy and Solutions, Barbara Peng, that illustrates the connections between the various investors and the NoSQL and Hadoop vendors in which they have invested.

A snapshot of the visualization is shown below but the the original is interactive, enabling 451 Research clients to drag the various elements around for greater emphasis, as well as isolate the NoSQL or Hadoop categories.

vc-firms

451 Research clients can also scroll over the blue circles to see the total amount of funding raised by the individual Hadoop and NoSQL vendors, and scroll over the smaller orange circles to see which investors have backed which companies.

The sample set was limited to 16 vendors for visual clarity, but the six Hadoop and 10 NoSQL providers cited account for more than 87% of funding to date (with Pivotal representing the vast majority of the remaining 13%).

This visualization illustrates that investment in Hadoop and NoSQL providers comes from a relatively small group of VC firms (52 to be specific, excluding individual seed investors), resulting in a relatively tightly clustered graph.

However, the visualization also enables us to put to the test the recent blog post by MarkLogic’s Adam Fowler in which he stated:

“Just look at the number of investors who are investing in multiple NoSQL companies. They’re hedging their bets because they’re not sure themselves which businesses will survive.”

In fact investment in multiple Hadoop and NoSQL vendors is relatively rare. Only 11 out of the 52 VC firms have invested in more than one Hadoop and/or NoSQL vendor, with seven of those picking one Hadoop vendor and one NoSQL provider. Less hedging their bets as picking a winner in each category.

Of the remaining four investment shops, two have invested in one Hadoop distributor, one NoSQL specialist and one Hadoop-as-a-service provider (MapR, DataStax and Qubole for Lightspeed Venture Partners; Cloudera, Couchbase and Altiscale for Accel Partners), while In-Q-Tel has invested in one Hadoop supplier, one NoSQL vendor and one NoSQL-as-a-service provider (Cloudera, MongoDB and Cloudant).

Only Sequoia Capital has invested in multiple NoSQL vendors (as well as Hadoop-as-a-service provider Altiscale) having invested in MongoDB, DataStax and – hold onto your hats, irony fans – MarkLogic. It should be noted however that Sequoia has not invested in DataStax since its series A round in late 2010.

The full report, Venture funding for Hadoop and NoSQL vendors tops $1bn is available now to 451 Research clients and also includes our perspective on when combined Hadoop and NoSQL revenue might begin to exceed combined Hadoop and NoSQL VC funding, as well as the potential for M&A and IPO activity in 2014.

The Data Day, A few days: December 6-12 2013

Talend raises $40m. GridGain names new CEO. And more

And that’s the data day, today.

The Data Day, A few days: December 2-5 2013

Big data reconsidered, No NoSQL. And more

And that’s the data day, today.

The Data Day, A few days: November 22-28 2013

Total Data Integration. And more

And that’s the data day, today.

Forthcoming webinar: Beyond NoSQL – Distributed Databases in Production, with Basho

On Tuesday, December 10th at 10:00am PT/1:00pm ET I’ll be taking part in a webinar in association with Basho on the subject of Beyond NoSQL – Distributed Databases in Production.

I’ll be presenting a brief history of NoSQL and covering NoSQL drivers and adoption trends, as well as our perspective on the NoSQL database landscape, and the importance of scalability and distributed architecture.

I’ll also be joined by Bobby Patrick, EVP and CMO at Basho Technologies, to discuss the benefits and future of distributed systems, while Tapjoy will also discuss how they are using distributed databases to provide reliable data locality to their customers.

For full details, and to register, click here.

New 451 Research report: Total Data Integration

Data integration technologies have a critical role to play in delivering on the potential of ‘big data,’ by ensuring that emerging data processing and analysis technologies work well with each other, and also with more established approaches.

The roles of established data management techniques – such as data governance, data quality and master data management – gain even greater significance given the desire to process and analyze data from previously ignored sources.

By adopting an approach we call ‘Total Data Integration,’ a company not only gets a complete picture of its business, customers and products, but it can derive new insights that weren’t possible when only structured data was brought into the mix.

The latest long-format report from the Information Management team – Total Data Integration – outlines the key drivers shaping this sector and highlights the role that integration and governance technologies will play in enabling the analysis of structured, semi-structured and unstructured data.

It explains how established vendors and emerging startups alike are responding to the increased complexity of Total Data analytics by enabling integration of these various data sources. The report also explores new approaches to data integration, including the role of Hadoop.

The Data Day, A few days: November 15-21 2013

Storage and big data: what went wrong? And more

And that’s the data day, today.

Forthcoming webinar: Hadoop in the cloud, with Qubole

On Tuesday November 26 at 11AM PDT I’ll be taking part in a webinar, Deploying Hadoop: Bare Metal or Cloud?, in association with Qubole.

I’ll be joined by Ashish Thusoo of Qubole (founder of Apache Hive), and between us we will outline the benefits of Hadoop implementation, examine common use cases for big data projects of various size and scope, and explain how Hadoop-as-a-Service removes complexity while adding much-needed functionality.

Among the topics we’ll cover:

  • Drivers behind the adoption of Hadoop, including interest in running Hadoop in the cloud
  • Explore the cost and utilization implications of running Hadoop on the cloud versus on-premises
  • 5 best practices to consider when deploying Hadoop infrastructure in the cloud – by Ashish Thusoo, who ran Facebook’s big data infrastructure team for several years.
  • Barriers to adoption for Hadoop-as-a-service and how they are being overcome by the emergence of new managed service offerings and providers.
  • What Big Data-as-a-Service means, and how it allows you to focus on data analysis and monetization (and less on infrastructure).

For more details, and to register, click here: http://info.qubole.com/hadoop-in-the-clouds

The Data Day, A few days: November 9-14 2013

Total Data Integration. PostgreSQL on RDS. And more

And that’s the data day, today.

The Data Day, A few days: November 2-8 2013

Garantia Data almost becomes RedisDB, raises $9m. And more

And that’s the data day, today.