Search Results for 'nosql' ↓

Visualizing the $1bn+ VC investment in Hadoop and NoSQL

Cumulative VC funding for Hadoop and NoSQL vendors broke through the $1bn barrier in 2013, according a Spotlight report published by 451 Research, based on data provided by The 451 M&A KnowledgeBase.

The data indicates that there was a substantial increase in funding in 2013 ($530.5m, not including RethinkDB’s $8m announced yesterday) compared to 2012 ($190.9m), thanks to major rounds for the likes of MongoDB, Pivotal, Hortonworks and DataStax.

The report includes a visualization created by 451’s Director of Data Strategy and Solutions, Barbara Peng, that illustrates the connections between the various investors and the NoSQL and Hadoop vendors in which they have invested.

A snapshot of the visualization is shown below but the the original is interactive, enabling 451 Research clients to drag the various elements around for greater emphasis, as well as isolate the NoSQL or Hadoop categories.

vc-firms

451 Research clients can also scroll over the blue circles to see the total amount of funding raised by the individual Hadoop and NoSQL vendors, and scroll over the smaller orange circles to see which investors have backed which companies.

The sample set was limited to 16 vendors for visual clarity, but the six Hadoop and 10 NoSQL providers cited account for more than 87% of funding to date (with Pivotal representing the vast majority of the remaining 13%).

This visualization illustrates that investment in Hadoop and NoSQL providers comes from a relatively small group of VC firms (52 to be specific, excluding individual seed investors), resulting in a relatively tightly clustered graph.

However, the visualization also enables us to put to the test the recent blog post by MarkLogic’s Adam Fowler in which he stated:

“Just look at the number of investors who are investing in multiple NoSQL companies. They’re hedging their bets because they’re not sure themselves which businesses will survive.”

In fact investment in multiple Hadoop and NoSQL vendors is relatively rare. Only 11 out of the 52 VC firms have invested in more than one Hadoop and/or NoSQL vendor, with seven of those picking one Hadoop vendor and one NoSQL provider. Less hedging their bets as picking a winner in each category.

Of the remaining four investment shops, two have invested in one Hadoop distributor, one NoSQL specialist and one Hadoop-as-a-service provider (MapR, DataStax and Qubole for Lightspeed Venture Partners; Cloudera, Couchbase and Altiscale for Accel Partners), while In-Q-Tel has invested in one Hadoop supplier, one NoSQL vendor and one NoSQL-as-a-service provider (Cloudera, MongoDB and Cloudant).

Only Sequoia Capital has invested in multiple NoSQL vendors (as well as Hadoop-as-a-service provider Altiscale) having invested in MongoDB, DataStax and – hold onto your hats, irony fans – MarkLogic. It should be noted however that Sequoia has not invested in DataStax since its series A round in late 2010.

The full report, Venture funding for Hadoop and NoSQL vendors tops $1bn is available now to 451 Research clients and also includes our perspective on when combined Hadoop and NoSQL revenue might begin to exceed combined Hadoop and NoSQL VC funding, as well as the potential for M&A and IPO activity in 2014.

Forthcoming webinar: Beyond NoSQL – Distributed Databases in Production, with Basho

On Tuesday, December 10th at 10:00am PT/1:00pm ET I’ll be taking part in a webinar in association with Basho on the subject of Beyond NoSQL – Distributed Databases in Production.

I’ll be presenting a brief history of NoSQL and covering NoSQL drivers and adoption trends, as well as our perspective on the NoSQL database landscape, and the importance of scalability and distributed architecture.

I’ll also be joined by Bobby Patrick, EVP and CMO at Basho Technologies, to discuss the benefits and future of distributed systems, while Tapjoy will also discuss how they are using distributed databases to provide reliable data locality to their customers.

For full details, and to register, click here.

NoSQL LinkedIn Skills Index – September 2013

With our rebooted NoSQL LinkedIn Skills Index, based on the number of LinkedIn member profiles mentioning each of the NoSQL projects, now into its second year, I thought it was a good time to add some newer projects to the list; specifically: ArangoDB, FoundationDB, RethinkDB, and Titan.

It shouldn’t surprise anyone to find that those four new additions failed to make a dent in the top ten list of the NoSQL databases most often cited in LinkedIn profiles. However, there is still some interesting activity this quarter, with Riak leapfrogging MarkLogic (as predicted).

linkedinq31

Outside the top ten, Apache Accumulo overtook Voldemort, and saw the second fastest growth in mentions in Q3, behind only DynamoDB and ahead of Neo4j, MongoDB, and Cassandra.

That growth saw MongoDB extend its lead as the most popular NoSQL database, according to LinkedIn profile mentions. As the chart below illustrates, it now accounts for 49% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample, compared with 47% in June.

allNoSQLq3

Incidentally, adding the four new NoSQL databases to the analysis did not have a significant impact on MongoDB’s share. Without them it still registered 49%. Expect MongoDB to pass the 50% threshold in Q4, however, as well as Couchbase to overtake MarkLogic.

Of course, we would also note that this is not meant to be a comprehensive analysis, but rather a snapshot of one particular data source.

Forthcoming webinar: NoSQL Technology and Real-time, Accurate Predictive Analytics

On August 29 at 10:00 am PT I’ll be taking part in a webinar in association with Objectivity entitled “Big Data: NoSQL Technology and Real-time, Accurate Predictive Analytics”.

The webinar will provide an overview of NoSQL database technology and, in particular, the role that graph databases have in the expanding analytics market.

I’ll be joined by Leon Guzenda, Founder, Objectivity, who will provide a brief overview of Objectivity, Inc and its products Objectivity/DB and InfiniteGraph, as well as J.C. Smart, Director Global Insight Laboratory, Georgetown University, who will explain how Georgetown University is taking advantage of Objectivity’s products to develop one of the most interconnected databases today – examining information from all types of sources worldwide in real-time.

For full details, and to register, click here.

Sizing the opportunities for MySQL, NoSQL, NewSQL and DBaaS

451 Research has recently published an update to our market sizing estimates for the MySQL ecosystem, NoSQL and NewSQL sectors, adding coverage of the database-as-a-service market.

The report, Next-Generation Operational Databases: 2012-2016, can be found here and provides estimates for the size of the aggregate market and each market sector, as well as competitive landscape maps. It also includes a growth forecast for each sector, and highlights the opportunities and threats facing participating vendors.

splash

The key findings are also available in the a short, free presentation (registration required), which can be found here, and provides details of how the MySQL, NoSQL and DBaaS sectors are each expected to grow to generate revenue in excess of $1bn by 2016.

NoSQL LinkedIn Skills Index – June 2013

Four quarters have now passed since we rebooted our NoSQL LinkedIn Skills Index, based on the number of LinkedIn member profiles mentioning each of the NoSQL projects, giving us a good view of the relative growth of the various NoSQL databases in the past year.

NoSQL-Jun

A few interesting statistics to pick out: Cassandra has jumped ahead of Redis for second place, while outside the top ten, shown here, OrientDB climbed above Hypertable and DEX climbed above InfiniteGraph. Looking ahead, expect Riak to overtake MarkLogic in the next three months.

DynamoDB saw the greatest increase in terms of the number of mentions in LinkedIn profiles in the past three months, although it remains in 10th position. In terms of growth, DynamoDB was followed by OrientDB, Neo4j, Apache Accumulo and DEX.

However, MongoDB once again extended its lead as the most popular NoSQL database, according to LinkedIn profile mentions. As the chart below illustrates, it now accounts for 47% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample, compared with 46% in March.

NoSQL_Jun2

Of course, we would also note that this is not meant to be a comprehensive analysis, but rather a snapshot of one particular data source.

Another significant data source that can provide a different perspective on the NoSQL market is our market-sizing revenue estimate. Stand-by for an update on our sizing estimates for the NoSQL, NewSQL, MySQL and DBaaS sectors in the coming weeks.

NoSQL LinkedIn Skills Index – March 2013

As Q1 comes to a close its time to take another look at our NoSQL LinkedIn Skills Index, based on the number of LinkedIn member profiles mentioning each of the NoSQL projects. This is the second update since we rebooted the analysis in September 2012 to account for more products and refine our search terms.

NoSQL_Mar

A few interesting statistics to pick out: Neo4j has, as predicted, jumped ahead of MarkLogic for sixth place. No other changes of position, but outside the top ten, shown here, Apache Accumulo continues to grow well.

In fact, Apache Accumulo had the fastest rate of growth for the second quarter in succession, just ahead of DynamoDB and OrientDB -once again – followed by Apache Cassandra and MongoDB.

MongoDB’s growth means that it once again extended its lead as the most popular NoSQL database, according to LinkedIn profile mentions. As the chart below illustrates, it now accounts for 46% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample, compared with 45% in December.

NoSQL_Mar2

NoSQL on MySQL: stating the obvious

Some of the NoSQL vendors seemed to have stirred up a mild controversy with their reactions to the launch of NoSQL access to InnoDB in MySQL 5.6 and their suggestions that NoSQL access is only a part of the NoSQL story.

Mark Leith, software development senior manager at Oracle has described the criticism as laughable and Oracle’s director of MySQL product marketing, Mat Keep, accused the NoSQL vendors of “trying to stand on the shoulders of giants” (which is pretty ironic given we are talking about Oracle adding NoSQL capabilities to one of its databases).

In any case I don’t see what the fuss is all about.

Sure, Couchbase and DataStax laid it on a bit thick, but these are corporate blog posts – it goes with the territory.

Besides while it might seem churlish to criticise NoSQL access to InnoDB in MySQL 5.6 for not being a document database or for enabling masterless multi-datacenter replication, the responses are valid in the context of hyperbolic claims that “MySQL can provide the best of both worlds… You don’t have to split your data and manage two databases.”

The caveat to all these claims, and indeed probably any claim ever made in a corporate blog, is “if it suits your particular application requirement.”

Back in early 2011 when we first considered the momentum behind NoSQL development and adoption we highlighted six key drivers:

  • Scalability
  • Performance
  • Relaxed consistency
  • Agility
  • Intricacy
  • Necessity

How many of those are addressed by key value access to the InnoDB storage engine? Query performance and agility, certainly. Necessity, perhaps – but only if your application workload requires both SQL and key value access.

As we stated when Oracle first began previewing key value access to the InnoDB storage engine:

“Support for data access using the memcached API by no means alleviates the need for NoSQL alternatives, but it will provide additional flexibility and agility for existing MySQL adopters.”

I also have to agree with Couchbase that this is a point that is illustrated by the existence of Oracle’s own NoSQL Database. As we stated at the time of its launch:

“The launch of Oracle NoSQL is… a clear indication that there are trends at work here that cannot be solved by adding non-SQL querying to existing relational databases.”

And that’s really all Couchbase and DataStax are pointing out.

If you’re looking for an offering that provides direct, key value insertion and querying of data in addition to SQL-based access to relational database tables, then MySQL 5.6 is clearly a leading contender. If that’s all you’re looking for, then you could arguably forget the need to manage two databases.

That clearly doesn’t necessarily make MySQL 5.6 suitable for use as a pure key value store, let alone a document database, or wide-column store, or graph database. If those are your requirements, MySQL 5.6 isn’t the best of any world, let alone both.

NoSQL LinkedIn Skills Index – December 2012

Time again to take a look at our NoSQL LinkedIn Skills Index, based on the number of LinkedIn member profiles mentioning each of the NoSQL projects. This is the first update since we rebooted the analysis in September to account for more products and refine our search terms.

NoSQL_Dec

On the face of it not a lot has changed in the last quarter, although there are a few interesting statistics to pick out. For instance, Neo4j is now practically tied for sixth place with MarkLogic and can be expected to overtake it in Q1 2013. Outside the top ten shown above, Apache Accumulo has gained two places – overtaking Aerospike and Hypertable.

In fact, Apache Accumulo showed the fastest rate of growth in mentions between September and December, just ahead of DynamoDB and OrientDB, followed by Couchbase and MongoDB.

MongoDB’s growth means that it has cemented its place as the most popular NoSQL database, according to LinkedIn profile mentions. As the chart below illustrates, it now accounts for 45% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample, compared with 43% in September.

nosql_all_dec

The Role of NoSQL and Graphs in the Total Data Landscape

I’ll be flying over to San Francisco at the weekend to attend and present at GraphConnect, which takes place at the Hyatt Regency on November 5 and 6.

Specifically, I’ll be giving a presentation on the role of NoSQL and graphs in the total data landscape, subtitled Big Data, Total Data, NoSQL, Graph, at 11.40am on November 6.

Here’s the overview: The database market is changing rapidly with new approaches emerging that provide an alternative to the relational data model. This presentation examines the drivers behind the rise of NoSQL data stores and, in particular graph databases, focusing on their use-cases and adoption trends, and exploring where graph database fit in the world of NoSQL, NewSQL, and big data.

I’ll also be moderating a panel at 5.05pm on November 6 comprised of enterprise companies that use graph databases in production. This panel includes 3-4 technical leads from Accenture, Cisco and Telenor Norway that will discuss what it takes to put large scale graph databases into production.

GraphConnect looks like a great event for anyone with experience with, or just interest in, graph databases. Keynotes will be provided by Emil Eifrém, CEO, Neo Technology, and James Fowler, co-author of Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives.

The full agenda can be found here, and it’s not too late to register, here.