Entries from May 2012 ↓

A different perspective on NoSQL vendor traction

Amid the reporting of 10gen’s $42m funding round yesterday a specific claim about 10gen’s success to date caught my eye.

“10gen says it’s got about half the NoSQL market wrapped up already. This is based on… indicators, such as how often LinkedIn profiles mention MongoDB.”

While our own analysis of LinkedIn profiles did indeed indicate that 10gen has a sizeable lead over its NoSQL rivals, this only accounts for the NoSQL market *to date*, and the NoSQL vendors have barely scratched the surface.

451 Research recently estimated that NoSQL software vendors between them generated revenue of just $20m in 2011 (less than half 10gen’s latest funding round), and that the market will grow at a CAGR of 82% to reach $215m by 2015.

10gen is well placed to capitalize on this growth given its customer and revenue traction to date. While we are not breaking out individual revenue estimates the chart below shows revenue and customer estimates for 10gen, Basho, Couchbase and DataStax, with the scale adjusted to fit on a single chart.

The chart appears to confirm 10gen’s claim to have half the NoSQL market wrapped up, at least in terms of customers. However, what this chart doesn’t address is the relative strategy stage of each vendor in terms of customer traction.

10gen has done extremely well in growing a large customer base via its focus on ease of developer adoption, and is now turning its attention to the sort of capabilities required by traditional enterprises.

Other vendors in the NoSQL space have done precisely the opposite: starting with enterprise capabilities and now turning their attention to greater ease of use and developer adoption.

We can begin to get a sense of how these strategies are playing out if we add a column for revenue per customer (again re-scaled). Here you can see that 10gen is actually doing less well than some of its rivals.

The size of the MongoDB installed base gives 10gen a big opportunity to aim at, but others are arguably ahead in terms of traction with enterprise customers. That’s why our market sizing methodology is specifically designed to take multiple (sometimes conflicting) factors into account in creating an estimate for each vendor, as well as the aggregate total.

10gen may well have about half the current NoSQL market wrapped up but this market has really only just begun.

MySQL vs. NoSQL and NewSQL – survey results

Back in January we launched a survey of database users to explore the competitive dynamic between MySQL, NoSQL and NewSQL databases, and to to discover if MySQL usage is really declining – as had been indicated by the results of a prior survey.

The publication of the associated report took longer than expected, mostly because we expanded its scope to include revenue and growth estimates for the MySQL ecosystem, NoSQL and NewSQL sectors respectively, and with that report now published I am pleased to fulfil our promise to share the survey results.

We seem to be having some random embedding issues so for now the results can be found on SlideShare, adapted from the presentation given at OSBC earlier this week. For greater context, we have also included an explanation of each slide, below:

Slide 2: Provides an overview of the associated report – MySQL vs NoSQL and NewSQL 2011:2015, which is available here.

Slide 3: Explains why we launched the report. We once described as the crown jewel of the open source database world, since its focus on Web-based applications, its lightweight architecture and fast-read capabilities, and its brand differentiated it from all of the established database vendors and made for a potentially complementary acquisition. Today, the competitive situation is very different.

Slide 4: Oracle’s MySQL business faces competition from the rest of the MySQL ecosystem, as illustrated in Slide 5, many of which have emerged following Oracle’s acquisition of Sun/MySQL.

Slide 6: The emergence of these alternatives was triggered, in part, by concern about the future of MySQL. A previous 451 survey,conducted in November 2009, showed that there was real concern about the acquisition, with only 17% of MySQL users believing Oracle should be allowed to acquire MySQL.

Slide 7: The 2009 survey also showed that while 82.1% of respondents were already using MySQL, that figure was expected to drop to 72.3% by 2014. That survey was conducted amid a climate of fear, uncertainty and doubt regarding the future of MySQL, and one of the drivers for our current report was to see if that predicted decline occurred.

Slide 8: To put this in context, we asked the current survey sample (which included 205 database users) about their reaction to the acquisition. While the vast majority of MySQL users reported that they continued to use MySQL where appropriate, 5% indicated that they were more inclined to use MySQL, and 26% said they were less inclined to use MySQL. Not surprisingly the proportion of users less inclined to use MySQL was much higher amongst those abandoning MySQL than those sticking with MySQL.

Slide 9: We also asked respondents to rate Oracle’s ownership of MySQL on a range of very good to very bad. Overall, the balance tipped in favour of a negative perception of Oracle’s track record, while there was naturally a more negative perception of Oracle amongst those abandoning MySQL compared to MySQL mainstays. However, the results showed that the percentage of respondents rating the company’s performance ‘very good’ and ‘very bad’ was actually quite similar for both abandoners and mainstays. While those abandoning MySQL are more likely to have a negative perception of Oracle, it is not necessarily safe to assume that Oracle’s actions and strategy are the cause of the abandonment. Clearly there are other competitive forces at work.

Slide 10: Not least the emergence of NoSQL, as illustrated in Slide 11, and NewSQL, as illustrated in Slide 12.

Slide 13: Based on some very high profile examples of projects migrating from MySQL to NoSQL, there is a common assumption that NoSQL and NewSQL pose a direct, immediate threat to MySQL. We believe the competitive dynamic is more complex.

Slide 14: While 49% of those survey respondents abandoning MySQL planned on retaining or adopting NoSQL databases, only 12.7% said they had actually deployed NoSQL databases as a *direct replacement* for MySQL.

Slide 15: In comparison, there is much greater overlap between NewSQL and MySQL, but of a complementary nature. 33% of respondents retaining MySQL had considered, tested or deployed NewSQL database technologies, while approximately 75% of the NewSQL revenue for 2011 is from vendors that we also consider part of the MySQL ecosystem.

Slide 16: The results of our 2012 survey show that MySQL is currently the most popular database amongst our survey sample, used by 80.5% of respondents today.

Slide 17: However, it’s popularity is again expected to decline to 2014 and 2017. This indicates an accelerated decline in the use of MySQL, compared the findings of our 2009 survey. While that survey was conducted amid a climate of fear, uncertainty and doubt regarding the future of MySQL we are not aware of any specific reason why the 2012 sample, which was self-selecting, should have a disproportionately negative attitude to MySQL or Oracle.

Slide 18: MySQL’s predicted decline of 26.4 percentage points between 2012 and 2017 compares to a predicted decline of just 9.3 percentage points for Microsoft SQL Server, and only 5.9 percentage points for Oracle Database. In comparison, MariaDB, Apache Cassandra and Apache CouchDB are expected to increase in usage by 3.0 percentage points or greater between 2011 and 2017.

Slide 19: Although alternative MySQL distributions including MariaDB, Drizzle and Percona Server are expected to see increased adoption over the next five years, they are not growing at the same rate that MySQL is declining.

Slide 20: So where are those abandoning MySQL going to? Looking specifically at the 55 MySQL users who expect to abandon it by 2017 (which is admittedly a small sample, and therefore not to be considered statistically relevant) we see that PostgreSQL is the most popular database being retained or adopted over the same period, followed by Microsoft SQL Server, Oracle, MongoDB, and MariaDB.

Slide 21: This only tells part of the story, however. Just because a company is retaining Oracle Database, for example, does not necessarily mean that Oracle Database is being used as a replacement for the abandoned MySQL. We therefore also specifically asked survey respondents which databases they had considered, tested or deployed as a direct replacement for MySQL. The response from the 55 respondents planning to abandon MySQL again saw PostgreSQL, MariaDB and MongoDB as the most popular answers, followed by Apache CouchDB and Apache HBase.

Slide 22: While NoSQL database were well-represented in this list, we saw that anyone considering NoSQL considered multiple NoSQL databases. Per respondent, NoSQL databases were the least considered of all alternatives by existing MySQL users.

Slide 23: The survey results suggest that MongoDB is the most often considered, tested or deployed as a replacement or complement for MySQL, followed by Apache CouchDB, Apache HBase, Apache Cassandra/DataStax, and Redis.

Slide 24: NewSQL technologies that improve the scalability and performance of MySQL scored well, with eight of the top 10 most considered NewSQL technologies being directly complementing MySQL. Of the other two, one (Drizzle) is a derivative of MySQL, and the other (Clustrix) can also be used in a complementary manner as part of a MySQL cluster, although in the long-term is positioned as a direct alternative.

Slide 25: MariaDB is the member of the MySQL ecosystem most often considered, tested or deployed as a replacement or complement for MySQL, followed by Continuent Tungsten, Percona Server, MySQL Cluster, and Amazon RDS.

Slide 26: More than half of all MySQL users had considered, tested or deployed another relational database as a direct replacement, while over 40% had considered, tested or deployed a caching technology to complement MySQL. The memcached caching technology was the most widely-deployed of all the technologies we asked about, followed closely by PostgreSQL, which supported anecdotal evidence that a number of MySQL users are migrating to the other major open source transactional database.

Slide 27: For the record, the survey had 205 respondents. Primary job roles among respondents included: director/manager of IT infrastructure (18.0%); architect/engineer (17.6%); developer/programmer (15.6%); database/systems administrator (14.6%); consultant (14.1%); VP level or above (13.7%); analyst (3.4%); and line-of-business manager (2.9%).

Further survey analysis and perspective on the competitive dynamic between MySQL, NoSQL and NewSQL is available in the MySQL vs NoSQL and NewSQL report, which also includes market sizing and growth predictions for the three segments.

451 Research delivers market sizing estimates for NoSQL, NewSQL and MySQL ecosystem

NoSQL and NewSQL database technologies pose a long-term competitive threat to MySQL’s position as the default database for Web applications, according to a new report published by 451 Research.

The report, MySQL vs. NoSQL and NewSQL: 2011-2015, examines the competitive dynamic between MySQL and the emerging NoSQL non-relational, and NewSQL relational database technologies.

It concludes that while the current impact of NoSQL and NewSQL database technologies on MySQL is minimal, they pose a long-term competitive threat due to their adoption for new development projects. The report includes market sizing and growth estimates, with the key findings as follows:

• NoSQL software vendors generated revenue* of $20m in 2011. NoSQL software revenue is expected to rapidly grow at a CAGR of 82% to reach $215m by 2015.

• NewSQL software vendors generated revenue* of $12m in 2011 (of which $9m is also considered MySQL ecosystem revenue). NewSQL revenue is also expected to grow rapidly at a CAGR of 75% to reach $112m by 2015 (including $56m in MySQL ecosystem revenue).

• The MySQL support ecosystem generated revenue* of $171m in 2011 (including $9m from NewSQL technologies). MySQL ecosystem revenue is expected to grow at a CAGR of 40% to reach $664m by 2015 (including $56m in NewSQL revenue).

“The MySQL ecosystem is now arguably more healthy and vibrant than it has ever been, with a strong vendor committed to the core product, and a wealth of alternative and complementary products and services on offer to maintain competitive pressure on Oracle,” commented report author Matthew Aslett, research manager, data management and analytics, 451 Research.

“However, the options for MySQL users have never been greater, and there is a significant element of the MySQL user base that is ready and willing to look elsewhere for alternatives,”

As well as revenue and growth estimates, the report also includes a survey of over 200 database administrators, developers, engineers and managers. The survey findings include:

• While the majority of MySQL users continue to use MySQL where appropriate, the use of MySQL is expected to decline from 80.5% of survey respondents today to 62.4% by 2014 and just 54.1% by 2017.

• Despite the emergence of NoSQL and NewSQL database products, the most common direct replacement for MySQL among survey respondents today is PostgreSQL, which is also the focus of a recent burst of commercial activity.

• While 49% of those survey respondents abandoning MySQL planned on retaining or adopting NoSQL databases, only 12.7% of MySQL abandoners said they had actually deployed NoSQL databases as a direct replacement for MySQL.

“While there have been some high profile example of users migrating from MySQL to NoSQL database, the huge size of MySQL installed base means that these projects are comparatively rare,” commented Aslett.

The report describes how NoSQL database technologies are largely being adopted for new projects that require additional scalability, performance, relaxed consistency and agility, while NewSQL database technologies are, at this stage, largely being adopted to improve the performance and scalability of existing databases, particularly MySQL.

“NoSQL and NewSQL have not made a significant impact on the MySQL installed base at this stage but MySQL is no longer the de facto standard for new application development projects,” said Aslett. “As a result, NoSQL and NewSQL pose a significant long-term competitive threat to MySQL’s dominance.”

MySQL vs. NoSQL and NewSQL: 2011-2015 is now available to existing 451 Research subscribers. Non-clients can apply for trial access to 451 Research’s content.

*451 Research’s analysis of MySQL, NoSQL and NewSQL revenue is based on a bottom-up analysis of each participating vendor’s current revenue and growth expectations, and includes software license and subscription support revenue only. Revenue line items not included in these figures include hardware associated with the delivery of these services, revenue related to applications deployed on these databases, traditional hosting services, or systems integration performed by the vendors or other third parties.

The revenue estimates do not take into account unpaid usage of open source licensed MySQL, NoSQL and NewSQL software, and therefore represent only a fraction of the total addressable market. Based on the above revenue figures and other analysis, 451 Research estimates that the total value of the MySQL ecosystem in terms of ‘displaced’ proprietary software might equate to $1.7bn in 2011, while the NoSQL market had a displaced value of $195.7m and the NewSQL sector a displaced value of $99.4m.

The Data Day, Today: May 18 2012

SAP expands HANA. Informatica embraces big data. Gary Bloom joins MarkLogic. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* For 451 Research clients

# Informatica 9.5: ‘big data’ runs through the integration platform makeover Impact Report

# Lucid Imagination launches search-based ‘big data’ platform Impact Report

# Datameer updates Hadoop-based BI stack with an eye to more complex analysis Impact Report

# MarkLogic searches for operational analytics role with plans for SQL, MapReduce support Impact Report

# Infobright shines following shift to machine-generated data Impact Report

# Starcounter focuses on performance with in-memory database update Impact Report

# Guavus bears fruit with data-processing platform for communications operators Impact Report

# InsightSquared bags $4.5m series A funding and salesforce.com as an investor Impact Report

# MarkLogic names veteran exec Gary Bloom as new president and CEO Analyst note

* SAP Continues to Expand Capabilities and Scale of SAP HANA Platform and Ease Developer Adoption

* SAP HANA Offers Multi-Node Capabilities to Help Customers Scale Out

* Gary Bloom Joins MarkLogic as Chief Executive Officer

* Amazon RDS for SQL Server and .NET support for AWS Elastic Beanstalk

* Informatica 9.5 Unleashes the Power of Hadoop

* Informatica Brings Master Data Management to Big Data, Social, Cloud and Mobile Computing

* Talend Announces New Release of Enterprise Open Source Integration Platform

* Lucid Imagination Combines Search, Analytics and Big Data to Tackle the Problem of Dark Data

* Big Data Refinery Fuels Next-Generation Data Architecture

* 7 Key Drivers for the Big Data Market

* Google puts a price tag on Cloud SQL services

* Actuate and Hortonworks Collaborate to Visualize Big Data

* Hadapt and Cloudera Deliver Big Data Analytics with Apache Hadoop

* Cloudera Partners With Hadoop Managed Services Provider MetaScale to Help Large Traditional Enterprises Adopt Apache Hadoop

* Opera Solutions’ Big Analytics Tailor Made for SAP HANA: Signal Hub Technology

* Cloudant to Contribute Big Data Capabilities to Apache CouchDB Project

* Hortonworks and Kognitio Announce Technical Partnership

* Starcounter Unveils World’s Fastest Consistent Database

* XAP 9.0 – Geared for Real-Time Big Data Stream Processing

* How long before R overtakes SAS and SPSS?

* Betting big on live sports data, Perform lays €120 million on RunningBall

And that’s the Data Day, today.

Forthcoming Webinar: Real World Success from Big Data

The initial focus of ‘big data’ has been about its increasing volume, velocity and variety — the “three Vs” — with little mention of real world application. Now is the time to get down to business.

On Wednesday, May 30, at 9am PT I’ll be taking part in a webinar with Splunk to discuss real world successes with ‘big data’.

451 Research believes that in order to deliver value from ‘big data’, businesses need to look beyond the nature of the data and re-assess the technologies, processes and policies they use to engage with that data.

I will outline 451 Research’s ‘total data’ concept for delivering business value from ‘big data’, providing examples of how companies are seeking agile new data management technologies, business strategies and analytical approaches to turn the “three Vs” of data into actionable operational intelligence.

I’ll be joined by Sanjay Mehta, Vice President of Product Marketing at Splunk, which was founded specifically to focus on the opportunity of effectively getting value from massive and ever changing amounts of machine-generated data, one of the fastest growing and most complex segments of ‘big data’.

Sanjay will share big data achievements from three Splunk customers, Groupon, Intuit and CenturyLink. Using Splunk, these companies are turning massive volumes of unstructured and semi-structured machine data into powerful insights.

Register here.

The Data Day, Today: May 8 2012

IBM acquires Vivisimo. Funding for Birst, ParAccel, Metamarkets and DataSift. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* For 451 Research clients

# IBM picks up Vivisimo to search for value in ‘big data’ Deal Analysis

# Teradata delivers on analytic cloud vision with Active Data Warehouse Private Cloud Impact Report

# The Big Blue picture for ‘big data’ analytics: IBM sheds light on BigSheets Impact Report

# Oversight Systems’ Continuous Analysis extracts actionable insight from data Impact Report

# Kalido updates MDM offering with business users, operationalizing master data in mind Impact Report

# Delphix reaps reward from agile approach to database virtualization Impact Report

# Automated Insights looks to pitch narrative, visuals and stats to enterprises Impact Report

# myDIALS eyes indirect sales in quest to be Internet access layer for analytics Impact Report

* IBM Advances Big Data Analytics with Acquisition of Vivisimo Also announces support for Cloudera.

* Teradata Announces 2012 First Quarter Results Revenue up 21% (PDF)

* Actuate Reports First Quarter 2012 Financial Results Revenue up 9% (PDF)

* Birst Secures $26 Million in Financing Led By Sequoia Capital

* ParAccel Closes Record Q1 Revenues and $20 Million Investment Round

* Metamarkets Raises $15 Million to Deliver Data Science-as-a-Service

* DataSift adds $7.2M: The story so far and focus for the future

* Teradata to Acquire eCircle (PDF)

* Google BigQuery brings Big Data analytics to all businesses

* TIBCO Spotfire Brings the Power of Data Discovery to Big Data and Extreme Information

* Jaspersoft Teams with VMware To Deliver Business Intelligence for Data-Driven Cloud Applications

* Kalido and Teradata Sign Global Reseller Agreement

* Actuate Announces Cloudera Alliance to Support Apache Hadoop and BIRT Developers in Big Data Integration

* Hortonworks and Kognitio Announce Technical Partnership Driving Apache Hadoop Adoption in Big Data Analytics Implementations

* Tokutek and PalominoDB Partner to Bring Scale, Performance to Database Deployments

* Acunu is pleased to announce v2 of the Acunu Data Platform!

* Is Yahoo really threatening memcached and Open Compute?

* Introducing Zend DBi as a MySQL Replacement on IBM i

* Zettaset and Hyve Solutions Build First Fully Integrated Enterprise OS Hadoop Solution

* Cloudera Announces New Japanese Subsidiary

* Bull Announces the Formation of Database Migration Business Unit

* Couchbase to Run Native with Key-Value API for ioMemory

* The Big Data Value Continuum

* Big Data is Business Intelligence plus Attention Deficit Disorder

* Nokia released Dempsy an open source stream data processing platform.

And that’s the Data Day, today.