Big Data and the Cloud: A Perfect Storm?

Next week at Cloud Expo Europe in London I’ll be giving a presentation – at 12.05 on January 29 to be precise – on the potential confluence of bog data and cloud computing.

Cloud computing is all about enabling frictionless adoption of low-cost, flexible compute and storage, while big data technologies such as Apache Hadoop enable low-cost, flexible data storage and processing. Hence many people seem to believe that cloud computing and big data have the potential to create a perfect storm of disruption.

However, 451 Research has been tracking the adoption of data management technologies on the cloud – and the lack of it – since relational databases became available on AWS in 2008, and the effect of the confluence of big data and the cloud would perhaps better be described as dead calm, rather than a perfect storm. Other than development and test environments, adoption has been limited.

In our presentation “Big Data and the Cloud: A Perfect Storm?” we will take a look at the factors that have restricted adoption of databases in the cloud to date, explain why we see the potential for cloud database growth in the coming years, and examine how the strategies of emerging Hadoop- and database-as-a-service providers are evolving to ensure that big data and the cloud combine to fulfil their potential to disruptive the IT landscape as we know it.

The Data Day, Two days: January 15/16 2013

Funding for Ayasdi and Zettaset. NuoDB launches cloud database. And more

And that’s the Data Day, today.

Cloud databases, or database on the cloud?

As 2012 came to a close I tweeted

NuoDB has today kicked off that debate with the launch of its Cloud Data Management System and 12 rules for a 21st century cloud database.

NuoDB’s 12 rules appear pretty sound to me – in fact you could argue they are somewhat obvious. This is actually to NuoDB’s credit in my opinion, in that they haven’t simply listed 12 differentiating aspects of their product, but 12 broader requirements.

Either way, I believe that this is the right time to be debating what constitutes a “cloud database”. Database on the cloud are nothing new, but these are existing relational database products configured to run on the cloud.

In other words, they are databases on the cloud, not databases of the cloud. There is a significant difference between spinning up a relational database in a VMI on the cloud versus deploying a database designed to take advantage of, enable, and be part of, the cloud.

To me, a true cloud database would be one designed to take advantage of and enable elastic, distributed architecture. NuoDB is one of those, but it won’t be the only one. Many NoSQL databases could also make a claim, albeit not for SQL and ACID workloads.

This isn’t a matter of SQL versus NoSQL, however. We’ve seen companies building their own next-generation database platforms deploying NoSQL and SQL technologies alongside each other for different workload and consistency requirements. Where the SQL layer falls down is the inability of existing relational databases to support elastic, geographically distributed cloud environments.

NuoDB believes it has a solution to that. So too do others including GenieDB, Translattice and VMware. Meanwhile Google’s F1 and Spanner projects have legitimized the concept of the globally-distributed SQL database.

Either way, the era of the relational cloud database – rather than the relational database on the cloud – has begun.

The Data Day, Two days: January 11/14 2013

Navigating our illustrated database landscape map. And more

And that’s the Data Day, today.

Exclusive guide to navigating our database landscape map

Anyone confused by our recently published database landscape map might be interested to know that 451 Research clients can now access an exclusive 451 Research guide to navigating the increasingly complex database landscape.

The guide steps through each of the main ‘lines’ and covers some of the issues we are seeing driving change across the the breadth of the map, as well as for individual lines. It is based on an updated January 2013 iteration of the map, which includes a number of revisions and additions based on feedback to the December 2012 version.

Note: the latest update to the map is available here.

Like any simplified illustration, the database map has its weaknesses – we certainly wouldn’t suggest that a company could use it to find an appropriate database for any given workload – but given the array of databases on the market, it is designed to help businesses identify a shortlist of potentially suitable choices based on the intersection of major functionality lines.

451 Research clients can access the guide here, while non-clients can use the same link to apply for trial access.

The Data Day, Two days: January 9/10 2013

SAP on HANA. Funding for Guavus and ScaleArc. And more

And that’s the Data Day, today.

Our 2013 Database survey is now live

451 Research’s 2013 Database survey is now live at http://bit.ly/451db13 investigating the current use of database technologies, including MySQL, NoSQL and NewSQL, as well as traditional relation and non-relational databases.

The aim of this survey is to identify trends in database usage, as well as changing attitudes to MySQL following its acquisition by Oracle, and the competitive dynamic between MySQL and other databases, including NoSQL and NewSQL technologies.

There are just 15 questions to answer, spread over five pages, and the entire survey should take less than ten minutes to complete.

All individual responses are of course confidential. The results will be published as part of a major research report due during Q2.

The full report will be available to 451 Research clients, while the results of the survey will also be made freely available via a
presentation at the Percona Live MySQL Conference and Expo in April.

Last year’s results have been viewed nearly 55,000 times on SlideShare so we are hoping for a good response to this year’s survey.

One of the most interesting aspects of a 2012 survey results was the extent to which MySQL users were testing and adopting PostgreSQL. Will that trend continue or accelerate in 2013? And what of the adoption of cloud-based database services such as Amazon RDS and Google Cloud SQL?

Are the new breed of NewSQL vendors having any impact on the relational database incumbents such as Oracle, Microsoft and IBM? And how is SAP HANA adoption driving interest in other in-memory databases such as VoltDB and MemSQL?

We will also be interested to see how well NoSQL databases fair in this year’s survey results. Last year MongoDB was the most popular, followed by Apache Cassandra/DataStax and Redis. Are these now making a bigger impact on the wider market, and what of Basho’s Riak, CouchDB, Neo4j, Couchbase et al?

Additionally, we have been tracking attitudes to Oracle’s ownership of MySQL since the deal to acquire Sun was announced. Have MySQL users’ attitudes towards Oracle improved or declined in the last 12 months, and what impact will the formation of the MariaDB Foundation have on MariaDB adoption?

We’re looking forward to analyzing the results and providing answers to these and other questions. Please help us to get the most representative result set by taking part in the survey at http://bit.ly/451db13

The Data Day, Two days: January 7/8, 2013

SAP’s HANA – a floor wax *and* a dessert topping?

And that’s the Data Day, today.

The Data Day, A few days: January 2-4, 2013

Apache Cassandra and BigTop updates. And more

And that’s the Data Day, today.

The Data Day, Two days: December 20/21 2012

Data management and analytics in 2013. And more

And that’s the Data Day, today.