The Data Day: August 30, 2016

What happened in data and analytics this week will astound you

For 451 Research clients: NewSQL databases: a definitive guide http://bit.ly/2c7wJPu

For 451 Research clients: With Atlas, MongoDB jumps into hosted NoSQL DBaaS waters http://bit.ly/2c7wXq4 By Jim Curtis

For 451 Research clients: DataStax rolls graph into DSE 5.0, highlights NoSQL multi-models http://bit.ly/2bOKF2I By Jim Curtis

For 451 Research clients: Looker illuminates its analytics business and platform strategy http://bit.ly/2c7z396 By Krishna Roy

For 451 Research clients: Arcadia looks to simplify security for Hadoop analysis, lands Rackspace as reseller http://bit.ly/2c7wwvZ By Krishna Roy

For 451 Research clients: With $30m in funding, Vena looks to expand its cloud service to an enterprise platform http://bit.ly/2c7x3Oq By Krishna Roy

For 451 Research clients: Stitch emerges from RJMetrics with ETL as a service following cloud BI sale http://bit.ly/2c7wW5m

For 451 Research clients: CMC Markets sees promising self-service results with new ‘Google-like’ BI tool http://bit.ly/2c7xPuK By Jason Stamper

Tableau appointed Adam Selipsky as new CEO http://tabsoft.co/2bONxN1

Splunk reported a net loss of $86.6m on Q2 revenue up 43% to $212.8m http://splk.it/2bON1P5

Salesforce signs agreement to acquire BeyondCore http://bit.ly/2bOOZPK

Magnitude Software acquired Simba Technologies http://mwne.ws/2bONqRI

SAP is reportedly acquiring Altiscale for over $125m http://bit.ly/2bON8dE

Syncsort acquires Cogito to enhance mainframe data access http://prn.to/2c7ynRb

Galactic Exchange closes seed financing round http://bit.ly/2bOMxZw

Teradata makes Aster Analytics available on Hadoop and Teradata Aster Analytics on Amazon Web Services http://prn.to/2c7y5Kh

JSON support is generally available in Azure SQL Database http://bit.ly/2c7xVT9

Red Hat launches Red Hat Virtualization 4 http://red.ht/2bOMJb0

SnapLogic launches Summer 2016 release of its SnapLogic Elastic Integration Platform http://bit.ly/2bOOmpq

AWS releases Amazon Kinesis Analytics http://bit.ly/2bOOwNd

AWS licenses SQLstream technology for Amazon Kinesis Analytics service http://bit.ly/2c7CZaa

Percona delivers open source in-Memory storage engine for Percona Server for MongoDB http://bit.ly/2bOPIjH

Riversand launches MDMCenter v7.8 http://bit.ly/2c7zDUl

WANdisco announces the release of WANdisco Fusion 2.9 http://bit.ly/2bOOapU

And that’s the data day, today.

The Data Day, Today: Feb 17 2012

Rob Bearden is new Hortonworks CEO. Oracle updates MySQL Cluster. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* With a new CEO, will Hortonworks get serious about Hadoop? Rob Bearden is Hortonworks’ new chief elephant-herder.

* Oracle Oracle Announces General Availability of MySQL Cluster 7.2. Claims 1 billion queries per minute.

* Vertica Extends Manageability and Ease-of-Use for the Vertica Analytics Platform.

* NGDATA Raises Capital to Accelerate Growth Belgian data management company confirms that it recently acquired Outerthought.

* QlikTech Announces Fourth Quarter and Full Year 2011 Financial Results

* Quest Business Intelligence Studio 1.0 is now Generally Available

* Jaspersoft Delivers Big Data Integration into Jaspersoft ETL

* Hortonworks University Launches to Deliver Comprehensive Apache Hadoop Training and Certification.

* Schema in Cassandra 1.1. “as systems deployed on Cassandra grew and matured, lack of schema became a pain point”

* GigaSpaces Announces New Cloudify Free Product Edition.

* Announcing Reduced Pricing on SQL Azure and New 100MB Database Option.

* Composite Software Continues Innovation with Release of Version 6.1 of its Data Virtualization Platform.

* Multi-Tenant Cloudant in Europe.

* NuvolaBase has launched its hosted graph database offering.

* How Target Figured Out A Teen Girl Was Pregnant Before Her Father Did. Big data’s beer and diapers equivalent.

* Beyond “Big Data”. “I have a theory that buzzwords are usually helpful in general, in that they usher in new concepts before they end up as meaningless marketing fluff–and, eventually, punchlines. I think this is in the process of happening right now with the term “big data”.”

* For 451 Research clients

# Akiban prepares to launch ‘table grouping’ NewSQL database IMpact Report

# IBM sheds light on the Big Blue business of information governance Market Decelopment Report

And that’s the Data Day, today.

The Data Day, Today: Jan 10 2012

Oracle OEMs Cloudera. The future of Apache CouchDB. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Oracle announced the general availability of Big Data Appliance, and an OEM agreement with Cloudera for CDH and Cloudera Manager.

* The Future of Apache CouchDB Cloudant confirms intention to integrate the core capabilities of BigCouch into Apache CouchDB.

* Reinforcing Couchbase’s Commitment to Open Source and CouchDB Couchbase CEO Bob Wiederhold attempts to clear up any confusion.

* Hortonworks Appoints Shaun Connolly to Vice President of Corporate Strategy Former vice president of product strategy at VMware.

* Splunk even more data with 4.3 Introducing the latest Splunk release.

* Announcement of Percona XtraDB Cluster (alpha release) Based on Galera.

* Bringing Value of Big Data to Business: SAP’s Integrated Strategy Forbes interview with with Sanjay Poonen, President and corporate officer of SAP Global Solutions.

* New Release of Oracle Database Firewall Extends Support to MySQL and Enhances Reporting Capabilities Self-explanatory.

* Big data and the disruption curve “Many efforts are being funded by business units and not the IT department and money is increasingly being diverted from large enterprise vendors.”

* Get your SQL Server database ready for SQL Azure Microsoft “codename” SQL Azure Compatibility Assessment.

* An update on Apache Hadoop 1.0 Cloudera’s Charles Zedlewski helpfully explains Apache Hadoop branch numbering.

* Xeround and the CAP Theorem So where does Xeround fit in the CAP Theorem?

* Can Yahoo’s new CEO Thompson harness big data, analytics? Larry Dignan thinks Scott Thompson might just be the right guy for the job.

* US Companies Face Big Hurdles in ‘Big Data’ Use “21% of respondents were unsure how to best define Big Data”

* Schedule Your Agenda for 2012 NoSQL Events Alex Popescu updates his list of the year’s key NoSQL events.

* DataStax take Apache Cassandra Mainstream in 2011; Poised for Growth and Innovation in 2012 The usual momentum round-up from DataStax.

* Objectivity claimed significant growth in adoption of its graph database, InfiniteGraph and flagship object database, Objectivity/DB.

* Cloudera Connector for Teradata 1.0.0 Self-explanatory.

* For 451 Research clients

# SAS delivers in-memory analytics for Teradata and Greenplum Market Development report

# With $84m in funding, Opera sets out predictive-analytics plans Market Development report

* Google News Search outlier of the day: First Dagger Fencing Competition in the World Scheduled for January 14, 2012

And that’s the Data Day, today.

What we talk about when we talk about NewSQL

Yesterday The 451 Group published a report asking “How will the database incumbents respond to NoSQL and NewSQL?”

That prompted the pertinent question, “What do you mean by ‘NewSQL’?”

Since we are about to publish a report describing our view of the emerging database landscape, including NoSQL, NewSQL and beyond (now available), it probably is a good time to define what we mean by NewSQL (I haven’t mentioned the various NoSQL projects in this post, but they are covered extensively in the report. More on them another day).

“NewSQL” is our shorthand for the various new scalable/high performance SQL database vendors. We have previously referred to these products as ‘ScalableSQL’ to differentiate them from the incumbent relational database products. Since this implies horizontal scalability, which is not necessarily a feature of all the products, we adopted the term ‘NewSQL’ in the new report.

And to clarify, like NoSQL, NewSQL is not to be taken too literally: the new thing about the NewSQL vendors is the vendor, not the SQL.

So who would be consider to be the NewSQL vendors? Like NoSQL, NewSQL is used to describe a loosely-affiliated group of companies (ScaleBase has done a good job of identifying, some of the several NewSQL sub-types) but what they have in common is the development of new relational database products and services designed to bring the benefits of the relational model to distributed architectures, or to improve the performance of relational databases to the extent that horizontal scalability is no longer a necessity.

In the first group we would include (in no particular order) Clustrix, GenieDB, ScalArc, Schooner, VoltDB, RethinkDB, ScaleDB, Akiban, CodeFutures, ScaleBase, Translattice, and NimbusDB, as well as Drizzle, MySQL Cluster with NDB, and MySQL with HandlerSocket. The latter group includes Tokutek and JustOne DB. The associated “NewSQL-as-a-service” category includes Amazon Relational Database Service, Microsoft SQL Azure, Xeround, Database.com and FathomDB.

(Links provide access to 451 Group coverage for clients. Non-clients can also apply for trial access).

Clearly there is the potential for overlap with NoSQL. It remains to be seen whether RethinkDB will be delivered as a NoSQL key value store for memcached or a “NewSQL” storage engine for MySQL, for example. While at least one of the vendors listed above is planning to enable the use of its database as a schema-less store, we also expect to see support for SQL queries added to some NoSQL databases. We are also sure that Citrusleaf won’t be the last NoSQL vendor to claim support for ACID transactions.

NewSQL is not about attempting to re-define the database market using our own term, but it is useful to broadly categorize the various emerging database products at this particular point in time.

Another clarification: ReadWriteWeb has picked up on this post and reported on the “NewSQL Movement”. I don’t think there is a movement in that sense that we saw the various NoSQL projects/vendors come together under the NoSQL umbrella with a common purpose. Perhaps the NewSQL players will do so (VoltDB and NimbusDB have reacted positively to the term, and Tokutek has become the first that I am aware of to explicitly describe its technology as NewSQL). As Derek Stainer notes, however: ” In the end it’s just a name, a way to categorize a group of similar solutions.”

In the meantime, we have already noted the beginning for the end of NoSQL, and the lines are blurring to the point where we expect the terms NoSQL and NewSQL will become irrelevant as the focus turns to specific use cases.

The identification of specific adoption drivers and use cases is the focus of our forthcoming long-form report on NoSQL, NewSQL and beyond, from which the 451 Group reported cited above is excerpted.

The report contains an overview of the roots of NoSQL and profiles of the major NoSQL projects and vendors, as well as analysis of the drivers behind the development and adoption of NoSQL and NewSQL databases, the evolving role of data grid technologies, and associated use cases.

It will be available very soon from the Information Management and CAOS practices and we will also publish more details of the key drivers as we see them and our view of the current database landscape here.