geniedb — Too much information

The Data Day, A few days: April 11-22, 2015

April 22nd, 2015 — Data management

CenturyLink, Hortonworks and Percona acquire Orchestrate, SequenceIQ and Tokutek respectively

For @451Research clients: @CenturyLink Orchestrates a move into database as a service http://t.co/dJNAAn8AyL (With @OrchestrateIO pick-up)

— Matt Aslett (@maslett) April 22, 2015

For @451Research clients: @Hortonworks buys @SequenceIQ to boost Hadoop on public and private clouds http://t.co/go7He9jo2S

— Matt Aslett (@maslett) April 15, 2015

For @451Research clients: @Percona expands MySQL portfolio and enters MongoDB market with purchase of @Tokutek http://t.co/lHMOXq9Fxc

— Matt Aslett (@maslett) April 15, 2015

For @451Research clients: @awscloud leverages Amazon's analytics experience for machine-learning service http://t.co/SxaFFzZoMZ By @owenrog

— Matt Aslett (@maslett) April 20, 2015

For @451Research clients: @OrientDB benefits from growing interest in multi-model NoSQL databases http://t.co/DxOoaaGyKh

— Matt Aslett (@maslett) April 20, 2015

For @451Research clients: @ScaleBase sees increased momentum as its database reaches new clouds http://t.co/TxZZUkU14Q By @jasonstamper

— Matt Aslett (@maslett) April 22, 2015

For @451Research clients: @host_analytics broadens EPM cloud offering with advanced modeling engine http://t.co/jrmNLUAWGJ By Jim Curtis

— Matt Aslett (@maslett) April 20, 2015

For @451Research clients: @WANdisco launches Fusion: active-active replication across multiple Hadoop distros http://t.co/md3LZe29AU

— Matt Aslett (@maslett) April 16, 2015

For @451Research clients: Dell @Boomi extends its iPaaS with an API lifecycle management release http://t.co/sGYyqZygkM By @CarlLehmann

— Matt Aslett (@maslett) April 13, 2015

For @451Research clients: @saffrontech spices up cognitive computing with SaffronStreamline http://t.co/Zb0RVlUWc7 By @jasonstamper

— Matt Aslett (@maslett) April 13, 2015

For @451Research clients: @zltechnologies unifies data analytics with ZL Enterprise Analytics http://t.co/rdhtg2mpBd By @jasonstamper

— Matt Aslett (@maslett) April 16, 2015

For @451Research clients Agile BI player @chartio adds Data Pipeline to drive advanced data processing http://t.co/Y2Hvnnq7ZA By Jim Curtis

— Matt Aslett (@maslett) April 16, 2015

For @451Research clients: @SimbaTech raises its profile, announces support for iOS http://t.co/h9RGS0XZGw By Jim Curtis

— Matt Aslett (@maslett) April 14, 2015

For @451Research clients: @Oracle outlines Big Data Management System for data warehouse and Hadoop coexistence http://t.co/9oTijBHFd6

— Matt Aslett (@maslett) April 21, 2015

For @451Research clients: Distributed database-as-a-service firm @GenieDB has been put back in its bottle http://t.co/K11LaFdsC1

— Matt Aslett (@maslett) April 22, 2015

CenturyLink acquires Orchestrate to enhance cloud platform with new database capabilities http://t.co/d0fAXKPJSr

— Matt Aslett (@maslett) April 21, 2015

Hortonworks acquires SequenceIQ to boost provisioning for Hadoop in the cloud, Docker containers or bare metal. http://t.co/gxFOlZlIga

— Matt Aslett (@maslett) April 13, 2015

Percona has acquired Tokutek http://t.co/L8XKNqahBz

— Matt Aslett (@maslett) April 14, 2015

Pepperdata raises more than $15m in Series B funding led by Wing Venture Capital http://t.co/9T1GwBAQ7T

— Matt Aslett (@maslett) April 16, 2015

Deep IS raises $8m series A round led by Sigma Prime Ventures and Stage 1 Ventures http://t.co/3UhJh4f79K

— Matt Aslett (@maslett) April 22, 2015

Bedrock Data launches data integration platform, raises $3.11m in Series A funding. http://t.co/BUqaXMfQ7v

— Matt Aslett (@maslett) April 15, 2015

Oracle announces Big Data Management System statement of direction for data warehousing and Hadoop coexistence. http://t.co/duAEpYc8DZ

— Matt Aslett (@maslett) April 14, 2015

SAP claims more than 6,400 HANA customers as it announces Q1 financial results. http://t.co/HW9gJMpN2A

— Matt Aslett (@maslett) April 21, 2015

Hortonworks, Pivotal and IBM announce that their Hadoop distros are now aligned on the Open Data Platform http://t.co/ec65YvO5oJ

— Matt Aslett (@maslett) April 14, 2015

Hortonworks updates HDP Hadoop distribution with Apache Spark, Ambari 2.0 http://t.co/e2cGl3kvMn

— Matt Aslett (@maslett) April 14, 2015

Teradata claims Software-Defined Warehouse, with consolidation enhancement to Teradata Database. http://t.co/gEollIkhYh

— Matt Aslett (@maslett) April 20, 2015

Teradata unveils Data Warehouse Appliance 2800. http://t.co/tE3P4qICVz

— Matt Aslett (@maslett) April 20, 2015

MarkLogic claims revenue of more than $100m in its fiscal 2015 http://t.co/ht5GSL4NGK

— Matt Aslett (@maslett) April 15, 2015

MapR’s Distribution including Hadoop is now integrated with Teradata QueryGrid. http://t.co/dVHTt1Y0Uh

— Matt Aslett (@maslett) April 20, 2015

Hortonworks hires former Teradata Labs president Scott Gnau as chief technology officer http://t.co/QZ2NplWO9e

— Matt Aslett (@maslett) April 14, 2015

Teradata’s Think Big expands into Europe http://t.co/GKpqIHGLkJ and launches the Dashboard Engine for Hadoop. http://t.co/9UgIXVhzLN

— Matt Aslett (@maslett) April 13, 2015

GoodData updates cloud analytics platform with data explorer, discovery and collaboration capabilities. http://t.co/0JTENSR0mU

— Matt Aslett (@maslett) April 22, 2015

Pivotal proposes Geode, the core of Pivotal GemFire, as an Apache incubator project http://t.co/Xt2cGBNjyb

— Matt Aslett (@maslett) April 13, 2015

GridGain launches GridGain Enterprise Edition v7.0 and GridGain Community Edition v1.0.1, based on Apache Ignite. http://t.co/To7rBrZgbU

— Matt Aslett (@maslett) April 13, 2015

RethinkDB declares production readiness with version 2.0. http://t.co/iNQMprGEVJ

— Matt Aslett (@maslett) April 15, 2015

GigaSpaces updates XAP in-memory computing software http://t.co/MrKYYQBRfP

— Matt Aslett (@maslett) April 15, 2015

And that’s the data day, today.

Comments Off on The Data Day, A few days: April 11-22, 2015

The Data Day, A few days: November 9-14 2013

November 14th, 2013 — Data management

Total Data Integration. PostgreSQL on RDS. And more

Our Total Data Integration report, assessing the impact of 'big data' on DI is now available to 451 Research clients http://t.co/88Yv2dNmqr

— Matt Aslett (@maslett) November 13, 2013

For 451 Research clients: Basho previews distributed database update following major deal with UK's NHS http://t.co/DY9EQUy4h8

— Matt Aslett (@maslett) November 12, 2013

For 451 Research clients: Pivotal adds in-memory transaction processing to Hadoop distribution http://t.co/hfi0H8i2qQ

— Matt Aslett (@maslett) November 11, 2013

For 451 Research clients: SiSense goes for growth in BI with an eye toward an IPO http://t.co/1OwYwmtpIW By Krishna Roy

— Matt Aslett (@maslett) November 14, 2013

For 451 clients: Tidemark bags $13m series D, seeks performance management beachhead with fresh cut http://t.co/9KnyMGPRLb By Krishna Roy

— Matt Aslett (@maslett) November 12, 2013

For 451 clients: Platfora embraces a stream of events as part of multilayered Hadoop analytics vision http://t.co/k07bKozWdH By Krishna Roy

— Matt Aslett (@maslett) November 11, 2013

For 451 Research clients: HPCC Systems expands the scope – and community – of its open source data platform http://t.co/Q5XBnz57Bc

— Matt Aslett (@maslett) November 13, 2013

For 451 Research clients: FoundationDB raises $17m series A for multi-model ACID NoSQL database http://t.co/DfpYqnhEzQ Analyst note.

— Matt Aslett (@maslett) November 13, 2013

Pivotal launches Pivotal One PaaS, including Hadoop and analytics services. http://t.co/R05fcXITpI

— Matt Aslett (@maslett) November 12, 2013

Jut raises $20 million series B financing http://t.co/xyAsMOnfqe

— Matt Aslett (@maslett) November 11, 2013

FoundationDB Raises $17 Million Series A Financing http://t.co/gsdanHX895

— Matt Aslett (@maslett) November 12, 2013

Amazon Web Services introduces Amazon RDS for PostgreSQL http://t.co/8OMYTTZJK4

— Matt Aslett (@maslett) November 14, 2013

Amazon Web Services introduces automated cross-region snapshot copy for Amazon Redshift. http://t.co/atXjynB8GZ

— Matt Aslett (@maslett) November 14, 2013

DataStax launches Enterprise 3.2 with automating repair and capacity planning functions. http://t.co/Np1Yt4OMG8

— Matt Aslett (@maslett) November 14, 2013

Oracle updates Big Data Appliance, including "the entire Cloudera Enterprise technology stack". http://t.co/8F3qoD4ilz

— Matt Aslett (@maslett) November 12, 2013

Introducing Heroku Postgres 2.0 http://t.co/l1hLxXfzPy

— Matt Aslett (@maslett) November 12, 2013

WANdisco updates Non-Stop Hadoop for Hortonworks http://t.co/WxOViBebJV

— Matt Aslett (@maslett) November 12, 2013

Actian OEMs Attivio's Active Intelligence Engine for integration with the ParAccel Big Data Analytics Platform. http://t.co/ENYIi8SG8j

— Matt Aslett (@maslett) November 13, 2013

GenieDB launches online management console for geo-distributed MySQL databases. http://t.co/OF8ctsGbmv

— Matt Aslett (@maslett) November 12, 2013

Apache Drill reaches first milestone release http://t.co/pozgmTpM1j

— Matt Aslett (@maslett) November 11, 2013

And that’s the data day, today.

Comments Off on The Data Day, A few days: November 9-14 2013

The Data Day, A few days: August 1-7 2013

August 7th, 2013 — Data management

MySQL, NoSQL, NewSQL, DBaaS market sizing. And more

Free presentation (reg reqd) explaining 451 Research's #MySQL #NoSQL #NewSQL #DBaaS revenue estimates for 2012-2016 http://t.co/eaPVv4ljQw

— Matt Aslett (@maslett) August 5, 2013

For 451 Research clients: Cloudera ups the ante with expanded Hadoop portfolio, new CEO http://t.co/AE0pVWPK5W

— Matt Aslett (@maslett) August 6, 2013

For 451 Research clients: GenieDB repositions as multi-cloud database as a service following 2.0 release http://t.co/7QxO9rZBjD

— Matt Aslett (@maslett) August 7, 2013

For 451 Research clients: Revelytix Looms large for Hadoop-based data set management http://t.co/s630e1LgRY

— Matt Aslett (@maslett) August 2, 2013

For 451 Research clients: Buoyed by $10m funding, GridGain launches in-memory streaming and Hadoop accelerator http://t.co/HSdENsBXfS

— Matt Aslett (@maslett) August 5, 2013

For 451 clients: Panorama brings in-memory collaborative visual analysis to Microsoft stack, iPad http://t.co/ItQ9iw06kc By Krishna Roy

— Matt Aslett (@maslett) August 1, 2013

For 451 Research clients: I Spry with my little eye… a new Hadoop professional services entrant http://t.co/NyjujDJFhz By @drkatyring

— Matt Aslett (@maslett) August 6, 2013

Teradata reports net income of $108m on revenue up 1% to $670m in Q2. http://t.co/EQGV6EW2HD

— Matt Aslett (@maslett) August 1, 2013

Tidemark has raised $13m in new venture financing, led by Tenaya Capital. http://t.co/Aji5ULFHlX

— Matt Aslett (@maslett) August 1, 2013

Infochimps has been acquired by CSC http://t.co/JOSrWuK55m

— Matt Aslett (@maslett) August 7, 2013

VividCortex raises $2m for MySQL monitoring and analysis tools. http://t.co/c97G6ZTLf9

— Matt Aslett (@maslett) August 7, 2013

GenieDB Launches Globally Distributed MySQL-as-a-Service http://t.co/7C3m0olnqp

— Matt Aslett (@maslett) August 5, 2013

NuoDB updates NewSQL database, previews upcoming second generation features http://t.co/n8XYhCquet

— Matt Aslett (@maslett) August 6, 2013

LexisNexis Risk Solutions announced the availability of version 4.0 of HPCC Systems. http://t.co/tImPZOqEbO

— Matt Aslett (@maslett) August 1, 2013

Pentaho partners with Splunk, launches Pentaho Business Analytics for Splunk Enterprise. http://t.co/pDwKkJgvyO

— Matt Aslett (@maslett) August 7, 2013

Hortonworks confirms the departure of CTO and founding CEO Eric Baldeschwieler. http://t.co/p6THsRDjuN

— Matt Aslett (@maslett) August 7, 2013

Comments Off on The Data Day, A few days: August 1-7 2013

The Data Day, A few days: June 11-25 2013

June 25th, 2013 — Data management

A bumper round-up of the past 14 days’ data-related news

For 451 Research clients: Sqrrl Data launches platform for developing secure 'big data' apps http://t.co/s2o9caqnmC

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Progress suddenly sells its flagship Apama CEP platform to Software AG http://t.co/4qBVaYeiN8 By @CarlLehmann1

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Datawatch buys Panopticon to shore up 'big data' play http://t.co/jme7Y77o5V By Krishna Roy

— Matt Aslett (@maslett) June 25, 2013

For 451 clients: 28msec emerges from stealth to enable real-time querying of any data source http://t.co/M9ZRrJwFF9 By Krishna Roy

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Heroku adds JavaScript support to PostgreSQL database as a service http://t.co/wTdvHx2fHb

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: BI by the hour: Jaspersoft sheds light on its utility pricing model http://t.co/eGvAji4Qtr By Krishna Roy

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Continuum Analytics seeks to make Python synonymous with advanced analysis http://t.co/cLKzQbKN1k By Krishna Roy

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Birst opens up BI stack, embraces Redshift data warehouse service http://t.co/OLaL7Rz1K3 By Krishna Roy

— Matt Aslett (@maslett) June 25, 2013

For 451 Research clients: Starview looks to Orion to make real-time IT monitoring shine http://t.co/RezlMUQajr

— Matt Aslett (@maslett) June 25, 2013

* Cisco announced its intention to acquire Composite Software.

* Software AG acquired Apama.

* TIBCO Software acquired StreamBase Systems.

* Cloudera appointed Tom Reilly as Chief Executive Officer and Mike Olson as Chief Strategy Officer and Chairman of the Board.

* Sears Holdings named Jeff Balagna Chief Executive Officer of MetaScale

* Ex-Yahoo CTO launched Altiscale, hardcore Hadoop as a service.

* SpaceCurve raised a $10M Series B round of financing.

* Sqrrl announced general availability of Sqrrl Enterprise.

* GE launched Predictivity services, supported by supported by Proficy Historian HD.

* Datameer announced Datameer 3.0.

* Oracle announced the general availability of MySQL Cluster 7.3.

* MemSQL announced the upcoming availability of MemSQL 2.1.

* Continuuity announced the release of Weave, a new open source project that enables Java developers to rapidly build scalable, distributed applications on YARN.

* RainStor adds security, text search features to database complement for Hadoop.

* Composite Software introduced version 6.2 SP3 of its Composite Data Virtualization Platform

* TokuDB launched TokuMX.

* Terracotta announced the immediate availability of Terracotta Universal Messaging.

* HP united its data management assets under HAVEn brand.

* Hortonworks and Red Hat announced an engineering collaboration around Hadoop.

* Rackspace Hosting’s ObjectRocket Database as a Service entered into a strategic agreement with 10gen.

* Simon Phipps posted State Of The Sea Lion – June 2013.

* Netflix announced that its Genie Hadoop-aaS management software is now open source

* Storm-YARN released as open source.

* Big Data arrived at the Oxford English Dictionary

And that’s the data day, today.

Comments Off on The Data Day, A few days: June 11-25 2013

The Data Day, A few days: April 22-26 2013

April 26th, 2013 — Data management

Pivotal launches. SkySQL and Mony Program merge. And much, much more

Our report on the changes in the MySQL ecosystem is now available for 451 clients and non-clients alike at bit.ly/451mysql

— Matt Aslett (@maslett) April 25, 2013

For 451 Research clients: VMware expands Serengeti’s horizons with updated Hadoop virtualization project bit.ly/17muQFI

— Matt Aslett (@maslett) April 26, 2013

For 451 Research clients: SkySQL, Monty Program merge to support MariaDB following formation of MariaDB Foundation bit.ly/10dsdjf

— Matt Aslett (@maslett) April 24, 2013

For 451 Research clients: With two funding rounds and an acquisition, Guavus expands its addressable market bit.ly/11Fd4LH

— Matt Aslett (@maslett) April 25, 2013

For 451 Research clients: Pentaho snags Webdetails to get more visual and business user-friendly bit.ly/13VDxIW

— Matt Aslett (@maslett) April 23, 2013

For 451 Research clients: Digital Infrastructure: what it is and why you need a DI strategy bit.ly/YFI9M2 By the 451 collective

— Matt Aslett (@maslett) April 22, 2013

Pivotal announced that GE plans to make a $105m strategic investment, representing a 10% equity stake. bit.ly/10dstP6

— Matt Aslett (@maslett) April 24, 2013

Actian acquires ParAccel. bit.ly/11Fchu8

— Matt Aslett (@maslett) April 25, 2013

SAP HANA contributed €86 million to SAP’s software revenue in Q1. bit.ly/12BljcG

— Matt Aslett (@maslett) April 22, 2013

Informatica reports Q1 revenue up 9% to $214.3m. bit.ly/ZQ0o1B

— Matt Aslett (@maslett) April 25, 2013

QlikTech reports Q1 revenue up 22% to $96.5m. bit.ly/15WgOfU

— Matt Aslett (@maslett) April 26, 2013

SkySQL today announced that it has signed a merger agreement with Monty Program Ab bit.ly/17g6ygy

— Matt Aslett (@maslett) April 23, 2013

Tokutek goes open source, making the source code for TokuDB v7 freely available under the GPLv2. bit.ly/12BkOPP

— Matt Aslett (@maslett) April 22, 2013

Continuent Tungsten Replicator Is Now 100% Open Source bit.ly/ZI9OMy

— Matt Aslett (@maslett) April 22, 2013

Press Release: @qubole Closes Series A Funding, Reaches Half Petabyte of Data Processed. Read more here: qubole.com/press-releases

— qubole (@qubole) April 23, 2013

GenieDB has announced GenieDB Enterprise 2.0, including compatibility With MySQL 5.6. mwne.ws/10dsyST

— Matt Aslett (@maslett) April 24, 2013

Continuent Announces New Continuent Tungsten 2.0 bit.ly/ZKoNp3

— Matt Aslett (@maslett) April 23, 2013

MemSQL has announced the GA of the distributed version of its in-memory database and real-time analytics platform bit.ly/ZKnZAI

— Matt Aslett (@maslett) April 23, 2013

Pentaho Acquires Dashboard and UI Specialist Partner Webdetails bit.ly/12BknoW

— Matt Aslett (@maslett) April 22, 2013

Announcing the MySQL Applier for Apache Hadoop bit.ly/ZMUm0K

— Matt Aslett (@maslett) April 23, 2013

Revolution Analytics launches Revolution R Enterprise 6.2. bit.ly/10dsmTN

— Matt Aslett (@maslett) April 24, 2013

Clustrix launches Clustrix 5.0 on Amazon Web Services. mwne.ws/10dsgLX

— Matt Aslett (@maslett) April 24, 2013

Comments Off on The Data Day, A few days: April 22-26 2013

Cloud databases, or database on the cloud?

January 15th, 2013 — Data management

As 2012 came to a close I tweeted

Major talking point for 2013: the difference between databases in the cloud, and cloud databases

— Matt Aslett (@maslett) December 21, 2012

NuoDB has today kicked off that debate with the launch of its Cloud Data Management System and 12 rules for a 21st century cloud database.

NuoDB’s 12 rules appear pretty sound to me – in fact you could argue they are somewhat obvious. This is actually to NuoDB’s credit in my opinion, in that they haven’t simply listed 12 differentiating aspects of their product, but 12 broader requirements.

Either way, I believe that this is the right time to be debating what constitutes a “cloud database”. Database on the cloud are nothing new, but these are existing relational database products configured to run on the cloud.

In other words, they are databases on the cloud, not databases of the cloud. There is a significant difference between spinning up a relational database in a VMI on the cloud versus deploying a database designed to take advantage of, enable, and be part of, the cloud.

To me, a true cloud database would be one designed to take advantage of and enable elastic, distributed architecture. NuoDB is one of those, but it won’t be the only one. Many NoSQL databases could also make a claim, albeit not for SQL and ACID workloads.

This isn’t a matter of SQL versus NoSQL, however. We’ve seen companies building their own next-generation database platforms deploying NoSQL and SQL technologies alongside each other for different workload and consistency requirements. Where the SQL layer falls down is the inability of existing relational databases to support elastic, geographically distributed cloud environments.

NuoDB believes it has a solution to that. So too do others including GenieDB, Translattice and VMware. Meanwhile Google’s F1 and Spanner projects have legitimized the concept of the globally-distributed SQL database.

Either way, the era of the relational cloud database – rather than the relational database on the cloud – has begun.

Comments Off on Cloud databases, or database on the cloud?

The Data Day, Two days: December 18/19 2012

December 19th, 2012 — Data management

GenieDB, Qubole, EdgeSpring, CouchDB, and more

For 451 Research clients: GenieDB launches first version of globally distributed relational database bit.ly/TXiX15

— Matt Aslett (@maslett) December 18, 2012

For 451 Research clients: Qubole prepares ‘big data’ PaaS for developers and data scientists bit.ly/TYHqTz

— Matt Aslett (@maslett) December 19, 2012

For 451 clients: EdgeSpring exits stealth mode touting visual BI stack with ‘local file structure’ twist bit.ly/TYHrqQ By Krishna Roy

— Matt Aslett (@maslett) December 19, 2012

NoSQL LinkedIn Skills Index – December 2012 bit.ly/RC06NC NoSQL is growing, but some are growing faster than others.

— Matt Aslett (@maslett) December 18, 2012

New blog post: CouchDB – sink or swim? bit.ly/T2Gzmp

— Matt Aslett (@maslett) December 17, 2012

Appfluent Technology raises $4.4m Series AA venture capital for data warehouse management software. mwne.ws/TYHAu8

— Matt Aslett (@maslett) December 19, 2012

Congress moves to support Apace #Accumulo… a nice win for #sqrrl ow.ly/gerdX

— sqrrl (@sqrrl_inc) December 19, 2012

The @vmware spinoff is probably a good thing for developers, says @451research‘s @maslett, now on @adtmag ow.ly/gaBhU.

— John K. Waters (@johnkwaters) December 17, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: December 18/19 2012

The Data Day, Two days: December 6/7 2012

December 7th, 2012 — Data management

Cloudera raises $65m. HP launches Hadoop AppSystem. And more

For 451 Research clients: HP launches Hadoop AppSystem to lower adoption complexity bit.ly/VNoadz

— Matt Aslett (@maslett) December 7, 2012

For 451 Research clients: BitYota launches cloud-based data warehouse as a service bit.ly/TLswAl

— Matt Aslett (@maslett) December 6, 2012

For 451 clients: Talend expands ‘big data’ integration and quality horizons… bit.ly/VNoeKz By Krishna Roy and Carl Lehmann

— Matt Aslett (@maslett) December 7, 2012

Cloudera Raises $65M to Accelerate Enterprise Growth bit.ly/VzKdbx #Hadoop #BigData

— Cloudera (@cloudera) December 6, 2012

Zettaset filed with the SEC for $4.75 million in new funding.bit.ly/VNpdu7

— Matt Aslett (@maslett) December 7, 2012

GenieDB launches version 1.0 of MySQL-compatible distributed relational database. bit.ly/VpwsGn

— Matt Aslett (@maslett) December 6, 2012

GoodData is integrating CloverETL’s data transformation into GoodData CloudConnect. bit.ly/VNpQUz

— Matt Aslett (@maslett) December 7, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: December 6/7 2012

What we talk about when we talk about NewSQL

April 6th, 2011 — Data management

Yesterday The 451 Group published a report asking “How will the database incumbents respond to NoSQL and NewSQL?”

That prompted the pertinent question, “What do you mean by ‘NewSQL’?”

Since we are about to publish a report describing our view of the emerging database landscape, including NoSQL, NewSQL and beyond (now available), it probably is a good time to define what we mean by NewSQL (I haven’t mentioned the various NoSQL projects in this post, but they are covered extensively in the report. More on them another day).

“NewSQL” is our shorthand for the various new scalable/high performance SQL database vendors. We have previously referred to these products as ‘ScalableSQL’ to differentiate them from the incumbent relational database products. Since this implies horizontal scalability, which is not necessarily a feature of all the products, we adopted the term ‘NewSQL’ in the new report.

And to clarify, like NoSQL, NewSQL is not to be taken too literally: the new thing about the NewSQL vendors is the vendor, not the SQL.

So who would be consider to be the NewSQL vendors? Like NoSQL, NewSQL is used to describe a loosely-affiliated group of companies (ScaleBase has done a good job of identifying, some of the several NewSQL sub-types) but what they have in common is the development of new relational database products and services designed to bring the benefits of the relational model to distributed architectures, or to improve the performance of relational databases to the extent that horizontal scalability is no longer a necessity.

In the first group we would include (in no particular order) Clustrix, GenieDB, ScalArc, Schooner, VoltDB, RethinkDB, ScaleDB, Akiban, CodeFutures, ScaleBase, Translattice, and NimbusDB, as well as Drizzle, MySQL Cluster with NDB, and MySQL with HandlerSocket. The latter group includes Tokutek and JustOne DB. The associated “NewSQL-as-a-service” category includes Amazon Relational Database Service, Microsoft SQL Azure, Xeround, Database.com and FathomDB.

(Links provide access to 451 Group coverage for clients. Non-clients can also apply for trial access).

Clearly there is the potential for overlap with NoSQL. It remains to be seen whether RethinkDB will be delivered as a NoSQL key value store for memcached or a “NewSQL” storage engine for MySQL, for example. While at least one of the vendors listed above is planning to enable the use of its database as a schema-less store, we also expect to see support for SQL queries added to some NoSQL databases. We are also sure that Citrusleaf won’t be the last NoSQL vendor to claim support for ACID transactions.

NewSQL is not about attempting to re-define the database market using our own term, but it is useful to broadly categorize the various emerging database products at this particular point in time.

Another clarification: ReadWriteWeb has picked up on this post and reported on the “NewSQL Movement”. I don’t think there is a movement in that sense that we saw the various NoSQL projects/vendors come together under the NoSQL umbrella with a common purpose. Perhaps the NewSQL players will do so (VoltDB and NimbusDB have reacted positively to the term, and Tokutek has become the first that I am aware of to explicitly describe its technology as NewSQL). As Derek Stainer notes, however: ” In the end it’s just a name, a way to categorize a group of similar solutions.”

In the meantime, we have already noted the beginning for the end of NoSQL, and the lines are blurring to the point where we expect the terms NoSQL and NewSQL will become irrelevant as the focus turns to specific use cases.

The identification of specific adoption drivers and use cases is the focus of our forthcoming long-form report on NoSQL, NewSQL and beyond, from which the 451 Group reported cited above is excerpted.

The report contains an overview of the roots of NoSQL and profiles of the major NoSQL projects and vendors, as well as analysis of the drivers behind the development and adoption of NoSQL and NewSQL databases, the evolving role of data grid technologies, and associated use cases.

It will be available very soon from the Information Management and CAOS practices and we will also publish more details of the key drivers as we see them and our view of the current database landscape here.

48 Comments

Scalable SQL: more than the mullet of the database world?

August 11th, 2010 — Data management

In the first part of our coverage on emerging database products and vendors we examined the new NoSQL databases and suggested that the incumbent database vendors would likely respond to the growing threat with a mix of in-memory and distributed caching technologies.

That is yet to happen, although it has only been a few months and the NoSQL databases have generated more noise than revenue at this stage, but in the meantime a new set of database vendors and products have emerged that could pose a more direct threat to the database incumbents while thwarting the potential of the NoSQL upstarts.

For want of a better phrase we have taken to referring to these products collectively as scalable SQL databases, and have just published a new spotlight report pulling together our various reports on the runners and riders.

Some of the vendors promise to deliver the scalability and flexibility promised by NoSQL while retaining the support for SQL queries and/or ACID (atomicity, consistency, isolation, durability). That is not an insignificant boast and it will be tough to offer the best of both worlds.

“SQL For Business, NoSQL For Partay!” is the explanation offered by MulletDB, a project that promises scalability and SQL queries. The danger is the scalable SQL ends up being the database equivalent of the celebrated mullet hairstyle or its business attire equivalent: the jacket and jeans.

One of the companies trying to avoid that problem is GenieDB (coverage) The London-based company’s GenieDB Engine is a fully replicated distributed database that combines a key-value store database with a ‘sharded’ memcached layer. Another example is Clustrix, which was founded in December 2006 to develop a new database appliance that would offer both scalability and durability in a single product.

Meanwhile VoltDB emerged earlier this summer with a transactional database management system that is designed to scale across clusters of industry-standard servers while retaining transactional integrity.

Additionally Xeround has recently confirmed its intention to reposition its Intelligent Data Grid (IDG) technology as Xeround Data Service, a scalable SQL database with support for ACID-compliant transactional capabilities for cloud computing environments, while New Technology/enterprise’s CloudTran, is designed to bring enterprise-level transaction management to GigaSpaces’ XAP in-memory data grid for on-premises deployment, and eventually any PaaS offering.

Meanwhile we are intrigued by VMware’s acquisiton of distributed data management vendor GemStone and its positioning of GemFire as a next-generation data management layer for cloud applications, as well as the forthcoming introduction of SQL querying in GigaSpaces’ eXtreme Application Platform (XAP), which will enable in-memory management of relational data and initiatives.

It is very early stages for all these vendors, and they have yet to prove that they have truly solved the problem of consistency and partition tolerance. In the meantime there are plenty of other contenders waiting in line.

Akiban is promising that it has the secret to SQL scalability with an approach that pre-groups data in order to overcome latency, caching and data distribution issues. Another company currently in stealth mode is JustOne Database which is working on perfecting a new storage model in order to deliver the performance and scalability required to support transactions and analytics on the same data simultaneously.

That is also the goal of Tokutek, which offers the TokuDB MySQL storage engine is based on Fractal Tree indexing technology designed to reduce data-insertion times and improve the performance of MySQL for both read and write applications.

JustOne and Tokutek are part of a slightly different set of vendors we are viewing under the scalable SQL umbrella: those that promise to improve performance for appropriate workloads to the extent that the advanced scale-out capabilities promised by some NoSQL databases become irrelevant.

While we’re on the subject of existing database vendors that could be considered part of the scalable SQL set, it is also worth mentioning MarkLogic. The company has recently been| associating itself with NoSQL and while the fact that it does not support SQL makes it a better literal fit with NoSQL the company’s support for ACID means that we would see it as an option for customers looking to improve performance without losing consistency, especially for unstructured or semi-structured data.*

As we previously noted; to some degree, the rise of NoSQL has resulted from the inability of the MySQL database to scale consistently. It is no surprise to see many of the scalable SQL vendors promising to improve the performance and scalability of MySQL, therefore, while others promote a clean-slate approach to address new big data management problems.

We have more details on each of the products and projects, mentioned above (as well as some not mentioned) their potential use cases, how they relate to MySQL, and what potential impact they may have on the adoption of NoSQL technologies, in the full report.

This is very much the start of our coverage of these vendors however. Expect more coverage in the near future, as well as a wider perspective on the potential for alternatives to the incumbent database suppliers, into 2011.

*Additionally, since the absence of SQL is only really tangential to many of the projects and products referred to as NoSQL it seems to me to be appropriate to have a database that does not support SQL in the scalable SQL category.

Comments Off on Scalable SQL: more than the mullet of the database world?

The Data Day, A few days: April 11-22, 2015

The Data Day, A few days: November 9-14 2013

The Data Day, A few days: August 1-7 2013

The Data Day, A few days: June 11-25 2013

The Data Day, A few days: April 22-26 2013

Cloud databases, or database on the cloud?

The Data Day, Two days: December 18/19 2012

The Data Day, Two days: December 6/7 2012

What we talk about when we talk about NewSQL

Scalable SQL: more than the mullet of the database world?

Search

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives

Search

Tags

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives