Entries from July 2013 ↓

The Data Day, A few days: July 24-31 2013

Next-Gen DB market sizing. Total Data Integration. And more.

And that’s the data day, today.

The Data Day, A few days: July 18-23 2013

DataStax raises $45m. Actian’s post-acquisition binge strategy. And more

And that’s the data day, today.

Forthcoming webinar: How Big Data Fulfils Promises that Data Warehouse Couldn’t

For those that missed the original I’ll be taking part in a webinar on Thursday, July 25th at 1pm ET in association with NGDATA entitled How Big Data Fulfils Promises that Data Warehouse Couldn’t.

Many enterprises were persuaded to adopt enterprise data warehousing (EDW) technology to achieve a ‘single version of the truth’ for enterprise data. In reality, the promises were rarely fulfilled with many stories of failed, lengthy and over budget projects. Even if an EDW projects reached deployment, the warehouse schema is designed to answer a specific set of queries and is inflexible to change and accommodate growing variety of data. Moreover, the data often is outdated in the context of consumer’s real-time world.

Big Data technologies such as Hadoop, HBase and Solr offer next generation data management that can deliver on the promise of “single version of truth” by providing a real-time, 360° view of customers and products.

In this webinar, you will learn:

  • Why the inflexibility of EDWs failed to deliver 360° view
  • How big data technologies can finally make 360° view a reality
  • Overview of an interactive Big Data management solution
  • Best practices and success stories from leading companies

For full details, and to register, click here.

Forthcoming webinar: Building an Analytic Platform

On July 23 at 10:00am PDT, I’ll be taking part in a webinar in association with MemSQL in the subject of building a next generation analytic platform.

To compete in the modern marketplace, your organization must be data-driven. But with the sheer number of data management and analytic technologies appearing on the market, it is difficult to map your needs to the appropriate solutions. One thing is clear – your data warehouse and Hadoop cluster do not suffice for real-time analytics.

In this webinar I’ll be joined by Mark Horton, director of product marketing for MemSQL, to discuss key criteria you should consider when building a next-generation analytic platform. Topics to be covered include:

  • Flexibility and incremental scale-out to size your platform according to your volume and performance needs.
  • Agile responsiveness to act on changes to your business environment faster than traditional data warehouses allowed.
  • Integration with existing tools to leverage previous investments in your next-generation analytic environment.

Attendees will be offered a free 30-day trial of MemSQL’s real-time analytics platform and special one-year introductory pricing if they purchase MemSQL at the end of the trial.

For full details and to register, click here.

The Data Day, A few days: July 11-17 2013

FoundationDB acquires Akiban. Cloudera acquihires Myrrix. And more

And that’s the data day, today.

The Data Day, A few days: July 1-10 2013

Oracle launches Database 12c. Predixion raises $20m. And more.

And that’s the data day, today.

Forthcoming Webinar: Get Down to Serious Business with Hadoop

On Wednesday, July 17, at 11:00am ET / 8:00am PT, I’ll be taking part in a webinar in association with MarkLogic on the subject of Hadoop.

As we’ve stated a few times, we believe that the flexibility of Apache Hadoop is one of its biggest assets – enabling organizations to generate value from data that was previously considered too expensive to be stored and processed in traditional databases – but it also results in “Hadoop” meaning different things to different people.

The result is that organizations still struggle over which Hadoop ecosystem components to adopt in order to obtain the greatest value, which application workloads might be suitable for deployment on Hadoop, and how to deploy Hadoop in conjunction with existing relational and non-relational databases.

On the webinar I’ll be providing an overview of the current state of the Hadoop ecosystem, geographic adoption, use cases, while MarkLogic’s Director of Product Management Justin Makeig to will provide an introduction to complementary technology from MarkLogic that can help your organization achieve real-time analysis, transactional data updates, integrity, granular security, and full-text search.

For full details, and to register, click here.

NoSQL LinkedIn Skills Index – June 2013

Four quarters have now passed since we rebooted our NoSQL LinkedIn Skills Index, based on the number of LinkedIn member profiles mentioning each of the NoSQL projects, giving us a good view of the relative growth of the various NoSQL databases in the past year.


A few interesting statistics to pick out: Cassandra has jumped ahead of Redis for second place, while outside the top ten, shown here, OrientDB climbed above Hypertable and DEX climbed above InfiniteGraph. Looking ahead, expect Riak to overtake MarkLogic in the next three months.

DynamoDB saw the greatest increase in terms of the number of mentions in LinkedIn profiles in the past three months, although it remains in 10th position. In terms of growth, DynamoDB was followed by OrientDB, Neo4j, Apache Accumulo and DEX.

However, MongoDB once again extended its lead as the most popular NoSQL database, according to LinkedIn profile mentions. As the chart below illustrates, it now accounts for 47% of all mentions of NoSQL technologies in LinkedIn profiles, according to our sample, compared with 46% in March.


Of course, we would also note that this is not meant to be a comprehensive analysis, but rather a snapshot of one particular data source.

Another significant data source that can provide a different perspective on the NoSQL market is our market-sizing revenue estimate. Stand-by for an update on our sizing estimates for the NoSQL, NewSQL, MySQL and DBaaS sectors in the coming weeks.