The Data Day, A few days: August 8-14 2013

DBaaS drives next-generation database growth. CSC acquires Infochimps. And more

And that’s the data day, today.

The Data Day, Two days: November 12/13 2012

Platfora raises $20m. IBM trumpets ‘integration anywhere’. And more

And that’s the Data Day, today.

The Data Day, Two days: November 6/7 2012

Microsoft launches Hekaton, PolyBase. Appcelerator acquires Nodeable. And more

And that’s the Data Day, today.

The Data Day, The week that was: October 22-26 2012

Cloudera launches Impala. Actuate snags Quiterian. Microsoft previews HDInsight.

And the rest:
– Microsoft previewed its Windows Azure HDInsight Service and Microsoft HDInsight Server for Windows.

– SAP launched a new “big data” bundle and go-to-market strategy.

– Informatica introduced Informatica PowerCenter Big Data Edition and reported its third quarter results.

– Also announcing financial results last week were QlikTech and Pervasive.

– Teradata updated its Unity suite with the addition of Unity Loader, and introduced its Unified Data Environment and the Unified Data Architecture.

– Splunk confirmed the release of Splunk Hadoop Connect and the Splunk App for HadoopOps.

– 10gen added five vice presidents to its management team.

– Rackspace partnered with Hortonworks to create OpenStack and Hadoop-based offerings for public and private cloud.

– Talend added support for Cassandra, HBase and MongoDB , and introduced big data profiling for Apache Hadoop to its integration platform

– MarkLogic announced support for HDFS and expanded its relationship with Hortonworks.

– Kognitio adopted a free licensing model.

– Calpont launched InfiniDB 3.5.

– MetaMarkets announced that it is open sourcing its Druid streaming, real-time data store.

– YarcData updated its uRiKA Big Data appliance for graph analytics.

– Alpine Data Labs announced a global OEM partnership with QlikTech.

– Actian and Attunity announced Attunity Replicate for Actian Vectorwise.

And that’s the Data Day, today.

The Data Day, Two days: August 9/10 2012

HP’s Autonomy problem. Excel 2013. And more.

And that’s the Data Day, today.

The distribution of Hadoop and MapReduce skills, according to LinkedIn

Back in December we published an assessment on the distribution of Hadoop skills, based on LinkedIn search results.

Thanks to a temporary lull in GB’s Olympic medal-winning exploits I ran the search again the other day. The results are pretty interesting for a number of reasons.

First the headline stats:

  • There are over 22,000 people with Hadoop in their LinkedIn profiles, up from just over 9,000 in December 2011, an increase of over 144% in eight months.
  • Hadoop skills are becoming more evenly distributed: 60.4% of LinkedIn members with Hadoop in the profiles are based in the US, compared to 64.5% in December 2011.
  • Growth areas include India (11.9% from 9.7%), China (4.4% from 3.6%, and the UK (3.4% from 3.0%).
  • However, the Bay Area remains the best place to find Hadoop enthusiasts, with 24.9% (albeit down from 28.2% eight months ago)

This time I also ran a similar search for MapReduce skills. The headline results:

  • MapReduce is mentioned in 6,424 LinkedIn profiles.
  • MapReduce skills are more evenly distributed: 61.9% of LinkedIn members with MapReduce in their profiles are based in the US, 38.1% in the rest of the world.
  • That said, over a quarter (25.9%) of LinkedIn members with MapReduce in their profiles are based in the Bay area.
  • Other geographic hotspots are India (8.7%), Seattle area (5.8%), and NYC area (4.8%).

This time I also looked at which vendors are listed as the current employers of LinkedIn members citing Hadoop and MapReduce. One really surprising result stood out:

  • Microsoft is the second largest employer of both Hadoop and MapReduce skills, according to LinkedIn member profiles
  • Redmond employs 1.7% of all LinkedIn members with Hadoop in their profiles, and 3.0% of members with MapReduce in their profiles
  • Yahoo is the largest employer of Hadoop skills (2.9%), with other ‘non-vendors’ also well represented: Google (1.3%), eBay (also 1.3%), Amazon (1.2%), LinkedIn (1.1%).
  • Google is by far the largest employer of MapReduce skills, as you might expect, with 7.1%. Also well represented are Yahoo (2.7%), Amazon (1.8%) and LinkedIn (1.4%).


The Data Day, Today: Apr 19 2012

Splunk goes public. SkySQL and Connotate raise funding. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Splunk Prices Initial Public Offering 13,500,000 shares at $17.00 per share = $229.5m.

* Connotate Increases Momentum and Closes $7m Series B Round

* SkySQL Raises $4 million in Series A Round

* SAND Technology Announces Exploration of Potential Strategic Alternatives

* GoodData Closes out the Quarter With Increased Revenue Growth and Expanded Market Traction

* World’s Largest Telcos Adopt Graph Databases to Solve Connected Data Issues

* Gazzang Seizes Big Data Opportunity, Announces Record Quarter and Year over Year Growth

* Hadapt Adds Big Data Industry Veteran Christopher Lynch as Chairman of the Board of Directors

* PalominoDB and SkySQL Join Forces to Offer Unparalleled Remote Database Services to Leading Companies Worldwide

* Cloudant Data Layer as a Service Adds Support for Joyent Cloud

* GoGrid Introduces a High-Performance Platform for Predictive Analytics

* MongoDB Hadoop Connector Announced

* StreamBase Releases StreamBase LiveView 1.0

* Pervasive RushAnalyzer and Cloudera Eliminate Barriers to Rapid Hadoop ROI

* Pegasystems Announces Hadoop Big Data Support

* XtremeData Hires Former IBM Analytics Leader

* Lucid Imagination Announces General Availability of LucidWorks Enterprise 2.1

* Of open data and pregnant men

* Is UNQL Dead?

* MySQL in 2012: Report from Percona Live

* For 451 Research clients

# Will new offerings and price cuts encourage greater database-as-a-service adoption? Spotlight report

# Basho expands into cloud storage with Riak CS Impact Report

# SAP modernizes its application stack at the data layer and the mobile front end Impact Report

# QlikTech takes QlikView pricing out of the dark Impact Report

# Kitenga refreshes Hadoop-based content-analysis wares; finds rollouts a slow burn Impact Report

# CoreMedia looks to NoSQL to scale social experiences for its WCM platform Impact Report

# Boundary maps monitoring for ‘big data’ as its path to enterprise Impact Report

# Orchestra to add data quality notes to MDM ensemble as it continues to eye US growth Impact Report

# Columnar database provider SAND Technology puts itself up for sale M&A Insight

# Is it time for Microsoft to ditch partners for performance management and go shopping? Acquirer IQ

And that’s the Data Day, today.

The Data Day, Today: Apr 11 2012

IBM launches Galileo database update. SAP outlines database roadmap. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Made in IBM Labs: New IBM Software Accelerates Decision Making in the Era of Big Data IBM launches DB2 10 and InfoSphere Warehouse 10.

* SAP Unveils Unified Strategy for Real-Time Data Management to Grow Database Market Leadership

* SAP Unveils Strategy to Gain Predictive Insights From Big Data

* TIBCO Delivers Breakthrough Software to Analyze Big Data in Motion

* TIBCO Announces Intent to Acquire LogLogic

* TIBCO Spotfire and Attivio Partner to Deliver New Levels of Integration and Discovery for Data and Content

* Mortar Data, Hadoop for the Rest of Us, Gets Seed Funding

* The coming in-memory database tipping point. Microsoft’s perspective on in-memory databases.

* Jaspersoft Extends Partnership with Talend to Deliver Big Data Integration

* Oracle to Hold MySQL Connect Conference in San Francisco September 29 and 30, 2012

* Percona XtraDB Cluster Open Source Software Provides a New Approach to High Availability MySQL

* Tokutek Brings Replication Performance to MySQL and MariaDB

* Continuent Announces Tungsten Enterprise 1.5 for Multi-Master, Multi-Region MySQL Data Services in the Amazon EC2

* SkySQL, hastexo Form Highly Available Partnership

* MySQL at Twitter Twitter releases its MySQL modifications under BSD license.

* Percona Bundles New Relic to Provide Gold and Platinum Support Customers with Comprehensive Application Visibility

* Percona Toolkit 2.1 for MySQL Enables Schema Changes without Scheduling Downtime

* Percona XtraBackup 2.0 for MySQL and Percona Server Provides Increased Performance

* Delphix Expands Agile Data Platform to Support Oracle Exadata

* Red Hat and 10gen Create Compelling Open Source Data Platform

* Announcing Pre-Production MongoDB Subscription from 10gen

* VoltDB Announces Version 2.5

* Red Hat Storage 2.0 Beta: Partners Test Big Data, Hadoop Support

* Sungard wants to sell you Hadoop as a service

* Actian and Lenovo Team to Optimize Big Data and Business Intelligence with New Appliance

* Objectivity Expands European Management Team With Former Sones Founder Mauricio Matthesius

* expressor Expands Data Integration Platform Into Big Data

* The Apache Software Foundation Announces Apache Sqoop as a Top-Level Project

* LucidDB has left Eigenbase moved to Apache License

* For 451 Research clients

# IBM looks to the stars with Galileo relational database update Impact Report

# Indicee eyes fresh VC as it establishes beachhead for cloud BI service using OEM sales Impact Report

# Percona launches XtraDB Cluster for MySQL database high availability Impact Report

# Tokutek targets replication performance with database update Impact Report

# ‘Big data’ in the datacenter: Vigilent secures $6.7m funding round Impact Report

And that’s the Data Day, today.

The Data Day, Today: Mar 13 2012

Drawn to Scale raises funding. Cloudera launches HBaseCon. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Drawn to Scale Announces Funding for Real-Time Big Data

* Cloudera Announces HBaseCon 2012, the Industry’s First Apache HBase Community Conference

* Gazzang Launches Big Data Encryption and Key Management Platform

* Jaspersoft Closes Record Fiscal Year

* Schooner Information Technology Releases Membrain 4.0

* How Project Mercury is eBay’s Big Data play Up on the roof of EBay’s big data center.

* SAND Announces Universal Query

* Oracle has a cloud computing secret The potential impact of metered pricing.

* Why should I consider memcached plugin? …for MySQL.

* For 451 Research clients

# Microsoft launches SQL Server 2012, with an eye on ‘big data’ Impact report

# Global IDs hones governance, MDM focus; looks to the cloud and appliances for growth Impact report

# Clarabridge ups the ante in ‘voice of the customer’ with v5.0 as the CEM space heats up Impact report

# ScaleBase launches elastic load balancing for MySQL databases Market Development report

# Dassault’s Exalead searches for a ‘big data’ role Market Development report

And that’s the Data Day, today.

The Data Day, Today: Mar 8 2012

Microsoft launches SQL Server 2012. MapR integrates with Informatica. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Microsoft Releases SQL Server 2012 to Help Customers Manage “Any Data, Any Size, Anywhere”

* SQL Server 2012 Released to Manufacturing

* SAS Access to Hadoop Links Leading Analytics, Big Data

* MapR And Informatica Announce Joint Support To Deliver High Performance Big Data Integration And Analysis

* Teradata Expands Integrated Analytics Portfolio

* New Teradata Platform Reshapes Business Intelligence Industry

* Microsoft’s Trinity: A graph database with web-scale potential

* KXEN Announces Availability of InfiniteInsight Version 6, a Predictive Analytics Solution with Unprecedented Agility, Productivity, and Ease of Use

* Software AG Announces its Strategy for the In-memory Management of Big Data

* Attunity and Hortonworks Announce Partnership to Simplify Big Data Integration with Apache Hadoop

* Schooner Information Technology and Ispirer Systems Partner to Deliver SQLWays for SchoonerSQL

* Big Data & Search-Based Applications

* Namenode HA Reaches a Major Milestone

* How Twitter is doing its part to democratize big data

* Dropping Prices Again– EC2, RDS, EMR and ElastiCache

* For 451 Research clients

# SAS outlines Hadoop strategy, previews Hadoop-based in-memory analytics Market Development report

# Pervasive rides the elephant into ‘big data’ predictive analytics Market Development report

# IBM makes desktop discovery and analysis play, shares business analytics priorities Market Development report

# Clustrix launches SDK to tap developer interest in new databases Market Development report

# Continuent and SkySQL team up for clustered MySQL support Analyst note

# MapR gets a boost from Cisco and Informatica Analyst note

And that’s the Data Day, today.