Entries from August 2012 ↓

What’s ‘big’ got to do with it?

August 15th, 2012 — Data management

Jaspersoft has released the results of its latest Big Data Survey and was good enough to share with us a few additional details. It makes for interesting reading.

The first thing to take into account is the sample bias. The survey was conducted with over 600 Jaspersoft community members. 63% of respondents are application developers, and 37% are in the software and Internet industry.

This already speaks volumes about the sectors with interest in big data, and it is interesting to compare the state of big data adoption with the recent results of 451 Research’s TheInfoPro storage study, which is conducted with storage professionals.

According to that study, 24% of storage respondents had already implemented solutions for big data, while 56% had no plans. As you might expect, Jaspersoft’s sample was more keen, with 36% having already deployed or in development, and 38% with no plans.

That’s still a good proportion of respondents with no plans to adopt a big data analytics project, however, with the biggest reasons not to adopt being a reliance on structured data (37%) and no clear understanding of what ‘big data’ is (35%).

Sceptics might suggest that the respondents to Jaspersoft’s survey that do have plans for big data are also somewhat confused about what constitutes a big data project.

Certainly they are using some fairly traditional technologies and approaches. Looking at the most popular answers to a range of questions we find that those with big data plans are:

creating reports (76%)
to analyze customer experience (48%)
based on data from enterprise applications (79%)
stored on relational databases (60%)
processed using ETL (59%)
running on-premises (60%)

So far, so what. The characteristics above could be used to describe many existing business intelligence projects.

It’s not even as if respondents are looking at huge volumes of data, with 38% expecting total data volume to be in the gigabytes, 40% expecting terabytes, and just 10% expecting petabytes and above.

So what makes these big data projects? It’s not until you look at the source of the data that you get any sense that the respondents with ongoing big data projects are doing anything different from those without: 68% are using machine-generated content (web logs, sensor data) as the a source for their big data projects, and 46% are using human-generated text (social media, blogs).

The results do suggest that some non-traditional analytics and data processing approaches are gaining ground, with 64% citing the importance of data visualization, 54% statistical/predictive analytics, 50% search, and 45% text analytics. However, just 18% are using Hadoop HDFS at this point (behind MongoDB with 19%).

1 Comment

The Data Day, Two days: August 13/14 2012

August 14th, 2012 — Data management

Datomic calls time on RDBMS. Actian offers $154m for Pervasive. And more

For 451 Research clients: Metadata Partners calls time on traditional transactional databases with Datomic bit.ly/PfLFff

— Matt Aslett (@maslett) August 14, 2012

For 451 Research clients: Cray’s YarcData claims early success for graph database appliance bit.ly/RNgedb

— Matt Aslett (@maslett) August 13, 2012

Actian Corporation Proposes to Acquire Pervasive Software for $8.50 per Share in Cash bit.ly/MXPmkl

— Matt Aslett (@maslett) August 13, 2012

Symantec partners with Hortonworks for Hadoop/CFS combo. bit.ly/Pg8bDk

— Matt Aslett (@maslett) August 14, 2012

Real-Time Big Data Startup ParStream Raises $5.6m prn.to/N0vNIb

— Matt Aslett (@maslett) August 14, 2012

Akiban Server as a “Rosetta Stone” for JSON and SQL? goo.gl/TXcKX

— Ori (@oriherrnstadt) August 14, 2012

The latest Big Data survey results from @jaspersoft: bit.ly/MXADMh. Surprisingly quick maturing of usage & high project interest.

— Brian Gentile (@BrianG_Jasper) August 14, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: August 13/14 2012

The Data Day, Two days: August 9/10 2012

August 10th, 2012 — Data management

HP’s Autonomy problem. Excel 2013. And more.

For 451 Research clients: HP: How do you solve a problem like Autonomy? bit.ly/Mmcm0P By @alanpelzsharpe

— Matt Aslett (@maslett) August 10, 2012

For 451 Research clients: Microsoft trumpets Excel 2013 as the one and only BI tool bit.ly/MmctJy By Krishna Roy

— Matt Aslett (@maslett) August 10, 2012

For 451 clients: ClearStory looks to ease diverse data access, exploration and analysis needs, at scale bit.ly/PGzpzd By Krishna Roy

— Matt Aslett (@maslett) August 9, 2012

For 451 Research clients: VMware covers more management and analytics terrain with Log Insight purchase bit.ly/MmcoFO By @451wendy

— Matt Aslett (@maslett) August 10, 2012

For 451 Research clients: Attivio lands key partners for search-based analytics bit.ly/MmcwVO

— Matt Aslett (@maslett) August 10, 2012

The distribution of Hadoop and MapReduce skills, according to LinkedIn bit.ly/QlzwR8 Hadoop citations up 144% in 8 months

— Matt Aslett (@maslett) August 10, 2012

For 451 Research clients: 1010data cites continued growth for hosted data warehouse bit.ly/PGztPg

— Matt Aslett (@maslett) August 9, 2012

Big data VC firm Data Collective steps out of the shadows gigaom.com/cloud/big-data… via @derrickharris

— Matt Aslett (@maslett) August 9, 2012

Lucid Imagination Changes Name to LucidWorks prn.to/OXedXr

— Matt Aslett (@maslett) August 9, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: August 9/10 2012

The distribution of Hadoop and MapReduce skills, according to LinkedIn

August 10th, 2012 — Data management

Back in December we published an assessment on the distribution of Hadoop skills, based on LinkedIn search results.

Thanks to a temporary lull in GB’s Olympic medal-winning exploits I ran the search again the other day. The results are pretty interesting for a number of reasons.

First the headline stats:

There are over 22,000 people with Hadoop in their LinkedIn profiles, up from just over 9,000 in December 2011, an increase of over 144% in eight months.
Hadoop skills are becoming more evenly distributed: 60.4% of LinkedIn members with Hadoop in the profiles are based in the US, compared to 64.5% in December 2011.
Growth areas include India (11.9% from 9.7%), China (4.4% from 3.6%, and the UK (3.4% from 3.0%).
However, the Bay Area remains the best place to find Hadoop enthusiasts, with 24.9% (albeit down from 28.2% eight months ago)

This time I also ran a similar search for MapReduce skills. The headline results:

MapReduce is mentioned in 6,424 LinkedIn profiles.
MapReduce skills are more evenly distributed: 61.9% of LinkedIn members with MapReduce in their profiles are based in the US, 38.1% in the rest of the world.
That said, over a quarter (25.9%) of LinkedIn members with MapReduce in their profiles are based in the Bay area.
Other geographic hotspots are India (8.7%), Seattle area (5.8%), and NYC area (4.8%).

This time I also looked at which vendors are listed as the current employers of LinkedIn members citing Hadoop and MapReduce. One really surprising result stood out:

Microsoft is the second largest employer of both Hadoop and MapReduce skills, according to LinkedIn member profiles
Redmond employs 1.7% of all LinkedIn members with Hadoop in their profiles, and 3.0% of members with MapReduce in their profiles
Yahoo is the largest employer of Hadoop skills (2.9%), with other ‘non-vendors’ also well represented: Google (1.3%), eBay (also 1.3%), Amazon (1.2%), LinkedIn (1.1%).
Google is by far the largest employer of MapReduce skills, as you might expect, with 7.1%. Also well represented are Yahoo (2.7%), Amazon (1.8%) and LinkedIn (1.4%).

Comments Off on The distribution of Hadoop and MapReduce skills, according to LinkedIn

The Data Day, Today: August 8 2012

August 8th, 2012 — Data management

Who loves Hadoop? Who doesn’t?

Who Loves Hadoop? bit.ly/P53j5t At least we now know who is suggesting that Hadoop might need to be ‘taken out of open source’-O

— Matt Aslett (@maslett) August 8, 2012

Who doesn’t love Hadoop? bit.ly/NENgFH Responding to suggestions that Hadoop would be better off closed.

— Matt Aslett (@maslett) August 8, 2012

For 451 Research clients: LexisNexis cites benefits from open-sourcing HPCC Systems bit.ly/P536Pw

— Matt Aslett (@maslett) August 8, 2012

Apache Hadoop YARN – Background and an Overview… great explanation by @acmurthy shar.es/vw8T4

— Hortonworks (@hortonworks) August 8, 2012

Log Insight has been acquired by VMware bit.ly/O2ort0

— Matt Aslett (@maslett) August 8, 2012

Jaspersoft partners with DataStax to create analytics connector for Apache Cassandra. prn.to/NjevYT

— Matt Aslett (@maslett) August 8, 2012

Actuate and DataStax partner to deliver analytics for Apache Cassandra. bit.ly/Mj7yJw

— Matt Aslett (@maslett) August 8, 2012

Announcing Acunu Reflex with Analytics, v3 acunu.com/2/post/2012/08… #bigdata #cassandra #v3

— Acunu (@Acunu) August 8, 2012

Information. Insight. Instantly: Check Out The Latest Version Of Our Big Data Platform! bit.ly/QCimDb

— infochimps (@infochimps) August 7, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Today: August 8 2012

The Data Day, Two days: August 6/7 2012

August 7th, 2012 — Data management

Hadapt goes GA (quietly). Birst delivers Distributed Business Analytics

For 451 Research clients: Hadapt quietly goes GA with Hadoop/RDBMS combo bit.ly/Q22BB8

— Matt Aslett (@maslett) August 6, 2012

For 451 clients: Birst scores $26m in fourth-round funding, opens the warehouse door to business users bit.ly/Q22GVk By Krishna Roy

— Matt Aslett (@maslett) August 6, 2012

GoGrid Server Images with IBM’s BigInsights and Streams are now available. bit.ly/Q4ce2g

— Matt Aslett (@maslett) August 6, 2012

Ye Shiwen: Can statistics explain her win? bbc.in/NYDW5p

— Alan Pelz-Sharpe (@AlanPelzSharpe) August 7, 2012

Nimbula partners with MapR for private cloud Hadoop offering mwne.ws/MtJTHK

— Matt Aslett (@maslett) August 7, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: August 6/7 2012

The Data Day, Two days: August 2/3 2012

August 3rd, 2012 — Data management

StormDB looks to define database cloud. YARN becomes Hadoop sub-project. And more.

For 451 Research clients: StormDB seeks to define the database cloud with managed database service bit.ly/Me37e9

— Matt Aslett (@maslett) August 2, 2012

YARN (aka MapReduce 2.0/NextGen MapReduce) is now a sub-project of Apache Hadoop. bit.ly/Mnvf4L

— Matt Aslett (@maslett) August 3, 2012

Teradata reports GAAP net income of $112m in Q2 on revenue up 14% to $665m. prn.to/N4YLII

— Matt Aslett (@maslett) August 2, 2012

Actuate reported GAAP net income of $5.6m in Q2 on revenue up 7% to $36.2m. bit.ly/M7VF9d

— Matt Aslett (@maslett) August 3, 2012

Gazzang and DataStax team up for NoSQL data security. bit.ly/Me3d5L

— Matt Aslett (@maslett) August 2, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: August 2/3 2012

The Data Day, Today: August 1 2012

August 1st, 2012 — Data management

Rapid-I integrates with Radoop. How ‘Big Data’ Is Different. And more.

For 451 Research clients: Rapid-I stretches tentacles to US and into ‘big data’ analytics via Radoop bit.ly/Qdkdhy By Krishna Roy

— Matt Aslett (@maslett) August 1, 2012

How ‘Big Data’ Is Different bit.ly/MRgpU5 MIT Sloan Management Review

— Matt Aslett (@maslett) August 1, 2012

“The Big Big Data Freak-Out of 2012” – blog post from our CEO @kky -> bit.ly/Qazvnc #BigData #Hadoop

— Mortar (@mortardata) July 31, 2012

Rackspace launches Cloud Database services as part of OpenStack cloud push. bit.ly/OnU8Jh

— Matt Aslett (@maslett) August 1, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Today: August 1 2012

Entries from August 2012 ↓

What’s ‘big’ got to do with it?

The Data Day, Two days: August 13/14 2012

The Data Day, Two days: August 9/10 2012

The distribution of Hadoop and MapReduce skills, according to LinkedIn

The Data Day, Today: August 8 2012

The Data Day, Two days: August 6/7 2012

The Data Day, Two days: August 2/3 2012

The Data Day, Today: August 1 2012

Search

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives

Entries from August 2012 ↓

Search

Tags

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives