Entries from November 2012 ↓

Weird Science – Darwinian theory and emerging Hadoop vendor business strategies

November 30th, 2012 — Data management

Dan Woods recently opined that Apache Hadoop has had a weird beginning thanks to its “Three Headed Open Core” model and warned that there is a danger than it will fragment – à la Unix – thanks to competing commercial forces.

There are a couple of points to address here. The first is the assumption that the vendor community developing Hadoop is in some way ‘weird’. Not for those of us that have studied the evolution of open source-related business strategies it isn’t.

In fact, Hadoop’s multi-vendor community is a prime example of the corporate-dominated development communities we saw emerging as the fourth stage of commercial open source back in 2010.

Some people still have trouble understanding, as I wrote two years ago, that

being successful is about sharing your code development with the competition via multi-vendor open source projects in order to benefit from improved code quality and lower research and development costs for non-differentiating features AND beating your competition with proprietary complementary technologies.

This isn’t weird. I firmly believe in the not-too-distant future this will be seen as entirely normal.

Another issue to address is the suggestion that these competing vendors pose a danger to the core project. In the blog linked above I argued that the contrary is true: comparing the various competing players in collaborative communities as having a similar impact on the development of a project as various competing factors – climate, habitat, existence or dearth of predators etc – do in Darwin’s evolutionary process: i.e. making it stronger.

I would be much more concerned about the potential fragmentation of Hadoop if we were looking at four or five different competing implementations of Google’s MapReduce and file system research. Instead, you could compare the differentiating features that Cloudera, Hortonworks, MapR, IBM and EMC have introduced to the result of natural selection based on a need to evolve to certain conditions.

So long as there remains a single core Apache Hadoop project upon which these differentiating features are based I believe Hadoop will not only survive, but will thrive. If I may quote myself again: “As long as they continue to collaborate on the non-differentiating code, the project should benefit from being stretched in multiple directions.”

I believe that, as with Linux, the vendors involved have learned the lessons of the Unix wars and understand that it is in their best interests – let alone everyone else’s – not to repeat them.

Another key point when we look at the Hadoop ecosystem is that we see multiple vendors building on others’ differentiating features and often supporting multiple distributions. It’s not a case of a herd of individually differentiated Hadoops, but more like a stack of Russian Hadoop dolls.

To my mind there are (currently) eight main Hadoop business strategies, each of which has the potential to build on those before it:

Hadoop distributors

e.g. Cloudera, Hortonworks, MapR, EMC, IBM

Hadoop cloud services

e.g. Amazon EMR, Google Compute Engine

Hadoop-based deployment services

e.g. Infochimps, Metascale

Hadoop-based deployment stack/appliances

e.g. Zettaset, Oracle BDA, Dell

Hadoop-based development services

e.g. Continuuity, Mortar Data

Hadoop-based application stacks

e.g. NGDATA, Guavus

Hadoop-based database stacks

e.g. Drawn to Scale, Splice Machine

Hadoop-based analytic services

e.g. Treasure Data, Qubole

Comments Off on Weird Science – Darwinian theory and emerging Hadoop vendor business strategies

The Data Day, Two days: November 28/29 2012

November 29th, 2012 — Data management

Amazon and BitYota launch DWaaSes (DWaaSi?) Continuuity’s funding and plans. And more.

For 451 Research clients: Continuuity raises $10m series A for Hadoop development PaaS bit.ly/SvjKan

— Matt Aslett (@maslett) November 29, 2012

For 451 Research clients: Drawn to Scale lives up to its name with early adopters of SQL database for Hadoop bit.ly/UbfbC2

— Matt Aslett (@maslett) November 28, 2012

For 451 Research clients: ‘Big data’ analytics: Let the M&A flurry commence bit.ly/UbfigX With Krishna Roy.

— Matt Aslett (@maslett) November 28, 2012

Amazon Web Services announces Amazon Redshift data warehouse-as-a-service bit.ly/X0II65

— Matt Aslett (@maslett) November 28, 2012

Expanding the Cloud – Announcing Amazon #Redshift, a Petabyte-scale Data Warehouse Service – #aws wv.ly/TtmtDF

— Werner Vogels (@Werner) November 28, 2012

BitYota launches Data Warehouse-as-a-Service bit.ly/SvjW9J

— Matt Aslett (@maslett) November 29, 2012

Kognitio introduces multi-node, analytical capability via Amazon Web Services. bit.ly/RlfP38

— Matt Aslett (@maslett) November 29, 2012

Sumo Logic has closed a $30 million Series C funding round led by Accel Partners. mwne.ws/QLkzyX

— Matt Aslett (@maslett) November 28, 2012

Greenplum Acquires MoreVRP, read the blog post here ow.ly/fGvoC #EMC #greenplum #database

— Greenplum (@Greenplum) November 29, 2012

Mike Lynch’s open letter to the HP board ow.ly/fCOQ2 & HP’s response ow.ly/fCOSI #Autonomy

— Nick Patience (@nickpatience) November 27, 2012

SAS Institute rumoured to have acquired rPath bit.ly/Svk5Kc ????????

— Matt Aslett (@maslett) November 29, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: November 28/29 2012

Forthcoming webinar: Big Data Best Practices with NGDATA

November 29th, 2012 — Data management

On December 13 at 1pm EDT/10AM PDT I’ll be taking part in a webinar to discuss Big Data Best Practices – Realizing True Business Value from Your Big Data.

Big Data has rapidly become a transformational business trend. Most business leaders understand that not being able to tap into the power of their Big Data could mean losing business to the competition. However, most organizations are not fully aware of how to embrace it.

I’ll discuss how you can overcome these hurdles and tap into your Big Data to transform your business, while Naren Patil, SVP of Product Marketing, NGDATA will provide some real-life examples of successful deployment projects.

To register, click here.

Comments Off on Forthcoming webinar: Big Data Best Practices with NGDATA

The Data Day, A few days: November 22-27 2012

November 27th, 2012 — Data management

Actian acquires Versant. GoodData’s hosted analytics. And more.

For 451 Research clients: Versant leaves UNICOM at the altar, agrees to be acquired by Actian bit.ly/V3C6y6

— Matt Aslett (@maslett) November 27, 2012

For 451 Research clients: GoodData bags series C funding, looks to round out hosted analysis service bit.ly/V3CfRX By Krishna Roy

— Matt Aslett (@maslett) November 27, 2012

In scenes not particularly reminiscent of The Graduate, Versant has left Unicom at the altar and run off with Actian. bit.ly/ULi0YR

— Matt Aslett (@maslett) November 22, 2012

EnterpriseDB launches Postgres Plus Advanced Server 9.2 bit.ly/WVn436 and Postgres Enterprise Manager 3.0. bit.ly/WVn57t

— Matt Aslett (@maslett) November 27, 2012

Exclusive: Inside Google Spanner, the Largest Single Database on Earth | Wired Enterprise | Wired.com wired.com/wiredenterpris…

— Matt Aslett (@maslett) November 27, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, A few days: November 22-27 2012

The Data Day, Today: November 21 2012

November 21st, 2012 — Data management

HP/Automomy fall-out. Behind 10gen’s strategic funding. And more

For 451 Research clients: What’s behind 10gen’s strategic investment from Red Hat and Intel Capital? bit.ly/RTbb9q

— Matt Aslett (@maslett) November 21, 2012

Autonomy Founder Mike Lynch Rejects HP Charges, Alleges Mismanagement dthin.gs/SOoHcv

— Matt Aslett (@maslett) November 21, 2012

Autonomy was reported to Serious Fraud Office (by current 451 research director @alanpelzsharpe) in 2011 ow.ly/ft7GT

— Matt Aslett (@maslett) November 21, 2012

FBI Said to Be Looking Into HP’s Allegations on Autonomy bloom.bg/Y13JOn via @bloombergnews

— Matt Aslett (@maslett) November 21, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Today: November 21 2012

The Data Day, Two days: November 19/20 2012

November 20th, 2012 — Data management

HP uncovers Autonomy irregularity. Pentaho ups big data commitment. And more.

HP internal investigation uncovers accounting improprieties, misrepresentations and disclosure failures at Autonomy bit.ly/UQXKFy

— Matt Aslett (@maslett) November 20, 2012

For 451 clients: Armed with series C funding and a BI stack refresh, Pentaho ups ‘big data’ commitment bit.ly/QoUpBX By Krishna Roy

— Matt Aslett (@maslett) November 19, 2012

Mortar Data closes $1.8M seed round for Python-wrapped Hadoop gigaom.com/data/mortar-da…

— Matt Aslett (@maslett) November 19, 2012

Amazon adds Big Data category to AWS Marketplace bit.ly/QoUyFu Includes Acunu, MongoDB, Couchbase, MapR, ScaleArc, Karmasphere, HANA

— Matt Aslett (@maslett) November 19, 2012

WANdisco has acquired AltoStor to target Hadoop high availability. bit.ly/S7K1g8

— Matt Aslett (@maslett) November 20, 2012

Treasure Data partners with Indicee for interactive analysis. mwne.ws/Q7YUiV

— Matt Aslett (@maslett) November 20, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: November 19/20 2012

The Data Day, Two days: November 15/16 2012

November 16th, 2012 — Data management

Jaspersoft gets visual. MemSQL gets distributed. And more.

For 451 clients: Jaspersoft throws hat into visual exploration ring as it continues to go for growth bit.ly/QhNNFp By Krishna Roy

— Matt Aslett (@maslett) November 16, 2012

For 451 Research clients: MemSQL updates in-memory database for distributed deployments and real-time analytics bit.ly/QhNV7L

— Matt Aslett (@maslett) November 16, 2012

For 451 Research clients: Infochimps targets enterprises with stream-processing additions to ‘big data’ PaaS bit.ly/QhNTwN

— Matt Aslett (@maslett) November 16, 2012

LucidWorks Big Data is now GA. prn.to/QhOqyS

— Matt Aslett (@maslett) November 16, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: November 15/16 2012

The Data Day, Today: November 14 2012

November 14th, 2012 — Data management

Funding for Continuuity and 10gen. Wibi Data launches the Kiji. And more.

For 451 Research clients: IxReveal seeks funding round, highlights uReveal brand and data-harmonization use case bit.ly/PTw7P5

— Matt Aslett (@maslett) November 14, 2012

For 451 clients: Datawatch details semi-structured data analysis strategy and roadmap following Monarch buy bit.ly/PTwclT Krishna Roy

— Matt Aslett (@maslett) November 14, 2012

10gen Announces Strategic Investment from @intelcapital and Red Hat soc.ai/2o9@redhatnews #MongoDB #NoSQL #Database #Intel

— 10gen(@10gen) November 14, 2012

Continuuity raises $10M Series A round to ignite Big Data app development within the #Hadoop ecosystem bit.ly/Qd0kKb

— Continuuity (@Continuuity) November 14, 2012

SAP positions HANA for transaction, analytics, text and predictive processing. prn.to/T2Z4Zu

— Matt Aslett (@maslett) November 14, 2012

Wibi Data launches the Kiji Project: An open source framework for building big data apps with Apache HBase bit.ly/W5VZKi

— Matt Aslett (@maslett) November 14, 2012

Hadapt and MapR partner to enable Hadapt’s Adaptive Analytical Platform to use MapR’s Distribution for Hadoop. bit.ly/PTyzVE

— Matt Aslett (@maslett) November 14, 2012

NuoDB launches release candidate, pricing and licensing for forthcoming elastic database for the cloud. bit.ly/W5Whkl

— Matt Aslett (@maslett) November 14, 2012

Actian positions Vectorwise for large data warehouse environments via OEM agreement with ScaleMP. bit.ly/W7Gmxa

— Matt Aslett (@maslett) November 14, 2012

Socrata plans open source reference implementation of its open data platform. mwne.ws/PTwnO2

— Matt Aslett (@maslett) November 14, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Today: November 14 2012

The Data Day, Two days: November 12/13 2012

November 13th, 2012 — Data management

Platfora raises $20m. IBM trumpets ‘integration anywhere’. And more

For 451 Research clients: Microsoft previews SQL Server in-memory data processing and Hadoop coexistence bit.ly/TG55su

— Matt Aslett (@maslett) November 13, 2012

For 451 Research clients: IBM trumpets ‘integration anywhere,’ moves into reference data management bit.ly/RShjOM By Krishna Roy

— Matt Aslett (@maslett) November 12, 2012

Platfora raises $20m series B to fund its in-memory BI platform for Hadoop. mwne.ws/SjpmT4

— Matt Aslett (@maslett) November 13, 2012

DataSift Raises $15M To Help Businesses Mine And Analyze Social Data ow.ly/ffzaD via @techcrunch

— DataSift (@DataSift) November 13, 2012

We are introducing Jaspersoft 5 today. Read more about our next-gen platform for data exploration ow.ly/feEtn. #bigdata #BI

— Jaspersoft Corp. (@Jaspersoft) November 13, 2012

RethinkDB reemerges with distributed document database. bit.ly/RShoSK

— Matt Aslett (@maslett) November 12, 2012

Big data visualization startup Zoomdata launches with $1.1m of seed funding. prn.to/ZCE6Sm

— Matt Aslett (@maslett) November 13, 2012

Oracle has made a strategic minority investment in Engine Yard. bit.ly/ZC49cj

— Matt Aslett (@maslett) November 13, 2012

FairCom claims SQL-NoSQL bridge with updated c-treeACE. bit.ly/Zuwa5C

— Matt Aslett (@maslett) November 12, 2012

McObject launches eXtremeDB Financial Edition mwne.ws/ZuvP2Q

— Matt Aslett (@maslett) November 12, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: November 12/13 2012

The Data Day, Two days: November 8/9 2012

November 9th, 2012 — Data management

Funding for Neo, Elasticsearch and Hadapt. And more

For 451 Research clients: Elasticsearch raises $10m for search-based analytics platform bit.ly/Udk9wg

— Matt Aslett (@maslett) November 9, 2012

For 451 Research clients: IBM trumpets ‘integration anywhere,’ moves into reference data management bit.ly/RShjOM By Krishna Roy

— Matt Aslett (@maslett) November 12, 2012

For 451 Research clients: Neo Technology raises $11m series B to fund graph database adoption push bit.ly/VI3h6N

— Matt Aslett (@maslett) November 8, 2012

For 451 Research clients: Birst gets big into ‘big data’ with BI service in the cloud for Hadoop bit.ly/Udkdfk By Krishna Roy

— Matt Aslett (@maslett) November 9, 2012

Database startup Hadapt reports $6.7M fundraise bizjournals.com/boston/blog/st… via @bbjnewsroom

— Matt Aslett (@maslett) November 8, 2012

Elasticsearch raises $10m series A funding led by Benchmark along with Rod Johnson and Data Collective. bit.ly/VI3JSw

— Matt Aslett (@maslett) November 8, 2012

Revolution Analytics updates Revolution R Enterprise with big data decision trees and predictive analytics on Hadoop. bit.ly/VI3uXE

— Matt Aslett (@maslett) November 8, 2012

Tableau Software offers native Google BigQuery connector. bit.ly/UdkmPZ

— Matt Aslett (@maslett) November 9, 2012

Google updates Google Cloud SQL for better performance and more storage. Adds free trial bit.ly/SU3rG0

— Matt Aslett (@maslett) November 8, 2012

Facebook open sources Corona — a better way to do webscale Hadoop gigaom.com/data/facebook-…

— Derrick Harris (@derrickharris) November 8, 2012

And that’s the Data Day, today.

Comments Off on The Data Day, Two days: November 8/9 2012

Entries from November 2012 ↓

Weird Science – Darwinian theory and emerging Hadoop vendor business strategies

The Data Day, Two days: November 28/29 2012

Forthcoming webinar: Big Data Best Practices with NGDATA

The Data Day, A few days: November 22-27 2012

The Data Day, Today: November 21 2012

The Data Day, Two days: November 19/20 2012

The Data Day, Two days: November 15/16 2012

The Data Day, Today: November 14 2012

The Data Day, Two days: November 12/13 2012

The Data Day, Two days: November 8/9 2012

Search

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives

Entries from November 2012 ↓

Search

Tags

Twitter: maslett

Categories

451 Group blogroll

Recent Posts

Subscribe via Email

Archives