Entries from February 2012 ↓
February 29th, 2012 — Data management
February 28th, 2012 — Data management
During my first trip to Oracle OpenWorld as an analyst a few years ago I asked a room full of Oracle data-warehousing users whether any of them had explored the use-cases for other Oracle data management assets, such as the TimesTen in-memory database.
The question was met with complete silence before Ken Jacobs kindly suggested that perhaps this wasn’t the right crowd for that sort of question.
It was one of those moments that really haunts you. My first industry event as an analyst and I had embarrassed myself by asking an apparently stupid question in front of a room of more experienced colleagues and potential clients.
Doesn’t seem like such a stupid question now though, does it?
I have exorcised the demons! This house is clear.
February 24th, 2012 — Data management
February 22nd, 2012 — Data management
In late 2010 I published a post discussing the problems associated with trying to size the ‘big data’ market based on a lack of clarity on the definition of the term and what technologies it applies to.
In that post we discussed a 2010 Bank of America Merrill Lynch report that estimated that ‘big data’ represented a total addressable market worth $64bn. This week Wikibon estimated that the big data market stands at just over $5bn in factory revenue growing to over $50bn by 2017, while Deloitte estimated that industry revenues will likely be in the range of $1-1.5bn this year.
To put that in perspective, Bank of America Merrill Lynch estimated that the total addressable market for ‘big data’ in 2010 was this
Wikibon estimates that the ‘big data’ market in 2012 is this
and Deloite estimates that the ‘big data’ market in 2012 is this
UPDATE – IDC has become the first of the big analyst vendors to break out its big data abacuses (abaci?). IDC thinks the ‘big data’ market in 2010 was $3.2bn. That’s this
Not surprisingly they came to their numbers by different means. BoA added up market estimates for database software, storage and servers for databases, BI and analytics software, data integration, master data management, text analytics, database-related cloud revenue, complex event processing and NoSQL databases.
Wikibon came to its estimate by adding up revenue associated with a select group of technologies and a select group of vendors, while Deloitte added up revenue estimates for database, ERP and BI software, reduced the total by 90% to reflect the proportion of data warehouses with more than five terabytes of data, and reduced that total by 80-85% to reflect the low level of current adoption.
IDC, meanwhile, went through a slightly tortuous route of defining the market based on the volume of data collected, OR deployments of ultra-high-speed messaging technology, OR rapidly growing data sets, AND the use of scale-out architecture, AND the use of two or more data types OR high-speed data sources.
There is something to be said for each of these definitions. But equally each can be easily dismissed. We previously described our issues with the all-inclusive nature of the BoA numbers, and while we find Wikibon’s process much more agreeable, some of the individual numbers they have come up with are highly questionable. Deloitte’s methodology is surreal, but defensible. IDC’s just illustrates the problem:
What this highlights is that the essential problem is the lack of definition for ‘big data’. As we stated in 2010: “The biggest problem with ‘big data’… is that the term has not been – and arguably cannot be – defined in any measurable way. How big is the ‘big data’ market? You may as well ask ‘how long is a piece of string?'”
February 17th, 2012 — Data management
February 14th, 2012 — Data management
February 9th, 2012 — Data management
Next week – February 15, 2012, at 10am PT to be precise – I’ll be taking part in a webinar with Clustrix to discuss scaling big database applications with NewSQL databases.
I’ll be providing an overview of the origins of the term NewSQL, its adoption, and a discussion of the core technologies that we see fitting into the category, as well as an explanation of the key considerations for choosing between NoSQL and NewSQL databases.
Other webinar participants include Robin Purohit, Clustrix president and CEO; and Aaron Passey, Clustrix CTO, and there will also be a presentation by Massive Media, which recently announced that it has used Clustrix to build and grow Twoo, a social networking site, to more than four million users in only six months without sharding the application and without any downtime.
Register for the event here.
February 8th, 2012 — Data management
February 3rd, 2012 — Data management
New CEO at Revolution. Pentaho goes big data. EMC Hadoop gets Isilon. And more.
An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.
* Revolution Analytics Names David Rich New CEO
* Pentaho Open Sources Big Data Capabilities to Further Fuel Widespread Adoption
* EMC Isilon is Industry’s First Scale-Out NAS System with Native Hadoop Support
* Actuate Reports Fourth Quarter and Fiscal Year 2011 Financial Results
* Sumo Logic Raises $15M Series B Round for Next Generation Log Management and Analytics
* Announcing Oracle R Enterprise 1.0
* Paul Cormier Joins Hortonworks’ Board of Directors
* DataStax Launches First Complete Solution for Cassandra Development on Windows and Mac
* Latest Release of Kalido Information Engine Eliminates Data Mart Migration and Consolidation Hassles
* Karmasphere Brings More Power, Collaboration, and Faster Insights to Big Data Analytics Teams on Hadoop
* Why Big Data Won’t Make You Smart, Rich, Or Pretty
* SAP HANA – slowly moving out of hype into actual projects
* For 451 Research clients
# Actuate gets ready to go shopping in the ‘big data’ mall Acquirer IQ
# Couchbase cites enterprise adoption, clarifies distributed NoSQL database strategy Impact report
# SpagoBI illuminates 2012 roadmap, takes open source model to US, Latin America Impact report
# Customer data analysis provider nPario combines big data and smart segmentation Impact report
# Tableau details 2012 growth strategy, gets semantic for visual analytics Market development report
# EMC integrates re-branded Hadoop distribution with Isilon NAS Market development report
# Quiterian seeks funding for new customer analytics in the cloud focus Market development report
# Hortonworks refines its commercial strategy for Apache Hadoop Market development report
# Digital Reasoning pledges to automate the analysis of complex data Market development report
And that’s the Data Day, today.
February 2nd, 2012 — Data management
Thanks to everyone who has already taken part in our survey exploring changing attitudes to MySQL following its acquisition by Oracle and examining the competitive dynamic between MySQL and other database technologies, including NoSQL and NewSQL.
The response has been great and even a quick look at the results makes for interesting reading, particularly in the light of our previous findings which indicated declining MySQL usage.
I am really looking forward to having the opportunity for a deep dive into the results and break out the figures to get a better understanding of the potential impact of alternative MySQL distribution and support providers, as well as NoSQL and NewSQL, on continued usage of MySQL.
The survey results will be made freely available on our blogs, as well as being included in a long format report containing our additional analysis and research related to the MySQL ecosystem and competitive dynamic.
Right now, however, is your last chance to contribute to the survey and get your voice heard. There are just 12 questions to answer, spread over four pages, and the entire survey should take no longer than five minutes to complete. All individual responses are of course confidential.
The survey will close in 24 hours.