Entries from April 2012 ↓

The Data Day, Today: Apr 25 2012

Splunk soars on IPO. VMware acquires Cetas. Vertica retain autonomy. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* For 451 Research clients

# Splunk IPO: $3bn and counting M&A Insight

# VMware snaps up Cetas Software for ‘big data’ analytics Deal Analysis

# HP’s Vertica retains its autonomy, continues integration with Autonomy Impact Report

# SAP makes long-awaited predictive analytics move of its own Impact Report

# Sanbolic pitches data management platform for server, desktop and database consolidation Impact Report

* Splunk IPO kills, lives up to expectations

* VMware acquires Cetas Software for Cloud and Big Data Analytics

* Opera Solutions Acquires Procurement Analytics Tools and Services from BIQ and Lexington Analytics

* Terascala Announces $14M Series B Funding Round Led by Strategic Partner Consortium

* Ravel Acquired by W2O Group To Expand Big Data Client Services And Enrich In-House Analytics and Insights Technology

* Teradata Active Data Warehouses Provide Private Cloud Benefits

* Pentaho Introduces New Interactive Visualization and Expanded Big Data Analytics

* Teradata Unveils New Purpose-Built Appliance for SAS High-Performance Analytics

* SAP Establishes Global Managing Board to Lead Company

* Oracle to Hadoop Under OneAppliance: GridIron Introduces First All-Flash Appliance Line With Unprecedented Performance to Tackle Unified Big Data Processing

* Lucid Imagination Technology Integration with SugarCRM Lets Customers Enjoy Improved Global Search Capabilities with Apache Lucene/Solr

* The Apache Software Foundation Announces Apache Cassandra v1.1

* Miso project: how it will help you make your own Guardian-style infographics and data visualisations

And that’s the Data Day, today.

The Data Day, Today: Apr 19 2012

Splunk goes public. SkySQL and Connotate raise funding. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Splunk Prices Initial Public Offering 13,500,000 shares at $17.00 per share = $229.5m.

* Connotate Increases Momentum and Closes $7m Series B Round

* SkySQL Raises $4 million in Series A Round

* SAND Technology Announces Exploration of Potential Strategic Alternatives

* GoodData Closes out the Quarter With Increased Revenue Growth and Expanded Market Traction

* World’s Largest Telcos Adopt Graph Databases to Solve Connected Data Issues

* Gazzang Seizes Big Data Opportunity, Announces Record Quarter and Year over Year Growth

* Hadapt Adds Big Data Industry Veteran Christopher Lynch as Chairman of the Board of Directors

* PalominoDB and SkySQL Join Forces to Offer Unparalleled Remote Database Services to Leading Companies Worldwide

* Cloudant Data Layer as a Service Adds Support for Joyent Cloud

* GoGrid Introduces a High-Performance Platform for Predictive Analytics

* MongoDB Hadoop Connector Announced

* StreamBase Releases StreamBase LiveView 1.0

* Pervasive RushAnalyzer and Cloudera Eliminate Barriers to Rapid Hadoop ROI

* Pegasystems Announces Hadoop Big Data Support

* XtremeData Hires Former IBM Analytics Leader

* Lucid Imagination Announces General Availability of LucidWorks Enterprise 2.1

* Of open data and pregnant men

* Is UNQL Dead?

* MySQL in 2012: Report from Percona Live

* For 451 Research clients

# Will new offerings and price cuts encourage greater database-as-a-service adoption? Spotlight report

# Basho expands into cloud storage with Riak CS Impact Report

# SAP modernizes its application stack at the data layer and the mobile front end Impact Report

# QlikTech takes QlikView pricing out of the dark Impact Report

# Kitenga refreshes Hadoop-based content-analysis wares; finds rollouts a slow burn Impact Report

# CoreMedia looks to NoSQL to scale social experiences for its WCM platform Impact Report

# Boundary maps monitoring for ‘big data’ as its path to enterprise Impact Report

# Orchestra to add data quality notes to MDM ensemble as it continues to eye US growth Impact Report

# Columnar database provider SAND Technology puts itself up for sale M&A Insight

# Is it time for Microsoft to ditch partners for performance management and go shopping? Acquirer IQ

And that’s the Data Day, today.

The Data Day, Today: Apr 11 2012

IBM launches Galileo database update. SAP outlines database roadmap. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Made in IBM Labs: New IBM Software Accelerates Decision Making in the Era of Big Data IBM launches DB2 10 and InfoSphere Warehouse 10.

* SAP Unveils Unified Strategy for Real-Time Data Management to Grow Database Market Leadership

* SAP Unveils Strategy to Gain Predictive Insights From Big Data

* TIBCO Delivers Breakthrough Software to Analyze Big Data in Motion

* TIBCO Announces Intent to Acquire LogLogic

* TIBCO Spotfire and Attivio Partner to Deliver New Levels of Integration and Discovery for Data and Content

* Mortar Data, Hadoop for the Rest of Us, Gets Seed Funding

* The coming in-memory database tipping point. Microsoft’s perspective on in-memory databases.

* Jaspersoft Extends Partnership with Talend to Deliver Big Data Integration

* Oracle to Hold MySQL Connect Conference in San Francisco September 29 and 30, 2012

* Percona XtraDB Cluster Open Source Software Provides a New Approach to High Availability MySQL

* Tokutek Brings Replication Performance to MySQL and MariaDB

* Continuent Announces Tungsten Enterprise 1.5 for Multi-Master, Multi-Region MySQL Data Services in the Amazon EC2

* SkySQL, hastexo Form Highly Available Partnership

* MySQL at Twitter Twitter releases its MySQL modifications under BSD license.

* Percona Bundles New Relic to Provide Gold and Platinum Support Customers with Comprehensive Application Visibility

* Percona Toolkit 2.1 for MySQL Enables Schema Changes without Scheduling Downtime

* Percona XtraBackup 2.0 for MySQL and Percona Server Provides Increased Performance

* Delphix Expands Agile Data Platform to Support Oracle Exadata

* Red Hat and 10gen Create Compelling Open Source Data Platform

* Announcing Pre-Production MongoDB Subscription from 10gen

* VoltDB Announces Version 2.5

* Red Hat Storage 2.0 Beta: Partners Test Big Data, Hadoop Support

* Sungard wants to sell you Hadoop as a service

* Actian and Lenovo Team to Optimize Big Data and Business Intelligence with New Appliance

* Objectivity Expands European Management Team With Former Sones Founder Mauricio Matthesius

* expressor Expands Data Integration Platform Into Big Data

* The Apache Software Foundation Announces Apache Sqoop as a Top-Level Project

* LucidDB has left Eigenbase moved to Apache License

* For 451 Research clients

# IBM looks to the stars with Galileo relational database update Impact Report

# Indicee eyes fresh VC as it establishes beachhead for cloud BI service using OEM sales Impact Report

# Percona launches XtraDB Cluster for MySQL database high availability Impact Report

# Tokutek targets replication performance with database update Impact Report

# ‘Big data’ in the datacenter: Vigilent secures $6.7m funding round Impact Report

And that’s the Data Day, today.

‘Big Data’ Survival Guide: A 10-step guide to surviving the ‘big data’ deluge

Earlier today I presented a ‘Big Data’ Survival Guide at our HCTSEU event in London. The presentation was in effect a 10-step guide to surviving the ‘big data’ deluge.

Here’s a taster of what was discussed:

1. There’s no such thing as “big” data.
Or, more to the point: The problem is not “big” data – it’s more data. The increased use of interactive applications and websites – as well as sensors, meters and other data-generating machines – has increased the volume, velocity and variety of data to store and process.

2. ‘Big Data’ has the potential to revolutionize the IT industry.
Here we are talking less about the three Vs of big data and more about ‘big data’ as a concept, which describes the realization of greater business intelligence by storing, processing and analyzing that increased volume, velocity and variety of data. It can be summed up by the statement from Google’s The Unreasonable Effectiveness of Data that “Simple models and a lot of data trump more elaborate models based on less data”

3. Never use the term ‘big data’ when ‘data’ will do.
“Big Data” is nearing/at/over the hype peak. Be cautious about how you use it. “Big Data” and technologies like Hadoop will eventually become subsumed into the fabric of the IT industry and will simply become part of the way we do business.

4. (It’s not how big it is) It’s what you do with it that counts.
Generating value from data is about more than just the volume, variety, and velocity of data. The adoption of non-traditional data processing technologies is driven not just by the nature of the data, but also by the user’s particular data processing requirements. That is the essence of our Total Data management concept, which builds on the three Vs to also assess Totality, Exploration, Frequency and Dependency, which can be explained via:

5. All data has potential value.
Totality: The desire to process and analyze data in its entirety, rather than analyzing a sample of data and extrapolating the results.

6. You may have to search for it.
Exploration: The interest in exploratory analytic approaches, in which schema is defined in response to the nature of the query.

7. Time is of the essence.
Frequency: The desire to increase the rate of analysis to generate more accurate and timely business intelligence.

8. Make the most of what you have.
Dependency: The need to balance investment in existing technologies and skills with the adoption of new techniques.

9. Choose the right tool for the job.
There is no shortcut to determining which is the best technology to deploy for a particular workload. Several companies have developed their own approaches to solving this problem, which does provide some general guidance.

10. If your data is “big” the way you manage it should be “total”.
Everything I talked about in the presentation, including examples from eBay, Orbitz, Expedia, Vestas Wind Systems, and Disney (and several others) that I did not have space to address in this post, is included in our Total Data report. It examines the trends behind ‘big data’, explains the new and existing technologies used to store and process and deliver value from data, and outlines a Total Data management approach focused on selecting the most appropriate data storage and processing technology to deliver value from big data.

The Data Day, Today: Apr 2 2012

Basho launches cloud storage play. Opera acquisitions. And more.

An occasional series of data-related news, views and links posts on Too Much Information. You can also follow the series @thedataday.

* Basho Unveils Riak CS, Multi-Tenant Cloud Storage Software for Public and Private Clouds

* InsightsOne Secures $4.3 Million in Series A Round of Funding Led by Norwest Venture Partners

* Opera buys Commendo to create predictive analytics powerhouse

* Opera Solutions Increases Procurement Capabilities with Acquisition of Lexington Analytics

* How federal money will spur a new breed of big data

* Another HP org change Vertica no longer under the purview of Autonomy boss Mike Lynch?

* New SAS Visual Analytics Helps Organizations Analyze, Visualize Big Data

* Citrusleaf Delivers Real-Time NoSQL Replication

* NuoDB Launches Open Source Initiative on Github

* Actian Teams up With FlyingBinary and Tableau to Unleash Big Data Potential

* DH2i Launches and Unveils DxConsole Next Generation Virtualization Solution to Enable the Agile, Always-On Enterprise

* Acunu Analytics Ready to Preview!

* SAND Technology Announces Second Quarter Results for Fiscal Year 2012

* Idera Announces VMware Database Performance Monitoring Solution

* Idera Announces SQL Compliance Manager 3.6

* WalmartLabs is building big data tools — and will then open source them

* The three waves of opportunities in big data

* 4 Big Data Myths – Part I

* For 451 Research clients

# Drawn to Scale raises funds for Hadoop-based real-time database Impact report

# ParElastic brings elastic parallelism to relational databases Impact report

# DH2i launches with PolyServe-inspired database-virtualization software Impact report

# Tape industry pins future on ‘big data,’ active archiving and LTFS Spotlight report

# Lucid Imagination dreams up new strategy for enterprise search Market development report

# Pentaho identifies ‘big data’ analytics as investment priority, hooks into DataStax Market development report

# GridGain positions in-memory data grid for real-time analytics Market development report

# Having earned its stripes in HPC, Panasas heads for ‘big data’ Market development report

* Google News Search outlier of the day: Top 10 Dog and Cat Medical Conditions of 2011

And that’s the Data Day, today.