Entries from March 2011 ↓

451 is hiring!

As you may have seen from recent announcements, The 451 Group is expanding and both organically and inorganically.

As part of that  we’re looking for a bunch of new analysts, two of which are within the information management and storage areas, both of which are US-based.

Information governance and e-Discovery analyst

  • More details in the ad here

Senior storage analyst

  • More details in the ad here

In both cases the email to contact is careers@the451group.com

We basically look for very smart people who can write and who are passionate about their chosen area. If you have those three attributes, you’re a long way to becoming a 451 analyst.

MySQL NoSQL survey highlights role of polyglot persistence

The MySQL developer website is currently running a poll to gauge the adoption of NoSQL database projects by MySQL developers.

The results are interesting, particularly in relation to our research report on the emergence and adoption of NoSQL and NewSQL databases, which I am completing this week.

Our research has shown that one of the drivers of NoSQL has been performance, and in particular the failure of MySQL to provide predictable performance at scale. We do see NoSQL being deployed for applications that previously ran on MySQL, or for which MySQL would previously have been the natural choice.

For example, while Facebook continues to run its core applications on MySQL running the InnoDB storage engine and memcached it also created what became Apache Cassandra to power its inbox search, and selected Apache HBase for its Messages application, which was updated in late 2010 to combine chat, email, and SMS, having found that MySQL was unable to deliver the performance required for large data sets.

Similarly, content discovery service StumbleUpon adopted HBase following problems with MySQL failover, Digg replaced its MySQL cluster with Apache Cassandra, and Wordnik replaced MySQL with MongoDB.

Clearly, however, not every MySQL application is suitable for a NoSQL database. Just because almost 80% of the MySQL survey respondents are adopting NoSQL database, does not mean they are replacing MySQL with NoSQL.

Like Facebook, many major NoSQL users also continue to use MySQL, including Twitter which back-tracked on a planned migration of its core status table to Apache Cassandra in 2010. It continues to use MySQL, but is adopting Cassandra for newer projects.

The adoption of multiple database products depending on the nature of the application is another of the six major drivers for NoSQL and NewSQL adoption highlighted by our research.

The theory of polyglot persistence has developed based on the fact that different data storage models have their own strengths and the acceptance that while the relational model is suitable for a large proportion of data storage requirements, there are times when a document, graph, or object database might be more suitable, or even a distributed file system.

Facebook and Twitter are prime examples of polyglot persistence in action, and the survey of MySQL developers shows that the practice is widespread. At the time of writing 205 people have responded to the survey, providing 421 responses.

If we exclude the 42 that indicate they are not using a NoSQL database, that means that the remaining 163 people are using 379 NoSQL databases, which equates to 2.33 databases per respondent, not including their existing use of MySQL or other traditional or NewSQL databases.

I’ll provide more details of the research report, including the other four adoption drivers, once the report is published. The report contains analysis of the drivers behind the development and adoption of NoSQL and NewSQL databases, as well as the evolving role of data grid technologies, as well as the associated use cases. It will be available soon for clients of our Information Management and CAOS practices.

Webinar next week: Text-aware apps & document filters

On Tuesday March 8 I’m doing a webinar along with Isys Search Software and Sybase about text-aware applications. The full title is “Text-Aware Software Solutions: What Defines Excellence?”

‘Text-aware applications’ is a phrase we coined back in 2005 as part of the process of writing a major report on the subject in which we looked at the various application areas (CRM, ERP, BI etc) that could benefit from a deep understanding of unstructured data.

As the first key finding from the report said:

The future success of companies and organizations will increasingly be based on their ability to unlock hidden intelligence and value from unstructured data, and text in particular.

The webinar on March 8 looks at the role of document filters in making applications text-aware, which is something I’ve talked about here before.

It’s at 10am PT/1pm ET/6pm UK. You can register here.