Tuesday, October 9, 2012

IMDBs (In Memory Databases) gaining popularity



As hardware and memory prices continue to decline, in memory databases (IMDB) are gaining traction for real-time analytics, applications, and platforms. There are good reasons for the accelerated adoption of IMDBs for big data, with the most obvious being performance (as in very fast delivery of large data, and on the fly aggregations of multi-dimensional data delivered quickly to the end user).

From an architecture perspective there are a few key differences, looking at HANA as an example, that power such enhanced performance. 
  • Memory vs disk as primary storage (much faster performance)
  • Column and row store vs tables (no indices, materialized views, or cubes - aggregations on the fly)
  • Data compression
  • Parallelism (both on hardware and processing)
These changes are redefining the roles of data modelers, database developers and DBAs which are way more blurred in IMDB environments vs traditional databases.

Use Cases for IMDBs include: real-time analytics on operational databases, data warehousing, and predictive analysis on big data, as well as real-time applications and platforms (mobile, cloud)

Drawbacks for IMDBs include: Maturity of most product offerings, cost and availability of skills / expertise.

Enterprise IMDB Vendors and Products


  • Microsoft (updated per announcement on Oct 24th 2012)
  • Opensource
    • Project Serengeti is an open-source effort that is coming right behind Microsoft from VMWare
    • MySQL, other smaller players here
    • Feedback welcome on opensource IMDB enterprise solutions you have had success with

Expecting the adoption of IMDBs as well as product maturity to continue to accelerate and the vendor battle to heat up, as large enterprise clients climb on-board.  At this time everyone seems to be chasing SAP HANA.

Other Links


No comments:

Post a Comment