As I am planning to attend the Hadoop Summit next week in San Jose, I was reviewing what has been happening in the Hadoop and Big Data space over the last year. One of the major shifts we have seen is the introduction of SQL into the NoSQL world. All of the major Hadoop distributions now have a SQL or SQL-like story including Impala and Drill. A key driver in adding SQL to Hadoop is the requirement within the enterprise to be able to use their existing toolset with their Hadoop data. This means, enterprises want to be able to use tools like Excel, Crystal Reports, Tableau, QlikView, MicroStrategy, etc with data in a Hadoop cluster.
Over 20 years ago, this problem was solved with an open standard called ODBC. Today, ODBC is the de facto way that Hadoop and other Big Data sources are exposing the data they hold to the BI and analytics tools that enterprises use. Just like in the early 1990’s when Simba enabled companies like Microsoft, Oracle, and IBM to deliver broad-based ODBC connectivity to relational sources, Simba is now enabling products like Hive, Cassandra, MongoDB, HBase, and BigQuery to be accessed using the ODBC standard. I am looking forward to the Hadoop Summit next week to see what else is new.