Connect Tableau to MongoDB, Salesforce, Cassandra, and…Well, Just About Anything

August 27, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Big Data   Cassandra   MongoDB   ODBC   Salesforce   Simba Technologies   SQL   Tableau   tagged as Cassandra   MongoDB   ODBC   Salesforce   Tableau  

The good folks at Tableau Software (@Tableau) recently released v8.2 of their flagship analytics software, extending their reach to the Mac platform. Even cooler, Tableau chose Simba’s ODBC technology to power SQL connectivity for the solution, embedding Simba code into their Mac-platform offering. (They liked Simba connectivity so much they put their … Read More

Simba continues to lead with ODBC for Spark SQL and everything else

August 25, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in ADO.NET   Data Access   JDBC   ODBC  

In the “Big Data” space, Simba has always been the leader in data connectivity being the first to provide ODBC and JDBC drivers for the many new data sources.  All of the major Hadoop vendors including Cloudera, Hortonworks, and MapR use the Simba ODBC and JDBC drivers in their distributions.  … Read More

“A Shark by any other name would run as fast (or faster)”

August 21, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Big Data   Data Access   Databricks   JDBC   ODBC   Shark   Spark   SQL   Uncategorized  

What’s in a name? Pardon my paraphrase of a certain bard: What’s in a name? that which we call a rose By any other name would smell as sweet; It was announced at the 2014 Spark Summit that Shark would retire. Since then and despite Reynold’s post on the last day of Summit … Read More

OLAP for Big Data…

July 18, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Business Intelligence   Data Analytics   Multi-Dimensional Data Connectivity   OLAP   OLTP   Relational Database Connectivity   Shark   Spark   Web/Tech  

Is it me or has OLAP finally (re)emerged from the shadow of the transactional systems? I was pleasantly surprised by AtScale‘s subdued emergence at June’s Hadoop Summit. (Refer to my previous summary for more.) This week, Socrata’s Evan Chan presented “Interactive OLAP Queries using Cassandra and Spark” at the Seattle Spark Meetup. Evan began work using Spark with … Read More

Google Chucks MapReduce, Databricks is in the Cloud, and Shark Meets its Maker: A Spark Summit 2014 Synopsis

July 2, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Big Data   Cloudera   Data Access   Databricks   Google   Hadoop   Hive   Hortonworks   MapR   ODBC   SAP   Shark   Spark   Web/Tech  

It is a post-MapReduce world. Last week’s announcement out of Google IO was that Google has retired MapReduce and replaced it with its home-grown cloud analytics system Cloud Dataflow. Rhetorics aside, it is good to see this from the originator of the concept. (I recall a similar declaration from Apache … Read More

Databricks – an interesting plan for Spark, Shark, and Spark SQL

July 2, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Databricks   Hadoop   Shark   Spark   SQL  

This week was the Spark Summit in San Francisco.  It was a great event with a lot of interesting announcements. Databricks is the company promoting Spark and Shark and they made some interesting announcements. One interesting piece of news is that they are ending development of Shark and instead focusing their efforts … Read More

All About Apache Drill Data Sources and File Types

June 26, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Big Data   Data Access   Data Terms / Applications   Hive   tagged as CSV   DFS   Drill   File Types   HBase   Hive   JSON  

Storage plugin extensibility is a key feature of Apache Drill. Drill supports Hive, HBase, and its DFS file system, which encompasses the CSV, TSV, JSON, and Parquet file types. You can configure Drill sources and data types via its web interface. (More details on that in my earlier blog here.) … Read More

Setting up Apache Drill to Query Hive and HBase: a How-to

June 19, 2014 facebooktwittergoogle_plusredditpinterestlinkedinmail by in Big Data   Data Technologies   Hadoop   Hive   Relational Database Connectivity   tagged as Drill   HBase   Hive  

The Apache Drill beta release is near! This guide will show you how to set up and use Drill in embedded mode. Apache Drill is a low-latency distributed query engine capable of querying large datasets. It stands out from other execution engines in that it allows for nested schemas, multiple … Read More