Import Hadoop Data into SQL BI Semantic Model Tabular

Originally posted on Ayad Shammout's SQL & BI Blog:
  Hadoop brings scale and flexibility that don’t exist in the traditional data warehouse. Using Hive as a data warehouse for Hadoop to facilitate easy data summarization, ad-hoc queries, and the analysis of large datasets. Although Hive supports ad-hoc queries for Hadoop through HiveQL, query performance is often prohibitive for even the most common BI scenarios. A better solution is to bring relevant Hadoop data into SQL Server Analysis Services Tabular model by using HiveQL. Analysis Services can then serve up the data for ad-hoc analysis and reporting. But, there…

Rate this:

In the context of quantum entanglement and time travel – Stargate may be more correct than Star Trek

Feature Image: Michael Bolognesi’s Diamonds in the Sky As a follow up to In the context of quantum entanglement and teleportation – Stargate may be more correct than Star Trek, I’m diving into one of SciFi’s persistent quandaries – time travel.  And before anyone gets started, I am a proud Trekkie so this is not meant as a knock on Star Trek.  In fact, I’ve already purchased my tickets for Star Trek Into Darkness and as fan of BBC’s Sherlock, I have to admit I’m sort of rooting for the villain this time around!   Image source: Benedict Cumberbatch – Star…

Rate this:

Compile and add Hive UDF via ADD JAR in HDInsight on Azure

To compile a Hive UDF and if you have the Hadoop source code, the right way to do this is to use maven with the Hive repository so you can compile your JAR using the exact version of the source code / jars that you are working against.  For more information on how to use maven, check out: http://maven.apache.org/guides/getting-started/maven-in-five-minutes.html But in situations where you do not have access to the source code, but you have all the necessary jars (like the jars within the lib folder) you can workaround this by manually compiling the Hive UDFs.  To do this, let’s…

Rate this:

Add Built-In Hive UDFs on HDInsight Azure

In the last few weeks, I have had a number of customers ping me about how to utilize various Hive UDFs.  The first ask was how to use some of the UDFs that are already built into Hive.  For example, if you wanted a generated row sequence number (i.e. an IDENTITY column), you can use the Hive UDF UDFRowSequence.  This UDF is already built and included in the hive-contrib-0.9.0.jar that is not already loaded in the distributed cache (run list jars from the Hive CLI to verify).  Below is a quick code snippet that allows you to run the generated…

Rate this: