To compile a Hive UDF and if you have the Hadoop source code, the right way to do this is to use maven with the Hive repository so you can … Continue reading
In the last few weeks, I have had a number of customers ping me about how to utilize various Hive UDFs. The first ask was how to use some of … Continue reading
.“…to look at the stars always makes me dream, as simply as I dream over the black dots of a map representing towns and villages…” – Vincent Van Gogh Image … Continue reading
Reblogged from Ayad Shammout's SQL & BI Blog: Healthcare Compliance with Big Data and BI Over the past few years Denny Lee (Technical Principal Program Manager within Microsoft’s SQL Business … Continue reading
If you’ve joined the HDInsight Preview – you will notice many new changes including the tight integration with Windows Azure and that HDInsight defaults to ASV. As noted in Why … Continue reading
By Brad Sarsfield and Denny Lee One of the questions we are commonly asked concerning HDInsight, Azure, and Azure Blob Storage is why one should store their data into Azure … Continue reading
About three and a half years ago, I had virtually joined the Yahoo! Targeting, Analytics, and Optimization (TAO) Engineering team where we embarked on an incredible journey to create the … Continue reading
. “No, not Angry Bird – Elephant Bird!” – said no one . . In a few of my customer projects, we started diving into using protocol buffers (protobufs) as … Continue reading
Over the past seven years, Ayad Shammout (@aashamout), Principal Business Intelligence Consultant at Beth Israel Deaconess Medical Center (a teaching hospital of Harvard Medical School), and I have worked on … Continue reading
In my post from last year, I had asked the rhetorical question What’s so BIG about “Big Data”. I had the honor of announcing the largest known Analysis Services cube … Continue reading
Recently I was asked how could I get my Pig scripts to access files stored in Azure Blob Storage through the command line prompt. While it is possible to do … Continue reading
One you get your HadoopOnAzure.com cluster up and running, an easy way to test out Hive Dynamic Partition Insert (the ability to load data into multiple partitions without the need … Continue reading
This is going to be yet another exciting SQLPASS Summit! Lots of great sessions, focus groups, and craziness – definitely one of the funner times of the job! Over the … Continue reading
One of the odd HiveODBC error messages that I recently encountered on a project is that when I am extracting data from my Hive/Hadoop cluster using the HiveODBC driver, I … Continue reading