Denny Lee

Ramblings of a data dork: from BI and Big Data to Travel and Food

Compile and add Hive UDF via ADD JAR in HDInsight on Azure

To compile a Hive UDF and if you have the Hadoop source code, the right way to do this is to use maven with the Hive repository so you can … Continue reading

Rate this:

May 9, 2013 · Leave a Comment

Add Built-In Hive UDFs on HDInsight Azure

In the last few weeks, I have had a number of customers ping me about how to utilize various Hive UDFs.  The first ask was how to use some of … Continue reading

Rate this:

May 6, 2013 · Leave a Comment

Optimizing Joins running on HDInsight Hive on Azure at GFS

.“…to look at the stars always makes me dream, as simply as I dream over the black dots of a map representing towns and villages…” – Vincent Van Gogh Image … Continue reading

Rate this:

April 26, 2013 · 5 Comments

Healthcare Compliance with Big Data and BI

Reblogged from Ayad Shammout's SQL & BI Blog: Healthcare Compliance with Big Data and BI Over the past few years Denny Lee  (Technical Principal Program Manager within Microsoft’s SQL Business … Continue reading

Rate this:

April 25, 2013 · Leave a Comment

Updated HDInsight on Azure ASV paths for multiple storage accounts

If you’ve joined the HDInsight Preview – you will notice many new changes including the tight integration with Windows Azure and that HDInsight defaults to ASV.  As noted in Why … Continue reading

Rate this:

March 25, 2013 · 8 Comments

Why use Blob Storage with HDInsight on Azure

By Brad Sarsfield and Denny Lee One of the questions we are commonly asked concerning HDInsight, Azure, and Azure Blob Storage is why one should store their data into Azure … Continue reading

Rate this:

March 18, 2013 · 5 Comments

#PASSBAC – Yahoo!, Big Data, and Microsoft BI: Bigger and Better Together

About three and a half years ago, I had virtually joined the Yahoo! Targeting, Analytics, and Optimization (TAO) Engineering team where we embarked on an incredible journey to create the … Continue reading

Rate this:

March 7, 2013 · 1 Comment

Getting Hadoop and protobufs up and running with Elephant Bird on Mac OSX Mountain Lion

. “No, not Angry Bird – Elephant Bird!” – said no one . . In a few of my customer projects, we started diving into using protocol buffers (protobufs) as … Continue reading

Rate this:

March 6, 2013 · Leave a Comment

#PASSBAC – Ensuring Compliance of Patient Data with Big Data and BI

Over the past seven years, Ayad Shammout (@aashamout), Principal Business Intelligence Consultant at Beth Israel Deaconess Medical Center (a teaching hospital of Harvard Medical School), and I have worked on … Continue reading

Rate this:

March 4, 2013 · 3 Comments

Yahoo! 24TB SSAS Big Data Case Study + Slides

In my post from last year, I had asked the rhetorical question What’s so BIG about “Big Data”.  I had the honor of announcing the largest known Analysis Services cube … Continue reading

Rate this:

December 8, 2012 · 3 Comments

Getting your Pig to eat ASV blobs in Windows Azure HDInsight

Recently I was asked how could I get my Pig scripts to access files stored in Azure Blob Storage through the command line prompt.  While it is possible to do … Continue reading

Rate this:

December 3, 2012 · Leave a Comment

An easy way to test out Hive Dynamic Partition Insert on HDInsight Azure

One you get your HadoopOnAzure.com cluster up and running, an easy way to test out Hive Dynamic Partition Insert (the ability to load data into multiple partitions without the need … Continue reading

Rate this:

November 7, 2012 · 2 Comments

SQLPASS: Hadoop and BI are better together–we’ll show you how!

This is going to be yet another exciting SQLPASS Summit!  Lots of great sessions, focus groups, and craziness – definitely one of the funner times of the job! Over the … Continue reading

Rate this:

November 5, 2012 · Leave a Comment

HiveODBC error message “..expected data length is 334…”

One of the odd HiveODBC error messages that I recently encountered on a project is that when I am extracting data from my Hive/Hadoop cluster using the HiveODBC driver, I … Continue reading

Rate this:

October 11, 2012 · 2 Comments

Professional Microsoft SQL Server 2012 Analysis Services with MDX and DAX

Analysis Services Multidimensional and Tabular Reference all in one handy book!

@dennylee

Enter your email address to follow this blog and receive notifications of new posts by email.

Join 2,002 other followers

Copyright

Copyright © 2012 Denny G Lee - All Rights Reserved
Follow

Get every new post delivered to your Inbox.

Join 2,002 other followers