To compile a Hive UDF and if you have the Hadoop source code, the right way to do this is to use maven with the Hive repository so you can … Continue reading
In the last few weeks, I have had a number of customers ping me about how to utilize various Hive UDFs. The first ask was how to use some of … Continue reading
.“…to look at the stars always makes me dream, as simply as I dream over the black dots of a map representing towns and villages…” – Vincent Van Gogh Image … Continue reading
Reblogged from Ayad Shammout's SQL & BI Blog: Healthcare Compliance with Big Data and BI Over the past few years Denny Lee (Technical Principal Program Manager within Microsoft’s SQL Business … Continue reading
One of the great thing about working with the folks at 343 Industries – Halo developer – is that I get to claim my that playing Halo 4 is part … Continue reading
If you’ve joined the HDInsight Preview – you will notice many new changes including the tight integration with Windows Azure and that HDInsight defaults to ASV. As noted in Why … Continue reading
By Brad Sarsfield and Denny Lee One of the questions we are commonly asked concerning HDInsight, Azure, and Azure Blob Storage is why one should store their data into Azure … Continue reading
By Michael Wetzel, Tamir Melamed, Mark Vayman, Denny Lee Reviewed by Pedro Urbina Escos, Brad Sarsfield, Rui Martins Thanks to Krishnan Kaniappan, Che Chou, Jennifer Yi, and Rob Semsey As … Continue reading
About three and a half years ago, I had virtually joined the Yahoo! Targeting, Analytics, and Optimization (TAO) Engineering team where we embarked on an incredible journey to create the … Continue reading
. “No, not Angry Bird – Elephant Bird!” – said no one . . In a few of my customer projects, we started diving into using protocol buffers (protobufs) as … Continue reading
Over the past seven years, Ayad Shammout (@aashamout), Principal Business Intelligence Consultant at Beth Israel Deaconess Medical Center (a teaching hospital of Harvard Medical School), and I have worked on … Continue reading
Spark is an in-memory open source cluster computing system allowing for fast iterative and interactive analytics. Spark utilizes Scala – a type-safe objected oriented language with functional properties that is … Continue reading
If you’re interested in Big Data, BI, and Compliance in Healthcare; check out Ayad Shammout (@aashammout) and my 24 Hours of PASS (Spring 2013) session Ensuring Compliance of Patient Data … Continue reading
In my post from last year, I had asked the rhetorical question What’s so BIG about “Big Data”. I had the honor of announcing the largest known Analysis Services cube … Continue reading