For the 2012 Hadoop Summit, I will have the honor to co-present with Dave Mariani (@dmariani) from Klout in our session How Klout is changing the landscape of social media with Hadoop and BI. Our session is currently scheduled for June 13th at 3:35pm but it is subject to change. Check out our session and [...]
Archive for the ‘BigData’ Category
SQL BI at Hadoop Summit = Awesomesauce!
Posted in Analysis Services, BigData, SQL, tagged Hadoop, hive, Thoughts on May 30, 2012 | Leave a Comment »
Installing Hadoop on OSX Lion (10.7)
Posted in BigData, tagged Configuration, Hadoop on May 8, 2012 | Leave a Comment »
For starters, this isn’t a production setup, this is just so that I can do some quick Hadoop demos on my Macbook Air (2011). In this case, my configuration is OSX Lion, 4GB RAM, and 256GB SSD. As well, a serious shout out to the authors below whom I had referenced to create this post. [...]
A Primer on Hadoop (from the Microsoft SQL Community perspective)
Posted in BigData, tagged Architecture, Azure, Excel, Hadoop, hive, Thoughts on March 27, 2012 | 3 Comments »
For a quick primer on Hadoop (from the perspective of the Microsoft SQL Community), as well as Microsoft Hadoop on Azure and Windows, check out the SlideShare.NET presentation below. Above the cloud: Big Data and BI View more PowerPoint from Denny Lee Note, as well, there is a great end-to-end Microsoft Hadoop on Azure and [...]
Connecting Hadoop on Azure to your Amazon S3 Blob storage
Posted in BigData, Cloud, tagged amazon s3, AWS, Configuration, Connectivity, interactive javascript on March 21, 2012 | 4 Comments »
When working with Hadoop on Azure, you may be used to the idea of putting your data in the Cloud. In addition to using Azure Blob Storage, another option is connecting your Hadoop on Azure cluster to query data against Amazon S3. To configure Hadoop on Azure to connect to it, below are the steps [...]
BI and Big Data–the best of both worlds!
Posted in Analysis Services, BigData, PowerPivot, Random Thoughts, tagged Architecture, Excel, Hadoop, Javascript, Thoughts on March 1, 2012 | 1 Comment »
As part of the excitement of the Strata Conference this week, Microsoft has been talking about Big Data and Hadoop. It started off with Dave Campbell’s question: Do we have the tools we need to navigate the New World of Data?. And some of the tooling call outs specific to Microsoft include references to PowerPivot, [...]
Hadoop JavaScript– Microsoft’s VB shift for Big Data
Posted in BigData, Random Thoughts, tagged Hadoop, Javascript, Thoughts on February 17, 2012 | 1 Comment »
One of the cool things about the Hadoop on Azure CTP is its Interactive JavaScript Console – it allows users query and visualize data on top HDFS using a JavaScript framework. For example, below is a graph pie visualization within a browser generated by the Interactive JavaScript console using graph.pie function. Why is this important [...]
Connecting Power View to Hadoop on Azure–An #awesomesauce way to view Big Data in the Cloud
Posted in BigData, PowerPivot, Reporting Services, tagged Azure, Hadoop, Power View on February 10, 2012 | 3 Comments »
The post Connecting PowerPivot to Hadoop on Azure – Self Service BI to Big Data in the Cloud provided the step-by-step details on how to connect PowerPivot to your Hadoop on Azure cluster. And while this is really powerful, one of the great features as part of SQL Server 2012 is Power View (formerly known [...]
Moving data to compute or compute to data? That is the Big Data question
Posted in BigData, SQL, tagged Chemistry, Scale-Out, Thoughts on January 31, 2012 | 2 Comments »
Dorky attempts at geek Shakespere aside; as the volume, complexity, and variability of your data systems increase in … entropy …, this becomes a fundamental question in whether one scales up or scale out their data problem. Apologies for the nerdy chemistry references in advance – which starts with this picture of Dr. Arthur Grosser [...]
Cool Hadoop on Azure How To Videos by @bradoop
Posted in BigData, Cloud, tagged Azure, Hadoop on January 26, 2012 | Leave a Comment »
A big shout out to Brad Sarsfield (@bradoop) for creating these great How-To videos for Hadoop on Azure. How To: Upload Data and Use the WordCount Sample with Hadoop Services for Windows Azure (video) Run the Pi Estimator Sample on Hadoop on Windows Azure (video)
Scale Up or Scale Out your Data Problems? A Space Analogy
Posted in BigData, SQL, tagged Scale-Out, Space, Thoughts on January 24, 2012 | 7 Comments »
As I am writing more about Big Data, I’m been asked whether we need to have traditional relational or cube systems now that we have Big Data / NoSQL / Hadoop. My responses are to note that these are different systems that serve different purposes even though both are used to better understand data. But [...]

