Feeds:
Posts
Comments

Archive for the ‘BigData’ Category

For the 2012 Hadoop Summit, I will have the honor to co-present with Dave Mariani (@dmariani) from Klout in our session How Klout is changing the landscape of social media with Hadoop and BI. Our session is currently scheduled for June 13th at 3:35pm but it is subject to change.  Check out our session and [...]

Read Full Post »

For starters, this isn’t a production setup, this is just so that I can do some quick Hadoop demos on my Macbook Air (2011).  In this case, my configuration is OSX Lion, 4GB RAM, and 256GB SSD.   As well, a serious shout out to the authors below whom I had referenced to create this post. [...]

Read Full Post »

For a quick primer on Hadoop (from the perspective of the Microsoft SQL Community), as well as Microsoft Hadoop on Azure and Windows, check out the SlideShare.NET presentation below. Above the cloud: Big Data and BI View more PowerPoint from Denny Lee Note, as well, there is a great end-to-end Microsoft Hadoop on Azure and [...]

Read Full Post »

When working with Hadoop on Azure, you may be used to the idea of putting your data in the Cloud.  In addition to using Azure Blob Storage, another option is connecting your Hadoop on Azure cluster to query data against Amazon S3.  To configure Hadoop on Azure to connect to it, below are the steps [...]

Read Full Post »

As part of the excitement of the Strata Conference this week, Microsoft has been talking about Big Data and Hadoop.  It started off with Dave Campbell’s question: Do we have the tools we need to navigate the New World of Data?.  And some of the tooling call outs specific to Microsoft include references to PowerPivot, [...]

Read Full Post »

One of the cool things about the Hadoop on Azure CTP is its Interactive JavaScript Console – it allows users query and visualize data on top HDFS using a JavaScript framework.  For example, below is a graph pie visualization within a browser generated by the Interactive JavaScript console using graph.pie function. Why is this important [...]

Read Full Post »

The post Connecting PowerPivot to Hadoop on Azure – Self Service BI to Big Data in the Cloud provided the step-by-step details on how to connect PowerPivot to your Hadoop on Azure cluster.   And while this is really powerful, one of the great features as part of SQL Server 2012 is Power View (formerly known [...]

Read Full Post »

Dorky attempts at geek Shakespere aside; as the volume, complexity, and variability of your data systems increase in … entropy …, this becomes a fundamental question in whether one scales up or scale out their data problem. Apologies for the nerdy chemistry references in advance – which starts with this picture of Dr. Arthur Grosser [...]

Read Full Post »

A big shout out to Brad Sarsfield (@bradoop) for creating these great How-To videos for Hadoop on Azure.   How To: Upload Data and Use the WordCount Sample with Hadoop Services for Windows Azure (video)     Run the Pi Estimator Sample on Hadoop on Windows Azure (video)

Read Full Post »

As I am writing more about Big Data, I’m been asked whether we need to have traditional relational or cube systems now that we have Big Data / NoSQL / Hadoop.  My responses are to note that these are different systems that serve different purposes even though both are used to better understand data. But [...]

Read Full Post »

Older Posts »

Follow

Get every new post delivered to your Inbox.

Join 1,049 other followers