Hadoop on Azure: HiveQL query against Azure Blob Storage

The posting Setup Azure Blob Store for Hadoop on Azure CTP provides a quick way to upload files to your Azure Blob storage account and connect Hadoop on Azure CTP to it.  Now that you have done that, one of the first things you may want to do is to interact with the data. To do this, let’s create a Hive table within Hadoop on Azure CTP that is connected to the files you uploaded to your Azure Blob storage account and query it.  We will be referencing the scenario noted at: Hadoop on Azure Scenario: Query a web log…

Rate this:

Setup Azure Blob Store for Hadoop on Azure CTP

One of the cool ways to run Hadoop on Azure is to have it connect to Azure Blob storage via your Windows Azure Storage account.  To setup your Azure storage account, please refer to http://windows.azure.com. The tasks below will allow you to setup your Hadoop on Azure CTP account to connect to an existing Azure Blob Storage account using the asv protocol.  For example, within Hadoop, you normally would get a listing of files within HDFS using the command line interface: hadoop fs –ls / In the case of accessing files within Azure Blob storage, you can run the command:…

Rate this:

A funky way to do Hive and Hadoop … on Azure

. Funny how you can say something in your head and it sounds fine — Doctor Who (Matt Smith in Amy’s Choice) . Recently I had posted the wiki article: Hadoop on Azure Scenario: Query a web log via HiveQL. In it described how to analyze a sample web log using HiveQL on the Hadoop on Azure CTP (HadoopOnAzure.com) using Interactive Hive console, Interactive Javascript console, Secure FTP using curl to transfer data to HDFS Creating an EXTERNAL table against compressed log files Executing some simple HiveQL queries. Screenshot of the Hadoop on Azure Interactive Hive Console executing a HiveQL…

Rate this: