Seattle Spark Meetup Kicks Off with DataBricks

I am very excited to that announce that Matei Zaharia and Pat McDonough from DataBricks will be speaking at the Seattle Spark Meetup and we’ve increased the room size to accommodate more people! Seattle Spark Meetup Kick Off with DataBricks They will come up and join us for pizzas and to talk about Apache Spark!  I highly encourage you to join the Seattle Spark Meetup for this and other exciting sessions!  Below is the abstract of their session as well as their biographies. Introduction to Apache Spark Apache Spark has quickly grown to be one of the most active projects in big data,…

Rate this:

Jump Start onto Spark 0.7.2 and Scala 2.9.3 on Mac OSX

Spark is an in-memory open source cluster computing system allowing for fast iterative and interactive analytics.  Spark utilizes Scala – a type-safe objected oriented language with functional properties that is fully interoperable with Java.  For more information about Spark, please refer to http://spark-project.org.  To test out Spark, you can install the stand-alone version on Mac OSX. This is a follow up to my previous blog post on the topic – Installing Spark 0.6.1 Standalone on OSX Mountain Lion (10.8).  Since this blog post, Spark has added some interesting features including: – Spark Streaming as part of Spark 0.7 – An associated…

Rate this:

Jump Start on Apache Kafka 0.8 with Scala 2.9.2 on Mac OSX

I have run the calculations a dozen times, the grid should be enough to bring about fusion and power the device. But, to be sure, I won’t know until I press the freaking button. — Lucas, Continuum Season 1, Episode 2, “Fast Times” — As noted by Lucas and typical of most Big Data Analytics projects, the calculations may be right but that doesn’t mean its going to work.   Fortunately, that is NOT the case for running Apache Kafka 0.8 with Scala 2.9.2 on Mac OSX Mountain Lion.  In fact this jump start guide really points to some key…

Rate this:

How Klout changed the landscape of social media with Hadoop and BI Slides Updated

One of the key themes that Dave Mariani (@dmarini) and I were talking about during this year’s Hadoop Summit was that: Hadoop and BI are better together as I have detailed this in many previous blog posts.  As noted during our session – How Klout changed the landscape of social media with Hadoop and BI – combing Hadoop and BI together is something you can do right now to accelerate your understanding of the data that you have amassed in your Hadoop cluster. For more information, we have updated our Hadoop Summit slides to include additional screenshots for your reference….

Rate this: