MS TechDays Session: Big Data in the Microsoft Platform

 

The slides from my big data session at Microsoft TechDays can be found below. During the session, we discussed a lot of the fundamental technologies of the most popular Hadopp distributions including HDFS, Map-Reduce, HBase, Hive, Pig, Sqoop and others. The session was really focused on practical examples and we spent quite a bit of time showing code :)

Towards the end of the presentation, we debated some of the limitations of Hadoop and exploring some recent research materials focused on addressing those. I particularly enjoyed the discussion around Google Dremel which is the foundation behind the amazing Google Big Query engine.

 I hope you find this material useful. If you have any feedback about MS HDInsight or would like to know more don’t hesitate to ping me via this weblog.

Published Wednesday, March 13, 2013 8:15 AM by gsusx

Comments

No Comments