The slides from my big data session at Microsoft TechDays can be found below. During the session, we discussed a lot of the fundamental technologies of the most popular Hadopp distributions including HDFS, Map-Reduce, HBase, Hive, Pig, Sqoop and others. The session was really focused on practical examples and we spent quite a bit of time showing code :)
Towards the end of the presentation, we debated some of the limitations of Hadoop and exploring some recent research materials focused on addressing those. I particularly enjoyed the discussion around Google Dremel which is the foundation behind the amazing Google Big Query engine.
I hope you find this material useful. If you have any feedback about MS HDInsight or would like to know more don’t hesitate to ping me via this weblog.