MS TechDays Session: Big Data in the Microsoft Platform
The slides from my big data session
at Microsoft TechDays can be found below. During the session, we discussed a
lot of the fundamental technologies of the most popular Hadopp distributions
including HDFS, Map-Reduce, HBase, Hive, Pig, Sqoop and others. The session was really focused on practical examples and we spent quite a bit of time showing code :)
Towards the end
of the presentation, we debated some of the limitations of
Hadoop and exploring some recent research materials focused on addressing
those. I particularly enjoyed the discussion around Google Dremel which is the
foundation behind the amazing Google Big Query engine.
I hope you find this material useful.
If you have any feedback about MS HDInsight or would like to know more don’t hesitate
to ping me via this weblog.