Hadoop Distributed File System Basics, Running Example Programs and Benchmarks, Hadoop MapReduce Framework, MapReduce Programming
Essential Hadoop Tools, Hadoop YARN Applications, Managing Hadoop with Apache Ambari, Basic Hadoop Administration Procedures
Business Intelligence Concepts and Application, Data Warehousing, Data Mining, Data Visualization
Decision Trees, Regression, Artificial Neural Networks, Cluster Analysis, Association Rule Mining
Text Mining, Naïve-Bayes Analysis, Support Vector Machines, Web Mining, Social Network Analysis
Course outcomes:
The students should be able to:
Question paper pattern:
Text Books:
1. Douglas Eadline,"Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem", 1stEdition, Pearson Education, 2016. ISBN-13: 978-9332570351
2. Anil Maheshwari, “Data Analytics”, 1st Edition, McGraw Hill Education, 2017. ISBN-13: 978-9352604180
Reference Books:
1) Tom White, “Hadoop: The Definitive Guide”, 4th Edition, O’Reilly Media, 2015.ISBN-13: 978-9352130672
2) Boris Lublinsky, Kevin T.Smith, Alexey Yakubovich,"Professional Hadoop Solutions", 1stEdition, Wrox Press, 2014ISBN-13: 978-8126551071
3) Eric Sammer,"Hadoop Operations: A Guide for Developers and Administrators",1stEdition, O'Reilly Media, 2012.ISBN-13: 978-9350239261