COURSE INTRODUCTION
This is an ideal course package for individuals who want to understand the basic concepts of Big Data and Hadoop. On completing this course, learners will be able to interpret what goes behind the processing of huge volumes of data as the industry switches over from Excel-based analytics to real-time analytics.
COURSE OBJECTIVES
After finish the course, student will have knowledge and skills to:
	- Understand the characteristics of Big Data
- Describe the basics of Hadoop and HDFS architecture
- List the features and processes of MapReduce
- Learn the basics of Pig, Hive, and HBase
- Explore the commercial distributions of Hadoop
- Understand the key components of the Hadoop ecosystem
- Get introduced to Sqoop & ZooKeeper
AUDIENCE
This course is meant for professionals who intend to gain a basic understanding of Big Data and Hadoop. It is ideal for professionals in senior management who requires a theoretical understanding of how Hadoop can solve their Big Data problem.
COURSE CONTENTS
	
		
			|   Lesson 1.0 - Introduction to Big Data and Hadoop 
				Introduction to Big Data and HadoopObjectivesNeed for Big DataThree Characteristics of Big DataCharacteristics of Big Data TechnologyAppeal of Big Data TechnologyHandling Limitations of Big DataIntroduction to HadoopHadoop ConfigurationApache Hadoop Core ComponentsHadoop Core Components—HDFSHadoop Core Components—MapReduceHDFS ArchitectureUbuntu Server—IntroductionHadoop Installation—PrerequisitesHadoop Multi-Node Installation—PrerequisitesSingle-Node Cluster vs. Multi-Node ClusterMapReduceCharacteristics of MapReduceReal-Time Uses of MapReducePrerequisites for Hadoop Installation in Ubuntu Desktop 12.04Hadoop MapReduce—FeaturesHadoop MapReduce—ProcessesAdvanced HDFS–IntroductionAdvanced MapReduceData Types in HadoopDistributed CacheDistributed Cache (contd.)Joins in MapReduceIntroduction to PigComponents of PigData ModelPig vs. SQLPrerequisites to Set the Environment for Pig LatinSummary |   | Lesson 1.1 - Hive HBase and Hadoop Ecosystem Components 
				Hive, HBase and Hadoop Ecosystem ComponentsObjectivesHive—IntroductionHive—Characteristics5 System Architecture and Components of HiveBasics of Hive Query LanguageData Model—TablesData Types in HiveSerialization and De serializationUDF/UDAF vs. MapReduce ScriptsHBase—IntroductionCharacteristics of HBaseHBase ArchitectureHBase vs. RDBMSCloudera—IntroductionCloudera DistributionCloudera ManagerHortonworks Data PlatformMapR Data PlatformPivotal HDIntroduction to ZooKeeperFeatures of ZooKeeperGoals of ZooKeeperUses of ZooKeeperSqoop—Reasons to Use ItSqoop—Reasons to Use It (contd.)Benefits of SqoopApache Hadoop EcosystemApache OozieIntroduction to MahoutUsage of MahoutApache CassandraApache SparkApache AmbariKey Features of Apache Ambari Hadoop Security—Kerberos |