Hadoop Starter Kit

LEARN HDFS, MAPREDUCE AND INTRODUCTION TO PIG AND HIVE WITH FREE CLUSTER ACCESS


Course content:

Section 1: Introduction & Software Setup
Section 2: HDFS
Section 3: MapReduce
Section 4: Apache Pig
Section 5: Apache Hive
Section 6: Hadoop Administrator in Real World
Section 7: Hadoop Developer Course

Description:

In the first section you will learn about what is big data with examples. We will discuss the factors to consider when considering whether a problem is big data problem or not. We will talk about the challenges with existing technologies when it comes to big data computation. We will breakdown the Big Data problem in terms of storage and computation and understand how Hadoop approaches the problem and provide a solution to the problem.

In the HDFS, section you will learn about the need for another file system like HDFS. We will compare HDFS with traditional file systems and its benefits. We will also work with HDFS and discuss the architecture of HDFS.

In the MapReduce section you will learn about the basics of MapReduce and phases involved in MapReduce. We will go over each phase in detail and understand what happens in each phase. Then we will write a MapReduce program in Java to calculate the maximum closing price for stock symbols from a stock dataset.

In the next two sections, we will introduce you to Apache Pig & Hive. We will try to calculate the maximum closing price for stock symbols from a stock dataset using Pig and Hive.


Who is this course for:

  • This course is for anyone who wants to learn about Big Data technologies. 
  • No advanced programming knowledge is needed
  • This course is for anyone who wants to learn about distributed computing and Hadoop


Comments

Popular posts from this blog

Anti-Money Laundering Concepts: AML, KYC and Compliance

Microsoft Excel Masterclass for Business Managers

Complete WordPress Website Developer Course