Data Refinery with MapReduce

Apache Hadoop    |    Intermediate
  • 13 videos | 54m 55s
  • Earns a Badge
Rating 5.0 of 2 users Rating 5.0 of 2 users (2)
MapReduce is a set of classes, which abstract away the complexity of parallel processing. Learn how MapReduce can take a single compute job and run it in our super computing platform.

WHAT YOU WILL LEARN

  • Define the principle concepts of key-value pairs and list the rules for key-value pairs
    Describe how mapreduce transforms key-value pairs
    Load a large text book and then run wordcount to count the number of words in the text book
    Label all of the functions for mapreduce on a diagram
    Match the phases of mapreduce to their definitions
    Set up the classpath and test wordcount
    Build a jar file and run wordcount
  • Describe the base mapper class of the mapreduce java api and describe how to override its methods
    Describe the base reducer class of the mapreduce java api and describe how to override its methods
    Describe the function of the mapreducedriver java class
    Set up the classpath and test a mapreduce job
    Identify the concept of streaming for mapreduce
    Stream a python job

IN THIS COURSE

  • 4m 22s
    In this video, find out how to define the principle concepts of key-value pairs and list the rules for key-value pairs. FREE ACCESS
  • 2m 15s
    Upon completion of this video, you will be able to describe how MapReduce transforms key-value pairs into a list of keys and a list of values. FREE ACCESS
  • Locked
    3.  WordCount, the Hello World of Hadoop
    2m 49s
    Find out how to load a large text book and then run WordCount to count the number of words in the text book. FREE ACCESS
  • Locked
    4.  MapReduce
    9m 30s
    In this video, you will learn how to label all of the functions for MapReduce on a diagram. FREE ACCESS
  • Locked
    5.  MapReduce Step-by-Step
    5m 26s
    In this video, find out how to match the phases of MapReduce to their definitions. FREE ACCESS
  • Locked
    6.  Exploring Hadoop Classpath
    2m 50s
    In this video, you will set up the classpath and test WordCount. FREE ACCESS
  • Locked
    7.  Writing a MapReduce Job
    5m 45s
    In this video, you will build a JAR file and run a WordCount. FREE ACCESS
  • Locked
    8.  The Mapper Java API
    2m 51s
    Upon completion of this video, you will be able to describe the base mapper class of the MapReduce Java API and how to override its methods. FREE ACCESS
  • Locked
    9.  The Reducer Java API
    2m 35s
    Upon completion of this video, you will be able to describe the base Reducer class of the MapReduce Java API and how to override its methods. FREE ACCESS
  • Locked
    10.  The Driver Java API
    3m 2s
    After completing this video, you will be able to describe the function of the MapReduceDriver Java class. FREE ACCESS
  • Locked
    11.  Writing a MapReduce Job for Inventory
    5m 16s
    During this video, you will learn how to set up the classpath and test a MapReduce job. FREE ACCESS
  • Locked
    12.  Hadoop Streaming
    3m 45s
    In this video, you will learn about the concept of streaming for MapReduce. FREE ACCESS
  • Locked
    13.  Running a Streaming Job
    4m 31s
    In this video, find out how to stream a Python job. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Rating 4.7 of 54 users Rating 4.7 of 54 users (54)
Channel Big Data
Rating 4.0 of 1 users Rating 4.0 of 1 users (1)