Dataproc Operations

Google Cloud    |    Intermediate
  • 10 videos | 52m 1s
  • Includes Assessment
  • Earns a Badge
Rating 4.6 of 25 users Rating 4.6 of 25 users (25)
Executing Dataproc implementations with big data can provide a variety of methods. Examine Dataproc implementations with Spark and Hadoop using the cloud shell and introduce BigQuery PySpark REPL package.

WHAT YOU WILL LEARN

  • Describe the various spark and hadoop processes that can be performed with dataproc
    Recognize the benefits of separating storage and compute services using cloud dataproc
    Recall the process of monitoring and logging dataproc jobs
    Demonstrate the process of using an ssh tunnel to connect to the master and worker nodes in a cluster
    Define the spark repl package and how it's used in linux
  • Describe the compute and storage processes and the benefits of their separation and the virtualized distribution of hadoop
    Define bigquery and its benefits for large-scale analytics
    Describe the mapreduce programming model
    Demonstrate the process of submitting multiple jobs with dataproc
    Recognize the various dataproc and cloud shell job operations and implementations

IN THIS COURSE

  • 3m 5s
    After completing this video, you will be able to describe the various Spark and Hadoop processes that can be performed with Dataproc. FREE ACCESS
  • 2m 34s
    Upon completion of this video, you will be able to recognize the benefits of separating storage and compute services using Cloud Dataproc. FREE ACCESS
  • Locked
    3.  Job Monitoring and Logging
    3m 59s
    Upon completion of this video, you will be able to recall the process of monitoring and logging Dataproc jobs. FREE ACCESS
  • Locked
    4.  SSH into Master and Worker Nodes
    6m 36s
    During this video, you will learn how to apply the process of using an SSH tunnel to connect to the master and worker nodes in a cluster. FREE ACCESS
  • Locked
    5.  Spark REPL
    3m 20s
    In this video, you will learn about the Spark REPL package and how it's used in Linux. FREE ACCESS
  • Locked
    6.  Separation of Compute and Storage
    4m 6s
    Upon completion of this video, you will be able to describe the compute and storage processes, the benefits of their separation, and the virtualized distribution of Hadoop. FREE ACCESS
  • Locked
    7.  BigQuery Features and Capabilities
    3m 53s
    In this video, you will learn how to define BigQuery and its benefits for large-scale analytics. FREE ACCESS
  • Locked
    8.  MapReduce with Big Data
    6m 11s
    After completing this video, you will be able to describe the MapReduce programming model. FREE ACCESS
  • Locked
    9.  Job Submission with Cloud Shell
    7m 39s
    During this video, you will learn how to submit multiple jobs with Dataproc. FREE ACCESS
  • Locked
    10.  Exercise: Dataproc and Cloud Shell Implementations
    10m 38s
    Upon completion of this video, you will be able to recognize the various Dataproc and Cloud Shell job operations and implementations. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Channel Apache Hadoop
Rating 5.0 of 1 users Rating 5.0 of 1 users (1)

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.7 of 39 users Rating 4.7 of 39 users (39)
Rating 4.4 of 38 users Rating 4.4 of 38 users (38)