Introduction to the Shell for Hadoop HDFS
Apache Hadoop
| Beginner
- 9 videos | 52m 36s
- Includes Assessment
- Earns a Badge
In this Skillsoft Aspire course, learners discover how to set up a Hadoop Cluster on the cloud and explore bundled web apps-the YARN Cluster Manager app and the HDFS (Hadoop Distributed File System) NameNode UI. This 9-video course assumes a good understanding of what Hadoop is, and how HDFS enables processing of big data in parallel by distributing large data sets across a cluster; learners should also be familiar with running commands from the Linux shell, with some fluency in basic Linux file system commands. The course opens by exploring two web applications which are packaged with Hadoop, the UI for the YARN cluster manager, and the node name UI for HDFS. Learners then explore two shells which can be used to work with HDFS, the Hadoop FS shell and Hadoop DFS shell. Next, you will explore basic commands which can be used to navigate HDFS; discuss their similarities with Linux file system commands; and discuss distributed computing. In a closing exercise, practice identifying web applications used to explore and also monitor Hadoop.
WHAT YOU WILL LEARN
-
Provision a hadoop cluster on the cloud using the google cloud platform's dataproc serviceIdentify the various gcp services used by dataproc when provisioning a clusterList the metrics available on the yarn cluster manager app and recognize how it can be useful to monitor job executionsRecall the details and metrics of hdfs available on the namenode web app and how it can be used to browse the file system
-
Identify the tools of the hadoop ecosystem which are packaged with hadoop and recall how they can be accessedConfigure hdfs using the hdfs-site.xml file and identify the properties which can be set from itCompare the hadoop fs and hdfs dfs shells and recognize their similarities to linux shellsExplore apps for hadoop, configure hdfs, work with hdfs shells
IN THIS COURSE
-
2m 19s
-
9m 38sIn this video, you will learn how to provision a Hadoop cluster on the cloud using Google Cloud Platform's Dataproc service. FREE ACCESS
-
3m 56sIn this video, you will learn how to identify the various GCP services used by Dataproc when provisioning a cluster. FREE ACCESS
-
9m 3sAfter completing this video, you will be able to list the metrics available on the YARN Cluster Manager app and understand how it can be useful to monitor job executions. FREE ACCESS
-
7m 3sUpon completion of this video, you will be able to recall the details and metrics of HDFS available on the NameNode web app and how to browse the file system. FREE ACCESS
-
4m 29sIn this video, you will identify the tools of the Hadoop ecosystem that are packaged with Hadoop and recall how they can be accessed. FREE ACCESS
-
4m 48sFind out how to configure HDFS using the hdfs-site.xml file and identify the properties that can be set from it. FREE ACCESS
-
5m 36sIn this video, you will learn how to compare the hadoop fs and hdfs dfs shells and recognize their similarities to the Linux shells. FREE ACCESS
-
5m 47sDuring this video, you will learn how to explore apps for Hadoop, configure HDFS, and work with HDFS shells. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.