Getting Started with Hive
Apache Hive 2.3.2
| Beginner
- 10 videos | 55m 47s
- Includes Assessment
- Earns a Badge
This 9-video Skillsoft Aspire course focuses solely on theory and involves no programming or query execution. Learners begin by examining what a data warehouse is, and how it differs from a relational database, important because Apache Hive is primarily a data warehouse, despite giving a SQL-like interface to query data. Hive facilitates work on very large data sets, stored as files in the Hadoop Distributed File System, and lets users perform operations in parallel on data in these files by effectively transforming Hive queries into MapReduce operations. Next, you will hear about types of data and operations which data warehouses and relational databases handle, before moving on to basic components of the Hadoop architecture. Finally, the course discusses features of Hive making it popular among data analysts. The concluding exercise recalls differences between online transaction processing and online analytical processing systems, asking learners to identify Hadoop's three major components; list Hadoop offerings on three major cloud platforms (AWS, Microsoft Azure, and Google Cloud Platform); and list benefits of Hive for data analysts.
WHAT YOU WILL LEARN
-
Define what a data warehouse is and identify its characteristicsDescribe the functions served by relational databases and the features they offerDistinguish between online transaction processing and online analytical processing and identify the specific problems they are meant to solveIdentify where hive fits in the hadoop ecosystem and how it simplifies working with hadoopDescribe the architecture of hive and the functions served by hiveserver and the metastore
-
Identify the services and features offered by aws, azure, and gcp to run hadoop and hive on their infrastructureDescribe the different primitive and complex data types available in hiveCompare managed and external tables in hive and how they relate to the underlying dataContrast oltp and olap systems, identify major components of hadoop, explore hive benefits for data analysis
IN THIS COURSE
-
2m 21s
-
4m 54sIn this video, you will learn how to define what a data warehouse is and identify its characteristics. FREE ACCESS
-
4m 49sUpon completion of this video, you will be able to describe the functions served by relational databases and the features they offer. FREE ACCESS
-
7m 3sIn this video, find out how to distinguish between Online Transaction Processing and Online Analytical Processing and identify the specific problems they are meant to solve. FREE ACCESS
-
6m 51sIn this video, you will identify where Hive fits in the Hadoop ecosystem and how it simplifies working with Hadoop. FREE ACCESS
-
7m 38sUpon completion of this video, you will be able to describe the architecture of Hive and the functions served by HiveServer and the Metastore. FREE ACCESS
-
5m 40sIn this video, you will learn how to identify the services and features offered by AWS, Azure, and GCP to run Hadoop and Hive on their infrastructure. FREE ACCESS
-
6m 19sUpon completion of this video, you will be able to describe the different primitive and complex data types available in Hive. FREE ACCESS
-
2m 46sIn this video, you will compare managed and external tables in Hive and how they relate to the data they are based on. FREE ACCESS
-
7m 26sIn this video, you will learn how to contrast OLTP and OLAP systems, identify major components of Hadoop, and explore Hive benefits for data analysis. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.