Spark for High-speed Big Data Analytics
Big Data
| Beginner
- 12 videos | 45m 51s
- Includes Assessment
- Earns a Badge
Spark is an open-source, massively parallel, in-memory solution that allows you to run big data analytics pipelines at high speed. Use this course to learn how Apache Spark works and gain an understanding of its architecture. As you progress, investigate the industry-leading examples of Uber and Alibaba to recognize how Spark can add business value to data in many industry types. Moving along, compare the functionality of Spark and Hadoop in relation to use cases, identifying when using Spark is most advantageous. Finally, explore fundamental Spark characteristics, optimization techniques, and best practices. When you've completed this course, you'll have a solid theoretical understanding of how and when to use Apache Spark for specific big data analytics tasks.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseRecognize how spark offers an open-source, scalable, massively parallel, in-memory solution for analytics applicationsOutline the two main components of the spark architecture: resilient distributed dataset and directed acyclic graphDescribe how spark is providing business value to uberDescribe how spark is providing business value to alibabaDescribe how spark is providing business value to the healthcare industry
-
Compare and name the main differences between spark and hadoop with respect to ease of use, latency, security, and costSpecify in which scenarios and conditions spark is a better choice than its alternativesList the main features of spark, such as loading behavior, file formats, parallelism, cache, data skewsName the most important performance optimization techniques in apache spark, such as file format selection, level of parallelism, and api selectionName simple best practices when using spark, like starting small or resolving skewnessSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 55s
-
6m
-
4m 10s
-
4m 52s
-
4m 29s
-
3m 4s
-
3m 31sIn this video, you'll compare and name the main differences between Spark and Hadoop with respect to ease of use, latency, security, and cost. You'll learn that both Hadoop and Spark are popular choices in the marketplace. Here, you'll discover more about the major differences between Hadoop and Spark. FREE ACCESS
-
5m 23s
-
4m 12s
-
3m 42s
-
3m 26s
-
1m 10s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.