Processing Data: Integrating Kafka with Apache Spark
Apache Kafka
| Intermediate
- 12 videos | 1h 45m 38s
- Includes Assessment
- Earns a Badge
Flexible and Intuitive, DataFrames are a popular data structure in data analytics. In this course, build Spark applications that process data streamed to Kafka topics using DataFrames. Begin by setting up a simple Spark app that streams in messages from a Kafka topic, processes and transforms them, and publishes them to an output sink. Next, leverage the Spark DataFrame application programming interface by performing selections, projections, and aggregations on data streamed in from Kafka, while also exploring the use of SQL queries for those transformations. Finally, you will perform windowing operations - both tumbling windows, where the windows do not overlap, and sliding windows, where there is some overlapping of data.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseBuild a spark application that reads from a kafka topicManipulate streaming data and publish the output to the consoleSubscribe to multiple kafka topics from a spark applicationWrite an app that generates data to periodically send to a kafka topicDevelop a spark application that publishes transformed data to a kafka topic
-
Transform streaming data with spark sqlPerform aggregations on spark dataframes and order their contentsPerform group by, aggregations, and orderingDescribe what windows are in the context of spark streaming and define them using dataframesDefine operations on tumbling and sliding windowsSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 5s
-
10m 12s
-
10m
-
11m 35s
-
11m
-
10m 25s
-
7m 28s
-
10m 49s
-
10m
-
12m 6s
-
8m 41s
-
2m 17s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.