Data Pipeline: Using Frameworks for Advanced Data Management
Data Pipeline
| Intermediate
- 10 videos | 32m 1s
- Includes Assessment
- Earns a Badge
Discover how to implement data pipelines using Python Luigi, integrate Spark and Tableau to manage data pipelines, use Dask arrays, and build data pipeline visualization with Python in this 10-video course. Begin by learning about features of Celery and Luigi that can be used to set up data pipelines, then how to implement Python Luigi to set up data pipelines. Next, turn to working with Dask library, after listing the essential features provided by Dask from the perspective of task scheduling and big data collections. Learn about implementation of Dask arrays to manage NumPy application programming interfaces (APIs). Explore frameworks that can be used to implement data exploration and visualization in data pipelines. Integrate Spark and Tableau to manage data pipelines. Move on to streaming data visualization with Python, using Python to build visualizations for streaming data. Then learn about the data pipeline building capabilities provided by Kafka, Spark, and PySpark. The concluding exercise involves setting up Luigi to implement data pipelines, Spark and Tableau integration, and building pipelines with Python.
WHAT YOU WILL LEARN
-
Recognize the features of celery and luigi that can be used to set up data pipelinesImplement python luigi in order to set up data pipelinesList dask task scheduling and big data collection featuresImplement dask arrays in order to manage numpy apisList frameworks that can be used to implement data exploration and visualization in data pipelines
-
Integrate spark and tableau to manage data pipelinesUse python to build visualizations for streaming dataRecognize the data pipeline building capabilities provided by kafka, spark, and pysparkSet up luigi to implement data pipelines, integrate spark and tableau for data pipeline management, and build visualizations for data pipelines using python
IN THIS COURSE
-
1m 34s
-
3m 45sAfter completing this video, you will be able to recognize the features of Celery and Luigi that can be used to set up data pipelines. FREE ACCESS
-
3m 38sIn this video, you will learn how to use Python Luigi to set up data pipelines. FREE ACCESS
-
3m 11sAfter completing this video, you will be able to list Dask's task scheduling and big data collection features. FREE ACCESS
-
3m 59sIn this video, learn how to use Dask arrays to manage NumPy APIs. FREE ACCESS
-
3m 46sUpon completion of this video, you will be able to list frameworks that can be used to implement data exploration and visualization in data pipelines. FREE ACCESS
-
2m 26sDuring this video, you will learn how to integrate Spark and Tableau to manage data pipelines. FREE ACCESS
-
2m 51sFind out how to use Python to build visualizations for data streaming. FREE ACCESS
-
3m 45sAfter completing this video, you will be able to recognize the data pipeline building capabilities provided by Kafka, Spark, and PySpark. FREE ACCESS
-
3m 7sIn this video, you will set up Luigi to implement data pipelines, integrate Spark and Tableau for data pipeline management, and build visualizations for data pipelines using Python. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.