MLOps with Data Version Control: Working with Pipelines & DVCLive
MLOps 2023
| Beginner
- 17 videos | 2h 11m 52s
- Includes Assessment
- Earns a Badge
Data Version Control (DVC) pipelines enable the construction of end-to-end data processing workflows, connecting data and code stages while maintaining version control. DVCLive is a Python library for logging machine learning metrics in simple file formats and is fully compatible with DVC. In this course, you will configure and employ pipelines in DVC and modularize and coordinate each step, while leveraging the dvc.yaml file for stage management and the dvc.lock file for project consistency. Next, you will dive into practical DVC utilization with Jupyter notebooks. You will track model parameters, metrics, and artifacts via Python code's log statements using DVCLive. Then you will explore the user-friendly Iterative Studio interface. Finally, you will leverage DVCLive for comprehensive model experimentation. By pushing experiment files to DVC and employing Git branches, you will manage parallel developments. You will pull requests to streamline merging experiment branches and register model artifacts with the Iterative Studio registry. This course will equip you with the foundational knowledge of DVC and enable you to automate the tracking of model metrics and parameters with DVCLive.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseSet up an ml pipeline stageAdd a stage to an ml pipelineUse the dvc.lock fileExecute a dvc pipelineSet up dvc in a jupyter notebookSet up an iterative studio projectVisualize data for mlLog plots to dvclive
-
Log and track images in dvcliveTrack ml experiments using dvclivePush experiment files to dvc and githubMerge a branch with main in githubRun and track a k-nearest neighbor (knn) regression modelLog a model as an artifactRegister a model in the iterative studio registrySummarize the key concepts covered in this course
IN THIS COURSE
-
2m 5sIn this video, we will discover the key concepts covered in this course. FREE ACCESS
-
11m 40sDuring this video, you will learn how to set up an ML pipeline stage. FREE ACCESS
-
10m 3sIn this video, find out how to add a stage to an ML pipeline. FREE ACCESS
-
3m 43sDiscover how to use the dvc.lock file. FREE ACCESS
-
7m 37sFind out how to execute a DVC pipeline. FREE ACCESS
-
7m 52sIn this video, you will learn how to set up DVC in a Jupyter Notebook. FREE ACCESS
-
9m 59sDuring this video, discover how to set up an Iterative Studio project. FREE ACCESS
-
4m 28sIn this video, find out how to visualize data for ML. FREE ACCESS
-
12m 31sLearn how to log plots to DVCLive. FREE ACCESS
-
7m 42sDuring this video, discover how to log and track images in DVCLive. FREE ACCESS
-
10m 13sIn this video, you will learn how to track ML experiments using DVCLive. FREE ACCESS
-
9m 21sFind out how to push experiment files to DVC and GitHub. FREE ACCESS
-
7m 8sDiscover how to merge a branch with main in GitHub. FREE ACCESS
-
6m 7sDuring this video, you will learn how to run and track a k-Nearest Neighbor (kNN) regression model. FREE ACCESS
-
11m 33sFind out how to log a model as an artifact. FREE ACCESS
-
7m 45sIn this video, discover how to register a model in the Iterative Studio registry. FREE ACCESS
-
2m 6sIn this video, we will summarize the key concepts covered in this course. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.