Large Language Models and Key Metrics
AI, large language models
| Beginner
- 15 videos | 1h 38m
- Earns a Badge
Large language models (LLMs) are revolutionizing the landscape of artificial intelligence by enhancing how machines understand and generate human language. Models like GPT and BERT can grasp context, meaning, and expression in ways never before possible. Use this course to deepen your understanding of evaluation metrics like HELM, BLEU, ROUGE, and F1 scores, and learn to distinguish between intrinsic and extrinsic evaluation methods. Explore practical examples of evaluating LLMs, reviewing both public and in-house models, and discover their pros and cons. You'll also learn key considerations for selecting an LLM, focusing on scalability and task-specific performance, and aligning evaluation metrics with business needs for effective model selection. Upon completion of this course, you will have a strong foundational understanding of large language models and the key metrics used to evaluate them.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseDescribe the architecture of large language models (llms)Define the key concepts of large language models (llms)Describe the core principles of large language models (llms)Identify the strengths and weaknesses of different llms, including in-house and public modelsRecognize key criteria to consider when selecting an llm, including computational resources, scalability, and task-specific performanceIdentify common llm evaluation metrics such as helm evaluation, eleuther’s test, perplexity, bleu score (bilingual evaluation understudy), rouge score (recall-oriented understudy for gisting evaluation), f1 score, accuracy, loss function, human evaluation, glue and superglue benchmarks, and latencyDifferentiate between intrinsic and extrinsic evaluation methods for llms
-
Outline how evaluation metrics align with different business use cases and criteria for model selectionExplore a simple llm architecture and its language processing capabilitiesIdentify the features, pros, and cons of a public large language modelIdentify the features, pros, and cons of an in-house large language modelEvaluate a pre-trained llm using the common evaluation metricsUse intrinsic metrics (e.g., perplexity) and extrinsic metrics (e.g., task accuracy) using sample tasksSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 41sIn this video, we will discover the key concepts covered in this course. FREE ACCESS
-
6m 48sAfter completing this video, you will be able to describe the architecture of large language models (LLMs). FREE ACCESS
-
7m 14sUpon completion of this video, you will be able to define the key concepts of large language models (LLMs). FREE ACCESS
-
7m 6sAfter completing this video, you will be able to describe the core principles of large language models (LLMs). FREE ACCESS
-
7m 46sUpon completion of this video, you will be able to identify the strengths and weaknesses of different LLMs, including in-house and public models. FREE ACCESS
-
7m 3sAfter completing this video, you will be able to recognize key criteria to consider when selecting an LLM, including computational resources, scalability, and task-specific performance. FREE ACCESS
-
7m 18sIn this video, we will identify common LLM evaluation metrics such as HELM evaluation, Eleuther’s test, Perplexity, BLEU Score (Bilingual Evaluation Understudy), ROUGE Score (Recall-Oriented Understudy for Gisting Evaluation), F1 Score, Accuracy, Loss Function, Human Evaluation, GLUE and SuperGLUE Benchmarks, and latency. FREE ACCESS
-
6m 36sAfter completing this video, you will be able to differentiate between intrinsic and extrinsic evaluation methods for LLMs. FREE ACCESS
-
6m 48sUpon completion of this video, you will be able to outline how evaluation metrics align with different business use cases and criteria for model selection. FREE ACCESS
-
9m 51sIn this video, we will explore a simple LLM architecture and its language processing capabilities. FREE ACCESS
-
6m 14sUpon completion of this video, you will be able to identify the features, pros, and cons of a public large language model. FREE ACCESS
-
7m 12sIn this video, we will identify the features, pros, and cons of an in-house large language model. FREE ACCESS
-
7m 54sIn this video, discover how to evaluate a pre-trained LLM using the common evaluation metrics. FREE ACCESS
-
6m 50sIn this video, discover how to use intrinsic metrics (e.g., perplexity) and extrinsic metrics (e.g., task accuracy) using sample tasks. FREE ACCESS
-
1m 37sIn this video, we will summarize the key concepts covered in this course. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.