Large Language Models and Key Metrics

AI, large language models | Beginner

15 videos | 1h 38m
Includes Assessment
Earns a Badge

Large language models (LLMs) are revolutionizing the landscape of artificial intelligence by enhancing how machines understand and generate human language. Models like GPT and BERT can grasp context, meaning, and expression in ways never before possible. Use this course to deepen your understanding of evaluation metrics like HELM, BLEU, ROUGE, and F1 scores, and learn to distinguish between intrinsic and extrinsic evaluation methods. Explore practical examples of evaluating LLMs, reviewing both public and in-house models, and discover their pros and cons. You'll also learn key considerations for selecting an LLM, focusing on scalability and task-specific performance, and aligning evaluation metrics with business needs for effective model selection. Upon completion of this course, you will have a strong foundational understanding of large language models and the key metrics used to evaluate them.

WHAT YOU WILL LEARN

Discover the key concepts covered in this course

Describe the architecture of large language models (llms)

Define the key concepts of large language models (llms)

Describe the core principles of large language models (llms)

Identify the strengths and weaknesses of different llms, including in-house and public models

Recognize key criteria to consider when selecting an llm, including computational resources, scalability, and task-specific performance

Identify common llm evaluation metrics such as helm evaluation, eleuther’s test, perplexity, bleu score (bilingual evaluation understudy), rouge score (recall-oriented understudy for gisting evaluation), f1 score, accuracy, loss function, human evaluation, glue and superglue benchmarks, and latency

Differentiate between intrinsic and extrinsic evaluation methods for llms
Outline how evaluation metrics align with different business use cases and criteria for model selection

Explore a simple llm architecture and its language processing capabilities

Identify the features, pros, and cons of a public large language model

Identify the features, pros, and cons of an in-house large language model

Evaluate a pre-trained llm using the common evaluation metrics

Use intrinsic metrics (e.g., perplexity) and extrinsic metrics (e.g., task accuracy) using sample tasks

Summarize the key concepts covered in this course

IN THIS COURSE

1m 41s

In this video, we will discover the key concepts covered in this course. FREE ACCESS
6m 48s

After completing this video, you will be able to describe the architecture of large language models (LLMs). FREE ACCESS
3. Large Language Model Concepts

7m 14s

Upon completion of this video, you will be able to define the key concepts of large language models (LLMs). FREE ACCESS
4. Large Language Model Principles

7m 6s

After completing this video, you will be able to describe the core principles of large language models (LLMs). FREE ACCESS
5. Large Language Model Types

7m 46s

Upon completion of this video, you will be able to identify the strengths and weaknesses of different LLMs, including in-house and public models. FREE ACCESS
6. Large Language Model Selection Criteria

7m 3s

After completing this video, you will be able to recognize key criteria to consider when selecting an LLM, including computational resources, scalability, and task-specific performance. FREE ACCESS
7. Large Language Model Metrics

7m 18s

In this video, we will identify common LLM evaluation metrics such as HELM evaluation, Eleuther’s test, Perplexity, BLEU Score (Bilingual Evaluation Understudy), ROUGE Score (Recall-Oriented Understudy for Gisting Evaluation), F1 Score, Accuracy, Loss Function, Human Evaluation, GLUE and SuperGLUE Benchmarks, and latency. FREE ACCESS
8. Large Language Model Evaluation Methods

6m 36s

After completing this video, you will be able to differentiate between intrinsic and extrinsic evaluation methods for LLMs. FREE ACCESS
9. Large Language Model Metrics Alignment

6m 48s

Upon completion of this video, you will be able to outline how evaluation metrics align with different business use cases and criteria for model selection. FREE ACCESS
10. Exploring Large Language Model Design

9m 51s

In this video, we will explore a simple LLM architecture and its language processing capabilities. FREE ACCESS
11. Public Large Language Models

6m 14s

Upon completion of this video, you will be able to identify the features, pros, and cons of a public large language model. FREE ACCESS
12. In-House Large Language Models

7m 12s

In this video, we will identify the features, pros, and cons of an in-house large language model. FREE ACCESS
13. Evaluating Large Language Models

7m 54s

In this video, discover how to evaluate a pre-trained LLM using the common evaluation metrics. FREE ACCESS
14. Using Large Language Model Intrinsic Metrics

6m 50s

In this video, discover how to use intrinsic metrics (e.g., perplexity) and extrinsic metrics (e.g., task accuracy) using sample tasks. FREE ACCESS
15. Course Summary

1m 37s

In this video, we will summarize the key concepts covered in this course. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

Course Generative Modeling Foundations

(1)

Course Systems Engineering Process: System Analysis & Modeling

(2)

Book Knowledge-augmented Methods for Natural Language Processing

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills and Salary Report

ESG Impact Report

Large Language Models and Key Metrics

WHAT YOU WILL LEARN

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

YOU MIGHT ALSO LIKE