Using OpenAI APIs: Using Image & Audio APIs

Generative AI | Intermediate

9 videos | 1h 12m 52s
Includes Assessment
Earns a Badge

(1)

DALL-E and Whisper are OpenAI's image and audio-based model offerings. DALL-E, an image generation model, demonstrates the ability to create visually striking images based on textual prompts. Whisper represents a state-of-the-art automatic speech recognition (ASR) system. With its high accuracy in transcribing spoken words, Whisper finds utility in various applications, from voice assistants to transcription services. You will begin this course by generating images using OpenAI's DALL-E model. You will generate images using text prompts, create variations of existing images, and perform image inpainting using natural language. Then, you will work with the Whisper model, which caters to speech transcription and translation. You will transcribe and translate audio in different languages and accents, and you will evaluate the performance of these models.

WHAT YOU WILL LEARN

Discover the key concepts covered in this course

Generate images using dall-e

Create image variations and perform inpainting

Transcribe clips of audio

Perform translation and text-to-speech conversion
Evaluate audio transcription

Set up the whisper model locally

Interpret images with the chat application programming interface (api)

Summarize the key concepts covered in this course

IN THIS COURSE

1m 27s

In this video, we will discover the key concepts covered in this course. FREE ACCESS
10m 51s

Learn how to generate images using DALL-E. FREE ACCESS
3. Working with Image Variations and Inpainting

11m 24s

In this video, find out how to create image variations and perform inpainting. FREE ACCESS
4. Performing Audio Transcription

10m 19s

During this video, discover how to transcribe clips of audio. FREE ACCESS
5. Performing Translation and Text-to-Speech Conversion

6m 43s

In this video, you will learn how to perform translation and text-to-speech conversion. FREE ACCESS
6. Evaluating Transcribed Audio

10m 37s

Find out how to evaluate audio transcription. FREE ACCESS
7. Installing and Using the Whisper Model Locally

9m 42s

Discover how to set up the Whisper model locally. FREE ACCESS
8. Using Chat Completions to Interpret Images

9m 53s

Learn how to interpret images with the chat application programming interface (API). FREE ACCESS
9. Course Summary

1m 57s

In this video, we will summarize the key concepts covered in this course. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

Course Using OpenAI APIs: Exploring APIs with the OpenAI Playground

(3)

Course Introduction to AI-powered Image Generation

(2)

Course Code, Relatedness, & Fine-tuning with OpenAI

(2)

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills and Salary Report

ESG Impact Report

Using OpenAI APIs: Using Image & Audio APIs

WHAT YOU WILL LEARN

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

YOU MIGHT ALSO LIKE