Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications

5h 3m
Shreyas Subramanian
John Wiley & Sons (US)
2024

Learn to build cost-effective apps using Large Language Models

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:

Effective strategies to address the challenge of the high computational cost associated with LLMs
Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques
Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models

Perfect for developers and data scientists interested in deploying foundational models, or business leaders planning to scale out their use of GenAI, Large Language Model-Based Solutions will also benefit project leaders and managers, technical support staff, and administrators with an interest or stake in the subject.

About the Author

SHREYAS SUBRAMANIAN, PhD, is a principal data scientist at AWS, one of the largest organizations building and providing large language models for enterprise use. He is currently advising both internal Amazon teams and large enterprise customers on building, tuning, and deploying Generative AI applications at scale. Shreyas runs machine learning-focused cost optimization workshops, helping them reduce the costs of machine learning applications on the cloud. Shreyas also actively participates in cutting-edge research and development of advanced training, tuning and deployment techniques for foundation models.

In this Book

Introduction
Introduction
Tuning Techniques for Cost Optimization
Inference Techniques for Cost Optimization
Model Selection and Alternatives
Infrastructure and Deployment Tuning Strategies
Conclusion

FREE ACCESS

Course Generative AI Foundations: Advanced Generative AI Techniques for IT

(8)

Course Exploring AI Builder models in Power Apps 2024

(3)

Course Generative AI on AWS: Building GenAI Models with Amazon SageMaker

(1)

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills and Salary Report

ESG Impact Report

Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications

In this Book

YOU MIGHT ALSO LIKE