Book description
Ready to build real-world applications with large language models? With the pace of improvements over the past year, LLMs have become good enough for use in real-world applications. LLMs are also broadly accessible, allowing practitioners besides ML engineers and scientists to build intelligence into their products.
In this report, six experts in AI and machine learning present crucial, yet often neglected, ML lessons and methodologies essential for developing products based on LLMs. Awareness of these concepts can give you a competitive advantage against most others in the field.
Over the past year, authors Eugene Yan, Brian Bischof, Charles Frye, Hamel Husain, Jason Liu, and Shreya Shankar have been busy testing and refining these methodologies by building real-world applications on top of LLMs. In this report, they have distilled these lessons for the benefit of the community.
Table of contents
- Introduction
-
1. Tactics: The Emerging LLM Stack
- Prompting
- Information Retrieval/RAG
- Tuning and Optimizing Workflows
-
Evaluation and Monitoring
- Create a Few Assertion-Based Unit Tests From Real Input/Output Samples
- LLM-as-Judge Can Work (Somewhat), but It’s Not a Silver Bullet
- The “Intern Test” for Evaluating Generations
- Overemphasizing Certain Evals Can Hurt Overall Performance
- Simplify Annotation to Binary Tasks or Pairwise Comparisons
- (Reference-Free) Evals and Guardrails Can Be Used Interchangeably
- LLMs Will Return Output Even When They Shouldn’t
- Hallucinations Are a Stubborn Problem
- 2. Operations: Developing and Managing LLM Applications and the Teams That Build Them
- 3. Strategy: Building with LLMs without Getting Outmaneuvered
- About the Authors
Product information
- Title: What We Learned from a Year of Building with LLMs
- Author(s):
- Release date: July 2024
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098176709
You might also like
book
Building Event-Driven Microservices
Organizations today often struggle to balance business requirements with ever-increasing volumes of data. Additionally, the demand …
book
What Are AI Agents?
AI agents represent the latest milestone in humanity's computational toolbox. Powered by large language models (LLMs) …
book
What Is Generative AI?
ChatGPT, Midjourney, Stable Diffusion, and LLaMA are quickly becoming household names. These tools and many more …
book
What Is LLMOps?
Large language models (LLMs), a subcategory of generative AI, have taken the world by storm. Commonly …