Book description
Master retrieval-augmented generation architecture and fine-tune your AI stack, along with discovering real-world use cases and best practices to create powerful AI apps
Key Features
- Get to grips with the fundamentals of LLMs, vector databases, and Python frameworks
- Implement effective retrieval-augmented generation strategies with MongoDB Atlas
- Optimize AI models for performance and accuracy with model compression and deployment optimization
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description
The era of generative AI is upon us, and this book serves as a roadmap to harness its full potential. With its help, you’ll learn the core components of the AI stack: large language models (LLMs), vector databases, and Python frameworks, and see how these technologies work together to create intelligent applications.
The chapters will help you discover best practices for data preparation, model selection, and fine-tuning, and teach you advanced techniques such as retrieval-augmented generation (RAG) to overcome common challenges, such as hallucinations and data leakage. You’ll get a solid understanding of vector databases, implement effective vector search strategies, refine models for accuracy, and optimize performance to achieve impactful results. You’ll also identify and address AI failures to ensure your applications deliver reliable and valuable results. By evaluating and improving the output of LLMs, you’ll be able to enhance their performance and relevance.
By the end of this book, you’ll be well-equipped to build sophisticated AI applications that deliver real-world value.
What you will learn
- Understand the architecture and components of the generative AI stack
- Explore the role of vector databases in enhancing AI applications
- Master Python frameworks for AI development
- Implement Vector Search in AI applications
- Find out how to effectively evaluate LLM output
- Overcome common failures and challenges in AI development
Who this book is for
This book is for software engineers and developers looking to build intelligent applications using generative AI. While the book is suitable for beginners, a basic understanding of Python programming is required to make the most of it.
Table of contents
- Preface
- Chapter 1: Getting Started with Generative AI
- Chapter 2: Building Blocks of Intelligent Applications
- Part 1: Foundations of AI: LLMs, Embedding Models, Vector Databases, and Application Design
- Chapter 3: Large Language Models
- Chapter 4: Embedding Models
- Chapter 5: Vector Databases
- Chapter 6: AI/ML Application Design
- Part 2: Building Your Python Application: Frameworks, Libraries, APIs, and Vector Search
- Chapter 7: Useful Frameworks, Libraries, and APIs
- Chapter 8: Implementing Vector Search in AI Applications
- Part 3: Optimizing AI Applications: Scaling, Fine-Tuning, Troubleshooting, Monitoring, and Analytics
- Chapter 9: LLM Output Evaluation
- Chapter 10: Refining the Semantic Data Model to Improve Accuracy
- Chapter 11: Common Failures of Generative AI
- Chapter 12: Correcting and Optimizing Your Generative AI Application
- Appendix: Further Reading: Index
- Other Books You May Enjoy
Product information
- Title: Building AI Intensive Python Applications
- Author(s):
- Release date: September 2024
- Publisher(s): Packt Publishing
- ISBN: 9781836207252
You might also like
book
Python Real-World Projects
Develop Python applications using an enterprise-based approach with unit and acceptance tests by following agile methods …
book
The Well-Grounded Python Developer
If you’re new to Python, it can be tough to understand when, where, and how to …
article
Run Llama-2 Models Locally with llama.cpp
Llama is Meta’s answer to the growing demand for LLMs. Unlike its well-known technological relative, ChatGPT, …
book
Machine Learning with Python Cookbook, 2nd Edition
This practical guide provides more than 200 self-contained recipes to help you solve machine learning challenges …