|
|
|||
|
||||
OverviewLarge language models are brilliant - until they need to know something they were never trained on. Ask them about your company's documents, last week's news, your customer's account, or the latest API spec, and the cracks appear: confident answers built on nothing, made-up citations, and silent failures that erode trust. This is the knowledge problem at the heart of modern AI, and it is the reason RAG exists. Retrieval-Augmented Generation is the technique that fixes the gap. Instead of hoping an LLM remembers, RAG lets it look things up - in your docs, your database, your knowledge base - and answer with grounded, current, verifiable information. It has quietly become the backbone of serious AI systems in production, from internal copilots to customer-facing assistants. If you build with LLMs, RAG is no longer optional. This book is your complete, practical guide to RAG, from first principles to production. You will learn how the full pipeline works - chunking, embeddings, vector databases, retrieval strategies, reranking, and generation - and how the pieces fit together into systems that actually perform. You will see why naive RAG breaks, what advanced patterns like hybrid search, query rewriting, and agentic retrieval solve, and how to evaluate, monitor, and harden a RAG system for real users. Safety, ethics, cost, and the hard trade-offs of shipping AI are covered too. Master the foundations: why LLMs hallucinate, how retrieval grounds them, and what makes RAG different from fine-tuning. Understand every layer: chunking strategies, embedding models, vector stores, similarity search, and rerankers - explained clearly. Build real pipelines: from a working naive RAG to advanced patterns like hybrid search, multi-query, HyDE, and agentic retrieval. Ship to production: evaluation frameworks, observability, latency and cost control, and the operational patterns that actually scale. Think clearly about safety: source grounding, citations, prompt injection, data leakage, and the ethics of retrieval-driven AI. Who this book is for: developers building LLM applications, software architects designing AI systems, technical leads evaluating RAG for their teams, and AI practitioners who want to move beyond toy demos. If you have shipped - or want to ship - an AI feature that needs to know things, this book gives you the mental model, the vocabulary, and the patterns to do it right. Full Product DetailsAuthor: Alex ConstantinPublisher: Independently Published Imprint: Independently Published Dimensions: Width: 15.20cm , Height: 1.50cm , Length: 22.90cm Weight: 0.381kg ISBN: 9798195963439Pages: 282 Publication Date: 07 May 2026 Audience: General/trade , General Format: Paperback Publisher's Status: Active Availability: Available To Order We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |
||||