|
|
|||
|
||||
OverviewYou've built the prototype. The demo worked. Your team was optimistic. Then you tried to turn it into a product, and everything got harder. How much time have you spent staring at a response that was almost right - and could not explain why the next one was not? How often have you watched a retrieval system return seventeen perfect matches on Monday and six irrelevant results on Tuesday, with nothing in your code having changed? How many times have you shipped an improvement for one user and discovered, after deployment, that it broke something for another? The gap between a working prototype and a production LLM application is not a matter of polish. It is a different discipline. This is the book for engineers, architects, and technical leaders crossing that gap. Not another introduction to prompting. Not a survey of vendor APIs. A practical, opinionated guide to the stack that runs when real users send real requests - the retrieval pipeline that finds the right documents the first time, the evaluation harness that tells you whether yesterday's fix was an improvement or a regression, the observability layer that lets you debug failures you cannot reproduce, the cost controls that keep your system economically viable as traffic grows. You'll learn how to design an ingestion pipeline that handles messy sources without silently corrupting meaning, how to choose between managed and self-hosted vector databases based on actual workload characteristics, why hybrid search beats vector search alone, and when rerankers earn their keep. You'll see agent architectures with their honest failure modes, prompt engineering as a versioned discipline rather than a collection of tricks, structured outputs handled so downstream code does not break when the model improvises, evaluation that survives contact with reality, and cost engineering that turns a runaway expense into a managed line item. Every chapter is built from what actually works, not what theory suggests should work. The harder truth, which most books on this subject avoid: the LLM stack you build today is not the stack you will run in three years. The models will change. The frameworks will churn. A book that tells you to commit deeply to any specific vendor is selling you confidence, not expertise. This book gives you something more durable: the judgment to recognize which parts of your system should change, which should stay stable, and how to build so the inevitable migration is a measured decision rather than a forced evacuation. You will not find hype here. You will not find shortcuts. You will find specific prose about specific problems - where a section on retrieval names real failure modes and tells you which metric catches each one, where a section on security names the categories of prompt injection attack and the defenses that actually work against each. This is not a book for people who want to be impressed by AI. It is a book for people who need to ship. Most LLM projects fail not because the technology was not ready, but because teams treated the work as a prototype to scale up rather than a new engineering discipline to learn. The teams that succeed share a set of practices - around evaluation, versioning, deployment, non-determinism - that are unobvious when you start and essential at scale. This book is a direct transfer of those practices. You have two choices: keep improvising and hope the next prompt change fixes things, or build on a foundation that holds up as the technology evolves. If you're ready to replace guesswork with systems-and build AI products your team can rely on-this book will guide you there. Full Product DetailsAuthor: Alex CarterPublisher: Independently Published Imprint: Independently Published Dimensions: Width: 21.60cm , Height: 1.20cm , Length: 27.90cm Weight: 0.544kg ISBN: 9798195522674Pages: 230 Publication Date: 05 May 2026 Audience: General/trade , General Format: Paperback Publisher's Status: Active Availability: Available To Order We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately. Table of ContentsReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |
||||