The Architecture Handbook for Milvus Vector Database: Design and implement high-performance vector search systems with Milvus

Author:   Yudong Cai ,  Jeremy Zhu ,  Xuan Yang ,  Bang Fu
Publisher:   Packt Publishing Limited
ISBN:  

9781835881705


Pages:   502
Publication Date:   31 March 2026
Format:   Paperback
Availability:   In Print   Availability explained
This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us.

Our Price $131.97 Quantity:  
Add to Cart

Share |

The Architecture Handbook for Milvus Vector Database: Design and implement high-performance vector search systems with Milvus


Overview

Co-authored by core contributors of Milvus, this book guide explores the architecture of the Milvus vector databases for GenAI solutions Free with your book: DRM-free PDF version + access to Packt's next-gen Reader* Key Features Understand the core architecture and vector indexing engine that makes Milvus ideal for AI-driven search Learn scalable deployment and performance optimization techniques Test, apply, and integrate Milvus into AI and LLM pipelines using LangChain Purchase of the print or Kindle book includes a free PDF eBook Book DescriptionThe rapid adoption of LLMs demands efficient storage and lightning-fast retrieval of unstructured data. Designed as a vector database, Milvus has earned widespread recognition in the community and support from tech giants like Apple and NVIDIA. Yet, many developers only scratch the surface of what Milvus is truly capable of. Written by the contributors of the Milvus project, this handbook gives you an insider’s perspective on its design and how it handles large-scale, high-dimensional vector data. Starting with the basics, you’ll learn about everything from service deployment and SDK usage to Milvus’ layered architecture and how its components interact. You’ll learn how the indexing, replication, compaction, and garbage collection systems work and how to apply them to real scenarios. Through practical demos and configuration exercises, you’ll learn how to monitor, scale, and secure Milvus in production and then advance to performance evaluation and scalability testing using tools like VectorDBBench. You'll also explore Milvus' integration with LangChain for use cases such as vector search and RAG-based chatbots. By the end of this book, you’ll be able to analyze Milvus internals, fine-tune for performance, ensure system stability, and integrate it into next-generation AI solutions. *Email sign-up and proof of purchase requiredWhat you will learn Deploy Milvus using Docker, Kubernetes, and Helm Configure Milvus and monitor system health with Prometheus, Grafana, and Loki Understand core components like Knowhere, indexes, time sync, compaction, and garbage collection Design and optimize schema, queries, and data modification flows Benchmark performance and simulate real-world failure recovery Scale Milvus clusters to support large datasets and high-concurrency traffic Implement different multi-tenant strategies in Milvus Build AI applications using Milvus with LangChain Who this book is forThis book is for database practitioners looking to get started with Milvus and build their expertise in vector data and vector search. It’s particularly suited for data analysts, data scientists, Milvus developers, system architects, tech enthusiasts, and researchers in vector database technologies. To get the most out of this book, you should have a foundational understanding of Go, Python, or C++, as well as a basic knowledge of database systems. Familiarity with Docker and Kubernetes is recommended.

Full Product Details

Author:   Yudong Cai ,  Jeremy Zhu ,  Xuan Yang ,  Bang Fu
Publisher:   Packt Publishing Limited
Imprint:   Packt Publishing Limited
ISBN:  

9781835881705


ISBN 10:   183588170
Pages:   502
Publication Date:   31 March 2026
Audience:   General/trade ,  General
Format:   Paperback
Publisher's Status:   Active
Availability:   In Print   Availability explained
This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us.

Table of Contents

Table of Contents Introduction to Milvus Deploying Milvus in Multiple Ways Interacting with Milvus Configuring the Milvus System Understanding the Milvus Data Model and Architecture Data Modification and Maintenance in Milvus Reading Data in Milvus Compaction and Garbage Collection Exploring Milvus' Vector Engine How to Select a Vector Index Handling Complicated Search Requests Getting Started with Milvus Performance Benchmarking Stability and Reliability Evaluation for the Milvus Vector Database Scalability Evaluation for the Milvus Vector Database Getting Started with Milvus Performance Tuning Implementing Multi-Tenancy in Milvus How Milvus Works in AI

Reviews

Author Information

Yudong Cai is a senior software engineer with over 20 years of experience in large-scale system development. As one of the founding members of the Milvus project, he helped build Milvus from the ground up and has been involved in the development and iteration of every version since its initial open-source release. His key contributions include delivering the first production-grade Range Search implementation, as well as the refactoring of the entire Milvus configuration system, alongside the design and implementation of numerous other critical features. He is also the original developer and key maintainer of Knowhere, Milvus' core vector computation engine, where he designed its architecture to support multiple hardware acceleration frameworks and a wide range of vector search algorithms. Jeremy Zhu is a quality assurance engineer at Zilliz, focused on ensuring the robustness and high performance of the Milvus vector database. His core responsibilities include designing comprehensive test cases, developing automated system test pipelines for diverse scenarios, and executing rigorous stress, recovery, and performance testing. Jeremy possesses deep expertise in chaos engineering, distributed systems testing, and test automation frameworks, playing a key role in maintaining Milvus' high-quality standards. Xuan Yang is a senior software engineer at Zilliz in China, passionate about designing high-performance, scalable distributed database systems. As a core Milvus contributor, she architected the DataNode module, implemented the compaction process, and led the L0 segment design. She is the primary maintainer of PyMilvus, the official Python SDK, and VectorDBBench, an open-source benchmarking framework for vector databases. She cares deeply about system stability and performance and is always eager to collaborate with the community to push the boundaries of large-scale AI and vector data infrastructure. Bang Fu is a senior software engineer at Zilliz. With extensive experience in both Go and Python, he has actively contributed to the development of several key features for Milvus, including permission verification, request interception, incremental synchronization, and serverless metering functionalities. He is also interested in AI technology and led the development of the GPTCache project, which focuses on caching LLM responses to improve speed and reduce costs. In addition, he has participated in the development of the DeepSearcher project.

Tab Content 6

Author Website:  

Countries Available

All regions
Latest Reading Guide

April RG 26_2

 

Shopping Cart
Your cart is empty
Shopping cart
Mailing List