Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows

Author:   Harvey Chandler
Publisher:   Independently Published
ISBN:  

9798198005006


Pages:   176
Publication Date:   21 May 2026
Format:   Paperback
Availability:   Available To Order   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Our Price $52.77 Quantity:  
Add to Cart

Share |

Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG Workflows


Overview

Local AI for Developers: Run Open-Source LLMs on Your Laptop with Ollama, llama.cpp, Quantization, and Practical RAG WorkflowsBuild private, fast, cost-controlled AI systems directly on your own machine. Tired of sending every prompt, code snippet, and internal document to a remote API? Want to experiment with open-source LLMs without unpredictable usage bills, rate limits, or cloud dependency? Local AI for Developers gives you a practical path to running, optimizing, and building with local language models using Ollama, llama.cpp, GGUF models, quantization, embeddings, and Retrieval-Augmented Generation workflows. Instead of treating local AI as a toy demo, this book shows how to turn your laptop or workstation into a usable AI development environment. You'll learn how to choose models that fit your hardware, run open-source LLMs locally, expose them through developer-friendly APIs, build private document assistants, create local embeddings, improve RAG quality, benchmark performance, and move from laptop prototypes toward repeatable local AI workflows. The manuscript covers Ollama, llama.cpp, quantization, model selection, RAG, reranking, benchmarking, and production-style local workflows. Inside, you'll gain practical skills for: Setting up a reliable local AI development stack Running and managing models with Ollama Using llama.cpp and GGUF models for deeper control Choosing quantization levels for speed and quality Building local APIs, assistants, embeddings, and RAG pipelines Evaluating latency, throughput, memory use, and response quality This book is for software developers, AI engineers, ML practitioners, technical founders, and builders who want private, offline-capable, open-source AI systems they can control. Get your copy today and start building local AI workflows that run on your hardware, protect your data, and fit real developer work.

Full Product Details

Author:   Harvey Chandler
Publisher:   Independently Published
Imprint:   Independently Published
Dimensions:   Width: 17.80cm , Height: 1.00cm , Length: 25.40cm
Weight:   0.318kg
ISBN:  

9798198005006


Pages:   176
Publication Date:   21 May 2026
Audience:   General/trade ,  General
Format:   Paperback
Publisher's Status:   Active
Availability:   Available To Order   Availability explained
We have confirmation that this item is in stock with the supplier. It will be ordered in for you and dispatched immediately.

Table of Contents

Reviews

Author Information

Tab Content 6

Author Website:  

Countries Available

All regions
Latest Reading Guide

RGJ26

 

Shopping Cart
Your cart is empty
Shopping cart
Mailing List