|
|
|||
|
||||
OverviewAI images flood feeds, yet the models behind them feel mysterious. Relying on black boxes risks bias, errors, and costly creative dead ends. You deserve hands-on skills to build, audit, and improve these generators yourself. This book starts from a blank notebook, guiding every line of Python code. Learn transformers for vision, then craft diffusion models that sharpen noise into art. Finish with a custom system generating high-resolution images from any text prompt. Vision transformer anatomy: Decode image patches and attention flows for transparent decision paths. End-to-end diffusion pipeline: Transform random noise into detailed, photorealistic pictures you can trust. Captioning and classification builds: Extend models to describe or categorize images for downstream tasks. Fine-tuning walkthroughs: Adapt pretrained networks quickly, saving compute while boosting domain accuracy. Deepfake detection skills: Differentiate authentic photos from generated fakes to safeguard projects and brands. Fully runnable notebooks: Experiment, tweak, and visualize results without configuration hassles. In Build a Text-to-Image Generator (from Scratch), the author combines clear prose, diagrams, and production-ready Python to deliver practical authority. Starting with patch tokenization, you implement a vision transformer, then pivot to diffusion. Step-by-step chapters layer theory, code, and visual outputs, ensuring concepts click before you move on. By the final page you can craft, tune, and deploy image generators that suit your data, budget, and ethical standards. You control every hyperparameter and understand every pixel produced. Ideal for data scientists and Python-savvy enthusiasts eager to master state-of-the-art image generation. Full Product DetailsAuthor: Mark LiuPublisher: Manning Publications Imprint: Manning Publications Dimensions: Width: 19.00cm , Height: 2.00cm , Length: 23.50cm Weight: 0.639kg ISBN: 9781633435421ISBN 10: 1633435423 Pages: 360 Publication Date: 23 January 2026 Audience: Professional and scholarly , Professional & Vocational Format: Hardback Publisher's Status: Active Availability: In Print This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us. Table of ContentsReviewsThis book stands out for its hands-on, no-fluff approach to text-to-image generation—perfect for practitioners who want to build rather than just theorize. The clear PyTorch implementations, Colab-friendly examples, and practical exercises make even advanced concepts like Diffusion Models feel achievable. Simeon Leyzerzon, President, Excelsior Software Ltd. This book is a great hands-on intro to how text-to-image models like Stable Diffusion actually work under the hood. It explains the roles of transformers, VAEs, and denoising U-Nets in a super approachable way, with lots of code you can run yourself. If you’re curious about generative AI and want to build or tweak your own models, this is a solid place to start. Ravikumar Sanapala, Product Manager, Reality Labs, Meta Author InformationMark Liu is a professor and program director known for translating cutting-edge AI into practical curricula. With years mentoring graduate students and professionals, Mark brings clarity, rigor, and enthusiasm to every page. He distills deep generative-model expertise into step-by-step guidance that empowers readers to build powerful visual AI systems. Tab Content 6Author Website:Countries AvailableAll regions |
||||