Samhälle och politik

Build a Text-to-Image Generator (from Scratch)

av Mark Liu

Utgiven av Pearson Education

Format

Inbunden

Sidor

360 sidor

Språk

Engelska

Utgiven

jan. 2026

Jämför priser

Från 388 kr
Adlibris
Bästa pris
388 kr
Bokus
Bästa pris
388 kr
Akademibokhandeln
519 kr

Priserna uppdateras löpande från säkra och trygga butiker.

Om boken

AI images flood feeds, yet the models behind them feel mysterious. Relying on black boxes risks bias, errors, and costly creative dead ends. You deserve hands-on skills to build, audit, and improve these generators yourself. This book starts from a blank notebook, guiding every line of Python code. Learn transformers for vision, then craft diffusion models that sharpen noise into art. Finish with a custom system generating high-resolution images from any text prompt. 
  • Vision transformer anatomy: Decode image patches and attention flows for transparent decision paths. 
  • End-to-end diffusion pipeline: Transform random noise into detailed, photorealistic pictures you can trust. 
  • Captioning and classification builds: Extend models to describe or categorize images for downstream tasks. 
  • Fine-tuning walkthroughs: Adapt pretrained networks quickly, saving compute while boosting domain accuracy. 
  • Deepfake detection skills: Differentiate authentic photos from generated fakes to safeguard projects and brands. 
  • Fully runnable notebooks: Experiment, tweak, and visualize results without configuration hassles. 
In Build a Text-to-Image Generator (from Scratch), the author combines clear prose, diagrams, and production-ready Python to deliver practical authority. Starting with patch tokenization, you implement a vision transformer, then pivot to diffusion. Step-by-step chapters layer theory, code, and visual outputs, ensuring concepts click before you move on. By the final page you can craft, tune, and deploy image generators that suit your data, budget, and ethical standards. You control every hyperparameter and understand every pixel produced.  Ideal for data scientists and Python-savvy enthusiasts eager to master state-of-the-art image generation. 

Fler böcker av Mark Liu

Se alla

Boktips inom Samhälle och politik

Bästa pris388 kr
Gå till butik