🔮The Codex
Text-to-Image
AI that generates images from written text descriptions.
📖 Apprentice Explanation
Text-to-image AI creates pictures from your words. You describe what you want to see — like 'a sunset over mountains in watercolor style' — and the AI generates that image.
🧙 Archmage Notes
Text-to-image models use diffusion (Stable Diffusion, DALL-E 3) or autoregressive (Parti) architectures. CLIP provides text-image alignment. Key techniques include classifier-free guidance, negative prompts, and ControlNet for structural control.
