🔮The Codex

Text-to-Video

AI that generates video clips from text descriptions.

📖 Apprentice Explanation

Text-to-video AI creates short video clips from your written descriptions. It's like text-to-image but with motion. Tools like Runway can generate a few seconds of video from a prompt.

🧙 Archmage Notes

Text-to-video models extend diffusion architectures with temporal attention layers. Key challenges include temporal consistency, long-form generation, and compute costs. Sora, Runway Gen-3, and Kling represent the current state of the art.

Related Enchantments

🔮

Diffusion Model

🔮

Multimodal AI