🔮The Codex

Zero-Shot Learning

An AI's ability to perform tasks it wasn't specifically trained for.

📖 Apprentice Explanation

Zero-shot means an AI can handle tasks it's never seen before. If you ask ChatGPT to write a haiku about quantum physics, it can do it even though it wasn't specifically trained on quantum haikus.

🧙 Archmage Notes

Zero-shot capabilities emerge from large-scale pre-training on diverse data. Performance scales with model size (scaling laws). Instruction tuning and RLHF significantly improve zero-shot task performance.