🔮The Codex
Zero-Shot Learning
An AI's ability to perform tasks it wasn't specifically trained for.
📖 Apprentice Explanation
Zero-shot means an AI can handle tasks it's never seen before. If you ask ChatGPT to write a haiku about quantum physics, it can do it even though it wasn't specifically trained on quantum haikus.
🧙 Archmage Notes
Zero-shot capabilities emerge from large-scale pre-training on diverse data. Performance scales with model size (scaling laws). Instruction tuning and RLHF significantly improve zero-shot task performance.
