Claude Fable 5 is the first to break 90% on our core analytics benchmark of complex, long-running analytical tasks — a 10-point jump over Opus. On the hardest questions, it shows strong judgment and attention to nuance.
大多数人认为AI模型在复杂推理任务上的性能提升应该是渐进式的,但作者认为Fable 5实现了质的飞跃,直接突破90%这一关键阈值。这挑战了人们对AI进步的线性预期,暗示可能存在能力阈值一旦突破就会带来显著性能提升的非线性发展模式。