2 Matching Annotations
  1. Apr 2026
    1. The tradeoff is that the same input can map to more tokens—roughly 1.0–1.35× depending on the content type. Second, Opus 4.7 thinks more at higher effort levels, particularly on later turns in agentic settings. This improves its reliability on hard problems, but it does mean it produces more output tokens.

      大多数人认为AI模型升级应该提高效率,减少资源消耗。但作者指出Claude Opus 4.7实际上会产生更多输出token,消耗更多计算资源。这种'效率降低'换取'可靠性提高'的权衡挑战了人们对AI发展必然带来效率提升的认知,表明在某些场景下,模型可能需要更多思考才能达到更好的结果。

    1. Engineered from the ground up for maximum compute and memory efficiency

      大多数人认为高性能AI模型必然需要大量计算资源和内存。但作者强调Gemma 4的边缘模型是'从头开始为最大计算和内存效率而设计',暗示即使在资源受限的环境中也能实现高级AI功能,这与行业对AI资源需求的普遍认知相悖。