3 Matching Annotations
  1. Last 7 days
    1. 90% of the bugs are one-shotted now, including bizarre race conditions, unexpected corner-cases, third-party integration issues, undocumented API edge cases, everything. I hardly have to intervene.

      大多数人认为调试复杂系统特别是分布式系统的能力是工程师的最后堡垒,但作者认为AI已经能够解决90%的bug,包括那些需要丰富经验才能处理的复杂问题。这与'人类在调试领域具有独特优势'的主流认知相悖。

  2. Jun 2026
    1. The model is not merely sampling more images or videos; it is debugging a visual program in a closed-loop, renderable environment.

      大多数人认为AI生成内容的改进主要依靠增加计算量和样本数量,但作者认为真正的进步在于AI能够像程序员一样调试视觉程序。这一观点将AI从内容生成者转变为问题解决者,暗示未来AI的发展方向是编程能力而非单纯的生成能力。

  3. Apr 2026
    1. Claude Opus 4.7 passed three TBench tasks that prior Claude models couldn't, and it's landing fixes our previous best model missed, including a race condition.

      解决前代模型无法处理的并发条件(race condition)问题,展示了AI在系统级理解上的深度提升,这种对复杂系统行为的理解能力是AI从代码生成向系统架构设计转变的关键标志。