Hypothesis

this behavioral shift does not compromise performance on straightforward problems, it might affect performance on more challenging tasks.

「简单题不影响，难题可能变差」——这个不对称性极为危险。它意味着我们在用简单任务验证 Agent 可靠性时，得到的是虚假的信心。而当 Agent 真正面临高风险、高复杂度的任务时，上下文累积已经悄悄关闭了它的自我验证模式，在没有任何预警的情况下退化为浅层推理。这是一种「隐性能力衰减」，比显而易见的失败更危险。

hidden-degradation hard-tasks false-confidence agent-risk

Tags

Annotators

URL