Hypothesis

2 Matching Annotations

May 2026
techcrunch.com techcrunch.com

https://techcrunch.com/2026/05/28/rsi-is-the-new-agi-and-its-just-as-hard-to-pin-down/

1
1. fxp007 29 May 2026
  
  in Public
  
  RSI is the new AGI — and it's just as hard to pin down
  
  文章标题使用了'new'这个词，暗示RSI是一个新兴概念，但缺乏历史背景来支持这一说法。这可能导致读者对RSI的发展历程产生误解。文章应该提供RSI概念的历史发展信息，而不是简单地将其标记为'新'概念。
  
  critique unsupported-assertion context-gap
Visit annotations in context

Tags

unsupported-assertion

context-gap

critique

Annotators

fxp007

URL

techcrunch.com/2026/05/28/rsi-is-the-new-agi-and-its-just-as-hard-to-pin-down/
Apr 2026
metr.org metr.org

Task-Completion Time Horizons of Frontier AI Models

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Our human task duration estimates likely overestimate how long a human expert takes to complete these tasks, as the humans (and AI agents!) have much less context for the task than professionals doing equivalent work in their day-to-day job.
  
  METR 主动承认其人类基准时间可能被高估——因为参与实验的人类和 AI 一样，都是低上下文的「新手」状态，而非熟悉项目的专业人员。这意味着「2 小时时间地平线」所对应的人类能力，更接近一个没有背景知识的外包工人，而非一个有经验的全职工程师。AI 与「有上下文的专业人员」之间的真实差距，比时间地平线数字显示的要大得多。
  
  context-gap human-baseline measurement-limitation surprising
Visit annotations in context

Tags

surprising

measurement-limitation

human-baseline

context-gap

Annotators

fxp007

URL

metr.org/time-horizons/

Tags

Annotators

URL

Tags

Annotators

URL