Hypothesis

8 Matching Annotations

Jun 2026
openai.com openai.com

Untitled document

1
1. fxp007 18 Jun 2026
  
  in Public
  
  Chemists found the suggestion both surprising and interesting
  
  这是全文最值得关注的细节之一。TEMPO是温和的自由基氧化剂，通常不是有机化学家考虑偶联反应时的第一直觉。AI提出了一个人类专家觉得出人意料但合理的假设——这正是科研价值的核心：不是重新发现已知的，而是在现有知识空间中找到人类视野盲区里的连接。如果AI只是系统地重组了文献中已有的方向，这个结果就不值得发表。
  
  非共识假设 TEMPO AI创造力
Visit annotations in context

Tags

AI创造力

TEMPO

非共识假设

Annotators

fxp007

URL

openai.com/index/ai-chemist-improves-reaction/
techcrunch.com techcrunch.com

Jeff Bezos's Prometheus raises $12B to build an 'artificial general engineer' for the physical world | TechCrunch

1
1. fxp007 12 Jun 2026
  
  in Public
  
  贝索斯认为AI生产力提升将导致劳动力稀缺，与主流AI领袖预测的大规模失业观点相左。
  
  非共识观点
Visit annotations in context

Tags

非共识观点

Annotators

fxp007

URL

techcrunch.com/2026/06/11/jeff-bezoss-prometheus-raises-12b-to-build-an-artificial-general-engineer-for-the-physical-world/
www.technologyreview.com www.technologyreview.com

https://www.technologyreview.com/2026/06/11/1138794/google-deepmind-is-worried-about-what-happens-when-millions-of-agents-start-to-interact/

1
1. fxp007 12 Jun 2026
  
  in Public
  
  DeepMind认为学术界能比企业实验室看得更远，不受短期商业目标限制，这是推动多智能体安全研究的关键动力。
  
  非共识观点
Visit annotations in context

Tags

非共识观点

Annotators

fxp007

URL

technologyreview.com/2026/06/11/1138794/google-deepmind-is-worried-about-what-happens-when-millions-of-agents-start-to-interact/
www.anthropic.com www.anthropic.com

When AI builds itself

4
1. fxp007 12 Jun 2026
  
  in Public
  
  It's becoming clear that much of what advances the frontier is automatable; large-scale research progress is mostly a function of tools and resources, which dictate how fast you can run experiments, how many you can run at once, and how quickly you can get results.
  
  这是文中最具争议性的哲学主张：「大部分前沿进展是可自动化的」。反驳：Transformer、RLHF等范式级突破不是「把已知实验跑得更快」的产物，而是概念上的跳跃。作者的反驳是：这些范式突破间隔多年，中间99%的进展靠的是规模化+调试+迭代。如果Claude已经擅长后者，那「前沿」就意味着：方向设定（人类）+大规模自动执行（AI）。这个分工假设成立的前提是：下一个Transformer级别的突破何时到来，以及它是否同样可以自动化。
  
  非共识研究哲学批判性
2. fxp007 12 Jun 2026
  
  in Public
  
  our best model in November 2025 (Opus 4.5) beat the human choice 51% of the time; in April 2026 (Mythos Preview), this grew to 64%
  
  研究判断力的进化：从51%（略好于随机）到64%，6个月内提升13个百分点。但这个设计本身值得仔细审视：实验选取的是「人类做出了次优选择」的时刻（n=129），因此这不是无偏的人机对比，而是「在人类容易出错的情境下，模型犯同样错误的频率有多低」。即便如此，从51%到64%意味着：模型不只是在执行层超越人类，在判断层也开始建立优势——而判断层正是这篇文章认为「人类最后的比较优势」所在。
  
  数据研究判断非共识
3. fxp007 12 Jun 2026
  
  in Public
  
  It's becoming clear that much of what advances the frontier is automatable; large-scale research progress is mostly a function of tools and resources, which dictate how fast you can run experiments, how many you can run at once, and how quickly you can get results.
  
  这是文中最具争议性的哲学主张：「大部分前沿进展是可自动化的」。反驳：Transformer、注意力机制、RLHF等范式级突破不是「把已知实验跑得更快」的产物，而是概念上的跳跃。作者的反驳是：这些范式突破间隔多年，中间99%的进展靠的是「规模化+调试+迭代」。如果Claude已经擅长后者，那「前沿」就意味着：方向设定（人类）+大规模自动执行（AI）。这个分工假设成立的前提是：下一个Transformer级别的突破何时到来，以及它是否同样可以自动化。
  
  非共识研究哲学批判性
4. fxp007 12 Jun 2026
  
  in Public
  
  our best model in November 2025 (Opus 4.5) beat the human choice 51% of the time; in April 2026 (Mythos Preview), this grew to 64%
  
  研究判断力的进化：从51%（略好于随机）到64%，6个月内提升13个百分点。但这个设计本身值得仔细审视：实验选取的是「人类做出了次优选择」的时刻（n=129），因此这不是无偏的人机对比，而是「在人类容易出错的情境下，模型犯同样错误的频率有多低」。即便如此，从51%到64%的提升意味着：模型不只是在执行层超越人类，在判断层也开始建立优势——而判断层正是这篇文章认为「人类最后的比较优势」所在。
  
  数据研究判断非共识
Visit annotations in context

Tags

数据

非共识

研究判断

研究哲学

批判性

Annotators

fxp007

URL

anthropic.com/institute/recursive-self-improvement
sakana.ai sakana.ai

Untitled document

1
1. fxp007 12 Jun 2026
  
  in Public
  
  this dynamic adversarial process leads to the emergence of increasingly general strategies and reveals an intriguing form of convergent evolution, where different code implementations settle into similar high-performing behaviors
  
  这是全文最重要的实验结果：不同初始条件的独立演化路径，最终收敛到相似的行为策略。这与生物界鸟和蝙蝠各自独立演化出翅膀如出一辙。对 AI 研究者的启示：存在某种「最优策略的引力盆地」——无论从哪个起点出发，对抗压力会把系统推向相同的解。这意味着复杂能力的涌现可能比我们想象的更具必然性。
  
  收敛进化涌现非共识
Visit annotations in context

Tags

涌现

非共识

收敛进化

Annotators

fxp007

URL

sakana.ai/drq/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL