Hypothesis

6 Matching Annotations

Apr 2026
epoch.ai epoch.ai

https://epoch.ai/blog/have-ai-capabilities-accelerated

5
1. fxp007 30 Apr 2026
  
  in Public
  
  The three metrics where we find acceleration are concentrated in programming and mathematics. These are areas that labs have explicitly targeted for improvement, and they share an important property: correctness is easy to verify automatically.
  
  大多数人可能认为AI能力的加速是跨领域普遍发生的，但作者指出加速主要集中在编程和数学领域，因为这些领域正确性容易自动验证。这一发现挑战了人们对AI能力普遍提升的假设，暗示加速可能是有选择性的。
  
  non-consensus domain-specific verification-ease
2. fxp007 30 Apr 2026
  
  in Public
  
  Our fourth metric, an index constructed from WeirdML V2 results, showed no sign of acceleration. A single global linear trend fit the data best.
  
  大多数人可能认为所有AI能力指标都应该同步加速，但作者发现WeirdML V2指标没有显示出任何加速迹象，最佳拟合仍是简单的全局线性趋势。这一发现表明AI能力的加速并不是普遍现象，而是特定于某些任务领域。
  
  non-consensus domain-specific benchmarking
3. fxp007 26 Apr 2026
  
  in Public
  
  The three metrics where we find acceleration are concentrated in programming and mathematics. These are areas that labs have explicitly targeted for improvement, and they share an important property: correctness is easy to verify automatically.
  
  主流观点可能认为AI能力在各个领域的提升是均衡的，但作者指出加速现象主要集中在编程和数学领域，因为这些领域的正确性容易自动验证。这暗示AI进步可能不是普遍性的，而是集中在特定可量化的领域。
  
  non-consensus ai-benchmarks domain-specific
4. fxp007 24 Apr 2026
  
  in Public
  
  The three metrics where we find acceleration are concentrated in programming and mathematics. These are areas that labs have explicitly targeted for improvement
  
  这个观察揭示了AI能力加速的领域局限性。编程和数学领域的加速可能是因为这些领域被明确作为改进目标，且正确性容易验证。这表明AI进步可能是有选择性的，而非全面性的，对评估整体AI进展有重要启示。
  
  data-point statistics domain-specific
5. fxp007 24 Apr 2026
  
  in Public
  
  The three metrics where we find acceleration are concentrated in programming and mathematics.
  
  文章明确指出显示加速的三个指标主要集中在编程和数学领域。这是一个重要的限制，因为正确性在这些领域容易自动验证，使它们成为强化学习的自然目标。这表明AI能力的加速可能不适用于所有领域，特别是在那些难以自动验证正确性的任务上。
  
  data-point domain-specific limitations
Visit annotations in context

Tags

verification-ease

limitations

statistics

data-point

benchmarking

domain-specific

non-consensus

ai-benchmarks

Annotators

fxp007

URL

epoch.ai/blog/have-ai-capabilities-accelerated
Nov 2024
4thgenerationcivilization.substack.com 4thgenerationcivilization.substack.com

A global history of societal regulation

1
1. stopresetgo 22 Nov 2024
  
  in Public
  
  Domain-specific alliances
  
  for - adjacency - SRG planetary boundary / earth system boundaries working groups - domain specific alliances - Magisteria of the Commons
  
  adjacency - between - SRG planetary boundary / earth system boundaries working groups - domain specific alliances - Magisteria of the Commons - adjacency relationship The domain specific alliances of the Magisteria of the commons is similar to the SRG idea of developing funds version divisions of wealth system boundaries
  
  adjacency - SRG planetary boundary / earth system boundaries working groups - domain specific alliances - Magisteria of the Commons
Visit annotations in context

Tags

adjacency - SRG planetary boundary / earth system boundaries working groups - domain specific alliances - Magisteria of the Commons

Annotators

stopresetgo

URL

4thgenerationcivilization.substack.com/p/a-global-history-of-societal-regulation

Tags

Annotators

URL

Tags

Annotators

URL