Hypothesis

3 Matching Annotations

Apr 2026
huggingface.co huggingface.co

Reasoning Shift: How Context Silently Shortens LLM Reasoning

1
1. fxp007 09 Apr 2026
  
  in Public
  
  the robustness of these reasoning behaviors remains underexplored
  
  「推理行为的鲁棒性尚未被充分探索」——这句话是整个推理模型研究领域的集体盲点声明。过去两年，测试时计算（test-time compute）、长思维链（CoT）、o1/R1 类推理模型吸引了巨大关注，但几乎所有评测都在「孤立问题」环境下进行。在真实 Agent 部署场景中，「能否保持推理深度」这个最基本的可靠性问题，直到这篇论文才开始被系统研究。
  
  research-gap robustness test-time-compute systematic-evaluation
Visit annotations in context

Tags

test-time-compute

research-gap

robustness

systematic-evaluation

Annotators

fxp007

URL

huggingface.co/papers/2604.01161
epoch.ai epoch.ai

Keeping up with the GPTs | Epoch AI

1
1. fxp007 09 Apr 2026
  
  in Public
  
  Just last year, Anthropic spent over ten times more on compute than Minimax and Zhipu AI combined, and the gap is even wider for OpenAI:
  
  这个数字对国内 AI 从业者而言极为刺耳：Anthropic 一家的算力投入就超过智谱 AI 和 MiniMax 合计的十倍以上，而与 OpenAI 相比差距更大。所谓「中美 AI 竞争激烈」的叙事背后，是一场体量悬殊的不对称战争——不是同一量级的竞争，而是大卫与歌利亚的对决。对智谱这样的公司，这既是警醒，也是生存战略的根本约束。
  
  Zhipu-AI MiniMax compute-gap China-US-AI surprising
Visit annotations in context

Tags

surprising

compute-gap

MiniMax

China-US-AI

Zhipu-AI

Annotators

fxp007

URL

epoch.ai/gradient-updates/keeping-up-with-the-gpts/
epoch.ai epoch.ai

https://epoch.ai/blog/introducing-the-ai-chip-owners-explorer

1
1. fxp007 08 Apr 2026
  
  in Public
  
  We estimate that as of the end of 2025, Chinese companies collectively own just over 5% of the cumulative computing power of the leading AI chips sold in recent years
  
  考虑到中国AI产业的快速发展和政府对AI的大力投资，大多数人可能认为中国拥有更大比例的全球AI计算能力，但作者认为中国公司仅拥有约5%的全球AI计算能力。这一数字远低于人们的预期，挑战了关于中国AI技术实力的普遍认知。
  
  non-consensus china-ai-capabilities compute-gap
Visit annotations in context

Tags

non-consensus

china-ai-capabilities

compute-gap

Annotators

fxp007

URL

epoch.ai/blog/introducing-the-ai-chip-owners-explorer

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL