Hypothesis

7 Matching Annotations

Jun 2026
openai.com openai.com

https://openai.com/index/gpt-5-immunology-mystery/

1
1. fxp007 26 Jun 2026
  
  in Public
  
  AI may generate an insight, but people must still evaluate its significance and plausibility.
  
  大多数人认为随着AI能力增强，人类专家的角色将逐渐被取代。但作者坚持认为专业知识仍然至关重要，人类必须评估AI见解的意义和合理性，这挑战了技术决定论和对AI取代人类的担忧，暗示人机协作而非替代才是未来方向。
  
  counterintuitive human-ai-collaboration expertise
Visit annotations in context

Tags

expertise

human-ai-collaboration

counterintuitive

Annotators

fxp007

URL

openai.com/index/gpt-5-immunology-mystery/
human-in-the-loop.bearblog.dev human-in-the-loop.bearblog.dev

https://human-in-the-loop.bearblog.dev/llms-are-eroding-my-software-engineering-career-and-i-dont-know-what-to-do/

1
1. fxp007 07 Jun 2026
  
  in Public
  
  all the knowledge I have accumulated over the years: the trade-offs between implementations, how acquiring works, how to structure idempotency to prevent double-charges, everything, was becoming useless.
  
  大多数人认为深厚的领域专业知识是软件工程师不可替代的核心竞争力，但作者认为这些知识正在变得无用，因为LLMs能够快速获取和应用这些专业知识。这与行业普遍认为的'领域专家价值会随时间增长'的观点相悖。
  
  non-consensus domain-expertise ai-impact
Visit annotations in context

Tags

ai-impact

non-consensus

domain-expertise

Annotators

fxp007

URL

human-in-the-loop.bearblog.dev/llms-are-eroding-my-software-engineering-career-and-i-dont-know-what-to-do/
www.anthropic.com www.anthropic.com

https://www.anthropic.com/research/making-claude-a-chemist

2
1. fxp007 06 Jun 2026
  
  in Public
  
  For routine data prediction Opus 4.7—a general-purpose model without chemistry-specific fine-tuning—is now as good as or better than ChemDraw and MestReNova on average
  
  大多数人认为通用AI模型在专业化学任务上必然落后于专门训练的化学软件，但作者发现Claude在没有经过化学专门微调的情况下已经能够匹敌甚至超越专业软件。这表明现代AI模型的通用能力已经足够强大，可以在特定专业领域挑战专门工具的地位，打破了AI只能作为辅助工具的传统认知。
  
  non-consensus general-purpose-ai domain-expertise
2. fxp007 06 Jun 2026
  
  in Public
  
  a general-purpose model without chemistry-specific fine-tuning—is now as good as or better than ChemDraw and MestReNova on average
  
  大多数人认为专业化学软件需要专门训练才能在专业领域表现优异，但作者认为Claude这样没有经过化学专门微调的通用模型已经能够匹敌甚至超越专业化学软件。这是因为Claude的多模态能力和推理能力使其能够直接从期刊图表或手绘结构中读取化学信息，而不依赖预处理的分子数据库，这挑战了专业软件必须领域专门化的传统认知。
  
  non-consensus ai-vs-expertise chemistry-ai
Visit annotations in context

Tags

ai-vs-expertise

non-consensus

domain-expertise

general-purpose-ai

chemistry-ai

Annotators

fxp007

URL

anthropic.com/research/making-claude-a-chemist
Apr 2026
www.anthropic.com www.anthropic.com

https://www.anthropic.com/news/anthropic-nec

1
1. fxp007 26 Apr 2026
  
  in Public
  
  NEC will establish a Center of Excellence to develop a highly skilled, AI-enabled engineering organization
  
  大多数人认为AI会使专业知识和技能贬值，但作者认为AI实际上需要更高水平的工程专业知识，因为企业正在建立专门的卓越中心来培养AI技能，这表明AI工具正在提升而非降低工程工作的专业门槛。
  
  non-consensus skills-gap ai-expertise
Visit annotations in context

Tags

ai-expertise

non-consensus

skills-gap

Annotators

fxp007

URL

anthropic.com/news/anthropic-nec
blog.vidocsecurity.com blog.vidocsecurity.com

We Reproduced Anthropic's Mythos Findings With Public Models

1
1. fxp007 24 Apr 2026
  
  in Public
  
  Public models can already spot that a security-relevant check is missing in the right code path, but they can still miss the actual invariant being violated and therefore misstate the impact.
  
  这一发现揭示了公共模型在安全分析中的一个关键局限：它们能发现缺失的安全检查，但可能无法正确理解被违反的实际不变量，从而错误陈述影响。这挑战了'AI能完全理解安全含义'的假设，强调了人类专家在解释AI发现中的不可替代性。
  
  ai-limitations security-analysis human-expertise
Visit annotations in context

Tags

ai-limitations

human-expertise

security-analysis

Annotators

fxp007

URL

blog.vidocsecurity.com/blog/we-reproduced-anthropics-mythos-findings-with-public-models
openai.com openai.com

https://openai.com/index/introducing-gpt-rosalind/

1
1. fxp007 17 Apr 2026
  
  in Public
  
  When evaluated directly in the Codex app, best-of-ten model submissions ranked above the 95th percentile of human experts on the prediction task and around the 84th percentile of human experts on the sequence generation task.
  
  这一性能指标令人震惊，表明AI在某些任务上已超越95%的人类专家。这不仅是技术进步的标志，也引发了对专业科学家角色和未来就业市场的深刻思考。
  
  ai-performance expertise-superiority
Visit annotations in context

Tags

expertise-superiority

ai-performance

Annotators

fxp007

URL

openai.com/index/introducing-gpt-rosalind/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL