Hypothesis

5 Matching Annotations

Jun 2026
www.latent.space www.latent.space

The Age of Async Agents — Cognition's Walden Yan & OpenInspect's Cole Murray

1
1. fxp007 05 Jun 2026
  
  in Public
  
  the real failure mode of uncontrolled vibe coding: your codebase regressing to your worst engineer.
  
  This is the sharpest critique of naive AI coding adoption in the article. Without proper agent oversight, code review loops, and quality gates, AI doesn't raise the floor — it lowers it by enabling low-quality code to ship at machine speed. The 'worst engineer' framing implies that unconstrained agents optimize for task completion, not codebase health.
  
  vibe-coding ai-code-quality failure-modes
Visit annotations in context

Tags

ai-code-quality

failure-modes

vibe-coding

Annotators

fxp007

URL

latent.space/p/cognition
claude.com claude.com

Introducing dynamic workflows | Claude

1
1. fxp007 05 Jun 2026
  
  in Public
  
  When the cost of a wrong answer is high, a workflow gives Claude independent attempts at the problem and adversarial agents working to break the result before you see it.
  
  Adversarial self-verification is a significant architectural step beyond standard code review. Having agents actively attempt to falsify results before surfacing them mirrors formal verification approaches — but applied dynamically to any engineering problem. This could shift AI coding from 'trust then verify' to 'verify then deliver.'
  
  adversarial-agents ai-verification code-quality
Visit annotations in context

Tags

adversarial-agents

code-quality

ai-verification

Annotators

fxp007

URL

claude.com/blog/introducing-dynamic-workflows-in-claude-code
May 2026
www.anthropic.com www.anthropic.com

Introducing Claude Opus 4.8

1
1. fxp007 29 May 2026
  
  in Public
  
  Opus 4.8 is around four times less likely than its predecessor to allow flaws in code it has written to pass unremarked.
  
  大多数人认为AI模型会自信地输出有缺陷的代码而不自知，但作者认为Opus 4.8显著提高了自我纠错能力。这挑战了人们对AI模型自我评估能力的普遍怀疑，表明AI可能在代码质量方面比人们预期的更加可靠。
  
  non-consensus code-quality ai-reliability
Visit annotations in context

Tags

ai-reliability

code-quality

non-consensus

Annotators

fxp007

URL

anthropic.com/news/claude-opus-4-8
Apr 2026
x.com x.com

https://x.com/cerebras/status/2042015763201221032

1
1. fxp007 16 Apr 2026
  
  in Public
  
  Add contacts, live search, full pipeline dashboard – all unit tests passed.
  
  令人惊讶的是：AI生成的代码不仅功能完整，包括联系人管理、实时搜索和完整的管道仪表板，而且所有单元测试都通过了，表明AI不仅能快速编码，还能保证代码质量。
  
  surprising code-quality ai-capabilities
Visit annotations in context

Tags

ai-capabilities

code-quality

surprising

Annotators

fxp007

URL

x.com/cerebras/status/2042015763201221032
www.yanist.com www.yanist.com

编码助手时代的整洁代码 --- Clean code in the age of coding agents

1
1. fxp007 10 Apr 2026
  
  in Public
  
  their productivity is affected by the state of the codebase.
  
  【启发】这句话的深远意义在于：它把 AI Coding Agent 与人类开发者置于同一评价维度。这不是「AI 是否能替代人」的问题，而是「AI 受代码质量影响的方式是否与人类相同」。答案是肯定的——这意味着几十年来软件工程师积累的代码质量实践，不是因为 AI 的到来而失效，而恰恰因为 AI 的到来而变得更加重要。技术债从「慢慢影响人」变成了「立刻影响 AI 的 token 消耗」。
  
  inspiration codebase-quality technical-debt AI-affected-by-code
Visit annotations in context

Tags

AI-affected-by-code

codebase-quality

technical-debt

inspiration

Annotators

fxp007

URL

yanist.com/clean-code-in-the-age-of-coding-agents/

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL