30 Matching Annotations
  1. Jun 2026
    1. The result is "almost-right code … the most expensive kind of wrong" because reviewers must reconstruct the missing system context for every change

      几乎正确的代码是代价最高昂的错误, 丢失了上下文

    2. The inner-loop/outer-loop framing

      内外环模型 https://www.reddit.com/r/AI_Agents/comments/1n1zjwd/the_outer_loop_vs_the_inner_loop_of_agents_a/

      The inner loop is the control cycle of a single agent that hat gets some work (human or otherwise) and tries to complete it with the assistance of an LLM. The inner loop of an agent is directed by the task it gets, the tools it exposes to the LLM, its system prompt and optionally some state to checkpoint work during the loop. In this inner loop, a developer is responsible for idempotency, compensating actions (if certain tools fails, what should happen to previous operations), and other business logic concerns that helps them build a great user experience. This is where workflow engines like Temporal excel, so we leaned on them rather than reinventing the wheel. 内环是单个智能体的控制周期,它接收某些工作(来自人类或其他来源),并尝试在 LLM 的协助下完成。智能体的内环由其接收的任务、向 LLM 开放的工具、系统提示词(system prompt)以及可选的用于在循环期间记录工作检查点的状态所引导。在内环中,开发者负责幂等性、补偿操作(如果某些工具失败,之前的操作该如何处理)以及其他有助于构建卓越用户体验的业务逻辑关注点。这正是像 Temporal 这样的工作流引擎所擅长的领域,因此我们选择依赖它们,而不是重新造轮子。

      The outer loop is the control loop to route and coordinate work between agents. Here dependencies are coarse grained, where planning and orchestration are more compact and terse. The key shift is in granularity: from fine-grained task execution inside an agent to higher-level coordination across agents. We realized this problem looks more like proxying than full-blown workflow orchestration. This is where next generation proxy infrastructure like Arch excel, so we leaned on that. 外环是用于在智能体之间路由和协调工作的控制循环。在这里,依赖关系是粗粒度的,规划和编排更加紧凑和简洁。关键的转变在于粒度:从智能体内部的细粒度任务执行,转向跨智能体的高层级协调。我们意识到这个问题看起来更像是一种代理(proxying)行为,而不是完整的工作流编排。这正是像 Arch 这样的下一代代理基础设施所擅长的领域,因此我们选择了依赖它。

    3. AI-generated UIs match target designs at 83–89% fidelity, with failures "localized and non-catastrophic" — fine-grained styling/spacing rather than structural errors.

      细粒度的样式问题而非结构性错误 -> 视觉回归测试集成到 CI -> 是否真的有用?

    4. AI accelerates the "inner loop" (coding) while the "outer loop" (review, security, compliance, deployment)

      这个生命周期的架构很适合写进 PPT

    1. CodeRabbit's Dec 2025 analysis found AI co-authored code carried ~1.7× more "major" issues. Treat technical debt and security regressions as default risks, not edge cases.

      技术债和安全性是默认风险

    2. Stage 2 — Build the harness on one team (weeks 4–12).

      正在做的

      https://www.reddit.com/r/ExperiencedDevs/comments/1rbhxsv/anyone_else_trimming_down_aigenerated/ - AI 很容易以极高的效率,生成一堆完全符合 Lint 规范、却在架构上极其短视或臃肿的“垃圾工程” - 架构设计充满了“隐性知识”与折衷(Trade-offs)

      人类定骨架,AI 填血肉: 明确定义好项目结构、API 边界、规范文件 https://martinfowler.com/articles/harness-engineering.html

      当绿地项目逐渐演变为一个“边界清晰、规范确立”的成熟 codebase 时,才是从零构建并启动 Harness 自动化控制网格的最佳时机 https://www.reddit.com/r/opencodeCLI/comments/1trz990/what_do_you_use_to_start_a_greenfield_project/

    3. LLM's "vibes" in a way "better suited for a weekend project, rather than for customer-facing applications of a publicly listed company" (CodeRabbit).

      适合 sideproject 而非面向客户的生产级应用

    4. A practical "form factor" summary: the minimal credible AI-native workflow deliverable is a Git repo containing AGENTS.md (router) + docs/ (system of record) + .claude/skills/*/SKILL.md (procedures) + Spec Kit gates + test/CI/observability hooks + a settings/permissions file. Open-source scaffolders (e.g., harness-creator skills) can generate this skeleton in minutes.

      一个 ai native workflow 脚手架可以 apply 到所有项目

    5. 5. How to build an AI-native workflow from zero

      PPT 部分, 可以介绍目前我们有的, 正在做的, 限制有哪些, 下一步做什么

    6. The convergent architectural components

      这个架构可以画到 PPT: rule | knowledge (packaged) | scaffolding (spec-driven) | harness (feedback loops)

    7. harness: "the system around the model. It decides how work gets split, which subagents spawn, what tools each one gets, how their output is verified... and when the job is actually done" (Anthropic).

      harness 的真正定义: 一个治理框架

    8. "AI-native" is distinct from "AI-assisted/AI-augmented." IBM's definition: "AI-augmented systems rely on AI as a supporting tool, whereas AI-native systems are AI-driven at their core... if the AI were to be removed, the product would not just cease to function as intended, it would cease to be useful at all."

      由 AI 驱动, 移除 AI 则无法按预期运行

    9. value comes from rearchitecting workflows, not from handing developers tools.

      核心价值来源于工作流重构而非仅仅提供更强的工具

    10. is not a prompt — it's a harness.

      不是一个提示词, 而是一个系统框架 指令 + 路由 + 打包的知识 (skill) + 开发脚手架 + 反馈循环

    11. a system where humans steer and agents execute, anchored by a reusable, version-controlled artifact (a "harness").

      最后是一个 version-controlled artifact -> harness