This turns out to be the big unlock: the latest coding agents against the ~November 2025 frontier models are remarkably effective if you can give them an existing test suite to work against. I call these conformance suites and I’ve started deliberately looking out for them—so far I’ve had success with the html5lib tests, the MicroQuickJS test suite and a not-yet-released project against the comprehensive WebAssembly spec/test collection. If you’re introducing a new protocol or even a new programming language to the world in 2026 I strongly recommend including a language-agnostic conformance suite as part of your project. I’ve seen plenty of hand-wringing that the need to be included in LLM training data means new technologies will struggle to gain adoption. My hope is that the conformance suite approach can help mitigate that problem and make it easier for new ideas of that shape to gain traction.
conformance suites. potential way to introduce new tech and see it adopted despite it not being in llm training data by def.