Field Notes
Field notes on AI-assisted development. Concepts, failure modes, and the operating model underneath.
Instructions Are Not Architecture
An instruction is one possible home for a rule, not the only one, and rarely the best one. Where every rule belongs in an autonomous system: trash and runtime are valid answers, and hidden context is drift, not doctrine.
The Real World Must Sign For It
An AI-first factory gets to re-audit the whole testing inheritance. Tests are instruments, end-to-end evals are the contract, and reality is the final approver. Oracle capture is what happens when the worker captures the contract. Deploy is not proof.
The Commit Log Is the Boring Part
Someone told me to show proof my agents ship work on their own. A screenshot is the wrong thing to ask for. It's the exhaust, not the engine. Here is the machinery that makes 'merged overnight' mean 'merged correctly,' and the honest line of what is actually autonomous.
Cages and Constitutions
The case against wrapping agents in control code is mostly right. Most of that scaffolding is a cage. But some of what looks like a rail is a constitution, the small set of boundaries that must hold no matter how capable the actor becomes. The agent can be free. The institution still needs a constitution.
Prompt Sprawl
Rules pile up in the prompt, where nothing can enforce them. After a week of sharpening one rule, I deleted the role instead. Prompt Sprawl is the anti-pattern; the fix is moving the rule out of the prompt and into the runtime.
The Supervisor Tier
Most agent systems bubble every failure up to a human supervisor at the top of the stack. Erlang solved this in 1986. EVE Online re-derived it in 2008. The agent layer is the third domain to need the pattern.
Prompt Injection is a Privilege Problem
Most current prompt-injection mitigation is security theater. Software security solved the same kind of bug class structurally in 1995. The agent industry is repeating the wrong path.
The Pixel-Pushing Trap
AI makes it easy to build more. You need to make sure you're building the right things. The factory is running. It's making the wrong thing.
B2A: Business to Agent
The third distribution model after B2B and B2C. Your customers are AI agents finding you before a human ever sees your URL. MCP is the new SEO, and the conversion funnel is a JSON schema.
The Doctrine Bifurcation
Doctrine bifurcates into two registers: compiled doctrine a runtime enforces, and interpretive doctrine agents read. Most agent frameworks have only one. The split is what makes audit-by-construction enforceable.
Doctrine Engineering
Prompt engineering scales to one agent. Doctrine engineering scales to one hundred. The next discipline, the practice that emerges when probabilistic compliance breaks at scale, and why the term-of-art slot is open right now.
Shipping Paralysis & World Domination Scope
Why AI making code free didn't make shipping easier, and the specific traps that keep a genuinely good project permanently local.
AI Debt
What happens when the framework starts serving itself instead of shipping the product. Distinct from technical debt; AI Debt lives in the prose, scaffolding, and meta-process around the code.
Humans Moving Upchain
The substrate-level shift that the entire AI moment is producing. Agents move in below. Humans relocate above.
The Cambrian Explosion of Software
Software was stuck single-celled for fifty years. AI is the oxygen. Why every software category is about to be re-founded, and how to tell which game you're playing.
The Dark Factory
A deep dive into Level 5: what it actually requires, what it looks like in practice, and what kind of person could architect one.
Five Levels of AI Programming
A working framework for the current state of AI-assisted development, and a clear map of where the industry is heading.