Field Notes

Dan Grafham

Field notes on AI-assisted development. Concepts, failure modes, and the operating model underneath.

June 2026

Instructions Are Not Architecture

An instruction is one possible home for a rule, not the only one, and rarely the best one. Where every rule belongs in an autonomous system: trash and runtime are valid answers, and hidden context is drift, not doctrine.

The Real World Must Sign For It

An AI-first factory gets to re-audit the whole testing inheritance. Tests are instruments, end-to-end evals are the contract, and reality is the final approver. Oracle capture is what happens when the worker captures the contract. Deploy is not proof.

The Commit Log Is the Boring Part

Someone told me to show proof my agents ship work on their own. A screenshot is the wrong thing to ask for. It's the exhaust, not the engine. Here is the machinery that makes 'merged overnight' mean 'merged correctly,' and the honest line of what is actually autonomous.

May 2026

Cages and Constitutions

The case against wrapping agents in control code is mostly right. Most of that scaffolding is a cage. But some of what looks like a rail is a constitution, the small set of boundaries that must hold no matter how capable the actor becomes. The agent can be free. The institution still needs a constitution.

Prompt Sprawl

Rules pile up in the prompt, where nothing can enforce them. After a week of sharpening one rule, I deleted the role instead. Prompt Sprawl is the anti-pattern; the fix is moving the rule out of the prompt and into the runtime.

The Supervisor Tier

Most agent systems bubble every failure up to a human supervisor at the top of the stack. Erlang solved this in 1986. EVE Online re-derived it in 2008. The agent layer is the third domain to need the pattern.

Prompt Injection is a Privilege Problem

Most current prompt-injection mitigation is security theater. Software security solved the same kind of bug class structurally in 1995. The agent industry is repeating the wrong path.

The Pixel-Pushing Trap

AI makes it easy to build more. You need to make sure you're building the right things. The factory is running. It's making the wrong thing.

B2A: Business to Agent

The third distribution model after B2B and B2C. Your customers are AI agents finding you before a human ever sees your URL. MCP is the new SEO, and the conversion funnel is a JSON schema.

The Doctrine Bifurcation

Doctrine bifurcates into two registers: compiled doctrine a runtime enforces, and interpretive doctrine agents read. Most agent frameworks have only one. The split is what makes audit-by-construction enforceable.

Doctrine Engineering

Prompt engineering scales to one agent. Doctrine engineering scales to one hundred. The next discipline, the practice that emerges when probabilistic compliance breaks at scale, and why the term-of-art slot is open right now.

Shipping Paralysis & World Domination Scope

Why AI making code free didn't make shipping easier, and the specific traps that keep a genuinely good project permanently local.

April 2026

AI Debt

What happens when the framework starts serving itself instead of shipping the product. Distinct from technical debt; AI Debt lives in the prose, scaffolding, and meta-process around the code.

Humans Moving Upchain

The substrate-level shift that the entire AI moment is producing. Agents move in below. Humans relocate above.

The Cambrian Explosion of Software

Software was stuck single-celled for fifty years. AI is the oxygen. Why every software category is about to be re-founded, and how to tell which game you're playing.

The Dark Factory

A deep dive into Level 5: what it actually requires, what it looks like in practice, and what kind of person could architect one.

Five Levels of AI Programming

A working framework for the current state of AI-assisted development, and a clear map of where the industry is heading.