Essay

The 18-day Claude Max burst: what compounding agentic-SWE practice unlocks

May 11, 2026 · ~10 min read

1,005 tickets merged. 1,458 processed. 52 agents deployed across 9 repos. What actually happens when you run a production multi-agent OS at full throttle for 18 consecutive days.

On May 7, 2026, Phase 6 of the Master OS went live. Over the 18 days that followed, the system processed 1,458 tickets and merged 1,005 of them — a 69% merge rate across 9 git repositories, 52 deployed agents, and 7 active business domains.

I want to explain what actually happened during those 18 days, because “I merged 1,005 tickets” is a fact that sounds impressive and doesn’t tell you anything useful.

What a ticket is

In this system, a ticket is a unit of work with a finite state machine: ready → in_progress → verifying → merged (or blocked / gate_failed for failures). Every ticket has a spec, files_predicted, and acceptance_criteria. Those three fields are the quality gate for dispatch — a ticket without all three doesn’t get picked up by an executor.

The ticket executor is a Claude Code subagent spawned with the ticket’s spec, the predicted files, and a set of hard constraints. It does the work, runs verification, spawns a verifier subagent, and merges to main if the verifier returns pass. The operator (me) sees the notification when the ticket reaches merged state.

That’s the unit. 1,005 of those, in 18 days.

What compounding looks like in practice

The first few days were slow. Not because the system was broken — because I was still calibrating the spec quality. A ticket with a vague spec produces a vague implementation. The executor doesn’t know what done looks like, so it makes decisions that may or may not align with what you actually wanted.

By day 4, I had internalized the spec discipline: files_predicted tells the executor what to touch; acceptance_criteria tells the verifier what to check. When both are precise, merge rate is high. When either is vague, you get remediation loops.

By day 8, I had built tooling to enforce spec quality at creation time — the dispatcher filters tickets missing either field. That closed the feedback loop structurally, not by reminding myself.

By day 14, the system was running 8 parallel ticket executors with intelligent slot management. When a ticket finished, the next one in the priority queue picked up the open slot. No wave barriers. No manual dispatch decisions for operational tickets. I was spending my time on spec quality, strategic prioritization, and the decisions that genuinely required human judgment.

That’s what compounding looks like: the system gets smarter about its own bottlenecks and structures them away.

The 31% that didn’t merge

Of the 1,458 processed tickets, 453 didn’t reach merged state during the burst. Most of those are not failures — they’re deferred work. Some are intentionally drafted but not dispatched (waiting for dependencies, waiting for a strategic decision, waiting for a business event that hasn’t happened yet). Some hit gate failures — the pre-merge gate rejected them — and are queued for retry.

A small number hit genuine blockers: missing credentials, a spec that required operator input, a dependency on something external. Those surfaced as blocked with a specific blocker_reason. That’s the right outcome — a system that surfaces its own blockers cleanly is more useful than one that silently fails.

The merge rate is a throughput metric, not a success rate metric. The question isn’t “why didn’t the other 453 merge?” — it’s “did the right tickets merge, and did the rest surface cleanly?” The answer to both is yes.

What I learned about operating at this speed

The operator’s job in a high-throughput agentic system is not execution. Execution is delegated. The operator’s job is:

  1. Spec quality. A precise spec is worth 10x a vague one. The executor can’t read your mind. Write the acceptance criteria like you’re writing a test.
  2. Priority management. The queue will always have more work than capacity. Decide what matters this session. The system will handle the rest.
  3. Exception handling. When a ticket hits a genuine blocker, the operator needs to resolve it quickly or the slot sits idle. Build the reflex to triage blockers fast.
  4. Strategic review. At the end of each session, scan what merged. Does it add up to the thing you were trying to build? If not, the spec quality was fine but the prioritization was off.

The 18-day burst taught me more about operating an agentic system than the 18 months of practice that preceded it. The feedback loops are faster when the system is running at full throttle.


The Master OS is documented in the case study at seantraynor.dev/case-studies/master-os.