The Sprint Contract Effect: Measuring Overrun Reduction in Solo-Operator Engineering
Abstract
Solo operators running agent-mediated engineering work routinely overrun sprint estimates by 50-200% of the originally scoped time. We measured the effect of adopting a structured sprint-contract discipline (pre-work definition of done, enumerated failure modes, gate-out acceptance criteria) on contract overrun rate across n=48 contracts shipped over 16 weeks by one operator across three ventures. Median overrun fell from 78% pre-discipline to 12% post-discipline (Wilcoxon signed-rank p < 0.001), with the largest reduction in contracts whose original definition of done was vague or testable only on completion. The discipline transferred fully across ventures (engineering, operations, legal) without per-domain tuning.
1. Introduction
Estimation accuracy is a long-studied problem in software engineering — the median project ships at roughly 1.7× its original estimate, with variance dominated by the long right tail of severely overrun projects (Jørgensen & Shepperd 2007 [3]). The reasons are well documented: planning fallacy [4], optimism bias, scope inflation, the inability to enumerate failure modes ex ante.
Solo operators running agent-mediated engineering work have all of these problems plus several novel ones. The agent reads vague briefs as license to scope expansively. “Done” without an explicit definition becomes whatever the agent renders. Mid-session detours into research that turn into multi-day rabbit holes are routine. Without explicit pre-work contracts, individual sessions ship reasonable artifacts, but the artifact is rarely the one originally requested.
We present an empirical study of one operator across three ventures, measuring the effect of adopting a structured sprint-contract discipline on contract overrun rates. The study is small (n=48 contracts, single operator, 16 weeks), but the within-subject design — same operator, same ventures, same agent stack, before vs after a single intervention — produces an unusually clean signal: median overrun fell from 78% pre-discipline to 12% post-discipline, with the largest reduction in contracts whose original definition of done was vague.
The rest of TPL-2026-002 is for subscribers.
The Sprint Contract Effect: Measuring Overrun Reduction in Solo-Operator Engineering
- Every Expert-tier lesson — diagnostic prompts, transcripts, prompt kits, full homework
- Every research paper — methodology, figures, tables, reproducibility appendices
- New Expert lessons + papers as they ship (quarterly cadence)
- Foundations + Operating lessons stay free; bundles on GitHub stay free; this tier is the deep stuff
Free while the early catalog ships. Paid tier comes later — subscribe now and you’re grandfathered in.