There is a framing problem in the market. People talk about agent harnesses as if the harness is the interesting part. It is not. The interesting part is the factory.
An agent harness is a runtime wrapper. It takes a prompt, some tools, maybe some context, and executes a task. That is table stakes. Every serious engineering team can build one. Many already have — the recent wave of companies publicly disclosing their internal agent infrastructure proves this. Block had teams building internal harnesses before they laid those teams off. Stripe has been doing similar work. Ramp, DoorDash, and dozens of others are in various stages of the same journey.
The factory is something different. The factory is the execution engine and the sandboxing system. It is the orchestration layer that handles long-running, resumable work. It is the context management that survives across sessions. It is the model translation infrastructure that lets you start a task on one model and continue it on another without losing state.
That last one sounds simple. It is brutally hard. It requires session translation systems that handle the impedance mismatch between different model APIs, different context window sizes, and different tool-calling conventions. The harness is a weekend project. The factory is a multi-year platform investment.
This is where the real defensibility lives. Not in the agent definition. Not in the prompt. In the ugly, complex infrastructure that makes agents actually work in production. I've argued elsewhere that the framework trap catches every company that mistakes the wrapper for the product. The companies building factories will quietly become the layer underneath the next decade of agent applications — and most of the customers will never know their name.
Sources
Related Essays
The Framework Trap
LangChain pivoted to LangSmith. E2B sells the sandbox. The agent harness is not the product — it is the thing you give away. Monetization lives in the infrastructure underneath.
The Agent Harness Problem
Enterprise agents need layered interfaces, real software skills, and flexible platforms. The harness around the model matters more than the model.
The Harness Is Commoditized. Everything Else Is Not
The agent harness — Claude Code, OpenCode, Goose, Aider — is a commodity. Companies migrate between them freely. The defensible layers are context, orchestration, and tools.
Key takeaways
- The agent harness is a runtime wrapper. The factory is the system that produces, deploys, observes, and maintains agents at scale.
- Block, Stripe, Ramp, and DoorDash have all built internal harnesses — and several have already discarded them. The harness is not a moat.
- Defensibility lives in session translation, sandboxing, orchestration, and the unglamorous infrastructure that makes agents work in production.
FAQ
What is the difference between a harness and a factory?
A harness takes a prompt, some tools, and runs a single task. A factory is the production line — sandboxing, orchestration, persistence, model translation, observability, and policy enforcement. The harness is one machine. The factory is the building it sits in.
What is "session translation" and why is it brutally hard?
It is the infrastructure that lets you start a task on one model and finish it on another without losing state. That requires reconciling different APIs, context window sizes, tool-calling conventions, and message formats — all while preserving the task's semantic state. Easy to describe, painful to implement.