← Back to research
·6 min read·company

Stripe Minions

Stripe's in-house coding agents produce 1,000+ merged PRs per week — fully unattended, human-reviewed code built on a Goose fork.

Key takeaways

  • 1,000+ pull requests merged per week with zero human-written code
  • Built on a fork of Block's open-source Goose agent
  • MCP server ('Toolshed') provides 400+ internal tools for context

FAQ

What is Stripe Minions?

Stripe's internal coding agent system that writes code end-to-end and ships over 1,000 merged pull requests per week, with human review but no human-written code.

How do Stripe Minions work?

Engineers invoke agents via Slack, CLI, or web. Agents run in isolated pre-warmed 'devboxes,' access 400+ internal tools via MCP, and produce PR-ready code within 2 CI rounds.

What is Stripe Minions built on?

Minions are built on a fork of Block's open-source Goose coding agent, extended with deep internal tool integrations.

Executive Summary

Stripe Minions is the most detailed public case study of enterprise in-house coding agents. The system produces over 1,000 merged pull requests per week — with humans reviewing the code but writing none of it. Built on a fork of Block's open-source Goose agent, Minions integrate deeply with Stripe's existing developer infrastructure through a central MCP server called "Toolshed" with 400+ internal tools.

AttributeValue
CompanyStripe
TypeInternal tool (not for sale)
FoundationGoose fork (Block)
Public DocumentationFebruary 2026
HeadquartersSan Francisco, CA

Product Overview

Stripe Minions are fully unattended coding agents designed for "one-shot" tasks — given a ticket, bug report, or specification, they produce a complete pull request without human intervention during the coding process. The key innovation is deep integration with Stripe's existing developer tooling, allowing agents to work with the same context and tools as human engineers.

Key Capabilities

CapabilityDescription
End-to-end codingWrites code from start to finish based on tickets/specs
Multi-surface invocationSlack (primary), CLI, web UI, internal tool integrations
MCP context400+ tools via central "Toolshed" server (docs, tickets, Sourcegraph)
Parallel executionMultiple minions run simultaneously without git conflicts

Product Surfaces

SurfaceDescriptionNotes
SlackTag agent in any thread to invokePrimary interface
CLICommand-line for power usersSecondary
WebVisibility and managementAdministrative
Internal toolsDeep integration with ticketing, feature flagsStripe-specific

Technical Architecture

Minions run on pre-warmed "devboxes" — isolated development environments that spin up in 10 seconds. These environments are identical to what human engineers use, but isolated from production and the internet for security.

Architecture Flow

Invocation (Slack/CLI/Web)
    ↓
Devbox (isolated sandbox, 10s spin-up)
    ↓
MCP Server (Toolshed: 400+ tools)
    ↓
Agent Loop (Goose fork)
    ↓
Local lint (under 5s) → CI (max 2 rounds) → Auto-fix
    ↓
Pull Request (human review required)

Key Technical Details

AspectDetail
FoundationFork of Block's Goose agent
ExecutionPre-warmed devboxes (10-second spin-up)
ContextMCP server with 400+ tools
CI StrategyLocal lint (under 5s), max 2 CI rounds, auto-apply fixes
IsolationNo production access, no internet

Strengths

  • Proven scale — 1,000+ PRs merged weekly, validated in production at one of the world's most demanding codebases
  • Deep integration — MCP server provides agents the same context human engineers have (docs, tickets, build status, Sourcegraph search)
  • Fast iteration — 10-second devbox spin-up enables rapid feedback loops
  • Leverages existing infra — Uses same environments as human engineers, reducing agent-specific edge cases
  • Parallelization — Engineers run multiple Minions simultaneously without git worktree conflicts

Cautions

  • Requires massive codebase investment — Stripe has hundreds of millions of lines of code and years of devex tooling; this isn't replicable overnight
  • Human review bottleneck — Agents write code but don't merge; humans must still review everything
  • Stack-specific patterns — Ruby/Sorbet environment; some patterns may not transfer to other stacks
  • Dedicated team required — "Leverage team" maintains the system full-time
  • Not for sale — This is internal tooling, not a product you can buy

Competitive Positioning

vs. Other In-House Agents

SystemDifferentiation
StrongDM FactoryMinions require human review; StrongDM eliminates it entirely
Ramp InspectSimilar architecture; Minions have deeper MCP integration
Coinbase ClaudebotMinions built on Goose fork; Coinbase appears Anthropic-native

When to Reference Stripe Minions

  • Reference when: Building in-house agents at 1,000+ engineer scale
  • Consider alternatives when: Team under 100 engineers, no existing devex investment
  • Buy instead when: Need agents in weeks, not quarters (consider Tembo, Claude Code, Codex)

Ideal Customer Profile

This is internal tooling, not a product for sale. However, the architecture is worth studying if you are:

Good fit for similar build:

  • Engineering organization with 1,000+ engineers
  • Existing devex team (3+ engineers) that can be redirected
  • Codebase exceeding 10M LOC with proprietary frameworks
  • Mature internal tooling (ticketing, CI, monitoring)

Poor fit:

  • Teams under 100 engineers (buy instead)
  • Standard tech stack without proprietary frameworks
  • No existing devex investment
  • Need results in weeks, not quarters

Viability Assessment

FactorAssessment
Documentation QualityExcellent (detailed public blog post)
ReplicabilityDifficult (requires massive devex investment)
Open Source FoundationYes (Goose is open source)
Architecture MaturityHigh (production-validated at scale)
TransferabilityMedium (principles transfer; implementation is Stripe-specific)

Stripe's detailed documentation makes Minions the reference architecture for enterprise in-house coding agents. While the specific implementation requires Stripe-level investment, the patterns — MCP context, isolated sandboxes, CI integration, Slack invocation — are universally applicable.


Bottom Line

Stripe Minions represent the gold standard for in-house coding agents at elite engineering organizations. The key insight: agents need the same context and tools as human engineers, not a bolted-on integration.

Key metrics: 1,000+ PRs/week, 10-second devbox spin-up, 400+ MCP tools, max 2 CI rounds.

Architecture pattern: Slack invocation → isolated sandbox → MCP context → CI loop → human review → merge.

Recommended study for: Engineering leaders evaluating build vs. buy for coding agents, devex teams designing agent infrastructure.

Not recommended for: Small teams, organizations without existing devex investment, anyone expecting a product they can purchase.

Outlook: Stripe has defined the reference architecture. Expect commercial vendors (Tembo, Devin, etc.) to offer "Stripe Minions in a box" for organizations that can't build their own.


Research by Ry Walker Research • methodology

Disclosure: Author is CEO of Tembo, which offers agent orchestration as an alternative to building in-house.