Key takeaways
- Augment Code took the semantic-indexing engine behind its AI coding assistant and shipped it as a standalone MCP server on February 6, 2026 — any MCP-compatible agent (Claude Code, Cursor, Codex, Zed, Kilo, Roo, and more) can now call it for codebase retrieval
- Vendor-run benchmarks on 300 Elasticsearch PRs (900 attempts) claim 70%+ quality improvement across agents, peaking at 80% for Claude Code + Opus 4.5 — striking numbers, but self-reported and on a single repository
- Two modes: local (the Auggie CLI as a stdio MCP server with real-time working-directory indexing) and remote (Augment-hosted engine over HTTP with GitHub App integration for cross-repo and CI context)
- The company behind it raised a $227M Series B at a $977M valuation in April 2024 ($252M total) from Sutter Hill, Index, Lightspeed, Innovation Endeavors, and Meritech — unusually deep pockets for a context tool
FAQ
What is the Augment Context Engine MCP?
It is Augment Code's proprietary semantic code-search and indexing engine, unbundled from its coding assistant and exposed as an MCP server so any MCP-compatible coding agent can query it for codebase context.
How much does the Augment Context Engine cost?
MCP usage is token-based — LLM tokens at provider rates plus a 40% service fee, averaging roughly $0.03–$0.06 per query — with 1,000 free queries at launch; Augment's broader credit plans run $20–$200/month.
How does local mode differ from remote mode?
Local mode runs the Auggie CLI on your machine as a stdio MCP server that indexes your working directory in real time; remote mode connects to an Augment-hosted engine over HTTP that auto-indexes selected repos' default branches via a GitHub App, enabling cross-repo and CI context.
How is it different from Serena or Repomix?
Serena and Repomix are free, open-source, and fully local (LSP symbols and context packing respectively), while Augment's engine is a paid, closed-source semantic index with a hosted cross-repo mode and vendor-published benchmark claims behind it.
Executive Summary
Augment Context Engine MCP is the unbundling of Augment Code's core asset: the proprietary semantic-indexing engine that powered its AI coding assistant is now a standalone MCP server that any agent — Claude Code, Cursor, Codex, GitHub Copilot, Zed, Kilo Code, Roo Code, Gemini CLI, and others — can call for codebase retrieval.[1][2] It went GA on February 6, 2026, with two modes: a local mode where the Auggie CLI runs as a stdio MCP server indexing your working directory in real time, and a hosted remote mode at api.augmentcode.com/mcp that auto-indexes selected repositories' default branches via a GitHub App for cross-repo and CI context.[1][2]
The launch pitch is a benchmark: on 300 Elasticsearch PRs across 900 attempts, Augment reports that adding the Context Engine improved agent output quality by 70%+ across agents — 80% for Claude Code + Opus 4.5, 71% for Cursor + Opus 4.5, 30% for Cursor + Composer-1 — measured across correctness, completeness, best practices, code reuse, and unsolicited documentation.[1] These numbers are vendor-run, on a single repository, and should be read as such. The strategic shift is notable regardless: a $977M-valuation assistant vendor conceding the agent layer and selling context as the durable primitive — "a weaker model with great context (Sonnet + MCP) can outperform a stronger model with poor context (Opus without MCP)," in the company's own framing.[3][1]
| Attribute | Value |
|---|---|
| Company | Augment Inc. (Augment Code), Palo Alto[3] |
| Founders | Igor Ostrovsky (ex-Pure Storage), Guy Gur-Ari (ex-Google AI researcher)[3] |
| Funding | $252M total — $227M Series B at $977M post-money (April 2024) from Sutter Hill, Index, Innovation Endeavors, Lightspeed, Meritech; $25M Series A led by Sutter Hill[3] |
| Context Engine MCP GA | February 6, 2026[1] |
| Open Source | No — proprietary engine; community wrappers exist[2][4] |
Product Overview
The product is a single MCP tool, codebase-retrieval, backed by Augment's semantic index: the agent asks a natural-language question about the codebase and gets back the relevant code context, instead of burning turns on grep-and-read exploration.[2] Augment claims agents complete tasks up to 2x faster and consume fewer tokens overall because retrieval replaces exploratory file reading — vendor-reported, like the quality numbers.[5]
Beyond code repositories, Context Connectors extend the index to GitLab, Bitbucket, documentation sites, and custom sources; the product demo shows 4,907 files indexed across 5 sources.[5] Setup is self-serve from app.augmentcode.com/mcp/configuration, with 1,000 free queries offered during launch.[5]
Key Capabilities
| Capability | Description |
|---|---|
| Semantic codebase retrieval | Natural-language queries over a semantic index via the codebase-retrieval MCP tool[2] |
| Local real-time indexing | Auggie CLI indexes the working directory with immediate file-change detection; no manual sync[2] |
| Hosted cross-repo context | Remote mode auto-indexes selected repos' default branches on push, for multi-repo questions and CI[2] |
| Context Connectors | GitHub, GitLab, Bitbucket, documentation sites, custom sources[5] |
| Broad agent support | 12 documented client integrations including Claude Code, Cursor, Codex CLI, Zed, Copilot, Kiro, Kilo, Roo, Antigravity, Gemini CLI, OpenCode, Factory Droid[2] |
Product Surfaces
| Surface | Description | Availability |
|---|---|---|
| Local MCP server | Auggie CLI over stdio, real-time working-directory index | GA[2] |
| Remote MCP server | Augment-hosted engine over HTTP with GitHub App | GA[2] |
| Context Engine SDK | Programmatic access for building on the engine | Available (docs published December 2025)[6] |
Technical Architecture
Local mode keeps indexing on your machine — "our indexer runs locally on your machine," per the product page — and is the recommended mode for active development, since edits are reflected immediately.[5][2] Remote mode trades that locality for breadth: an Augment-hosted index that updates when commits land on default branches, suited to understanding unfamiliar code, cross-repository questions, and CI environments where no working directory exists.[2] The underlying ranking and embedding stack is not publicly documented, and the engine itself is closed source — before the official launch, a community member published their own MCP wrapper around the Auggie CLI to get at the engine, an early demand signal the GA product effectively absorbed.[4]
Key Technical Details
| Aspect | Detail |
|---|---|
| Deployment | Local (Auggie CLI, stdio) or Augment-hosted remote (HTTP); no self-hosted server option[2] |
| MCP tools | codebase-retrieval[2] |
| Integrations | 12 documented agent clients; GitHub App for remote indexing; GitLab/Bitbucket/docs connectors[2][5] |
| Open Source | No — proprietary engine and index[2] |
Strengths
- The engine has an independent reputation — years before the unbundling, HN commenters consistently identified the context engine as Augment's differentiator, which makes selling it standalone credible rather than a pivot of desperation.[6]
- Works with the agents developers already use — rather than competing with Claude Code and Cursor for the agent seat, it upgrades them; 12 documented integrations at launch.[2]
- Cross-repo hosted mode is genuinely differentiated — local-only rivals (Serena, Repomix, CodeGraph-style indexers) cannot answer questions spanning dozens of repositories or serve CI; the GitHub App mode can.[2]
- Published, reproducible-in-principle benchmark methodology — 300 real Elasticsearch PRs, 3 prompts each, 5 scored dimensions is more rigor than most context-tool marketing, even if self-run.[1]
- Deep-pocketed vendor — $252M raised at a $977M valuation gives the engine more runway than the typical open-source context tool's single maintainer.[3]
Cautions
- All headline numbers are vendor-reported — the 70%+/80% improvements come from Augment's own evaluation on one repository (Elasticsearch); no independent replication exists as of June 2026.[1]
- Marketing figures have drifted — the product page now attributes the +80% Claude Code result to Opus 4.6 while the launch post said Opus 4.5, a small inconsistency that does not build confidence in precision.[5][1]
- A 40% service fee on top of LLM tokens — every query bills the underlying tokens at provider rates plus 40%, roughly $0.03–$0.06 per query; agents that retrieve aggressively will compound this.[2]
- Pricing trust deficit at the parent company — third-party analysis of r/AugmentCodeAI sentiment reports the 2025 credit-pricing migration "eroded trust," with departing users recommending flat-rate Claude Code Max instead.[7]
- Closed source with no self-hosted server — remote mode puts your cross-repo index on Augment's cloud, and there is no auditable core; free local alternatives like Serena and Repomix exist for the privacy-sensitive.[2]
- Strategic ambiguity — Augment still sells its own assistant; whether the MCP product gets first-class long-term investment or remains a funnel to the suite is unproven four months post-GA.[1]
What Developers Say
There is no dedicated HN launch thread for the Context Engine MCP as of June 2026 — the only direct story (the Context Engine SDK, December 2025) drew 2 points and zero comments. The engine itself, inside Augment's assistant, has a longer comment trail.[6]
"Augment's main value add is their context engine, and imo they do it really well." — kenforthewin on Hacker News, December 2025[6]
"The context engine is the magic sauce. It's miles ahead of all the other tools…" — evelant on Hacker News, May 2025[6]
"We used to use Augment Code at work which has a thing called Context Engine…" — parasti on Hacker News, May 2026, describing a team that has since moved to Claude Code[6]
On the critical side, one HN commenter in July 2025 dismissed the branding as "a fancy way of saying" they index code for navigation, while praising results on medium codebases but flagging reliability issues and cost; and CostBench's aggregation of r/AugmentCodeAI characterizes the consensus on the parent company's 2025 pricing migration as a rollout "handled poorly."[6][7]
Pricing & Licensing
| Tier | Price | Includes |
|---|---|---|
| Launch promo | Free | 1,000 free queries during launch[5] |
| MCP usage | Token-based | LLM tokens at provider rates + 40% service fee; ~$0.03–$0.06 per average query[2] |
| Indie | $20/month | 40,000 credits/month (Augment plan)[7] |
| Standard | $60/month | 130,000 credits/month, up to 20 users[7] |
| Max | $200/month | 450,000 credits/month[7] |
| Enterprise | Custom | Custom credit limits, unlimited users[7] |
Licensing model: Proprietary, closed source; access via Augment account and MCP configuration at app.augmentcode.com.[2]
Hidden costs: The 40% service fee rides on top of model tokens you are already paying for; credit top-ups run $15 per 24,000 credits, expire after 12 months, and monthly credits do not roll over.[2][7]
Competitive Positioning
Direct Competitors
| Competitor | Differentiation |
|---|---|
| Serena | Free, open-source, fully local LSP-based semantic toolkit; symbol-level precision but no hosted cross-repo index or published agent-improvement benchmarks |
| CodeGraph | Knowledge-graph approach — explicit structural relationships rather than semantic retrieval; open source and self-hostable where Augment is closed and managed |
| Repomix | Context packing, not retrieval — flattens a repo into one AI-friendly file; zero cost and zero server, but no query-time semantic search |
| Sourcegraph / native agent search | Claude Code and Cursor ship their own grep/embedding search; Augment's bet is that its dedicated index beats built-ins by the margins in its benchmark[1] |
When to Choose Augment Context Engine Over Alternatives
- Choose Augment Context Engine when: you work across large or multiple repositories, want one retrieval layer shared by every agent your team runs, and accept paying per query for vendor-claimed quality gains.[2]
- Choose Serena when: you want free, local, auditable symbol-level context with no per-query cost or cloud dependency.
- Choose CodeGraph when: explicit dependency and call-graph structure matters more than semantic similarity.
- Choose Repomix when: the codebase fits in the context window and packing once beats querying repeatedly.
Ideal Customer Profile
Best fit:
- Teams running coding agents against large, old, or multi-repo codebases where built-in search demonstrably fails
- Organizations already paying for Augment Code that can extend the same index to every MCP-capable agent
- CI and automation pipelines that need codebase context without a local checkout, via the hosted mode[2]
Poor fit:
- Privacy-sensitive teams unwilling to put a cross-repo semantic index on a vendor's cloud
- Small single-repo projects where Repomix or an agent's native search is free and sufficient
- Cost-sensitive workflows where a 40% token surcharge per retrieval compounds badly[2]
Viability Assessment
| Factor | Assessment |
|---|---|
| Financial Health | Strong on paper — $252M raised at a $977M valuation, though the Series B dates to April 2024[3] |
| Market Position | First-mover among well-funded vendors unbundling context as an MCP product; the engine's reputation predates the launch[1][6] |
| Innovation Pace | High — SDK (December 2025), MCP GA (February 2026), Context Connectors, 12 client integrations within months[6][2] |
| Community/Ecosystem | Thin for the MCP product specifically — no HN launch thread, sentiment mostly inherited from the assistant era, including pricing-migration resentment[6][7] |
| Long-term Outlook | Hinges on independent benchmark validation and whether agent vendors' native retrieval closes the gap[1] |
The unbundling reads as a clear-eyed strategic call: the agent layer commoditized around Claude Code and Cursor, and Augment moved its real asset to where the buyers are. The open questions are empirical — nobody outside Augment has reproduced the 70%+ numbers — and structural: every agent vendor is improving native codebase search, so the paid retrieval layer must keep outrunning free built-ins to justify a 40% token surcharge.[1][2]
Bottom Line
Augment Context Engine MCP is the most credible paid entry in the code-intelligence category: a reputation-backed semantic engine, a hosted cross-repo mode nothing open-source matches, and benchmark methodology that is at least published — but every impressive number on the page is the vendor grading its own homework, and the parent company carries community baggage from its 2025 pricing migration. Pilot it on a codebase where your agent demonstrably flails; let the free alternatives win everywhere else.
Recommended for: Teams with large or multi-repo codebases who can A/B the quality gain themselves, and existing Augment customers extending the index to other agents.
Not recommended for: Single-repo projects, privacy-sensitive organizations, or anyone unwilling to pay a per-query surcharge before independent benchmarks exist.
Outlook: Watch for third-party replication of the Elasticsearch benchmark, whether the SDK spawns an ecosystem, and whether Claude Code's and Cursor's native retrieval erases the margin the business depends on.
Research by Ry Walker Research • methodology
Sources
- [1] Augment Code Blog: Context Engine is now available for any AI coding agent
- [2] Augment Docs: Context Engine MCP Overview
- [3] Business Wire: Augment Inc. Raises $227 Million at $977 Million Valuation
- [4] GitHub: aj47/auggie-context-mcp (community wrapper)
- [5] Augment Code: Context Engine MCP Product Page
- [6] Augment context engine mentions on Hacker News (Algolia search)
- [7] CostBench: Augment Code Pricing 2026