Key takeaways
- Born from the influential MemGPT research paper at UC Berkeley, now a YC-backed startup with $10M seed funding
- Tiered memory architecture (core, archival, recall) gives agents persistent state across conversations
- Model-agnostic design supports OpenAI, Anthropic, Mistral, and local models — no vendor lock-in
- Letta Code (42.5% on Terminal-Bench, #1 model-agnostic open-source agent) is now the company's center of gravity — desktop app launched April 2026, Channels replaced LettaBot in May 2026
FAQ
What is Letta?
Letta is a platform for building stateful AI agents with advanced memory. Originally the MemGPT research project from UC Berkeley, it provides tiered memory systems that let agents remember, learn, and self-improve over time.
How much does Letta cost?
As of June 2026, Letta offers a free tier (up to 3 managed agents, BYOK, no login required for local use), a Pro plan at $20/month with up to 20 stateful agents plus a Letta Auto model quota with pay-as-you-go overage, and custom Team/Enterprise pricing. The previously listed $200/month Max tier no longer appears on the pricing page.
How is Letta different from Mem0?
Letta provides a full agent framework with built-in memory management, tool calling, and an agent development environment. Mem0 is more focused on being a standalone memory layer you add to existing agents. Letta is heavier but more integrated.
Can I self-host Letta?
Yes, the core Letta framework is open source (Apache 2.0) and can be self-hosted. The cloud platform adds hosted infrastructure, the ADE (Agent Development Environment), and managed scaling.
Executive Summary
Letta is a platform for building stateful AI agents with advanced memory systems. Originally the MemGPT research project from UC Berkeley's AI Research Lab (BAIR), it introduced the concept of virtual context management — using an OS-inspired memory hierarchy to give LLMs effectively unlimited context. The company emerged from stealth in September 2024 with $10M in seed funding led by Felicis, with participation from Founders Fund and YC.
| Attribute | Value |
|---|---|
| Company | Letta (formerly MemGPT) |
| Founded | 2023 (research), 2024 (company) |
| Funding | $10M seed (Felicis, Founders Fund, YC) |
| Founders | Charles Packer, Sarah Wooders, Vivian Myers |
| Origin | UC Berkeley BAIR Lab |
| GitHub Stars | 23.3K+ (as of June 2026) |
| License | Apache 2.0 |
How It Works
Letta's core innovation is a tiered memory architecture inspired by operating system memory management. Agents have:
- Core Memory — always in the LLM context window. Contains the agent's persona and key info about the current user. Editable by the agent in real-time.
- Archival Memory — a vector database for long-term storage. Agents can write observations and retrieve them later via search. Scales to unlimited size.
- Recall Memory — full conversation history stored in a database. Agents can search past messages rather than relying on a fixed context window.
The agent autonomously decides when to read/write across these tiers using tool calls. This creates a self-managing memory system where the LLM acts as its own memory controller — the key insight from the original MemGPT paper.
Recent Developments
Letta has expanded beyond pure memory into a full agent platform, and as of June 2026 Letta Code is clearly the company's center of gravity:
- Letta Code (Dec 2025) — a memory-first coding agent that runs in your terminal, scoring 42.5% on Terminal-Bench and ranked #1 model-agnostic open-source agent
- Letta Code app (Apr 2026) — a desktop app for "deeply personalized agents that learn over time and work locally on your machine"
- Channels (beta) — connect agents to Telegram, Slack, Discord, WhatsApp, and Signal directly from Letta Code; the standalone LettaBot project was archived in May 2026 in favor of Channels plus Remote Environments and
letta cron - Context Repositories — git-based memory for coding agents (Feb 2026)
- Context Constitution (Apr 2026) — published principles for how agents should manage context to learn from experience
- Conversations API — shared agent memory across concurrent user sessions
- ADE (Agent Development Environment) — web UI for building and managing agents
Pricing
Pricing was restructured around Letta Code in 2026 — the former $200/month Max tier no longer appears:
| Plan | Price | Key Features |
|---|---|---|
| Free | $0 | Up to 3 managed agents, BYOK, no login required for local use, limited Letta Auto usage |
| Pro | $20/mo | Up to 20 stateful agents, Letta Auto weekly + monthly quota, pay-as-you-go overage |
| Team/Enterprise | Custom | Multi-seat arrangements via support@letta.com |
BYOK (Bring Your Own Keys) works on all tiers with OpenAI, Anthropic, and OpenRouter keys — and Letta Code can also draw on existing external coding plans. Letta's own docs note that casual coders typically run ~$100/month+ in usage and power users often exceed $200/month.
Strengths
- Deep research pedigree — born from a well-cited academic paper, not hype-driven
- Model-agnostic — works with OpenAI, Anthropic, Mistral, and local models via Ollama
- Open source core — Apache 2.0 license, self-hostable, 23K+ GitHub stars as of June 2026
- Full Python and TypeScript SDKs with a proper REST API
- Memory transparency — developers can inspect and control agent memory state, unlike black-box alternatives
- Active development — regular releases, expanding into coding agents and context repositories
Cautions
- Complexity overhead — the tiered memory system adds latency and token cost from internal tool calls
- Stability concerns — some users report instability, especially with local models
- Scope creep risk — expanding from memory layer into full agent framework and coding agent puts them in competition with many more players
- Community perception — some view the pivot from pure memory research to agent platform as chasing hype, though the founders dispute this
- Local model support — while technically model-agnostic, reliability with smaller local models can be poor
What Developers Say
The Letta Code launch thread on Hacker News (December 2025, 83 points) captured both the appeal and the skepticism around memory-first agents:
"Void is the greatest ad for Letta. I'm interested to see if it's as good at coding as it is at posting." — ascorbic, Hacker News
"In my experience, 'memory' is really not that helpful in most cases. For all of my projects, I keep the documentation files and feature specs up to date... Maintaining the memory is a considerable burden." — tigranbs, Hacker News
"Context poisoning is a real problem that these memory providers only make worse." — danieltanfh95, Hacker News
Co-founder Charles Packer told Tessl the resemblance to Claude Code is deliberate: "we're not trying to innovate at the UI layer, we're innovating at the 'memory' (context engineering) layer."
Competitive Positioning
| Feature | Letta | Mem0 | LangGraph | CrewAI |
|---|---|---|---|---|
| Primary Focus | Stateful agents with memory | Memory layer | Agent orchestration | Multi-agent teams |
| Memory System | Built-in tiered (core/archival/recall) | Standalone memory API | Manual state management | Basic memory |
| Self-Hosted | Yes (Apache 2.0) | Yes (Apache 2.0) | Yes | Yes |
| Cloud Offering | Yes | Yes | LangSmith | CrewAI Enterprise |
| Coding Agent | Yes (Letta Code) | No | No | No |
| Model Agnostic | Yes | Yes | Yes | Yes |
| Best For | Long-running agents needing persistent memory | Adding memory to existing agents | Complex agent workflows | Team-based agent orchestration |
Bottom Line
Letta occupies a unique niche: it's the most serious open-source attempt at solving LLM memory as a first-class concern. The MemGPT paper was genuinely influential, and the tiered memory architecture remains the most principled approach to agent state management available.
The 2026 trajectory shows consolidation rather than sprawl: the standalone LettaBot was archived in May 2026 and folded into Letta Code as Channels, the desktop app shipped in April, and pricing was rebuilt around Letta Code. The company is converging on one product — a memory-first coding agent — rather than maintaining parallel bets.
Recommended for: Developers building agents that need to remember and learn over long time horizons — Letta remains the most mature open-source option, and the free tier with BYOK makes it low-risk to evaluate. Also worth a look if you want a Claude Code-style coding agent that accumulates project knowledge across sessions.
Not recommended for: Teams that just need a lightweight memory API for an existing agent (consider Mem0), or workflows where up-to-date docs and specs already serve as the source of truth — several developers report that self-managed memory adds burden and context-poisoning risk without clear payoff.
Outlook: The bet is that memory becomes the durable differentiator in coding agents before frontier providers ship equivalent native features. The Terminal-Bench result and steady 2026 shipping cadence are credible signals, but the memory layer — Letta's core differentiator — still risks being absorbed by the model providers themselves.
Sources
- [1] Letta Official Website
- [2] Letta GitHub Repository
- [3] TechCrunch: Letta comes out of stealth
- [4] Letta raises $10M seed led by Felicis
- [5] Reddit: Letta vs Mem0 discussion
- [6] Reddit: Benchmarking memory solutions
- [7] Letta Blog: Building the #1 open source terminal-use agent
- [8] LettaBot GitHub (archived May 2026, replaced by Letta Code Channels)
- [9] Hacker News: Letta Code discussion
- [10] Tessl: Letta Code bets on memory as the missing layer in coding agents
- [11] Letta Code Pricing Documentation