Letta | Ry Walker Research

Key takeaways

Born from the influential MemGPT research paper at UC Berkeley, now a YC-backed startup with $10M seed funding
Tiered memory architecture (core, archival, recall) gives agents persistent state across conversations
Model-agnostic design supports OpenAI, Anthropic, Mistral, and local models — no vendor lock-in
Letta Code (42.5% on Terminal-Bench, #1 model-agnostic open-source agent) is now the company's center of gravity — desktop app launched April 2026, Channels replaced LettaBot in May 2026

FAQ

What is Letta?

Letta is a platform for building stateful AI agents with advanced memory. Originally the MemGPT research project from UC Berkeley, it provides tiered memory systems that let agents remember, learn, and self-improve over time.

How much does Letta cost?

As of June 2026, Letta offers a free tier (up to 3 managed agents, BYOK, no login required for local use), a Pro plan at $20/month with up to 20 stateful agents plus a Letta Auto model quota with pay-as-you-go overage, and custom Team/Enterprise pricing. The previously listed $200/month Max tier no longer appears on the pricing page.

How is Letta different from Mem0?

Letta provides a full agent framework with built-in memory management, tool calling, and an agent development environment. Mem0 is more focused on being a standalone memory layer you add to existing agents. Letta is heavier but more integrated.

Can I self-host Letta?

Yes, the core Letta framework is open source (Apache 2.0) and can be self-hosted. The cloud platform adds hosted infrastructure, the ADE (Agent Development Environment), and managed scaling.

Executive Summary

Letta is a platform for building stateful AI agents with advanced memory systems. Originally the MemGPT research project from UC Berkeley's AI Research Lab (BAIR), it introduced the concept of virtual context management — using an OS-inspired memory hierarchy to give LLMs effectively unlimited context. The company emerged from stealth in September 2024 with $10M in seed funding led by Felicis, with participation from Founders Fund and YC.

Attribute	Value
Company	Letta (formerly MemGPT)
Founded	2023 (research), 2024 (company)
Funding	$10M seed (Felicis, Founders Fund, YC)
Founders	Charles Packer, Sarah Wooders, Vivian Myers
Origin	UC Berkeley BAIR Lab
GitHub Stars	23.3K+ (as of June 2026)
License	Apache 2.0

How It Works

Letta's core innovation is a tiered memory architecture inspired by operating system memory management. Agents have:

Core Memory — always in the LLM context window. Contains the agent's persona and key info about the current user. Editable by the agent in real-time.
Archival Memory — a vector database for long-term storage. Agents can write observations and retrieve them later via search. Scales to unlimited size.
Recall Memory — full conversation history stored in a database. Agents can search past messages rather than relying on a fixed context window.

The agent autonomously decides when to read/write across these tiers using tool calls. This creates a self-managing memory system where the LLM acts as its own memory controller — the key insight from the original MemGPT paper.

Recent Developments

Letta has expanded beyond pure memory into a full agent platform, and as of June 2026 Letta Code is clearly the company's center of gravity:

Letta Code (Dec 2025) — a memory-first coding agent that runs in your terminal, scoring 42.5% on Terminal-Bench and ranked #1 model-agnostic open-source agent
Letta Code app (Apr 2026) — a desktop app for "deeply personalized agents that learn over time and work locally on your machine"
Channels (beta) — connect agents to Telegram, Slack, Discord, WhatsApp, and Signal directly from Letta Code; the standalone LettaBot project was archived in May 2026 in favor of Channels plus Remote Environments and letta cron
Context Repositories — git-based memory for coding agents (Feb 2026)
Context Constitution (Apr 2026) — published principles for how agents should manage context to learn from experience
Conversations API — shared agent memory across concurrent user sessions
ADE (Agent Development Environment) — web UI for building and managing agents

Pricing

Pricing was restructured around Letta Code in 2026 — the former $200/month Max tier no longer appears:

Plan	Price	Key Features
Free	$0	Up to 3 managed agents, BYOK, no login required for local use, limited Letta Auto usage
Pro	$20/mo	Up to 20 stateful agents, Letta Auto weekly + monthly quota, pay-as-you-go overage
Team/Enterprise	Custom	Multi-seat arrangements via support@letta.com

BYOK (Bring Your Own Keys) works on all tiers with OpenAI, Anthropic, and OpenRouter keys — and Letta Code can also draw on existing external coding plans. Letta's own docs note that casual coders typically run ~$100/month+ in usage and power users often exceed $200/month.

Strengths

Deep research pedigree — born from a well-cited academic paper, not hype-driven
Model-agnostic — works with OpenAI, Anthropic, Mistral, and local models via Ollama
Open source core — Apache 2.0 license, self-hostable, 23K+ GitHub stars as of June 2026
Full Python and TypeScript SDKs with a proper REST API
Memory transparency — developers can inspect and control agent memory state, unlike black-box alternatives
Active development — regular releases, expanding into coding agents and context repositories

Cautions

Complexity overhead — the tiered memory system adds latency and token cost from internal tool calls
Stability concerns — some users report instability, especially with local models
Scope creep risk — expanding from memory layer into full agent framework and coding agent puts them in competition with many more players
Community perception — some view the pivot from pure memory research to agent platform as chasing hype, though the founders dispute this
Local model support — while technically model-agnostic, reliability with smaller local models can be poor

What Developers Say

The Letta Code launch thread on Hacker News (December 2025, 83 points) captured both the appeal and the skepticism around memory-first agents:

"Void is the greatest ad for Letta. I'm interested to see if it's as good at coding as it is at posting." — ascorbic, Hacker News

"In my experience, 'memory' is really not that helpful in most cases. For all of my projects, I keep the documentation files and feature specs up to date... Maintaining the memory is a considerable burden." — tigranbs, Hacker News

"Context poisoning is a real problem that these memory providers only make worse." — danieltanfh95, Hacker News

Co-founder Charles Packer told Tessl the resemblance to Claude Code is deliberate: "we're not trying to innovate at the UI layer, we're innovating at the 'memory' (context engineering) layer."

Competitive Positioning

Feature	Letta	Mem0	LangGraph	CrewAI
Primary Focus	Stateful agents with memory	Memory layer	Agent orchestration	Multi-agent teams
Memory System	Built-in tiered (core/archival/recall)	Standalone memory API	Manual state management	Basic memory
Self-Hosted	Yes (Apache 2.0)	Yes (Apache 2.0)	Yes	Yes
Cloud Offering	Yes	Yes	LangSmith	CrewAI Enterprise
Coding Agent	Yes (Letta Code)	No	No	No
Model Agnostic	Yes	Yes	Yes	Yes
Best For	Long-running agents needing persistent memory	Adding memory to existing agents	Complex agent workflows	Team-based agent orchestration

Bottom Line

Letta occupies a unique niche: it's the most serious open-source attempt at solving LLM memory as a first-class concern. The MemGPT paper was genuinely influential, and the tiered memory architecture remains the most principled approach to agent state management available.

The 2026 trajectory shows consolidation rather than sprawl: the standalone LettaBot was archived in May 2026 and folded into Letta Code as Channels, the desktop app shipped in April, and pricing was rebuilt around Letta Code. The company is converging on one product — a memory-first coding agent — rather than maintaining parallel bets.

Recommended for: Developers building agents that need to remember and learn over long time horizons — Letta remains the most mature open-source option, and the free tier with BYOK makes it low-risk to evaluate. Also worth a look if you want a Claude Code-style coding agent that accumulates project knowledge across sessions.

Not recommended for: Teams that just need a lightweight memory API for an existing agent (consider Mem0), or workflows where up-to-date docs and specs already serve as the source of truth — several developers report that self-managed memory adds burden and context-poisoning risk without clear payoff.

Outlook: The bet is that memory becomes the durable differentiator in coding agents before frontier providers ship equivalent native features. The Terminal-Bench result and steady 2026 shipping cadence are credible signals, but the memory layer — Letta's core differentiator — still risks being absorbed by the model providers themselves.

Sources