Area
pumaDB
Zep
Primary job
A small hosted JSON database for agent memory and lightweight server-side app state.
Enterprise agent memory infrastructure for building, governing, and retrieving context from temporal graphs.
Data model
Named JSON tables. Current account limits are 20 tables, 1,000 rows per table, 25 MB per account, and 64 KB per row.
Temporal context graphs made from entities, facts, relationships, episodes, validity windows, and source provenance.
Write behavior
Agents and apps explicitly add, upsert, patch, batch, or delete JSON rows.
Applications ingest messages, text, JSON, business data, documents, and events so Zep can construct and update graph context.
Query behavior
CRUD, simple equality filters, sorting, counting, batch writes, and short-lived viewer links for larger results.
Retrieves assembled context blocks and graph results based on relevant facts, entities, relationships, and temporal state.
AI layer
Stores memory as explicit JSON records. It does not extract facts, build graphs, run Graph RAG, or assemble LLM context.
Positions itself around fact extraction, temporal graph updates, context engineering, Graph RAG, and token-efficient context assembly.
Agent interface
Hosted Streamable HTTP MCP, local stdio MCP, REST API, and CLI are built into the product surface.
SDKs, APIs, examples, and graph or memory APIs for adding data and retrieving context for LLM calls.
Governance
Uses account-scoped auth, OAuth for hosted MCP, API keys, explicit delete filters, and archived row restore.
Emphasizes enterprise controls such as provenance, access control, retention, audit, trust, and governed context retrieval.
Operational shape
Hosted on Cloudflare with deliberately small limits and no graph, vector, or model configuration.
A managed enterprise memory platform with context graph infrastructure, Graphiti, and deployment or compliance options.
Recovery model
Every update and delete archives the previous row. The last 10 versions are kept for 30 days and can be restored.
Tracks changing facts over time with validity and provenance, rather than exposing row-level JSON version restore as the central primitive.