What does Meaning Memory do?

Meaning Memory is the cognitive memory engine for enterprise multi-agent AI fleets. It scores every memory entry across five orthogonal STARE dimensions (Significance, Temporal, Asymmetry, Relational, Episodic), runs retrieval through a deterministic five-phase compile pipeline, and links every entry to source provenance. Operators get explicit control over ranking, decay, and what one agent learns becomes inherited context for the next across the fleet.

How does deterministic retrieval work in Meaning Memory?

Retrieval runs through a fixed five-phase pipeline (Drain → Dedup → Extract → Synthesize → Compile) with hash-chained audit on every state change. The same query against the same corpus produces byte-identical results on replay. This makes retrieval repeatable, reviewable, and reproducible, properties that flat vector retrieval cannot guarantee.

What are the five STARE dimensions?

STARE is the five-dimensional scoring framework at the core of Meaning Memory. Significance (what mattered), Temporal (when + how it decays), Asymmetry (attribution + trust gradient), Relational (graph linkage), Episodic (narrative arc clustering). Dimensions compose at compile time (additive richness boost) and at retrieval time (attribution-aware weighting), giving operators explicit control over how memories are ranked.

Does Meaning Memory require Postgres?

No, both backends are first-class. PostgresBackend (with pgvector + GIN indexing) is recommended for production multi-tenant fleets, SQL-queryable audit, and regulated industries that need a SQL audit trail. FileBackend is .md-native, agent-isolated, requires no database server, and is the right fit for air-gapped or single-process deployments. Customers choose based on deployment environment.

Can I deploy Meaning Memory in air-gapped environments?

Yes. Meaning Memory is a licensed self-host engine, customers deploy in their own infrastructure (Docker Compose, Kubernetes via Helm, or bare metal). No data leaves the customer perimeter. The FileBackend variant requires no database server, runs as a single process, and is purpose-built for air-gapped deployments. The wheel artifact is license-gated with offline activation supporting 365-day signed entitlement caches.

Meaning Memory | The Memory Layer for Enterprise AI Agents

The STARE Framework

Five Dimensions of Meaning

Meaning Memory scores every entry across five orthogonal cognitive primitives. Dimensions compose at compile time (additive richness boost) and at retrieval time (attribution-aware weighting), so significance, recency, attribution, relational fit, and episode membership all gate the rank, not just one weighted vector score.

Significance

First-class composite importance, measured twice

Significance is the foundational dimension, the one every other memory system gets approximately right and then stops. Meaning Memory treats it as a first-class composite, not a scalar.

Every memory carries two significance scores: sig_self (how important the agent thinks it is) and sig_external (how important the operator's policy thinks it is). When they diverge, the system records the divergence and the rationale. This is the encoding-compliance moat: a dual-write architecture where an extractor LLM catches the moments the agent forgot to call mm_remember, with STARE Sig arbitrating which write survives merge.

Why it matters

Probabilistic LLM agents silently fail to record what mattered. No retrieval, decay, or calibration system can recover what was never written. Significance, measured twice and arbitrated deterministically, is what closes the encoding hole.

Scenario

"A customer-service agent is told 'I'm thinking about cancelling, this is my third call about the same issue.' The agent answers helpfully but never calls mm_remember(). In every other memory system, that statement is lost. In Meaning Memory, the passive extractor catches it, scores it (sig 0.92, high), and the operator's churn-risk playbook escalates the next interaction. The agent's miss didn't cost the company the customer."

How Meaning Memory implements it

sig_self + sig_external numeric columns. divergence_rationale text for audit. Operator-controllable sig floor per tenant, per playbook. Grounded in Conway and Pleydell-Pearce 2000 Self-Memory System theory.

Temporal

Configurable decay curves and validity windows

Temporal is more than a timestamp. Every memory in Meaning Memory carries three time signals: when it was observed, when it becomes valid, and when it stops mattering.

Decay curves are operator-configurable per playbook. Some memories decay fast (today's traffic numbers). Some stay forever (a customer's allergy). Some don't decay but expire on a deadline (a paused deploy that resumes after security review). Meaning Memory tracks both decay and validity windows independently.

Why it matters

KV caches solve freshness with TTL. Vector stores solve recency with score boosts. Neither captures the truth that some memories fade, some expire, and some stay forever, and which is which is a governance question, not a heuristic.

Scenario

"A developer agent learns at 9am: 'The deploy is paused, security review in progress.' At 3pm, the security review closes. In a TTL cache, the memory expired hours ago. In Meaning Memory, valid_until=15:00 marked it temporally invalid the moment the review closed, and the audit trail shows exactly when, why, and which event triggered the transition."

How Meaning Memory implements it

valid_from / valid_until columns. Three decay shapes ship today: none, step, and exponential, selectable per playbook. Phase 4 compile honors both decay and validity.

Asymmetry

Explicit attribution and trust gradients

Asymmetry, also called Attribution, is the dimension most memory systems treat as post-hoc filtering. In Meaning Memory, every memory carries explicit writer attribution and an attribution-register score native to the data model, with per-perceiver retrieval-policy modifiers derived from it; corroboration-based warrant scoring on the roadmap.

In multi-agent deployments this matters enormously. The same statement ("our churn rate is 4.2%") from the CFO and from a customer comment should not carry the same retrieval weight. Asymmetry encodes the attribution-register gradient at the schema level so register-aware retrieval is possible without bolt-on filters.

Why it matters

Enterprise multi-agent fleets are political ecosystems. Marketing's agent sees one version of "true." Engineering's sees another. Legal's sees a third. Without attribution as a first-class primitive, cross-agent memory becomes either dangerously homogenized or frustratingly siloed. Asymmetry lets you have shared facts with register-aware retrieval.

Scenario

"Three agents in the same scope group store the memory 'the bug is in payments.' One is the QA bot. One is a customer-service summary. One is the engineering on-call escalation. Attribution-register scoring lets retrieval weight the more-strongly-attributed memory higher for evidence-driven roles. Same fact, different attribution register; register-aware ranking (per-perceiver policy in validation)."

How Meaning Memory implements it

Attribution-register score and per-perceiver modifier JSONB on every entry. Writer attribution captured at write time. Register-aware retrieval ranking (per-perceiver policy, fail-closed) — activation in validation. Corroboration-based warrant scoring and hostile-input detection are on the roadmap.

Relational

Typed edges, 1-hop filters, and provenance

Relational turns memories into a typed graph. Every memory can connect to other memories, agents, documents, customers, projects, people, concepts, events, or commitments. Nine bounded target types ship today, plus a custom:* namespace for domain-specific extensions.

Production R-dim today means clean schema primitives: vocabulary normalization, typed edges with provenance, predicate-gated retrieval, and 1-hop filters via mm_search(related_to=, min_r=). Vector similarity surfaces "memories that look like this." Relational edges surface "memories that matter to this."

Why it matters

Enterprise agent fleets need explicit linkage between memories, customers, and commitments, with audit-grade provenance on every edge. That is the shipped Relational contract. Multi-hop graph traversal remains a research track; LoCoMo-scale benchmarks have not shown proven retrieval lift at current architecture (pending further benchmark validation).

Scenario

"A customer escalation comes in. The agent searches with related_to= on the ticket memory and min_r= on the engineering bug link. One hop returns the bug report, the deploy memo, and the QA verification, each edge typed, each with asserted_by provenance. No hand-wired context stitching."

How Meaning Memory implements it

r_score numeric column on mm_entries. mm_relationships typed-edge table (PG) or edges.jsonl (FileBackend) with bounded target type enum. append_relationship(), get_relationships(), and 1-hop retrieval filters on both backends.

Research note: depth-N multi-hop traversal (mm_traverse_relationships) exists in-engine but is not a customer-facing lead, deferred pending benchmark validation that proves value at fleet scale.

Episodic

Episode summaries and optional narrative clustering

Episodic groups memories into bounded arcs. A customer conversation isn't a list of facts, it's an episode with a beginning and a resolution. Most memory systems flatten that structure. Meaning Memory preserves it.

Every memory can carry an episode_id. Episode summaries (mm_episode_summary, mm_search(episode_id=)) ship stable on both backends. Multi-step narrative clustering is beta and opt-in (MM_E_CLUSTER_MODE_B_ENABLED); the deterministic narrative renderer runs only when Mode B is enabled.

Why it matters

The unit of recall in human memory is not the fact. It's the episode. Episode summaries give operators and agents a bounded working set without loading every underlying memory. Narrative clustering adds chronology when you opt in.

Scenario

"A multi-day support escalation involves 14 separate agent interactions. The on-call SRE calls mm_episode_summary on episode 7a3f and gets title, time bounds, entry count, and summary metadata, stable today. With Mode B enabled, Phase 4 compile can emit a chronologically ordered narrative outline for the same episode."

How Meaning Memory implements it

e_score numeric + episode_id UUID on every entry. mm_episodes summary table (PG) or workspace sidecar (FileBackend). Episode summaries stable; narrative clustering beta, default OFF.

Structured Cognition for Agent Fleets

Five Dimensions of Meaning

Significance

Temporal

Asymmetry

Relational

Episodic

Significance

Temporal

Asymmetry

Relational

Episodic

STARE dimensions compose across compile and retrieval.

The 5-Phase Deterministic Pipeline

Drain

Dedup

Extract

Synthesize

Compile

Three layers. Your stack on top and bottom.

See what your fleet remembers.

Framework-agnostic. Native where it counts.

MCP Server

Letta

CLI

REST API

Where Meaning Memory earns its keep.

One shopper, many touchpoints, one timeline.

Escalation without context amnesia.

Reconstruct what each agent knew, minute by minute.

Two editions. One engine.

MM Engine

MM Studio

Want the deep technical brief?