Home Features Why KAPEX Use Cases Pricing Docs Blog

The thesis·Patent pending

Every other memory system has the curve backwards.

Mem0, Zep, MemoryBank, and the published literature all assume more engagement equals stronger memory. KAPEX inverts the curve. More processing → faster decay. Unresolved content persists. Resolved content recedes. That's the patent.

The problem with current memory systems

Every existing system treats recall frequency as a positive retention signal.

User mentions their father 50 times → father memories dominate every context window. Indefinitely. This models obsession, not understanding. Four failure modes follow.

Failure 01

Topic domination

High-frequency topics crowd out everything else. The grocery-list mention from yesterday and the family-death disclosure from last month get the same retrieval treatment.

Failure 02

No model of resolution

Resolved grief and active crisis look identical in retrieval. Both have high recall counts. Both surface. The system can't tell them apart.

Failure 03

Flat salience

"I had Thai food for lunch" and "my father just died" get treated as equal-weight memory nodes. No scoring at write-time means no scoring at retrieval-time.

Failure 04

Context-window waste

Without write-time scoring, retrieval fills the budget with stale, undifferentiated nodes. The model gets verbosity instead of signal.

What KAPEX does differently

Three lines of integration. Score, decay, inject.

Send a conversation turn to KAPEX, get a scored memory context block back, inject it into your LLM prompt. The core difference: processing-modulated decay. More processing means faster decay. This isn't a parameter tweak — it's a fundamentally different model.

Step 01

Send the turn

Your app passes the user input and your LLM's response to KAPEX. KAPEX scores, stores, and updates the memory graph — without changing your model.

Step 02

Recall what matters

At query time, KAPEX returns a token-budgeted context block of the highest-salience memories for that user. Three retrieval channels compete for the token budget.

Step 03

Inject into your prompt

Paste the context into your system prompt. The model now remembers — across sessions, across products, across providers. Works with Claude, GPT, Gemini, Llama, or your own.

Architecture at a glance

Six engines, one pipeline.

The architecture is structured, not a single black-box embedding model. Each engine is independently testable, swappable, and patent-pending.

Engine 01

Scoring engine

12 signal dimensions computed at write-time. Detail →

Engine 02

Legitimacy gap detection

Detects when what someone says doesn't match how they say it — the patent-worthy signal under NDA.

Engine 03

Entity-anchored hierarchy

Domain → entity → facet. Not flat vectors. Detail →

Engine 04

Residual floors

Memories decay toward category-specific minimums. They never fully disappear.

Engine 05

Three-channel retrieval

Salience + recency + constraint, token-budgeted. Detail →

Engine 06

13-module safety layer

Crisis sentinel, anti-fabrication, PII, prompt-injection. Detail →

How to evaluate

Run the same 20-message conversation. Twice.

The fastest way to evaluate KAPEX is a controlled side-by-side against your current memory system. Here's the protocol we use with pilot prospects.

The protocol

Same conversation. Same model. Different memory layer.

Run the same 20-message conversation through KAPEX and through your current memory system. Watch what happens at message 15 when the user has clearly moved past a topic discussed heavily in messages 2–6. In other systems, messages 2–6 still dominate context. In KAPEX, they've decayed proportionally to how much the user processed them.

If you want a pre-built eval harness with our internal scenarios, ask during pilot onboarding — we'll send it under mutual NDA.

Moat metrics

The numbers behind the claim.

From our internal benchmark suite. Methodology, exact scores, and the full result set are shared under mutual NDA.

Test pair

KAPEX scored emotional content

Vs the mundane equivalent

Spread

Loss-related disclosure vs casual preference

Large

Trauma reference vs weather chatter

Large

Anger / betrayal vs scheduling

Meaningful

Decay accelerates with processing.

Base

Unprocessed memories decay slowly

Moderate

After several sessions of processing

High

After sustained processing

Processed content recedes. Unresolved content persists. The exact decay magnitudes, formula, and parameter set are protected under patent and shared on a per-pilot basis under NDA.

Experience KAPEX live

Try the free beta

Patent pending

Run the eval. See the curve.

30-day pilot, full feature set, 100 users. We'll send the eval harness under NDA.

Start free pilot Watch demos