@seanhogg/builderforce-memory-mcp
Expose @seanhogg/builderforce-memory to any MCP client. One token-saving
tool core over a pluggable MemoryBackend, three transports:
| Transport | Factory | Consumed as | Use when |
|---|---|---|---|
| In-process (Claude Agent SDK) | createMemoryMcpServer(backend) |
the returned type:"sdk" object |
the consuming product is TS and runs @anthropic-ai/claude-agent-sdk in-process — lowest latency |
| stdio | runStdio(backend) / npx @seanhogg/builderforce-memory-mcp |
{ type:"stdio", command, args } |
any language / separate process; decouples the SSM+IndexedDB deps from the consumer |
| HTTP (Streamable) | createMemoryHttpHandler(backend, { authToken }) |
{ type:"http", url, headers } |
multi-tenant / networked (builderforce.ai hosting, remote agents) |
Technical report & peer review. This transport is reviewed in the Evermind technical report's adversarial
PEER-REVIEW.md. Note before exposing the HTTP handler publicly: theauthTokencheck is a non-constant-time string compare (timing channel) and the handler shares onebackendwith no per-tenant isolation — a single token reads every tenant's facts. Despite the "multi-tenant" note above, real isolation (token→namespace mapping, constant-time compare, rate limiting) is not yet implemented. Tracked asEVM-3in the Builderforce.ai Consolidated Gap Register.
Why this saves tokens
The point of an external memory store is fetch-on-demand instead of pinning
your whole memory file into every prompt. That only pays off if recall is
selective, so the caps are enforced server-side in tools.ts:
memory_recallreturns a ranked top-K (default 5, hard-capped), never the store.- Each entry's content is truncated (default 500 chars) before it hits context.
- There is no "return everything" tool — a dump-all is more expensive than inlining.
Tool descriptions are prescriptive ("call this before answering when…") because recent Claude models reach for tools more conservatively; the trigger condition in the description is what drives should-call rate. Pair this with prompt caching (keep the tool set identical across tenants so the cached prefix survives) and context editing (prune stale recalled facts) for the full win.
The seam
Everything is written against MemoryBackend:
interface MemoryBackend {
recall(query: string, topK: number): Promise<RecallHit[]>; // semantic
get(key: string): Promise<RecallHit | undefined>; // exact
recallByTag(tag: string, limit: number): Promise<RecallHit[]>;
remember?(input: RememberInput): Promise<void>; // optional → read-only backends omit
forget?(key: string): Promise<void>;
}Ship the local MemoryStoreBackend (IndexedDB via @seanhogg/builderforce-memory) today;
drop in a networked builderforce.ai adapter later with zero changes to tools
or transports.
Quick start — in-process with the Claude Agent SDK
import { query } from "@anthropic-ai/claude-agent-sdk";
import { createMemoryMcpServer, createLocalMemoryStoreBackend } from "@seanhogg/builderforce-memory-mcp";
const backend = await createLocalMemoryStoreBackend(); // IndexedDB (fake-indexeddb in Node)
const memory = await createMemoryMcpServer(backend); // type:"sdk" config
for await (const msg of query({
prompt: "What language does this user prefer? Check memory first.",
options: {
mcpServers: { builderforce_memory: memory },
allowedTools: ["mcp__builderforce_memory__*"], // auto-approve, no prompts
},
})) {
if (msg.type === "result" && msg.subtype === "success") console.log(msg.result);
}SSM-embedding recall (the premium path)
The local backend defaults to lexical (Jaccard) recall. For SSM-embedding cosine recall, pass an SSM runtime — e.g. reuse the agent-runtime's already-loaded Evermind runtime instead of standing up a second model:
const backend = await createLocalMemoryStoreBackend({ runtime: ssmMemoryService.runtime });Recall quality then improves automatically as that model is adapted/distilled.
One-shot install into any MCP host (not just Claude)
# Auto-detect installed agents and register the memory server into each:
npx -y -p @seanhogg/builderforce-memory-mcp builderforce-memory-install
npx -y -p @seanhogg/builderforce-memory-mcp builderforce-memory-install --host=all
npx -y -p @seanhogg/builderforce-memory-mcp builderforce-memory-install --host=cursor,windsurfWrites the stdio server entry into each host's own config, idempotently and with
.bak backups. Supported hosts: Claude Code, Claude Desktop, Cursor, Windsurf,
VS Code, Cline, Gemini CLI, Codex CLI. Every agent points at one shared store
(~/.builderforce-memory/memory.json by default, override with --memory-file),
so a fact remembered in one agent is recalled in another. --readonly drops the
write tools; --local=<dist/bin/stdio.js> runs a checkout for development.
The same launch spec is exported for programmatic use:
buildServerSpec() / installMemoryServer() (see src/install/).
Claude Code: the self-driving "memory combo"
Registering the server gives Claude Code the memory tools. For Claude Code the
installer ALSO wires hooks so memory is self-driving instead of advisory (skip
with --no-hooks):
| Hook | Behaviour |
|---|---|
SessionStart |
Inject a digest of the top durable memories. |
PreCompact |
Flush durable learnings before the window is summarised. |
UserPromptSubmit |
Contextual recall — score memories against the prompt and inject the relevant ones, so a stored fact is retrieved at the decision point, not only at session start. |
Stop |
Autonomous capture — if the user's last message was a durable instruction/correction and it wasn't consolidated this turn, BLOCK the stop with a directive to memory_remember. Capture flows through the MCP write path (never racing the snapshot); a stop_hook_active guard means at most one nudge per turn. |
All four point at one generated, dependency-free hook script
(~/.claude/builderforce-memory/bfmem-hook.mjs, four modes) plus a companion
skill. Idempotent + .bak-safe. Source of truth: src/install/claude-hooks.ts
(installClaudeCombo / bfmemHookSource).
Quick start — stdio (any process / language)
npx -y @seanhogg/builderforce-memory-mcp # serves the local MemoryStore over stdiomcpServers: {
builderforce_memory: { type: "stdio", command: "npx", args: ["-y", "@seanhogg/builderforce-memory-mcp"] },
}Env: BUILDERFORCE_MEMORY_DB (db name), BUILDERFORCE_MEMORY_READONLY=1 (drop write tools).
Quick start — HTTP (multi-tenant)
BUILDERFORCE_MEMORY_TOKEN=secret PORT=8787 npx @seanhogg/builderforce-memory-mcp-httpmcpServers: {
builderforce_memory: {
type: "http",
url: "http://localhost:8787",
headers: { Authorization: "Bearer secret" },
},
}Or mount createMemoryHttpHandler(backend, { authToken }) on your own Node/Express
server against any backend (including a future remote one).
Custom / remote backend
Implement MemoryBackend and hand it to any transport:
import { createMemoryMcpServer, type MemoryBackend } from "@seanhogg/builderforce-memory-mcp";
const remote: MemoryBackend = {
recall: (q, k) => bfClient.search(q, k),
get: (key) => bfClient.get(key),
recallByTag: (t, lim) => bfClient.byTag(t, lim),
remember: (input) => bfClient.put(input),
forget: (key) => bfClient.delete(key),
};
const server = await createMemoryMcpServer(remote);Dependencies
@modelcontextprotocol/sdk + zod are hard deps. @anthropic-ai/claude-agent-sdk
(SDK transport), @seanhogg/builderforce-memory + fake-indexeddb (local backend) are
optional peers, imported indirectly — install only what your transport/backend
needs.