Your code never leaves the box
EmbeddingGemma runs on your CPU/GPU. Qdrant is a local daemon on 127.0.0.1. SQLite on disk. Nothing is sent to a paid API unless you explicitly opt in.
Persistent, semantic memory for your AI coding tools. -70.4% tokens, -31.7% cost across 150 benchmark runs. 100% local by default.
curl -fsSL https://raw.githubusercontent.com/alexandruleca/imprint-memory-layer/main/install.sh | bash
Every session starts blank. Claude Code reads chunker.py for the tenth time. Imprint gives every MCP-capable agent a semantic, local, persistent brain — searched, not grepped.
EmbeddingGemma runs on your CPU/GPU. Qdrant is a local daemon on 127.0.0.1. SQLite on disk. Nothing is sent to a paid API unless you explicitly opt in.
Debug queries alone drop by -68.3% cost because known failure modes are served from memory instead of re-read from disk.
Cross-project questions drop by -90.6% tokens. Semantic search returns the 5 chunks that matter, not the 3 files that contain them.
`imprint setup all` auto-detects Claude Code, Cursor, Codex CLI, Copilot, and Cline. No sudo. Self-managed .venv. Qdrant auto-spawned. Nothing leaks into your system install.
Canonical project name from manifests (package.json, go.mod, pyproject.toml), not path. Sync memory between your laptop and a cloud VPS and everything lines up.
A SQLite knowledge graph tracks subject → predicate → object with validity windows. Decisions, bug fixes, and patterns are recalled with the context of when they were true.
Your AI agent speaks MCP to a local Imprint server. Imprint talks to a Qdrant daemon on 127.0.0.1:6333, a SQLite fact graph, and an on-device ONNX embedding model. No API call leaves this box — unless you deliberately flip a switch.
All storage, embeddings, and tagging stay on-device. Nothing crosses the dashed line.
A single imprint config set tagger.llm true enables LLM-quality topics. Off by default.
WebSocket relay lets a laptop and a VPS share memory while keeping the data plane peer-to-peer.
Every prompt runs 5 times in each mode. Medians are reported. Raw JSON per run is checked in. See the full deep-dive or BENCHMARK.md on GitHub.
imprint setup all auto-wires every supported host that's installed. Missing tools are skipped with a warning, never a failure.
claude-code ~/.claude/settings.json cursor ~/.cursor/mcp.json + rules/imprint.mdc copilot <VSCode user>/mcp.json codex ~/.codex/config.toml cline VSCode globalStorage / ~/.cline
Self-contained .venv, auto-downloaded Qdrant binary, and every host configured in one run. imprint disable is symmetric. Prefer a direct download? Grab the latest build from the download page.
curl -fsSL https://raw.githubusercontent.com/alexandruleca/imprint-memory-layer/main/install.sh | bash imprint setup all · imprint ingest .Apache 2.0. Self-updating. Uninstall with one command. Start on your laptop, sync to a VPS when you're ready.