memory-lancedb-pro

Production-grade long-term memory system (v1.1.0-beta.8) for OpenClaw AI agents. Provides persistent, intelligent memory storage using LanceDB with hybrid vector + BM25 retrieval, LLM-powered Smart Extraction, Weibull decay lifecycle, and multi-scope isolation.

For full technical details (thresholds, formulas, database schema, source file map), see references/full-reference.md.

Applying the Optimal Config (Step-by-Step Workflow)

When the user says "help me enable the best config", "apply optimal configuration", or similar, follow this exact procedure:

Step 1 — Present configuration plans and let user choose

Present these three plans in a clear comparison, then ask the user to pick one:

Plan A — 🏆 Full Power (Best Quality)

- Embedding: Jina jina-embeddings-v5-text-small (task-aware, 1024-dim)
Reranker: Jina jina-reranker-v3 (cross-encoder, same key)
LLM: OpenAI gpt-4o-mini (Smart Extraction)
Keys needed: JINA_API_KEY + INLINECODE5
Get keys: Jina → https://jina.ai/api-key · OpenAI → https://platform.openai.com/api-keys
Cost: Both paid (Jina has free tier with limited quota)
Best for: Production deployments, highest retrieval quality

Plan B — 💰 Budget (Free Reranker)

- Embedding: Jina INLINECODE6
Reranker: SiliconFlow BAAI/bge-reranker-v2-m3 (free tier available)
LLM: OpenAI INLINECODE8
Keys needed: JINA_API_KEY + SILICONFLOW_API_KEY + INLINECODE11
Get keys: Jina → https://jina.ai/api-key · SiliconFlow → https://cloud.siliconflow.cn/account/ak · OpenAI → https://platform.openai.com/api-keys
Cost: Jina embedding paid, SiliconFlow reranker free tier, OpenAI paid
Best for: Cost-sensitive deployments that still want reranking

Plan C — 🟢 Simple (OpenAI Only)

- Embedding: OpenAI INLINECODE12
Reranker: None (vector+BM25 fusion only, no cross-encoder)
LLM: OpenAI INLINECODE13
Keys needed: OPENAI_API_KEY only
Get key: https://platform.openai.com/api-keys
Cost: OpenAI paid only
Best for: Users who already have OpenAI and want minimal setup

Plan D — 🖥️ Fully Local (Ollama, No API Keys)

- Embedding: Ollama mxbai-embed-large (1024-dim, recommended) or nomic-embed-text:v1.5 (768-dim, lighter)
Reranker: None — Ollama has no cross-encoder reranker; retrieval uses vector+BM25 fusion only
LLM: Ollama via OpenAI-compatible endpoint — recommended models with reliable JSON/structured output:

- qwen3:8b (recommended — best JSON output, native structured output, ~5.2GB)
- qwen3:14b (better quality, ~9GB, needs 16GB VRAM)
- llama4:scout (multimodal MoE, 10M ctx, ~12GB)
- mistral-small3.2 (24B, 128K ctx, excellent instruction following, ~15GB)
- mistral-nemo (12B, 128K ctx, efficient, ~7GB)

- Keys needed: None — fully local, no external API calls
Prerequisites:

- Ollama installed: https://ollama.com/download
- Models pulled (see Step 5 below)
- Ollama running: macOS = launch the app from Applications; Linux = systemctl start ollama or ollama serve

- Cost: Free (hardware only)
RAM requirements: mxbai-embed-large ~670MB; qwen3:8b ~5.2GB; qwen3:14b ~9GB; llama4:scout ~12GB; mistral-small3.2 ~15GB
Trade-offs: No cross-encoder reranking = lower retrieval precision than Plans A/B; Smart Extraction quality depends on local LLM — if extraction produces garbage, set INLINECODE24
Best for: Privacy-sensitive deployments, air-gapped environments, zero API cost

After user selects a plan, ask in one message:

1. Please provide the required API key(s) for your chosen plan (paste directly, or say "already set as env vars")
Are the env vars already set in your OpenClaw Gateway process? (If unsure, answer No)
Where is your openclaw.json? (Skip if you want me to find it automatically)

If the user already stated their provider/keys in context, skip asking and proceed.

Do NOT proceed to Step 2 until API keys have been collected and verified (Step 2 below).

Step 2 — Verify API Keys (MANDATORY — do not skip)

Run ALL key checks for the chosen plan before touching any config. If any check fails, STOP and tell the user which key failed and why. Do not proceed to Step 3.

Plan A / Plan B — Jina embedding check:
CODEBLOCK0

Plan A / B / C — OpenAI check:
CODEBLOCK1

Plan B — SiliconFlow reranker check:
CODEBLOCK2

Plan D — Ollama check:
CODEBLOCK3

Interpret results:

HTTP code	Meaning	Action
INLINECODE26 / INLINECODE27	Key valid, quota available	✅ Continue
INLINECODE28 / INLINECODE29

If any check fails: Tell the user exactly which provider failed, the HTTP code received, and what to fix. Do not proceed with installation until all required keys pass their checks.

If the user says keys are set as env vars in the gateway process, run checks using ${VAR_NAME} substituted inline or ask them to paste the key temporarily for verification.

Step 3 — Find openclaw.json

Check these locations in order:
CODEBLOCK4

If not found, ask the user for the path.

Step 4 — Read current config

CODEBLOCK5

Check what already exists — never blindly overwrite existing settings.

Step 5 — Build the merged config based on chosen plan

Use the config block for the chosen plan. Substitute actual API keys inline if the user provided them directly; keep ${ENV_VAR} syntax if they confirmed env vars are set in the gateway process.

Plan A config (plugins.entries.memory-lancedb-pro.config):
CODEBLOCK6

Plan B config:
CODEBLOCK7

Plan C config:
CODEBLOCK8

Plan D config (replace models as needed — qwen3:8b recommended for LLM, mxbai-embed-large for embedding):
CODEBLOCK9

Plan D prerequisites — run BEFORE applying config:
CODEBLOCK10

If Smart Extraction produces garbled/invalid output: The local LLM may not support structured JSON reliably. Try qwen3:8b first — it has native structured output support. If still failing, disable:
CODEBLOCK11

If Ollama is on a different host or Docker: Replace http://localhost:11434/v1 with the actual host, e.g. http://192.168.1.100:11434/v1. Also set OLLAMA_HOST=0.0.0.0 in the Ollama process to allow remote connections.

For the plugins.entries.memory-lancedb-pro.config block, merge into the existing openclaw.json rather than replacing the whole file. Use a targeted edit of only the memory plugin config section.

Step 6 — Apply the config

Read the current openclaw.json first, then apply a surgical edit to the plugins.entries.memory-lancedb-pro section. Use the template that matches your installation method:

Method 1 — openclaw plugins install (plugin was installed via the plugin manager):
No load.paths or allow needed — the plugin manager already registered the plugin.
CODEBLOCK12

Method 2 — git clone with manual path (workspace plugin):
Both load.paths AND allow are required — workspace plugins are disabled by default.
CODEBLOCK13

Step 7 — Validate and restart

CODEBLOCK14

Expected output confirms:

- INLINECODE51
INLINECODE52

Step 8 — Verify

CODEBLOCK15

Then do a quick smoke test:

1. Store: call memory_store with INLINECODE54
Recall: call memory_recall with INLINECODE56
Confirm the memory is returned

Installation

Quick Install (Beginner-Friendly)

For new users, the community one-click installer handles everything automatically — path detection, schema validation, auto-update, provider selection, and rollback:

CODEBLOCK16

Options: --dry-run (preview only), --beta (include pre-release), --ref v1.2.0 (pin version), --selfcheck-only, --uninstall.

Source: https://github.com/CortexReach/toolbox/tree/main/memory-lancedb-pro-setup

Requirements

- Node.js 24 recommended (Node 22 LTS minimum, 22.16+)
LanceDB ≥ 0.26.2
OpenAI SDK ≥ 6.21.0
TypeBox 0.34.48

Install Method 1 — via OpenClaw plugin manager (recommended)

CODEBLOCK17

npm vs GitHub branches: @beta installs from the npm registry (not directly from GitHub). The repo has two long-lived branches: master is the release branch (matches npm @beta), main is older/behind. Always clone master if you want code that matches the published beta.

Then bind the memory slot and add your config (see Configuration section below):
CODEBLOCK18

Restart and verify:
CODEBLOCK19

Install Method 2 — git clone with manual path (Path A for development)

⚠️ Critical: Workspace plugins (git-cloned paths) are disabled by default in OpenClaw. You MUST explicitly enable them.

CODEBLOCK20

Add to openclaw.json — the enabled: true and the allow entry are both required:
CODEBLOCK21

Validate and restart:
CODEBLOCK22

Expected log output:

- INLINECODE71
INLINECODE72

Install Method 3 — Existing deployments (Path B)

Use absolute paths in plugins.load.paths. Add to plugins.allow. Bind memory slot: plugins.slots.memory = "memory-lancedb-pro". Set plugins.entries.memory-lancedb-pro.enabled: true.

Then restart and verify:
CODEBLOCK23

New User First-Install Checklist

After the plugin starts successfully, determine which scenario applies and run the corresponding steps:

Scenario A — Coming from built-in memory-lancedb plugin (most common upgrade path)

The old plugin stores data in LanceDB at ~/.openclaw/memory/lancedb. Use the migrate command:

CODEBLOCK24

If the old database is at a non-default path:

openclaw memory-pro migrate check --source /path/to/old/lancedb
openclaw memory-pro migrate run --source /path/to/old/lancedb

Scenario B — Existing memories exported as JSON

If you have memories in the standard JSON export format:

CODEBLOCK26

Expected JSON schema:

{
  "version": "1.0",
  "memories": [
    {
      "text": "Memory content (required)",
      "category": "preference|fact|decision|entity|other",
      "importance": 0.7,
      "timestamp": 1234567890000
    }
  ]
}

Scenario C — Memories stored in Markdown files (AGENTS.md, MEMORY.md, etc.)

There is no direct markdown import — the import command only accepts JSON. You need to convert first.

Manual conversion approach:

1. Open the markdown file(s) containing memories
For each memory entry, create a JSON object with text, category, INLINECODE81
Save as a JSON file following the schema above
Run INLINECODE82

Or use memory_store tool directly in the agent to store individual entries one at a time:
CODEBLOCK28

Note: Markdown-based memory files (MEMORY.md, AGENTS.md) are workspace context files, not the same as the LanceDB memory store. You only need to migrate them if you want that content searchable via memory_recall.

Scenario D — Fresh install, no prior memories

No migration needed. Verify the plugin is working with a quick smoke test:

openclaw memory-pro stats     # should show 0 memories

Then trigger a conversation — autoCapture will start storing memories automatically.

LanceDB Version Compatibility

No manual action required for LanceDB version changes.

The plugin requires @lancedb/lancedb ^0.26.2 as an npm dependency — this is installed automatically when you install or update the plugin. You do not need to manually install or upgrade LanceDB.

LanceDB 0.26+ changed how numeric columns are returned (Arrow BigInt type for timestamp, importance, _distance, _score). The plugin handles this transparently at runtime via internal Number(...) coercion — no migration commands are needed when moving between LanceDB versions.

TL;DR: LanceDB version compatibility is fully automatic. See the table below for when each maintenance command actually applies.

Upgrading plugin code vs. data

Command distinction (important):

Command	When to use
INLINECODE93	Update plugin code after a new release (npm-installed only)
INLINECODE94

Update all npm-installed plugins at once |
| openclaw memory-pro upgrade | Enrich old memory-lancedb-pro entries that predate the smart-memory schema (missing L0/L1/L2 metadata + 6-category system) — NOT related to LanceDB version |
| openclaw memory-pro migrate | One-time migration from the separate memory-lancedb built-in plugin → Pro |
| openclaw memory-pro reembed | Rebuild all embeddings after switching embedding model or provider |

When do you need memory-pro upgrade?

Run it if you installed memory-lancedb-pro before the smart-memory format was introduced (i.e., entries are missing memory_category in their metadata). Signs you need it:

- memory_recall returns results but without meaningful categories
INLINECODE102 shows entries with no l0_abstract / l1_overview fields

Safe upgrade sequence:
CODEBLOCK30

Upgrade options:
CODEBLOCK31

Plugin management commands

CODEBLOCK32

Gateway restart required after: plugins install, plugins enable, plugins disable, plugins update, or any change to openclaw.json. Changes do not take effect until the gateway is restarted.
CODEBLOCK33

Easy-to-Miss Setup Steps

1. Gateway restart required after any change: After installing, enabling, disabling, updating, or changing config in openclaw.json, you MUST run openclaw gateway restart — changes are NOT hot-reloaded.
Workspace plugins are DISABLED by default: After git clone, you MUST add plugins.allow: ["memory-lancedb-pro"] AND plugins.entries.memory-lancedb-pro.enabled: true — without these the plugin silently does not load.
Env vars in gateway process: ${OPENAI_API_KEY} requires env vars set in the OpenClaw Gateway service process—not just your shell.
Absolute vs. relative paths: For existing deployments, always use absolute paths in plugins.load.paths.
baseURL not baseUrl: The embedding (and llm) config field is baseURL (capital URL), NOT baseUrl. Using the wrong casing causes a schema validation error: "must NOT have additional properties". Also note the required /v1 suffix: http://localhost:11434/v1, not http://localhost:11434. Do not confuse with agents.defaults.memorySearch.remote.baseUrl which uses a different casing.
jiti cache invalidation: After modifying .ts files under plugins, run rm -rf /tmp/jiti/ BEFORE openclaw gateway restart.
Unknown plugin id = error: OpenClaw treats unknown ids in entries, allow, deny, or slots as validation errors. The plugin id must be discoverable before referencing it.
Separate LLM config: If embedding and LLM use different providers, configure the llm section separately — it falls back to embedding key/URL otherwise.
Scope isolation: Multi-scope requires explicit scopes.agentAccess mapping — without it, agents only see global scope.
Session memory hook: Fires on /new command — test with an actual /new invocation.
Reranker credentials: When switching providers, update both rerankApiKey AND rerankEndpoint.
Config check before assuming defaults: Run openclaw config get plugins.entries.memory-lancedb-pro to verify what's actually loaded.
Custom config/state paths via env vars: OpenClaw respects the following environment variables for custom paths:

- OPENCLAW_HOME — sets the root config/data directory (default: ~/.openclaw/) - OPENCLAW_CONFIG_PATH — absolute path to openclaw.json override - OPENCLAW_STATE_DIR — override for runtime state/data directory Set these in the OpenClaw Gateway process's environment if the default ~/.openclaw/ path is not appropriate.

Post-Installation Verification

CODEBLOCK34

Full smoke test checklist:

- ✅ Plugin info shows enabled: true and config loaded
✅ Hooks include before_agent_start, agent_end, INLINECODE148
✅ One memory_store → memory_recall round trip via tools
✅ One exact-ID search hit
✅ One natural-language search hit
✅ If session memory enabled: one real /new test

Troubleshooting — Error Message Quick Reference

Config validation tool (from CortexReach/toolbox):

# Download once
curl -fsSL https://raw.githubusercontent.com/CortexReach/toolbox/main/memory-lancedb-pro-setup/scripts/config-validate.mjs -o config-validate.mjs
# Run against your openclaw.json
node config-validate.mjs
# Or validate a specific config snippet
node config-validate.mjs --json '{"embedding":{"baseURL":"http://localhost:11434/v1","model":"bge-m3","apiKey":"ollama"}}'

Exit code 0 = pass/warn, 1 = errors found.

Error message	Root cause	Fix
INLINECODE152 + INLINECODE153	Field name typo in embedding config (e.g. `baseUrl` instead of `baseURL`)	Check all field names against the schema table below — field names are case-sensitive
INLINECODE156 (top-level config)

Unknown top-level field in plugin config | Remove or correct the field |
| memory-lancedb-pro: plugin not found / plugin silently not loading | plugins.allow missing (git-clone install) or enabled: false | Add plugins.allow: ["memory-lancedb-pro"] and set enabled: true, then restart |
| Unknown plugin id validation error | Plugin referenced in entries/slots before it's discoverable | Install/register the plugin first, then add config references |
| ${OPENAI_API_KEY} not expanding / auth errors despite env var set | Env var not set in the gateway process environment | Set the env var in the service that runs OpenClaw gateway, not just your shell |
| Hooks (before_agent_start, agent_end) not firing | Gateway not restarted after install/config change | Run openclaw gateway restart |
| Embedding errors with Ollama | Wrong baseURL format | Must be http://localhost:11434/v1 (with /v1), field must be baseURL not baseUrl |
| memory-pro stats shows 0 entries after conversation | autoCapture false or extractMinMessages not reached | Set autoCapture: true; need at least extractMinMessages (default 2) turns |
| Memories not injected before agent replies | autoRecall is false (schema default) | Explicitly set "autoRecall": true |
| jiti cache error after editing plugin .ts files | Stale compiled cache | Run rm -rf /tmp/jiti/ then openclaw gateway restart |

Configuration

Minimal Quick-Start

CODEBLOCK36

Note: autoRecall is disabled by default in the plugin schema — explicitly set it to true for new deployments.

Optimal Production Config (recommended)

Uses Jina for both embedding and reranking — best retrieval quality:

CODEBLOCK37

Why these settings excel:

- Jina embeddings: Task-aware vectors (taskQuery/taskPassage) optimized for retrieval
Hybrid mode 0.7/0.3: Balances semantic understanding with exact keyword matching
Jina reranker v3: Cross-encoder reranking significantly improves relevance
candidatePoolSize: 12 + minScore: 0.6: Aggressive filtering reduces noise
captureAssistant: false: Prevents storing agent-generated boilerplate
sessionMemory: false: Avoids polluting retrieval with session summaries

Full Config (all options)

{
  "embedding": {
    "apiKey": "${JINA_API_KEY}",
    "model": "jina-embeddings-v5-text-small",
    "baseURL": "https://api.jina.ai/v1",
    "dimensions": 1024,
    "taskQuery": "retrieval.query",
    "taskPassage": "retrieval.passage",
    "normalized": true
  },
  "dbPath": "~/.openclaw/memory/lancedb-pro",
  "autoCapture": true,
  "autoRecall": true,
  "captureAssistant": false,
  "smartExtraction": true,
  "llm": {
    "apiKey": "${OPENAI_API_KEY}",
    "model": "gpt-4o-mini",
    "baseURL": "https://api.openai.com/v1"
  },
  "extractMinMessages": 2,
  "extractMaxChars": 8000,
  "enableManagementTools": false,
  "retrieval": {
    "mode": "hybrid",
    "vectorWeight": 0.7,
    "bm25Weight": 0.3,
    "minScore": 0.3,
    "hardMinScore": 0.35,
    "rerank": "cross-encoder",
    "rerankProvider": "jina",
    "rerankModel": "jina-reranker-v3",
    "rerankEndpoint": "https://api.jina.ai/v1/rerank",
    "rerankApiKey": "${JINA_API_KEY}",
    "candidatePoolSize": 20,
    "recencyHalfLifeDays": 14,
    "recencyWeight": 0.1,
    "filterNoise": true,
    "lengthNormAnchor": 500,
    "timeDecayHalfLifeDays": 60,
    "reinforcementFactor": 0.5,
    "maxHalfLifeMultiplier": 3
  },
  "scopes": {
    "default": "global",
    "definitions": {
      "global": { "description": "Shared knowledge" },
      "agent:discord-bot": { "description": "Discord bot private" }
    },
    "agentAccess": {
      "discord-bot": ["global", "agent:discord-bot"]
    }
  },
  "sessionStrategy": "none",
  "memoryReflection": {
    "storeToLanceDB": true,
    "injectMode": "inheritance+derived",
    "agentId": "memory-distiller",
    "messageCount": 120,
    "maxInputChars": 24000,
    "thinkLevel": "medium"
  },
  "selfImprovement": {
    "enabled": true,
    "beforeResetNote": true,
    "ensureLearningFiles": true
  },
  "mdMirror": { "enabled": false },
  "decay": {
    "recencyHalfLifeDays": 30,
    "recencyWeight": 0.4,
    "frequencyWeight": 0.3,
    "intrinsicWeight": 0.3,
    "betaCore": 0.8,
    "betaWorking": 1.0,
    "betaPeripheral": 1.3
  },
  "tier": {
    "coreAccessThreshold": 10,
    "coreCompositeThreshold": 0.7,
    "coreImportanceThreshold": 0.8,
    "workingAccessThreshold": 3,
    "workingCompositeThreshold": 0.4,
    "peripheralCompositeThreshold": 0.15,
    "peripheralAgeDays": 60
  }
}

Configuration Field Reference

Embedding
Field Type Default Description
INLINECODE193 string — API key (supports `${ENV_VAR}`); array for multi-key failover
INLINECODE195
string | — | Model identifier |

Field	Type	Default	Description
INLINECODE193	string	—	API key (supports `${ENV_VAR}`); array for multi-key failover
INLINECODE195

Top-Level
Field Type Default Description
INLINECODE206 string INLINECODE207 LanceDB data directory
INLINECODE208
boolean | true | Auto-extract memories after agent replies (via `agent_end` hook) |

Field	Type	Default	Description
INLINECODE206	string	INLINECODE207	LanceDB data directory
INLINECODE208

LLM (for Smart Extraction)
Field Type Default Description
INLINECODE234 string falls back to INLINECODE235 LLM API key
INLINECODE236
string | `openai/gpt-oss-120b` | LLM model for extraction |

Field	Type	Default	Description
INLINECODE234	string	falls back to INLINECODE235	LLM API key
INLINECODE236

Retrieval
Field Type Default Description
INLINECODE240 string INLINECODE241 INLINECODE242 / `vector` (`bm25`-only mode does not exist in schema)
INLINECODE245
number | 0.7 | Weight for vector search |

Field	Type	Default	Description
INLINECODE240	string	INLINECODE241	INLINECODE242 / `vector` (`bm25`-only mode does not exist in schema)
INLINECODE245

Access reinforcement note: Reinforcement is whitelisted to source: "manual" only — auto-recall does NOT strengthen memories, preventing noise amplification.

Session Strategy (v1.1.0+)

Use sessionStrategy (top-level field) to configure the session pipeline:

Value	Behavior
INLINECODE275 (default)	Built-in session memory (simpler)
INLINECODE276

Advanced LLM-powered reflection with inheritance/derived injection |
| "none" | Session summaries disabled |

memoryReflection config (used when sessionStrategy: "memoryReflection"):

Field	Type	Default	Description
INLINECODE280	boolean	true	Persist reflections to LanceDB
INLINECODE281

memoryReflection.recall sub-object (controls which past reflections are retrieved for injection):

Field	Type	Default	Description
INLINECODE301	string	INLINECODE302	Recall mode: `fixed` / INLINECODE304
INLINECODE305

Session Memory (deprecated — legacy compat only)

⚠️ sessionMemory is a legacy compatibility shim since v1.1.0. Prefer sessionStrategy instead.

- sessionMemory.enabled: true → maps to INLINECODE318
INLINECODE319 → maps to INLINECODE320

Field	Type	Default	Description
INLINECODE321	boolean	false	Legacy: enable session summaries on INLINECODE322
INLINECODE323

number | 15 | Legacy: maps to memoryReflection.messageCount |

Self-Improvement Governance

Field	Type	Default	Description
INLINECODE325	boolean	true	Enable self-improvement tools (`self_improvement_log` etc.) — on by default
INLINECODE327

Tool activation rules:

- self_improvement_log: requires selfImprovement.enabled: true (default — active unless explicitly disabled)
INLINECODE334 + self_improvement_review: additionally require INLINECODE336

Markdown Mirror

Field	Type	Default	Description
INLINECODE337	boolean	false	Mirror memory entries as `.md` files
INLINECODE339

string | — | Directory for markdown mirror files |

Decay
Field Type Default Description
INLINECODE340 number 30 Base Weibull decay half-life
INLINECODE341
number | 0.4 | Weight of recency in lifecycle score (distinct from `retrieval.recencyWeight`) |

Field	Type	Default	Description
INLINECODE340	number	30	Base Weibull decay half-life
INLINECODE341

Tier Management
Field Type Default Description
INLINECODE354 number 10 Access count for core promotion
INLINECODE355
number | 0.7 | Lifecycle score for core promotion |

Field	Type	Default	Description
INLINECODE354	number	10	Access count for core promotion
INLINECODE355

MCP Tools

Core Tools (auto-registered)

memory_recall — Search long-term memory via hybrid retrieval

Parameter	Type	Required	Default	Notes
INLINECODE362	string	yes	—	Search query
INLINECODE363

memory_store — Save information to long-term memory

Parameter	Type	Required	Default	Notes
INLINECODE368	string	yes	—	Information to remember
INLINECODE369

memory_forget — Delete memories by search or direct ID

Parameter	Type	Required	Notes
INLINECODE374	string	one of	Search query to locate memory
INLINECODE375

memory_update — Update memory (preserves original timestamp; preference/entity text updates create a new versioned row preserving history)

Parameter	Type	Required	Notes
INLINECODE380	string	yes	Full UUID or 8+ char prefix
INLINECODE381

Management Tools (enable with `enableManagementTools: true`)

memory_stats — Usage statistics

- scope (string, optional): Filter by scope

memory_list — List recent memories with filtering

- limit (number, optional, default 10, max 50), scope, category, offset (pagination)

Self-Improvement Tools

INLINECODE394 is enabled by default (selfImprovement.enabled: true). self_improvement_extract_skill and self_improvement_review additionally require enableManagementTools: true.

self_improvement_log — Log learning/error entries into LEARNINGS.md / ERRORS.md

Parameter	Type	Required	Notes
INLINECODE400	enum	yes	INLINECODE401 or INLINECODE402
INLINECODE403

self_improvement_extract_skill — Create skill scaffold from a learning entry

Parameter	Type	Required	Default	Notes
INLINECODE414	string	yes	—	Format `LRN-YYYYMMDD-001` or INLINECODE416
INLINECODE417

self_improvement_review — Summarize governance backlog (no parameters)

Smart Extraction

LLM-powered automatic memory classification and storage triggered after conversations.

Enable

CODEBLOCK39

Minimal (reuses embedding API key — no separate llm block needed):
CODEBLOCK40

Disable: INLINECODE425

6-Category Classification

Input Category	Stored As	Dedup Behavior
Profile	INLINECODE426	Always merge (auto-consolidates)
Preferences

L0/L1/L2 Layered Content per Memory

- L0 (Abstract): Single-sentence index (min 5 chars)
L1 (Overview): Structured markdown summary
L2 (Content): Full narrative detail

Two-Stage Deduplication

1. Vector pre-filter: Similarity ≥ 0.7 finds candidates
LLM decision: INLINECODE432

Embedding Providers

Provider	Model	Base URL	Dimensions	Notes
Jina (recommended)	INLINECODE433	INLINECODE434	1024	Latest (Feb 2026), task-aware LoRA, 32K ctx
Jina (multimodal)

DashScope rerank note: DashScope is not a rerankProvider enum value, but its rerank API response is Jina-compatible. Use rerankProvider: "jina" with DashScope's endpoint:
CODEBLOCK41

Multi-key failover: Set apiKey as an array for round-robin rotation on 429/503 errors.

Reranker Providers

Provider	INLINECODE456	Endpoint	Model	Notes
Jina (default)	INLINECODE457	INLINECODE458	INLINECODE459	Latest text reranker (2025, Qwen3 backbone, 131K ctx)
Jina (multimodal)

jina | https://api.jina.ai/v1/rerank | jina-reranker-m0 | Multimodal (text+images), use when docs contain images |
| SiliconFlow | siliconflow | https://api.siliconflow.com/v1/rerank | BAAI/bge-reranker-v2-m3 | Free tier available |
| Voyage AI | voyage | https://api.voyageai.com/v1/rerank | rerank-2.5 | Sends {model, query, documents}, no top_n |
| Pinecone | pinecone | https://api.pinecone.io/rerank | bge-reranker-v2-m3 | Pinecone customers only |
| vLLM / Docker Model Runner | vllm | Custom endpoint | any compatible model | Self-hosted via Docker Model Runner |

Jina key can be reused for both embedding and reranking.

Multi-Scope Isolation

Scope Format	Description
INLINECODE475	Shared across all agents
INLINECODE476

Default access: global + agent:<id>. Multi-scope requires explicit scopes.agentAccess — see Full Config above.

To disable memory entirely (unbind the slot without removing the plugin):

{ "plugins": { "slots": { "memory": "none" } } }

Memory Lifecycle (Weibull Decay)

Three Tiers

Tier	Decay Floor	Beta	Behavior
Core	0.9	0.8	Gentle sub-exponential decline
Working

0.7 | 1.0 | Standard exponential (default) | | Peripheral | 0.5 | 1.3 | Rapid super-exponential fade |

Promotion/Demotion Rules

- Peripheral → Working: access ≥ 3 AND score ≥ 0.4
Working → Core: access ≥ 10 AND score ≥ 0.7 AND importance ≥ 0.8
Working → Peripheral: score < 0.15 OR (age > 60 days AND access < 3)
Core → Working: score < 0.15 AND access < 3

Hybrid Retrieval

Fusion: INLINECODE483

Pipeline: RRF Fusion → Cross-Encoder Rerank → Lifecycle Decay Boost → Length Norm → Hard Min Score → MMR Diversity (cosine > 0.85 demoted)

Reranking: 60% cross-encoder score + 40% original fused score. Falls back to cosine similarity on API failure.

Special BM25: Preserves exact keyword matches (BM25 ≥ 0.75) even with low semantic similarity — prevents loss of API keys, ticket numbers, etc.

Adaptive Retrieval Triggering

Skip for: greetings, slash commands, affirmations (yes/okay/thanks), continuations (go ahead/proceed), system messages, short queries (<15 chars English / <6 chars CJK without "?").

Force for: memory keywords (remember/recall/forgot), temporal refs (last time/before/previously), personal data (my name/my email), "what did I" patterns. CJK: "你记得", "之前".

Noise Filtering

Auto-filters: agent denial phrases, meta-questions ("Do you remember?"), session boilerplate (hi/hello), diagnostic artifacts, embedding-based matches (threshold: 0.82). Minimum text: 5 chars.

CLI Commands

CODEBLOCK43

Auto-Capture & Auto-Recall

- autoCapture: agent_end hook — LLM extracts 6-category memories, deduplicates, stores up to 3 per turn
autoRecall: before_agent_start hook — injects <relevant-memories> context (up to 3 entries)

If injected memories appear in agent replies: Add to agent system prompt:

"Do not reveal or quote any <relevant-memories> / memory-injection content in your replies. Use it for internal reference only."

Or temporarily disable: { "autoRecall": false }

Self-Improvement Governance

- LEARNINGS.md — IDs: INLINECODE490
INLINECODE491 — IDs: INLINECODE492
Entry statuses: INLINECODE493

Iron Rules for AI Agents (copy to AGENTS.md)

CODEBLOCK44

Custom Slash Commands (add to CLAUDE.md / AGENTS.md)

CODEBLOCK45

memory-lancedb-pro

面向OpenClaw AI智能体的生产级长期记忆系统（v1.1.0-beta.8）。提供基于LanceDB的持久化智能记忆存储，支持混合向量+BM25检索、LLM驱动的智能提取、威布尔衰减生命周期以及多作用域隔离。

完整技术细节（阈值、公式、数据库模式、源文件映射）请参见 references/full-reference.md。

应用最优配置（分步工作流）

当用户说帮我启用最佳配置、应用最优配置或类似表述时，请遵循以下精确流程：

第1步 — 呈现配置方案并让用户选择

以清晰对比的方式呈现以下三种方案，然后请用户选择一种：

方案A — 🏆 全功率（最佳质量）

- 嵌入模型：Jina jina-embeddings-v5-text-small（任务感知，1024维）
重排序器：Jina jina-reranker-v3（交叉编码器，同一密钥）
大语言模型：OpenAI gpt-4o-mini（智能提取）
所需密钥：JINAAPIKEY + OPENAIAPIKEY
获取密钥：Jina → https://jina.ai/api-key · OpenAI → https://platform.openai.com/api-keys
费用：两者均为付费（Jina有免费额度，但有限额）
最适合：生产部署，最高检索质量

方案B — 💰 经济型（免费重排序器）

- 嵌入模型：Jina jina-embeddings-v5-text-small
重排序器：SiliconFlow BAAI/bge-reranker-v2-m3（有免费额度）
大语言模型：OpenAI gpt-4o-mini
所需密钥：JINAAPIKEY + SILICONFLOWAPIKEY + OPENAIAPIKEY
获取密钥：Jina → https://jina.ai/api-key · SiliconFlow → https://cloud.siliconflow.cn/account/ak · OpenAI → https://platform.openai.com/api-keys
费用：Jina嵌入付费，SiliconFlow重排序免费额度，OpenAI付费
最适合：对成本敏感但仍需重排序的部署

方案C — 🟢 简单（仅OpenAI）

- 嵌入模型：OpenAI text-embedding-3-small
重排序器：无（仅向量+BM25融合，无交叉编码器）
大语言模型：OpenAI gpt-4o-mini
所需密钥：仅 OPENAIAPIKEY
获取密钥：https://platform.openai.com/api-keys
费用：仅OpenAI付费
最适合：已有OpenAI密钥且希望最小化配置的用户

方案D — 🖥️ 完全本地（Ollama，无需API密钥）

- 嵌入模型：Ollama mxbai-embed-large（1024维，推荐）或 nomic-embed-text:v1.5（768维，更轻量）
重排序器：无 — Ollama没有交叉编码器重排序器；检索仅使用向量+BM25融合
大语言模型：通过兼容OpenAI的端点使用Ollama — 推荐的可靠JSON/结构化输出模型：

- qwen3:8b（推荐 — 最佳JSON输出，原生结构化输出，约5.2GB）
- qwen3:14b（质量更好，约9GB，需要16GB显存）
- llama4:scout（多模态MoE，10M上下文，约12GB）
- mistral-small3.2（24B，128K上下文，指令遵循优秀，约15GB）
- mistral-nemo（12B，128K上下文，高效，约7GB）

- 所需密钥：无 — 完全本地，无需外部API调用
前置条件：

- 已安装Ollama：https://ollama.com/download
- 已拉取模型（参见下方第5步）
- Ollama正在运行：macOS = 从应用程序启动；Linux = systemctl start ollama 或 ollama serve

- 费用：免费（仅需硬件）
内存需求：mxbai-embed-large约670MB；qwen3:8b约5.2GB；qwen3:14b约9GB；llama4:scout约12GB；mistral-small3.2约15GB
权衡：无交叉编码器重排序 = 检索精度低于方案A/B；智能提取质量取决于本地LLM — 如果提取结果质量差，请设置 smartExtraction: false
最适合：隐私敏感部署、隔离环境、零API成本

用户选择方案后，在一次消息中询问：

1. 请提供所选方案所需的API密钥（直接粘贴，或说已设置为环境变量）
环境变量是否已在您的OpenClaw Gateway进程中设置？（如果不确定，回答否）
您的 openclaw.json 在哪里？（如果希望我自动查找，请跳过）

如果用户已在上下文中说明了其提供商/密钥，则跳过询问，直接继续。

在收集并验证API密钥之前（下方第2步），请勿进入第2步。

第2步 — 验证API密钥（强制 — 不可跳过）

在触及任何配置之前，对所选方案运行所有密钥检查。 如果任何检查失败，停止并告知用户哪个密钥失败及原因。不要进入第3步。

方案A / 方案B — Jina嵌入检查：
bash
curl -s -o /dev/null -w %{http_code} \
https://api.jina.ai/v1/embeddings \
-H Authorization: Bearer APIKEY> \
-H Content-Type: application/json \
-d {model:jina-embeddings-v5-text-small,input:[test]}

方案A / B / C — OpenAI检查：
bash
curl -s -o /dev/null -w %{http_code} \
https://api.openai.com/v1/models \
-H Authorization: Bearer APIKEY>

方案B — SiliconFlow重排序器检查：
bash
curl -s -o /dev/null -w %{http_code} \
https://api.siliconflow.com/v1/rerank \
-H Authorization: Bearer APIKEY> \
-H Content-Type: application/json \
-d {model:BAAI/bge-reranker-v2-m3,query:test,documents:[test doc]}

方案D — Ollama检查：
bash
curl -s -o /dev/null -w %{http_code} http://localhost:11434/api/tags

结果解读：

HTTP代码	含义	操作
200 / 201	密钥有效，配额可用	✅ 继续
401 / 403

如果任何检查失败： 告知用户具体哪个提供商失败、收到的HTTP代码以及需要修复的内容。在所有必需密钥通过检查之前，不要继续安装。

如果用户说密钥已在网关进程中设置为环境变量，使用内联替换的 ${VAR_NAME} 运行检查，或请用户临时粘贴密钥进行验证。

第3步 — 查找 openclaw.json

按顺序检查以下位置：
bash

最常见的位置

ls ~/.openclaw/openclaw.json
ls ~/openclaw.json

询问网关从哪里读取配置

openclaw config get --show-path 2>/dev/null || echo not found

如果未找到，请询问用户路径。

第4步 — 读取当前配置

bash

在更改任何内容之前读取并显示当前插件配置

openclaw config get plugins.entries.memory-lancedb-pro 2>/dev/null
openclaw config get plugins.slots.memory 2>/dev/null

检查已存在的内容 — 切勿盲目覆盖现有设置。

第5步 — 根据所选方案构建合并配置

使用所选方案的配置块。如果用户直接提供了API密钥，则内联替换实际密钥；如果用户确认环境变量已在网关进程中设置，则保留 ${ENV_VAR} 语法。

方案A配置（

memory-lancedb-pro记忆数据库