diff --git a/README.md b/README.md index 29f068a..bc291c3 100644 --- a/README.md +++ b/README.md @@ -278,7 +278,37 @@ python3 clean_memories_tr.py --execute --limit 100 --- -### 6. Embedding Models +### 6. Configuration Options Reference + +**All configurable options with defaults:** + +| Option | Default | Description | +|--------|---------|-------------| +| **Embedding model** | `mxbai-embed-large` | Model for generating gem embeddings. `mxbai` = higher accuracy (MTEB 66.5). `snowflake` = faster processing. | +| **Timer interval** | `5` minutes | How often the curator runs. `5 min` = fast backlog clearing. `30 min` = balanced. `60 min` = minimal overhead. | +| **Batch size** | `100` | Max memories sent to curator per run. Higher = fewer API calls but more memory usage. | +| **Max gems per run** | *(unlimited)* | Hard limit on gems extracted per batch. Not set by default — extracts all found gems. | +| **Qdrant URL** | `http://10.0.0.40:6333` | Vector database endpoint. Change if Qdrant runs on different host/port. | +| **Ollama URL** | `http://10.0.0.10:11434` | LLM endpoint for gem extraction. Change if Ollama runs elsewhere. | +| **Curator LLM** | `qwen3:30b-a3b-instruct` | Model for extracting gems. `30b` = best quality (~3s). `4b` = faster but needs JSON fallback. | +| **User ID** | `rob` | Owner identifier for memories. Used for filtering and multi-user setups. | +| **Source collection** | `memories_tr` | Qdrant collection for raw captured memories. | +| **Target collection** | `gems_tr` | Qdrant collection for curated gems (injected into context). | +| **Watcher service** | `enabled` | Real-time capture daemon. Reads session JSONL and writes to Qdrant. | +| **Cron timer** | `enabled` | Periodic curation job. Runs `curator_timer.py` on schedule. | +| **Log path** | `/var/log/true-recall-timer.log` | Where curator output is written. Check with `tail -f`. | +| **Dry-run mode** | `disabled` | Test mode — shows what would be curated without writing to Qdrant. | + +**OpenClaw-side options:** +| Option | Default | Description | +|--------|---------|-------------| +| **Compactor mode** | `default` | How context is summarized. `default` = fast standard. `safeguard` = chunked for very long sessions. | +| **Memory flush** | `disabled` | If enabled, writes silent "memory" turn before compaction. Adds overhead — disabled for minimal lag. | +| **Context pruning** | `cache-ttl` | Removes old tool results from context. `cache-ttl` = prunes hourly. `off` = no pruning. | + +--- + +### 7. Embedding Models **Current Setup:** - `memories_tr`: `snowflake-arctic-embed2` (capture similarity)