doobidoo/mcp-memory-service v10.25.0 on GitHub

🙏 Special Thanks

This release is entirely the work of @chriscoey, who contributed 5 meticulously researched and well-tested PRs in a single day. Each one identified real bugs through careful code reading — not just surface-level fixes but root-cause analysis with regression tests proving the fix. Outstanding community contribution.

What's Changed

This release consolidates 5 high-quality PRs from @chriscoey that fix critical bugs in the SQLite-vec storage backend, improve security, and add an embedding migration utility.

🆕 Added

Embedding model migration script (scripts/maintenance/migrate_embeddings.py): Migrate embeddings between any models, including across different dimensions (e.g., 384-dim → 768-dim). Works with any OpenAI-compatible API (Ollama, vLLM, OpenAI, TEI). Features: --dry-run, auto-detect dimension, timestamped backup, service detection, cross-platform, batched with progress, post-migration integrity verification. Closes #552.

🐛 Fixed

Soft-delete leaks (data correctness):

recall() — both semantic and time-based paths returned deleted memories
get_memories_by_time_range() — returned deleted memories
get_largest_memories() — returned deleted memories
get_memory_timestamps() — counted deleted memories
get_memory_connections() — tag group counts included deleted memories
get_access_patterns() — returned content hashes of deleted memories
update_memory_metadata() — could modify soft-deleted memories
update_memories_batch() — same issue for batch update path
delete() error handler — added explicit rollback to prevent dangling embedding DELETEs

Score formula:

recall() used 1.0 - distance but cosine distance ∈ [0, 2], producing negative scores. Fixed to max(0.0, 1.0 - distance/2.0) → correctly maps to [0, 1].

Tag handling:

get_largest_memories() used json.loads() to parse tags, but tags are stored as comma-separated strings
get_all_memories(), count_all_memories(), retrieve(), delete_by_timeframe(), delete_before_date() used LIKE '%tag%' (substring match) instead of GLOB exact-match. A tag query for "test" incorrectly matched "testing" and "my-test-tag".
Added _escape_glob() helper to prevent GLOB wildcard injection (*, ?, [) from user-supplied tag values.
search_by_tag_chronological() LIMIT/OFFSET is now parameterized instead of f-string interpolated.

Consolidation system:

_sample_memory_pairs() materialized all combinations(memories, 2) (~50M pairs for 10k memories) just to sample 100. Now uses random index pair generation — O(max_pairs).
_get_existing_associations() filtered by memory_type=="association" but associations are stored with memory_type="observation" and tag "association". The filter never matched, so duplicate associations were never prevented. Now uses search_by_tag(["association"]).

⚡ Performance

Batch access metadata: retrieve() now persists access metadata in one executemany call per query instead of N individual UPDATE+COMMIT round-trips.
Hybrid search O(n+m) dedup: retrieve_hybrid() replaced O(n×m) nested-loop deduplication with O(n+m) dict-based merging. BM25-only memories are now batch-fetched in a single SQL query (capped at 999 to respect SQLITE_MAX_VARIABLE_NUMBER) instead of N+1 individual get_by_hash() calls.

🧪 Tests

23 new regression tests covering all fixed methods
Total: 1,420 tests

Contributors

@chriscoey — authored all 5 PRs (#556, #557, #558, #559, #560)

doobidoo/mcp-memory-service v10.25.0 v10.25.0 — sqlite_vec bug fixes, GLOB security, O(n²) fix, migration script on GitHub