v8.75.0 - Lightweight ONNX Quality Scoring

🎯 90% Installation Size Reduction

Major achievement: Installation shrinks from 7.7GB to 805MB while maintaining full quality scoring functionality.

Problem Solved: transformers package added 6.9GB of dependencies (torch, tensorflow, etc.)
Solution: Use tokenizers package directly instead of full transformers stack
Benefits:
- 90% disk space reduction: 7.7GB → 805MB
- Faster installation: <2 min (vs 10-15 min)
- Same quality scoring performance (nvidia-quality-classifier-deberta ONNX model)
- Conditional dependency loading - only install what you use
- No runtime performance impact

Problem: Update script failed on Linux systems without lsof
Root Cause: Script used lsof exclusively, health checks only tried HTTP
Solution:
- Port detection fallback chain: lsof → ss → netstat → ps
- Health checks support both HTTP and HTTPS with automatic fallback
- Tested on Arch Linux with ss-only environment

New Users (lightweight installation):

pip install mcp-memory-service
# Lightweight ONNX setup (805MB total)
bash scripts/setup-lightweight.sh

Existing Users (upgrade):

pip install --upgrade mcp-memory-service

Full Installation (with transformers for embeddings):

pip install mcp-memory-service[full]

See CHANGELOG.md for full version history.

🚀 Upgrade now for a dramatically leaner installation!