0.7.5
- Feat:
graphify extractnow runs incrementally — auto-detects priormanifest.jsonand re-extracts only changed/new files; semantic results cached by content hash so unchanged docs cost zero LLM tokens on repeat runs (#698) - Feat: Entity deduplication pipeline runs on every build — entropy gate + MinHash/LSH blocking + Jaro-Winkler verification + same-community boost collapses near-duplicate entities (typos, spacing, plurals) before clustering
- Feat:
--dedup-llmflag forgraphify extract— optional LLM tiebreaker for ambiguous entity pairs (~$0.01 for 10k-node graphs), off by default - Fix:
graphify hook installrebuild now preserves human-readable community labels from.graphify_labels.jsoninstead of resetting to generic "Community N" names on every commit (#705) - Fix:
graphify install --platform gemininow works correctly (#706) - Deps:
datasketchandrapidfuzzadded as base dependencies