Importance Notes
To prevent weaker LLMs from inadvertently incorporating few-shot examples from the prompt into the extraction results, the official entity-relation extraction prompt has been updated in the current version by removing the few-shot examples. Empirical evaluation on the Qwen3-30B model shows no degradation in extraction quality. However, users are advised to monitor performance carefully on their specific LLMs to ensure consistent results.
To modify the system's default behavior, provide entity types and example prompt texts via env var ENTITY_TYPE_PROMPT_FILE.
What's New
- feat(query): optionally send chunk heading path to the LLM by @danielaskdd in #3211
- ✨ feat(chunker): inject split-table headers at split time, freeze split slices by @danielaskdd in #3218
- feat(extract): inject section heading breadcrumb into extraction prompts by @danielaskdd in #3225
- Add query cancellation with Stop button by @danielaskdd in #3227
- ♻️ refactor: make extraction prompt examples format-only by @danielaskdd in #3231
What's Changed
- build(deps): update redis requirement from <8.0.0,>=5.0.0 to >=5.0.0,<9.0.0 by @dependabot[bot] in #3194
- 🔧 chore(mineru): default MINERU_LOCAL_IMAGE_ANALYSIS to false by @danielaskdd in #3208
- build(deps): bump the react group in /lightrag_webui with 3 updates by @dependabot[bot] in #3199
- build(deps): bump lucide-react from 1.16.0 to 1.17.0 in /lightrag_webui in the ui-components group by @dependabot[bot] in #3200
- build(deps-dev): bump the build-tools group in /lightrag_webui with 3 updates by @dependabot[bot] in #3201
- build(deps): bump katex from 0.16.47 to 0.17.0 in /lightrag_webui in the content-rendering group across 1 directory by @dependabot[bot] in #3202
- build(deps): bump the frontend-minor-patch group across 1 directory with 2 updates by @dependabot[bot] in #3203
- ♻️ refactor(config): rename MAX_ASYNC to MAX_ASYNC_LLM with alias fallback by @danielaskdd in #3209
- fix: honor Postgres search path in table checks by @he-yufeng in #3206
- fix(parser): surface accurate error for non-ZIP .docx files by @danielaskdd in #3210
- ✨ fix(chunker): glue body-less heading blocks into their deeper child (P strategy) by @danielaskdd in #3214
- ✨ fix(chunker): land parent-path gate + bridge overlap for P strategy by @danielaskdd in #3215
- fix(mineru): preserve HTML table content by @danielaskdd in #3219
- ✨ feat(multimodal): declare table content format (html/json) in analysis prompt by @danielaskdd in #3221
- Preserve HTML table header spans in split tables by @danielaskdd in #3222
- feat(sidecar): add parent_headings to tables/drawings/equations sidecars by @danielaskdd in #3223
- fix(extract): drop "in section unknown" noise from multimodal relation descriptions by @danielaskdd in #3224
- feat(query): cap parent-heading length to match extraction breadcrumb by @danielaskdd in #3226
- Rename RetrievalTesting to RetrievalView by @danielaskdd in #3229
- fix(webui): prevent double-click on Send from aborting the query by @danielaskdd in #3230
Full Changelog: v1.5.0...v1.5.1