Fu-Jie/openwebui-extensions async-context-compression-v1.4.0 on GitHub

Overview

This release focuses on improving the structural integrity of chat history when using function-calling models and enhancing task reliability through concurrent task management. It introduces "Atomic Message Grouping" to prevent chat context corruption and a session-based locking mechanism to ensure stable background operations.

📖 README

New Features

Atomic Message Grouping: A new structure-aware logic that identifies and groups assistant-tool-tool-assistant calling sequences. This ensures that tool results are never orphaned from their calls during compression.
Tail Boundary Alignment: Automatically corrects truncation indices to ensure the recent context "tail" starts at a valid message boundary, preventing partial tool-calling sequences from being sent to the LLM.
Chat Session Locking: Implements a per-chat-id asynchronous lock to prevent multiple summary tasks from running concurrently for the same session, reducing redundant LLM calls and race conditions.
Metadata Traceability: Summarization inputs now include message IDs, participant names, and key metadata labels, allowing the summary model to maintain better traceability in its output.

Bug Fixes

Fixed "No tool call found" Errors: By enforcing atomic grouping, the filter no longer truncates the context in a way that separates tool calls from their results.
Improved Progress Calculation: Fixed an issue where summarizing messages would cause the progress tracking to drift due to shifting list indices.
Prevented Duplicate Summary Tasks: The new locking mechanism ensures that only one background summary process is active per session.

Related Issues

#56: Tool-Calling context corruption and concurrent summary tasks.

Fu-Jie/openwebui-extensions async-context-compression-v1.4.0 Async Context Compression v1.4.0 on GitHub

Overview

New Features

Bug Fixes

Related Issues

Fu-Jie/openwebui-extensions async-context-compression-v1.4.0
Async Context Compression v1.4.0

on GitHub