What's new
This is the first release since 0.4.0 with significant new features! We look forward to hearing feedback and suggestions from the community.
Chat completion model cache
One of the big missing features from 0.2 was the ability to seamlessly cache model client completions. This release adds ChatCompletionCache
which can wrap any other ChatCompletionClient
and cache completions.
There is a CacheStore
interface to allow for easy implementation of new caching backends. The currently available implementations are:
import asyncio
import tempfile
from autogen_core.models import UserMessage
from autogen_ext.models.openai import OpenAIChatCompletionClient
from autogen_ext.models.cache import ChatCompletionCache, CHAT_CACHE_VALUE_TYPE
from autogen_ext.cache_store.diskcache import DiskCacheStore
from diskcache import Cache
async def main():
with tempfile.TemporaryDirectory() as tmpdirname:
openai_model_client = OpenAIChatCompletionClient(model="gpt-4o")
cache_store = DiskCacheStore[CHAT_CACHE_VALUE_TYPE](Cache(tmpdirname))
cache_client = ChatCompletionCache(openai_model_client, cache_store)
response = await cache_client.create([UserMessage(content="Hello, how are you?", source="user")])
print(response) # Should print response from OpenAI
response = await cache_client.create([UserMessage(content="Hello, how are you?", source="user")])
print(response) # Should print cached response
asyncio.run(main())
ChatCompletionCache
is not yet supported by the declarative component config, see the issue to track progress.
GraphRAG
This releases adds support for GraphRAG as a tool agents can call. You can find a sample for how to use this integration here, and docs for LocalSearchTool
and GlobalSearchTool
.
import asyncio
from autogen_ext.models.openai import OpenAIChatCompletionClient
from autogen_agentchat.ui import Console
from autogen_ext.tools.graphrag import GlobalSearchTool
from autogen_agentchat.agents import AssistantAgent
async def main():
# Initialize the OpenAI client
openai_client = OpenAIChatCompletionClient(
model="gpt-4o-mini",
)
# Set up global search tool
global_tool = GlobalSearchTool.from_settings(settings_path="./settings.yaml")
# Create assistant agent with the global search tool
assistant_agent = AssistantAgent(
name="search_assistant",
tools=[global_tool],
model_client=openai_client,
system_message=(
"You are a tool selector AI assistant using the GraphRAG framework. "
"Your primary task is to determine the appropriate search tool to call based on the user's query. "
"For broader, abstract questions requiring a comprehensive understanding of the dataset, call the 'global_search' function."
),
)
# Run a sample query
query = "What is the overall sentiment of the community reports?"
await Console(assistant_agent.run_stream(task=query))
if __name__ == "__main__":
asyncio.run(main())
#4612 by @lspinheiro
Semantic Kernel model adapter
Semantic Kernel has an extensive collection of AI connectors. In this release we added support to adapt a Semantic Kernel AI Connector to an AutoGen ChatCompletionClient using the SKChatCompletionAdapter
.
Currently this requires passing the kernel during create, and so cannot be used with AssistantAgent
directly yet. This will be fixed in a future release (#5144).
#4851 by @lspinheiro
AutoGen to Semantic Kernel tool adapter
We also added a tool adapter, but this time to allow AutoGen tools to be added to a Kernel, called KernelFunctionFromTool
.
#4851 by @lspinheiro
Jupyter Code Executor
This release also brings forward Jupyter code executor functionality that we had in 0.2, as the JupyterCodeExecutor
.
Please note that this currently on supports local execution and should be used with caution.
Memory
It's still early on but we merged the interface for agent memory in this release. This allows agents to enrich their context from a memory store and save information to it. The interface is defined in core and AssistantAgent in agentchat accepts memory as a parameter now. There is an initial example memory implementation which simply injects all memories as system messages for the agent. The intention is for the memory interface to be able to be used for both RAG and agent memory systems going forward.
- Tutorial
- Core
Memory
interface - Existing
AssistantAgent
with new memory parameter
#4438 by @victordibia, #5053 by @ekzhu
Declarative config
We're continuing to expand support for declarative configs throughout the framework. In this release, we've added support for termination conditions and base chat agents. Once we're done with this, you'll be able to configure and entire team of agents with a single config file and have it work seamlessly with AutoGen studio. Stay tuned!
#4984, #5055 by @victordibia
Other
- Add sources field to TextMentionTermination by @Leon0402 in #5106
- Update gpt-4o model version to 2024-08-06 by @ekzhu in #5117
Bug fixes
- Retry multiple times when M1 selects an invalid agent. Make agent sel… by @afourney in #5079
- fix: normalize finish reason in CreateResult response by @ekzhu in #5085
- Pass context between AssistantAgent for handoffs by @ekzhu in #5084
- fix: ensure proper handling of structured output in OpenAI client and improve test coverage for structured output by @ekzhu in #5116
- fix: use tool_calls field to detect tool calls in OpenAI client; add integration tests for OpenAI and Gemini by @ekzhu in #5122
Other changes
- Update website for 0.4.1 by @jackgerrits in #5031
- PoC AGS dev container by @JohanForngren in #5026
- Update studio dep by @ekzhu in #5062
- Update studio dep to use version bound by @ekzhu in #5063
- Update gpt-4o model version and add new model details by @keenranger in #5056
- Improve AGS Documentation by @victordibia in #5065
- Pin uv to 0.5.18 by @jackgerrits in #5067
- Update version to 0.4.3 pre-emptively by @jackgerrits in #5066
- fix: dotnet azure pipeline (uv sync installation) by @bassmang in #5042
- docs: .NET Documentation by @lokitoth in #5039
- [Documentation] Update tools.ipynb: use system messages in the tool_agent_caller_loop session by @zysoong in #5068
- docs: enhance agents.ipynb with parallel tool calls section by @ekzhu in #5088
- Use caching to run tests and report coverage by @lspinheiro in #5086
- fix: ESPR dotnet code signing by @bassmang in #5081
- Update AGS pyproject.toml by @victordibia in #5101
- docs: update AssistantAgent documentation with a new figure, attention and warning notes by @ekzhu in #5099
- Rysweet fix integration tests and xlang by @rysweet in #5107
- docs: enhance Swarm user guide with notes on tool calling by @ekzhu in #5103
- fix a small typo by @marinator86 in #5120
New Contributors
- @lokitoth made their first contribution in #5060
- @keenranger made their first contribution in #5056
- @zysoong made their first contribution in #5068
- @marinator86 made their first contribution in #5120
Full Changelog: v0.4.1...v0.4.3