Release Notes - v0.5.4.dev3
Release Date: 2026-04-09
Changes: v0.5.8rc1 → v0.5.4.dev3-baseten
Summary
This release (0.5.4.dev3) fixes a runtime issue with fetching model token limits from LiteLLM/Baseten model metadata and improves related debug logging. It also updates the package dev version. No user-facing API changes — upgrade recommended if you encountered crashes related to model token metadata.
Highlights
- Fix for Baseten/LiteLLM integration that prevents crashes when model metadata is missing token limits.
- Improved debug logging in the LLM utilities to make diagnosis easier.
- Development version bumped to 0.5.4.dev3.
Improvements
- get_model_max_completion_tokens now safely handles models that do not include a "max_tokens" entry in LiteLLM's model_cost, avoiding KeyError crashes.
- Added clearer debug logs when the model entry is present but missing max_tokens, and when a model isn't found at all — makes troubleshooting easier when using external model providers like Baseten.
Bug Fixes
- Resolved issues with Baseten usage by guarding access to the "max_tokens" field in LiteLLM's model_cost. This prevents errors when model metadata doesn't include token limit information.
Technical Changes
- Small code change in cognee/infrastructure/llm/utils.py to check for the presence of the max_tokens key before reading it.
- Version bump in pyproject.toml and uv.lock from 0.5.4.dev1 to 0.5.4.dev3.
- No public API changes. Internal robustness and logging improvements only.
Generated by Cognee Release Notes Generator on 2026-04-09