What's Changed
- [Redis] - Add ability to add client through GCP IAM Auth by @jugaldb in #13275
- [LLM Translation + Coding tools] Added litellm claude code count tokens support by @jugaldb in #13261
- [LLM Translation] claude opus 4.1 support for anthropic provider by @jugaldb in #13296
- [New model] add bedrock/us.anthropic.claude-opus-4-1-20250805-v1:0 by @ishaan-jaff in #13295
- New models - add fireworks_ai/glm-4p5 model family by @ishaan-jaff in #13297
- [Feat] Add fireworks gpt-oss models by @ishaan-jaff in #13303
- Create New Key - Make Team Field Required for Service Account by @NANDINI-star in #13302
- [LLM Translation] Fix model group on clientside auth with API calls by @jugaldb in #13293
- [Redis IAM] Change documentation by @jugaldb in #13306
- [Proxy server] Add apscheduler log suppress by @jugaldb in #13299
- Fix double slash issue in SSO login URL construction by @NANDINI-star in #13289
- [MCP Gateway] refactor mcp guardrails by @jugaldb in #13238
- [Bug]: Fix Mimetype Resolution Error in Bedrock Document Understanding by @ishaan-jaff in #13309
- [Feat] - When using custom tags on prometheus allow using wildcard patterns by @ishaan-jaff in #13316
- [Bug Fix] [Bug]: New Databricks Foundation Models databricks-gpt-oss-20b and databricks-gpt-oss-120b failed with error: litellm.APIConnectionError: 'signature' by @ishaan-jaff in #13318
- [LLM Translation] Fix model group on clientside auth with API calls by @jugaldb in #13314
- fix(streaming_handler.py): include cost in streaming usage object by @krrishdholakia in #13319
- feat(JinaAI): support multimodal embedding models by @zjx20 in #13181
- Exclude none fields on
/chat/completion
- fixes n8n bug + Allow calling/v1/models
when end user over budget by @krrishdholakia in #13320 - feat: Add logo customization for LiteLLM admin UI by @colesmcintosh in #12958
- [Feat] - New model - Add Bedrock gpt oss models -
"openai.gpt-oss-20b-1:0"
,"openai.gpt-oss-120b-1:0"
by @ishaan-jaff in #13342 - [MCP Gateway] fix auth on ui for bearer servers by @jugaldb in #13312
- Fix create, search vector store error by @yeahyung in #13285
- [Fix migration for MCP server name and alias] added new migration files by @jugaldb in #13345
- [MCP Gateway] Added route check for internal users by @jugaldb in #13350
- [UI]added token breakdown in ui by @jugaldb in #13357
- Feat - New models add groq/openai/gpt-oss by @ishaan-jaff in #13363
- Provider logos on usage page by @NANDINI-star in #13372
- feat(integrations): allow setting of braintrust callback base url by @edwarddamato in #13368
- fix: 12152 - Redacted sensitive information logged in guardrails by @AnandKhinvasara in #13356
- Add labels to migrations job template by @unique-jakub in #13343
- Update OCI docs by @breno-aumo in #13336
- [Feat] Responses API Session Handling - Multi media support by @ishaan-jaff in #13347
- Add GPT 5 models by @lowjiansheng in #13377
- feat - add
claude-opus-4-1
on cost map by @ishaan-jaff in #13384 - feat: Add GPT-5 model family with official OpenAI specifications (#13… by @ishaan-jaff in #13386
- [Feat] add azure/gpt-5 model family by @ishaan-jaff in #13385
- Revert "Fix SSO Logout | Create Unified Login Page with SSO and Usern… by @ishaan-jaff in #13387
- Fix non-root docker image for migration by @1995parham in #13379
- [Bug Fix] Mistral Tool Calling - Grammar error: at 3(11): failed to compile JSON schema by @ishaan-jaff in #13389
- [Bug Fix] OpenAI gpt-5 series does not support "max_tokens" parameter and
temperature
values that are not = 1 by @ishaan-jaff in #13390 - Add presidio MCP pre call docs by @jugaldb in #13392
New Contributors
- @zjx20 made their first contribution in #13181
- @edwarddamato made their first contribution in #13368
Full Changelog: v1.75.0-nightly...v1.75.2-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.75.2-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 140.0 | 181.73824943076167 | 6.321748843124574 | 0.0 | 1892 | 0 | 112.23509100000228 | 1448.496696999996 |
Aggregated | Passed ✅ | 140.0 | 181.73824943076167 | 6.321748843124574 | 0.0 | 1892 | 0 | 112.23509100000228 | 1448.496696999996 |