What's Changed
- Correct GPT-5 token limits and price by @emerzon in #13423
- fix(access group): allow access group on mcp tool retrieval by @thiagosalvatore in #13425
- fix(proxy): add missing braintrust api base to env vars by @edwarddamato in #13412
- [Feat] Add
reasoning_effort
to OpenAIGPT5Config by @ishaan-jaff in #13434 - [User Delete from team] fix user membership issue by @jugaldb in #13433
- Display Error from Backend on the UI - Notification by @NANDINI-star in #13427
- [Bug fix] - Error creating standard logging object - can't register atexit after shutdownLitellm fixes standard logging payload by @ishaan-jaff in #13436
- LLM Translation - fix prices for oai gpt 5 by @jugaldb in #13441
- Add support for reasoning_effort minimal by @lowjiansheng in #13401
- [Bug Fix] Improve error message from - key creation permission error message by @ishaan-jaff in #13443
- [Docs] Add docs on how router / cooldowns work by @ishaan-jaff in #13444
- LiteLLM UI - Test Key Page - allow uploading images for /chat/completions and /responses by @ishaan-jaff in #13445
- Disable logging settings for non-enterprise users - Create Key by @NANDINI-star in #13431
- feat(reasoning): support 'minimal' effort type for OpenAI by @colesmcintosh in #13447
- [Feat] Working e2e flow for Responses API session management with media by @ishaan-jaff in #13456
- [Documentation ]added mcp guardrails doc in mcp.md by @jugaldb in #13452
Full Changelog: v1.75.3-nightly...v1.75.4-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.75.4-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.75.4-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 110.0 | 148.44770442887227 | 6.362853189864178 | 0.0 | 1905 | 0 | 81.61193400002276 | 1085.9850920000156 |
Aggregated | Passed ✅ | 110.0 | 148.44770442887227 | 6.362853189864178 | 0.0 | 1905 | 0 | 81.61193400002276 | 1085.9850920000156 |