What's Changed
- [Proxy] LiteLLM mock test fix by @jugaldb in #13635
- [Proxy] Litellm add DB metrics to prometheus by @jugaldb in #13626
- [LLM Translation] Fix Realtime API endpoint for no intent by @jugaldb in #13476
- [MCP Gateway] LiteLLM Fix MCP gateway key auth by @jugaldb in #13630
- [Fix] Ensure /messages works when using `bedrock/converse/ with LiteLLM by @ishaan-jaff in #13627
- UI - Fix image overflow in LiteLLM model by @ishaan-jaff in #13639
- [Bug Fix] /messages endpoint - ensure tool use arguments are returned for non-anthropic models by @ishaan-jaff in #13638
Full Changelog: v1.75.6-nightly...v1.75.7-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.75.7-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 140.0 | 182.49004836898376 | 6.247441314306712 | 0.0 | 1870 | 0 | 114.26430999995318 | 2038.3160259999613 |
Aggregated | Passed ✅ | 140.0 | 182.49004836898376 | 6.247441314306712 | 0.0 | 1870 | 0 | 114.26430999995318 | 2038.3160259999613 |