What's Changed
- feat(proxy_server.py): return litellm version in response headers by @krrishdholakia in #3535
- [Fix]
litellm.completion_cost(model="bedrock/anthropic.claude-instant-v1"..)
by @ishaan-jaff in #3534 - [UI] show
End-User
Usage on Usage Tab by @ishaan-jaff in #3530 - Add support for async streaming to watsonx provider by @simonsanvil in #3479
- feat(proxy_server.py): add CRUD endpoints for 'end_user' management by @krrishdholakia in #3536
- Revert "Add support for async streaming to watsonx provider " by @krrishdholakia in #3546
- [Feat] support
stream_options
param for OpenAI by @ishaan-jaff in #3537 - Fix/error on get user role by @powerhouseofthecell in #3551
- Globally filtering pydantic conflict warnings by @CyanideByte in #3555
- [Feat] support
stream_options
onlitellm.text_completion
by @ishaan-jaff in #3547 - feat(predibase.py): add support for predibase provider by @krrishdholakia in #3552
- Expand access for other jwt algorithms by @duckboy81 in #3378
New Contributors
- @powerhouseofthecell made their first contribution in #3551
- @duckboy81 made their first contribution in #3378
Full Changelog: v1.37.0.dev2_completion_cost...v1.37.2
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.37.2
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Failed ❌ | 24 | 28.59593037362605 | 1.5197959088318929 | 1.5197959088318929 | 455 | 455 | 22.671621000029063 | 184.80915000003506 |
/health/liveliness | Failed ❌ | 23 | 27.673046850246536 | 15.568722485858137 | 15.568722485858137 | 4661 | 4661 | 21.451024999976198 | 1771.8764150000084 |
/health/readiness | Failed ❌ | 23 | 28.361425038412307 | 15.652227755574176 | 15.652227755574176 | 4686 | 4686 | 21.433796999986043 | 1998.6570389999656 |
Aggregated | Failed ❌ | 23 | 28.044976272087183 | 32.74074615026421 | 32.74074615026421 | 9802 | 9802 | 21.433796999986043 | 1998.6570389999656 |