What's Changed
- feat: hash prompt when caching by @prd-tuong-nguyen in #5105
- feat: set max_internal_budget for user w/ sso by @krrishdholakia in #5120
- Litellm sso team member add by @krrishdholakia in #5129
- [Feat] Add pricing for cohere embedding models by @ishaan-jaff in #5137
- [Feat] v2 prometheus deployment outage, healthy, partial outage alerting by @ishaan-jaff in #5134
- ui allow adding cohere models by @ishaan-jaff in #5136
New Contributors
- @prd-tuong-nguyen made their first contribution in #5105
Full Changelog: v1.43.4...v1.43.4.dev5
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.43.4.dev5
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 130.0 | 155.9265892127544 | 6.39414481763279 | 0.0 | 1913 | 0 | 104.33059599995431 | 2708.094066000001 |
Aggregated | Passed ✅ | 130.0 | 155.9265892127544 | 6.39414481763279 | 0.0 | 1913 | 0 | 104.33059599995431 | 2708.094066000001 |