What's Changed
- Add cost tracking for rerank via bedrock + jina ai by @krrishdholakia in #8691
- add bedrock llama vision support + cohere / infinity rerank - 'return_documents' support by @krrishdholakia in #8684
- Correct spelling in user_management_heirarchy.md by @oaustegard in #8716
- (Feat) - UI, Allow sorting models by Created_At and all other columns on the UI by @ishaan-jaff in #8725
- (UI) Edit Model flow improvements by @ishaan-jaff in #8729
- Support arize phoenix on litellm proxy (#7756) by @krrishdholakia in #8715
- fix(amazon_deepseek_transformation.py): remove from stream o… by @krrishdholakia in #8717
- Add cohere v2/rerank support (#8421) by @krrishdholakia in #8605
- fix(proxy/_types.py): fixes issue where internal user able to escalat… by @krrishdholakia in #8740
New Contributors
- @oaustegard made their first contribution in #8716
Full Changelog: v1.61.13-nightly...v1.61.15-nightly
## Docker Run LiteLLM Proxy
```
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.61.15-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Failed ❌ | 140.0 | 149.21232461729608 | 6.452882828983183 | 6.452882828983183 | 1931 | 1931 | 114.22628599996187 | 662.278525000005 |
Aggregated | Failed ❌ | 140.0 | 149.21232461729608 | 6.452882828983183 | 6.452882828983183 | 1931 | 1931 | 114.22628599996187 | 662.278525000005 |