What's Changed
- fix(types/utils.py): Support deepseek prompt caching by @krrishdholakia in #5019
- build(ui): allow admin_viewer to view teams tab by @krrishdholakia in #5027
- [Fix] Whisper Caching - Use correct cache keys for checking request in cache by @ishaan-jaff in #5026
- fix(utils.py): Fix adding azure models on ui by @krrishdholakia in #5029
- Allow Bedrock to set custom STS endpoint for OIDC flow by @Manouchehri in #4982
- [Feat] Add support for Vertex AI fine tuning endpoints by @ishaan-jaff in #5028
- [Feat] Add support for Vertex AI Fine tuning on LiteLLM Proxy by @ishaan-jaff in #5030
- [Feat] Vertex AI fine tuning - support translating hyperparameters by @ishaan-jaff in #5034
- Docs - Add example of Vertex AI fine tuning API by @ishaan-jaff in #5035
- [Feat] support all native vertex ai endpoints - Gemini API, Embeddings API, Imagen API, Batch prediction API, Tuning API, CountTokens API by @ishaan-jaff in #5037
Full Changelog: v1.42.11...v1.42.12
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 120.0 | 147.37461002147643 | 6.379597709990025 | 0.0033418531744316532 | 1909 | 1 | 27.773200999973824 | 2754.675483000028 |
Aggregated | Passed ✅ | 120.0 | 147.37461002147643 | 6.379597709990025 | 0.0033418531744316532 | 1909 | 1 | 27.773200999973824 | 2754.675483000028 |