github BerriAI/litellm v1.42.12

latest releases: v1.52.0-stable, v1.52.2-dev1, v1.52.3...
3 months ago

What's Changed

  • fix(types/utils.py): Support deepseek prompt caching by @krrishdholakia in #5019
  • build(ui): allow admin_viewer to view teams tab by @krrishdholakia in #5027
  • [Fix] Whisper Caching - Use correct cache keys for checking request in cache by @ishaan-jaff in #5026
  • fix(utils.py): Fix adding azure models on ui by @krrishdholakia in #5029
  • Allow Bedrock to set custom STS endpoint for OIDC flow by @Manouchehri in #4982
  • [Feat] Add support for Vertex AI fine tuning endpoints by @ishaan-jaff in #5028
  • [Feat] Add support for Vertex AI Fine tuning on LiteLLM Proxy by @ishaan-jaff in #5030
  • [Feat] Vertex AI fine tuning - support translating hyperparameters by @ishaan-jaff in #5034
  • Docs - Add example of Vertex AI fine tuning API by @ishaan-jaff in #5035
  • [Feat] support all native vertex ai endpoints - Gemini API, Embeddings API, Imagen API, Batch prediction API, Tuning API, CountTokens API by @ishaan-jaff in #5037

Full Changelog: v1.42.11...v1.42.12

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.12

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 120.0 147.37461002147643 6.379597709990025 0.0033418531744316532 1909 1 27.773200999973824 2754.675483000028
Aggregated Passed ✅ 120.0 147.37461002147643 6.379597709990025 0.0033418531744316532 1909 1 27.773200999973824 2754.675483000028

Don't miss a new litellm release

NewReleases is sending notifications on new releases.