github BerriAI/litellm v1.43.4

latest releases: v1.52.0-stable, v1.52.2-dev1, v1.52.3...
3 months ago

✨ Today we're launching support for Gemini Context Caching on LiteLLM Proxy- Start here: https://docs.litellm.ai/docs/providers/vertex#context-caching

🔥 Fix UI - Easily add Groq models

⚡️ Admin UI - Azure OpenAI don't require api version when adding model

📈 UI - sort providers in alphabetical order on Models Page

🛠️ [Fix-Bug]: Whisper not working

📈 fix handle case when service logger has no attribute prometheusService

Group 5940

What's Changed

Full Changelog: v1.43.3...v1.43.4

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.43.4

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 140.0 161.51749219841636 6.333549395955978 0.23729921219676753 1895 71 102.82188400003633 956.2377719999517
Aggregated Passed ✅ 140.0 161.51749219841636 6.333549395955978 0.23729921219676753 1895 71 102.82188400003633 956.2377719999517

Don't miss a new litellm release

NewReleases is sending notifications on new releases.