github BerriAI/litellm v1.40.25

latest releases: v1.48.5-stable, v1.48.5, v1.48.4-stable...
3 months ago

What's Changed

  • feat(dynamic_rate_limiter.py): Dynamic tpm quota (multiple projects) by @krrishdholakia in #4349
  • fix(router.py): Content Policy Fallbacks for Azure 'content_filter' responses by @krrishdholakia in #4365
  • Disable message redaction in logs via request header by @msabramo in #4352

Full Changelog: v1.40.24...v1.40.25

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.25

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 120.0 137.1328113917474 6.561515229902106 0.0 1963 0 98.4713450000072 1831.7410280000104
Aggregated Passed ✅ 120.0 137.1328113917474 6.561515229902106 0.0 1963 0 98.4713450000072 1831.7410280000104

Don't miss a new litellm release

NewReleases is sending notifications on new releases.