github BerriAI/litellm v1.42.5

latest releases: v1.52.0-stable, v1.52.2-dev1, v1.52.3...
3 months ago

🔥 We're launching filtering LLMs by provider, max_tokens on https://models.litellm.ai 👉 View cost, max_tokens for 200+ LLMs (@LiteLLM)

litellm_filters

🔭 [Feat] - log writing BatchSpendUpdate events on OTEL

🔑 Proxy Enterprise - security - check max request size

🛡️ [Feat Enterprise] - check max response size

✅ Feat Enterprise - set max request / response size UI

What's Changed

Full Changelog: v1.42.4...v1.42.5

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.5

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 130.0 149.07872144345131 6.351580011280877 0.0 1901 0 107.79980099999875 1698.2656079999856
Aggregated Passed ✅ 130.0 149.07872144345131 6.351580011280877 0.0 1901 0 107.79980099999875 1698.2656079999856

Don't miss a new litellm release

NewReleases is sending notifications on new releases.