What's Changed
- Fix Datadog JSON serialization by @idris in #4920
- [Fix] using airgapped license for Enterprise by @ishaan-jaff in #4921
- [Feat] - log writing BatchSpendUpdate events on OTEL by @ishaan-jaff in #4924
- Fix Canary error with
docusaurus start
by @yujonglee in #4919 - [Feature]: Allow using custom and on-demand models in Fireworks AI + update data to model_prices_and_context_window.json by @danielbichuetti in #4730
- Proxy Enterprise - security - check max request size by @ishaan-jaff in #4926
- [Feat Enterprise] - check max response size by @ishaan-jaff in #4928
- Feat Enterprise - set max request / response size UI by @ishaan-jaff in #4927
Full Changelog: v1.42.4-stable...v1.42.5-stable
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.5-stable
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.42.5-stable
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 130.0 | 161.68211911395028 | 6.545409328079534 | 0.0 | 1957 | 0 | 110.24948799996537 | 2273.3069750000254 |
Aggregated | Passed ✅ | 130.0 | 161.68211911395028 | 6.545409328079534 | 0.0 | 1957 | 0 | 110.24948799996537 | 2273.3069750000254 |