What's Changed
- (feat) add
gpt-4o-audio-preview
models to model cost map by @ishaan-jaff in #6306 - (code quality) add ruff check PLR0915 for
too-many-statements
by @ishaan-jaff in #6309 - (doc) fix typo on Turn on / off caching per Key. by @ishaan-jaff in #6297
- (feat) Support
audio
,modalities
params by @ishaan-jaff in #6304 - (feat) Support audio param in responses streaming by @ishaan-jaff in #6312
- (feat) - allow using os.environ/ vars for any value on config.yaml by @ishaan-jaff in #6276
Full Changelog: v1.49.7...v1.50.0
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.50.0
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 240.0 | 266.05337712404867 | 6.142852534799847 | 0.0 | 1838 | 0 | 211.22095199996238 | 1541.6589870000053 |
Aggregated | Passed ✅ | 240.0 | 266.05337712404867 | 6.142852534799847 | 0.0 | 1838 | 0 | 211.22095199996238 | 1541.6589870000053 |