What's Changed
- (fix) get_response_headers for Azure OpenAI by @ishaan-jaff in #6344
- fix(litellm-helm): correctly use dbReadyImage and dbReadyTag values by @Hexoplon in #6336
- fix(proxy_server.py): add 'admin' user to db by @krrishdholakia in #6223
- refactor(redis_cache.py): use a default cache value when writing to r… by @krrishdholakia in #6358
- LiteLLM Minor Fixes & Improvements (10/21/2024) by @krrishdholakia in #6352
- Refactor: apply early return by @Haknt in #6369
- (refactor) remove berrispendLogger - unused logging integration by @ishaan-jaff in #6363
- (fix) standard logging metadata + add unit testing by @ishaan-jaff in #6366
- Revert "(fix) standard logging metadata + add unit testing " by @ishaan-jaff in #6381
- Add new Claude 3.5 sonnet model card by @lowjiansheng in #6378
- Add claude 3 5 sonnet 20241022 models for all provides by @Manouchehri in #6380
New Contributors
Full Changelog: v1.50.1...v1.50.2-stable
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.50.2-stable
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.50.2-stable
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 190.0 | 210.17145067557462 | 6.253172473880384 | 0.0 | 1871 | 0 | 177.3328190000143 | 1816.3144349999811 |
Aggregated | Passed ✅ | 190.0 | 210.17145067557462 | 6.253172473880384 | 0.0 | 1871 | 0 | 177.3328190000143 | 1816.3144349999811 |