What's Changed
- Update Azure o3 pricing to match OpenAI pricing ($2/$8 per 1M tokens) by @ervwalter in #11937
- [BugFix] Ollama response_format not working by @ThakeeNathees in #11880
- fix aws bedrock claude tool call index by @jnhyperion in #11842
- fix(acompletion): allow dict for tool_choice argument by @Jannchie in #11860
- [Chore] Check team counts on license when creating new team by @ishaan-jaff in #11943
- [Docs] [Pre-Release] v1.73.0-stable by @ishaan-jaff in #11950
- Show user all models they can call (Across teams) on UI by @krrishdholakia in #11948
New Contributors
- @ervwalter made their first contribution in #11937
- @ThakeeNathees made their first contribution in #11880
- @jnhyperion made their first contribution in #11842
- @Jannchie made their first contribution in #11860
Full Changelog: v1.72.9-nightly...v1.73.0-nightly
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.73.0-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.73.0-nightly
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 230.0 | 252.55233764162196 | 6.182759830384375 | 0.0 | 1850 | 0 | 208.6453730000244 | 1743.1928639999796 |
Aggregated | Passed ✅ | 230.0 | 252.55233764162196 | 6.182759830384375 | 0.0 | 1850 | 0 | 208.6453730000244 | 1743.1928639999796 |