What's Changed
- Add mistral medium latest to model prices by @dragosMC91 in #2562
- fix(anthropic): tool calling detection by @lucasmrdt in #2558
- Fixed azure ad token not being processed properly in embedding models by @vilmar-hillow in #2142
- [FEAT] Litellm admin UI cleanup by @ishaan-jaff in #2588
- [Admin UI] Show models when creating teams by @ishaan-jaff in #2589
- build(deps): bump follow-redirects from 1.15.4 to 1.15.6 in /docs/my-website by @dependabot in #2521
- feat(handle_jwt.py): support authenticating admins into the proxy via jwt's by @krrishdholakia in #2592
- [Feat] /metrics endpoint for Prometheus, Grafana by @ishaan-jaff in #2591
- fix(proxy/utils.py): fix reset budget logic by @krrishdholakia in #2593
New Contributors
- @dragosMC91 made their first contribution in #2562
- @lucasmrdt made their first contribution in #2558
- @vilmar-hillow made their first contribution in #2142
Full Changelog: v1.32.4...v1.32.7
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 89 | 96.38442711304144 | 1.5365464658154921 | 0.0 | 460 | 0 | 82.7262380000775 | 600.5486389999533 |
/health/liveliness | Passed ✅ | 66 | 69.08479451166716 | 15.45899357346543 | 0.0 | 4628 | 0 | 63.394482999910906 | 1327.8773580000234 |
/health/readiness | Passed ✅ | 66 | 68.6079558634594 | 15.485716120697004 | 0.003340318403946722 | 4636 | 1 | 63.522563999868 | 1238.7321420001172 |
Aggregated | Passed ✅ | 66 | 70.14888408628039 | 32.481256159977924 | 0.003340318403946722 | 9724 | 1 | 63.394482999910906 | 1327.8773580000234 |