What's Changed
- UI - sort models by latency by @ishaan-jaff in #3104
- UI - move model usage to usage tab by @ishaan-jaff in #3103
- [Proxy] Add PROXY_BASE_URL in slack alerts by @ishaan-jaff in #3108
- add mixtral 8x22 by @themrzmaster in #3109
- fix streaming special character flushing logic by @krrishdholakia in #3111
Full Changelog: v1.35.10...v1.35.11
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 94 | 100.52679200638227 | 1.5700175715047135 | 0.0 | 470 | 0 | 86.20277200003557 | 1024.0533010000377 |
/health/liveliness | Passed ✅ | 78 | 81.20203275317154 | 15.008699891001443 | 0.0066809258361902706 | 4493 | 2 | 73.74093499998935 | 2260.400893999986 |
/health/readiness | Passed ✅ | 78 | 81.4637248292686 | 15.476364699534763 | 0.0 | 4633 | 0 | 73.4900209999978 | 1558.3156839999788 |
Aggregated | Passed ✅ | 78 | 82.27488146488147 | 32.05508216204092 | 0.0066809258361902706 | 9596 | 2 | 73.4900209999978 | 2260.400893999986 |