What's Changed
- fix: claude haiku cache read pricing per token by @hewliyang in #9834
- Add service annotations to litellm-helm chart by @mlhynfield in #9840
- Reflect key and team update in UI by @crisshaker in #9825
- Add user alias to API endpoint by @Jacobh2 in #9859
- Update Azure Phi-4 pricing by @emerzon in #9862
- feat: add enterpriseWebSearch tool for vertex-ai by @qvalentin in #9856
- VertexAI non-jsonl file storage support by @krrishdholakia in #9781
- [Bug Fix] Add support for UploadFile on LLM Pass through endpoints (OpenAI, Azure etc) by @ishaan-jaff in #9853
- [Feat SSO] Debug route - allow admins to debug SSO JWT fields by @ishaan-jaff in #9835
New Contributors
- @hewliyang made their first contribution in #9834
- @mlhynfield made their first contribution in #9840
- @crisshaker made their first contribution in #9825
- @qvalentin made their first contribution in #9856
Full Changelog: v1.65.4.dev6...v1.65.4.dev8
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.65.4.dev8
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 240.0 | 271.9459418950253 | 6.118160191369328 | 0.0 | 1829 | 0 | 215.48997299998973 | 3681.300501999999 |
Aggregated | Passed ✅ | 240.0 | 271.9459418950253 | 6.118160191369328 | 0.0 | 1829 | 0 | 215.48997299998973 | 3681.300501999999 |