BerriAI/litellm v1.65.4.dev8 on GitHub

What's Changed

fix: claude haiku cache read pricing per token by @hewliyang in #9834
Add service annotations to litellm-helm chart by @mlhynfield in #9840
Reflect key and team update in UI by @crisshaker in #9825
Add user alias to API endpoint by @Jacobh2 in #9859
Update Azure Phi-4 pricing by @emerzon in #9862
feat: add enterpriseWebSearch tool for vertex-ai by @qvalentin in #9856
VertexAI non-jsonl file storage support by @krrishdholakia in #9781
[Bug Fix] Add support for UploadFile on LLM Pass through endpoints (OpenAI, Azure etc) by @ishaan-jaff in #9853
[Feat SSO] Debug route - allow admins to debug SSO JWT fields by @ishaan-jaff in #9835

Full Changelog: v1.65.4.dev6...v1.65.4.dev8

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.65.4.dev8

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	240.0	271.9459418950253	6.118160191369328	0.0	1829	0	215.48997299998973	3681.300501999999
Aggregated	Passed ✅	240.0	271.9459418950253	6.118160191369328	0.0	1829	0	215.48997299998973	3681.300501999999