BerriAI/litellm v1.41.12 on GitHub

What's Changed

fix(vertex_httpx.py): support tool calling w/ streaming for vertex ai + gemini by @krrishdholakia in #4579
fix(router.py): fix setting httpx mounts by @krrishdholakia in #4434
Fix bugs with watsonx embedding/async endpoints by @simonsanvil in #4586
fix - setting rpm/tpm on proxy through admin ui by @ishaan-jaff in #4599
Update helm chart version by @lowjiansheng in #4590
[Enterprise-Feature: Proxy] Track user-ip address in requests & in LiteLLM_SpendLogs by @ishaan-jaff in #4603

Full Changelog: v1.41.11...v1.41.12

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.41.12

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	174.2114751289816	6.295675154974679	0.0	1884	0	119.434291999994	1664.4424330000334
Aggregated	Passed ✅	140.0	174.2114751289816	6.295675154974679	0.0	1884	0	119.434291999994	1664.4424330000334