BerriAI/litellm v1.40.17 on GitHub

What's Changed

[Fix] Proxy Thread creation using the Assistants API by @ishaan-jaff in #4260
Add a Dependabot config to auto-update GitHub action versions by @kurtmckee in #4261
[Fix-Bug]: LiteLLM returns 500 in case of Quota exceeded for anthropic-claude-3-haiku by @ishaan-jaff in #4263
[Docs] Deep infra llama3 by @ishaan-jaff in #4265
[Fix-Bug] Async Streaming Mock is different to sync streaming mock by @ishaan-jaff in #4268
fix(proxy_server.py): Fix JWT-Auth team spend tracking by @krrishdholakia in #4269
fix: add more type hints to init methods by @nejch in #4258
[Fix] Use Langfuse prompt Object with LiteLLM Proxy by @ishaan-jaff in #4275
fix(vertex_httpx.py): Correctly handle Vertex content policy violation error by @krrishdholakia in #4271

Full Changelog: v1.40.16...v1.40.17

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.17

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	166.0081077142874	6.315588540483388	0.0	1890	0	109.83896499999446	2170.276978000004
Aggregated	Passed ✅	140.0	166.0081077142874	6.315588540483388	0.0	1890	0	109.83896499999446	2170.276978000004