BerriAI/litellm v1.41.8 on GitHub

🛠️ fix - raise report Anthropic streaming errors (thanks David Manouchehri)

✨ [Fix] Add nvidia nim param mapping based on model passed

What's Changed

fix(anthropic.py): add index to streaming tool use by @igor-drozdov in #4554
(fix) fixed bug with the watsonx embedding endpoint by @simonsanvil in #4540
Revert "(fix) fixed bug with the watsonx embedding endpoint" by @krrishdholakia in #4561
[docs] add example Grafana Dashboard by @ishaan-jaff in #4563
build(deps): bump certifi from 2023.7.22 to 2024.7.4 by @dependabot in #4568
fix(proxy/utils.py): support logging rejected requests to langfuse, etc. by @krrishdholakia in #4564
[Feat] Add Galileo Logging Callback by @ishaan-jaff in #4567
[Fix] Add nvidia nim param mapping based on model by @ishaan-jaff in #4565
fix - raise report Anthropic streaming errors by @ishaan-jaff in #4566
feat - control guardrails per api key by @ishaan-jaff in #4569

Full Changelog: v1.41.7...v1.41.8

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.41.8

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	120.0	148.48763956993193	6.382118352365276	0.0	1909	0	109.10986900000808	1689.413720999994
Aggregated	Passed ✅	120.0	148.48763956993193	6.382118352365276	0.0	1909	0	109.10986900000808	1689.413720999994