BerriAI/litellm v1.44.14-stable on GitHub

What's Changed

anthropic prompt caching cost tracking by @krrishdholakia in #5453
[Feat-Proxy] track spend logs for vertex pass through endpoints by @ishaan-jaff in #5457
[Feat] New Provider - Add Cerebras AI API by @ishaan-jaff in #5461
[Feat - Prometheus] - Track error_code, model metric by @ishaan-jaff in #5463
Minor LiteLLM Fixes and Improvements by @krrishdholakia in #5456

Full Changelog: v1.44.13...v1.44.14-stable

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.44.14-stable

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	164.65214474238385	6.252754675195834	0.0	1871	0	115.70125900004768	2147.0894060000205
Aggregated	Passed ✅	140.0	164.65214474238385	6.252754675195834	0.0	1871	0	115.70125900004768	2147.0894060000205