BerriAI/litellm v1.44.14 on GitHub

What's Changed

anthropic prompt caching cost tracking by @krrishdholakia in #5453
[Feat-Proxy] track spend logs for vertex pass through endpoints by @ishaan-jaff in #5457
[Feat] New Provider - Add Cerebras AI API by @ishaan-jaff in #5461
[Feat - Prometheus] - Track error_code, model metric by @ishaan-jaff in #5463
Minor LiteLLM Fixes and Improvements by @krrishdholakia in #5456

Full Changelog: v1.44.13...v1.44.14

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.44.14

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	174.81784158205727	6.331611805444247	0.0	1895	0	108.71869999994033	5381.36602100002
Aggregated	Passed ✅	140.0	174.81784158205727	6.331611805444247	0.0	1895	0	108.71869999994033	5381.36602100002