BerriAI/litellm v1.65.6-nightly on GitHub

What's Changed

Fix anthropic prompt caching cost calc + trim logged message in db by @krrishdholakia in #9838
feat(realtime/): add token tracking + log usage object in spend logs … by @krrishdholakia in #9843
fix(cost_calculator.py): handle custom pricing at deployment level fo… by @krrishdholakia in #9855

Full Changelog: v1.65.5-nightly...v1.65.6-nightly

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.65.6-nightly

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	190.0	209.99145276997868	6.188819872716192	0.0	1852	0	167.33176299999286	4428.401366999992
Aggregated	Passed ✅	190.0	209.99145276997868	6.188819872716192	0.0	1852	0	167.33176299999286	4428.401366999992