BerriAI/litellm v1.48.3 on GitHub

What's Changed

Add Llama 3.2 90b model on Vertex AI. by @Manouchehri in #5908
Update litellm helm envconfigmap by @Pit-Storm in #5872
LiteLLM Minor Fixes & Improvements (09/24/2024) by @krrishdholakia in #5880
LiteLLM Minor Fixes & Improvements (09/25/2024) by @krrishdholakia in #5893
[feat-Prometheus] Track api key alias and api key hash for remaining tokens metric by @ishaan-jaff in #5924
[Fix proxy perf] Use correct cache key when reading from redis cache by @ishaan-jaff in #5928
[Fix] Perf use only async functions for get cache by @ishaan-jaff in #5930
[docs] updated langfuse integration guide by @jannikmaierhoefer in #5921
Upgrade dependencies in dockerfile by @Jacobh2 in #5862
[Fix Azure AI Studio] drop_params_from_unprocessable_entity_error by @ishaan-jaff in #5936

Full Changelog: v1.48.2...v1.48.3

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.48.3

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	100.0	118.27303725401755	6.443801071488233	0.0	1929	0	76.61121600000342	2505.7243389999826
Aggregated	Passed ✅	100.0	118.27303725401755	6.443801071488233	0.0	1929	0	76.61121600000342	2505.7243389999826