BerriAI/litellm v1.48.8 on GitHub

What's Changed

Fixed minor typo in bash command to prevent overwriting .env file by @sdaoudi in #5902
(docs) fix health check documentation language problems by @ishaan-jaff in #5984
(docs) add example using Azure OpenAI entrata id, client_id, tenant_id with litellm by @ishaan-jaff in #5985
(docs) prometheus metrics document all prometheus metrics by @ishaan-jaff in #5989
[Bug] Skip slack alert if there was no spend by @pazcuturi in #5998
(feat proxy slack alerting) - allow opting in to getting key / internal user alerts by @ishaan-jaff in #5990
(performance improvement - vertex embeddings) ~111.11% faster by @ishaan-jaff in #6000

Full Changelog: v1.48.7...v1.48.8

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.48.8

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	95	117.17931074485615	6.495544582359362	0.0	1944	0	73.97817899999382	3032.9931830000305
Aggregated	Passed ✅	95	117.17931074485615	6.495544582359362	0.0	1944	0	73.97817899999382	3032.9931830000305