BerriAI/litellm v1.43.19.dev2 on GitHub

What's Changed

[Fix Router] - Don't cooldown Default Provider deployment by @ishaan-jaff in #5302
[Fix] Router - Do not retry on 404 errors from LLM API providers by @ishaan-jaff in #5298
[Fix Router] Don't retry errors when healthy_deployments=0 by @ishaan-jaff in #5299
[Fix] Router - don't recursively use the same fallback by @ishaan-jaff in #5301

Full Changelog: v1.43.19.dev1...v1.43.19.dev2

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.43.19.dev2

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.43.19.dev2

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	140.0	155.707955254718	6.374453265792973	0.0	1908	0	109.84177599999612	1317.8180300000122
Aggregated	Passed ✅	140.0	155.707955254718	6.374453265792973	0.0	1908	0	109.84177599999612	1317.8180300000122