BerriAI/litellm v1.72.1.dev1 on GitHub

What's Changed

Support returning virtual key in custom auth + Handle provider-specific optional params for embedding calls by @krrishdholakia in #11346
Doc : Nvidia embedding models by @AnilAren in #11352
feat: add cerebras/qwen-3-32b model pricing and context window by @colesmcintosh in #11373
Fix Google/Vertex AI Gemini module linting errors - Remove unused imports by @colesmcintosh in #11374

Full Changelog: v1.72.0.dev3...v1.72.1.dev1

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.72.1.dev1

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.72.1.dev1

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	220.0	240.21992879346166	6.133456637850369	0.0	1835	0	194.88736400001017	1476.116710000042
Aggregated	Passed ✅	220.0	240.21992879346166	6.133456637850369	0.0	1835	0	194.88736400001017	1476.116710000042