What's Changed
- Litellm release notes 07 12 2025 by @krrishdholakia in #12563
- Add Bytez to the list of providers in the docs by @inf3rnus in #12588
- [Feat] New LLM API Integration - Add Moonshot API (Kimi) (#12551) by @ishaan-jaff in #12592
- [Feat] Add ai21/jamba-1.7 model family pricing by @ishaan-jaff in #12593
- fix: add implicit caching cost calculation for Gemini 2.x models by @colesmcintosh in #12585
- Updated release notes by @krrishdholakia in #12594
- [Feat] Vector Stores - Add Vertex RAG Engine API as a provider by @ishaan-jaff in #12595
- Wildcard model filter by @NANDINI-star in #12597
- [Bug fix] [Bug]: Verbose log is enabled by default by @ishaan-jaff in #12596
- Control Plane + Data Plane support by @krrishdholakia in #12601
- Claude 4 Bedrock /invoke route support + Bedrock application inference profile tool choice support by @krrishdholakia in #12599
- refactor(prisma_migration.py): refactor to support use_prisma_migrate - for helm hook by @krrishdholakia in #12600
- feat: Add envVars and extraEnvVars support to Helm migrations job by @AntonioKL in #12591
- feat(gemini): Add custom TTL support for context caching (#9810) by @marcelodiaz558 in #12541
- fix(anthropic): fix streaming + response_format + tools bug by @dmcaulay in #12463
New Contributors
- @AntonioKL made their first contribution in #12591
- @marcelodiaz558 made their first contribution in #12541
- @dmcaulay made their first contribution in #12463
Full Changelog: v1.74.3.rc.1...v1.74.3.dev2
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.74.3.dev2
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.74.3.dev2
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 190.0 | 206.89425634609142 | 6.19933434941609 | 0.0 | 1855 | 0 | 168.97698900004343 | 1646.9904610000299 |
Aggregated | Passed ✅ | 190.0 | 206.89425634609142 | 6.19933434941609 | 0.0 | 1855 | 0 | 168.97698900004343 | 1646.9904610000299 |