What's Changed
- LiteLLM Minor Fixes & Improvements (12/13/2024) pt.1 by @krrishdholakia in #7219
- (feat - Router / Proxy ) Allow setting budget limits per LLM deployment by @ishaan-jaff in #7220
- build(deps): bump nanoid from 3.3.7 to 3.3.8 in /ui/litellm-dashboard by @dependabot in #7216
- Litellm add router to base llm testing by @ishaan-jaff in #7202
- fix(main.py): fix retries being multiplied when using openai sdk by @krrishdholakia in #7221
- (proxy) - Auth fix, ensure re-using safe request body for checking
model
field by @ishaan-jaff in #7222 - (UI fix) - Allow editing Key Metadata by @ishaan-jaff in #7230
- (UI) Fix Usage Tab - Don't make expensive UI queries after SpendLogs crosses 1M Rows by @ishaan-jaff in #7229
- (code quality) Add ruff check to ban
print
in repo by @ishaan-jaff in #7233 - (UI QA) - stop making expensive UI queries when 1M + spendLogs in DB by @ishaan-jaff in #7234
- Fix vllm import by @ivanvykopal in #7224
- Add new Gemini 2.0 Flash model to Vertex AI. by @Manouchehri in #7193
- Litellm remove circular imports by @krrishdholakia in #7232
- (feat) Add Tag-based budgets on litellm router / proxy by @ishaan-jaff in #7236
- Litellm dev 12 14 2024 p1 by @krrishdholakia in #7231
New Contributors
- @ivanvykopal made their first contribution in #7224
Full Changelog: v1.55.2...v1.55.3
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.55.3
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Passed ✅ | 250.0 | 281.1265187306242 | 6.1657490001280255 | 0.0033418693767631575 | 1845 | 1 | 119.36488499998177 | 3755.8482019999815 |
Aggregated | Passed ✅ | 250.0 | 281.1265187306242 | 6.1657490001280255 | 0.0033418693767631575 | 1845 | 1 | 119.36488499998177 | 3755.8482019999815 |