What's Changed
- [Bug Fix] - Allow using reasoning_effortfor gpt-5 model family andreasoningfor Responses API by @ishaan-jaff in #13475
- [Bug Fix]: Azure OpenAI GPT-5 max_tokens + reasoningparam support by @ishaan-jaff in #13510
- [Draft] [LLM Translation] Add model id check by @jugaldb in #13507
Full Changelog: v1.75.5.rc.1...litellm_v1.75.5-dev_memory_fix
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-litellm_v1.75.5-dev_memory_fix
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-litellm_v1.75.5-dev_memory_fix
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
| Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) | 
|---|---|---|---|---|---|---|---|---|---|
| /chat/completions | Passed ✅ | 120.0 | 153.71541160805808 | 6.384305831136473 | 0.0 | 1911 | 0 | 80.34984599999007 | 1251.9617030000063 | 
| Aggregated | Passed ✅ | 120.0 | 153.71541160805808 | 6.384305831136473 | 0.0 | 1911 | 0 | 80.34984599999007 | 1251.9617030000063 |