BerriAI/litellm litellm_v1.75.5-dev_memory_fix

What's Changed

[Bug Fix] - Allow using reasoning_effort for gpt-5 model family and reasoning for Responses API by @ishaan-jaff in #13475
[Bug Fix]: Azure OpenAI GPT-5 max_tokens + reasoning param support by @ishaan-jaff in #13510
[Draft] [LLM Translation] Add model id check by @jugaldb in #13507

Full Changelog: v1.75.5.rc.1...litellm_v1.75.5-dev_memory_fix_1

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-litellm_v1.75.5-dev_memory_fix_1

Name	Status	Median Response Time (ms)	Average Response Time (ms)	Requests/s	Failures/s	Request Count	Failure Count	Min Response Time (ms)	Max Response Time (ms)
/chat/completions	Passed ✅	180.0	211.1796445664915	6.382692480357281	0.0	1910	0	132.33785100001683	1892.3347159999935
Aggregated	Passed ✅	180.0	211.1796445664915	6.382692480357281	0.0	1910	0	132.33785100001683	1892.3347159999935