What's Changed
- fix logfile and pidfile of supervisor for non root environment by @dotmobo in #17267
- Fix audio cost per second override by @dushyantzz in #19158
- fix: add openai/dall-e base pricing entries by @obod-mpw in #19133
- fix(tools): prevent OOM with nested $defs in tool schemas by @rsp2k in #19112
- chore: resolve ModuleNotFoundError for Microsoft Foundry Agents by @Harshit28j in #18991
- [Docs] Deleted Key and Team Table by @yuneng-jiang in #19299
- docs: add performance improvement section by @AlexsanderHamir in #19300
- [Infra] UI - Build Release UI Script by @yuneng-jiang in #19302
- Add in-product nudge for claude code feedback survey + new learning centre by @krrishdholakia in #19303
- [docs] add cursor with mcp by @uc4w6c in #19292
- Google WIF documentation by @krrishdholakia in #19320
- [fix] responses api non OpenAI models by @uc4w6c in #19317
- [test] mcp integration test by @uc4w6c in #19319
- Fix: _handle_failure method getting called 2 times by @Sameerlite in #19326
- Fix: upload pdfs for file endpoint by @Sameerlite in #19327
- Add managed files support when load_balancing is True by @Sameerlite in #19338
- Fix: anthropic-beta is getting overriden and set to anthropic-beta by @Sameerlite in #19343
- Fix Output None for replicate handler by @Sameerlite in #19347
- Fix: update the doc by @Sameerlite in #19349
- fix(router): prevent retrying 4xx client errors by @Harshit28j in #19275
- fix(agentcore): simplify agentcore streaming by @busla in #17141
- Revert "Fix audio cost per second override" by @Sameerlite in #19352
- fix(proxy): add /a2a/{agent_id}/.well-known/agent-card.json to agent_… by @Harshit28j in #19277
- Add ChatGPT subscription support and responses bridge by @msexxeta in #19030
- Fix: vector store sync issues by @Sameerlite in #19329
- fix(responses): streaming with tool_choice allowed_tools by @Chesars in #17205
- fix(logging): prevent duplicate StandardLoggingPayload logs by @Harshit28j in #19325
- fix(langfuse_otel): ignore service logs and fix callback shadowing by @Harshit28j in #19298
- feat: Add Redis-based migration lock with bug fixes by @Harshit28j in #19261
- fix(utils.py): correctly extract messages from google genai contents by @Harshit28j in #19156
- Litellm staging 01 16 2026 by @krrishdholakia in #19234
- [Fix] Bedrock stability model usage issues by @Sameerlite in #19323
- fix/bedrock-inconsistent-postcall-hook by @Harshit28j in #19151
- docs: update UI contributing guide by @Chesars in #19353
- Fix HTML entity in survey description text by @superpoussin22 in #19307
- Fix : test_responses_streaming_failure_triggers_failure_handlers by @Sameerlite in #19354
- Deployment limits note by @krrishdholakia in #19367
- Add health check scripts and parallel execution support by @AlexsanderHamir in #19295
- [Fix] Claude Code x Bedrock Invoke fails with
advanced-tool-use-2025-11-20by @ishaan-jaff in #19373 - feat: add retry_delay, exponential_backoff, and jitter to completion() by @Harshit28j in #19371
- docs: fix bad examples from sdk by @Harshit28j in #19322
- fix(realtime): disable SSL for ws:// WebSocket connections by @Chesars in #19345
- feat: add support for keda in helm chart by @rsicart in #19337
- fix: correct Groq gpt-oss pricing and add cache pricing by @cluebbehusen in #19311
- feat(azure): add support for Azure OpenAI v1 API by @Chesars in #19313
- feat(gemini): use responseJsonSchema for Gemini 2.0+ models by @Chesars in #19314
- fix(gcs_bucket): prevent unbounded queue growth due to slow API calls by @AlexsanderHamir in #19297
- fix #19254 - [Bug]: litellm_params ignored by get_llm_provider function in completion() definition by @Lucky-Lodhi2004 in #19315
- fix(bedrock): deduplicate tool calls in assistant history (#15178) by @binbandit in #19324
- [Feat] - Add self hosted Claude Code Plugin Marketplace by @ishaan-jaff in #19378
- Simplify comments on helper file by @AlexsanderHamir in #19382
- [Fix] Fix Pass through routes to work with server root path by @ishaan-jaff in #19383
- Fix #19357 - [Bug]: Tool call fails when using Ollama backend by @Lucky-Lodhi2004 in #19369
- fix: correct us.anthropic.claude-opus-4-5 In-region pricing by @cluebbehusen in #19310
- Fix queue persistence to Redis by @emerzon in #19304
- feat (volcengine) : Support Volcengine responses api by @LingXuanYin in #18508
- fix: HTTP client memory leaks in Presidio, OpenAI, and Gemini by @rsp2k in #19190
- [Feat] UI - Allow Adding Claude Code Plugins by @ishaan-jaff in #19387
- Fix: bedrock invoke claude 4 optional params #19318 by @flex-myeonghyeon in #19381
- docs: fix vertex_region to vertex_location in Vertex AI pass-through docs by @wwwillchen in #19380
- Field-Existence Checks to Type Classes to Prevent Attribute Errors by @Lrakotoson in #18321
- [feat] mcp version up by @uc4w6c in #19379
- fix(bedrock): handle thinking with tool calls for Claude 4 models by @bensi94 in #18787
- fix(responses): stream tool call events in completion bridge by @victorigualada in #19368
- docs: Migrate Pillar Security to Generic Guardrail API by @eagle-p in #19364
- fix: preserve tool output ordering for gemini in responses bridge by @victorigualada in #19360
- Fix extract_cacheable_prefix to handle string content with message-level cache_control by @VedantMadane in #19266
- fix: drop_params not dropping prompt_cache_key for non-OpenAI providers by @Chesars in #19346
- merge main iin 19/1 staging by @Sameerlite in #19400
- Merge main 01 20 2026 by @Sameerlite in #19401
- Revert "feat: add retry_delay, exponential_backoff, and jitter to completion()" by @Sameerlite in #19402
- Litellm staging 01 20 2026 fix by @Sameerlite in #19404
- Revert "fix #19254 - [Bug]: litellm_params ignored by get_llm_provider function in completion() definition" by @Sameerlite in #19405
- Revert "feat: Add Redis-based migration lock with bug fixes" by @Sameerlite in #19409
- Litellm staging 01 20 2026 by @krrishdholakia in #19386
- Add input_cost_per_video_per_second in ModelInfoBase by @Sameerlite in #19398
- Add gemini-2.5-computer-use-preview-10-2025 model for vertex ai provider by @Sameerlite in #19397
- Fix for Prometheus Metric Cardinality Issue with /responses Endpoint by @Sameerlite in #19396
- Litellm staging 01 19 2026 by @krrishdholakia in #19355
- Fix: ID mismatch between text-start and text-delta by @Sameerlite in #19390
- fix: remove count tokens optional param before request is sent to vertex by @otaviofbrito in #19359
- Fix: Handle PostgreSQL cached plan errors during rolling deployments by @AlexsanderHamir in #19424
- Fix in-flight request termination on SIGTERM when health-check runs in a separate process by @AlexsanderHamir in #19427
- fix: UI dashboard respects custom authentication header override by @stiyyagura0901 in #19276
- [Infra] UI - Fixing UI Build by @yuneng-jiang in #19431
- [Internal Tooling] Add custom auth header support and increase default prompt size to 100k chars by @AlexsanderHamir in #19436
- fix: sync Helm chart version with LiteLLM release version by @Chesars in #19438
- [Feature] UI - Playground: Button to Fill Custom API Base by @yuneng-jiang in #19440
- fix: add google-cloud-aiplatform as optional dependency with clear error message by @Chesars in #19437
- docs: clarify Gemini and Vertex AI model prefix in json file by @Chesars in #19443
- [Infra] Fixing LiteLLM Proxy Extras by @yuneng-jiang in #19446
- [Infra[ Fixing dynamic_router_retry_policy CI by @yuneng-jiang in https://github.com//pull/19448
- [Infra] Use mock db for claude code marketplace tests by @yuneng-jiang in #19451
- [Infra] Fix test_route_checks by @yuneng-jiang in #19450
- [Infra] Increase Time to Wait for Spend Accuracy Tests by @yuneng-jiang in #19452
- [Infra] Changing Google Tests to use Gemini 3 Flash Preview by @yuneng-jiang in #19456
- [Feat] Add Support for Claude Code Max/OAuth 2 on LiteLLM AI Gateway by @ishaan-jaff in #19453
- [Fix] Claude Code - /messages/token_counter - ensure it works for Anthropic, Azure AI Anthropic on AI Gateway by @ishaan-jaff in #19432
- Revert "[Infra] Changing Google Tests to use Gemini 3 Flash Preview" by @yuneng-jiang in #19457
- bump: version 1.81.0 → 1.81.1 by @yuneng-jiang in #19460
- Fixes test_aaabasic_gcs_logger by @Sameerlite in #19463
- Litellm cc docs max by @ishaan-jaff in #19466
New Contributors
- @dushyantzz made their first contribution in #19158
- @obod-mpw made their first contribution in #19133
- @msexxeta made their first contribution in #19030
- @rsicart made their first contribution in #19337
- @binbandit made their first contribution in #19324
- @flex-myeonghyeon made their first contribution in #19381
- @Lrakotoson made their first contribution in #18321
- @bensi94 made their first contribution in #18787
- @victorigualada made their first contribution in #19368
- @VedantMadane made their first contribution in #19266
- @stiyyagura0901 made their first contribution in #19276
Full Changelog: v1.81.0-nightly...v1.81.1-nightly