What's Changed
- merge main by @Sameerlite in #17408
- Guardrail API - support tool call checks on OpenAI
/chat/completions, OpenAI/responses, Anthropic/v1/messagesby @krrishdholakia in #17459 - docs: document responses and embedding api for github copilot by @codgician in #17456
- fix(openai): add clear error message for empty LLM endpoint responses by @DevajMody in #17445
- Fix: Skip empty text blocks in Anthropic system messages by @HaiyiMei in #17442
- feat: Basic Weave OTEL integration by @andrewtruong in #17439
- fix(ocr): add missing ocr and aocr to CallTypes enum by @Chesars in #17435
- fix(groq): check supports_response_schema before using json_tool_call workaround by @Chesars in #17438
- fix(pricing): add missing Moonshot turbo models and fix incorrect pricing by @Chesars in #17432
- fix: make bedrock converse messages respect modify_params as expected by @danielmklein in #17427
- Update docs to link agent hub by @krrishdholakia in #17462
- [Feature] Add Auto Redirect to SSO to New Login Page by @yuneng-jiang in #17451
- [Infra] UI - Change Credentials to use React Query by @yuneng-jiang in #17465
- Map output_tokens_details of responses API to completion_tokens_details by @Sameerlite in #17458
- Add support for bedrock qwen 2 imported model by @Sameerlite in #17461
- Add support for file content download for bedrock batches by @Sameerlite in #17470
- fix: test_default_api_base for ragfow by @Sameerlite in #17485
- feat - new model amazon.titan-image-generator-v2:0 by @ishaan-jaff in #17488
- Fix aim guardrail tests by @hxyannay in #17499
- fix: resolve ruff lint errors by @AnasAbdelR in #17490
- 🆕 feat: support routing to only websearch supported deployments by @raghav-stripe in #17500
- [Management endpoints] - Allow admin viewer to access global tag usage by @ishaan-jaff in #17501
- [Feat] Agent Access Control - Enforce Allowed agents by key, team + add agent access groups on backend by @ishaan-jaff in #17502
- [Feat] UI - Agent Gateway - set allowed agents by key, team by @ishaan-jaff in #17511
- [Docs] A2a - Permission management by @ishaan-jaff in #17515
- [Feat] Allow adding OpenAI compatible chat providers using .json + add public ai provider by @ishaan-jaff in #17448
- chore: add stack trace to mcp error message by @uc4w6c in #17269
- (fix): empty response + vllm streaming by @dominicfeliton in #17516
- fix(guardrails): mask all matching keywords in content filter by @DevajMody in #17521
- Enable detailed debugging for reference by @krrishdholakia in #17508
- fix(sql): Optimize SpendLogs queries to use timestamp filtering for index usage by @kristianmitk in #17504
- fix(gemini): handle partial JSON chunks after first valid chunk by @Chesars in #17496
- fix(unified_guardrail.py): support during_call event type for unified guardrails by @krrishdholakia in #17514
- Guardrails API - new
structured_messagesparam by @krrishdholakia in #17518 - [Feat] VertexAI - Add BGE Embeddings support by @Sameerlite in #17362
- [Fix] Select in Edit Membership Modal by @yuneng-jiang in #17524
- fix: code quality issues from ruff linter by @AlexsanderHamir in #17536
- Fix: Presidio guardrail test TypeError and license base64 decoding error by @AlexsanderHamir in #17538
- [fix] parse budget:thinking blocks for opus 4.5 by @colinlin-stripe in #17534
- Fix vector store configuration synchronization failure by @Sameerlite in #17525
- Add support for cursor BYOK with its own configuration by @Sameerlite in #17519
- Add background health checks to db by @Sameerlite in #17528
- Fix: Lack of None value checks & update publicai_chat_transformation tests by @AlexsanderHamir in #17539
- Remove hardcoded api key from test file by @krrishdholakia in #17540
- Make sure that media resolution is only for gemini 3 model by @Sameerlite in #17137
- [Bug fix] WatsonX audio transcriptions, don't force content type in request headers by @ishaan-jaff in #17546
- [Infra] Bump LiteLLM Enterprise Version by @yuneng-jiang in #17549
- Fix budget update to allow null max_budget by @ishaan-jaff in #17545
- fix failing vertex tests by @Sameerlite in #17542
- [Feature] Show all callbacks on UI by @yuneng-jiang in #16335
- fix(github_copilot): preserve encrypted_content in reasoning items for multi-turn conversations by @rgshr in #17130
- feat: Add gpt-5.1-codex-max model pricing and configuration by @Chesars in #17541
- Fix: apply_guardrail method and improve test isolation by @AlexsanderHamir in #17555
- Fix: MLflow streaming spans for Anthropic passthrough by @uc4w6c in #17288
- [Fixes] Dynamic Rate Limiter - Dynamic rate limiting token count increases/decreases by 1 instead of actual count + Redis TTL by @ishaan-jaff in #17558
- [Bug fix] Secret Managers Integration - Make email and secret manager operations independent in key management hooks by @ishaan-jaff in #17551
- feat: add Mistral Large 3 model support by @Chesars in #17547
- fix(team): use organization.members instead of deprecated organizatio… by @DevajMody in #17557
- [Feature] Customer (end user) Usage by @yuneng-jiang in #17498
- [Feat] Allow using dynamic rate limit/priority reservation on teams by @ishaan-jaff in #17061
- build(deps): bump jws from 3.2.2 to 3.2.3 in /ui/litellm-dashboard by @dependabot[bot] in #17494
- fix(presidio): handle empty content and error dict responses in guardrails by @dominicfallows in #17489
- fix(bedrock): anthropics beta headers for qwen by @irfansofyana in #17467
- fix(gemini): fix reasoning_content showing duplicate content in streaming responses by @Chesars in #17266
- Fixes #13652 - auth not working with ollama.com by @GusBricker in #17191
- fix(docs): remove
source .envby @utsumi-fj in #17466 - fix(streaming): add length validation for empty tool_calls in delta by @DevajMody in #17523
- feat(oci): Enable Oracle Cloud Infrastructure configuration via UI by @jatorre in #17159
- Fix: Gemini image_tokens treated as text tokens in cost calculation by @Chesars in #17554
- fix(responses): Add image generation support for Responses API by @Chesars in #16586
- fix: Handle global location for Vertex AI image generation endpoint by @OlivverX in #17255
- Feat/mcp preserve tool metadata calltoolresult by @uc4w6c in #17561
- [Feature] Customer Usage UI by @yuneng-jiang in #17506
- [Fix] Mock server_root_path for v2/login test by @yuneng-jiang in #17563
- [Fix] CI/CD - Adding end user and org to service types by @yuneng-jiang in #17564
- build(deps): bump mdast-util-to-hast from 13.2.0 to 13.2.1 in /ui/litellm-dashboard by @dependabot[bot] in #17444
- [Feature] Support Images in Compare UI by @yuneng-jiang in #17562
- fix: mcp test by @uc4w6c in #17570
- [Fix] Flaky UI Test by @yuneng-jiang in #17569
- [Fix] Change useAuthorized Hook to redirect to new Login Page by @yuneng-jiang in #17553
- Fix: remove merge conflict markdown by @AlexsanderHamir in #17586
- Fix Langfuse logger test mock setup by @AlexsanderHamir in #17588
- fix: preserve usage from JSON response for OpenAI provider in Bedrock by @AlexsanderHamir in #17589
- Revert "Fix Langfuse logger test mock setup (#17588)" by @AlexsanderHamir in #17590
- Fix: Langfuse logger test mock setup by @AlexsanderHamir in #17591
- Fix flaky test: test_logging_non_streaming_request by @AlexsanderHamir in #17592
- Fix: Ensure guardrail metadata is preserved in request_data by @AlexsanderHamir in #17593
- Fix: Resolve flakiness in three integration tests by @AlexsanderHamir in #17594
- fix: add retry logic for flaky Azure image generation health check test by @AlexsanderHamir in #17595
- Add retry logic to apk package installation in Dockerfile.non_root by @AlexsanderHamir in #17596
- fix: make test_watsonx_gpt_oss_prompt_transformation deterministic by @AlexsanderHamir in #17597
- [Feature] UI - Add Info Banner for Customer Usage by @yuneng-jiang in #17598
- fix: make test_log_langfuse_v2_handles_null_usage_values more robust by @AlexsanderHamir in #17599
- [Infra] Building UI for release by @yuneng-jiang in #17600
- Fix security vulnerability: update mdast-util-to-hast to 13.2.1 by @AlexsanderHamir in #17601
- fix: mark test_caching_with_reasoning_content as flaky and handle InternalServerError by @AlexsanderHamir in #17603
- docs: add gpt-5.1-codex-max to OpenAI provider documentation by @Chesars in #17602
- feat: Add xhigh reasoning effort for gpt-5.1-codex-max by @withsmilo in #17585
- 1.80.8 RC docs by @ishaan-jaff in #17605
- feat: add script to create branches with litellm_ prefix by @AlexsanderHamir in #17606
- [Fix] UI - MCP Test Tool Fix by @yuneng-jiang in #17609
- [Infra] Building UI To Include Fix by @yuneng-jiang in #17610
- Fix: test_guardrails_with_api_key_controls by @AlexsanderHamir in #17612
- refactor: extract helper methods from pre_call_hook to fix PLR0915 by @AlexsanderHamir in #17613
- [Docs] Customer Usage by @yuneng-jiang in #17615
- docs: add Microsoft GraphRAG to projects using LiteLLM by @Chesars in #17616
- Helm Chart: Add possibility to override command, args and add deployment labels by @LukasdeBoer in #17535
- [Docs] Update sidebar for customer usage by @yuneng-jiang in #17617
- [New Model] Add Amazon Nova as first party provider for chat completions by @anilkumarkodali in #17351
- [Docs] Fixing path to image by @yuneng-jiang in #17618
- Add beta header in vertex request for claude websearch request by @Sameerlite in #16269
- fix(vertex_llm_base.py): Add missing scopes to identity pool credentials by @krrishdholakia in #17560
- [Feature] Loading states for Edit Membership modal by @yuneng-jiang in #17621
New Contributors
- @DevajMody made their first contribution in #17445
- @andrewtruong made their first contribution in #17439
- @AnasAbdelR made their first contribution in #17490
- @dominicfeliton made their first contribution in #17516
- @kristianmitk made their first contribution in #17504
- @rgshr made their first contribution in #17130
- @dominicfallows made their first contribution in #17489
- @irfansofyana made their first contribution in #17467
- @GusBricker made their first contribution in #17191
- @OlivverX made their first contribution in #17255
- @withsmilo made their first contribution in #17585
- @LukasdeBoer made their first contribution in #17535
- @anilkumarkodali made their first contribution in #17351
Full Changelog: v1.80.7.dev.3...v1.80.8-nightly