github BerriAI/litellm litellm_pro-mgmnt-dev-v1.81.13

3 hours ago

What's Changed

  • Prompt Management API - new API to interact with Prompt Management integrations (no PR required) by @krrishdholakia in #17800
  • fix(deps): add pytest-postgresql for db schema migration tests by @jquinter in #21280
  • fix(test): replace caplog with custom handler for parallel execution by @jquinter in #21282
  • fix(test): correct async mock for video generation logging test by @jquinter in #21283
  • fix(test): add cleanup fixture and no_parallel mark for MCP tests by @jquinter in #21284
  • Litellm anthropic doc beta header by @Sameerlite in #21320
  • Generic Guardrails: Add a configurable fallback to handle generic guardrail endpoint connection failures by @itayov in #21245
  • Fix: Exclude tool params for models without function calling support (#21125) by @AtharvaJaiswal005 in #21244
  • fix: preserve metadata for custom callbacks on codex/responses path (… by @saneroen in #21243
  • fix(proxy): handle missing DATABASE_URL in append_query_params by @vincentkoc in #21239
  • fix(mcp): revert StreamableHTTPSessionManager to stateless mode by @michelligabriele in #21323
  • fix: prevent double-counting of litellm_proxy_total_requests_metric by @shivamrawat1 in #21159
  • UI - Content Filters, help edit/view categories and 1-click add categories + go to next page by @krrishdholakia in #21223
  • fix(responses-api): return finish_reason='tool_calls' when response.completed contains function_call items by @felixti in #19745
  • Fix OCI Grok output pricing by @ishaan-jaff in #21329
  • [Infra] Bumping proxy extras version by @yuneng-jiang in #21332
  • docs: add Semgrep & OOM fixes section to v1.81.12 release notes by @AlexsanderHamir in #21334
  • Fix au.anthropic.claude opus 4 6 v1 by @anttttti in #20731
  • Feat/playground test fallbacks by @atapia27 in #21007
  • fix(proxy): fix master key rotation Prisma validation errors by @michelligabriele in #21330
  • Add GDPR Art. 32 EU PII Protection Policy Template by @ishaan-jaff in #21340
  • feat: EU AI Act Article 5 policy template for prohibited practices detection by @ishaan-jaff in #21342
  • [Feature] UI - Usage: Allow Filtering by User by @yuneng-jiang in #21351
  • fix: Make vector stores migration idempotent by @milan-berri in #21325
  • feat: guardrail tracing UI - policy, detection method, match details by @ishaan-jaff in #21349
  • feat(bedrock): support native structured outputs API (outputConfig.textFormat) by @ndgigliotti in #21222
  • Fix: Add blog as incident report by @Sameerlite in #21356
  • feat(models): add github_copilot/gpt-5.3-codex and github_copilot/claude-opus-4.6-fast by @Chesars in #21316
  • fix(proxy): preserve and forward OAuth Authorization headers through proxy layer by @iamadamreed in #19912
  • feat: Add IBM watsonx.ai rerank support by @MateuszOssGit in #21303
  • fix: make PodLockManager.release_lock atomic compare-and-delete by @emerzon in #21226
  • [Infra] v1.81.13-nightly Change Copy to main by @yuneng-jiang in #21357
  • [Infra] Add Server Root Test to GitHub Actions by @yuneng-jiang in #21353
  • fix: preserve provider_specific_fields from proxy responses by @sahukanishka in #21220
  • perf(router): remove quadratic deployment scan in usage-based routing v2 by @emerzon in #21211
  • perf(router): avoid O(n^2) membership scans in team deployment filter by @emerzon in #21210
  • fix: add store to OPENAI_CHAT_COMPLETION_PARAMS by @namabile in #21195
  • Fix Bedrock service_tier cost propagation by @emerzon in #21172
  • fix: add missing OpenAI chat completion params to OPENAI_CHAT_COMPLETION_PARAMS by @shin-bot-litellm in #21360
  • perf: increase default LRU cache size to reduce multi-model thrash by @emerzon in #21139
  • fix(router): avoid O(n) alias scan for non-alias get_model_list lookups by @emerzon in #21136
  • [Fix] Key Expiry Default Duration by @yuneng-jiang in #21362
  • Add Databricks to supported LLM providers for response schema by @TomeHirata in #21368
  • Update poetry.lock by @Sameerlite in #21383
  • [feat] Add support for Openai Evals API by @Sameerlite in #21375
  • Add vllm e2e test for embedding by @Sameerlite in #21382
  • fix(lint): suppress PLR0915 too many statements in route_request by @jquinter in #21390
  • Add Claude Sonnet 4.6 pricing by @ishaan-jaff in #21395
  • add default version for opus 4.6 by @superpoussin22 in #21397
  • Day 0 Support: Claude Sonnet 4.6 by @ishaan-jaff in #21401
  • fix(ci): reduce parallelism and add retry logic to improve test stability by @jquinter in #21394
  • fix(tests): improve conftest isolation and remove deprecation warnings by @jquinter in #21396
  • Add EU AI Act Article 5 template to policy templates UI by @ishaan-jaff in #21414
  • fix: remove unused asyncio imports (linting errors) by @jquinter in #21412
  • fix(deps): regenerate poetry.lock after pyproject.toml changes by @jquinter in #21418
  • fix(tests): resolve test isolation issue in http_handler tests by @jquinter in #21388
  • fix(test): prevent flaky failure in test_log_langfuse_v2_handles_null_usage_values by @jquinter in #21419
  • fix(token-counter): normalize encode() return type and handle HF tokenizer fallback by @jquinter in #21416
  • fix(tests): mock prisma.Prisma in backoff retry tests to avoid 'prisma generate' by @jquinter in #21421
  • fix(lakera-guardrail): avoid KeyError on missing LAKERA_API_KEY during initialization by @jquinter in #21422
  • Fix EU AI Act template: add missing category_file path by @ishaan-jaff in #21424
  • [Fix] /v1/models returning wildcard instead of expanded models for BYOK team keys by @shivamrawat1 in #21408
  • fix: remove importlib.reload calls causing cross-test class-reference staleness by @jquinter in #21425
  • Add French language support for EU AI Act Article 5 guardrail by @ishaan-jaff in #21427
  • fix(token-counter): fix test isolation and encode() return type normalization by @jquinter in #21423
  • fix(tests): use class-level AsyncHTTPHandler mock in vertex GPT-OSS tests by @jquinter in #21428
  • fix(tests): restore disable_aiohttp_transport and force_ipv4 in isolate_litellm_state by @jquinter in #21431
  • fix(test): mock enterprise license check in JWT test by @jquinter in #21285
  • fix: improve test isolation for parallel execution by @jquinter in #20595
  • improve(ci): enhance test stability with better isolation and distribution by @jquinter in #21277
  • fix: session grouping broken for dict rows from query_raw by @ishaan-jaff in #21435
  • fix: restore sys.modules after stub injection in langfuse otel test by @jquinter in #21434
  • feat(ui): add guardrail jump link in log detail view by @ishaan-jaff in #21437
  • move e2e to llm translation by @Sameerlite in #21387
  • Add compliance checker endpoints + UI panel by @ishaan-jaff in #21432
  • fix(bedrock): broaden Nova 2 model detection to support all nova-2-* variants by @ryanh-ai in #21358
  • Add prompt injection detection policy template + guardrails by @ishaan-jaff in #21452
  • feat: split EU AI Act Article 5 into 5 dedicated sub-guardrails by @ishaan-jaff in #21453
  • Add MCP Security guardrail to block unregistered MCP servers by @ishaan-jaff in #21429
  • End users - Allow giving end users access to specific mcp servers by @krrishdholakia in #21411
  • Revert "End users - Allow giving end users access to specific mcp servers " by @krrishdholakia in #21461
  • Add support for devstral 2512 model aliases by @stronk7 in #21372
  • feat(bedrock): support nova/ and nova-2/ spec prefixes for custom imported models by @ryanh-ai in #21359
  • Add native Responses API support for Databricks GPT models by @TomeHirata in #21460
  • Litellm prompt registry fix by @Harshit28j in #21402
  • Prompt Management API - allow integrating with LiteLLM prompt management without a PR by @krrishdholakia in #17946
  • Revert "fix: make PodLockManager.release_lock atomic compare-and-delete" by @Sameerlite in #21469
  • Litellm oss staging 02 16 2026 by @krrishdholakia in #21326
  • Litellm oss staging 02 17 2026 by @krrishdholakia in #21361
  • [Chore]Add remaining beta tests2 by @Sameerlite in #21299
  • Add mapping for websearch from v1/messages to chat/completions by @Sameerlite in #21465
  • Add 'reasoning' field to 'reasoning_content' field in delta by @Sameerlite in #21468
  • [Feat] Add duckduckgo as search tool by @Sameerlite in #21467
  • Litellm sanitise anthropic mesages 2 by @Sameerlite in #21464
  • Add File deletion criteria with batch references by @Sameerlite in #21456
  • Incident Report: vLLM Embeddings Broken by encoding_format Parameter by @Sameerlite in #21474
  • [Feat]Add day 0 claude sonnet 4.6 feat support by @Sameerlite in #21448
  • Fix mock test by @Sameerlite in #21475
  • fix(tests): restore proxy_server module attrs after test_proxy_admin_expired_key_from_cache by @jquinter in #21473
  • fix(ci): add prisma generate step to matrix CI workflow by @jquinter in #21436
  • feat(datadog): add 'team' tag to logs, metrics, and cost management by @Harshit28j in #21449
  • fix(tests): resolve merge conflict in test_vertex_ai_rerank_transformation.py by @jquinter in #21478
  • fix(proxy): use prisma.Json for JSON fields in _rotate_master_key create_many() by @jquinter in #21479
  • fix(tests): add inference_geo to model prices JSON schema validator by @jquinter in #21477
  • Add deployment affinity routing callback by @emerzon in #19143
  • [Refactor] UI - Keys: Change Key Type Label by @yuneng-jiang in #21364
  • Add version in claude-code-beta-headers-incident by @Sameerlite in #21485
  • fix: guard against None metadata in prometheus metrics by @ishaan-jaff in #21489
  • fix(tests): restore litellm.model_cost after reload endpoint test by @jquinter in #21499
  • [Infra] Change Server Root Path GitHub action test to non root image by @yuneng-jiang in #21495
  • fix(ci): force-reinstall enterprise package to override PyPI version by @jquinter in #21481
  • fix(tests): resolve MCP test isolation failures in parallel execution by @jquinter in #21484
  • fix(tests): restore default_internal_user_params instead of delattr-ing it by @jquinter in #21483
  • fix: improve streaming proxy throughput by fixing middleware and logging bottlenecks by @ishaan-jaff in #21501
  • fix(ci): install enterprise package into main project venv, not enterprise's own venv by @jquinter in #21506
  • [Bug] Allow internal_user_viewer to access RAG endpoints; restrict ingest to existing vector stores by @shivamrawat1 in #21508
  • fix(sso): preserve SSO role regardless of role_mappings config by @yuneng-jiang in #21503
  • [Feature] Allow store_model_in_db to be set via database by @yuneng-jiang in #21511
  • fix: CI failures - missing env key doc + streaming test by @ishaan-jaff in #21510
  • Add aviation and UAE policy templates with tag-based filtering by @ishaan-jaff in #21518
  • Mcp user permissions by @krrishdholakia in #21462
  • feat(ui): add CSV dataset upload to compliance playground by @ishaan-jaff in #21526
  • Litellm cicd 190226 by @Sameerlite in #21531
  • Add supoort for context-1m-2025-08-07 by @Sameerlite in #21534
  • fix: prevent sys.modules["langfuse"] import failures in langfuse unit tests by @jquinter in #21440
  • fix(types): add = None defaults to Optional[str] fields in managed table models by @jquinter in #21500
  • [Feature] UI - Models & Endpoints: Add Model Settings Modal by @yuneng-jiang in #21516
  • fix(tests): restore litellm.model_cost after TestPriceDataReloadIntegration tests by @jquinter in #21505
  • fix(tests): update MCP tests broken by user permissions commit (#21462) by @jquinter in #21536
  • fix(mypy): resolve type errors from MCP user permissions commit by @jquinter in #21535
  • fix(test): restore default_internal_user_params to None instead of delattr by @jquinter in #21439
  • fix(tests): use record.getMessage() instead of record.message for LogRecord by @jquinter in #21476
  • fix(ui): remove duplicate URL in tagsSpendLogsCall query string by @jquinter in #20909
  • Competitor guardrails: streaming discovery, variations, pre/post split by @ishaan-jaff in #21533
  • [Feature] Allow team members to view entire team usage by @yuneng-jiang in #21537
  • Litellm project management apis by @Harshit28j in #21078

New Contributors

Full Changelog: litellm_sso-dev-v1.81.13...litellm_pro-mgmnt-dev-v1.81.13

Don't miss a new litellm release

NewReleases is sending notifications on new releases.