What's Changed
- feat(tracing): add chat token detail OTEL span attributes by @jerkeyray in #1975
- feat: hide deleted virtual keys from filter options by @jerkeyray in #1952
- helm: add client.enforceAuthOnInference to bifrost.config by @nirzOps in #1946
- fix: add Bearer token authentication to VLLM provider by @Radheshg04 in #1963
- fix(logging): avoid loading raw_request/raw_response in log list queries by @Vaibhav701161 in #1934
- nix: add NixOS module and flake packages for bifrost by @ReStranger in #1858
- feat: add large payload streaming context keys and provider utilities by @danpiths in #1768
- feat: add large payload transport hooks and response streaming by @danpiths in #1769
- feat: add large payload request passthrough for simple providers by @danpiths in #1770
- feat: add large payload support for OpenAI and Azure providers by @danpiths in #1771
- feat: add large payload support for Anthropic, Gemini, Vertex, and Cohere by @danpiths in #1772
- feat: add large payload awareness to plugins and logstore by @danpiths in #1773
- feat: add large payload config UI and log display by @danpiths in #1774
- bifrost cli by @akshaydeo in #1990
- fixed cli publish flows by @akshaydeo in #2001
- fixed cli actions condition by @akshaydeo in #2002
- mode fixes by @akshaydeo in #2003
- adds changelog for cli by @akshaydeo in #2004
- docs(bedrock): how to use arns and deployment mapping by @TejasGhatte in #2005
- fixed discord link by @17jmumford in #2018
- chore: update flake.lock to bump nix go version to 1.26.1 by @danpiths in #2020
- fix: makefile shell compatibility and gzip reader panic by @danpiths in #2022
- adds missing logprob mapping for gemini models (vertex and gemini proivder) by @akshaydeo in #2009
- virtual key search/filtering and pagination by @akshaydeo in #2012
- remove extra headers from restricted keys in BifrostContext by @akshaydeo in #2013
- feat: add API key selection by ID with priority over name selection by @Pratham-Mishra04 in #1941
- feat: add a prompt repository with folder, prompt, version, and session schemas, along with the backend. by @impoiler in #1966
- feat: add prompt repository HTTP handlers with CRUD operations for folders, prompts, versions, and sessions by @impoiler in #1967
- feat: add core UI components for the prompt repository by @impoiler in #1968
- feat: add a prompt repository UI, custom utilities, and helpers by @impoiler in #1969
- feat: add a prompt management UI with a playground, versioning, and deployment features by @impoiler in #1970
- feat: Add a model parameters table and an API endpoint with in-memory caching by @impoiler in #2000
- helm fixes and helm test fixes by @akshaydeo in #2023
- fix: check OAuth status before treating popup closure as cancellation by @Pratham-Mishra04 in #2027
- feat: add TLS configuration support for all providers by @Radheshg04 in #1996
- feat: added passthrough support by @TejasGhatte in #1867
- feat: session stickiness in key selection by @TejasGhatte in #2021
- fix: tool call indexes in chat completions stream by @TejasGhatte in #1982
- feat: extend e2e ui tests by @Radheshg04 in #1839
- feat: enhance e2e API tests with DB verification and new endpoints by @Radheshg04 in #1899
- fix: reduce logstore serialization overhead and batch cost updates by @jerkeyray in #2025
- fix: map chat finish_reason to responses status and preserve terminal stream semantics by @jerkeyray in #1995
- cli-editor-docs-improvement by @akshaydeo in #1980
- added cursor docs by @akshaydeo in #1985
- feat: add image size and aspect ratio configuration for Gemini provider by @Radheshg04 in #2035
- docs: enhance CLI agents documentation by @Radheshg04 in #1991
- feat: add weighted routing targets for probabilistic routing rules with key selection support by @Pratham-Mishra04 in #2015
- feat: add Enabled field to ChatReasoning struct by @mango766 in #2040
- fix: set responses input messages in gen_ai.input.messages closes #1974 by @TejasGhatte in #2033
- fix: prevent SQLite CASCADE deletion during routing targets migration by @Pratham-Mishra04 in #2045
- async log write improvements by @akshaydeo in #2042
- remove in-memory storage of model parameters by @akshaydeo in #2043
- feat: add virtual key limit resets by @jerkeyray in #2047
- feat: refactor model catalog pricing engine with unified cost calculation by @Pratham-Mishra04 in #1800
- fix: remove resolution parameter from image generation and add MCP client config fields by @Pratham-Mishra04 in #2041
- fix: video pricing fixes by @TejasGhatte in #2044
- fix: test fixes and gemini tool name by @TejasGhatte in #2038
- fix: handle request body passthrough for count tokens endpoint for Anthropic and Vertex providers by @Pratham-Mishra04 in #2051
- fix: toolChoice silently dropped on Bedrock /converse and /converse-stream endpoints by @Pratham-Mishra04 in #2052
- refactor: replace BodyUncompressed() with streaming decoders for request decompression by @jerkeyray in #1973
- feat: add threshold-gated streaming request decompression with pooled readers by @danpiths in #2036
- cli with tabs, install flow, self-update flow by @akshaydeo in #2032
- cli test fixes by @akshaydeo in #2058
- feat: add Jinja2 variable support and API key selector to prompt UI by @impoiler in #2007
- feat: improve prompt UI with smaller loading dots, click-to-edit system messages, and provider/model validation by @impoiler in #2053
- go upgrade to 1.26.1 by @akshaydeo in #2060
- Add data-testid attributes to prompt UI components for testing by @impoiler in #2059
- fixed windows build issue by @akshaydeo in #2061
- feat: add quality-based image pricing by @Radheshg04 in #2055
- feat: anthropic passthrough by @TejasGhatte in #2057
- fix: improve MCP client reconnection with exponential backoff and connection timeout handling by @Pratham-Mishra04 in #2056
- adds edit sesion, fixes cursor missing in main tab for cli by @akshaydeo in #2063
- adds pii redactor OSS UI by @akshaydeo in #2066
- feat: improve prompt UI with version badges, change indicators, and refined diff detection by @impoiler in #2065
- responses on websockets and realtime api support by @akshaydeo in #1748
- feat: add size-to-resolution conversion for Replicate image models by @Radheshg04 in #2064
- Update documentation link in NotAvailableBanner to new quickstart URL by @impoiler in #2068
- fix: merge conflict by @TejasGhatte in #2069
- fix CORS issue with allowing * by @akshaydeo in #2067
- feat(providers): allow storing raw request/response without returning them to clients by @Vaibhav701161 in #1981
- fix: skip all body building in case of large payload flow in
getRequestBodyFor...functions by @danpiths in #2071 - compilation fix by @akshaydeo in #2072
- cli autoupdate and resize fixes by @akshaydeo in #2075
- fix: chooser update shortcut test phase by @akshaydeo in #2077
- fix(anthropic): deterministic tool schema serialization for prompt caching by @Edward-Upton in #2082
- rbac for prompt repository by @akshaydeo in #2083
- v1.4.12 release cut by @akshaydeo in #2084
- build fixes for 1.4.12 by @akshaydeo in #2085
- helm chart test fixes by @akshaydeo in #2086
- fix: framework configstore - avoid postgres cached-plan failure in migration backfill by @dannyball710 in #2087
- fixed race condition for test-bifrost-http by @akshaydeo in #2088
- fixed default value of store raw request response by @akshaydeo in #2089
- adds missing endpoints, timeout increase for http tests by @akshaydeo in #2090
New Contributors
- @nirzOps made their first contribution in #1946
- @Vaibhav701161 made their first contribution in #1934
- @ReStranger made their first contribution in #1858
- @17jmumford made their first contribution in #2018
- @mango766 made their first contribution in #2040
- @dannyball710 made their first contribution in #2087
Full Changelog: ent-v1.3.8-base...ent-v1.3.9-base