github livekit/agents livekit-agents@1.5.2

9 hours ago

Note

livekit-agents 1.5 introduced many new features. You can check out the changelog here.

What's Changed

  • Update Phonic generate_reply timeout to 10 seconds by @qionghuang6 in #5205
  • fix: pass prometheus_multiproc_dir in from_server_options initialization by @ivanbalingit in #5195
  • feat(mistralai): upgrade to SDK v2 by @Pauldevillers in #5163
  • (deepgram sttv2): validate eager_eot_threshold value by @tinalenguyen in #5216
  • Add WebSocket streaming support to Baseten TTS plugin by @iancarrasco-b10 in #4741
  • fix: allow codec format specification via the user for Sarvam TTS by @pUrGe12 in #5209
  • emit agent handoff from conversation_item_added by @tinalenguyen in #5218
  • fix(llm): surface validation error details to LLM on function call argument failures by @Lyt060814 in #5193
  • fix(cli): update log level width in console mode by @chenghao-mou in #5224
  • fix(utils): preserve type annotations in deprecate_params by @longcw in #5200
  • fix(test): replace oai with deepgram and fix broken tests by @chenghao-mou in #5225
  • feat(voice): reuse STT connection across agent handoffs by @longcw in #5093
  • feat(google): add VertexRAGRetrieval provider tool by @youpesh in #5222
  • fix: ensure MCP client enter/exit run in the same task by @longcw in #5223
  • feat(assemblyai): add domain parameter for Medical Mode by @m-ods in #5208
  • fix: Nova Sonic interactive context bugs and dynamic tool support by @prettyprettyprettygood in #5220
  • (google realtime): add gemini-3.1-flash-live-preview model by @tinalenguyen in #5233
  • fix(utils): improve type annotation for deprecate_params decorator by @longcw in #5244
  • fix: expose endpointing_opts in AgentSession.update_options() by @longcw in #5243
  • Fix/stt fallback adapter propagate aligned transcript by @miladmnasr in #5237
  • feat(mistral): add voxtral TTS support by @jeanprbt in #5245
  • feat(anthropic): support strict tool use schema by @roshan-shaik-ml in #5259
  • Baseten Plugin Update: fix metadata schema, add chain_id support, and improve response parsing by @jiegong-fde in #4889
  • feat(upliftai): add support for phrase replacement config id by @zaidqureshi2 in #5261
  • feat(soniox): expose max_endpoint_delay_ms option by @pstrav in #5214
  • fix: prevent TTS retry after partial audio and replay input on retry by @longcw in #5242
  • fix: only start session host when it's primary session by @longcw in #5241
  • fix: prevent CancelledError from propagating to unrelated Tee peers by @longcw in #5273
  • fix: prevent AttributeError in ThreadJobExecutor.logging_extra() by @longcw in #5277
  • fix(openai): close current generation channels on realtime reconnect by @longcw in #5276
  • fix(recorder): guard against empty agent speech frames by @chenghao-mou in #5279
  • fix(stt): reset VAD when STT sends EOT by @chenghao-mou in #5095
  • feat(anam): add avatarModel config support by @sr-anam in #5272
  • fix: catch TimeoutError from drain() so aclose() always runs by @seglo in #5282
  • (gemini-3.1-flash-live-preview): add warning for generate_reply by @tinalenguyen in #5286
  • feat(mistralai): add ref_audio support to Voxtral TTS for zero-shot voice cloning by @EtienneLescot in #5278
  • fix(core): reset user state to listening when audio is disabled by @chenghao-mou in #5198
  • append generate_reply instructions as system msg and convert it to user msg if unsupported by @longcw in #5287
  • add AsyncToolset by @longcw in #5127
  • fix(core): fix BackgroundAudioPlayer.play() hanging indefinitely by @theomonnom in #5299
  • fix(cli): prevent api_key/api_secret from leaking in tracebacks by @theomonnom in #5300
  • (phonic) Update languages fields by @qionghuang6 in #5285
  • fix(core): reduce TTS output buffering latency by @theomonnom in #5292
  • add session_end_timeout and gracefully cancel entrypoint on shutdown by @theomonnom in #4580
  • feat: OTEL metrics for latencies, usage, and connection timing by @theomonnom in #4891
  • evals: custom judges, tag metadata, and OTEL improvements by @theomonnom in #5306
  • fix is_context_type for generic RunContext types by @theomonnom in #5307
  • add 7-day uv cooldown by @chenghao-mou in #5290
  • fix(openai realtime): support per-response tool_choice in realtime sessions by @longcw in #5211
  • use delta aggregation temporality for otel metrics by @paulwe in #5314
  • (phonic) Add min_words_to_interrupt to Phonic plugin options by @qionghuang6 in #5304
  • add tag field to evaluation OTEL log records by @theomonnom in #5315
  • docs: add example agent replies to AsyncToolset by @longcw in #5313
  • fix(cartesia): handle flush_done message in TTS _recv_task by @Panmax in #5321
  • fix(voice): make function call history preservation configurable in AgentTask by @GopalGB in #5288
  • fix: convert oneOf to anyOf in strict schema for discriminated unions by @longcw in #5324
  • (gemini realtime): add warnings in update_chat_ctx and update_instructions by @tinalenguyen in #5332
  • fix: wait_for_participant waits until participant is fully active by @davidzhao in #5271
  • feat: answering machine detection by @chenghao-mou in #4906
  • feat: expose service_tier in CompletionUsage from OpenAI Responses API by @piyush-gambhir in #5341
  • fix: add PARTICIPANT_KIND_CONNECTOR to default participant kinds by @anunaym14 in #5339
  • feat/sarvam-llm-openai-compatible-integration by @dhruvladia-sarvam in #5069
  • feat(azure-stt): Possibility to change segmentation options during a call by @rafallezanko in #5323
  • fix(sarvam): sync missing API params, fix value ranges, and update models by @Namit1867 in #5347
  • (xai tts): update fields and ws setup by @tinalenguyen in #5350
  • fix(smallestai): add lightning-v3.1 endpoint routing by @sg-siddhant in #5330
  • feat(inference): add debug/identification headers to inference requests by @adrian-cowham in #5337
  • Move community plugins to livekit-plugins/community/ by @theomonnom in #5250
  • feat: support per-response tools in generate_reply by @longcw in #5310
  • fix xAI realtime update chat ctx by @longcw in #5320
  • Fix RoomIO teardown listener cleanup by @sindarknave in #5357
  • feat(mistral): support voxtral realtime streaming stt & modernize mistral plugin by @jeanprbt in #5289
  • fix: say() with missing audio file hangs forever and blocks speech queue by @theomonnom in #5358
  • add prompt_cache_retention chat completion option to inference by @s-hamdananwar in #5370
  • Add Murf as optional dep by @royalfig in #5334
  • feat(core): Support multiple provider keys in extra_content serialization by @adrian-cowham in #5374
  • ci: add PyPI publish workflow with trusted publishing by @theomonnom in #5379
  • feat: Add D-ID avatar plugin by @osimhi213 in #5232
  • ci: fix tag checkout and discover glob by @theomonnom in #5381
  • feat(rime): add mistv3 model support by @mcullan in #5298
  • ci: fix update_versions.py invocation by @theomonnom in #5382
  • ci: remove release label from publish workflow by @theomonnom in #5384
  • require livekit-protocol>=1.1.5, implement get_framework_info by @theomonnom in #5385
  • ci: fix build permissions and tag format by @theomonnom in #5386
  • ci: fix version read in publish workflow by @theomonnom in #5388
  • ci: use livekit-agents@version for release PR title by @theomonnom in #5390
  • fix: minimax optional dep not bumped by update_versions.py by @theomonnom in #5392
  • livekit-agents@1.5.2 by @github-actions[bot] in #5391

New Contributors

Full Changelog: https://github.com/livekit/agents/compare/livekit-agents@1.5.1...livekit-agents@1.5.2

Don't miss a new agents release

NewReleases is sending notifications on new releases.