What's Changed
- Add spitch optional dependency by @temibabs in #2559
- add Cartesia STT usage event by @ChenghaoMou in #2565
- use the cgroup cpu_count for the inference thread pool by @theomonnom in #2572
- avoid possible contention on concurrent inference executions by @theomonnom in #2575
- use onnx dynamic_block_base by @theomonnom in #2578
- add vad for stt FallbackAdapter by @longcw in #2582
- Don't require sarvam api key param for TTS by @bcherry in #2579
- Remove unnecessary model param from baseten tts by @bcherry in #2568
- Fix baseten STT api key lookup by @bcherry in #2576
- fix stt fallback adapter imports by @longcw in #2590
- Replace the office-ambience sound file by @bcherry in #2588
- chore(deepgram,cartesia): removed AudioEnergyFilter by @davidzhao in #2594
- unit tests for agent session by @longcw in #2518
- fix unknown energy filter parameter by @theomonnom in #2599
- fix type check by @longcw in #2596
- wait for final transcript in manual turn detection by @longcw in #2597
- add volume gain option by @jmugicagonz in #2603
- increase audio frame size by @theomonnom in #2610
- Add SSML support for Google TTS by @kechako in #2608
- fix OpenAI Realtime connect timeout by @theomonnom in #2612
- fix OpenAI Realtime tool_choice by @theomonnom in #2613
- add transcript_confidence to ChatMessage by @theomonnom in #2611
- fix(turn-detector): improve accuracy by combining adjacent turns by @davidzhao in #2595
- fix transcription delay when VAD false negative by @longcw in #2620
- Hume plugin fixes by @zgreathouse in #2591
- Updating metrics for cached tokens for Realtime model (OpenAI) by @tg-bomze in #2621
- Disable ensure_ascii by @tg-bomze in #2622
- add timeout for agent session tests by @longcw in #2624
- add error log when llm fallback adapter failed because chunk_sent by @longcw in #2626
- fix ChatContext.insert type check by @theomonnom in #2635
- Removes the split_utterances option from Hume TTS plugin by @zgreathouse in #2638
- wait for video track from avatar plugins by @longcw in #2627
- add http_options for gemini LLM and realtime model by @longcw in #2640
- correctly passing speaking_rate to StreamingAudioConfig by @david-rodriguez in #2631
- Fix : Increase audio mixer timeout by @CyprienRicqueB2L in #2646
- handling multiple audio chunk output by @raghavjaistra in #2641
- Fix Hume TTS by @bcherry in #2639
- Update sarvam defaults, add 2.5 by @bcherry in #2618
- fix tracing param in openai realtime by @longcw in #2652
- raise error from gladia stt for fallback adapter and retry by @longcw in #2653
- fix await tasks groups never return by @longcw in #2654
- chore: add note for job_context.api usage by @davidzhao in #2655
- fix(google): update dependency versions by @davidzhao in #2658
- feat(baseten): add LLM module by @davidzhao in #2657
- cleanup tee in agent activity by @longcw in #2660
- fix duplicated audio on flush by @theomonnom in #2663
- fix transcription sync warning when gemini no text output by @longcw in #2661
- livekit-agents v1.1.2 by @theomonnom in #2664
New Contributors
- @tg-bomze made their first contribution in #2621
- @david-rodriguez made their first contribution in #2631
- @CyprienRicqueB2L made their first contribution in #2646
- @raghavjaistra made their first contribution in #2641
Full Changelog: https://github.com/livekit/agents/compare/livekit-agents@1.1.0...livekit-agents@1.1.2