livekit/agents livekit-agents@1.1.2 on GitHub

What's Changed

Add spitch optional dependency by @temibabs in #2559
add Cartesia STT usage event by @ChenghaoMou in #2565
use the cgroup cpu_count for the inference thread pool by @theomonnom in #2572
avoid possible contention on concurrent inference executions by @theomonnom in #2575
use onnx dynamic_block_base by @theomonnom in #2578
add vad for stt FallbackAdapter by @longcw in #2582
Don't require sarvam api key param for TTS by @bcherry in #2579
Remove unnecessary model param from baseten tts by @bcherry in #2568
Fix baseten STT api key lookup by @bcherry in #2576
fix stt fallback adapter imports by @longcw in #2590
Replace the office-ambience sound file by @bcherry in #2588
chore(deepgram,cartesia): removed AudioEnergyFilter by @davidzhao in #2594
unit tests for agent session by @longcw in #2518
fix unknown energy filter parameter by @theomonnom in #2599
fix type check by @longcw in #2596
wait for final transcript in manual turn detection by @longcw in #2597
add volume gain option by @jmugicagonz in #2603
increase audio frame size by @theomonnom in #2610
Add SSML support for Google TTS by @kechako in #2608
fix OpenAI Realtime connect timeout by @theomonnom in #2612
fix OpenAI Realtime tool_choice by @theomonnom in #2613
add transcript_confidence to ChatMessage by @theomonnom in #2611
fix(turn-detector): improve accuracy by combining adjacent turns by @davidzhao in #2595
fix transcription delay when VAD false negative by @longcw in #2620
Hume plugin fixes by @zgreathouse in #2591
Updating metrics for cached tokens for Realtime model (OpenAI) by @tg-bomze in #2621
Disable ensure_ascii by @tg-bomze in #2622
add timeout for agent session tests by @longcw in #2624
add error log when llm fallback adapter failed because chunk_sent by @longcw in #2626
fix ChatContext.insert type check by @theomonnom in #2635
Removes the split_utterances option from Hume TTS plugin by @zgreathouse in #2638
wait for video track from avatar plugins by @longcw in #2627
add http_options for gemini LLM and realtime model by @longcw in #2640
correctly passing speaking_rate to StreamingAudioConfig by @david-rodriguez in #2631
Fix : Increase audio mixer timeout by @CyprienRicqueB2L in #2646
handling multiple audio chunk output by @raghavjaistra in #2641
Fix Hume TTS by @bcherry in #2639
Update sarvam defaults, add 2.5 by @bcherry in #2618
fix tracing param in openai realtime by @longcw in #2652
raise error from gladia stt for fallback adapter and retry by @longcw in #2653
fix await tasks groups never return by @longcw in #2654
chore: add note for job_context.api usage by @davidzhao in #2655
fix(google): update dependency versions by @davidzhao in #2658
feat(baseten): add LLM module by @davidzhao in #2657
cleanup tee in agent activity by @longcw in #2660
fix duplicated audio on flush by @theomonnom in #2663
fix transcription sync warning when gemini no text output by @longcw in #2661
livekit-agents v1.1.2 by @theomonnom in #2664

New Contributors

@tg-bomze made their first contribution in #2621
@david-rodriguez made their first contribution in #2631
@CyprienRicqueB2L made their first contribution in #2646
@raghavjaistra made their first contribution in #2641

Full Changelog: https://github.com/livekit/agents/compare/livekit-agents@1.1.0...livekit-agents@1.1.2