Added
-
Added
RTVIProcessorwhich implements the RTVI-AI standard.
See https://github.com/rtvi-ai -
Added
BotInterruptionFramewhich allows interrupting the bot while talking. -
Added
LLMMessagesAppendFramewhich allows appending messages to the current LLM context. -
Added
LLMMessagesUpdateFramewhich allows changing the LLM context for the one provided in this new frame. -
Added
LLMModelUpdateFramewhich allows updating the LLM model. -
Added
TTSSpeakFramewhich causes the bot say some text. This text will not be part of the LLM context. -
Added
TTSVoiceUpdateFramewhich allows updating the TTS voice.
Removed
- We remove the
LLMResponseStartFrameandLLMResponseEndFrameframes. These were added in the past to properly handle interruptions for theLLMAssistantContextAggregator. But theLLMContextAggregatoris now based onLLMResponseAggregatorwhich handles interruptions properly by just processing theStartInterruptionFrame, so there's no need for these extra frames any more.
Fixed
-
Fixed an issue with
StatelessTextTransformerwhere it was pushing a string instead of aTextFrame. -
TTSServiceend of sentence detection has been improved. It now works with acronyms, numbers, hours and others. -
Fixed an issue in
TTSServicethat would not properly flush the current aggregated sentence if anLLMFullResponseEndFramewas found.
Performance
CartesiaTTSServicenow uses websockets which improves speed. It also leverages the new Cartesia contexts which maintains generated audio prosody when multiple inputs are sent, therefore improving audio quality a lot.