Added
-
Added support for OpenAI Realtime API with the new
OpenAILLMServiceRealtimeBetaprocessor. (see https://platform.openai.com/docs/guides/realtime/overview) -
Added
RTVIBotTranscriptionProcessorwhich will send the RTVIbot-transcriptionprotocol message. These are TTS text aggregated (into sentences) messages. -
Added new input params to the
MarkdownTextFilterutility. You can setfilter_codeto filter code from text andfilter_tablesto filter tables from text. -
Added
CanonicalMetricsService. This processor uses the newAudioBufferProcessorto capture conversation audio and later send it to Canonical AI. (see https://canonical.chat/) -
Added
AudioBufferProcessor. This processor can be used to buffer mixed user and bot audio. This can later be saved into an audio file or processed by some audio analyzer. -
Added
on_first_participant_joinedevent toLiveKitTransport.
Changed
-
LLM text responses are now logged properly as unicode characters.
-
UserStartedSpeakingFrame,UserStoppedSpeakingFrame,BotStartedSpeakingFrame,BotStoppedSpeakingFrame,BotSpeakingFrameandUserImageRequestFrameare now based fromSystemFrame
Fixed
-
Merge
RTVIBotLLMProcessor/RTVIBotLLMTextProcessorandRTVIBotTTSProcessor/RTVIBotTTSTextProcessorto avoid out of order issues. -
Fixed an issue in RTVI protocol that could cause a
bot-llm-stoppedorbot-tts-stoppedmessage to be sent before abot-llm-textorbot-tts-textmessage. -
Fixed
DeepgramSTTServiceconstructor settings not being merged with default ones. -
Fixed an issue in Daily transport that would cause tasks to be hanging if urgent transport messages were being sent from a transport event handler.
-
Fixed an issue in
BaseOutputTransportthat would causeEndFrameto be pushed downed too early and callFrameProcessor.cleanup()before letting the transport stop properly.