github murtaza-nasir/speakr v0.8.0-alpha
v0.8.0 - Connector Architecture & REST API

latest releases: v0.8.2-alpha, v0.8.1-alpha
2 days ago

This release introduces a modular connector-based architecture for transcription and a full-featured REST API.

Connector Architecture

  • Modular Transcription - New connector-based system with auto-detection
  • Simplified Configuration - Fewer env vars; auto-detects from ASR_BASE_URL or TRANSCRIPTION_MODEL
  • OpenAI Diarization - Use gpt-4o-transcribe-diarize for speaker identification without self-hosting
  • Data-Driven UI - Features automatically appear based on connector capabilities
  • Connector-Aware Chunking - Chunking handled internally by connectors that support it

Available Connectors:

Connector Use Case
asr_endpoint Self-hosted WhisperX/Whisper ASR services
openai_transcribe OpenAI gpt-4o-transcribe models (with diarization option)
openai_whisper Legacy Whisper API (whisper-1)

Deprecated Variables:

  • USE_ASR_ENDPOINT=true → Just set ASR_BASE_URL instead
  • WHISPER_MODEL → Use TRANSCRIPTION_MODEL instead

REST API v1

  • Complete API - Full CRUD for recordings, tags, speakers, processing
  • Swagger UI - Interactive docs at /api/v1/docs
  • Stats Endpoint - Dashboard-compatible for gethomepage.dev
  • Batch Operations - Bulk update, delete, transcribe
  • Chat & Events API - Programmatic AI chat and calendar event access
  • Audio Download - Stream or download audio files

Documentation

  • Migration Guide - Update your configuration
  • API Reference - Complete endpoint documentation
  • Updated env examples in config/env.transcription.example

Compatibility

Fully backwards compatible. Existing configurations continue to work with deprecation warnings in logs.

Don't miss a new speakr release

NewReleases is sending notifications on new releases.