altunenes/parakeet-rs v0.2.1 on GitHub

What's Changed

Major Update

Sortformer (end-to-end neural network):

Audio → NN → 4 Speaker Probabilities (0.0-1.0) → Just threshold > 0.5 → Model outputs (fixed 4 channels)
The model runs very fast on the CPU (tested on my Mac M3 so far), but additional processing may be required for very long audio files to avoid RAM bottlenecks.
See example: (diarization + tdt)
examples/diarization.rs

Note, Sortformer currently features flag only:

parakeet-rs = { version = "0.2.1", features = ["sortformer"] }

Full Changelog: v0.1.8...v0.2.1