github coqui-ai/TTS v0.0.13

latest releases: v0.22.0, v0.21.3, v0.21.2...
3 years ago

🐸 v0.0.13

🐞Bug Fixes

πŸ’Ύ Code updates

  • SpeakerManager class for handling multi-speaker model management and interfacing speaker.json file.
  • Enabling multi-speaker models with tts and tts-server endpoints. (πŸ‘‘ @kirianguiller )
  • Allow choosing a different noise scale for GlowTTS at inference.
  • Glow-TTS updates to import SC-Glow Models.
  • Fixing windows support (πŸ‘‘ @WeberJulian )

πŸšΆβ€β™€οΈ Operational Updates

  • Refactoring 🐸 TTS installation and allow selecting different scopes (all, tf, notebooks)for installation depending on the specific needs.

πŸ… Model implementations

πŸš€ New Pre-Trained Model Releases

  • SC-GlowTTS multi-speaker English model from our work https://arxiv.org/abs/2104.05557 (πŸ‘‘ @Edresson )
  • HiFiGAN vocoder finetuned for the above model.
  • Tacotron DDC Non-Binary English model using Accenture's Sam dataset.
  • HiFiGAN vocoder trained for the models above.

Released Models

πŸ’‘ All the models below are available by tts or tts-server endpoints on CLI as explained here.

Models with ✨️ below are new with this release.

  • SC-GlowTTS model is from our latest paper in a collaboration with @Edresson and @mueller91.
  • The new non-binary TTS model is trained using the SAM dataset from Accenture Labs. Check out their blog post
Language Dataset Model Name Model Type TTS version Download
✨ English (non-binary) sam (acccenture) Tacotron2-DDC tts πŸ˜„ v0.0.13 πŸ’Ύ
✨ English (multi-speaker) VCTK SC-GlowTTS tts πŸ˜„ v0.0.13 πŸ’Ύ
English LJSpeech Tacotron-DDC tts v0.0.12 πŸ’Ύ
German Thorsten-DE Tacotron-DCA tts v0.0.11 πŸ’Ύ
German Thorsten-DE Wavegrad vocoder v0.0.11 πŸ’Ύ
English LJSpeech SpeedySpeech tts v0.0.10 πŸ’Ύ
English EK1 Tacotron2 tts v0.0.10 πŸ’Ύ
Dutch MAI TacotronDDC tts v0.0.10 πŸ’Ύ
Chinese Baker TacotronDDC-GST tts v0.0.10 πŸ’Ύ
English LJSpeech TacotronDCA tts v0.0.9 πŸ’Ύ
English LJSpeech Glow-TTS tts v0.0.9 πŸ’Ύ
Spanish M-AILabs TacotronDDC tts v0.0.9 πŸ’Ύ
French M_AILabs TacotronDDC tts v0.0.9 πŸ’Ύ
Dutch MAI TacotronDDC tts v0.0.10 πŸ’Ύ
✨ English sam (accenture) HiFiGAN vocoder πŸ˜„ v0.0.13 πŸ’Ύ
✨ English VCTK HiFiGAN vocoder πŸ˜„ v0.0.13 πŸ’Ύ
English LJSpeech HiFiGAN vocoder v0.0.12 πŸ’Ύ
English EK1 WaveGrad vocoder v0.0.10 πŸ’Ύ
Dutch MAI ParallelWaveGAN vocoder v0.0.10 πŸ’Ύ
English LJSpeech MB-MelGAN vocoder v0.0.9 πŸ’Ύ
🌍 Multi-Lang LibriTTS FullBand-MelGAN vocoder v0.0.9 πŸ’Ύ
🌍 Multi-Lang LibriTTS WaveGrad vocoder v0.0.9 πŸ’Ύ

Update Jun 7 2021: Ruslan (Russian) model has been removed due to the license conflict.

Don't miss a new TTS release

NewReleases is sending notifications on new releases.