pypi TTS 0.2.0
v0.2.0

latest releases: 0.22.0, 0.21.3, 0.21.2...
3 years ago

๐Ÿธ v0.2.0

๐ŸžBug Fixes

  • Fix phoneme pre-compute issue.
  • Fix multi-speaker setup in Tacotron models.
  • Fix small issues in the Trainer regarding multi-optimizer training.

๐Ÿ’พ Code updates

  • W&B integration for model logging and experiment tracking, (๐Ÿ‘‘ @AyushExel)
    Code uses the Tensorboard by default. For W&B, you need to set log_dashboard option in the config and define project_name and wandb_entity.
  • Use ffsspec for model saving/loading (๐Ÿ‘‘ @agrinh)
  • Allow models to define their own symbol list with in-class make_symbols()
  • Allow choosing after epoch or after step LR scheduler update with scheduler_after_epoch.
  • Make converting spectrogram from amplitude to DB optional with do_amp_to_db_linear and do_amp_to_db_linear options.

๐Ÿ—’๏ธ Docs updates

  • Add GlowTTS and VITS docs.

๐Ÿค– Model implementations

๐Ÿš€ Model releases

  • vocoder_models--ja--kokoro--hifigan_v1 (๐Ÿ‘‘ @kaiidams)

    HiFiGAN model trained on Kokoro dataset to complement the existing Japanese model.

    Try it out:

    tts --model_name tts_models/ja/kokoro/tacotron2-DDC --text "ใ“ใ‚“ใซใกใฏใ€ไปŠๆ—ฅใฏใ„ใ„ๅคฉๆฐ—ใงใ™ใ‹๏ผŸ"
  • tts_models--en--ljspeech--tacotronDDC_ph

    TacotronDDC with phonemes trained on LJSpeech. It is to fix the pronunciation errors caused by the raw text
    in the released TacotronDDC model.

    Try it out:

    tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"
  • tts_models--en--ljspeech--vits

    VITS model trained on LJSpeech.

    Try it out:

    tts --model_name tts_models/en/ljspeech/vits --text "hello, how are you today?"
  • tts_models--en--vctk--vits

    VITS model trained on VCTK with multi-speaker support.

    Try it out:

    tts-server --model_name tts_models/en/vctk/vits     
  • vocoder_models--en--ljspeech--univnet

    UnivNet model trained on LJSpeech to complement the TacotronDDC model above.

    Try it out:

    tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"

Don't miss a new TTS release

NewReleases is sending notifications on new releases.