pypi torchaudio 0.5.0
v0.5.0

latest releases: 2.3.0, 2.2.2, 2.2.1...
4 years ago

Highlights

torchaudio includes new transforms (e.g. Griffin-Lim and inverse Mel scale), new filters (e.g. all pass, fade, band pass/reject, band, treble, deemph, riaa), and datasets (LJ Speech and SpeechCommands).

Backwards Incompatible Changes

  • torchaudio no longer supports python 2. We removed future and six imports. We added inline typing. (#413, #478, #479, #482, #486)
  • We fixed CommonVoice dataset download, and updated to the latest version. (#498)
  • We now skip data point with missing data in VCTK dataset. (#484)

New Features

  • We now have the Vol transforms, and DB_to_amplitude.(#468, #469)
  • We now have the InverseMelScale (#448)
  • We now have the Griffin-Lim functional. (#365)
  • We now support allpass, fade, bandpass, bandreject, band, treble, deemph, riaa. (#444, #449, #464, #470, #508)
  • We now offer LJSpeech and SpeechCommands datasets. (#439, #437)

Improvements

  • We added inline typing to SoxEffects and Kaldi compliance. (#490, #497)
  • We refactored the tests. (#480, #485, #496, #491, #501, #502, #503, #506, #507, #509)
  • We now run tests with sox only when sox is available. (#419)
  • We extended batch support to MelScale, MelSpectrogram, MFCC, Resample. (#391, #435)
  • The speed of torchaudio.functional.istft was improved. (#471)
  • We now have transform and functional tests for AmplitudeToDB. (#463)
  • We now ignore pycharm and OSX files in git. (#461)
  • TimeStretch now has a batch test. (#459)
  • Docstrings in transforms were polished. (#442)
  • TimeStretch and AmplitudeToDB are now torch.nn.Module. (#456)
  • Resample is now jitable. (#441)
  • We support python 3.8. (#397)
  • Add cuda test for complex norm. (#421)
  • Dither is jitable with the latest version of pytorch. (#417)
  • Batching uses view instead of reshape. (#409)
  • We refactored the jitability test. (#395)
  • In .circleci, we removed a conditional block that wasn't doing anything. (#399)
  • We now have Windows CI for building. (#394 and #398)
  • We corrected the use of standard variable names in code. (#393)
  • We adopted native-Python code generation convention. (#378)
  • torchaudio.istft creates tensors directly on device. (#377)
  • torchaudio.compliance.kaldi.resample_waveform is now jitable. (#362)
  • The runtime of torchaudio.functional.lfilter was decreased. (#374)

Bug Fixes

  • We fixed flake8 errors. (#504, #505)
  • We fixed Windows test by only testing with cpu-only binaries. (#489)
  • Spelling correction in docstrings for transforms.FrequencyMasking and transforms.TimeMasking. (#474)
  • In .circleci, we switched to use token for conda uploads. (#460)
  • The default value of dither parameter was changed. (#453)
  • TimeStretch moves device correctly. (#457)
  • Adding dev-other option in librispeech. (#433)
  • In build script, we install the correct version of pytorch for pip. (#412)
  • Upgrading dataset DeprecationWarning to UserWarning so that the user gets the warning. (#402)
  • Make power of spectrogram a float to work with complex norm. (#392)
  • Fix random seed for flaky test_griffinlim test. (#388)
  • Apply 'nightly' branch filter to binary uploads. (#385)
  • Fixed build errors: added explicitly utf8 decoration, added explicit utf_8_encoder definition if not available, explicitly cast to int. (#380)

Deprecations

  • None

Don't miss a new torchaudio release

NewReleases is sending notifications on new releases.