Highlights

torchaudio includes new transforms (e.g. Griffin-Lim and inverse Mel scale), new filters (e.g. all pass, fade, band pass/reject, band, treble, deemph, riaa), and datasets (LJ Speech and SpeechCommands).

Backwards Incompatible Changes

torchaudio no longer supports python 2. We removed future and six imports. We added inline typing. (#413, #478, #479, #482, #486)
We fixed CommonVoice dataset download, and updated to the latest version. (#498)
We now skip data point with missing data in VCTK dataset. (#484)

New Features

We now have the Vol transforms, and DB_to_amplitude.(#468, #469)
We now have the InverseMelScale (#448)
We now have the Griffin-Lim functional. (#365)
We now support allpass, fade, bandpass, bandreject, band, treble, deemph, riaa. (#444, #449, #464, #470, #508)
We now offer LJSpeech and SpeechCommands datasets. (#439, #437)

Improvements

We added inline typing to SoxEffects and Kaldi compliance. (#490, #497)
We refactored the tests. (#480, #485, #496, #491, #501, #502, #503, #506, #507, #509)
We now run tests with sox only when sox is available. (#419)
We extended batch support to MelScale, MelSpectrogram, MFCC, Resample. (#391, #435)
The speed of torchaudio.functional.istft was improved. (#471)
We now have transform and functional tests for AmplitudeToDB. (#463)
We now ignore pycharm and OSX files in git. (#461)
TimeStretch now has a batch test. (#459)
Docstrings in transforms were polished. (#442)
TimeStretch and AmplitudeToDB are now torch.nn.Module. (#456)
Resample is now jitable. (#441)
We support python 3.8. (#397)
Add cuda test for complex norm. (#421)
Dither is jitable with the latest version of pytorch. (#417)
Batching uses view instead of reshape. (#409)
We refactored the jitability test. (#395)
In .circleci, we removed a conditional block that wasn't doing anything. (#399)
We now have Windows CI for building. (#394 and #398)
We corrected the use of standard variable names in code. (#393)
We adopted native-Python code generation convention. (#378)
torchaudio.istft creates tensors directly on device. (#377)
torchaudio.compliance.kaldi.resample_waveform is now jitable. (#362)
The runtime of torchaudio.functional.lfilter was decreased. (#374)

Bug Fixes

We fixed flake8 errors. (#504, #505)
We fixed Windows test by only testing with cpu-only binaries. (#489)
Spelling correction in docstrings for transforms.FrequencyMasking and transforms.TimeMasking. (#474)
In .circleci, we switched to use token for conda uploads. (#460)
The default value of dither parameter was changed. (#453)
TimeStretch moves device correctly. (#457)
Adding dev-other option in librispeech. (#433)
In build script, we install the correct version of pytorch for pip. (#412)
Upgrading dataset DeprecationWarning to UserWarning so that the user gets the warning. (#402)
Make power of spectrogram a float to work with complex norm. (#392)
Fix random seed for flaky test_griffinlim test. (#388)
Apply 'nightly' branch filter to binary uploads. (#385)
Fixed build errors: added explicitly utf8 decoration, added explicit utf_8_encoder definition if not available, explicitly cast to int. (#380)

Deprecations

None

torchaudio 0.5.0 v0.5.0 on Python PyPI