WaveRNN

Pytorch implementation of Deepmind's WaveRNN model from Efficient Neural Audio Synthesis

Implementation Details

Currently, there are two models in this repo. The first is WaveRNN, however it is quite slow to train (~7 days).

The good news is that I came up with another model that trains much faster and can handle the noise in predicted features from Tacotron and similar models. The sound quality is not as good as Wavenet but it's not that far off. You can listen to the samples here and judge for yourself.

Notebooks 1 - 3 are self-contained however notebooks 4a and 4b need to be run sequentially. You can stop & close notebook 4b (training) whenever you like and it will pick up from where you left off.

Dependencies

Python 3
Pytorch v0.4
Librosa

Disclaimer I do not represent or work for Deepmind/Google.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.ipynb_checkpoints		.ipynb_checkpoints
assets		assets
models		models
outputs		outputs
utils		utils
NB1 - Fit a Sine Wave.ipynb		NB1 - Fit a Sine Wave.ipynb
NB2 - Fit a Short Sample.ipynb		NB2 - Fit a Short Sample.ipynb
NB3 - Fit a 30min Sample.ipynb		NB3 - Fit a 30min Sample.ipynb
NB4a - Alternative Model (Preprocessing).ipynb		NB4a - Alternative Model (Preprocessing).ipynb
NB4b - Alternative Model (Training).ipynb		NB4b - Alternative Model (Training).ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WaveRNN

Implementation Details

Dependencies

About

Releases

Packages

Languages

WordToken/WaveRNN

Folders and files

Latest commit

History

Repository files navigation

WaveRNN

Implementation Details

Dependencies

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages