Text-to-Speech-using-Tacotron2

Converting text to audio and applying audio augmentation

Topic

In this notebook I will experiment with speech synthesis from text using Tacotron2 which is a deep neural network that uses sequence to sequence architecture and which produces a mel spectromgram out of a text and then converts it to audio using Vocoder. After extracting the audio I will then use a series of audio augmentation techniques to make the sound more natural. So let's get started !

Objectives

Convert text to audio
Use audio augmentation to enhance the extracted audio

Summary

Importing libraries
Text to speech
Adding white noise
Time stretching
Pitch scaling
Inverting polarity
Random gain
Conclusion

Libraries

Torchaudio
Numpy
Ipython
Librosa
Matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
Text-to-Speech using Tacotron2.html		Text-to-Speech using Tacotron2.html
Text-to-Speech using Tacotron2.ipynb		Text-to-Speech using Tacotron2.ipynb
exp_1.wav		exp_1.wav
gained.wav		gained.wav
inverted.wav		inverted.wav
noised.wav		noised.wav
pitch_scaled.wav		pitch_scaled.wav
stretched.wav		stretched.wav

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-to-Speech-using-Tacotron2

Topic

Objectives

Summary

Libraries

About

Releases

Packages

Languages

imane-ayouni/Text-to-Speech-using-Tacotron2

Folders and files

Latest commit

History

Repository files navigation

Text-to-Speech-using-Tacotron2

Topic

Objectives

Summary

Libraries

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages