Go to file
zhaokexin01 ca809df71f Update README.md 2020-03-05 08:47:07 +08:00
examples update train.py cli argument, load a wavenet model 2020-02-28 04:07:32 +00:00
images Add logo parakeet 2020-02-06 12:42:00 +08:00
notebooks fix import errors 2019-11-25 08:53:11 +00:00
parakeet Merge branch 'master' of upstream 2020-02-29 03:26:59 +00:00
tests add license 2020-02-26 21:03:51 +08:00
tools add license 2020-02-26 21:03:51 +08:00
.gitignore move data.py and rename some files 2020-02-16 17:54:11 +00:00
.pre-commit-config.yaml add license 2020-02-26 21:03:51 +08:00
LICENSE Update README 2020-02-27 04:21:52 +00:00
README.md Update README.md 2020-03-05 08:47:07 +08:00
setup.py add license 2020-02-26 21:03:51 +08:00

README.md

Parakeet

Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on PaddlePaddle Fluid dynamic graph and includes many influential TTS models proposed by Baidu Research and other research groups.

In particular, it features the latest [WaveFlow] (https://arxiv.org/abs/1912.01219) model proposed by Baidu Research. WaveFlow is a small-footprint generative flow for raw audio, which is directly trained with maximum likelihood. It generates high-fidelity speech as WaveNet, while synthesizing serval orders of magnitude faster as it only requires a few sequential steps to generate very long waveforms. Furthermore, it can significantly reduce the likelihood gap that has existed between autoregressive models and flow-based models for efficient synthesis. Finally, our small-footprint WaveFlow has only 5.9M parameters, which is 15 times smaller than WaveGlow. It can generate 22.05 kHz high-fidelity audio around 40 times faster than real-time on a V100 GPU without engineered inference kernels.


Setup

Make sure the library libsndfile1 is installed, e.g., on Ubuntu.

sudo apt-get install libsndfile1

Install PaddlePaddle

See install for more details. This repo requires paddlepaddle 1.7 or above.

Install Parakeet

# git clone this repo first
cd Parakeet
pip install -e .

Install CMUdict for nltk

CMUdict from nltk is used to transform text into phonemes.

import nltk
nltk.download("punkt")
nltk.download("cmudict")

Examples

Parakeet is provided under the Apache-2.0 license.