2019-11-13 22:22:46 +08:00
# Parakeet
Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-speech toolkit for the open-source community. It is built on Paddle Fluid dynamic graph, with the support of many influential TTS models proposed by [Baidu Research ](http://research.baidu.com ) and other academic institutions.
2020-02-06 12:42:00 +08:00
< div align = "center" >
< img src = "images/logo.png" width = 450 / > < br >
< / div >
2020-02-25 14:39:43 +08:00
### Setup
Make sure the library `libsndfile1` installed, e.g., on Ubuntu
```bash
sudo apt-get install libsndfile1
```
2019-11-13 22:22:46 +08:00
2020-02-25 14:39:43 +08:00
### Install PaddlePaddle
2020-02-18 10:29:54 +08:00
See [install ](https://www.paddlepaddle.org.cn/install/quick ) for more details. This repo requires paddlepaddle's version is above 1.7.
### Install Parakeet
2019-11-13 22:22:46 +08:00
```bash
2020-02-18 10:29:54 +08:00
# git clone this repo first
cd Parakeet
pip install -e .
2019-11-13 22:22:46 +08:00
```
2020-02-18 10:29:54 +08:00
### Install CMUdict for nltk
CMUdict from nltk is used to transform text into phonemes.
```python
import nltk
2020-02-19 11:00:17 +08:00
nltk.download("punkt")
2020-02-18 10:29:54 +08:00
nltk.download("cmudict")
```
2020-02-18 17:42:24 +08:00
2020-02-25 23:53:54 +08:00
## Related Research
2019-11-13 22:22:46 +08:00
2020-02-18 11:32:14 +08:00
- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning ](https://arxiv.org/abs/1710.07654 )
2020-02-18 17:42:24 +08:00
- [Neural Speech Synthesis with Transformer Network ](https://arxiv.org/abs/1809.08895 )
- [FastSpeech: Fast, Robust and Controllable Text to Speech ](https://arxiv.org/abs/1905.09263 ).
2020-02-25 23:53:54 +08:00
- [WaveFlow: A Compact Flow-based Model for Raw Audio ](https://arxiv.org/abs/1912.01219 )
2020-02-18 10:29:54 +08:00
## Examples
2020-02-26 13:49:58 +08:00
- [Train a DeepVoice3 model with ljspeech dataset ](./examples/deepvoice3 )
- [Train a TransformerTTS model with ljspeech dataset ](./examples/transformer_tts )
- [Train a FastSpeech model with ljspeech dataset ](./examples/fastspeech )
- [Train a WaveFlow model with ljspeech dataset ](./examples/waveflow )
2020-02-27 12:21:52 +08:00
## Copyright and License
Parakeet is provided under the [Apache-2.0 license ](LICENSE ).