diff --git a/README.md b/README.md index 181f78c..90f446e 100644 --- a/README.md +++ b/README.md @@ -20,18 +20,30 @@ cd Parakeet pip install -e . ``` +### Setup +Make sure libsndfile1 installed: +```bash +sudo apt-get install libsndfile1 +``` + ### Install CMUdict for nltk CMUdict from nltk is used to transform text into phonemes. - ```python import nltk +nltk.download("punkt") nltk.download("cmudict") ``` + + ## Supported models - [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654) +- [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895) +- [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263). ## Examples - [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3) +- [Train a transformer_tts model with ljspeech dataset](./parakeet/examples/transformer_tts) +- [Train a fastspeech model with ljspeech dataset](./parakeet/examples/fastspeech) diff --git a/examples/fastspeech/README.md b/examples/fastspeech/README.md index bd7c4d5..007b6b2 100644 --- a/examples/fastspeech/README.md +++ b/examples/fastspeech/README.md @@ -1,47 +1,6 @@ # Fastspeech Paddle fluid implementation of Fastspeech, a feed-forward network based on Transformer. The implementation is based on [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263). -We implement Fastspeech model in paddle fluid with dynamic graph, which is convenient for flexible network architectures. - -## Installation - -### Install paddlepaddle -This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source. -1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details. - -2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake. - -### Install parakeet -You can choose to install via pypi or clone the repository and install manually. - -1. Install via pypi. - ```bash - pip install parakeet - ``` - -2. Install manually. - ```bash - git clone - cd Parakeet/ - pip install -e . - -### Download cmudict for nltk -You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`. - -```python -import nltk -nltk.download("punkt") -nltk.download("cmudict") -``` - -If you have completed all the above installations, but still report an error at runtime: - -``` OSError: sndfile library not found ``` - -You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via: - -``` sudo apt-get install libsndfile1 ``` - ## Dataset We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/). diff --git a/examples/transformer_tts/README.md b/examples/transformer_tts/README.md index 475161d..afdfdd2 100644 --- a/examples/transformer_tts/README.md +++ b/examples/transformer_tts/README.md @@ -1,47 +1,6 @@ # TransformerTTS Paddle fluid implementation of TransformerTTS, a neural TTS with Transformer. The implementation is based on [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895). -We implement TransformerTTS model in paddle fluid with dynamic graph, which is convenient for flexible network architectures. - -## Installation - -### Install paddlepaddle -This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source. -1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details. - -2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake. - -### Install parakeet -You can choose to install via pypi or clone the repository and install manually. - -1. Install via pypi. - ```bash - pip install parakeet - ``` - -2. Install manually. - ```bash - git clone - cd Parakeet/ - pip install -e . - -### Download cmudict for nltk -You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`. - -```python -import nltk -nltk.download("punkt") -nltk.download("cmudict") -``` - -If you have completed all the above installations, but still report an error at runtime: - -``` OSError: sndfile library not found ``` - -You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via: - -``` sudo apt-get install libsndfile1 ``` - ## Dataset We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/). diff --git a/setup.py b/setup.py index ca630fc..1cd6e8a 100644 --- a/setup.py +++ b/setup.py @@ -36,7 +36,7 @@ setup_info = dict( install_requires=[ 'numpy', 'nltk', 'inflect', 'librosa', 'unidecode', 'numba', 'tqdm', 'matplotlib', 'tensorboardX', 'tensorboard', 'scipy', - 'ruamel.yaml', 'pandas', 'sox', + 'ruamel.yaml', 'pandas', 'sox', 'soundfile', ], # Package info