Merge branch 'fix' into 'master'

Modified installation related content in README. See merge request !18
2020-02-19 12:23:33 +08:00 · 2020-02-19 12:23:33 +08:00 · 6428ce5439
parent 4dd9a27361 034b4fdbab
commit 6428ce5439
4 changed files with 14 additions and 84 deletions
--- a/README.md
+++ b/README.md
@ -20,18 +20,30 @@ cd Parakeet
 pip install -e .
 ```

+### Setup
+Make sure libsndfile1 installed:
+```bash 
+sudo apt-get install libsndfile1 
+```
+
 ### Install CMUdict for nltk

 CMUdict from nltk is used to transform text into phonemes.
-
 ```python
 import nltk
+nltk.download("punkt")
 nltk.download("cmudict")
 ```
+
+
 ## Supported models

 - [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654)
+- [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895)
+- [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263).

 ## Examples

 - [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3) 
+- [Train a transformer_tts model with ljspeech dataset](./parakeet/examples/transformer_tts) 
+- [Train a fastspeech model with ljspeech dataset](./parakeet/examples/fastspeech) 
--- a/examples/fastspeech/README.md
+++ b/examples/fastspeech/README.md
@ -1,47 +1,6 @@
 # Fastspeech
 Paddle fluid implementation of Fastspeech, a feed-forward network based on Transformer. The implementation is based on [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263).

-We implement Fastspeech model in paddle fluid with dynamic graph, which is convenient for flexible network architectures.
-
-## Installation
-
-### Install paddlepaddle
-This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source.
-1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details.
-
-2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake.
-
-### Install parakeet
-You can choose to install via pypi or clone the repository and install manually.
-
-1. Install via pypi.
-   ```bash
-   pip install parakeet
-   ```
-
-2. Install manually.
-   ```bash
-   git clone <url>
-   cd Parakeet/
-   pip install -e .
-
-### Download cmudict for nltk
-You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`.
-
-```python
-import nltk
-nltk.download("punkt")
-nltk.download("cmudict")
-```
-
-If you have completed all the above installations, but still report an error at runtime：
-
-``` OSError: sndfile library not found ```
-
-You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via:
-
-``` sudo apt-get install libsndfile1 ```
-
 ## Dataset

 We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
--- a/examples/transformer_tts/README.md
+++ b/examples/transformer_tts/README.md
@ -1,47 +1,6 @@
 # TransformerTTS
 Paddle fluid implementation of TransformerTTS, a neural TTS with Transformer. The implementation is based on [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895).

-We implement TransformerTTS model in paddle fluid with dynamic graph, which is convenient for flexible network architectures.
-
-## Installation
-
-### Install paddlepaddle
-This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source.
-1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details.
-
-2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake.
-
-### Install parakeet
-You can choose to install via pypi or clone the repository and install manually.
-
-1. Install via pypi.
-   ```bash
-   pip install parakeet
-   ```
-
-2. Install manually.
-   ```bash
-   git clone <url>
-   cd Parakeet/
-   pip install -e .
-
-### Download cmudict for nltk
-You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`.
-
-```python
-import nltk
-nltk.download("punkt")
-nltk.download("cmudict")
-```
-
-If you have completed all the above installations, but still report an error at runtime：
-
-``` OSError: sndfile library not found ```
-
-You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via:
-
-``` sudo apt-get install libsndfile1 ```
-
 ## Dataset

 We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
--- a/setup.py
+++ b/setup.py
@ -36,7 +36,7 @@ setup_info = dict(
    install_requires=[
        'numpy', 'nltk', 'inflect', 'librosa', 'unidecode', 'numba', 
        'tqdm', 'matplotlib', 'tensorboardX', 'tensorboard', 'scipy',
-        'ruamel.yaml', 'pandas', 'sox',  
+        'ruamel.yaml', 'pandas', 'sox', 'soundfile',  
    ],

    # Package info