Merge branch 'fix' into 'master'
Modified installation related content in README. See merge request !18
This commit is contained in:
commit
6428ce5439
14
README.md
14
README.md
|
@ -20,18 +20,30 @@ cd Parakeet
|
||||||
pip install -e .
|
pip install -e .
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### Setup
|
||||||
|
Make sure libsndfile1 installed:
|
||||||
|
```bash
|
||||||
|
sudo apt-get install libsndfile1
|
||||||
|
```
|
||||||
|
|
||||||
### Install CMUdict for nltk
|
### Install CMUdict for nltk
|
||||||
|
|
||||||
CMUdict from nltk is used to transform text into phonemes.
|
CMUdict from nltk is used to transform text into phonemes.
|
||||||
|
|
||||||
```python
|
```python
|
||||||
import nltk
|
import nltk
|
||||||
|
nltk.download("punkt")
|
||||||
nltk.download("cmudict")
|
nltk.download("cmudict")
|
||||||
```
|
```
|
||||||
|
|
||||||
|
|
||||||
## Supported models
|
## Supported models
|
||||||
|
|
||||||
- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654)
|
- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654)
|
||||||
|
- [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895)
|
||||||
|
- [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263).
|
||||||
|
|
||||||
## Examples
|
## Examples
|
||||||
|
|
||||||
- [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3)
|
- [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3)
|
||||||
|
- [Train a transformer_tts model with ljspeech dataset](./parakeet/examples/transformer_tts)
|
||||||
|
- [Train a fastspeech model with ljspeech dataset](./parakeet/examples/fastspeech)
|
||||||
|
|
|
@ -1,47 +1,6 @@
|
||||||
# Fastspeech
|
# Fastspeech
|
||||||
Paddle fluid implementation of Fastspeech, a feed-forward network based on Transformer. The implementation is based on [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263).
|
Paddle fluid implementation of Fastspeech, a feed-forward network based on Transformer. The implementation is based on [FastSpeech: Fast, Robust and Controllable Text to Speech](https://arxiv.org/abs/1905.09263).
|
||||||
|
|
||||||
We implement Fastspeech model in paddle fluid with dynamic graph, which is convenient for flexible network architectures.
|
|
||||||
|
|
||||||
## Installation
|
|
||||||
|
|
||||||
### Install paddlepaddle
|
|
||||||
This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source.
|
|
||||||
1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details.
|
|
||||||
|
|
||||||
2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake.
|
|
||||||
|
|
||||||
### Install parakeet
|
|
||||||
You can choose to install via pypi or clone the repository and install manually.
|
|
||||||
|
|
||||||
1. Install via pypi.
|
|
||||||
```bash
|
|
||||||
pip install parakeet
|
|
||||||
```
|
|
||||||
|
|
||||||
2. Install manually.
|
|
||||||
```bash
|
|
||||||
git clone <url>
|
|
||||||
cd Parakeet/
|
|
||||||
pip install -e .
|
|
||||||
|
|
||||||
### Download cmudict for nltk
|
|
||||||
You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`.
|
|
||||||
|
|
||||||
```python
|
|
||||||
import nltk
|
|
||||||
nltk.download("punkt")
|
|
||||||
nltk.download("cmudict")
|
|
||||||
```
|
|
||||||
|
|
||||||
If you have completed all the above installations, but still report an error at runtime:
|
|
||||||
|
|
||||||
``` OSError: sndfile library not found ```
|
|
||||||
|
|
||||||
You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via:
|
|
||||||
|
|
||||||
``` sudo apt-get install libsndfile1 ```
|
|
||||||
|
|
||||||
## Dataset
|
## Dataset
|
||||||
|
|
||||||
We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
|
We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
|
||||||
|
|
|
@ -1,47 +1,6 @@
|
||||||
# TransformerTTS
|
# TransformerTTS
|
||||||
Paddle fluid implementation of TransformerTTS, a neural TTS with Transformer. The implementation is based on [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895).
|
Paddle fluid implementation of TransformerTTS, a neural TTS with Transformer. The implementation is based on [Neural Speech Synthesis with Transformer Network](https://arxiv.org/abs/1809.08895).
|
||||||
|
|
||||||
We implement TransformerTTS model in paddle fluid with dynamic graph, which is convenient for flexible network architectures.
|
|
||||||
|
|
||||||
## Installation
|
|
||||||
|
|
||||||
### Install paddlepaddle
|
|
||||||
This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source.
|
|
||||||
1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details.
|
|
||||||
|
|
||||||
2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake.
|
|
||||||
|
|
||||||
### Install parakeet
|
|
||||||
You can choose to install via pypi or clone the repository and install manually.
|
|
||||||
|
|
||||||
1. Install via pypi.
|
|
||||||
```bash
|
|
||||||
pip install parakeet
|
|
||||||
```
|
|
||||||
|
|
||||||
2. Install manually.
|
|
||||||
```bash
|
|
||||||
git clone <url>
|
|
||||||
cd Parakeet/
|
|
||||||
pip install -e .
|
|
||||||
|
|
||||||
### Download cmudict for nltk
|
|
||||||
You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`.
|
|
||||||
|
|
||||||
```python
|
|
||||||
import nltk
|
|
||||||
nltk.download("punkt")
|
|
||||||
nltk.download("cmudict")
|
|
||||||
```
|
|
||||||
|
|
||||||
If you have completed all the above installations, but still report an error at runtime:
|
|
||||||
|
|
||||||
``` OSError: sndfile library not found ```
|
|
||||||
|
|
||||||
You need to install ```libsndfile``` using your distribution’s package manager. e.g. install via:
|
|
||||||
|
|
||||||
``` sudo apt-get install libsndfile1 ```
|
|
||||||
|
|
||||||
## Dataset
|
## Dataset
|
||||||
|
|
||||||
We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
|
We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
|
||||||
|
|
2
setup.py
2
setup.py
|
@ -36,7 +36,7 @@ setup_info = dict(
|
||||||
install_requires=[
|
install_requires=[
|
||||||
'numpy', 'nltk', 'inflect', 'librosa', 'unidecode', 'numba',
|
'numpy', 'nltk', 'inflect', 'librosa', 'unidecode', 'numba',
|
||||||
'tqdm', 'matplotlib', 'tensorboardX', 'tensorboard', 'scipy',
|
'tqdm', 'matplotlib', 'tensorboardX', 'tensorboard', 'scipy',
|
||||||
'ruamel.yaml', 'pandas', 'sox',
|
'ruamel.yaml', 'pandas', 'sox', 'soundfile',
|
||||||
],
|
],
|
||||||
|
|
||||||
# Package info
|
# Package info
|
||||||
|
|
Loading…
Reference in New Issue