Merge branch 'master' into 'master'
README and requirement update See merge request !16
This commit is contained in:
commit
147f7be251
24
README.md
24
README.md
|
@ -8,10 +8,30 @@ Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-spee
|
|||
|
||||
## Installation
|
||||
|
||||
### Install Paddlepaddle
|
||||
|
||||
See [install](https://www.paddlepaddle.org.cn/install/quick) for more details. This repo requires paddlepaddle's version is above 1.7.
|
||||
|
||||
### Install Parakeet
|
||||
|
||||
```bash
|
||||
pip install parakeet
|
||||
# git clone this repo first
|
||||
cd Parakeet
|
||||
pip install -e .
|
||||
```
|
||||
|
||||
### Install CMUdict for nltk
|
||||
|
||||
CMUdict from nltk is used to transform text into phonemes.
|
||||
|
||||
```python
|
||||
import nltk
|
||||
nltk.download("cmudict")
|
||||
```
|
||||
## Supported models
|
||||
|
||||
- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](./deepvoice3)
|
||||
- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654)
|
||||
|
||||
## Examples
|
||||
|
||||
- [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3)
|
||||
|
|
|
@ -4,39 +4,6 @@ Paddle implementation of deepvoice 3 in dynamic graph, a convolutional network b
|
|||
|
||||
We implement Deepvoice 3 in paddle fluid with dynamic graph, which is convenient for flexible network architectures.
|
||||
|
||||
## Installation
|
||||
|
||||
### Install paddlepaddle.
|
||||
This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source.
|
||||
|
||||
1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details.
|
||||
|
||||
2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake.
|
||||
|
||||
### Install parakeet
|
||||
You can choose to install via pypi or clone the repository and install manually.
|
||||
|
||||
1. Install via pypi.
|
||||
```bash
|
||||
pip install paddle-parakeet
|
||||
```
|
||||
|
||||
2. Install manually.
|
||||
```bash
|
||||
git clone <url>
|
||||
cd Parakeet/
|
||||
pip install -e .
|
||||
```
|
||||
|
||||
### Download cmudict for nltk
|
||||
You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`.
|
||||
|
||||
```python
|
||||
import nltk
|
||||
nltk.download("punkt")
|
||||
nltk.download("cmudict")
|
||||
```
|
||||
|
||||
## Dataset
|
||||
|
||||
We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/).
|
||||
|
|
|
@ -1,14 +0,0 @@
|
|||
numba==0.45.1
|
||||
numpy==1.16.4
|
||||
nltk==3.4.4
|
||||
scipy
|
||||
unidecode==1.1.1
|
||||
inflect==2.1.0
|
||||
librosa==0.7.0
|
||||
tqdm==4.35.0
|
||||
tensorboardX==1.8
|
||||
matplotlib
|
||||
requests==2.22.0
|
||||
lws==1.2.4
|
||||
nnmnkwii
|
||||
tensorboard
|
Loading…
Reference in New Issue