From d73776c45669443f32f943a3c53bddc22c29752a Mon Sep 17 00:00:00 2001 From: chenfeiyu Date: Tue, 18 Feb 2020 02:29:54 +0000 Subject: [PATCH 1/3] update README for parakeet and examples/deepvoice3 --- README.md | 18 ++++++++++++++++-- examples/deepvoice3/README.md | 33 --------------------------------- 2 files changed, 16 insertions(+), 35 deletions(-) diff --git a/README.md b/README.md index d57e1ac..8ba02b7 100644 --- a/README.md +++ b/README.md @@ -9,9 +9,23 @@ Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-spee ## Installation ```bash -pip install parakeet +# git clone this repo first +cd Parakeet +pip install -e . ``` +### Install CMUdict for nltk + +CMUdict from nltk is used to transform text into phonemes. + +```python +import nltk +nltk.download("cmudict") +``` ## Supported models -- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](./deepvoice3) +- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](./parakeet/models/deepvoice3) + +## Examples + +- [Train a deepvoice 3 model with ljspeech dataset](./parakeet/examples/deepvoice3) diff --git a/examples/deepvoice3/README.md b/examples/deepvoice3/README.md index 5a0909e..43e1939 100644 --- a/examples/deepvoice3/README.md +++ b/examples/deepvoice3/README.md @@ -4,39 +4,6 @@ Paddle implementation of deepvoice 3 in dynamic graph, a convolutional network b We implement Deepvoice 3 in paddle fluid with dynamic graph, which is convenient for flexible network architectures. -## Installation - -### Install paddlepaddle. -This implementation requires the latest develop version of paddlepaddle. You can either download the compiled package or build paddle from source. - -1. Install the compiled package, via pip, conda or docker. See [**Installation Mannuals**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/index_en.html) for more details. - -2. Build paddlepaddle from source. See [**Compile From Source Code**](https://www.paddlepaddle.org.cn/documentation/docs/en/beginners_guide/install/compile/fromsource_en.html) for more details. Note that if you want to enable data parallel training for multiple GPUs, you should set `-DWITH_DISTRIBUTE=ON` with cmake. - -### Install parakeet -You can choose to install via pypi or clone the repository and install manually. - -1. Install via pypi. - ```bash - pip install paddle-parakeet - ``` - -2. Install manually. - ```bash - git clone - cd Parakeet/ - pip install -e . - ``` - -### Download cmudict for nltk -You also need to download cmudict for nltk, because convert text into phonemes with `cmudict`. - -```python -import nltk -nltk.download("punkt") -nltk.download("cmudict") -``` - ## Dataset We experiment with the LJSpeech dataset. Download and unzip [LJSpeech](https://keithito.com/LJ-Speech-Dataset/). From fed950a9a31d2116dd254e7426fa0ae4300860a1 Mon Sep 17 00:00:00 2001 From: chenfeiyu Date: Tue, 18 Feb 2020 02:29:54 +0000 Subject: [PATCH 2/3] update paddle version requirement to 1.7, remove requirements.txt --- README.md | 6 ++++++ requirements.txt | 14 -------------- 2 files changed, 6 insertions(+), 14 deletions(-) delete mode 100644 requirements.txt diff --git a/README.md b/README.md index 8ba02b7..80c8dfa 100644 --- a/README.md +++ b/README.md @@ -8,6 +8,12 @@ Parakeet aims to provide a flexible, efficient and state-of-the-art text-to-spee ## Installation +### Install Paddlepaddle + +See [install](https://www.paddlepaddle.org.cn/install/quick) for more details. This repo requires paddlepaddle's version is above 1.7. + +### Install Parakeet + ```bash # git clone this repo first cd Parakeet diff --git a/requirements.txt b/requirements.txt deleted file mode 100644 index 50a1da6..0000000 --- a/requirements.txt +++ /dev/null @@ -1,14 +0,0 @@ -numba==0.45.1 -numpy==1.16.4 -nltk==3.4.4 -scipy -unidecode==1.1.1 -inflect==2.1.0 -librosa==0.7.0 -tqdm==4.35.0 -tensorboardX==1.8 -matplotlib -requests==2.22.0 -lws==1.2.4 -nnmnkwii -tensorboard From ddb5b826b7a43d8566f4e6ecfc6374b6b5044333 Mon Sep 17 00:00:00 2001 From: chenfeiyu Date: Tue, 18 Feb 2020 03:32:14 +0000 Subject: [PATCH 3/3] update links for models in README --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 80c8dfa..181f78c 100644 --- a/README.md +++ b/README.md @@ -30,7 +30,7 @@ nltk.download("cmudict") ``` ## Supported models -- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](./parakeet/models/deepvoice3) +- [Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning](https://arxiv.org/abs/1710.07654) ## Examples