update README for examples/ge2e

This commit is contained in:
iclementine 2021-04-29 17:15:18 +08:00
parent b1304cb449
commit 42092f1f5b
2 changed files with 10 additions and 10 deletions

View File

@ -28,23 +28,23 @@ Currently supported datasets are Librispeech-other-500, VoxCeleb, VoxCeleb2,ai-
1. Librispeech/train-other-500
An English multispeaker datasetURL https://www.openslr.org/resources/12/train-other-500.tar.gzonly the `train-other-500` subset is used.
An English multispeaker dataset[URL](https://www.openslr.org/resources/12/train-other-500.tar.gz)only the `train-other-500` subset is used.
1. VoxCeleb1
An English multispeaker datasetURL https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html , Audio Files from Dev A to Dev D should be downloaded, combined and extracted.
An English multispeaker dataset[URL](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html) , Audio Files from Dev A to Dev D should be downloaded, combined and extracted.
2. VoxCeleb2
An English multispeaker datasetURL https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html , Audio Files from Dev A to Dev H should be downloaded, combined and extracted.
An English multispeaker dataset[URL](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html) , Audio Files from Dev A to Dev H should be downloaded, combined and extracted.
3. Aidatatang-200zh
A Mandarin Chinese multispeaker dataset URL https://www.openslr.org/62/ .
A Mandarin Chinese multispeaker dataset [URL](https://www.openslr.org/62/) .
4. magicdata
A Mandarin Chinese multispeaker dataset URL https://www.openslr.org/68/ .
A Mandarin Chinese multispeaker dataset [URL](https://www.openslr.org/68/) .
If you want to use other datasets, you can also download and preprocess it as long as it meets the needs described below.

View File

@ -27,23 +27,23 @@ ge2e
1. Librispeech/train-other-500
英文多说话人数据集,下载链接 https://www.openslr.org/resources/12/train-other-500.tar.gz我们的实验中仅用到了 train-other-500 这个子集。
英文多说话人数据集,[下载链接](https://www.openslr.org/resources/12/train-other-500.tar.gz),我们的实验中仅用到了 train-other-500 这个子集。
1. VoxCeleb1
英文多说话人数据集,下载链接 https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html需要下载其中的 Audio Files 中的 Dev A 到 Dev D 四个压缩文件并合并解压。
英文多说话人数据集,[下载链接](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html),需要下载其中的 Audio Files 中的 Dev A 到 Dev D 四个压缩文件并合并解压。
2. VoxCeleb2
英文多说话人数据集,下载链接 https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html需要下载其中的 Audio Files 中的 Dev A 到 Dev H 八个压缩文件并合并解压。
英文多说话人数据集,[下载链接](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html),需要下载其中的 Audio Files 中的 Dev A 到 Dev H 八个压缩文件并合并解压。
3. Aidatatang-200zh
中文多说话人数据集,下载链接 https://www.openslr.org/62/ .
中文多说话人数据集,[下载链接](https://www.openslr.org/62/) .
4. magicdata
中文多说话人数据集,下载链接 https://www.openslr.org/68/ .
中文多说话人数据集,[下载链接](https://www.openslr.org/68/) .
如果用户需要使用其他的数据集,也可以自行下载并进行数据处理,只要符合如下的要求。