update README for examples/ge2e

This commit is contained in:
iclementine 2021-04-29 17:15:18 +08:00
parent b1304cb449
commit 42092f1f5b
2 changed files with 10 additions and 10 deletions

View File

@ -28,23 +28,23 @@ Currently supported datasets are Librispeech-other-500, VoxCeleb, VoxCeleb2,ai-
1. Librispeech/train-other-500 1. Librispeech/train-other-500
An English multispeaker datasetURL https://www.openslr.org/resources/12/train-other-500.tar.gzonly the `train-other-500` subset is used. An English multispeaker dataset[URL](https://www.openslr.org/resources/12/train-other-500.tar.gz)only the `train-other-500` subset is used.
1. VoxCeleb1 1. VoxCeleb1
An English multispeaker datasetURL https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html , Audio Files from Dev A to Dev D should be downloaded, combined and extracted. An English multispeaker dataset[URL](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html) , Audio Files from Dev A to Dev D should be downloaded, combined and extracted.
2. VoxCeleb2 2. VoxCeleb2
An English multispeaker datasetURL https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html , Audio Files from Dev A to Dev H should be downloaded, combined and extracted. An English multispeaker dataset[URL](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html) , Audio Files from Dev A to Dev H should be downloaded, combined and extracted.
3. Aidatatang-200zh 3. Aidatatang-200zh
A Mandarin Chinese multispeaker dataset URL https://www.openslr.org/62/ . A Mandarin Chinese multispeaker dataset [URL](https://www.openslr.org/62/) .
4. magicdata 4. magicdata
A Mandarin Chinese multispeaker dataset URL https://www.openslr.org/68/ . A Mandarin Chinese multispeaker dataset [URL](https://www.openslr.org/68/) .
If you want to use other datasets, you can also download and preprocess it as long as it meets the needs described below. If you want to use other datasets, you can also download and preprocess it as long as it meets the needs described below.

View File

@ -27,23 +27,23 @@ ge2e
1. Librispeech/train-other-500 1. Librispeech/train-other-500
英文多说话人数据集,下载链接 https://www.openslr.org/resources/12/train-other-500.tar.gz我们的实验中仅用到了 train-other-500 这个子集。 英文多说话人数据集,[下载链接](https://www.openslr.org/resources/12/train-other-500.tar.gz),我们的实验中仅用到了 train-other-500 这个子集。
1. VoxCeleb1 1. VoxCeleb1
英文多说话人数据集,下载链接 https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html需要下载其中的 Audio Files 中的 Dev A 到 Dev D 四个压缩文件并合并解压。 英文多说话人数据集,[下载链接](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox1.html),需要下载其中的 Audio Files 中的 Dev A 到 Dev D 四个压缩文件并合并解压。
2. VoxCeleb2 2. VoxCeleb2
英文多说话人数据集,下载链接 https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html需要下载其中的 Audio Files 中的 Dev A 到 Dev H 八个压缩文件并合并解压。 英文多说话人数据集,[下载链接](https://www.robots.ox.ac.uk/~vgg/data/voxceleb/vox2.html),需要下载其中的 Audio Files 中的 Dev A 到 Dev H 八个压缩文件并合并解压。
3. Aidatatang-200zh 3. Aidatatang-200zh
中文多说话人数据集,下载链接 https://www.openslr.org/62/ . 中文多说话人数据集,[下载链接](https://www.openslr.org/62/) .
4. magicdata 4. magicdata
中文多说话人数据集,下载链接 https://www.openslr.org/68/ . 中文多说话人数据集,[下载链接](https://www.openslr.org/68/) .
如果用户需要使用其他的数据集,也可以自行下载并进行数据处理,只要符合如下的要求。 如果用户需要使用其他的数据集,也可以自行下载并进行数据处理,只要符合如下的要求。