Commit Graph

399 Commits

Author SHA1 Message Date
chenfeiyu d3761683e1 add an adaptive loss to balance stop prediction classes 2020-12-05 14:12:30 +08:00
chenfeiyu a4a0bd8c98 add last bn for the decoder postnet, switch back to weighted mean 2020-12-05 14:00:08 +08:00
chenfeiyu c57e8e7350 fix transformer_tts' stop condition 2020-12-04 02:11:02 +08:00
Feiyu Chan a6806389f9
Merge pull request #44 from iclementine/reborn
hide models that are not updated yet
2020-12-03 19:06:06 +08:00
chenfeiyu e87bfb7d05 hide fastspeech, deepvoice3, clarinet temporarily till they are updated 2020-12-03 18:54:17 +08:00
chenfeiyu 3ca037453e remove conf and use yacs instead 2020-12-03 18:42:36 +08:00
Feiyu Chan a29c74d036
Merge pull request #43 from iclementine/reborn
update models
2020-12-03 17:03:55 +08:00
chenfeiyu 4df5ad42f6 remove the last layer from decoder prenet 2020-12-03 15:55:07 +08:00
chenfeiyu 810f979dba siwtch to keras style sample_weight in losses 2020-12-03 15:37:43 +08:00
chenfeiyu 6edc7d8474 switch back to standard implementation of positional encoding 2020-12-03 14:54:32 +08:00
chenfeiyu 404add2caa temporary fix for memory leak 2020-12-03 14:51:25 +08:00
chenfeiyu 9cb5c03069 transformer_tts, miscellaneous fixes 2020-12-01 18:13:30 +08:00
chenfeiyu 598d813908 fix a bug in config 2020-11-23 13:24:03 +08:00
iclementine 2ed26d3416 do not expand the last layer of lists 2020-11-20 16:17:24 +08:00
iclementine ce29ac68b3 use yaml instead of ruamel.yaml 2020-11-20 16:13:05 +08:00
iclementine d190ce8d7f use dict comprehension to exclude unspecified options 2020-11-20 15:21:06 +08:00
iclementine 6101c6ac86 fix typos 2020-11-20 15:18:53 +08:00
iclementine 5e11ce0dcd remove options not specified via CLI before merging args 2020-11-20 15:17:35 +08:00
iclementine 73a2cadc36 remove default values when adding config options to a parser 2020-11-20 15:13:24 +08:00
iclementine 8af831ae3c add --config by default when adding config options to a parser 2020-11-20 15:06:07 +08:00
iclementine 5b5eaaadac add a tool for configuration 2020-11-20 14:33:56 +08:00
iclementine fb49c1e77d fix typos 2020-11-19 22:20:31 +08:00
iclementine 2dce0887b3 add schedulers 2020-11-19 22:17:50 +08:00
iclementine 49231ca8e5 move datasets 2020-11-19 22:04:25 +08:00
iclementine db7598c702 add datasets 2020-11-19 20:43:03 +08:00
iclementine abee3ecdd4 move datasets into parakeet.datasets 2020-11-19 20:31:21 +08:00
iclementine b65cc4d8dc add Unit normalizer 2020-11-19 20:17:42 +08:00
iclementine a01200e437 add an cli for cloning examples 2020-11-19 18:08:11 +08:00
iclementine c7e5aaa540 remove old examples 2020-11-19 15:47:57 +08:00
iclementine 0e35119453 add more doc in chinese 2020-11-19 10:41:37 +08:00
iclementine c8622b4699 update experiment guide 2020-11-17 16:33:13 +08:00
iclementine e470cda881 add Chinese docs 2020-11-17 10:48:02 +08:00
iclementine 01f30d7cc8 switch to markdown 2020-11-12 17:28:07 +08:00
iclementine 7822a89fec add doctree 2020-11-12 17:17:02 +08:00
iclementine 098d3795c2 add documentation for installation. 2020-11-12 17:07:03 +08:00
chenfeiyu a9177cd6c2 waveflow: explicitly call forward hook before calling a method other than forward when needed. 2020-11-09 15:46:27 +08:00
chenfeiyu af4da7dd9e 1. update code for waveflow's probability density estimation and sampling;
2. add WaveFlowLoss.
2020-11-04 23:22:45 +08:00
chenfeiyu e07441c193 waveflow refactor: add prediction functionalities 2020-11-04 19:31:36 +08:00
chenfeiyu 8094578f6d update waveflow to 2.0 APIs 2020-11-04 01:37:49 +08:00
chenfeiyu 0cdad602e2 fix a bug for changing reduction factor in transformner_tts 2020-11-03 11:18:46 +08:00
chenfeiyu 1f71f65c28 Merge branch 'reborn' into exp 2020-10-30 21:44:06 +08:00
chenfeiyu 68f5e1de15 add utility to pack attention weights 2020-10-30 21:36:11 +08:00
chenfeiyu 45d6f3b99d specify a U(-.05, .05) initializer for Embedding 2020-10-30 17:42:06 +08:00
chenfeiyu 57d820f055 add support for channel last in batch_spec, and Conv1dBatchNorm 2020-10-30 15:13:57 +08:00
chenfeiyu 36cc543348 minor fixes to TransformerTTS 2020-10-28 11:05:47 +08:00
chenfeiyu c43216ae9b 1. API renaming Conv1d -> Conv1D, BatchNorm1d -> BatchNorm1D;
2. add losses in parakeet/modules;
3. fix a bug in phonetics;
4. TransformerTTS update: encoder dim can be different from decoder dim;
5. MultiHeadAttention in TransformerTTS: add k_input_dim & v_input_dim in __init__ to allow differemt feature sizes for k and v.
2020-10-22 05:04:45 +00:00
iclementine 2a764d9a10 add opencc, g2p_en, g2pm into requirements 2020-10-20 16:08:45 +08:00
iclementine 580655f33f add phonetics & vocab & punctuation 2020-10-20 16:06:11 +08:00
iclementine c1e0aecdde 1. import models into parakeet.models;
2. add predict for TransformerTTS and test its io.
2020-10-16 13:51:56 +08:00
iclementine 6aa7af1aa4 add AudioFolderDataset 2020-10-15 23:15:27 +08:00