* hacky thing, add tone support for acoustic model
* fix experiments for waveflow and wavenet, only write visual log in rank-0
* use emb add in tacotron2
* 1. remove space from numericalized representation;
2. fix decoder paddign mask's unsqueeze dim.
* remove bn in postnet
* refactoring code
* add an option to normalize volume when loading audio.
* add an embedding layer.
* 1. change the default min value of LogMagnitude to 1e-5;
2. remove stop logit prediction from tacotron2 model.
* WIP: baker
* add ge2e
* fix lstm speaker encoder
* fix lstm speaker encoder
* fix speaker encoder and add support for 2 more datasets
* simplify visualization code
* add a simple strategy to support multispeaker for tacotron.
* add vctk example for refactored tacotron
* fix indentation
* fix class name
* fix visualizer
* fix root path
* fix root path
* fix root path
* fix typos
* fix bugs
* fix text log extention name
* add example for baker and aishell3
* update experiment and display
* format code for tacotron_vctk, add plot_waveform to display
* add new trainer
* minor fix
* add global condition support for tacotron2
* add gst layer
* add 2 frontend
* fix fmax for example/waveflow
* update collate function, data loader not does not convert nested list into numpy array.
* WIP: add hifigan
* WIP:update hifigan
* change stft to use conv1d
* add audio datasets
* change batch_text_id, batch_spec, batch_wav to include valid lengths in the returned value
* change wavenet to use on-the-fly prepeocessing
* fix typos
* resolve conflict
* remove imports that are removed
* remove files not included in this release
* remove imports to deleted modules
* move tacotron2_msp
* clean code
* fix argument order
* fix argument name
* clean code for data processing
* WIP: add README
* add more details to thr README, fix some preprocess scripts
* add voice cloning notebook
* add an optional to alter the loss and model structure of tacotron2, add an alternative config
* add plot_multiple_attentions and update visualization code in transformer_tts
* format code
* remove tacotron2_msp
* update tacotron2 from_pretrained, update setup.py
* update tacotron2
* update tacotron_aishell3's README
* add images for exampels/tacotron2_aishell3's README
* update README for examples/ge2e
* add STFT back
* add extra_config keys into the default config of tacotron
* fix typos and docs
* update README and doc
* update docstrings for tacotron
* update doc
* update README
* add links to downlaod pretrained models
* refine READMEs and clean code
* add praatio into requirements for running the experiments
* format code with pre-commit
* simplify text processing code and update notebook
2. add losses in parakeet/modules;
3. fix a bug in phonetics;
4. TransformerTTS update: encoder dim can be different from decoder dim;
5. MultiHeadAttention in TransformerTTS: add k_input_dim & v_input_dim in __init__ to allow differemt feature sizes for k and v.