c43216ae9b
2. add losses in parakeet/modules; 3. fix a bug in phonetics; 4. TransformerTTS update: encoder dim can be different from decoder dim; 5. MultiHeadAttention in TransformerTTS: add k_input_dim & v_input_dim in __init__ to allow differemt feature sizes for k and v. |
||
---|---|---|
.. | ||
audio | ||
data | ||
frontend | ||
models | ||
modules | ||
utils | ||
__init__.py |