WaveGRU-Text-To-Speech / wavegru.yaml
NTT123
a slow but working model
df1ad02
raw
history blame
No virus
257 Bytes
## dsp
sample_rate : 24000
window_length: 50.0 # ms
hop_length: 12.5 # ms
mel_min: 1.0e-5 ## need .0 to make it a float
mel_dim: 80
n_fft: 2048
## wavegru
embed_dim: 32
rnn_dim: 512
frames_per_sequence: 67
num_pad_frames: 62
upsample_factors: [5, 4, 3, 5]