Spaces:

ntt123
/

WaveGRU-Text-To-Speech

Running

WaveGRU-Text-To-Speech / wavegru.yaml

NTT123

a slow but working model

df1ad02 over 2 years ago

No virus

257 Bytes

	## dsp
	sample_rate : 24000
	window_length: 50.0 # ms
	hop_length: 12.5 # ms
	mel_min: 1.0e-5 ## need .0 to make it a float
	mel_dim: 80
	n_fft: 2048

	## wavegru
	embed_dim: 32
	rnn_dim: 512
	frames_per_sequence: 67
	num_pad_frames: 62
	upsample_factors: [5, 4, 3, 5]