Fhrozen's picture
add files
23d0252
|
raw
history blame
748 Bytes
metadata
tags:
  - espnet
  - audio
  - audio-to-audio
  - vocoder
language:
  - multilingual
datasets:
  - libritts
  - csj
  - css10
  - aishell3
  - jvs
  - jsss
  - jsut
license: cc-by-4.0

Vocoder model - HifiGAN - Multilingual

No support given.

Details

batch_size: 64
discriminator_params:
  follow_official_norm: true
  period_discriminator_params:
    bias: true
    channels: 32
    downsample_scales:
    - 3
    - 3
    - 3
    - 3
    - 1
    in_channels: 1
    kernel_sizes:
    - 5
    - 3
    max_downsample_channels: 1024
    nonlinear_activation: LeakyReLU
    nonlinear_activation_params:
      negative_slope: 0.1
    out_channels: 1
    use_spectral_norm: false
    use_weight_norm: true
  periods:
  - 2
  - 3
  - 5
  - 7
  - 11