README.md · MaverickAlex/R-FLAV-B-1-LS at main

metadata

license: cc-by-nc-4.0
tags:
  - audio-to-video
library_name: diffusers

Models of R-FLAV trained on Landscape and AIST++ for 400k iterations.

For more info, please refer to the Github repository at https://github.com/ErgastiAlex/R-FLAV

To download the ckpts directly in the code you can do

from huggingface_hub import hf_hub_download
import torch
from models import FLAV

model = FLAV.from_pretrained(args.model_ckpt)

hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/config.json")
vocoder_path = hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/vocoder.pt")

vocoder_path = vocoder_path.replace("vocoder.pt", "")
vocoder = Generator.from_pretrained(vocoder_path)