Spaces:

KwaiVGI
/

LivePortrait

Running on Zero

App Files Files Community

how to run inference in fp8?

#38

by codewithRiz - opened Nov 26, 2024

Discussion

codewithRiz

Nov 26, 2024

we have tested on A100 as well and rtx 4070 locally
with flask API. now the on avg 1min video taking 1.5 min infrance time even I tried to upload the video on server already to put just video id so that uploading time reduces .i tiread torch compile also , not sure how to increase inference time .
any tips

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment