PY007 commited on
Commit
91950d8
·
verified ·
1 Parent(s): ef073d4

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. README.md +36 -1
  2. assets/logo.jpg +0 -0
README.md CHANGED
@@ -5,5 +5,40 @@ license_name: tencent-hunyuan-community
5
  license_link: LICENSE
6
  ---
7
 
8
- <!-- ## **FastHunyuan** -->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
 
 
5
  license_link: LICENSE
6
  ---
7
 
8
+ <p align="center">
9
+ <img src="assets/logo.jpg" height=100>
10
+ </p>
11
+
12
+ # FastHunyuan Model Card
13
+
14
+ ## Model Details
15
+
16
+ FastHunyuan is an accelerated [HunyuanVideo](https://huggingface.co/tencent/HunyuanVideo) model. It can sample high quality videos with 6 diffusion steps. That brings around 8X speed up compared to the original HunyuanVideo with 50 steps.
17
+
18
+ - **Developed by** [Hao AI Lab](https://hao-ai-lab.github.io/)
19
+ - **License** tencent-hunyuan-community
20
+ - **Distilled from** [HunyuanVideo](https://huggingface.co/tencent/HunyuanVideo)
21
+
22
+
23
+ ## [Github Repository](https://github.com/hao-ai-lab/FastVideo)
24
+
25
+ ## Usage
26
+
27
+ - Clone [Fastvideo](https://github.com/hao-ai-lab/FastVideo) repository and follow the inference instructions in the README.
28
+ - Alternatively, you can inference FastHunyuan using the official [Hunyuan Video repository](https://github.com/Tencent/HunyuanVideo) by **setting the shift to 17 and steps to 6**.
29
+
30
+ ## Training details
31
+
32
+ FastHunyuan is consistency distillated on the [MixKit](https://huggingface.co/datasets/LanguageBind/Open-Sora-Plan-v1.1.0/tree/main) dataset with the following hyperparamters:
33
+ - Batch size: 16
34
+ - Resulotion: 720x1280
35
+ - Num of frames: 125
36
+ - Train steps: 320
37
+ - GPUs: 32
38
+ - LR: 1e-6
39
+ - Loss: huber
40
+
41
+ ## Evaluation
42
+ We provide some qualitative comparison between FastHunyuan 6 step inference v.s. the original Hunyuan with 6 step inference:
43
+
44
 
assets/logo.jpg ADDED