Spaces:

XiangpengYang
/

VideoGrain

Configuration error

App Files Files Community

XiangpengYang commited on Mar 4

Commit

f7c5396

1 Parent(s): 5aca2b0

update

Browse files

Files changed (14) hide show

README.md +67 -20
assets/class-level/bear.gif +3 -0
assets/class-level/car-1.gif +3 -0
assets/class-level/husky.gif +3 -0
assets/class-level/pig.gif +3 -0
assets/class-level/posche.gif +3 -0
assets/class-level/tennis.gif +3 -0
assets/class-level/tennis_1cls.gif +3 -0
assets/class-level/tennis_3cls.gif +3 -0
assets/class-level/tiger.gif +3 -0
assets/class-level/wolf.gif +3 -0
assets/{bear_weight.gif → vis/bear_weight.gif} +0 -0
config/part_level/adding_new_object/run_two_man/{running_spider_polar_sunglass.yaml → spider_polar_sunglass.yaml} +0 -0
test.sh +1 -1

README.md CHANGED Viewed

@@ -108,32 +108,20 @@ python image_util/sample_video2frames.py --video_path 'your video path' --output
 We segment videos using our ReLER lab's [SAM-Track](https://github.com/z-x-yang/Segment-and-Track-Anything). I suggest using the `app.py` in SAM-Track for `graio` mode to manually select which region in the video your want to edit. Here, we also provided an script ` image_util/process_webui_mask.py` to process masks from SAM-Track path to VideoGrain path.
-## 🔥 VideoGrain Editing
-### Inference
-**🔛prepare your config**
-VideoGrain is a training-free framework. To run VideoGrain on your video, modify `./config/demo_config.yaml` based on your needs:
-1. Replace your pretrained model path and controlnet path in your config. you can change the control_type to `dwpose` or `depth_zoe` or `depth`(midas).
-2. Prepare your video frames and layout masks (edit regions) using SAM-Track or SAM2 in dataset config.
-3. Change the `prompt`, and extract each `local prompt` in the editing prompts. the local prompt order should be same as layout masks order.
-4. Your can change flatten resolution with 1->64, 2->16, 4->8. (commonly, flatten at 64 worked best)
-5. To ensure temporal consistency, you can set `use_pnp: True` and `inject_step:5/10`. (Note: pnp>10 steps will be bad for multi-regions editing)
-6. If you want to visualize the cross attn weight, set `vis_cross_attn: True`
-7. If you want to cluster DDIM Inversion spatial temporal video feature, set `cluster_inversion_feature: True`
-**😍Editing your video**
 ```bash
 bash test.sh
 #or
-CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config  /path/to/the/config
 ```
-<details><summary>The result is saved at `./result` . (Click for directory structure) </summary>
 ```
 result
 ├── run_two_man
@@ -150,6 +138,28 @@ result
 ```
 </details>
 ## 🚀Multi-Grained Video Editing Results
 ### 🌈 Multi-Grained Definition
@@ -207,7 +217,7 @@ CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config  config/instance_level
 </tr>
 </table>
-## 🕺  Part-level Video Editing
 You can get part-level video editing results, using the following command:
 ```bash
 CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/part_level/modification/man_text_message/blue_shirt.yaml
@@ -246,6 +256,43 @@ CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/part_level/modi
   <td width=15% style="text-align:center;">superman </td>
   <td width=15% style="text-align:center;">superman + sunglasses</td>
 </tr>
 </table>
@@ -284,7 +331,7 @@ CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/instance_level/
   <td><img src="assets/soely_edit/input.gif"></td>
   <td><img src="assets/vis/edit.gif"></td>
   <td><img src="assets/vis/spiderman_weight.gif"></td>
-  <td><img src="assets/bear_weight.gif"></td>
   <td><img src="/assets/vis/cherry_weight.gif"></td>
 </tr>
 <tr>

 We segment videos using our ReLER lab's [SAM-Track](https://github.com/z-x-yang/Segment-and-Track-Anything). I suggest using the `app.py` in SAM-Track for `graio` mode to manually select which region in the video your want to edit. Here, we also provided an script ` image_util/process_webui_mask.py` to process masks from SAM-Track path to VideoGrain path.
+## 🔥🔥🔥 VideoGrain Editing
+### 🎨 Inference
+Your can reproduce the instance + part level results in our teaser by running:
 ```bash
 bash test.sh
 #or
+CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/part_level/adding_new_object/run_two_man/spider_polar_sunglass.yaml
 ```
+For other instance/part/class results in VideoGrain project page or teaser, we provide all the data (video frames and layout masks) and corresponding configs to reproduce, the results is shown in [🚀Multi-Grained Video Editing Results](#multi-grained-video-editing-results).
+<details><summary>The result is saved at `./result` . (Click for directory structure) </summary>
 ```
 result
 ├── run_two_man
 ```
 </details>
+## Editing guidance for YOUR Video
+### 🔛prepare your config**
+VideoGrain is a training-free framework. To run VideoGrain on your video, modify `./config/demo_config.yaml` based on your needs:
+1. Replace your pretrained model path and controlnet path in your config. you can change the control_type to `dwpose` or `depth_zoe` or `depth`(midas).
+2. Prepare your video frames and layout masks (edit regions) using SAM-Track or SAM2 in dataset config.
+3. Change the `prompt`, and extract each `local prompt` in the editing prompts. the local prompt order should be same as layout masks order.
+4. Your can change flatten resolution with 1->64, 2->16, 4->8. (commonly, flatten at 64 worked best)
+5. To ensure temporal consistency, you can set `use_pnp: True` and `inject_step:5/10`. (Note: pnp>10 steps will be bad for multi-regions editing)
+6. If you want to visualize the cross attn weight, set `vis_cross_attn: True`
+7. If you want to cluster DDIM Inversion spatial temporal video feature, set `cluster_inversion_feature: True`
+### 😍Editing your video**
+```bash
+bash test.sh
+#or
+CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config  /path/to/the/config
+```
 ## 🚀Multi-Grained Video Editing Results
 ### 🌈 Multi-Grained Definition
 </tr>
 </table>
+## 🕺 Part-level Video Editing
 You can get part-level video editing results, using the following command:
 ```bash
 CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/part_level/modification/man_text_message/blue_shirt.yaml
   <td width=15% style="text-align:center;">superman </td>
   <td width=15% style="text-align:center;">superman + sunglasses</td>
 </tr>
+</table>
+## 🥳 Class-level Video Editing
+You can get class-level video editing results, using the following command:
+```bash
+CUDA_VISIBLE_DEVICES=0 accelerate launch test.py --config config/class_level/wolf/wolf.yaml
+```
+<table class="center">
+<tr>
+  <td><img src="assets/class-level/wolf.gif"></td>
+  <td><img src="assets/class-level/pig.gif"></td>
+  <td><img src="assets/class-level/husky.gif"></td>
+  <td><img src="assets/class-level/bear.gif"></td>
+  <td><img src="assets/class-level/tiger.gif"></td>
+</tr>
+<tr>
+  <td width=15% style="text-align:center;">input</td>
+  <td width=15% style="text-align:center;">pig</td>
+  <td width=15% style="text-align:center;">husky</td>
+  <td width=15% style="text-align:center;">bear</td>
+  <td width=15% style="text-align:center;">tiger</td>
+</tr>
+<tr>
+  <td><img src="assets/class-level/tennis.gif"></td>
+  <td><img src="assets/class-level/tennis_1cls.gif"></td>
+  <td><img src="assets/class-level/tennis_3cls.gif"></td>
+  <td><img src="assets/class-level/car-1.gif"></td>
+  <td><img src="assets/class-level/posche.gif"></td>
+</tr>
+<tr>
+  <td width=15% style="text-align:center;">input</td>
+  <td width=15% style="text-align:center;">iron man</td>
+  <td width=15% style="text-align:center;">Batman + snow court + iced wall</td>
+  <td width=15% style="text-align:center;">input </td>
+  <td width=15% style="text-align:center;">posche</td>
+</tr>
 </table>
   <td><img src="assets/soely_edit/input.gif"></td>
   <td><img src="assets/vis/edit.gif"></td>
   <td><img src="assets/vis/spiderman_weight.gif"></td>
+  <td><img src="assets/vis/bear_weight.gif"></td>
   <td><img src="/assets/vis/cherry_weight.gif"></td>
 </tr>
 <tr>

assets/class-level/bear.gif ADDED Viewed

Git LFS Details

SHA256: 29be8413f7278c1d266357d13cd295fb05722ead8d4ed6703b7c738e8b59c3fd
Pointer size: 132 Bytes
Size of remote file: 2.39 MB

assets/class-level/car-1.gif ADDED Viewed

Git LFS Details

SHA256: 72acea1c5d5097e2e878f339a72f0b8cb0f293fd1bca71f6a65a9e9344474519
Pointer size: 132 Bytes
Size of remote file: 1.09 MB

assets/class-level/husky.gif ADDED Viewed

Git LFS Details

SHA256: 842375b1c6bcd1a37cc0c16fd0161af1e2c3e946d05bd0a98012735208a32273
Pointer size: 132 Bytes
Size of remote file: 2.29 MB

assets/class-level/pig.gif ADDED Viewed

Git LFS Details

SHA256: 6797aa3ed46daac96e62bcd844836e638e3ffe4685df97b247bbc2a2c7074b04
Pointer size: 132 Bytes
Size of remote file: 1.76 MB

assets/class-level/posche.gif ADDED Viewed

Git LFS Details

SHA256: 329c13d401fcc62cee9d0632857024a55d5865c752cf8228e34bb2b9afdf039c
Pointer size: 132 Bytes
Size of remote file: 1.08 MB

assets/class-level/tennis.gif ADDED Viewed

Git LFS Details

SHA256: b97b2d87eba4706b75038034585defb78439e81a740e7d2881b447f640352239
Pointer size: 132 Bytes
Size of remote file: 2.91 MB

assets/class-level/tennis_1cls.gif ADDED Viewed

Git LFS Details

SHA256: bad43c22e6e29d67809b5fb7b6fd43a1a8578b0a44268da05bf6f6fedd3f1ca8
Pointer size: 132 Bytes
Size of remote file: 2.92 MB

assets/class-level/tennis_3cls.gif ADDED Viewed

Git LFS Details

SHA256: cc1527e1cf680339e8f2d9b8929bb2058d1b0aed6d5f89bd024182ec257d02b9
Pointer size: 132 Bytes
Size of remote file: 3.32 MB

assets/class-level/tiger.gif ADDED Viewed

Git LFS Details

SHA256: 6af763e70a53f116c7fa81ebbea927ec5ebefc46d73edbd864bed75e96f0ad54
Pointer size: 132 Bytes
Size of remote file: 2.75 MB

assets/class-level/wolf.gif ADDED Viewed

Git LFS Details

SHA256: 651458a1ebc192a73a482f03e8d2961f694892b56a9e83eb844439ac0ba314fc
Pointer size: 132 Bytes
Size of remote file: 2.59 MB

assets/{bear_weight.gif → vis/bear_weight.gif} RENAMED Viewed

File without changes

config/part_level/adding_new_object/run_two_man/{running_spider_polar_sunglass.yaml → spider_polar_sunglass.yaml} RENAMED Viewed

File without changes

test.sh CHANGED Viewed

	@@ -1,2 +1,2 @@
1	export CUDA_VISIBLE_DEVICES=0
2	- accelerate launch test.py --config config/part_level/adding_new_object/run_two_man/~~running_spider_polar_sunglass~~.yaml


1	export CUDA_VISIBLE_DEVICES=0
2	+ accelerate launch test.py --config config/part_level/adding_new_object/run_two_man/spider_polar_sunglass.yaml