jaehong31
/

RACCooN

Model card Files Files and versions Community

RACCooN / README.md

jaehong31

cards

7a0f027 8 months ago

|

1.23 kB

	---
	license: openrail
	---

	# This repo contains the pretrained checkpoints for Ctrl-Adapter


	CTRL-Adapter is an efficient and versatile framework for adding diverse
	spatial controls to any image or video diffusion model. It supports a variety of useful
	applications, including video control, video control with multiple conditions, video control with
	sparse frame conditions, image control, zero-shot transfer to unseen conditions, and video editing.

	See also: https://github.com/jaehong31/RACCooN


	<!-- <br>
	<img width="800" src="assets/raccoon_teaser.png"/>
	<br> -->

	<br>
	<br>

	# Description of VPLM Dataset

	Multi-Objects Description
	- Train: RACCooN/VPLM/gt_train.json
	- Test: RACCooN/VPLM/gt_test.json

	Single-Object Layout Prediction
	- Train: RACCooN/VPLM/gt_train_layouts.json
	- Test: RACCooN/VPLM/gt_test_layouts.json


	# Description of Model Checkpoints

	## V2P

	Multi-Objects Description
	- RACCooN/mllm_finetuned/multi_obj_projector.bin

	Single-Object Description
	- RACCooN/mllm_finetuned/single_obj_projector.bin

	Single-Object Layout Prediction
	- RACCooN/mllm_finetuned/layout_pred_projector.bin


	## P2V
	- RACCooN/unet_finetuned/diffusion_pytorch_model.safetensors