File size: 2,350 Bytes

b11e796
 
adbb738
 
 
 
fe48386
4ac1cdc
fe48386
 
b11e796
adbb738
 
f832baf
 
adbb738
 
 
4195d2a
 
3d1822c
adbb738
 
3d1822c
adbb738

---
license: mit
tags:
- image-to-image
- text-to-image
- segmentation-to-image
- controlnet
- stable-diffusion
language:
- en
---

# PAIR-Diffusion Model Card - Stable Diffusion 1.5 finetuned with COCO-Stuff
[PAIR Diffusion](https://arxiv.org/abs/2303.17546) models an image as composition of multiple objects and aim to control structural and appearance properties of these objects. It allows reference image-guided appearance manipulation and structure editing of an image at an object level. Describing object appearances using text can be challenging and ambiguous, PAIR Diffusion  enables a user to control the appearance of an object using images. Having fine-grained control over appearance and structure at object level can be beneficial for future works in video and 3D beside image editing, 
            where we need to have consistent appearance across time in case of video or across various viewing positions in case of 3D.

For more information please refer to [GitHub](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [arXiv](https://arxiv.org/abs/2303.17546)

![Main Diagram](Main_Diag.png)&nbsp;&nbsp;


This model card applies the method to Stable Diffusion 1.5 (SDv1.5). We used COCO-Stuff dataset to finetune SDv1.5 using ControlNet due to its efficiency. The model can be tested using the publicly available demo here [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/PAIR/PAIR-Diffusion)

&nbsp;&nbsp;

## Model Details

- **Developed by:** Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Nicu Sebe, Trevor, Darrell, Atlas Wang, Humphrey Shi
- **Model type:** Object Level Editing using Diffusion Models
- **Language(s):** English
- **License:** MIT
- **Resources for more information:** [GitHub Repository](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [Paper](https://arxiv.org/abs/2303.17546).
- **Cite as:**
```
      @article{goel2023pair,
            title={PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models},
            author={Goel, Vidit and Peruzzo, Elia and Jiang, Yifan and Xu, Dejia and Sebe, Nicu and Darrell, Trevor and 
            Wang, Zhangyang and Shi, Humphrey},
            journal={arXiv preprint arXiv:2303.17546},
            year={2023} 
            }
```