File size: 2,350 Bytes
b11e796 adbb738 fe48386 4ac1cdc fe48386 b11e796 adbb738 f832baf adbb738 4195d2a 3d1822c adbb738 3d1822c adbb738 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 |
---
license: mit
tags:
- image-to-image
- text-to-image
- segmentation-to-image
- controlnet
- stable-diffusion
language:
- en
---
# PAIR-Diffusion Model Card - Stable Diffusion 1.5 finetuned with COCO-Stuff
[PAIR Diffusion](https://arxiv.org/abs/2303.17546) models an image as composition of multiple objects and aim to control structural and appearance properties of these objects. It allows reference image-guided appearance manipulation and structure editing of an image at an object level. Describing object appearances using text can be challenging and ambiguous, PAIR Diffusion enables a user to control the appearance of an object using images. Having fine-grained control over appearance and structure at object level can be beneficial for future works in video and 3D beside image editing,
where we need to have consistent appearance across time in case of video or across various viewing positions in case of 3D.
For more information please refer to [GitHub](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [arXiv](https://arxiv.org/abs/2303.17546)
![Main Diagram](Main_Diag.png)
This model card applies the method to Stable Diffusion 1.5 (SDv1.5). We used COCO-Stuff dataset to finetune SDv1.5 using ControlNet due to its efficiency. The model can be tested using the publicly available demo here [![Hugging Face Spaces](https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-blue)](https://huggingface.co/spaces/PAIR/PAIR-Diffusion)
## Model Details
- **Developed by:** Vidit Goel, Elia Peruzzo, Yifan Jiang, Dejia Xu, Nicu Sebe, Trevor, Darrell, Atlas Wang, Humphrey Shi
- **Model type:** Object Level Editing using Diffusion Models
- **Language(s):** English
- **License:** MIT
- **Resources for more information:** [GitHub Repository](https://github.com/Picsart-AI-Research/PAIR-Diffusion/), [Paper](https://arxiv.org/abs/2303.17546).
- **Cite as:**
```
@article{goel2023pair,
title={PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models},
author={Goel, Vidit and Peruzzo, Elia and Jiang, Yifan and Xu, Dejia and Sebe, Nicu and Darrell, Trevor and
Wang, Zhangyang and Shi, Humphrey},
journal={arXiv preprint arXiv:2303.17546},
year={2023}
}
``` |