---
license: afl-3.0
language:
- en
base_model:
- black-forest-labs/FLUX.1-dev
- stabilityai/stable-diffusion-xl-base-1.0
- stable-diffusion-v1-5/stable-diffusion-v1-5
pipeline_tag: text-to-image
tags:
- art
- image
---

# 3DIS: DEPTH-DRIVEN DECOUPLED INSTANCE SYNTHESIS FOR TEXT-TO-IMAGE GENERATION

- ## 🔓 Code is now open source!

- ## 🌐 Project Website: https://limuloo.github.io/3DIS/

- ## 🏠 GitHub Repository: https://github.com/limuloo/3DIS

- ## 📄 3DIS Paper: https://arxiv.org/abs/2410.12669

- ## 📄 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131

![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e99fc07e2ec711a7138262/1eJLZFN88B1qpp8iJGy8U.png)

## Usage

This is the space for the pretrained-weights of 3DIS. The inference code can be found in our [GitHub Repository](https://github.com/limuloo/3DIS).

## Citation
If you find this repository useful, please use the following BibTeX entry for citation.
```
@article{zhou20243dis,
  title={3dis: Depth-driven decoupled instance synthesis for text-to-image generation},
  author={Zhou, Dewei and Xie, Ji and Yang, Zongxin and Yang, Yi},
  journal={arXiv preprint arXiv:2410.12669},
  year={2024}
}

@article{zhou20253disflux,
  title={3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering},
  author={Zhou, Dewei and Xie, Ji and Yang, Zongxin and Yang, Yi},
  journal={arXiv preprint arXiv:2501.05131},
  year={2025}
}
```