--- license: afl-3.0 language: - en base_model: - black-forest-labs/FLUX.1-dev - stabilityai/stable-diffusion-xl-base-1.0 - stable-diffusion-v1-5/stable-diffusion-v1-5 pipeline_tag: text-to-image tags: - art - image --- # 3DIS: DEPTH-DRIVEN DECOUPLED INSTANCE SYNTHESIS FOR TEXT-TO-IMAGE GENERATION - ## 🔓 Code is now open source! - ## 🌐 Project Website: https://limuloo.github.io/3DIS/ - ## 🏠 GitHub Repository: https://github.com/limuloo/3DIS - ## 📄 3DIS Paper: https://arxiv.org/abs/2410.12669 - ## 📄 3DIS-FLUX Tech Report: https://arxiv.org/abs/2501.05131 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64e99fc07e2ec711a7138262/1eJLZFN88B1qpp8iJGy8U.png) ## Usage This is the space for the pretrained-weights of 3DIS. The inference code can be found in our [GitHub Repository](https://github.com/limuloo/3DIS). ## Citation If you find this repository useful, please use the following BibTeX entry for citation. ``` @article{zhou20243dis, title={3dis: Depth-driven decoupled instance synthesis for text-to-image generation}, author={Zhou, Dewei and Xie, Ji and Yang, Zongxin and Yang, Yi}, journal={arXiv preprint arXiv:2410.12669}, year={2024} } @article{zhou20253disflux, title={3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering}, author={Zhou, Dewei and Xie, Ji and Yang, Zongxin and Yang, Yi}, journal={arXiv preprint arXiv:2501.05131}, year={2025} } ```