metadata
license: openrail
pipeline_tag: text-to-image
datasets:
- HuggingFaceTB/everyday-conversations-llama3.1-2k
language:
- ab
metrics:
- bertscore
base_model: microsoft/Phi-3.5-vision-instruct
library_name: espnet
tags:
- art
ImageDream Model Card
This model card focuses on the model associated with the ImageDream paper
See also: https://github.com/ByteDance/ImageDream for code base.
Description of Files
sd-v2.1-base-4view-ipmv.pt
- the ImageDream-Pixel diffusion model fine-tuned from MVDream v2.1
sd-v2.1-base-4view-ipmv-local.pt
- the ImageDream diffusion model without pixel-controller tuned from MVDream v2.1
Citation
@article{wang2023imagedream,
title={ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation},
author={Wang, Peng and Shi, Yichun},
journal={arXiv preprint arXiv:2312.02201},
year={2023}
}
Misuse, Malicious Use, and Out-of-Scope Use
The model should not be used to intentionally create or disseminate images that create hostile or alienating environments for people. This includes generating images that people would foreseeably find disturbing, distressing, or offensive; or content that propagates historical or current stereotypes.