Peng-Wang
/

ImageDream

Model card Files Files and versions Community

ImageDream / README.md

rfnkyz's picture

Update README.md

6b4d8d4 verified 3 months ago

|

1.33 kB

	---
	license: openrail
	pipeline_tag: text-to-image
	datasets:
	- HuggingFaceTB/everyday-conversations-llama3.1-2k
	language:
	- ab
	metrics:
	- bertscore
	base_model: microsoft/Phi-3.5-vision-instruct
	library_name: espnet
	tags:
	- art
	---
	# ImageDream Model Card

	This model card focuses on the model associated with the [ImageDream paper](https://image-dream.github.io/)

	See also: https://github.com/ByteDance/ImageDream for code base.

	## Description of Files

	sd-v2.1-base-4view-ipmv.pt

	- the ImageDream-Pixel diffusion model fine-tuned from [MVDream v2.1](https://huggingface.co/MVDream/MVDream)

	sd-v2.1-base-4view-ipmv-local.pt

	- the ImageDream diffusion model without pixel-controller tuned from [MVDream v2.1](https://huggingface.co/MVDream/MVDream)

	## Citation

	```
	@article{wang2023imagedream,
	title={ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation},
	author={Wang, Peng and Shi, Yichun},
	journal={arXiv preprint arXiv:2312.02201},
	year={2023}
	}
	```

	## Misuse, Malicious Use, and Out-of-Scope Use

	The model should not be used to intentionally create or disseminate images that create hostile or alienating environments for people. This includes generating images that people would foreseeably find disturbing, distressing, or offensive; or content that propagates historical or current stereotypes.