ppaine-landscape / README.md

Add a condor statue image

65e6931 almost 2 years ago

6.25 kB

	---
	license: creativeml-openrail-m
	tags:
	- pytorch
	- diffusers
	- stable-diffusion
	- text-to-image
	- diffusion-models-class
	- dreambooth-hackathon
	- landscape
	widget:
	- text: a photo of ppaine landscape at night, NIKON Z FX
	---

	# Dreambooth Hackaton 23': How can we use a text-to-image generative model to explore the cinematographic appeal of Torres del Paine 🇨🇱?

	> _Torres del Paine National Park is a national park encompassing mountains, glaciers, lakes, and rivers in southern Chilean Patagonia._
	> _It is also part of the End of the World Route, a tourist scenic route. [Wikipedia](https://en.wikipedia.org/wiki/Torres_del_Paine_National_Park)_

	- Reddit post: [Dreambooth Hackaton: How can we use a text-to-image model to explore the cinematographic appeal of Torres del Paine 🇨🇱?](https://www.reddit.com/r/StableDiffusion/comments/109fjdu/dreambooth_hackaton_how_can_we_use_a_texttoimage/)

	## Description

	DreamBooth model for the ppaine concept trained by alkzar90 on the alkzar90/torres-del-paine dataset.

	This is a Stable Diffusion model fine-tuned on the ppaine concept with DreamBooth. It can be used by modifying the `instance_prompt`: a photo of ppaine landscape

	This model was created as part of the DreamBooth Hackathon 🔥. Visit the [organisation page](https://huggingface.co/dreambooth-hackathon) for instructions on how to take part!

	This is a Stable Diffusion model fine-tuned on `landscape` images for the landscape theme.


	## Cinematographics rendering & Object/Artifacts insertion

	<figure>
	<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-cinematographics.png" alt="Patagonia Landscape Model - Cinematographic Renderings/Artifacts Inmersion">
	<figcaption>Figure 1: <b>Cinematographics renderings and object/artifacts insertions in the Chilean Torres del Paine national park</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"The ppaine landscape in the middle earth, cinematic light, lord of the ring style, epic"</i>,
	(ii) <i>"The ppaine landscape in the middle earth, a visible dragon skeleton bones, cinematic light, lord of the ring style, epic"</i>,
	(iii) <i>"A long branches forest in the ppaine landscape, mountain peaks at the background, cinematic light, realistic, lord of the ring style, epic"</i>,
	(iv) <i>"A futuristic jeep riding in ppaine landscape, cinematic light, technology</i>,
	(v) <i>"A futuristic tensor airship flying over the ppaine landscape at night, NIKON-Z-FX"</i>,
	(vi) <i>"A huge tensor bridge in the ppaine landscape, cinematic light, majestic, architecture"</i>.
	</figcaption>
	</figure>

	### Animal Statues

	<figure>
	<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/animal-statues/a-photo-of-an-ancient-stone-condor-statue-in-the-ppaine-landscape%2C-michaelangelo%2C-majestic%2C-NIKON-Z-FX%2C-28mm.png" alt="Condor statue in Torres del Paine landscape">
	<figcaption>Figure 2: <b>Animal statues in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A photo of an ancient stone condor statue in the ppaine landscape, michaelangelo, majestic, NIKON-Z-FX, 28mm"</i>,
	</figcaption>
	</figure>

	## Director's eye view

	What does the director's cut concept mean? The definition by the [Merriam-Webster dictionary](https://www.merriam-webster.com/dictionary/director%27s%20cut#:~:text=noun,version%20created%20for%20general%20distribution) is: _"a version of a motion picture that is edited according to the director's wishes and that usually includes scenes cut from the version created for general distribution"_.

	<figure>
	<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-wes-anderson-cut.png" alt="Patagonia Landscape Model - Wes Anderson's cut">
	<figcaption>Figure 3: <b>Illustration of the director cuts of the Chilean Torres del Paine national park, in Wes Anderson's eyes</b>. Text prompts for generated images left-to-right;
	(i) <i>"The ppaine landscape, Wes Anderson style, cinematic light"</i>,
	(ii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>,
	(iii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>.
	</figcaption>
	</figure>


	## Artistic Style Transfer

	<figure>
	<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-landscape-painting.png" alt="Patagonia Landscape Model - Artist Style Painting">
	<figcaption>Figure 4: <b>Artistic renderings of the Chilean Torres del Paine national park in the style of famous painters</b>. Text prompts for generated images up-to-down rows and left-to-right;
	(i) <i>"A painting of the ppaine landscape, Vincent Van Gogh style"</i>,
	(ii) <i>"A painting of the ppaine landscape, Michelangelo style"</i>,
	(iii) <i>"A painting of the ppaine landscape, Botero style"</i>,
	(iv) <i>"A painting of the ppaine landscape, Pierre-Auguste Renoir style"</i>,
	(v) <i>"A painting of the ppaine landscape, Leonardo Da Vinci style"</i>,
	(vi) <i>"A painting of the ppaine landscape, Rembrandt style"</i>.
	</figcaption>
	</figure>

	## Usage

	```python
	from diffusers import StableDiffusionPipeline

	pipeline = StableDiffusionPipeline.from_pretrained('alkzar90/ppaine-landscape')
	image = pipeline().images[0]
	image
	```

	## References

	* [DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (Ruiz et al. 2022)](https://arxiv.org/abs/2208.12242})
	* [High-Resolution Image Synthesis with Latent Diffusion Models (Rombach et al., 2022 )](https://arxiv.org/abs/2112.10752)
	* [Training Stable Diffusion with Dreambooth using 🧨 Diffusers (Post)](https://huggingface.co/blog/dreambooth)
	* [Hugging Face DreamBooth Hackathon](https://github.com/huggingface/diffusion-models-class/tree/main/hackathon)



	## Thanks to John Whitaker and Lewis Tunstall

	Thanks to [John Whitaker](https://github.com/johnowhitaker) and [Lewis Tunstall](https://github.com/lewtun) for writing out and describing the initial hackathon parameters at https://huggingface.co/dreambooth-hackathon.