File size: 5,398 Bytes
e8c52a1 9851dcb e8c52a1 213e242 e8c52a1 9851dcb e8c52a1 2b5fb3d 64f1c96 c1955a6 64f1c96 213e242 2b5fb3d b0d4b29 213e242 2b5fb3d 73b750e b0d4b29 73b750e 213e242 e8c52a1 84fe89a b6e103c ca7b976 b6e103c 84fe89a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 |
---
license: creativeml-openrail-m
tags:
- pytorch
- diffusers
- stable-diffusion
- text-to-image
- diffusion-models-class
- dreambooth-hackathon
- landscape
widget:
- text: a photo of ppaine landscape at night, NIKON Z FX
---
# Dreambooth Hackaton 23': How can we use a text-to-image generative model to explore the cinematographic appeal of Torres del Paine 🇨🇱?
> _Torres del Paine National Park is a national park encompassing mountains, glaciers, lakes, and rivers in southern Chilean Patagonia._
> _It is also part of the End of the World Route, a tourist scenic route. [Wikipedia](https://en.wikipedia.org/wiki/Torres_del_Paine_National_Park)_
## Description
DreamBooth model for the ppaine concept trained by alkzar90 on the alkzar90/torres-del-paine dataset.
This is a Stable Diffusion model fine-tuned on the ppaine concept with DreamBooth. It can be used by modifying the `instance_prompt`: **a photo of ppaine landscape**
This model was created as part of the DreamBooth Hackathon 🔥. Visit the [organisation page](https://huggingface.co/dreambooth-hackathon) for instructions on how to take part!
This is a Stable Diffusion model fine-tuned on `landscape` images for the landscape theme.
## Cinematographics rendering
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-cinematographics.png" alt="Patagonia Landscape Model - Cinematographic Renderings/Artifacts Inmersion">
<figcaption>Figure 1: <b>Cinematographics renderings and object/artifacts insertions in the Chilean Torres del Paine national park</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"The ppaine landscape in the middle earth, cinematic light, lord of the ring style, epic"</i>,
(ii) <i>"The ppaine landscape in the middle earth, a visible dragon skeleton bones, cinematic light, lord of the ring style, epic"</i>,
(iii) <i>"A long branches forest in the ppaine landscape, mountain peaks at the background, cinematic light, realistic, lord of the ring style, epic"</i>,
(iv) <i>"A futuristic jeep riding in ppaine landscape, cinematic light, technology</i>,
(v) <i>"A futuristic tensor airship flying over the ppaine landscape at night, NIKON-Z-FX"</i>,
(vi) <i>"A huge tensor bridge in the ppaine landscape, cinematic light, majestic, architecture"</i>.
</figcaption>
</figure>
## Director's eye view
What does the director's cut concept mean? The definition by the [Merriam-Webster dictionary](https://www.merriam-webster.com/dictionary/director%27s%20cut#:~:text=noun,version%20created%20for%20general%20distribution): _"a version of a motion picture that is edited according to the director's wishes and that usually includes scenes cut from the version created for general distribution"_.
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-wes-anderson-cut.png" alt="Patagonia Landscape Model - Wes Anderson's cut">
<figcaption>Figure 2: <b>Illustration of the director cuts of the Chilean Torres del Paine national park, in Wes Anderson's eyes</b>. Text prompts for generated images left-to-right;
(i) <i>"The ppaine landscape, Wes Anderson style, cinematic light"</i>,
(ii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>,
(iii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>.
</figcaption>
</figure>
## Artistic Style Transfer
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-landscape-painting.png" alt="Patagonia Landscape Model - Artist Style Painting">
<figcaption>Figure 3: <b>Artistic renderings of the Chilean Torres del Paine national park in the style of famous painters</b>. Text prompts for generated images up-to-down rows and left-to-right;
(i) <i>"A painting of the ppaine landscape, Vincent Van Gogh style"</i>,
(ii) <i>"A painting of the ppaine landscape, Michelangelo style"</i>,
(iii) <i>"A painting of the ppaine landscape, Botero style"</i>,
(iv) <i>"A painting of the ppaine landscape, Pierre-Auguste Renoir style"</i>,
(v) <i>"A painting of the ppaine landscape, Leonardo Da Vinci style"</i>,
(vi) <i>"A painting of the ppaine landscape, Rembrandt style"</i>.
</figcaption>
</figure>
## Usage
```python
from diffusers import StableDiffusionPipeline
pipeline = StableDiffusionPipeline.from_pretrained('alkzar90/ppaine-landscape')
image = pipeline().images[0]
image
```
## References
* [DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (Ruiz et al. 2022)](https://arxiv.org/abs/2208.12242})
* [High-Resolution Image Synthesis with Latent Diffusion Models (Rombach et al., 2022 )](https://arxiv.org/abs/2112.10752)
* [Training Stable Diffusion with Dreambooth using 🧨 Diffusers (Post)](https://huggingface.co/blog/dreambooth)
* [Hugging Face DreamBooth Hackathon](https://github.com/huggingface/diffusion-models-class/tree/main/hackathon)
## Thanks to John Whitaker and Lewis Tunstall
Thanks to [John Whitaker](https://github.com/johnowhitaker) and [Lewis Tunstall](https://github.com/lewtun) for writing out and describing the initial hackathon parameters at https://huggingface.co/dreambooth-hackathon. |