File size: 7,096 Bytes
e8c52a1 cefa125 e8c52a1 9851dcb e8c52a1 213e242 cefa125 45a74ab e8c52a1 9851dcb e8c52a1 2249847 64f1c96 cefa125 c1955a6 64f1c96 213e242 65e6931 cefa125 65e6931 2b5fb3d 1712fd4 2b5fb3d b0d4b29 cefa125 65e6931 b0d4b29 213e242 2b5fb3d 73b750e cefa125 65e6931 73b750e 213e242 e8c52a1 84fe89a b6e103c ca7b976 b6e103c 84fe89a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 |
---
license: creativeml-openrail-m
tags:
- pytorch
- diffusers
- stable-diffusion
- text-to-image
- diffusion-models-class
- dreambooth-hackathon
- landscape
widget:
- text: a photo of ppaine landscape, NIKON Z FX, cinematic light, galaxy sky
---
# Dreambooth Hackaton 23': How can we use a text-to-image generative model to explore the cinematographic appeal of Torres del Paine 🇨🇱?
> _Torres del Paine National Park is a national park encompassing mountains, glaciers, lakes, and rivers in southern Chilean Patagonia._
> _It is also part of the End of the World Route, a tourist scenic route. [Wikipedia](https://en.wikipedia.org/wiki/Torres_del_Paine_National_Park)_
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/snowscat-H3oXiq7_bII-unsplash.jpg" alt="Torres del Paine Snowcatt photo, Unsplash">
<figcaption>Portrait: Snowscat photo, source Unsplash.
</figcaption>
</figure>
- Reddit post: [Dreambooth Hackaton: How can we use a text-to-image model to explore the cinematographic appeal of Torres del Paine 🇨🇱?](https://www.reddit.com/r/StableDiffusion/comments/109fjdu/dreambooth_hackaton_how_can_we_use_a_texttoimage/)
## Description
DreamBooth model for the ppaine concept trained by alkzar90 on the alkzar90/torres-del-paine dataset.
This is a Stable Diffusion model fine-tuned on the ppaine concept with DreamBooth. It can be used by modifying the `instance_prompt`: **a photo of ppaine landscape**
This model was created as part of the DreamBooth Hackathon 🔥. Visit the [organisation page](https://huggingface.co/dreambooth-hackathon) for instructions on how to take part!
This is a Stable Diffusion model fine-tuned on `landscape` images for the landscape theme.
## Cinematographics rendering & Object/Artifacts insertion
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-cinematographics.png" alt="Torres del Paine Landscape Model - Cinematographic Renderings/Artifacts Inmersion">
<figcaption>Figure 1: <b>Cinematographics renderings and object/artifacts insertions in the Chilean Torres del Paine national park</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"The ppaine landscape in the middle earth, cinematic light, lord of the ring style, epic"</i>,
(ii) <i>"The ppaine landscape in the middle earth, a visible dragon skeleton bones, cinematic light, lord of the ring style, epic"</i>,
(iii) <i>"A long branches forest in the ppaine landscape, mountain peaks at the background, cinematic light, realistic, lord of the ring style, epic"</i>,
(iv) <i>"A futuristic jeep riding in ppaine landscape, cinematic light, technology</i>,
(v) <i>"A futuristic tensor airship flying over the ppaine landscape at night, NIKON-Z-FX"</i>,
(vi) <i>"A huge tensor bridge in the ppaine landscape, cinematic light, majestic, architecture"</i>.
</figcaption>
</figure>
### Animal Statues
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/animal-statues/a-photo-of-an-ancient-stone-condor-statue-in-the-ppaine-landscape%2C-michaelangelo%2C-majestic%2C-NIKON-Z-FX%2C-28mm.png" alt="Condor statue in Torres del Paine landscape">
<figcaption>Figure 2-a: <b>Animal statues in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A photo of an ancient stone condor statue in the ppaine landscape, michaelangelo, majestic, NIKON-Z-FX, 28mm"</i>,
</figcaption>
</figure>
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/animal-statues/a-photo-of-a-marble-huemul-statue-in-the-ppaine-landscape%2C-majestic%2C-michaelangelo%2C-NIKON-Z-FX%2C-28mm%20(1).png" alt="Huemul marble statue in Torres del Paine landscape">
<figcaption>Figure 2-b: <b>Animal statues in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A photo of a marble huemul stattue in the ppaine landscape, majestic, michaelangelo, NIKON-Z-FX, 28mm"</i>,
</figcaption>
</figure>
## Director's eye view
What does the director's cut concept mean? The definition by the [Merriam-Webster dictionary](https://www.merriam-webster.com/dictionary/director%27s%20cut#:~:text=noun,version%20created%20for%20general%20distribution) is: _"a version of a motion picture that is edited according to the director's wishes and that usually includes scenes cut from the version created for general distribution"_.
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-wes-anderson-cut.png" alt="Torres del Paine Landscape Model - Wes Anderson's cut">
<figcaption>Figure 3: <b>Illustration of the director cuts of the Chilean Torres del Paine national park, in Wes Anderson's eyes</b>. Text prompts for generated images left-to-right;
(i) <i>"The ppaine landscape, Wes Anderson style, cinematic light"</i>,
(ii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>,
(iii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>.
</figcaption>
</figure>
## Artistic Style Transfer
<figure>
<img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-landscape-painting.png" alt="Torres del Paine Landscape Model - Artist Style Painting">
<figcaption>Figure 4: <b>Artistic renderings of the Chilean Torres del Paine national park in the style of famous painters</b>. Text prompts for generated images up-to-down rows and left-to-right;
(i) <i>"A painting of the ppaine landscape, Vincent Van Gogh style"</i>,
(ii) <i>"A painting of the ppaine landscape, Michelangelo style"</i>,
(iii) <i>"A painting of the ppaine landscape, Botero style"</i>,
(iv) <i>"A painting of the ppaine landscape, Pierre-Auguste Renoir style"</i>,
(v) <i>"A painting of the ppaine landscape, Leonardo Da Vinci style"</i>,
(vi) <i>"A painting of the ppaine landscape, Rembrandt style"</i>.
</figcaption>
</figure>
## Usage
```python
from diffusers import StableDiffusionPipeline
pipeline = StableDiffusionPipeline.from_pretrained('alkzar90/ppaine-landscape')
image = pipeline().images[0]
image
```
## References
* [DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (Ruiz et al. 2022)](https://arxiv.org/abs/2208.12242})
* [High-Resolution Image Synthesis with Latent Diffusion Models (Rombach et al., 2022 )](https://arxiv.org/abs/2112.10752)
* [Training Stable Diffusion with Dreambooth using 🧨 Diffusers (Post)](https://huggingface.co/blog/dreambooth)
* [Hugging Face DreamBooth Hackathon](https://github.com/huggingface/diffusion-models-class/tree/main/hackathon)
## Thanks to John Whitaker and Lewis Tunstall
Thanks to [John Whitaker](https://github.com/johnowhitaker) and [Lewis Tunstall](https://github.com/lewtun) for writing out and describing the initial hackathon parameters at https://huggingface.co/dreambooth-hackathon. |