File size: 5,398 Bytes
e8c52a1
 
 
 
 
 
 
 
 
 
 
 
 
 
9851dcb
e8c52a1
213e242
 
 
e8c52a1
 
 
9851dcb
 
 
 
 
e8c52a1
 
 
 
2b5fb3d
64f1c96
 
 
c1955a6
64f1c96
 
 
 
 
 
 
213e242
2b5fb3d
 
 
 
 
b0d4b29
 
 
 
 
 
 
 
213e242
2b5fb3d
 
 
73b750e
 
b0d4b29
73b750e
 
 
 
 
 
 
 
213e242
e8c52a1
 
 
 
 
 
 
 
 
84fe89a
b6e103c
 
 
ca7b976
b6e103c
 
 
 
 
84fe89a
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
---
license: creativeml-openrail-m
tags:
- pytorch
- diffusers
- stable-diffusion
- text-to-image
- diffusion-models-class
- dreambooth-hackathon
- landscape
widget:
- text: a photo of ppaine landscape at night, NIKON Z FX
---

# Dreambooth Hackaton 23': How can we use a text-to-image generative model to explore the cinematographic appeal of Torres del Paine 🇨🇱?

> _Torres del Paine National Park is a national park encompassing mountains, glaciers, lakes, and rivers in southern Chilean Patagonia._
> _It is also part of the End of the World Route, a tourist scenic route. [Wikipedia](https://en.wikipedia.org/wiki/Torres_del_Paine_National_Park)_


## Description

DreamBooth model for the ppaine concept trained by alkzar90 on the alkzar90/torres-del-paine dataset.

This is a Stable Diffusion model fine-tuned on the ppaine concept with DreamBooth. It can be used by modifying the `instance_prompt`: **a photo of ppaine landscape**

This model was created as part of the DreamBooth Hackathon 🔥. Visit the [organisation page](https://huggingface.co/dreambooth-hackathon) for instructions on how to take part!

This is a Stable Diffusion model fine-tuned on `landscape` images for the landscape theme.


## Cinematographics rendering

<figure>
  <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-cinematographics.png" alt="Patagonia Landscape Model - Cinematographic Renderings/Artifacts Inmersion">
  <figcaption>Figure 1: <b>Cinematographics  renderings and object/artifacts insertions in the Chilean Torres del Paine national park</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"The ppaine landscape in the middle earth, cinematic light, lord of the ring style, epic"</i>,
    (ii) <i>"The ppaine landscape in the middle earth, a visible dragon skeleton bones, cinematic light, lord of the ring style, epic"</i>, 
    (iii) <i>"A long branches forest in the ppaine landscape, mountain peaks at the background, cinematic light, realistic, lord of the ring style, epic"</i>, 
    (iv) <i>"A futuristic jeep riding in ppaine landscape, cinematic light, technology</i>, 
    (v) <i>"A futuristic tensor airship flying over the ppaine landscape at night, NIKON-Z-FX"</i>, 
    (vi) <i>"A huge tensor bridge in the ppaine landscape, cinematic light, majestic, architecture"</i>.
  </figcaption>
</figure>


## Director's eye view

What does the director's cut concept mean? The definition by the [Merriam-Webster dictionary](https://www.merriam-webster.com/dictionary/director%27s%20cut#:~:text=noun,version%20created%20for%20general%20distribution): _"a version of a motion picture that is edited according to the director's wishes and that usually includes scenes cut from the version created for general distribution"_.

<figure>
  <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-wes-anderson-cut.png" alt="Patagonia Landscape Model - Wes Anderson's cut">
  <figcaption>Figure 2: <b>Illustration of the director cuts of the Chilean Torres del Paine national park, in Wes Anderson's eyes</b>. Text prompts for generated images left-to-right; 
    (i) <i>"The ppaine landscape, Wes Anderson style, cinematic light"</i>,
    (ii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>, 
    (iii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>.
  </figcaption>
</figure>


## Artistic Style Transfer

<figure>
  <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-landscape-painting.png" alt="Patagonia Landscape Model - Artist Style Painting">
  <figcaption>Figure 3: <b>Artistic renderings of the Chilean Torres del Paine national park in the style of famous painters</b>. Text prompts for generated images up-to-down rows and left-to-right; 
    (i) <i>"A painting of the ppaine landscape, Vincent Van Gogh style"</i>,
    (ii) <i>"A painting of the ppaine landscape, Michelangelo style"</i>, 
    (iii) <i>"A painting of the ppaine landscape, Botero style"</i>, 
    (iv) <i>"A painting of the ppaine landscape, Pierre-Auguste Renoir style"</i>, 
    (v) <i>"A painting of the ppaine landscape, Leonardo Da Vinci style"</i>, 
    (vi) <i>"A painting of the ppaine landscape, Rembrandt style"</i>.
  </figcaption>
</figure>

## Usage

```python
from diffusers import StableDiffusionPipeline

pipeline = StableDiffusionPipeline.from_pretrained('alkzar90/ppaine-landscape')
image = pipeline().images[0]
image
```

## References

* [DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation (Ruiz et al. 2022)](https://arxiv.org/abs/2208.12242})
* [High-Resolution Image Synthesis with Latent Diffusion Models (Rombach et al., 2022 )](https://arxiv.org/abs/2112.10752)
* [Training Stable Diffusion with Dreambooth using 🧨 Diffusers (Post)](https://huggingface.co/blog/dreambooth)
* [Hugging Face DreamBooth Hackathon](https://github.com/huggingface/diffusion-models-class/tree/main/hackathon)



## Thanks to John Whitaker and Lewis Tunstall

Thanks to [John Whitaker](https://github.com/johnowhitaker) and [Lewis Tunstall](https://github.com/lewtun) for writing out and describing the initial hackathon parameters at https://huggingface.co/dreambooth-hackathon.