File size: 6,307 Bytes
d1bf9c8
fecea3d
 
c32e966
d1bf9c8
c32e966
 
 
d1bf9c8
c32e966
 
 
 
 
 
 
 
33faca3
c32e966
 
 
 
 
 
d1bf9c8
c32e966
 
 
 
d1bf9c8
273f150
0242a52
9b42eae
264457c
d1bf9c8
62f2593
 
 
 
 
 
c6d32fb
2e1aca5
 
 
 
 
 
e382ad3
 
d1bf9c8
e382ad3
d1bf9c8
e382ad3
d1bf9c8
e382ad3
d1bf9c8
e382ad3
0a33e97
e382ad3
0a33e97
e382ad3
0a33e97
e382ad3
0a33e97
 
 
 
a927040
0a33e97
 
 
fcfce81
 
0a33e97
 
 
d30a7f7
7851ce9
 
fcfce81
7851ce9
d30a7f7
 
 
 
 
 
ffed394
8c9ace2
5f33288
ffed394
 
 
 
 
 
5f33288
ffed394
0a33e97
 
 
 
 
 
 
d8e60b2
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
---
license: creativeml-openrail-m
language:
- en
tags:
- stable-diffusion
- text-to-image
- diffusers
widget:
- text: bagan, by Vincent van gogh, highly detailed, highly illustration
  example_title: Example Prompt 1
- text: >-
    Establishing shot of a bagan, an epic fantasy, dramatic lighting, cinematic,
    extremely high detail, photorealistic, cinematic lighting, matte painting,
    artstation, by simon stalenhag, uncharted 4: a thief's end
  example_title: Example Prompt 2
- text: >-
    hyper realistic water color painting, transparent, myanmar bagan ancient
    city, after raining sense, beautiful cloud, ancient pagoda, some trees, with
    water splash infront of pagoda, lovely cloud, beautiful golden ratio
    composition, neutral color, moody image, lots of grey, golden ratio
    composition, grey and moody, more grey, rule of third, --ar 5:3  --q 0.5 
    --v 5
  example_title: Example Prompt 3
base_model: runwayml/stable-diffusion-v1-5
metrics:
- code_eval
library_name: diffusers
pipeline_tag: text-to-image
---

# enchanted-bagan-small
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6598b82502c4796342239a35/2z5Xa8Ba1ViaAolxSCaBl.png)
Enchanted-bagan-small is a latent text-to-image diffusion model designed to generate Bagan images based on the text input. The quality of the generated pictures heavily relies on the input prompt.

### What is Bagan?
Bagan is an ancient city located in the Mandalay Region of Myanmar (formerly Burma). It was the capital of the Kingdom of Pagan from the 9th to the 13th centuries. Bagan is renowned for its vast archaeological site, which features over 2,000 well-preserved Buddhist temples, pagodas, and monasteries spread across the area. These structures were built during the height of the Kingdom of Pagan's power when it was a center of Theravada Buddhism and a major cultural and religious hub in Southeast Asia.

The temples and pagodas in Bagan are notable for their architectural beauty, intricate designs, and historical significance. They range from small, simple structures to towering monuments adorned with elaborate carvings and artwork. Bagan's landscape, with its numerous temples dotting the horizon, is particularly stunning during sunrise and sunset, drawing visitors from around the world to witness the breathtaking views.

In 2019, Bagan was designated as a UNESCO World Heritage Site, recognizing its outstanding universal value and cultural significance. Despite facing challenges such as natural disasters and modern development pressures, Bagan remains one of Myanmar's most iconic and cherished historical destinations.

### Why did we choose to do this?

When we prompted the stable diffusion model to generate an image of Bagan, it produced an image depicting a pagoda from Thailand.   
Hence, our decision was to fine-tune the current stable diffusion model using a multitude of Bagan photos in order to attain a clearer outcome.


### How to create prompts:
When we create prompt for bagan, we have to consider 6 keywords. Those are Subject, Medium, Style, Art-sharing website, Resolution, and Additional details.

Subject -> What you want to see in the picture is the subject. Not writing enough about the subjects is a common error.

Medium -> The medium is the substance that artists work with. Illustration, oil painting, 3D rendering, and photography are a few examples. The impact of Medium is significant because a single keyword can significantly alter the style.

Style -> The image's artistic style is referred to as the style. Pop art, impressionist, and surrealist are a few examples.

Art-sharing website -> Specialty graphic websites like Deviant Art and Artstation compile a large number of images from various genres. One surefire way to direct the image toward these styles is to use them as a prompt.

Resolution -> Resolution represents how sharp and detailed the image is

Additional Details -> Sweeteners added to an image are additional details. To give the image a more dystopian and sci-fi feel, we will add those elements.

The example prompt for general bagan is: bagan, a creepy and eery Halloween setting, with Jack o lanterns on the street and shadow figures lurking about, dynamic lighting, photorealistic fantasy concept art, stunning visuals, creative, cinematic, ultra detailed, trending on art station, spooky vibe. That prompt gives you the Halloween theme.

### Contributors:
Main Contributor: [Ye Bhone Lin](https://github.com/Ye-Bhone-Lin)

Supervisor: [Sa Phyo Thu Htet](https://github.com/SaPhyoThuHtet)

Contributors: Thant Htoo San, Min Phone Thit



### Limitation:
We can't generate a photo of a human.

### Other Work:
Note: These other works are not included in this version.

Other Work:
In our exploration of image generation, we also have worked into the architectural marvels of Myanmar, featuring iconic landmarks such as Ananda, Shwezigon, Bupaya, Thatbyinnyu, and Mraukoo. Each structure stands as a testament to the rich cultural and historical tapestry of the region, captured through the lens of our innovative text-to-image generator, General Bagan.

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6598b82502c4796342239a35/MwR8pZ8xd6IXrNrvNL5ru.png)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6598b82502c4796342239a35/w-7_MOhc0dMt6uEcdPoay.png)
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6598b82502c4796342239a35/TpLTtrQBFLFQmbIvzdF5V.png)

### Cite As:
```bibtex
@misc{enchanted-bagan-small,
  author = {{Ye Bhone Lin, Sa Phyo Thu Htet}},
  title = {enchanted-bagan-small},
  url = {https://huggingface.co/Simbolo-Servicio/enchanted-bagan-small},
  urldate = {2024-1-25},
  date = {2024-1-25}
}
```

### References:
Wikipedia (2022). Stable Diffusion. Retrieved From: https://en.wikipedia.org/wiki/Stable_Diffusion

Rombach, R., Blattmann, A., Lorenz, D., Esser, P., & Ommer, B. (2022). High-Resolution Image Synthesis with Latent Diffusion Models. Retrieved From: https://arxiv.org/abs/2112.10752

Naomi Brown (2022). What is Stable Diffusion and How to Use it. Retrieved From: https://www.fotor.com/blog/what-is-stable-diffusion

Mishra, O. (June, 9). Stable Diffusion Explained. Medium. https://medium.com/@onkarmishra/stable-diffusion-explained-1f101284484d