Text encoder is actually llama 2, is it not?
#2
by
justheuristic
- opened
README.md
CHANGED
@@ -7,10 +7,10 @@
|
|
7 |
|
8 |
Lumina-T2I is a model that generates images based on text conditions, supporting various text encoders and models of different parameter sizes. With minimal training costs, it achieves high-quality image generation by training from scratch. Additionally, it offers usage through CLI console programs and Web Demo displays.
|
9 |
|
10 |
-
Our generative model has `LargeDiT` as the backbone, the text encoder is the `
|
11 |
|
12 |
- Generation Model: Large-DiT
|
13 |
-
- Text Encoder:
|
14 |
- VAE: stabilityai/sd-vae-ft-sdxl
|
15 |
|
16 |
## 📰 News
|
@@ -23,7 +23,7 @@ More checkpoints of our model will be released soon~
|
|
23 |
|
24 |
| Resolution | Flag-DiT Parameter| Text Encoder | Prediction | Download URL |
|
25 |
| ---------- | ----------------------- | ------------ | -----------|-------------- |
|
26 |
-
| 1024 | 5B |
|
27 |
|
28 |
## Installation
|
29 |
|
|
|
7 |
|
8 |
Lumina-T2I is a model that generates images based on text conditions, supporting various text encoders and models of different parameter sizes. With minimal training costs, it achieves high-quality image generation by training from scratch. Additionally, it offers usage through CLI console programs and Web Demo displays.
|
9 |
|
10 |
+
Our generative model has `LargeDiT` as the backbone, the text encoder is the `Llama 2` 7B model, and the VAE uses a version of `sdxl` fine-tuned by stabilityai.
|
11 |
|
12 |
- Generation Model: Large-DiT
|
13 |
+
- Text Encoder: Llama 2 7B
|
14 |
- VAE: stabilityai/sd-vae-ft-sdxl
|
15 |
|
16 |
## 📰 News
|
|
|
23 |
|
24 |
| Resolution | Flag-DiT Parameter| Text Encoder | Prediction | Download URL |
|
25 |
| ---------- | ----------------------- | ------------ | -----------|-------------- |
|
26 |
+
| 1024 | 5B | Llama 2 7B | Rectified Flow | [hugging face](https://huggingface.co/Alpha-VLLM/Lumina-T2I) |
|
27 |
|
28 |
## Installation
|
29 |
|