Update README.md
Browse files
README.md
CHANGED
@@ -14,14 +14,25 @@ tags:
|
|
14 |
## Network Arichitecture
|
15 |
![images](./images/ControlNet++.png)
|
16 |
|
17 |
-
|
18 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
conditions using the same network parameter. 2 Support multiple conditions input without increasing computation offload, which is especially important for designers
|
20 |
who want to edit image in detail, different conditions use the same condition encoder, without adding extra computations or parameters. We do thoroughly experiments
|
21 |
on SDXL and achieve superior performance both in control ability and aesthetic score. We release the method and the model to the open source community to make everyone
|
22 |
can enjoy it.
|
23 |
|
24 |
-
|
25 |
|
26 |
**If you find it useful, please give me a star, thank you very much**
|
27 |
|
|
|
14 |
## Network Arichitecture
|
15 |
![images](./images/ControlNet++.png)
|
16 |
|
17 |
+
## Advantages about the model
|
18 |
+
- Use bucket training like novelai, can generate high resolutions images of any aspect ratio
|
19 |
+
- Use large amount of high quality data(over 10000000 images), the dataset covers a diversity of situation
|
20 |
+
- Use re-captioned prompt like DALLE.3, use CogVLM to generate detailed description, good prompt following ability
|
21 |
+
- Use many useful tricks during training. Including but not limited to date augmentation, mutiple loss, multi resolution
|
22 |
+
- Use almost the same parameter compared with original ControlNet. No obvious increase in network parameter or computation.
|
23 |
+
- Support 10+ control conditions, no obvious performance drop on any single condition compared with training independently
|
24 |
+
- Support multi condition generation, condition fusion is learned during training. No need to set hyperparameter or design prompts.
|
25 |
+
- Compatible with other opensource SDXL models, such as BluePencilXL, CounterfeitXL. Compatible with other Lora models.
|
26 |
+
|
27 |
+
|
28 |
+
***We design a new architecture that can support 10+ control types in condition text-to-image generation and can generate high resolution images visually comparable with
|
29 |
+
midjourney***. The network is based on the original ControlNet architecture, we propose two new modules to: 1 Extend the original ControlNet to support different image
|
30 |
conditions using the same network parameter. 2 Support multiple conditions input without increasing computation offload, which is especially important for designers
|
31 |
who want to edit image in detail, different conditions use the same condition encoder, without adding extra computations or parameters. We do thoroughly experiments
|
32 |
on SDXL and achieve superior performance both in control ability and aesthetic score. We release the method and the model to the open source community to make everyone
|
33 |
can enjoy it.
|
34 |
|
35 |
+
Inference scripts and more details can found: https://github.com/xinsir6/ControlNetPlus/tree/main
|
36 |
|
37 |
**If you find it useful, please give me a star, thank you very much**
|
38 |
|