File size: 1,346 Bytes
d3b89e0 c867da1 d3b89e0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 |
---
tags:
- text-to-image
- stable-diffusion
license: apache-2.0
language:
- en
library_name: diffusers
---
# StyleShot Model Card
<div align="center">
[**Project Page**](https://styleshot.github.io) **|** [**Paper (ArXiv)**](https://arxiv.org/abs/) **|** [**Code**](https://github.com/tencent-ailab/IP-Adapter)
</div>
---
## Introduction
we present StyleShot, a generalized plug-to-play style transfer method, capable of generating the high-quality stylized images that match the desired style from any reference image without test-time style-tuning. To the best of our knowledge, StyleShot is the first work to designate a style-aware encoder based on Stable Diffusion, which enables the extraction of features from the reference image that are rich in style expression. StyleShot can be generalized not only to other custom models fine-tuned from the same base model, but also to controllable generation using existing controllable tools.
![arch](./fig1.png)
## Models
### StyleShot for SD 1.5
- [ip-adapter_sd15.bin](https://huggingface.co/h94/IP-Adapter/blob/main/models/ip-adapter_sd15.bin): use global image embedding from OpenCLIP-ViT-H-14 as condition
- [ip-adapter_sd15_light.bin](https://huggingface.co/h94/IP-Adapter/blob/main/models/ip-adapter_sd15_light.bin): same as ip-adapter_sd15, but more compatible with text prompt |