segmind
/

small-sd

StableDiffusionPipeline

stable-diffusion

stable-diffusion-diffusers

Inference Endpoints

Model card Files Files and versions Community

small-sd / README.md

Warlord-K's picture

Initial Commit

4194411 about 1 year ago

|

No virus

1.3 kB


	---
	license: creativeml-openrail-m
	base_model: SG161222/Realistic_Vision_V4.0
	datasets:
	- recastai/LAION-art-EN-improved-captions
	tags:
	- bksdm
	- bksdm-base
	- stable-diffusion
	- stable-diffusion-diffusers
	- text-to-image
	- diffusers
	inference: true
	---

	# Text-to-image Distillation - Warlord-K/BKSDM-Base-95K

	This pipeline was distilled from SG161222/Realistic_Vision_V4.0 on a Subset of recastai/LAION-art-EN-improved-captions dataset. Below are some example images generated with the finetuned pipeline using the following prompts: ['Portrait of a pretty girl']:

	![val_imgs_grid](./val_imgs_grid.png)


	This Pipeline is based upon [the paper](https://arxiv.org/pdf/2305.15798.pdf). Training Code can be found [here](https://github.com/segmind/BKSDM).

	## Pipeline usage

	You can use the pipeline like so:

	```python
	from diffusers import DiffusionPipeline
	import torch

	pipeline = DiffusionPipeline.from_pretrained("Warlord-K/BKSDM-Base-95K", torch_dtype=torch.float16)
	prompt = "Portrait of a pretty girl"
	image = pipeline(prompt).images[0]
	image.save("my_image.png")
	```

	## Training info

	These are the key hyperparameters used during training:

	* Steps: 95000
	* Learning rate: 1e-4
	* Batch size: 32
	* Gradient accumulation steps: 4
	* Image resolution: 512
	* Mixed-precision: fp16