license: apache-2.0 | |
tags: | |
- trl | |
- ddpo | |
- diffusers | |
- reinforcement-learning | |
- text-to-image | |
- stable-diffusion | |
# TRL DDPO Model | |
This is a diffusion model that has been fine-tuned with reinforcement learning to | |
guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text. | |