my_DDPO / README.md

Nguyen17

Push model using huggingface_hub.

cc2cea5 verified 5 months ago

preview code

raw

history blame contribute delete

363 Bytes

metadata

license: apache-2.0
tags:
  - trl
  - ddpo
  - diffusers
  - reinforcement-learning
  - text-to-image
  - stable-diffusion

TRL DDPO Model

This is a diffusion model that has been fine-tuned with reinforcement learning to guide the model outputs according to a value, function, or human feedback. The model can be used for image generation conditioned with text.