Awesome reward models - a HuggingFaceH4 Collection

HuggingFaceH4 's Collections

updated Apr 12

A curated collection of reward models to use with techniques like rejection sampling and RLHF / RLAIF