POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated Feb 3 • 10
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 92
Reward Bench Collection Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated 27 days ago • 9
view article Article Accelerated Inference with Optimum and Transformers Pipelines May 10, 2022 • 2