REBEL Cornell-AGI/REBEL-OpenChat-3.5 Text Generation • Updated about 1 month ago • 84 • 1 Cornell-AGI/REBEL-Llama-3 Text Generation • Updated about 1 month ago • 57 • 1 Cornell-AGI/REBEL-Llama-3-epoch_2 Text Generation • Updated about 1 month ago • 24 • 3 REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1
REBEL: Reinforcement Learning via Regressing Relative Rewards Paper • 2404.16767 • Published Apr 25 • 1