RLHFlow MATH Process Reward Model Collection This is a collection of datasets and models of process reward modeling. • 15 items • Updated Nov 9, 2024 • 8
FactAlign Collection Models and datasets of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models" • 7 items • Updated Oct 7, 2024 • 1