Generative Verifiers: Reward Modeling as Next-Token Prediction Paper โข 2408.15240 โข Published Aug 27, 2024 โข 13 โข 2