DogeRM - a miulab Collection

miulab 's Collections

updated Oct 8

Models trained/used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging ( https://arxiv.org/abs/2407.01470)