miulab/llama2-7b-oss-instruct
Text Generation
•
Updated
•
44
Models trained/used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging ( https://arxiv.org/abs/2407.01470)