Transformers
PyTorch
English
llama
reward model
RLHF
RLAIF
text-generation-inference
evan-nexusflow's picture
Create config.json
6c6b4d5 verified