How can i do RLHF with this model with smaller dataset ?
#3
by
himasai9711
- opened
Hi @chenyh7 , So I'm trying to use this model for document data extraction , so for continuous improvement in the model output, i wanna introduce RLHF , where human will give the correct JSON for the feedback, help with resources for the model.