How can i do RLHF with this model with smaller dataset ?

#3
by himasai9711 - opened

Hi @chenyh7 , So I'm trying to use this model for document data extraction , so for continuous improvement in the model output, i wanna introduce RLHF , where human will give the correct JSON for the feedback, help with resources for the model.

Sign up or log in to comment