RLHFlow

university
Activity Feed

AI & ML interests

Workflow of Reinforcement Learning from Human Feedback (RLHF). Blog: https://rlhflow.github.io/

Recent Activity

RLHFlow's activity

hendrydong 
in RLHFlow/LLaMA3.2-1B-SFT about 1 month ago

the training data for this model?

1
#1 opened about 1 month ago by
AIR-hl