Jingcheng Hu
reign12
AI & ML interests
Foundation models and alignment
Organizations
reign12's activity
Add paper link
#3 opened 27 days ago
by
AdinaY
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63a369d98c0c89dcae3b8329/6OUJ7Hc9T1jXynYH3FGaf.png)
33B when?
2
#8 opened 8 months ago
by
nova434431
Question about evaluating this reward model on Anthropic/hh-rlhf
1
#4 opened about 1 year ago
by
songff
More details on training data for reward model
#2 opened 10 months ago
by
reign12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/625026b7d2d191ac43320c5e/K-Fn3v2KwNyg9QzhKB4vH.jpeg)
How is this dataset filtered?
#1 opened 11 months ago
by
reign12
![](https://cdn-avatars.huggingface.co/v1/production/uploads/625026b7d2d191ac43320c5e/K-Fn3v2KwNyg9QzhKB4vH.jpeg)
大神是怎么收集这么多高质量的数据的啊
3
#1 opened about 1 year ago
by
leonall