Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
6
Zhaolin Gao
GitBag
Follow
kirankc's profile picture
dark-pen's profile picture
2 followers
·
0 following
https://zhaolingao.github.io/
AI & ML interests
Reinforcement Learning from Human Feedback
Recent Activity
updated
a model
2 days ago
GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e4_lr_3e-7_1734672146
updated
a model
2 days ago
GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e6_lr_3e-7_1734682709
updated
a model
2 days ago
GitBag/reasoning_rebel_meta_general_1024_1024_eta_1e5_lr_3e-7_1734677447
View all activity
Articles
RLHF 101: A Technical Dive into RLHF
11 days ago
•
4
Organizations
GitBag
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
3 models
3 months ago
Cornell-AGI/REBEL-Llama-3-Armo-iter_1
Updated
Sep 2
•
11
•
1
Cornell-AGI/REBEL-Llama-3-Armo-iter_2
Updated
Sep 2
•
11
•
2
Cornell-AGI/REBEL-Llama-3-Armo-iter_3
Updated
Sep 2
•
7
•
2
liked
a model
6 months ago
Cornell-AGI/REBEL-Llama-3-epoch_2
Text Generation
•
Updated
Sep 1
•
9
•
3
liked
2 models
7 months ago
Cornell-AGI/REBEL-OpenChat-3.5
Text Generation
•
Updated
Sep 1
•
11
•
1
Cornell-AGI/REBEL-Llama-3
Text Generation
•
Updated
Sep 1
•
20
•
1