Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
32
Steven Liu
stvnl
Follow
0 followers
·
1 following
AI & ML interests
None yet
Recent Activity
liked
a model
28 days ago
deepseek-ai/DeepSeek-R1
updated
a model
4 months ago
stvnl/msc_rm_zh
updated
a model
4 months ago
stvnl/msc_rm_en
View all activity
Organizations
None yet
stvnl
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
28 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
13 days ago
•
4.4M
•
•
9.92k
updated
6 models
4 months ago
stvnl/msc_rm_zh
Updated
Nov 1, 2024
stvnl/msc_rm_en
Updated
Nov 1, 2024
stvnl/msc_rm_zh_margin
Updated
Oct 31, 2024
stvnl/msc_rm_en_margin
Updated
Oct 31, 2024
stvnl/msc_ppo_zh
Reinforcement Learning
•
Updated
Oct 31, 2024
•
2
stvnl/msc_ppo_en
Reinforcement Learning
•
Updated
Oct 31, 2024
•
2
liked
a dataset
4 months ago
CarperAI/openai_summarize_comparisons
Viewer
•
Updated
Feb 27, 2023
•
260k
•
2.46k
•
40
updated
4 models
4 months ago
stvnl/rm_gpt2_en_20241021_0101
Updated
Oct 21, 2024
stvnl/rm_gpt2_en_20241021_0602
Updated
Oct 21, 2024
stvnl/rm_gpt2_en_20241021_0553
Updated
Oct 21, 2024
stvnl/rm_gpt2_en_20241021_0340
Updated
Oct 21, 2024
liked
a model
4 months ago
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
9
updated
5 models
4 months ago
stvnl/rm_bloomz_en_20241019_1853
Updated
Oct 19, 2024
stvnl/rm_bloomz_en_20241019_1547
Updated
Oct 19, 2024
stvnl/rm_bloomz_en_20241019_1531
Updated
Oct 19, 2024
stvnl/RM-Bloomz-EN
Updated
Oct 13, 2024
stvnl/rm_bloomz_en_20241012_1153
Updated
Oct 12, 2024
liked
a dataset
5 months ago
fnlp/hh-rlhf-strength-cleaned
Viewer
•
Updated
Jan 31, 2024
•
168k
•
311
•
23
liked
a dataset
10 months ago
llamafactory/DPO-En-Zh-20k
Viewer
•
Updated
Jun 7, 2024
•
20k
•
295
•
90
Load more