Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Jixuan Leng
Sean123321
Follow
AI & ML interests
None yet
Recent Activity
updated
a collection
about 9 hours ago
VLM
updated
a collection
4 days ago
VLM
updated
a collection
6 days ago
VLM
View all activity
Organizations
Sean123321
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
updated
a collection
about 9 hours ago
VLM
Collection
6 items
•
Updated
about 9 hours ago
updated
a collection
4 days ago
VLM
Collection
6 items
•
Updated
about 9 hours ago
updated
a collection
6 days ago
VLM
Collection
6 items
•
Updated
about 9 hours ago
updated
a collection
12 days ago
VLM
Collection
6 items
•
Updated
about 9 hours ago
updated
a collection
about 2 months ago
VLM
Collection
6 items
•
Updated
about 9 hours ago
authored
a paper
2 months ago
Taming Overconfidence in LLMs: Reward Calibration in RLHF
Paper
•
2410.09724
•
Published
Oct 13
•
2
updated
6 models
2 months ago
HINT-lab/llama3-8b-dpo-v0.2
Text Generation
•
Updated
Oct 12
•
15
HINT-lab/llama3-8b-cdpo-v0.2
Text Generation
•
Updated
Oct 12
•
15
HINT-lab/mistral-7b-ppo-hermes-v0.3
Text Generation
•
Updated
Oct 12
•
9
•
1
HINT-lab/mistral-7b-ppo-clean-hermes
Text Generation
•
Updated
Oct 12
•
13
HINT-lab/llama3-8b-final-ppo-v0.3
Text Generation
•
Updated
Oct 12
•
14
HINT-lab/llama3-8b-final-ppo-clean-v0.1
Text Generation
•
Updated
Oct 12
•
67
updated
a dataset
2 months ago
HINT-lab/prompt-collections-final-v0.3
Viewer
•
Updated
Oct 11
•
20.5k
•
38
updated
a model
2 months ago
HINT-lab/mistral-7b-hermes-rm-skywork
Updated
Oct 11
•
2
updated
a model
3 months ago
HINT-lab/mistral-7b-hermes-dpo-v0.2
Text Generation
•
Updated
Oct 10
•
11
updated
a dataset
3 months ago
HINT-lab/calibration_preference_mixture_final-v0.1
Viewer
•
Updated
Oct 10
•
25.5k
•
44
updated
a model
3 months ago
HINT-lab/mistral-7b-hermes-cdpo-v0.2
Text Generation
•
Updated
Oct 10
•
13
Load more