Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Mingyu Chen
MYC081
Follow
AI & ML interests
theory
Recent Activity
upvoted
a
paper
27 days ago
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
updated
a model
about 2 months ago
MYC081/Qwen2.5-3B-WPO-bf16-1
updated
a model
about 2 months ago
MYC081/Qwen2-0.5B-WPO-bf16-1
View all activity
Organizations
None yet
models
8
Sort: Recently updated
MYC081/Qwen2.5-3B-WPO-bf16-1
Text Generation
•
Updated
Nov 15, 2024
•
18
MYC081/Qwen2.5-3B-WPO-bf16-1-test
Updated
Nov 14, 2024
MYC081/Qwen2.5-1.5B-WPO-bf16-1
Updated
Nov 14, 2024
MYC081/Qwen2-0.5B-WPO-bf16-1
Updated
Nov 14, 2024
•
8
MYC081/pythia-1b-tldr-xpo
Updated
Nov 13, 2024
•
11
MYC081/pythia-6.9b-deduped-tldr-online-dpo
Updated
Nov 11, 2024
MYC081/Qwen2.5-0.5B-Online-DPO-PairRM
Updated
Nov 5, 2024
MYC081/pythia-2.8b-deduped-tldr-online-dpo
Updated
Nov 5, 2024
datasets
None public yet