Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lv12
's Collections
Representation Learning
Preference Optimization
Information Retrieval
Preference Optimization
updated
18 days ago
x
Upvote
-
A Roadmap to Pluralistic Alignment
Paper
•
2402.05070
•
Published
Feb 7
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
Jan 18
•
135
SakanaAI/DiscoPOP-zephyr-7b-gemma
Text Generation
•
Updated
19 days ago
•
874
•
25
Upvote
-
Share collection
View history
Collection guide
Browse collections