Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lv12
's Collections
Representation Learning
Preference Optimization
Information Retrieval
Preference Optimization
updated
Jun 14
x
Upvote
1
A Roadmap to Pluralistic Alignment
Paper
•
2402.05070
•
Published
Feb 7
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
Jan 18
•
145
SakanaAI/DiscoPOP-zephyr-7b-gemma
Text Generation
•
Updated
Jun 13
•
6.84k
•
36
Upvote
1
Share collection
View history
Collection guide
Browse collections