Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
ankner
's Collections
Critique-out-Loud Reward Models
Critique-out-Loud Reward Models
updated
Sep 5, 2024
Paper: https://arxiv.org/abs/2408.11791 | Code: https://github.com/zankner/CLoud
Upvote
3
ankner/Llama3-8B-CLoud-RM
Updated
Oct 16, 2024
•
26
ankner/Llama3-8B-Classic-RM
Updated
Oct 17, 2024
•
22
ankner/Llama3-70B-CLoud-RM
Updated
Oct 18, 2024
•
6
•
1
ankner/Llama3-70B-Classic-RM
Updated
Oct 18, 2024
•
8
ankner/Llama3-8b-ultra-oracle
Viewer
•
Updated
Sep 5, 2024
•
124k
•
61
ankner/Llama3-8b-ultra-self-gen-8b
Viewer
•
Updated
Sep 5, 2024
•
124k
•
72
ankner/Llama3-8b-ultra-self-gen-70b
Viewer
•
Updated
Sep 5, 2024
•
124k
•
71
Upvote
3
Share collection
View history
Collection guide
Browse collections