Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
princeton-nlp
's Collections
SWE-bench
SimPO
ProLong
Sheared Llama
SimCSE
SimPO
updated
about 18 hours ago
This collections contains a list of SimPO and baseline models.
Upvote
14
+4
princeton-nlp/gemma-2-9b-it-SimPO
Text Generation
•
Updated
Aug 2
•
101k
•
118
princeton-nlp/gemma-2-9b-it-DPO
Text Generation
•
Updated
Jul 18
•
2.49k
•
5
princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation
•
Updated
Jun 17
•
2.54k
princeton-nlp/Llama-3-Base-8B-SFT-DPO
Text Generation
•
Updated
Jun 17
•
5.6k
princeton-nlp/Llama-3-Base-8B-SFT-KTO
Text Generation
•
Updated
Jun 17
•
5.01k
princeton-nlp/Llama-3-Base-8B-SFT-ORPO
Text Generation
•
Updated
Jun 17
•
5.01k
princeton-nlp/Llama-3-Base-8B-SFT-RDPO
Text Generation
•
Updated
Jun 17
•
5.38k
princeton-nlp/Llama-3-Base-8B-SFT-SimPO
Text Generation
•
Updated
May 24
•
2.91k
princeton-nlp/Llama-3-Base-8B-SFT
Text Generation
•
Updated
Jun 17
•
9.75k
•
1
princeton-nlp/Llama-3-Instruct-8B-SimPO
Text Generation
•
Updated
Jun 17
•
2.77k
•
55
princeton-nlp/Llama-3-Instruct-8B-IPO
Text Generation
•
Updated
Jun 17
•
2.44k
princeton-nlp/Llama-3-Instruct-8B-KTO
Text Generation
•
Updated
Jun 17
•
4.86k
princeton-nlp/Llama-3-Instruct-8B-ORPO
Text Generation
•
Updated
Jun 17
•
4.9k
princeton-nlp/Llama-3-Instruct-8B-RDPO
Text Generation
•
Updated
Jun 17
•
4.87k
princeton-nlp/Llama-3-Instruct-8B-DPO
Text Generation
•
Updated
Jun 17
•
4.98k
princeton-nlp/Mistral-7B-Instruct-RDPO
Text Generation
•
Updated
Jun 17
•
2.82k
princeton-nlp/Mistral-7B-Instruct-DPO
Text Generation
•
Updated
Jun 17
•
2.83k
princeton-nlp/Mistral-7B-Instruct-IPO
Text Generation
•
Updated
Jun 17
•
2.84k
princeton-nlp/Mistral-7B-Instruct-KTO
Text Generation
•
Updated
Jun 17
•
2.82k
princeton-nlp/Mistral-7B-Instruct-SimPO
Text Generation
•
Updated
Jun 17
•
2.87k
•
1
princeton-nlp/Mistral-7B-Instruct-ORPO
Text Generation
•
Updated
Jun 17
•
2.82k
princeton-nlp/Mistral-7B-Base-SFT-IPO
Text Generation
•
Updated
Jun 17
•
2.93k
princeton-nlp/Mistral-7B-Base-SFT-KTO
Text Generation
•
Updated
Jun 17
•
2.93k
princeton-nlp/Mistral-7B-Base-SFT-DPO
Text Generation
•
Updated
Jun 17
•
2.57k
princeton-nlp/Mistral-7B-Base-SFT-RDPO
Text Generation
•
Updated
Jun 17
•
2.94k
princeton-nlp/Mistral-7B-Base-SFT-SimPO
Text Generation
•
Updated
Jun 17
•
5.05k
princeton-nlp/llama3-ultrafeedback
Viewer
•
Updated
Jul 18
•
61.8k
•
468
•
15
princeton-nlp/Mistral-7B-Base-SFT-CPO
Text Generation
•
Updated
Sep 30
•
2.93k
princeton-nlp/Mistral-7B-Base-SFT-RRHF
Text Generation
•
Updated
Sep 30
•
2.96k
princeton-nlp/Mistral-7B-Base-SFT-SLiC-HF
Text Generation
•
Updated
Jul 7
•
2.91k
princeton-nlp/Mistral-7B-Instruct-CPO
Text Generation
•
Updated
Jul 7
•
2.8k
princeton-nlp/Mistral-7B-Instruct-RRHF
Text Generation
•
Updated
Jul 7
•
2.8k
princeton-nlp/Mistral-7B-Instruct-SLiC-HF
Text Generation
•
Updated
Jul 7
•
2.8k
princeton-nlp/Llama-3-Base-8B-SFT-CPO
Text Generation
•
Updated
Jul 7
•
5k
princeton-nlp/Llama-3-Base-8B-SFT-RRHF
Text Generation
•
Updated
Jul 7
•
2.44k
princeton-nlp/Llama-3-Base-8B-SFT-SLiC-HF
Text Generation
•
Updated
Jul 7
•
2.44k
princeton-nlp/Llama-3-Instruct-8B-CPO
Text Generation
•
Updated
Jul 7
•
4.88k
princeton-nlp/Llama-3-Instruct-8B-RRHF
Text Generation
•
Updated
Jul 7
•
2.44k
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF
Text Generation
•
Updated
Jul 7
•
2.46k
princeton-nlp/Llama-3-Instruct-8B-RRHF-v0.2
Text Generation
•
Updated
Jul 7
•
2.44k
princeton-nlp/Llama-3-Instruct-8B-SLiC-HF-v0.2
Text Generation
•
Updated
Jul 7
•
2.47k
princeton-nlp/Llama-3-Instruct-8B-DPO-v0.2
Text Generation
•
Updated
Jul 7
•
4.94k
princeton-nlp/Llama-3-Instruct-8B-IPO-v0.2
Text Generation
•
Updated
Jul 7
•
2.48k
princeton-nlp/Llama-3-Instruct-8B-CPO-v0.2
Text Generation
•
Updated
Jul 7
•
4.91k
princeton-nlp/Llama-3-Instruct-8B-KTO-v0.2
Text Generation
•
Updated
Jul 7
•
4.91k
princeton-nlp/Llama-3-Instruct-8B-ORPO-v0.2
Text Generation
•
Updated
Jul 7
•
6.12k
princeton-nlp/Llama-3-Instruct-8B-RDPO-v0.2
Text Generation
•
Updated
Jul 7
•
2.44k
princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
Text Generation
•
Updated
Jul 7
•
3.02k
•
5
princeton-nlp/llama3-ultrafeedback-armorm
Viewer
•
Updated
Jul 18
•
61.8k
•
788
•
14
Upvote
14
+10
Share collection
View history
Collection guide
Browse collections