Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
3,298
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
TTTXXX01/DPO_shift2-Negative-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 8
•
4
skymizer/llama2-7b-sft-chat-no-template-dpo
Text Generation
•
Updated
Jun 8
•
4
TTTXXX01/DPO005-Baseline-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 8
•
5
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR2e-7_3epochs
Text Generation
•
Updated
Jun 9
•
5
mradermacher/Phoenix-i1-GGUF
Updated
Aug 2
•
79
TTTXXX01/IL_DPOAll-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 8
•
5
ksw1/DPO-3-1k
Text Generation
•
Updated
Jun 9
•
3
TTTXXX01/DPOAll-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 9
•
5
ksw1/DPO-3-1k-checkpoints
Text Generation
•
Updated
Jun 8
•
4
TTTXXX01/IL_BERAll-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 9
•
4
TTTXXX01/IL_LSAll-7b-sft-full
Text Generation
•
Updated
Jun 9
•
3
mlxha/mnlp-openaint-phi3-mini-mcq-dpo
Text Generation
•
Updated
Jun 9
•
8
TTTXXX01/IL_SquareAll-7b-sft-full
Text Generation
•
Updated
Jun 9
•
2
TTTXXX01/IL_ExponentialAll-7b-sft-full
Text Generation
•
Updated
Jun 9
•
5
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR1e-7_3epochs
Text Generation
•
Updated
Jun 10
•
7
TTTXXX01/IL_BrierAll-7b-sft-full
Text Generation
•
Updated
Jun 15
•
3
kyryl-opens-ml/doplhin-dpo
Updated
Jun 10
•
2
ywangmy/ma-plus-gemini-f20k-full-z3-0.08-2e-6
Text Generation
•
Updated
Jun 10
TTTXXX01/IL_DPO_ChatALL-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 10
•
4
ywangmy/ma-plus-mixed-all-full-z3-0.08-2e-6
Text Generation
•
Updated
Jun 10
tanliboy/zephyr-qwen2-7b-dpo
Text Generation
•
Updated
Jun 20
•
48
tsavage68/UTI2_L3_50steps_1e6rate_03beta_CSFTDPO
Text Generation
•
Updated
Jun 10
•
4
mNLP-project/gpt2-dpo-from_base_gpt2
Text Generation
•
Updated
Jun 10
•
10
tsavage68/UTI2_M2_1000steps_1e7rate_CSFTDPO
Text Generation
•
Updated
Jun 10
•
5
tsavage68/UTI2_L3_250steps_1e7rate_05beta_CSFTDPO
Text Generation
•
Updated
Jun 10
•
2
tsavage68/UTI2_L3_1000steps_1e8rate_05beta_CSFTDPO
Text Generation
•
Updated
Jun 10
•
4
haidermasood99/openhermes-mistral-dpo-gptq
Updated
Jun 10
tsavage68/UTI2_L3_50steps_1e6rate_05beta_CSFTDPO
Text Generation
•
Updated
Jun 10
•
2
kyryl-opens-ml/doplhin-dpo-1-epoch
Updated
Jun 11
TTTXXX01/DPO_Chat-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 11
•
5
Previous
1
...
73
74
75
76
77
...
110
Next