Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Merge
Eval Results
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
3,293
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
TTTXXX01/All_like_DPO_Chat-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 11
•
4
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR2e-7_3epochs_old
Text Generation
•
Updated
Jun 12
•
2
underactuated/mistral_dpo
Updated
Jun 24
skymizer/Llama2-7b-sft-chat-custom-template-dpo
Text Generation
•
Updated
Jun 11
•
4
TTTXXX01/zephyr-7b-DPO-full
Text Generation
•
Updated
Jun 12
•
3
UnbeT/dpo
Text2Text Generation
•
Updated
Jun 12
•
2
Minbyul/biomistral-7b-wo-kqa_golden-iter-dpo-step2
Text Generation
•
Updated
Jun 12
•
2
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR1e-7_3epochs_old
Text Generation
•
Updated
Jun 14
•
3
VAGOsolutions/SauerkrautLM-1.5b
Text Generation
•
Updated
Jun 13
•
373
•
11
1t4chi/zephyr-7b-DPOBS128-full
Text Generation
•
Updated
Jun 13
•
3
TTTXXX01/zephyr-7b-DPOBS128-full
Text Generation
•
Updated
Jun 13
•
2
nvidia/Llama3-70B-DPO-Chat
Updated
Jun 14
•
11
•
2
AmberYifan/dpo-v-trans
Text Generation
•
Updated
Jun 13
•
5
•
1
NikolayKozloff/SauerkrautLM-1.5b-Q4_0-GGUF
Updated
Jun 13
•
9
•
1
NikolayKozloff/SauerkrautLM-1.5b-Q5_0-GGUF
Updated
Jun 13
•
9
•
1
mradermacher/SauerkrautLM-1.5b-GGUF
Updated
Jun 13
•
214
mradermacher/SauerkrautLM-1.5b-i1-GGUF
Updated
Aug 2
•
99
TTTXXX01/zephyr-7b-DPOBS48-full
Text Generation
•
Updated
Jun 13
•
6
wyan/NeuralDaredevil-8B-abliterated-Q4_K_M-GGUF
Updated
Jun 13
•
7
wyan/NeuralDaredevil-8B-abliterated-Q8_0-GGUF
Updated
Jun 13
•
14
mradermacher/zephyr-7b-DPOBS128-full-GGUF
Updated
Jun 14
•
42
TTTXXX01/All_like128-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 14
•
6
narekvslife/quantized
Updated
Jun 14
TTTXXX01/All_like48-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 14
•
4
QuantFactory/notus-7b-v1-GGUF
Text Generation
•
Updated
Jun 18
•
156
TTTXXX01/IL_DPO48-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 14
•
4
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR1e-7_2epochs_old
Text Generation
•
Updated
Jun 15
•
5
TTTXXX01/IL_DPO128-zephyr-7b-sft-full
Text Generation
•
Updated
Jun 14
•
5
chrisswillss98/dpo_mcqa_quantizedBitsAndBytes
Updated
Jun 14
chrisswillss98/dpo_mcqa_v1.01
Text Generation
•
Updated
Jun 14
•
2
Previous
1
...
74
75
76
77
78
...
110
Next