Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
dpo
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Carbon Emissions
Misc with no match
text-embeddings-inference
Apply filters
Models
3,800
Full-text search
Edit filters
Sort: Trending
Active filters:
dpo
Clear all
mradermacher/Lama-DPOlphin-8B-GGUF
Updated
Sep 4
•
312
•
1
tsavage68/Na_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
tsavage68/Na_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
tsavage68/Na_L3_350steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
7
tsavage68/Na_L3_250steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
7
tsavage68/Na_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
tsavage68/Na_L3_350steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
mradermacher/Lama-DPOlphin-8B-i1-GGUF
Updated
Sep 4
•
990
•
1
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
9
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
6
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
7
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 4
•
5
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
Sep 4
•
9
Jatin313/tiny-chatbot-dpo
Updated
Sep 4
•
1
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
Sep 10
•
5
bartowski/TwinLlama-3.1-8B-DPO3-GGUF
Text Generation
•
Updated
Sep 5
•
338
nomadrp/tq-aya101-6langs
Updated
Sep 5
•
4
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
Sep 8
•
5
tsavage68/Na_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
tsavage68/Na_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
tsavage68/Na_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
tsavage68/Na_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
tsavage68/Na_M2_1000steps_1e8rate_05beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
tsavage68/Na_M2_300steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
Sep 5
•
5
NicholasCorrado/zephyr-7b-uf-rlced-conifer-group-dpo-2e
Text Generation
•
Updated
Sep 7
•
5
KoNqUeRoR3891/HW2-dpo
Text Generation
•
Updated
Sep 6
•
6
nomadrp/tq-aya101-gt2
Updated
Sep 6
•
5
nomadrp/tq-llama3.1-gt3
Updated
Sep 6
•
4
NicholasCorrado/zephyr-7b-uf-rlced-conifer-1e2e-group-dpo-2e
Text Generation
•
Updated
Sep 7
•
12
Previous
1
...
96
97
98
99
100
Next