-
-
-
-
-
-
Inference Providers
Active filters:
dpo
tsavage68/Na_L3_100steps_1e6rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
NicholasCorrado/zephyr-7b-uf-rlced-conifer-dpo-2e
Text Generation
•
Updated
•
6
tsavage68/Na_L3_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
6
tsavage68/Na_L3_100steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
CultriX/Lama-DPOlphin-8B-Q3_K_S-GGUF
Text Generation
•
Updated
•
8
•
1
CultriX/Lama-DPOlphin-8B-Q4_K_S-GGUF
Text Generation
•
Updated
•
7
•
1
QuantFactory/Fireball-3.1-8B-ORPO-GGUF
Text Generation
•
Updated
•
33
•
2
mradermacher/Lama-DPOlphin-8B-GGUF
Updated
•
181
•
1
tsavage68/Na_L3_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_350steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
8
tsavage68/Na_L3_250steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_L3_350steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
mradermacher/Lama-DPOlphin-8B-i1-GGUF
Updated
•
278
•
1
tsavage68/Na_M2_1000steps_1e7rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_1000steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_1000steps_1e7rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_200steps_1e6rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_100steps_1e7rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5
SongTonyLi/SFT_D1chosenThenDPO_D2a_Instruct_argilla_math_results
Text Generation
•
Updated
•
5
Jatin313/tiny-chatbot-dpo
NicholasCorrado/zephyr-7b-uf-dpo-2e
Text Generation
•
Updated
•
23
bartowski/TwinLlama-3.1-8B-DPO3-GGUF
Text Generation
•
Updated
•
54
nomadrp/tq-aya101-6langs
NicholasCorrado/rlced-conifer-zephyr-7b-dpo-2e
Text Generation
•
Updated
•
24
tsavage68/Na_M2_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
4
tsavage68/Na_M2_1000steps_1e6rate_05beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
•
Updated
•
5
tsavage68/Na_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
•
Updated
•
5