Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
dataset:HuggingFaceH4/ultrafeedback_binarized
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
999
Full-text search
Edit filters
Sort: Trending
Active filters:
HuggingFaceH4/ultrafeedback_binarized
Clear all
Minbyul/meditron-7b-dpo-full-sft-wo-kqa_golden
Text Generation
•
Updated
Apr 30
•
10
Minbyul/selfbiorag-7b-dpo-full-sft-wo-kqa_golden
Text Generation
•
Updated
Apr 30
•
5
Minbyul/biomistral-7b-dpo-full-sft-wo-kqa_silver_wogold
Text Generation
•
Updated
Apr 30
mradermacher/meditron-7b-dpo-full-sft-wo-kqa_golden-GGUF
Updated
May 5
•
41
Minbyul/mistral-7b-dpo-full-sft-wo-kqa_silver_wogold
Text Generation
•
Updated
Apr 30
•
10
weqweasdas/zephyr-7b-dpo-full
Text Generation
•
Updated
May 3
•
17
Minbyul/llama2-7b-dpo-full-sft-wo-kqa_silver_wogold
Text Generation
•
Updated
Apr 30
•
11
Minbyul/meditron-7b-dpo-full-sft-wo-kqa_silver_wogold
Text Generation
•
Updated
Apr 30
•
12
Minbyul/selfbiorag-7b-dpo-full-sft-wo-kqa_silver_wogold
Text Generation
•
Updated
Apr 30
•
5
DUAL-GPO/zephyr-7b-gpo-v0-i1
Updated
May 3
ShenaoZ/0.0001_withdpo_3iters_bs256_531lr_iter_1
Text Generation
•
Updated
May 2
•
9
ShenaoZ/0.0001_withdpo_3iters_bs256_511lr_iter_1
Text Generation
•
Updated
May 2
•
9
ShenaoZ/0.0001_withdpo_3iters_bs256_551lr_iter_1
Text Generation
•
Updated
May 3
•
7
DUAL-GPO/phi-2-gpo-renew2-b0.001-vllm-i1
Updated
May 3
•
9
newsletter/zephyr-7b-beta-Q6_K-GGUF
Text Generation
•
Updated
Aug 17
•
25
•
1
DUAL-GPO/zephyr-7b-gpo-log-i1
Updated
May 4
ShenaoZ/0.0001_withdpo_3iters_bs256_551lr_misit_iter_1
Text Generation
•
Updated
May 3
ShenaoZ/0.001_withdpo_4iters_bs256_5102lr_misit_iter_1
Text Generation
•
Updated
May 4
ShenaoZ/0.01_withdpo_4iters_bs256_5102lr_misit_iter_1
Text Generation
•
Updated
May 3
ShenaoZ/0.0_withdpo_4iters_bs256_5102lr_misit_iter_1
Text Generation
•
Updated
May 3
ShenaoZ/0.0001_withdpo_4iters_bs256_5102lr_misit_correct_iter_1
Text Generation
•
Updated
May 4
fenguhao/zephyr-7b-dpo-full
Text Generation
•
Updated
13 days ago
•
6
DUAL-GPO/zephyr-7b-gpo-v1-i0
Updated
May 5
DUAL-GPO/zephyr-7b-gpo-log-v1-i0
Updated
May 5
•
4
DUAL-GPO/phi-2-gpo-renew2-b0.001-0.5ultrafeedback-i1
Updated
May 5
•
7
ShenaoZ/0.0005_withdpo_4iters_bs256_555lr_iter_1
Text Generation
•
Updated
May 5
•
2
DUAL-GPO/phi-2-gpo-renew2-b0.001-0.5ultrafeedback-lowLr-i1
Updated
May 5
•
2
ShenaoZ/0.0001_withdpo_4iters_bs256_3333lr_iter_1
Text Generation
•
Updated
May 5
ShenaoZ/0.0001_withdpo_4iters_bs256_4444lr_iter_1
Text Generation
•
Updated
May 5
ShenaoZ/0.00005_withdpo_4iters_bs256_555lr_iter_1
Text Generation
•
Updated
May 5
Previous
1
...
13
14
15
16
17
...
34
Next