Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
Inference Endpoints
AutoTrain Compatible
text-generation-inference
custom_code
4-bit precision
Eval Results
dataset:HuggingFaceH4/ultrafeedback_binarized
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
999
Full-text search
Edit filters
Sort: Trending
Active filters:
HuggingFaceH4/ultrafeedback_binarized
Clear all
DUAL-GPO-2/phi-2-ipo-renew1
Updated
Apr 19
•
11
DUAL-GPO/zephyr-7b-gpo-log-i0
Updated
Apr 21
DavidAU/juanako-7b-UNA-Q6_K-GGUF
Updated
Apr 19
•
3
just1nseo/zephyr-7b-dpo-full-accumulation4
Text Generation
•
Updated
Apr 20
DavidAU/TinyLlama-1.1B-Remix-V.2-Q8_0-GGUF
Updated
Apr 20
•
2
ShenaoZ/0.001_ablation_declr_4iters_iter_1
Text Generation
•
Updated
Apr 20
DUAL-GPO-2/zephyr-7b-gpo-log-i0
Updated
Apr 24
•
12
skuma307/OrpoLlama3-8B-FT
Text Generation
•
Updated
Apr 21
•
9
ShenaoZ/0.001_ablation_4iters_bs256_iter_1
Text Generation
•
Updated
Apr 21
ShenaoZ/0.001_ablation_5iters_bs256_iter_1
Text Generation
•
Updated
Apr 21
•
2
DUAL-GPO/phi-2-gpo-renew2-b0.001-i1
Updated
Apr 22
•
4
ShenaoZ/0.0001_ablation_4iters_bs256_iter_1
Text Generation
•
Updated
Apr 21
•
5
DUAL-GPO-2/phi-2-gpo-renew2-b0.001-v2-i1
Updated
Apr 22
•
4
DUAL-GPO/phi-2-gpo-renew2-b0.001-extra-i1
Updated
Apr 23
•
5
DUAL-GPO/phi-2-gpo-renew2-b0.001-log-i0
Updated
Apr 23
•
5
ShenaoZ/0.001_ablation_4iters_bs512_iter_1
Text Generation
•
Updated
Apr 22
•
3
ShenaoZ/0.01_ablation_4iters_bs256_iter_1
Text Generation
•
Updated
Apr 22
ShenaoZ/0.001_ablation_4iters_bs256_declr_iter_1
Text Generation
•
Updated
Apr 22
•
3
ShenaoZ/0.001_ablation_4iters_bs256_decalpha_iter_1
Text Generation
•
Updated
Apr 22
•
2
DUAL-GPO/phi-2-gpo-renew2-b0.01-log-i0
Updated
Apr 23
•
1
ShenaoZ/0.001_ablation_4iters_bs256_sample2_iter_1
Text Generation
•
Updated
Apr 23
•
2
martimfasantos/tinyllama-1.1b-chat-dpo-qlora
Updated
Apr 24
•
4
DUAL-GPO-2/phi-2-gpo-renew2-b0.001-extra-v2-i1
Updated
Apr 24
•
5
bwuzhang/test_5
Text Generation
•
Updated
Apr 24
HabaAndrei/model_tiny_llama
Text Generation
•
Updated
Apr 24
ShenaoZhang/0.001_ablation_5iters_bs128_iter_1
Text Generation
•
Updated
Apr 24
•
1
ShenaoZ/0.001_ablation_5iters_bs256_useresponse_iter_1
Text Generation
•
Updated
Apr 25
junweiliao/zephyr-7b-dpo-qlora
Updated
Apr 26
•
2
Beanpow/zephyr-7b-dpo-full
Text Generation
•
Updated
Apr 26
lole25/zephyr-7b-gpo-gen-i1
Updated
Apr 25
•
4
Previous
1
...
11
12
13
14
15
...
34
Next