Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
trl
Inference Endpoints
text-generation-inference
AutoTrain Compatible
4-bit precision
custom_code
Eval Results
8-bit precision
Merge
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
34,502
Full-text search
Edit filters
Sort: Trending
Active filters:
trl
Clear all
ichetandhembre/chat_lora_unsloth
Updated
Mar 18
santhoshmlops/gemma-2b-unsloth-SFT
Updated
Mar 18
SleepyGorilla/mistral-finetune
Updated
Mar 18
•
8
rjindal/rohit-rlhf_demo
Text2Text Generation
•
Updated
Mar 18
•
3
crimsonjoo/Luxia-21.4B-16bit
Text Generation
•
Updated
Mar 18
•
8
pmarmik/bolna-lead-qualification
Text Generation
•
Updated
Mar 18
•
3
•
3
SleepyGorilla/test2
Updated
Mar 18
shivam9980/mistral-news-7B-cnn
Updated
Mar 18
ogdanneedham/mistral-sf-0.1-lora
Updated
Mar 18
shenxq/zephyr-7b-dpo-lora-pairrm
Updated
Mar 19
•
19
a-h-m-e-d/mistral_7b_v2_finetuned-spectral_rules
Text Generation
•
Updated
Mar 18
•
3
shenxq/zephyr-7b-dpo-qlora-pairrm
Updated
Mar 19
•
4
shivam9980/mistral-7b-news-cnn-merged
Text2Text Generation
•
Updated
Sep 15
AshJem/phi-1_5-finetuned-dialogstudio
Updated
Mar 18
lole25/phi-2-gpo-lora-ultrafeedback-test
Updated
Mar 18
•
8
lole25/phi-2-gpo-lora-ultrafeedback-test-1
Updated
Mar 18
•
1
RayBoustany/phi-2-role-play
Updated
Mar 18
•
2
shivam9980/mistral-news-7B-inshorts-hindi
Updated
Mar 18
shivam9980/mistral-7b-news-inshorts-merged-hindi
Updated
Mar 18
ftuncc/Llama-2-7b-doctorchat-tr-finetune
Text Generation
•
Updated
Mar 18
santhoshmlops/gemma-2b-it-Unsloth-SFT
Updated
Mar 18
•
2
kazuma313/gemma-dokter-ft2
Updated
Mar 27
•
41
AvizCICD/ncp-base-mix
Text Generation
•
Updated
Mar 18
•
2
lole25/phi-2-gpo-test-iter-2
Updated
Mar 18
unslothai/lora_model
Updated
Mar 18
unslothai/model
Text Generation
•
Updated
Mar 18
•
12
danielhanchen/model
Text Generation
•
Updated
Jun 19
•
7
lole25/phi-2-gpo-test-iter-0
Updated
Mar 18
•
5
ENERGY-DRINK-LOVE/deepnoid_DPOv3
Text Generation
•
Updated
Mar 18
•
3.75k
lole25/phi-2-gpo-test-iter-1
Updated
Mar 18
•
2
Previous
1
...
97
98
99
100
Next