-
-
-
-
-
-
Inference status
Active filters:
rlhf
AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed
Text Generation
•
Updated
•
19
•
3
AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed
Text Generation
•
Updated
•
12
•
8
sileod/mdeberta-v3-base-tasksource-nli
Zero-Shot Classification
•
Updated
•
81
•
15
agi-css/socially-good-lm
Text Generation
•
Updated
•
6
•
5
agi-css/hh-rlhf-sft
Text Generation
•
Updated
•
7
•
3
agi-css/better-base
Text Generation
•
Updated
•
5
•
5
argilla/roberta-base-reward-model-falcon-dolly
Text Classification
•
Updated
•
23
•
4
merve/peft-copy-test
Text Generation
•
Updated
lyogavin/Anima33B-DPO-Belle-1k
Text Generation
•
Updated
•
1
lyogavin/Anima33B-DPO-Belle-1k-merged
Text Generation
•
Updated
•
7
•
12
PKU-Alignment/beaver-7b-v1.0-reward
Reinforcement Learning
•
Updated
•
247
•
16
PKU-Alignment/beaver-dam-7b
Updated
•
529
•
6
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
•
2
•
23
fnlp/moss-rlhf-reward-model-7B-en
fnlp/moss-rlhf-sft-model-7B-en
fnlp/moss-rlhf-policy-model-7B-en
lightonai/alfred-40b-0723
Text Generation
•
Updated
•
25
•
45
kashif/stack-llama-2
Text Generation
•
Updated
•
1.48k
•
15
barnybug/stack-llama-2-ggml
vwxyzjn/starcoderbase-triviaqa
Text Generation
•
Updated
•
7
lvwerra/starcoderbase-gsm8k
Text Generation
•
Updated
•
11
ContextualAI/archangel_sft_pythia1-4b
Text Generation
•
Updated
•
10
ContextualAI/archangel_sft_pythia2-8b
Text Generation
•
Updated
•
14
•
1
ContextualAI/archangel_sft_pythia6-9b
Text Generation
•
Updated
•
13
ContextualAI/archangel_sft_pythia12-0b
Text Generation
•
Updated
•
8
ContextualAI/archangel_sft_llama7b
Text Generation
•
Updated
•
973
•
1
ContextualAI/archangel_sft_llama13b
Text Generation
•
Updated
•
39
ContextualAI/archangel_sft_llama30b
Text Generation
•
Updated
•
47
ContextualAI/archangel_slic_llama30b
Text Generation
•
Updated
•
10
ContextualAI/archangel_slic_pythia1-4b
Text Generation
•
Updated
•
8