Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

8-bit precision

Misc with no match

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

310

Full-text search

Active filters: rlhf

AdamG012/chat-opt-1.3b-rlhf-critic-deepspeed

Text Generation • Updated Apr 25, 2023 • 19 • 3

AdamG012/chat-opt-1.3b-rlhf-actor-ema-deepspeed

Text Generation • Updated Apr 25, 2023 • 12 • 8

sileod/mdeberta-v3-base-tasksource-nli

Zero-Shot Classification • Updated Oct 19, 2023 • 81 • 15

agi-css/socially-good-lm

Text Generation • Updated May 29, 2023 • 6 • 5

agi-css/hh-rlhf-sft

Text Generation • Updated Jun 1, 2023 • 7 • 3

agi-css/better-base

Text Generation • Updated Jun 1, 2023 • 5 • 5

argilla/roberta-base-reward-model-falcon-dolly

Text Classification • Updated Jun 16, 2023 • 23 • 4

merve/peft-copy-test

Text Generation • Updated Jun 14, 2023

lyogavin/Anima33B-DPO-Belle-1k

Text Generation • Updated Jul 2, 2023 • 1

lyogavin/Anima33B-DPO-Belle-1k-merged

Text Generation • Updated Jul 2, 2023 • 7 • 12

PKU-Alignment/beaver-7b-v1.0-reward

Reinforcement Learning • Updated Apr 20 • 247 • 16

PKU-Alignment/beaver-dam-7b

Updated Jul 10, 2023 • 529 • 6

Ablustrund/moss-rlhf-reward-model-7B-zh

Updated Jul 13, 2023 • 2 • 23

fnlp/moss-rlhf-reward-model-7B-en

Updated Jul 13, 2023 • 9

fnlp/moss-rlhf-sft-model-7B-en

Updated Jul 14, 2023 • 2

fnlp/moss-rlhf-policy-model-7B-en

Updated Jul 17, 2023 • 1

lightonai/alfred-40b-0723

Text Generation • Updated Aug 11, 2023 • 25 • 45

kashif/stack-llama-2

Text Generation • Updated Aug 8, 2023 • 1.48k • 15

barnybug/stack-llama-2-ggml

Updated Aug 10, 2023 • 2

vwxyzjn/starcoderbase-triviaqa

Text Generation • Updated Aug 29, 2023 • 7

lvwerra/starcoderbase-gsm8k

Text Generation • Updated Aug 30, 2023 • 11

ContextualAI/archangel_sft_pythia1-4b

Text Generation • Updated Jan 11 • 10

ContextualAI/archangel_sft_pythia2-8b

Text Generation • Updated Jan 11 • 14 • 1

ContextualAI/archangel_sft_pythia6-9b

Text Generation • Updated Jan 11 • 13

ContextualAI/archangel_sft_pythia12-0b

Text Generation • Updated Jan 11 • 8

ContextualAI/archangel_sft_llama7b

Text Generation • Updated Jan 11 • 973 • 1

ContextualAI/archangel_sft_llama13b

Text Generation • Updated Jan 11 • 39

ContextualAI/archangel_sft_llama30b

Text Generation • Updated Jan 11 • 47

ContextualAI/archangel_slic_llama30b

Text Generation • Updated Jan 11 • 10

ContextualAI/archangel_slic_pythia1-4b

Text Generation • Updated Jan 11 • 8