chujiezheng
/

Llama-3-Instruct-8B-SimPO-ExPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chujiezheng commited on May 26

Commit

c5b0116

•

1 Parent(s): 5b49826

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -4,7 +4,7 @@ language:
 license: llama3
 ---
-# LLaMA3-iterative-DPO-final-ExPO
 The extrapolated (ExPO) model based on [`princeton-nlp/Mistral-7B-Instruct-SimPO`](https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final) and [`meta-llama/Meta-Llama-3-8B-Instruct`](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), as in the "[Weak-to-Strong Extrapolation Expedites Alignment](https://arxiv.org/abs/2404.16792)" paper.

 license: llama3
 ---
+# Llama-3-Instruct-8B-SimPO-ExPO
 The extrapolated (ExPO) model based on [`princeton-nlp/Mistral-7B-Instruct-SimPO`](https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final) and [`meta-llama/Meta-Llama-3-8B-Instruct`](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct), as in the "[Weak-to-Strong Extrapolation Expedites Alignment](https://arxiv.org/abs/2404.16792)" paper.