CultriX
/

MistralTrix-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

CultriX commited on Jan 13, 2024

Commit

6007688

·

verified ·

1 Parent(s): 7905b42

Update README.md

Files changed (1) hide show

README.md +3 -5

README.md CHANGED Viewed

@@ -20,6 +20,8 @@ TruthfulQA: 70.73
 Winogrande: 80.98
 GSM8K: 62.77
 # Edit/Disclaimer:
 Currently the #1 ranked 7B LLM on the LLM Leaderboards, woah!
 I did not expect that result at all and am in no way a professional when it comes to LLM's or computer science in general,
@@ -105,8 +107,4 @@ dpo_trainer = DPOTrainer(
     beta=0.1,
     max_prompt_length=1024,
     max_length=1536,
-)
-EDIT: Still waiting for the Open-LLM benchmark results to come back in, but...:
-According to the few tests I ran on it myself the new "CultriX/MistralTrix-SLERP" should beat this model at only 7.42B!

 Winogrande: 80.98
 GSM8K: 62.77
+# Edit 2:
+EDIT: Still waiting for the Open-LLM benchmark results to come back in, but...: According to the few tests I ran on it myself the new "CultriX/MistralTrix-SLERP" should beat this model at only 7.42B!
 # Edit/Disclaimer:
 Currently the #1 ranked 7B LLM on the LLM Leaderboards, woah!
 I did not expect that result at all and am in no way a professional when it comes to LLM's or computer science in general,
     beta=0.1,
     max_prompt_length=1024,
     max_length=1536,
+)