TimeMobius
/

Mobius-RWKV-12B-base-m1

Model card Files Files and versions Community

TimeMobius commited on Dec 31, 2023

Commit

deeb947

·

1 Parent(s): 6659471

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ language:
 inference: false
 ---
 # Model Card for Mobius-12B-base-m1
-The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch. We use
 ## Warning
@@ -56,9 +56,9 @@ The Mobius base m1 is the base model can be easily fine-tuned to achieve compell
 | lambda ppl                | 3.41   |
 | lambda                |  0.72  |
 | piqa               | 0.78   |
-| hellaswag          | 0.71        |
 | winogrande         | 0.68        |
-| arc_challenge      | 0.42       |
 | arc_easy           | 0.73       |
 | openbookqa         | 0.40       |
 | sciq               | 0.93       |

 inference: false
 ---
 # Model Card for Mobius-12B-base-m1
+The Mobius-12B-base-m1 Large Language Model (LLM) is a pretrained model based on RWKV v5 arch. We use 0.01B tokens to post train this model.
 ## Warning
 | lambda ppl                | 3.41   |
 | lambda                |  0.72  |
 | piqa               | 0.78   |
+| hellaswag 10 shots         | 0.72        |
 | winogrande         | 0.68        |
+| arc_challenge 25shots     | 0.47       |
 | arc_easy           | 0.73       |
 | openbookqa         | 0.40       |
 | sciq               | 0.93       |