appvoid
/

palmer-003-turbo-2401

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

appvoid commited on Jan 4

Commit

05a55cd

•

1 Parent(s): 6a403af

Update README.md

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -17,15 +17,12 @@ note that this is a zero-shot setting as opposite to open llm leaderboard's few-
    Model           ARC_C   HellaSwag  PIQA  Winogrande Average
 palmer-001	     | 0.2807 | 0.5524 | 0.7106 | 0.5896 | 0.5333 |
 palmer-003-turbo | 0.3106 | 0.5806 | 0.7247 | 0.5951 | 0.5527 |
-p-003-turbo-2401 | ~~~~~~ | ~~~~~~ | ~~~~~~ | ~~~~~~ | ~~~~~~ | (this)
 palmer-002       | 0.3242 | 0.5956 | 0.7345 | 0.5888 | 0.5607 |
 ```
 This model is as good as tinyllama base while being half the size.
-### training 🦾
-Training took 1.5 rtx 2060 gpu hours. It was trained on 15,000 gpt-4 shuffled samples. palmer was fine-tuned using lower learning rates ensuring it keeps as much general knowledge as possible.
 ### prompt 📝
 ```
 no prompt 🚀

    Model           ARC_C   HellaSwag  PIQA  Winogrande Average
 palmer-001	     | 0.2807 | 0.5524 | 0.7106 | 0.5896 | 0.5333 |
 palmer-003-turbo | 0.3106 | 0.5806 | 0.7247 | 0.5951 | 0.5527 |
+p-003-turbo-2401 | 0.3114 | ~~~~~~ | 0.7258 | 0.5959 | ~~~~~~ | (this)
 palmer-002       | 0.3242 | 0.5956 | 0.7345 | 0.5888 | 0.5607 |
 ```
 This model is as good as tinyllama base while being half the size.
 ### prompt 📝
 ```
 no prompt 🚀