garage-bAInd
/

Stable-Platypus2-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

arielnlee commited on Jan 3

Commit

97e9bf8

•

1 Parent(s): 44fd282

Update README.md

Files changed (1) hide show

README.md +0 -12

README.md CHANGED Viewed

@@ -12,18 +12,6 @@ Stable-Platypus-13B is a merge of [`garage-bAInd/Platypus2-13B`](https://hugging
 ![Platty](./Best_Platty_small.jpeg)
-### Benchmark Metrics
-| Metric                | Value |
-|-----------------------|-------|
-| MMLU (5-shot)         |   58.30   |
-| ARC (25-shot)         |   62.71   |
-| HellaSwag (10-shot)   |   82.29   |
-| TruthfulQA (0-shot)   |   52.52   |
-| Avg.                  |   63.96   |
-We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above, using the same version as the HuggingFace LLM Leaderboard. Please see below for detailed instructions on reproducing benchmark results.
 ### Model Details
 * **Trained by**: **Platypus2-13B** trained by Cole Hunter & Ariel Lee; **StableBeluga-13B** trained by StabilityAI

 ![Platty](./Best_Platty_small.jpeg)
 ### Model Details
 * **Trained by**: **Platypus2-13B** trained by Cole Hunter & Ariel Lee; **StableBeluga-13B** trained by StabilityAI