chargoddard
/

llama3-42b-v0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chargoddard commited on Apr 21, 2024

Commit

ae96f79

·

verified ·

1 Parent(s): da911da

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -4,6 +4,10 @@ datasets:
 - JeanKaddour/minipile
 language:
 - en
 ---
 Meta's Llama 3 70B pruned to 42B parameters using the methodology described in [The Unreasonable Ineffectiveness of the Deeper Layers](https://arxiv.org/abs/2403.17887). Post-pruning trained using QLoRA for ~100M tokens from [JeanKaddour/minipile](https://huggingface.co/datasets/JeanKaddour/minipile).
@@ -29,4 +33,6 @@ Still evaluating, don't get too excited! Might be incredibly dumb. Check out the
 | - humanities     |N/A    |none  |     5|acc   |0.7296|±  |0.0062|
 | - other          |N/A    |none  |     5|acc   |0.8101|±  |0.0067|
 | - social_sciences|N/A    |none  |     5|acc   |0.8668|±  |0.0060|
-| - stem           |N/A    |none  |     5|acc   |0.6825|±  |0.0079|

 - JeanKaddour/minipile
 language:
 - en
+tags:
+- axolotl
+- mergekit
+- llama
 ---
 Meta's Llama 3 70B pruned to 42B parameters using the methodology described in [The Unreasonable Ineffectiveness of the Deeper Layers](https://arxiv.org/abs/2403.17887). Post-pruning trained using QLoRA for ~100M tokens from [JeanKaddour/minipile](https://huggingface.co/datasets/JeanKaddour/minipile).
 | - humanities     |N/A    |none  |     5|acc   |0.7296|±  |0.0062|
 | - other          |N/A    |none  |     5|acc   |0.8101|±  |0.0067|
 | - social_sciences|N/A    |none  |     5|acc   |0.8668|±  |0.0060|
+| - stem           |N/A    |none  |     5|acc   |0.6825|±  |0.0079|
+[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)