gradientai
/

Llama-3-8B-Instruct-Gradient-1048k

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

leo-pekelis-gradient commited on Apr 29

Commit

1c075c4

•

1 Parent(s): 69e6264

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -38,7 +38,7 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
 |                        | 65K       | 262K      | 524k      | 1048k     |
 |------------------------|-----------|-----------|-----------|-----------|
-| Initialize From        | LLaMA-3 7B| 65K       | 262K      | 524k      |
 | Sequence Length 2^N    | 16        | 18        | 19        | 20        |
 | RoPE theta             | 15.3 M    | 207.1 M   | 1.06B     | 2.80B     |
 | Batch Size             | 1         | 1         | 16         | 16         |

 |                        | 65K       | 262K      | 524k      | 1048k     |
 |------------------------|-----------|-----------|-----------|-----------|
+| Initialize From        | LLaMA-3 8B| 65K       | 262K      | 524k      |
 | Sequence Length 2^N    | 16        | 18        | 19        | 20        |
 | RoPE theta             | 15.3 M    | 207.1 M   | 1.06B     | 2.80B     |
 | Batch Size             | 1         | 1         | 16         | 16         |