Update README to add title and Gradient logo (#7)
Browse files- Update README to add title and Gradient logo (7ae8bfa8cfd177d5b3f9477b3cb48e8f10138943)
README.md
CHANGED
@@ -6,6 +6,9 @@ tags:
|
|
6 |
- meta
|
7 |
- llama-3
|
8 |
---
|
|
|
|
|
|
|
9 |
|
10 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6585dc9be92bc5f258156bd6/hiHWva3CbsrnPvZTp5-lu.png)
|
11 |
|
@@ -40,6 +43,14 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
|
|
40 |
| # GPUs | 8 | 8 |
|
41 |
| GPU Type | NVIDIA L40S| NVIDIA L40S|
|
42 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
## References
|
44 |
|
45 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
@@ -48,13 +59,6 @@ For training data, we generate long contexts by augmenting [SlimPajama](https://
|
|
48 |
|
49 |
[3] https://github.com/jzhang38/EasyContext
|
50 |
|
51 |
-
## The Gradient AI Team
|
52 |
-
|
53 |
-
Gradient is accelerating AI transformation across industries. https://gradient.ai/
|
54 |
-
|
55 |
-
## Contact Us
|
56 |
-
|
57 |
-
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
58 |
|
59 |
----
|
60 |
|
|
|
6 |
- meta
|
7 |
- llama-3
|
8 |
---
|
9 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/655bb613e8a8971e89944f3e/TSa3V8YpoVagnTYgxiLaO.png" width="200"/>
|
10 |
+
|
11 |
+
# Llama-3 8B Instruct 262k
|
12 |
|
13 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6585dc9be92bc5f258156bd6/hiHWva3CbsrnPvZTp5-lu.png)
|
14 |
|
|
|
43 |
| # GPUs | 8 | 8 |
|
44 |
| GPU Type | NVIDIA L40S| NVIDIA L40S|
|
45 |
|
46 |
+
## The Gradient AI Team
|
47 |
+
|
48 |
+
Gradient is accelerating AI transformation across industries. https://gradient.ai/
|
49 |
+
|
50 |
+
## Contact Us
|
51 |
+
|
52 |
+
Drop an email to [contact@gradient.ai](mailto:contact@gradient.ai)
|
53 |
+
|
54 |
## References
|
55 |
|
56 |
[1] Peng, Bowen, et al. "Yarn: Efficient context window extension of large language models." arXiv preprint arXiv:2309.00071 (2023).
|
|
|
59 |
|
60 |
[3] https://github.com/jzhang38/EasyContext
|
61 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
|
63 |
----
|
64 |
|