viethoangtranduong
commited on
Commit
•
d76b839
1
Parent(s):
d5ef612
Update README.md
Browse files
README.md
CHANGED
@@ -75,6 +75,30 @@ Alan Mathison Turing (23 June 1912 7 June 1954) was an English computer scienti
|
|
75 |
|
76 |
To do GPU Inference in Int8 or CPU inference, please refer to the `togethercomputer/RedPajama-INCITE-7B-Chat` documentation.
|
77 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
78 |
# Uses
|
79 |
|
80 |
## Direct Use
|
@@ -114,21 +138,6 @@ Using the model to generate content that is cruel to individuals is a misuse of
|
|
114 |
For example, the model may not always provide accurate or relevant answers, particularly for questions that are complex, ambiguous, or outside of its training data.
|
115 |
We therefore welcome contributions from individuals and organizations, and encourage collaboration towards creating a more robust and inclusive chatbot.
|
116 |
|
117 |
-
## Training
|
118 |
-
|
119 |
-
**Training Data**
|
120 |
-
|
121 |
-
Please refer to [snorkelai/snorkel-curated-instruction-tuning](https://huggingface.co/datasets/snorkelai/snorkel-curated-instruction-tuning)
|
122 |
-
|
123 |
-
**Training Procedure**
|
124 |
-
|
125 |
-
- **Hardware:** 8 A100
|
126 |
-
- **Optimizer:** Adam
|
127 |
-
- **Gradient Accumulations**: 1
|
128 |
-
- **Num of Tokens:** 3.8M tokens
|
129 |
-
- **Learning rate:** 1e-5
|
130 |
-
- **Batch size:** 64
|
131 |
-
|
132 |
## Community
|
133 |
|
134 |
Join us on [Snorkel AI Slack](snorkel.ai/slack)
|
|
|
75 |
|
76 |
To do GPU Inference in Int8 or CPU inference, please refer to the `togethercomputer/RedPajama-INCITE-7B-Chat` documentation.
|
77 |
|
78 |
+
|
79 |
+
## Training
|
80 |
+
|
81 |
+
**Training Data**
|
82 |
+
|
83 |
+
Please refer to [snorkelai/snorkel-curated-instruction-tuning](https://huggingface.co/datasets/snorkelai/snorkel-curated-instruction-tuning)
|
84 |
+
|
85 |
+
**Training Procedure**
|
86 |
+
|
87 |
+
- **Hardware:** 8 A100
|
88 |
+
- **Optimizer:** Adam
|
89 |
+
- **Gradient Accumulations**: 1
|
90 |
+
- **Num of Tokens:** 3.8M tokens
|
91 |
+
- **Learning rate:** 1e-5
|
92 |
+
- **Batch size:** 64
|
93 |
+
|
94 |
+
## License/Attribution
|
95 |
+
**Copyright (2023) Snorkel AI, Inc.** This model was developed at [Snorkel AI](https://snorkel.ai/) and its use is subject to the Apache 2.0 license.
|
96 |
+
|
97 |
+
This work comes with the collaboration with Together Computer.
|
98 |
+
|
99 |
+
|
100 |
+
Please further licensing, please refer to the dataset licenses: [snorkelai/snorkel-curated-instruction-tuning](https://huggingface.co/datasets/snorkelai/snorkel-curated-instruction-tuning/)
|
101 |
+
|
102 |
# Uses
|
103 |
|
104 |
## Direct Use
|
|
|
138 |
For example, the model may not always provide accurate or relevant answers, particularly for questions that are complex, ambiguous, or outside of its training data.
|
139 |
We therefore welcome contributions from individuals and organizations, and encourage collaboration towards creating a more robust and inclusive chatbot.
|
140 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
141 |
## Community
|
142 |
|
143 |
Join us on [Snorkel AI Slack](snorkel.ai/slack)
|