ajibawa-2023
commited on
Commit
•
30b4c56
1
Parent(s):
506f4a2
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ This data was generated using GPT-3.5, GPT-4 etc. This conversation is in Vicuna
|
|
16 |
I have released the [data](https://huggingface.co/datasets/ajibawa-2023/Python-Code-23k-ShareGPT).
|
17 |
|
18 |
**Training:**
|
19 |
-
Entire dataset was trained on Azure 4 x A100 80GB. For 3 epoch, training took 42 hours. DeepSpeed codebase was used for training purpose. This was trained on Llama-
|
20 |
|
21 |
|
22 |
**GPTQ GGML & AWQ**
|
|
|
16 |
I have released the [data](https://huggingface.co/datasets/ajibawa-2023/Python-Code-23k-ShareGPT).
|
17 |
|
18 |
**Training:**
|
19 |
+
Entire dataset was trained on Azure 4 x A100 80GB. For 3 epoch, training took 42 hours. DeepSpeed codebase was used for training purpose. This was trained on Llama-1 by Meta.
|
20 |
|
21 |
|
22 |
**GPTQ GGML & AWQ**
|