LoneStriker
commited on
Commit
•
53b516c
1
Parent(s):
b709f89
Upload folder using huggingface_hub
Browse files
.gitattributes
CHANGED
@@ -1,35 +1,5 @@
|
|
1 |
-
|
2 |
-
|
3 |
-
|
4 |
-
|
5 |
-
|
6 |
-
*.ftz filter=lfs diff=lfs merge=lfs -text
|
7 |
-
*.gz filter=lfs diff=lfs merge=lfs -text
|
8 |
-
*.h5 filter=lfs diff=lfs merge=lfs -text
|
9 |
-
*.joblib filter=lfs diff=lfs merge=lfs -text
|
10 |
-
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
11 |
-
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
12 |
-
*.model filter=lfs diff=lfs merge=lfs -text
|
13 |
-
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
14 |
-
*.npy filter=lfs diff=lfs merge=lfs -text
|
15 |
-
*.npz filter=lfs diff=lfs merge=lfs -text
|
16 |
-
*.onnx filter=lfs diff=lfs merge=lfs -text
|
17 |
-
*.ot filter=lfs diff=lfs merge=lfs -text
|
18 |
-
*.parquet filter=lfs diff=lfs merge=lfs -text
|
19 |
-
*.pb filter=lfs diff=lfs merge=lfs -text
|
20 |
-
*.pickle filter=lfs diff=lfs merge=lfs -text
|
21 |
-
*.pkl filter=lfs diff=lfs merge=lfs -text
|
22 |
-
*.pt filter=lfs diff=lfs merge=lfs -text
|
23 |
-
*.pth filter=lfs diff=lfs merge=lfs -text
|
24 |
-
*.rar filter=lfs diff=lfs merge=lfs -text
|
25 |
-
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
26 |
-
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
27 |
-
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
28 |
-
*.tar filter=lfs diff=lfs merge=lfs -text
|
29 |
-
*.tflite filter=lfs diff=lfs merge=lfs -text
|
30 |
-
*.tgz filter=lfs diff=lfs merge=lfs -text
|
31 |
-
*.wasm filter=lfs diff=lfs merge=lfs -text
|
32 |
-
*.xz filter=lfs diff=lfs merge=lfs -text
|
33 |
-
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
-
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
-
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
1 |
+
OpenHermes-2.5-Code-290k-13B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
|
2 |
+
OpenHermes-2.5-Code-290k-13B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
3 |
+
OpenHermes-2.5-Code-290k-13B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
4 |
+
OpenHermes-2.5-Code-290k-13B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
5 |
+
OpenHermes-2.5-Code-290k-13B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
OpenHermes-2.5-Code-290k-13B-Q3_K_L.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:637d177b19a450d66fd6a372db13bb81a942daf39aa6346e9f39c78ba730ad35
|
3 |
+
size 6929559552
|
OpenHermes-2.5-Code-290k-13B-Q4_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7b313692b575a14f3fc8588714b1e9c97821fe5b60f67e49acca2c300f5c6868
|
3 |
+
size 7865956352
|
OpenHermes-2.5-Code-290k-13B-Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b8bc8de958ad06992b9bbb7c1707faaf8ab28893ad4fa70895ffb43d946d49f7
|
3 |
+
size 9229924352
|
OpenHermes-2.5-Code-290k-13B-Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b9d95634e8967d34087c75de1f1c1e93a2597b130c217bdcdf6f803454902b0e
|
3 |
+
size 10679140352
|
OpenHermes-2.5-Code-290k-13B-Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f9f6955c1df25f8a15c82426163a0104dd10e849e87934c32c0a826c8a708e70
|
3 |
+
size 13831319552
|
README.md
ADDED
@@ -0,0 +1,65 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- ajibawa-2023/OpenHermes-2.5-Code-290k
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
tags:
|
8 |
+
- code
|
9 |
+
- finetune
|
10 |
+
- synthetic data
|
11 |
+
- text-generation-inference
|
12 |
+
- conversational
|
13 |
+
---
|
14 |
+
|
15 |
+
**OpenHermes-2.5-Code-290k-13B**
|
16 |
+
|
17 |
+
OpenHermes-2.5-Code-290k-13B is a state of the art Llama-2 Fine-tune, which is trained on additional code dataset.
|
18 |
+
This model is trained on my existing dataset [OpenHermes-2.5-Code-290k](https://huggingface.co/datasets/ajibawa-2023/OpenHermes-2.5-Code-290k).
|
19 |
+
This dataset is amalgamation of two datasets. I have used [OpenHermes-2.5](https://huggingface.co/datasets/teknium/OpenHermes-2.5) a super quality dataset made avaliable by teknium. Other datset is my own [Code-290k-ShareGPT](https://huggingface.co/datasets/ajibawa-2023/Code-290k-ShareGPT).
|
20 |
+
Dataset is in Vicuna/ShareGPT format. There are around **1.29 million** set of conversations. I have cleaned the dataset provided by Teknium and removed metadata such as "source" & "category" etc. This dataset has primarily synthetically generated instruction and chat samples.
|
21 |
+
|
22 |
+
This model has enhanced coding capabilities besides other capabilities such as **Blogging, story generation, Q&A and many more**.
|
23 |
+
|
24 |
+
**Training:**
|
25 |
+
|
26 |
+
Entire model was trained on 4 x A100 80GB. For 2 epoch, training took **21 Days**. Fschat & DeepSpeed codebase was used for training purpose. This was trained on Llama-2 by Meta.
|
27 |
+
|
28 |
+
|
29 |
+
This is a full fine tuned model. Links for quantized models will be updated soon.
|
30 |
+
|
31 |
+
|
32 |
+
**GPTQ, GGUF, AWQ & Exllama**
|
33 |
+
|
34 |
+
GPTQ: TBA
|
35 |
+
|
36 |
+
GGUF: TBA
|
37 |
+
|
38 |
+
AWQ: TBA
|
39 |
+
|
40 |
+
Exllama v2: TBA
|
41 |
+
|
42 |
+
|
43 |
+
|
44 |
+
|
45 |
+
|
46 |
+
**Example Prompt:**
|
47 |
+
```
|
48 |
+
This is a conversation with your helpful AI assistant. AI assistant can generate Code in various Programming Languages along with necessary explanation. It can generate Story, Blogs .....
|
49 |
+
|
50 |
+
Context
|
51 |
+
You are a helpful AI assistant.
|
52 |
+
|
53 |
+
USER: <prompt>
|
54 |
+
ASSISTANT:
|
55 |
+
```
|
56 |
+
|
57 |
+
You can modify above Prompt as per your requirement. I have used ShareGPT/Vicuna format v1.1 .
|
58 |
+
|
59 |
+
I want to say special Thanks to the Open Source community for helping & guiding me to better understand the AI/Model development.
|
60 |
+
|
61 |
+
Thank you for your love & support.
|
62 |
+
|
63 |
+
**Example Output**
|
64 |
+
|
65 |
+
I will update soon.
|