zoeywin sequelbox commited on
Commit
8274e28
1 Parent(s): f062a15

Upload folder using huggingface_hub (#5)

Browse files

- 278bbb9c1984a9662cb757bad0a482bbc42d57f7a1dcb1cf6fc2f4e4b4ff3ee9 (cf6a17908672b40b9c9bfa1d576ed055eda38be0)
- af7edc79a82a2076142ec9eb1a7728a456c6b0a75cad97db011837257c6fdd89 (6a25582a358fd3b93ab3bd175bb9e2aad6666621)
- 7df24d748aa0058f5dc623c255241c704826f88929af13e08acf5c11d17240b3 (c14d9571bed431fad64ba8eceba0e65096902d17)
- 19531b187e6991fde11622092df30d89c60d3c393f2e1167cf2d5b937868ee62 (704ae94ec3852ae44d4eb7a6d10fc93fa459eaa4)
- ab9191993d62eac7d565e05ba0bb7fb3e4d4a183dfa1033838be7647a08106fd (033cce5b18ab5df97ed5f6f080cc0549dc90efef)
- f15d8edd84ae43cc5ff6e2cd0eb5ce8eb5856d5cee9060bad69cd99757751582 (5e49be042825248b967c5e6f2ba7a6b19b1f5e7b)
- 46c59c884b6859208f322326f73ca9d97d08eda4563d679b6fe3c5212285aff6 (5ab53f999100bb7dbb73d44629db27f075058645)
- model card (47d2134d6483482dc4a50485856c54c8d2ef322f)


Co-authored-by: scott <sequelbox@users.noreply.huggingface.co>

README.md CHANGED
@@ -23,20 +23,24 @@ tags:
23
  base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
24
  datasets:
25
  - sequelbox/Tachibana
26
- - LDJnr/Pure-Dove
27
  model_type: llama
28
  license: llama3.1
29
  ---
30
 
31
 
 
 
 
32
  Enigma is a code-instruct model built on Llama 3.1 8b.
33
  - High quality code instruct performance within the Llama 3 Instruct chat format
34
  - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
 
35
 
36
 
37
  ## Version
38
 
39
- This is the **2024-08-10** release of Enigma for Llama 3.1 8b.
40
 
41
  Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
42
 
@@ -73,9 +77,9 @@ print(outputs[0]["generated_text"][-1])
73
  ```
74
 
75
  ## The Model
76
- Enigma is built on top of Llama 3.1 8b Instruct, using code-instruct data to supplement code-instruct performance using Llama 3.1 Instruct prompt style.
77
 
78
- Our current version of the Enigma code-instruct dataset is [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana), supplemented with a small selection of data from [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove) for general chat consistency.
79
 
80
 
81
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)
 
23
  base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
24
  datasets:
25
  - sequelbox/Tachibana
26
+ - sequelbox/Supernova
27
  model_type: llama
28
  license: llama3.1
29
  ---
30
 
31
 
32
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64f267a8a4f79a118e0fcc89/it7MY5MyLCLpFQev5dUis.jpeg)
33
+
34
+
35
  Enigma is a code-instruct model built on Llama 3.1 8b.
36
  - High quality code instruct performance within the Llama 3 Instruct chat format
37
  - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
38
+ - Overall chat performance supplemented with [generalist synthetic data.](https://huggingface.co/datasets/sequelbox/Supernova)
39
 
40
 
41
  ## Version
42
 
43
+ This is the **2024-09-04** release of Enigma for Llama 3.1 8b, enhancing code-instruct and general chat capabilities.
44
 
45
  Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
46
 
 
77
  ```
78
 
79
  ## The Model
80
+ Enigma is built on top of Llama 3.1 8b Instruct, using high quality code-instruct data and general chat data in Llama 3.1 Instruct prompt style to supplement overall performance.
81
 
82
+ Our current version of Enigma is trained on code-instruct data from [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana) and general chat data from [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
83
 
84
 
85
  ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)
config.json CHANGED
@@ -33,7 +33,7 @@
33
  "rope_theta": 500000.0,
34
  "tie_word_embeddings": false,
35
  "torch_dtype": "float32",
36
- "transformers_version": "4.44.0",
37
  "use_cache": true,
38
  "vocab_size": 128256
39
  }
 
33
  "rope_theta": 500000.0,
34
  "tie_word_embeddings": false,
35
  "torch_dtype": "float32",
36
+ "transformers_version": "4.44.2",
37
  "use_cache": true,
38
  "vocab_size": 128256
39
  }
generation_config.json CHANGED
@@ -8,5 +8,5 @@
8
  ],
9
  "temperature": 0.6,
10
  "top_p": 0.9,
11
- "transformers_version": "4.44.0"
12
  }
 
8
  ],
9
  "temperature": 0.6,
10
  "top_p": 0.9,
11
+ "transformers_version": "4.44.2"
12
  }
model-00001-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7e9e5664bfc422dbb7cca97eb819fa479b586b328be26d51971633ebba245ca0
3
  size 4886466168
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:08dff18399f4082cd4af329673d8e5f05ba976529cd4b2fd3eaa8a198ad48a0c
3
  size 4886466168
model-00002-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a0b7fad220c567c9c95008ac7d0757ca2993ae7b83646ca524332d1eaa21f469
3
  size 4832007448
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93734a9e3f00cbded5bb06644d1a5a8a247c14f383e61ad49ad0c671e350f262
3
  size 4832007448
model-00003-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c7a1e5696beaed309e06ae8c093ee5641f502b373b414afa6de929b706df1341
3
  size 4999813112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d58a3870696db28abed0917760f77a5cf11322674209263462ff08935f87ea7e
3
  size 4999813112
model-00004-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9d982d47c65e3a0bf8136ab05cd82dab3a5029855837c14d599c02a0cf6dffe
3
  size 4999813128
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:52bddb21602fc8f2ce0f79d011b6f6280dcf65f659142766b84f9a375524d364
3
  size 4999813128
model-00005-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1d9b103e8b6a9e2d283eabb59871ba708dd5d51f530e81e765f1ee6227ae9722
3
  size 4832007496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f67914bd40d6b748669032e06fd2c36eb83c5354da989052174a97270c3dd1b
3
  size 4832007496
model-00006-of-00007.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c3ae2952e9f2312e2ec29c0797392c799ca3a1250c99db6d99df0b1c4a7b768d
3
  size 4999813120
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74a7c7fc0cc3074b7eb787ef4ede238dd0733565edadbb137c497615122e8080
3
  size 4999813120
tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 5450,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {