ValiantLabs
/

Llama3.1-8B-Enigma

Model card Files Files and versions Community

zoeywin

sequelbox commited on Sep 4

Commit

8274e28

•

1 Parent(s): f062a15

Upload folder using huggingface_hub (#5)

Browse files

- 278bbb9c1984a9662cb757bad0a482bbc42d57f7a1dcb1cf6fc2f4e4b4ff3ee9 (cf6a17908672b40b9c9bfa1d576ed055eda38be0)
- af7edc79a82a2076142ec9eb1a7728a456c6b0a75cad97db011837257c6fdd89 (6a25582a358fd3b93ab3bd175bb9e2aad6666621)
- 7df24d748aa0058f5dc623c255241c704826f88929af13e08acf5c11d17240b3 (c14d9571bed431fad64ba8eceba0e65096902d17)
- 19531b187e6991fde11622092df30d89c60d3c393f2e1167cf2d5b937868ee62 (704ae94ec3852ae44d4eb7a6d10fc93fa459eaa4)
- ab9191993d62eac7d565e05ba0bb7fb3e4d4a183dfa1033838be7647a08106fd (033cce5b18ab5df97ed5f6f080cc0549dc90efef)
- f15d8edd84ae43cc5ff6e2cd0eb5ce8eb5856d5cee9060bad69cd99757751582 (5e49be042825248b967c5e6f2ba7a6b19b1f5e7b)
- 46c59c884b6859208f322326f73ca9d97d08eda4563d679b6fe3c5212285aff6 (5ab53f999100bb7dbb73d44629db27f075058645)
- model card (47d2134d6483482dc4a50485856c54c8d2ef322f)

Co-authored-by: scott <sequelbox@users.noreply.huggingface.co>

Files changed (10) hide show

README.md +8 -4
config.json +1 -1
generation_config.json +1 -1
model-00001-of-00007.safetensors +1 -1
model-00002-of-00007.safetensors +1 -1
model-00003-of-00007.safetensors +1 -1
model-00004-of-00007.safetensors +1 -1
model-00005-of-00007.safetensors +1 -1
model-00006-of-00007.safetensors +1 -1
tokenizer.json +1 -6

README.md CHANGED Viewed

@@ -23,20 +23,24 @@ tags:
 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
-- LDJnr/Pure-Dove
 model_type: llama
 license: llama3.1
 ---
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
 ## Version
-This is the **2024-08-10** release of Enigma for Llama 3.1 8b.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
@@ -73,9 +77,9 @@ print(outputs[0]["generated_text"][-1])
 ```
 ## The Model
-Enigma is built on top of Llama 3.1 8b Instruct, using code-instruct data to supplement code-instruct performance using Llama 3.1 Instruct prompt style.
-Our current version of the Enigma code-instruct dataset is [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana), supplemented with a small selection of data from [LDJnr/Pure-Dove](https://huggingface.co/datasets/LDJnr/Pure-Dove) for general chat consistency.
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

 base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
 datasets:
 - sequelbox/Tachibana
+- sequelbox/Supernova
 model_type: llama
 license: llama3.1
 ---
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/64f267a8a4f79a118e0fcc89/it7MY5MyLCLpFQev5dUis.jpeg)
 Enigma is a code-instruct model built on Llama 3.1 8b.
 - High quality code instruct performance within the Llama 3 Instruct chat format
 - Finetuned on synthetic code-instruct data generated with Llama 3.1 405b. [Find the current version of the dataset here!](https://huggingface.co/datasets/sequelbox/Tachibana)
+- Overall chat performance supplemented with [generalist synthetic data.](https://huggingface.co/datasets/sequelbox/Supernova)
 ## Version
+This is the **2024-09-04** release of Enigma for Llama 3.1 8b, enhancing code-instruct and general chat capabilities.
 Help us and recommend Enigma to your friends! We're excited for more Enigma releases in the future.
 ```
 ## The Model
+Enigma is built on top of Llama 3.1 8b Instruct, using high quality code-instruct data and general chat data in Llama 3.1 Instruct prompt style to supplement overall performance.
+Our current version of Enigma is trained on code-instruct data from [sequelbox/Tachibana](https://huggingface.co/datasets/sequelbox/Tachibana) and general chat data from [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)

config.json CHANGED Viewed

@@ -33,7 +33,7 @@
   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
-  "transformers_version": "4.44.0",
   "use_cache": true,
   "vocab_size": 128256
 }

   "rope_theta": 500000.0,
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
+  "transformers_version": "4.44.2",
   "use_cache": true,
   "vocab_size": 128256
 }

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   ],
   "temperature": 0.6,
   "top_p": 0.9,
-  "transformers_version": "4.44.0"
 }

   ],
   "temperature": 0.6,
   "top_p": 0.9,
+  "transformers_version": "4.44.2"
 }

model-00001-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7e9e5664bfc422dbb7cca97eb819fa479b586b328be26d51971633ebba245ca0
 size 4886466168

 version https://git-lfs.github.com/spec/v1
+oid sha256:08dff18399f4082cd4af329673d8e5f05ba976529cd4b2fd3eaa8a198ad48a0c
 size 4886466168

model-00002-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a0b7fad220c567c9c95008ac7d0757ca2993ae7b83646ca524332d1eaa21f469
 size 4832007448

 version https://git-lfs.github.com/spec/v1
+oid sha256:93734a9e3f00cbded5bb06644d1a5a8a247c14f383e61ad49ad0c671e350f262
 size 4832007448

model-00003-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c7a1e5696beaed309e06ae8c093ee5641f502b373b414afa6de929b706df1341
 size 4999813112

 version https://git-lfs.github.com/spec/v1
+oid sha256:d58a3870696db28abed0917760f77a5cf11322674209263462ff08935f87ea7e
 size 4999813112

model-00004-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f9d982d47c65e3a0bf8136ab05cd82dab3a5029855837c14d599c02a0cf6dffe
 size 4999813128

 version https://git-lfs.github.com/spec/v1
+oid sha256:52bddb21602fc8f2ce0f79d011b6f6280dcf65f659142766b84f9a375524d364
 size 4999813128

model-00005-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1d9b103e8b6a9e2d283eabb59871ba708dd5d51f530e81e765f1ee6227ae9722
 size 4832007496

 version https://git-lfs.github.com/spec/v1
+oid sha256:6f67914bd40d6b748669032e06fd2c36eb83c5354da989052174a97270c3dd1b
 size 4832007496

model-00006-of-00007.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3ae2952e9f2312e2ec29c0797392c799ca3a1250c99db6d99df0b1c4a7b768d
 size 4999813120

 version https://git-lfs.github.com/spec/v1
+oid sha256:74a7c7fc0cc3074b7eb787ef4ede238dd0733565edadbb137c497615122e8080
 size 4999813120

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 5450,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {