nisten
/

BigCodeLlama-169b-GGUF

Model card Files Files and versions Community

nisten commited on Jan 29

Commit

098405a

•

1 Parent(s): 7a1bb49

Update README.md

Files changed (1) hide show

README.md +56 -0

README.md CHANGED Viewed

@@ -1,3 +1,59 @@
 ---
 license: mit
 ---

 ---
 license: mit
+tags:
+- merge
 ---
+---
+base_model: [nisten/BigCodeLlama-169b]
+tags:
+- mergekit
+- merge
+---
+# Quantizations of BigCodeLLama LFG 🚀
+## Experimental CodeLlaMA frankenstein to see how it benchmarks
+### Models Merged
+The following models were included in the merge:
+* ../CodeLlama-70b-hf
+* ../CodeLlama-70b-Instruct-hf
+* ../CodeLlama-70b-Python-hf
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+dtype: bfloat16
+merge_method: passthrough
+slices:
+- sources:
+  - layer_range: [0, 69]
+    model:
+      model:
+        path: ../CodeLlama-70b-hf
+- sources:
+  - layer_range: [66, 76]
+    model:
+      model:
+        path: ../CodeLlama-70b-Instruct-hf
+- sources:
+  - layer_range: [42, 66]
+    model:
+      model:
+        path: ../CodeLlama-70b-hf
+- sources:
+  - layer_range: [13, 37]
+    model:
+      model:
+        path: ../CodeLlama-70b-Python-hf
+- sources:
+  - layer_range: [10, 80]
+    model:
+      model:
+        path: ../CodeLlama-70b-Instruct-hf
+```
+### Stay tuned for GGUFs quants