Anthonyg5005
/

L3-8B-Stheno-v3.1-int8-ct2

Text Generation

quantized model

Model card Files Files and versions Community

Anthonyg5005 commited on Jun 1

Commit

0fb5ca8

•

1 Parent(s): 9eb7400

Create README.md

Files changed (1) hide show

README.md +26 -0

README.md ADDED Viewed

	@@ -0,0 +1,26 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+library_name: CTranslate2
+pipeline_tag: text-generation
+tags:
+  - facebook
+  - meta
+  - llama
+  - llama-3
+  - ct2
+  - quantized model
+  - int8
+---
+# CTranslate2 int8 version of L3-8B-Stheno-v3.1
+This is a int8_float16 quantization of [L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)\
+See more on CTranslate2: [Docs](https://opennmt.net/CTranslate2/index.html) | [Github](https://github.com/OpenNMT/CTranslate2)
+This model was converted to ct2 format using the following commnd:
+```
+ct2-transformers-converter --model Sao10K/L3-8B-Stheno-v3.1 --output_dir L3-8B-Stheno-v3.1-ct2 --quantization int8_bfloat16 --low_cpu_mem_usage
+```
+***no converstion needed using the model from this repository as it is already in ct2 format.***