Anthonyg5005's picture
Update README.md
9907a3b verified
|
raw
history blame contribute delete
No virus
830 Bytes
---
license: cc-by-nc-4.0
language:
- en
library_name: CTranslate2
pipeline_tag: text-generation
tags:
- facebook
- meta
- llama
- llama-3
- ct2
- quantized model
- int8
base_model: Sao10K/L3-8B-Stheno-v3.1
---
# CTranslate2 int8 version of L3-8B-Stheno-v3.1
This is a int8_bfloat16 quantization of [L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1)\
See more on CTranslate2: [Docs](https://opennmt.net/CTranslate2/index.html) | [Github](https://github.com/OpenNMT/CTranslate2)
This model was converted to ct2 format using the following commnd:
```
ct2-transformers-converter --model Sao10K/L3-8B-Stheno-v3.1 --output_dir L3-8B-Stheno-v3.1-ct2 --quantization int8_bfloat16 --low_cpu_mem_usage
```
***no converstion needed using the model from this repository as it is already in ct2 format.***