OPEA
/

Meta-Llama-3.1-70B-Instruct-int4-asym-inc

4-bit precision

intel/auto-round

Model card Files Files and versions Community

cicdatopea commited on Dec 2, 2024

Commit

7882b2d

•

1 Parent(s): e9042f5

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: llama3.1
 ---
 ## Model Card Details
-This model is an int4 model with group_size 128 and asymmetric quantization of [meta-llama/Meta-Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) generated by [intel/auto-round](https://github.com/intel/auto-round), auto-round is needed to run this model
 ## Inference on CPU/HPU/CUDA

 ---
 ## Model Card Details
+This model is an int4 model with group_size 128 and asymmetric quantization of [meta-llama/Meta-Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) generated by [intel/auto-round](https://github.com/intel/auto-round), auto-round is needed to run this model. [Symmetic model](https://huggingface.co/OPEA/Meta-Llama-3.1-70B-Instruct-int4-sym-inc) is recommended for better performance.
 ## Inference on CPU/HPU/CUDA