llmware
/

llama-3.2-3b-instruct-ov

Model card Files Files and versions Community

doberst commited on Oct 8, 2024

Commit

a9df077

•

1 Parent(s): d5315dd

Update README.md

Files changed (1) hide show

README.md +7 -6

README.md CHANGED Viewed

@@ -3,16 +3,17 @@ license: llama3.2
 inference: false
 tags:
 - green
-- p1
 - llmware-chat
 - ov
 ---
-# llama-3.2-1b-instruct-ov
-**llama-3.2-1b-instruct-ov** is an OpenVino int4 quantized version of Llama 3.2 1B Instruct, providing a very small, very fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
-[**llama-3.2-1b-instruct**](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) is a new 1B chat foundation model from Meta.
 ### Model Description
@@ -20,8 +21,8 @@ tags:
 - **Developed by:** meta-llama
 - **Quantized by:** llmware
 - **Model type:** llama-3.2
-- **Parameters:** 1 billion
-- **Model Parent:** meta-llama/Meta-Llama-3.2-1B-Instruct
 - **Language(s) (NLP):** English
 - **License:** Llama 3.2 Community License
 - **Uses:** General chat use cases

 inference: false
 tags:
 - green
+- p3
 - llmware-chat
 - ov
+- emerald
 ---
+# llama-3.2-3b-instruct-ov
+**llama-3.2-3b-instruct-ov** is an OpenVino int4 quantized version of Llama 3.2 3B Instruct, providing a very small, very fast inference implementation, optimized for AI PCs using Intel GPU, CPU and NPU.
+[**llama-3.2-3b-instruct**](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) is a new 3B chat foundation model from Meta.
 ### Model Description
 - **Developed by:** meta-llama
 - **Quantized by:** llmware
 - **Model type:** llama-3.2
+- **Parameters:** 3 billion
+- **Model Parent:** meta-llama/Meta-Llama-3.2-3B-Instruct
 - **Language(s) (NLP):** English
 - **License:** Llama 3.2 Community License
 - **Uses:** General chat use cases