internlm
/

internlm2_5-20b-chat

Text Generation

feature-extraction

Model card Files Files and versions Community

RangiLyu commited on Aug 6, 2024

Commit

ef17bde

·

verified ·

1 Parent(s): 35f8c15

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -95,6 +95,10 @@ for response, history in model.stream_chat(tokenizer, "Hello", history=[]):
 ## Deployment
 ### LMDeploy
 LMDeploy is a toolkit for compressing, deploying, and serving LLM, developed by the MMRazor and MMDeploy teams.

 ## Deployment
+### llama.cpp
+[internlm/internlm2_5-20b-chat-gguf](https://huggingface.co/internlm/internlm2_5-20b-chat-gguf) offers `internlm2_5-20b-chat` models in GGUF format in both half precision and various low-bit quantized versions, including `q5_0`, `q5_k_m`, `q6_k`, and `q8_0`.
 ### LMDeploy
 LMDeploy is a toolkit for compressing, deploying, and serving LLM, developed by the MMRazor and MMDeploy teams.