Qwen
/

Qwen2-0.5B-Instruct-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

JustinLin610 commited on Jun 16

Commit

d448a78

•

1 Parent(s): 25e938c

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -39,10 +39,6 @@ Cloning the repo may be inefficient, and thus you can manually download the GGUF
 huggingface-cli download Qwen/Qwen2-0.5B-Instruct-GGUF qwen2-0.5b-instruct-q5_k_m.gguf --local-dir . --local-dir-use-symlinks False
 ```
-With the upgrade of APIs of llama.cpp, `llama-gguf-split` is equivalent to the previous `gguf-split`.
-For the arguments of this command, the first is the path to the first split GGUF file, and the second is the path to the output GGUF file.
 To run Qwen2, you can use `llama-cli` (the previous `main`) or `llama-server` (the previous `server`).
 We recommend using the `llama-server` as it is simple and compatible with OpenAI API. For example:

 huggingface-cli download Qwen/Qwen2-0.5B-Instruct-GGUF qwen2-0.5b-instruct-q5_k_m.gguf --local-dir . --local-dir-use-symlinks False
 ```
 To run Qwen2, you can use `llama-cli` (the previous `main`) or `llama-server` (the previous `server`).
 We recommend using the `llama-server` as it is simple and compatible with OpenAI API. For example: