xtuner
/

llava-phi-3-mini-gguf

Model card Files Files and versions Community

pppppM commited on Apr 29

Commit

f2aee31

•

1 Parent(s): 1ac97a1

Update README.md

Files changed (1) hide show

README.md +8 -4

README.md CHANGED Viewed

@@ -63,12 +63,13 @@ wget https://huggingface.co/xtuner/llava-phi-3-mini-gguf/resolve/main/llava-phi-
 # int4 llm
 wget https://huggingface.co/xtuner/llava-phi-3-mini-gguf/resolve/main/llava-phi-3-mini-int4.gguf
-```
-### Build environment
-1. Build [llama.cpp](https://github.com/ggerganov/llama.cpp) ([docs](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage)) .
-2. Build `./llava-cli` ([docs](https://github.com/ggerganov/llama.cpp/tree/master/examples/llava#usage)).
 ### Chat by `ollama`
@@ -86,6 +87,9 @@ ollama run llava-phi3-int4 "xx.png Describe this image"
 ### Chat by `./llava-cli`
 Note: llava-phi-3-mini uses the `Phi-3-instruct` chat template.
 ```bash

 # int4 llm
 wget https://huggingface.co/xtuner/llava-phi-3-mini-gguf/resolve/main/llava-phi-3-mini-int4.gguf
+# (optional) ollama fp16 modelfile
+wget https://huggingface.co/xtuner/llava-phi-3-mini-gguf/resolve/main/OLLAMA_MODELFILE_F16
+# (optional) ollama int4 modelfile
+wget https://huggingface.co/xtuner/llava-phi-3-mini-gguf/resolve/main/OLLAMA_MODELFILE_INT4
+```
 ### Chat by `ollama`
 ### Chat by `./llava-cli`
+1. Build [llama.cpp](https://github.com/ggerganov/llama.cpp) ([docs](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage)) .
+2. Build `./llava-cli` ([docs](https://github.com/ggerganov/llama.cpp/tree/master/examples/llava#usage)).
 Note: llava-phi-3-mini uses the `Phi-3-instruct` chat template.
 ```bash