georgesung commited on
Commit
b42db76
1 Parent(s): a890433

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -40,3 +40,30 @@ python train.py configs/llama3_8b_chat_uncensored.yaml
40
 
41
  # Fine-tuning guide
42
  https://georgesung.github.io/ai/qlora-ift/
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
40
 
41
  # Fine-tuning guide
42
  https://georgesung.github.io/ai/qlora-ift/
43
+
44
+ # Ollama inference
45
+ First, install [Ollama](https://ollama.com/). Based on instructions [here](https://github.com/ollama/ollama/blob/main/README.md#import-from-gguf), run the following:
46
+ ```
47
+ cd $MODEL_DIR_OF_CHOICE
48
+ wget https://huggingface.co/georgesung/llama3_8b_chat_uncensored/resolve/main/llama3_8b_chat_uncensored_q4_0.gguf
49
+ ```
50
+
51
+ Create a file called `llama3-uncensored.modelfile` with the following:
52
+ ```
53
+ FROM ./llama3_8b_chat_uncensored_q4_0.gguf
54
+ TEMPLATE """{{ .System }}
55
+
56
+ ### HUMAN:
57
+ {{ .Prompt }}
58
+
59
+ ### RESPONSE:
60
+ """
61
+ PARAMETER stop "### HUMAN:"
62
+ PARAMETER stop "### RESPONSE:"
63
+ ```
64
+
65
+ Then run:
66
+ ```
67
+ ollama create llama3-uncensored -f llama3-uncensored.modelfile
68
+ ollama run llama3-uncensored
69
+ ```