Mozilla
/

granite-34b-code-instruct-llamafile

Text Generation

Inference Endpoints

Model card Files Files and versions Community

jartine commited on May 26

Commit

de1686b

•

1 Parent(s): 3048f7d

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -265,7 +265,8 @@ Command template:
 The maximum context size of this model is 8192 tokens. These llamafiles
 use a default context size of 512 tokens. Whenever you need the maximum
 context size to be available with llamafile for any given model, you can
-pass the `-c 0` flag.
 ## About Quantization

 The maximum context size of this model is 8192 tokens. These llamafiles
 use a default context size of 512 tokens. Whenever you need the maximum
 context size to be available with llamafile for any given model, you can
+pass the `-c 0` flag. The default temperature for these llamafiles is 0.
+It can be changed, e.g. `--temp 0.8`.
 ## About Quantization