Mozilla
/

Meta-Llama-3.1-405B-Instruct-llamafile

Model card Files Files and versions Community

jartine commited on Jul 27, 2024

Commit

9d3fb81

•

1 Parent(s): eb8b832

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -70,6 +70,16 @@ full 128k size. See our
 repository for llamafiles that are known to work with a 128kb context
 size.
 On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
 the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
 driver needs to be installed. If the prebuilt DSOs should fail, the CUDA

 repository for llamafiles that are known to work with a 128kb context
 size.
+On Windows there's a 4GB limit on executable sizes. You can work around
+that by downloading the [official llamafile
+release](https://github.com/Mozilla-Ocho/llamafile/releases) binary,
+renaming it to have a .exe extension, and then passing the llamafiles in
+this repo via the `-m` flag as though they were GGUF weights, e.g.
+```
+.\llamafile-0.8.11.exe -m Meta-Llama-3.1-405B.Q2_K.llamafile
+```
 On GPUs with sufficient RAM, the `-ngl 999` flag may be passed to use
 the system's NVIDIA or AMD GPU(s). On Windows, only the graphics card
 driver needs to be installed. If the prebuilt DSOs should fail, the CUDA