reach-vb HF staff commited on
Commit
327f3aa
·
verified ·
1 Parent(s): 098f871

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +0 -42
app.py CHANGED
@@ -86,48 +86,6 @@ def process_model(model_id, q_method, private_repo, oauth_token: gr.OAuthToken |
86
  card.text = dedent(
87
  f"""
88
  # {new_repo_id}
89
- This model was converted to GGUF format from [`{model_id}`](https://huggingface.co/{model_id}) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
90
- Refer to the [original model card](https://huggingface.co/{model_id}) for more details on the model.
91
-
92
- ## Use with llama.cpp
93
- Install llama.cpp through brew (works on Mac and Linux)
94
-
95
- ```bash
96
- brew install llama.cpp
97
-
98
- ```
99
- Invoke the llama.cpp server or the CLI.
100
-
101
- ### CLI:
102
- ```bash
103
- llama-cli --hf-repo {new_repo_id} --hf-file {quantized_gguf_name} -p "The meaning to life and the universe is"
104
- ```
105
-
106
- ### Server:
107
- ```bash
108
- llama-server --hf-repo {new_repo_id} --hf-file {quantized_gguf_name} -c 2048
109
- ```
110
-
111
- Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
112
-
113
- Step 1: Clone llama.cpp from GitHub.
114
- ```
115
- git clone https://github.com/ggerganov/llama.cpp
116
- ```
117
-
118
- Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
119
- ```
120
- cd llama.cpp && LLAMA_CURL=1 make
121
- ```
122
-
123
- Step 3: Run inference through the main binary.
124
- ```
125
- ./llama-cli --hf-repo {new_repo_id} --hf-file {quantized_gguf_name} -p "The meaning to life and the universe is"
126
- ```
127
- or
128
- ```
129
- ./llama-server --hf-repo {new_repo_id} --hf-file {quantized_gguf_name} -c 2048
130
- ```
131
  """
132
  )
133
  card.save(f"README.md")
 
86
  card.text = dedent(
87
  f"""
88
  # {new_repo_id}
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
89
  """
90
  )
91
  card.save(f"README.md")