limcheekin commited on
Commit
877e768
1 Parent(s): 55fe9af

feat: updated to notus-7B-v1-GGUF

Browse files
Files changed (4) hide show
  1. Dockerfile +1 -1
  2. README.md +5 -5
  3. index.html +9 -17
  4. zephyr-7b.ipynb +0 -0
Dockerfile CHANGED
@@ -15,7 +15,7 @@ RUN pip install -U pip setuptools wheel && \
15
 
16
  # Download model
17
  RUN mkdir model && \
18
- curl -L https://huggingface.co/TheBloke/zephyr-7B-beta-GGUF/resolve/main/zephyr-7b-beta.Q4_K_M.gguf -o model/gguf-model.bin
19
 
20
  COPY ./start_server.sh ./
21
  COPY ./main.py ./
 
15
 
16
  # Download model
17
  RUN mkdir model && \
18
+ curl -L https://huggingface.co/TheBloke/notus-7B-v1-GGUF/resolve/main/notus-7b-v1.Q6_K.gguf -o model/gguf-model.bin
19
 
20
  COPY ./start_server.sh ./
21
  COPY ./main.py ./
README.md CHANGED
@@ -1,20 +1,20 @@
1
  ---
2
- title: zephyr-7B-beta-GGUF (Q4_K_M)
3
  colorFrom: purple
4
  colorTo: blue
5
  sdk: docker
6
  models:
7
- - HuggingFaceH4/zephyr-7b-beta
8
- - TheBloke/zephyr-7B-beta-GGUF
9
  tags:
10
  - inference api
11
  - openai-api compatible
12
  - llama-cpp-python
13
- - zephyr-7B-beta-GGUF
14
  - gguf
15
  pinned: false
16
  ---
17
 
18
- # zephyr-7B-beta-GGUF (Q4_K_M)
19
 
20
  Please refer to the [index.html](index.html) for more information.
 
1
  ---
2
+ title: notus-7B-v1-GGUF (Q6_K)
3
  colorFrom: purple
4
  colorTo: blue
5
  sdk: docker
6
  models:
7
+ - argilla/notus-7b-v1
8
+ - TheBloke/notus-7B-v1-GGUF
9
  tags:
10
  - inference api
11
  - openai-api compatible
12
  - llama-cpp-python
13
+ - notus-7B-v1-GGUF
14
  - gguf
15
  pinned: false
16
  ---
17
 
18
+ # notus-7B-v1-GGUF (Q6_K)
19
 
20
  Please refer to the [index.html](index.html) for more information.
index.html CHANGED
@@ -1,13 +1,15 @@
1
  <!DOCTYPE html>
2
  <html>
3
  <head>
4
- <title>zephyr-7B-beta-GGUF (Q4_K_M)</title>
5
  </head>
6
  <body>
7
- <h1>zephyr-7B-beta-GGUF (Q4_K_M)</h1>
8
  <p>
9
  With the utilization of the
10
- <a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>
 
 
11
  package, we are excited to introduce the GGUF model hosted in the Hugging
12
  Face Docker Spaces, made accessible through an OpenAI-compatible API. This
13
  space includes comprehensive API documentation to facilitate seamless
@@ -16,27 +18,17 @@
16
  <ul>
17
  <li>
18
  The API endpoint:
19
- <a href="https://limcheekin-zephyr-7b-beta-gguf.hf.space/v1"
20
- >https://limcheekin-zephyr-7b-beta-gguf.hf.space/v1</a
21
  >
22
  </li>
23
  <li>
24
  The API doc:
25
- <a href="https://limcheekin-zephyr-7b-beta-gguf.hf.space/docs"
26
- >https://limcheekin-zephyr-7b-beta-gguf.hf.space/docs</a
27
  >
28
  </li>
29
  </ul>
30
- <p>
31
- Go ahead and try it out the API endpoint yourself with the
32
- <a
33
- href="https://huggingface.co/spaces/limcheekin/zephyr-7B-beta-GGUF/blob/main/zephyr-7b.ipynb"
34
- target="_blank"
35
- >
36
- zephyr-7b.ipynb</a
37
- >
38
- jupyter notebook.
39
- </p>
40
  <p>
41
  If you find this resource valuable, your support in the form of starring
42
  the space would be greatly appreciated. Your engagement plays a vital role
 
1
  <!DOCTYPE html>
2
  <html>
3
  <head>
4
+ <title>notus-7B-v1-GGUF (Q6_K)</title>
5
  </head>
6
  <body>
7
+ <h1>notus-7B-v1-GGUF (Q6_K)</h1>
8
  <p>
9
  With the utilization of the
10
+ <a href="https://github.com/abetlen/llama-cpp-python" target="_blank"
11
+ >llama-cpp-python</a
12
+ >
13
  package, we are excited to introduce the GGUF model hosted in the Hugging
14
  Face Docker Spaces, made accessible through an OpenAI-compatible API. This
15
  space includes comprehensive API documentation to facilitate seamless
 
18
  <ul>
19
  <li>
20
  The API endpoint:
21
+ <a href="https://limcheekin-notus-7b-v1-gguf.hf.space/v1"
22
+ >https://limcheekin-notus-7b-v1-gguf.hf.space/v1</a
23
  >
24
  </li>
25
  <li>
26
  The API doc:
27
+ <a href="https://limcheekin-notus-7b-v1-gguf.hf.space/docs"
28
+ >https://limcheekin-notus-7b-v1-gguf.hf.space/docs</a
29
  >
30
  </li>
31
  </ul>
 
 
 
 
 
 
 
 
 
 
32
  <p>
33
  If you find this resource valuable, your support in the form of starring
34
  the space would be greatly appreciated. Your engagement plays a vital role
zephyr-7b.ipynb DELETED
The diff for this file is too large to render. See raw diff