Starling-LM-7B-alpha-GGUF

Runtime error

limcheekin commited on Dec 10, 2023

Commit

a6fdbf5

1 Parent(s): 7fca658

feat: updated to Starling-LM-7B-alpha-GGUF model

Files changed (3) hide show

Dockerfile CHANGED Viewed

@@ -15,7 +15,7 @@ RUN pip install -U pip setuptools wheel && \
 # Download model
 RUN mkdir model && \
-    curl -L https://huggingface.co/TheBloke/openchat_3.5-GGUF/resolve/main/openchat_3.5.Q4_K_M.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

 # Download model
 RUN mkdir model && \
+    curl -L https://huggingface.co/TheBloke/Starling-LM-7B-alpha-GGUF/resolve/main/starling-lm-7b-alpha.Q6_K.gguf -o model/gguf-model.bin
 COPY ./start_server.sh ./
 COPY ./main.py ./

README.md CHANGED Viewed

@@ -1,20 +1,20 @@
 ---
-title: openchat_3.5-GGUF (Q4_K_M)
 colorFrom: purple
 colorTo: blue
 sdk: docker
 models:
-  - openchat/openchat_3.5
-  - TheBloke/openchat_3.5-GGUF
 tags:
   - inference api
   - openai-api compatible
   - llama-cpp-python
-  - openchat_3.5-GGUF
   - gguf
 pinned: false
 ---
-# openchat_3.5-GGUF (Q4_K_M)
 Please refer to the [index.html](index.html) for more information.

 ---
+title: Starling-LM-7B-alpha-GGUF (Q6_K)
 colorFrom: purple
 colorTo: blue
 sdk: docker
 models:
+  - berkeley-nest/Starling-LM-7B-alpha
+  - TheBloke/Starling-LM-7B-alpha-GGUF
 tags:
   - inference api
   - openai-api compatible
   - llama-cpp-python
+  - Starling-LM-7B-alpha-GGUF
   - gguf
 pinned: false
 ---
+# Starling-LM-7B-alpha-GGUF (Q6_K)
 Please refer to the [index.html](index.html) for more information.

index.html CHANGED Viewed

@@ -1,10 +1,10 @@
 <!DOCTYPE html>
 <html>
   <head>
-    <title>openchat_3.5-GGUF (Q4_K_M)</title>
   </head>
   <body>
-    <h1>openchat_3.5-GGUF (Q4_K_M)</h1>
     <p>
       With the utilization of the
       <a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>
@@ -16,14 +16,14 @@
     <ul>
       <li>
         The API endpoint:
-        <a href="https://limcheekin-openchat-3-5-gguf.hf.space/v1"
-          >https://limcheekin-openchat-3-5-gguf.hf.space/v1</a
         >
       </li>
       <li>
         The API doc:
-        <a href="https://limcheekin-openchat-3-5-gguf.hf.space/docs"
-          >https://limcheekin-openchat-3-5-gguf.hf.space/docs</a
         >
       </li>
     </ul>

 <!DOCTYPE html>
 <html>
   <head>
+    <title>Starling-LM-7B-alpha-GGUF (Q6_K)</title>
   </head>
   <body>
+    <h1>Starling-LM-7B-alpha-GGUF (Q6_K)</h1>
     <p>
       With the utilization of the
       <a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>
     <ul>
       <li>
         The API endpoint:
+        <a href="https://limcheekin-starling-lm-7b-alpha-gguf.hf.space/v1"
+          >https://limcheekin-starling-lm-7b-alpha-gguf.hf.space/v1</a
         >
       </li>
       <li>
         The API doc:
+        <a href="https://limcheekin-starling-lm-7b-alpha-gguf.hf.space/docs"
+          >https://limcheekin-starling-lm-7b-alpha-gguf.hf.space/docs</a
         >
       </li>
     </ul>