limcheekin's picture
chore: updated models and Q8_0 to F16
a525bd4
raw
history blame
1.29 kB
<!DOCTYPE html>
<html>
<head>
<title>replit-code-v1_5-3b-GGUF (F16)</title>
</head>
<body>
<h1>replit-code-v1_5-3b-GGUF (F16)</h1>
<p>
With the utilization of the
<a href="https://github.com/abetlen/llama-cpp-python">llama-cpp-python</a>
package, we are excited to introduce the GGUF model hosted in the Hugging
Face Docker Spaces, made accessible through an OpenAI-compatible API. This
space includes comprehensive API documentation to facilitate seamless
integration.
</p>
<ul>
<li>
The API endpoint:
<a href="https://limcheekin-replit-code-v1-5-3b-gguf.hf.space/v1"
>https://limcheekin-replit-code-v1-5-3b-gguf.hf.space/v1</a
>
</li>
<li>
The API doc:
<a href="https://limcheekin-replit-code-v1-5-3b-gguf.hf.space/docs"
>https://limcheekin-replit-code-v1-5-3b-gguf.hf.space/docs</a
>
</li>
</ul>
<p>
If you find this resource valuable, your support in the form of starring
the space would be greatly appreciated. Your engagement plays a vital role
in furthering the application for a community GPU grant, ultimately
enhancing the capabilities and accessibility of this space.
</p>
</body>
</html>