oblivious
/

Refact-1.6B-fim-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Refact-1.6B-fim-GGUF

Model creator: Small Magellanic Cloud AI
Original model: Refact-1.6B

Description

This repository contains quantized GGUF format model files for Refact-1.6B.

Prompt: fill in the middle

<fim_prefix>def print_hello_world():\n    """<fim_suffix>\n    print("Hello world!")<fim_middle>

Prompt: chat (experimental)

<empty_output>SYSTEM You are a programming assistant
<empty_output>USER How do I sort a list in Python?
<empty_output>ASSISTANT

Example `llama.cpp` command

./main -m refact-1_6b-Q4_K_M.gguf -c 4096 -n -1 -p '<fim_prefix>{prefix}<fim_suffix>{suffix}<fim_middle>'

For other parameters and how to use them, please refer to the llama.cpp documentation

Downloads last month: 194

GGUF

Model size

1.59B params

Architecture

refact

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for oblivious/Refact-1.6B-fim-GGUF

Base model

smallcloudai/Refact-1_6B-fim

Quantized

(5)

this model

Datasets used to train oblivious/Refact-1.6B-fim-GGUF

Space using oblivious/Refact-1.6B-fim-GGUF 1