Meta-DeepHermes-3-Llama-3-8B-Preview-GGUF

Original Model

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Run with Gaianet

Prompt template:

IMPORTANT: To toggle REASONING ON, you must use the following system prompt:

You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside <think> </think> tags, and then provide your solution or response to the problem.

prompt template: llama-3-chat

Context size:

chat_ctx_size: 128000

Run with GaiaNet:

Quantized with llama.cpp b4743

Downloads last month
499
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for gaianet/DeepHermes-3-Llama-3-8B-Preview-GGUF

Quantized
(28)
this model