QuantFactory/L3-Rhaenys-8B-GGUF

This is quantized version of tannedbum/L3-Rhaenys-8B created using llama.cpp

Original Model Card

3.0 Farewell model. Next i'm going to wait Sao10K to break the bank again with a new 3.1 RP base.

SillyTavern

Text Completion presets

temp 0.9
top_k 30
top_p 0.75
min_p 0.2
rep_pen 1.1
smooth_factor 0.25
smooth_curve 1

Advanced Formatting

Context & Instruct preset by Virt-io

Instruct Mode: Enabled

merge

This is a merge of pre-trained language models created using mergekit.

This model was merged using the slerp merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:


slices:
  - sources:
      - model: Sao10K/L3-8B-Niitama-v1
        layer_range: [0, 32]
      - model: Sao10K/L3-8B-Stheno-v3.2
        layer_range: [0, 32]
merge_method: slerp
base_model: Sao10K/L3-8B-Niitama-v1
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16


slices:
  - sources:
      - model: tannedbum/L3-Niitama-Stheno-8B
        layer_range: [0, 32]
      - model: princeton-nlp/Llama-3-Instruct-8B-SimPO-v0.2
        layer_range: [0, 32]
merge_method: slerp
base_model: tannedbum/L3-Niitama-Stheno-8B
parameters:
  t:
    - filter: self_attn
      value: [0.2, 0.4, 0.6, 0.2, 0.4]
    - filter: mlp
      value: [0.8, 0.6, 0.4, 0.8, 0.6]
    - value: 0.4
dtype: bfloat16

Want to support my work ? My Ko-fi page: https://ko-fi.com/tannedbum

Downloads last month
15
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for QuantFactory/L3-Rhaenys-8B-GGUF