aashish1904's picture
Upload README.md with huggingface_hub
1880798 verified
|
raw
history blame
1.69 kB
metadata
base_model:
  - nbeerbower/mistral-nemo-gutenberg-12B-v4
  - NeverSleep/Lumimaid-v0.2-12B
library_name: transformers
tags:
  - mergekit
  - merge

QuantFactory Banner

QuantFactory/NarraThinker12B-GGUF

This is quantized version of ClaudioItaly/NarraThinker12B created using llama.cpp

Original Model Card

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: NeverSleep/Lumimaid-v0.2-12B
  - model: nbeerbower/mistral-nemo-gutenberg-12B-v4
merge_method: slerp
base_model: nbeerbower/mistral-nemo-gutenberg-12B-v4
dtype: bfloat16
parameters:
  t: [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.2, 0.2, 0.2, 0.3, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 0.9, 0.9, 0.9, 0.9, 0.9, 0.9]
  layers: [0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110]
tokenizer_merge_method: slerp
tokenizer_parameters:
  t: 0.2