Edit model card

bigmix

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the task arithmetic merge method using jeiku/Rosa_v1_3B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: task_arithmetic
base_model: jeiku/Rosa_v1_3B
parameters:
  normalize: true
models:
  - model: jeiku/Rosa_v1_3B+jeiku/No_Robots_Alpaca_StableLM
    parameters:
      weight: 0.5
  - model: jeiku/Rosa_v1_3B+jeiku/Toxic_DPO_StableLM
    parameters:
      weight: 0.5
  - model: jeiku/Rosa_v1_3B+jeiku/Alpaca_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/Everything_v3_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/Gnosis_256_StableLM
    parameters:
      weight: 1
  - model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_128_StableLM
    parameters:
      weight: 0.8
  - model: jeiku/Rosa_v1_3B+jeiku/PIPPA_128_StableLM
    parameters:
      weight: 0.4
  - model: jeiku/Rosa_v1_3B+jeiku/LimaRP_StableLM
    parameters:
      weight: 0.7
  - model: jeiku/Rosa_v1_3B+jeiku/Theory_of_Mind_RP_128_StableLM
    parameters:
      weight: 0.6
  - model: jeiku/Rosa_v1_3B+jeiku/Bluemoon_cleaned_StableLM
    parameters:
      weight: 0.8
  - model: jeiku/Rosa_v1_3B+jeiku/RPGPT_StableLM
    parameters:
      weight: 0.4
dtype: float16
Downloads last month
49
GGUF
Model size
2.8B params
Architecture
stablelm

2-bit

3-bit

4-bit

5-bit

6-bit

16-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for jeiku/Tofu_3B_GGUF