Information

Attempt at extending context window for an older Mistral-v0.1 model.

It seems to work fine at 16K.

ChatML and Alpaca work.

Irene-RP-v4-7B

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
      - model: Virt-io/Helen-v1_7B
        layer_range: [0, 32]
      - model: Virt-io/Irene-RP-v3-7B
        layer_range: [0, 32]
merge_method: slerp
base_model: Virt-io/Helen-v1_7B
parameters:
  t:
    - filter: self_attn
      value: [0.25, 0.45, 0.50, 0.20, 0.25]
    - filter: mlp
      value: [0.35, 0.45, 0.55, 0.20, 0.25]
    - value: 0.25 # fallback for rest of tensors
dtype: float16
Downloads last month
4
Safetensors
Model size
7.24B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for Virt-io/Irene-RP-v4-7B

Quantizations
1 model