--- base_model: - princeton-nlp/gemma-2-9b-it-SimPO - TheDrummer/Gemmasutra-9B-v1 library_name: transformers tags: - mergekit - merge --- # Ellaria-9B This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the SLERP merge method. ### Models Merged The following models were included in the merge: * [princeton-nlp/gemma-2-9b-it-SimPO](https://huggingface.co/princeton-nlp/gemma-2-9b-it-SimPO) * [TheDrummer/Gemmasutra-9B-v1](https://huggingface.co/TheDrummer/Gemmasutra-9B-v1) ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: TheDrummer/Gemmasutra-9B-v1 layer_range: [0, 42] - model: princeton-nlp/gemma-2-9b-it-SimPO layer_range: [0, 42] merge_method: slerp base_model: TheDrummer/Gemmasutra-9B-v1 parameters: t: - filter: self_attn value: [0.2, 0.4, 0.6, 0.2, 0.4] - filter: mlp value: [0.8, 0.6, 0.4, 0.8, 0.6] - value: 0.4 dtype: bfloat16 ```