lucyknada's picture
Upload ./README.md with huggingface_hub
345462d verified
|
raw
history blame
No virus
1.58 kB
---
base_model:
- anthracite-core/magnum-v3-27b-kto-r3
- anthracite-core/magnum-v3-27b-KTO-e1-r2
- anthracite-core/magnum-v3-27b-KTO-e0.25-r1
- IntervitensInc/gemma-2-27b-chatml
library_name: transformers
tags:
- mergekit
- merge
---
### exl2 quant (measurement.json in main branch)
---
### check revisions for quants
---
# output8
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [IntervitensInc/gemma-2-27b-chatml](https://huggingface.co/IntervitensInc/gemma-2-27b-chatml) as a base.
### Models Merged
The following models were included in the merge:
* [anthracite-core/magnum-v3-27b-kto-r3](https://huggingface.co/anthracite-core/magnum-v3-27b-kto-r3)
* [anthracite-core/magnum-v3-27b-KTO-e1-r2](https://huggingface.co/anthracite-core/magnum-v3-27b-KTO-e1-r2)
* [anthracite-core/magnum-v3-27b-KTO-e0.25-r1](https://huggingface.co/anthracite-core/magnum-v3-27b-KTO-e0.25-r1)
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model: IntervitensInc/gemma-2-27b-chatml
dtype: float32
merge_method: task_arithmetic
models:
- model: IntervitensInc/gemma-2-27b-chatml
- model: anthracite-core/magnum-v3-27b-KTO-e0.25-r1
parameters:
weight: 0.5
- model: anthracite-core/magnum-v3-27b-KTO-e1-r2
parameters:
weight: 0.1
- model: anthracite-core/magnum-v3-27b-kto-r3
parameters:
weight: 0.4
```