--- base_model: [] library_name: transformers tags: - mergekit - merge --- # Nemomix-v0.4-12B This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the della_linear merge method using F:\mergekit\mistralaiMistral-Nemo-Base-2407 as a base. ### Models Merged The following models were included in the merge: * F:\mergekit\intervitens_mini-magnum-12b-v1.1 * F:\mergekit\mistralaiMistral-Nemo-Instruct-2407 * F:\mergekit\invisietch_Atlantis-v0.1-12B * F:\mergekit\NeverSleepHistorical_lumi-nemo-e2.0 ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: F:\mergekit\invisietch_Atlantis-v0.1-12B parameters: weight: 0.16 density: 0.4 - model: F:\mergekit\mistralaiMistral-Nemo-Instruct-2407 parameters: weight: 0.23 density: 0.5 - model: F:\mergekit\NeverSleepHistorical_lumi-nemo-e2.0 parameters: weight: 0.27 density: 0.6 - model: F:\mergekit\intervitens_mini-magnum-12b-v1.1 parameters: weight: 0.34 density: 0.8 merge_method: della_linear base_model: F:\mergekit\mistralaiMistral-Nemo-Base-2407 parameters: epsilon: 0.05 lambda: 1 int8_mask: true dtype: bfloat16 ```