--- base_model: - Qwen/Qwen2.5-14B-Instruct-1M - Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8 - suayptalha/Lamarckvergence-14B - Qwen/Qwen2.5-14B-Instruct - sometimesanotion/LamarckInfusion-14B-v1 - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - prithivMLmods/Equuleus-Opus-14B-Exp - sometimesanotion/Lamarck-14B-v0.7-Fusion - Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7 - Qwen/Qwen2.5-Coder-14B-Instruct library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8](https://huggingface.co/Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8) as a base. ### Models Merged The following models were included in the merge: * [Qwen/Qwen2.5-14B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M) * [suayptalha/Lamarckvergence-14B](https://huggingface.co/suayptalha/Lamarckvergence-14B) * [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) * [sometimesanotion/LamarckInfusion-14B-v1](https://huggingface.co/sometimesanotion/LamarckInfusion-14B-v1) * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B) * [prithivMLmods/Equuleus-Opus-14B-Exp](https://huggingface.co/prithivMLmods/Equuleus-Opus-14B-Exp) * [sometimesanotion/Lamarck-14B-v0.7-Fusion](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.7-Fusion) * [Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7](https://huggingface.co/Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7) * [Qwen/Qwen2.5-Coder-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8 - model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7 - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - model: Qwen/Qwen2.5-14B-Instruct - model: Qwen/Qwen2.5-14B-Instruct-1M - model: Qwen/Qwen2.5-Coder-14B-Instruct - model: prithivMLmods/Equuleus-Opus-14B-Exp - model: sometimesanotion/Lamarck-14B-v0.7-Fusion - model: sometimesanotion/LamarckInfusion-14B-v1 - model: suayptalha/Lamarckvergence-14B base_model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8 chat_template: auto dtype: bfloat16 merge_method: model_stock parameters: int8_mask: true tokenizer: source: base ```