--- language: - en license: apache-2.0 library_name: transformers tags: - mergekit - merge base_model: - arcee-ai/Virtuoso-Small - Qwen2.5-14B-Qwenvergence-model_stock - huihui-ai/Qwen2.5-14B-Instruct-abliterated-v2 - sometimesanotion/Qwen2.5-14B-Qwenvergence-model_stock metrics: - accuracy pipeline_tag: text-generation --- ![Lamarck.webp](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.4-Qwenvergence/resolve/main/Lamarck.webp) --- Lamarck 14B v0.4 Qwenvergence: it's a big step up for Lamarck in terms of output quality, reasoning, and prose quality. All the same ingredients are involved as in previous releases of Lamarck; they are more effectively combined. This model features decent wit and stronger reasoning than 0.3. ## Merge Details This model was initialized from model_stock, and refined from there. No fine-tuning, use of models apart from those listed or the contents of Qwenvergence, wild parties, or sacrifices to the unnamed deities were involved. Contrary to default CO2 emissions reports, this is merging from models already made, helping to upcycle and extend the life of the compute work. It's merged on a single workstation running on nearly 50% renewable electricity. It was finalized using the [TIES](https://arxiv.org/abs/2306.01708) merge method using sometimesanotion/lamarck-14b-base as a base, per @rombodawg's continuous fine-tuning method. ### Models Merged **Top influences:** These ancestors are base models and present in the Qwenvergence model_stock, reinforced in later steps: - **[arcee-ai/Virtuoso-Small](https://huggingface.co/arcee-ai/Virtuoso-Small)** - A brand new model from Arcee, refined from the notable cross-architecture Llama-to-Qwen distillation [arcee-ai/SuperNova-Medius](https://huggingface.co/arcee-ai/SuperNova-Medius). The first two layers are nearly exclusively from Virtuoso. It has proven to be a well-rounded performer, and contributes a noticeable boost to the model's prose quality. - **[CultriX/SeQwence-14B-EvolMerge](http://huggingface.co/CultriX/SeQwence-14B-EvolMerge)** - A top contender on reasoning benchmarks. - **[VAGOsolutions/SauerkrautLM-v2-14b-DPO](https://huggingface.co/VAGOsolutions/SauerkrautLM-v2-14b-DPO)** - This model's influence is understated, but aids BBH and coding capability. - **[v000000/Qwen2.5-Lumen-14B](https://huggingface.co/v000000/Qwen2.5-Lumen-14B)** - A leading influence for prose quality. **Prose added:** The prose quality has taken a leap, no doubt also to the way [EVA-UNIT-01/EVA-Qwen2.5-14B-v0.2](https://huggingface.co/EVA-UNIT-01/EVA-Qwen2.5-14B-v0.2), [sthenno-com/miscii-14b-1028](https://huggingface.co/sthenno-com/miscii-14b-1028), [oxyapi/oxy-1-small](https://huggingface.co/oxyapi/oxy-1-small), and [underwoods/medius-erebus-magnum-14b](https://huggingface.co/underwoods/medius-erebus-magnum-14b) were applied. ### Configuration The following YAML configuration was used to finalize this model: ```yaml name: Lamarck-14B-v0.4-Qwenvergence merge_method: ties base_model: sometimesanotion/lamarck-14b-base tokenizer_source: base parameters: density: 1.00 weight: 1.00 int8_mask: true normalize: true rescale: false models: - model: merges/Qwen2.5-14B-Qwenvergence-slerp parameters: weight: 1.00 density: 1.00 - model: arcee-ai/Virtuoso-Small parameters: weight: 1.00 density: 1.00 ```