Lunzima
/

NQLSG-Qwen2.5-14B-OriginalFusion

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

NQLSG-Qwen2.5-14B-OriginalFusion / README.md

Lunzima's picture

Upload folder using huggingface_hub

d07dd0e verified 5 days ago

|

history blame contribute delete

2.52 kB

	---
	base_model:
	- Qwen/Qwen2.5-14B-Instruct-1M
	- Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
	- suayptalha/Lamarckvergence-14B
	- Qwen/Qwen2.5-14B-Instruct
	- sometimesanotion/LamarckInfusion-14B-v1
	- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
	- prithivMLmods/Equuleus-Opus-14B-Exp
	- sometimesanotion/Lamarck-14B-v0.7-Fusion
	- Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7
	- Qwen/Qwen2.5-Coder-14B-Instruct
	library_name: transformers
	tags:
	- mergekit
	- merge

	---
	# merge

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8](https://huggingface.co/Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8) as a base.

	### Models Merged

	The following models were included in the merge:
	* [Qwen/Qwen2.5-14B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M)
	* [suayptalha/Lamarckvergence-14B](https://huggingface.co/suayptalha/Lamarckvergence-14B)
	* [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
	* [sometimesanotion/LamarckInfusion-14B-v1](https://huggingface.co/sometimesanotion/LamarckInfusion-14B-v1)
	* [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)
	* [prithivMLmods/Equuleus-Opus-14B-Exp](https://huggingface.co/prithivMLmods/Equuleus-Opus-14B-Exp)
	* [sometimesanotion/Lamarck-14B-v0.7-Fusion](https://huggingface.co/sometimesanotion/Lamarck-14B-v0.7-Fusion)
	* [Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7](https://huggingface.co/Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7)
	* [Qwen/Qwen2.5-Coder-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-14B-Instruct)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
	- model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8.7
	- model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
	- model: Qwen/Qwen2.5-14B-Instruct
	- model: Qwen/Qwen2.5-14B-Instruct-1M
	- model: Qwen/Qwen2.5-Coder-14B-Instruct
	- model: prithivMLmods/Equuleus-Opus-14B-Exp
	- model: sometimesanotion/Lamarck-14B-v0.7-Fusion
	- model: sometimesanotion/LamarckInfusion-14B-v1
	- model: suayptalha/Lamarckvergence-14B
	base_model: Lunzima/NQLSG-Qwen2.5-14B-MegaFusion-v8
	chat_template: auto
	dtype: bfloat16
	merge_method: model_stock
	parameters:
	int8_mask: true
	tokenizer:
	source: base

	```