mayflowergmbh
/

occiglot-7b-de-es-fr-it-en-instruct-taskarithmetic

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

occiglot-7b-de-es-fr-it-en-instruct-taskarithmetic / README.md

johannhartmann's picture

Upload folder using huggingface_hub

c6feba0 verified 10 months ago

|

history blame contribute delete

1.86 kB

	---
	base_model:
	- occiglot/occiglot-7b-de-en-instruct
	- occiglot/occiglot-7b-it-en-instruct
	- occiglot/occiglot-7b-fr-en-instruct
	- occiglot/occiglot-7b-es-en-instruct
	- mistralai/Mistral-7B-v0.1
	library_name: transformers
	tags:
	- mergekit
	- merge

	---
	# occitaskarithmetic

	This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

	## Merge Details
	### Merge Method

	This model was merged using the [task arithmetic](https://arxiv.org/abs/2212.04089) merge method using [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) as a base.

	### Models Merged

	The following models were included in the merge:
	* [occiglot/occiglot-7b-de-en-instruct](https://huggingface.co/occiglot/occiglot-7b-de-en-instruct)
	* [occiglot/occiglot-7b-it-en-instruct](https://huggingface.co/occiglot/occiglot-7b-it-en-instruct)
	* [occiglot/occiglot-7b-fr-en-instruct](https://huggingface.co/occiglot/occiglot-7b-fr-en-instruct)
	* [occiglot/occiglot-7b-es-en-instruct](https://huggingface.co/occiglot/occiglot-7b-es-en-instruct)

	### Configuration

	The following YAML configuration was used to produce this model:

	```yaml
	models:
	- model: mistralai/Mistral-7B-v0.1
	# no parameters necessary for base model
	- model: occiglot/occiglot-7b-de-en-instruct
	parameters:
	density: 0.6
	weight: 0.25
	- model: occiglot/occiglot-7b-it-en-instruct
	parameters:
	density: 0.6
	weight: 0.25
	- model: occiglot/occiglot-7b-fr-en-instruct
	parameters:
	density: 0.6
	weight: 0.25
	- model: occiglot/occiglot-7b-es-en-instruct
	parameters:
	density: 0.6
	weight: 0.25
	merge_method: task_arithmetic
	base_model: mistralai/Mistral-7B-v0.1
	parameters:
	int8_mask: true
	dtype: bfloat16
	random_seed: 0
	tokenizer_source: model:occiglot/occiglot-7b-de-en-instruct

	```