--- base_model: - ibm/merlinite-7b - l3utterfly/mistral-7b-v0.1-layla-v4 - Undi95/Toppy-M-7B - senseable/WestLake-7B-v2 - jondurbin/bagel-dpo-7b-v0.4 library_name: transformers tags: - mergekit - merge --- ![](mistressmaid.png) # Franken-Mistress-7B-v2 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details So far of the Franken merges, this one does very well using the Min-P and Noromaid settings in SillyTavern 2. This one seems even better then the 10.5B version of this model. ### Merge Method This model was merged using the [TIES](https://arxiv.org/abs/2306.01708) merge method using Franken-Maid as a base. ### Models Merged The following models were included in the merge: * [ibm/merlinite-7b](https://huggingface.co/ibm/merlinite-7b) * [l3utterfly/mistral-7b-v0.1-layla-v4](https://huggingface.co/l3utterfly/mistral-7b-v0.1-layla-v4) * [Undi95/Toppy-M-7B](https://huggingface.co/Undi95/Toppy-M-7B) * [senseable/WestLake-7B-v2](https://huggingface.co/senseable/WestLake-7B-v2) * [jondurbin/bagel-dpo-7b-v0.4](https://huggingface.co/jondurbin/bagel-dpo-7b-v0.4) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: ibm/merlinite-7b parameters: weight: 1 density: 1 - model: Undi95/Toppy-M-7B parameters: weight: 0.3 - model: jondurbin/bagel-dpo-7b-v0.4 parameters: weight: 0.2 - model: senseable/WestLake-7B-v2 parameters: weight: 0.2 - model: l3utterfly/mistral-7b-v0.1-layla-v4 parameters: weight: 0.2 merge_method: ties base_model: Franken-Maid parameters: density: 0.4 int8_mask: true normalize: true dtype: bfloat16 ```