--- base_model: - coldint/10.5B_v1 library_name: transformers tags: - mergekit - merge --- # BestLlamaSN29 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the passthrough merge method. ### Models Merged The following models were included in the merge: * [coldint/10.5B_v1](https://huggingface.co/coldint/10.5B_v1) ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: coldint/10.5B_v1 layer_range: [0, 36] - sources: # add middle layers with residuals scaled to zero - model: coldint/10.5B_v1 layer_range: [34, 36] parameters: scale: - filter: o_proj value: 0.0 - filter: down_proj value: 0.0 - value: 1.0 # - sources: # add middle layers with residuals scaled to zero #- model: upstage/SOLAR-10.7B-v1.0 # layer_range: [14, 24] # parameters: #scale: #- filter: o_proj #value: 0.0 # - filter: down_proj # value: 0.0 # - value: 1.0 - sources: - model: coldint/10.5B_v1 layer_range: [36, 43] merge_method: passthrough dtype: bfloat16 ```