license: other | |
license_name: yi-34b | |
license_link: https://huggingface.co/01-ai/Yi-34B/blob/main/LICENSE | |
## Kaiju-57B | |
I made this model as an experiment for /r/LocalLlama, who've all wanted a Yi graft like Goliath. | |
I took the goliath-120B template and used the same proportions to blend Tess-M-v1.3 and Tess-M-v1.2. The mergekit yaml is in the repo. | |
I chose these two as there are still precious few Yi-200K tunes and merging models with different ideas of positional encoding did not work well. | |
Thanks to Meta for Llama which kickstarted open weight models, thanks to Yi for the base model, thanks migtissera and the others who have fine-tuned Yi. Special shoutout to chargoddard for mergekit and the original frankenllama. | |
# Prompt Format: | |
``` | |
SYSTEM: <ANY SYSTEM CONTEXT> | |
USER: | |
ASSISTANT: | |
``` | |