Article
Merge Large Language Models with mergekit
By
•
•
91Awesome article. It seems to me that only models with identical architecture (e.g., same number of layers, hidden dimensions, attention heads) can be merged with this approach. Is that correct? How do you know which models have identical architectures?