Anton Kratz

akratz

AI & ML interests

None yet

Recent Activity

Organizations

None yet

akratz's activity

upvoted an article 7 days ago
view article
Article

Merge Large Language Models with mergekit

By mlabonne •
• 91
view reply

Awesome article. It seems to me that only models with identical architecture (e.g., same number of layers, hidden dimensions, attention heads) can be merged with this approach. Is that correct? How do you know which models have identical architectures?

New activity in ctheodoris/Geneformer over 1 year ago
New activity in ctheodoris/Genecorpus-30M over 1 year ago

nonzero median

1
#1 opened over 1 year ago by
akratz
New activity in bigscience/bloom-book almost 2 years ago

🚩 Report : Not working

2
#9 opened almost 2 years ago by
TempoNaoTenho
New activity in bigscience/bloom almost 2 years ago