Collection of relevant papers about model merging
-
Qualitatively characterizing neural network optimization problems
Paper โข 1412.6544 โข Published โข 4 -
Averaging Weights Leads to Wider Optima and Better Generalization
Paper โข 1803.05407 โข Published โข 2 -
Merging Models with Fisher-Weighted Averaging
Paper โข 2111.09832 โข Published โข 1 -
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time
Paper โข 2203.05482 โข Published โข 6