Core Concepts
Alternating Tuning and Merging (ATM) is a novel iterative approach to model merging that surpasses one-shot methods by gradually integrating task-specific knowledge, leading to improved multi-task learning performance.
Zhou, L., Solombrino, D., Crisostomi, D., Bucarelli, M.S., Silvestri, F., & Rodolà, E. (2024). ATM: Improving Model Merging by Alternating Tuning and Merging. arXiv preprint arXiv:2411.03055.
This paper investigates the limitations of existing one-shot model merging techniques, particularly task arithmetic, and proposes a novel iterative approach called Alternating Tuning and Merging (ATM) to enhance multi-task learning performance.