TRAM integrates sharpness-aware minimization with trust region optimization to improve out-of-domain generalization by minimizing parameter sharpness and representation curvature.
TRAM integrates sharpness-aware minimization with trust region optimization to improve out-of-domain generalization during fine-tuning.