Automating Dataset Updates for Reliable and Timely Evaluation
The author proposes two strategies, mimicking and extending, to automate dataset updates for reliable and timely evaluation by addressing data leakage issues and controlling sample difficulty.