The article discusses the challenges of distributional shifts in machine learning problems and introduces new splitting criteria for decision trees to address out-of-distribution generalization. Traditional methods assume i.i.d. data, but real-life scenarios exhibit shifts over time or locations. The new criteria focus on linear models and neural networks, enhancing tree-based models like GBDTs. By incorporating era-wise information, the new criteria aim to optimize across disjoint eras rather than the entire dataset. Experimental results show improved performance in synthetic and real-world applications, such as financial markets and health domains.
לשפה אחרת
מתוכן המקור
arxiv.org
תובנות מפתח מזוקקות מ:
by Timothy DeLi... ב- arxiv.org 03-15-2024
https://arxiv.org/pdf/2309.14496.pdfשאלות מעמיקות