The article discusses the challenges of distributional shifts in machine learning problems and introduces new splitting criteria for decision trees to address out-of-distribution generalization. Traditional methods assume i.i.d. data, but real-life scenarios exhibit shifts over time or locations. The new criteria focus on linear models and neural networks, enhancing tree-based models like GBDTs. By incorporating era-wise information, the new criteria aim to optimize across disjoint eras rather than the entire dataset. Experimental results show improved performance in synthetic and real-world applications, such as financial markets and health domains.
In eine andere Sprache
aus dem Quellinhalt
arxiv.org
Wichtige Erkenntnisse aus
by Timothy DeLi... um arxiv.org 03-15-2024
https://arxiv.org/pdf/2309.14496.pdfTiefere Fragen