The article discusses the challenges of distributional shifts in machine learning problems and introduces new splitting criteria for decision trees to address out-of-distribution generalization. Traditional methods assume i.i.d. data, but real-life scenarios exhibit shifts over time or locations. The new criteria focus on linear models and neural networks, enhancing tree-based models like GBDTs. By incorporating era-wise information, the new criteria aim to optimize across disjoint eras rather than the entire dataset. Experimental results show improved performance in synthetic and real-world applications, such as financial markets and health domains.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Timothy DeLi... lúc arxiv.org 03-15-2024
https://arxiv.org/pdf/2309.14496.pdfYêu cầu sâu hơn