von Kleist, H., Zamanian, A., Shpitser, I., & Ahmidi, N. (2024). Evaluation of Active Feature Acquisition Methods for Time-varying Feature Settings. arXiv preprint arXiv:2312.01530v3.
This paper investigates the challenges of evaluating active feature acquisition (AFA) agents in healthcare, focusing on estimating the performance of these agents when deployed in real-world settings where their acquisition decisions may differ from those reflected in retrospective datasets.
The authors frame the problem of active feature acquisition performance evaluation (AFAPE) as estimating expected counterfactual acquisition and misclassification costs using retrospective data. They analyze AFAPE under various assumptions, including the "no direct effect" (NDE) assumption, where feature acquisitions don't affect underlying feature values, and the "no unobserved confounding" (NUC) assumption, where retrospective feature acquisition decisions are based solely on observed features. The paper explores three main viewpoints for addressing AFAPE: offline reinforcement learning (assuming NUC), missing data analysis (assuming NDE), and a novel semi-offline reinforcement learning framework (assuming both NUC and NDE).
The research highlights that standard evaluation methods in AFA can lead to biased results due to the distribution shift caused by the AFA agent's distinct acquisition policy. The authors demonstrate that leveraging the NDE assumption transforms the AFAPE problem into a missing data problem, allowing the application of established missing data techniques. Furthermore, they introduce a novel semi-offline reinforcement learning framework that combines aspects of both offline RL and missing data analysis, offering improved data efficiency and relaxed positivity assumptions compared to existing methods.
The study emphasizes the importance of considering the distribution shift inherent in deploying AFA agents and proposes a novel semi-offline reinforcement learning framework for more accurate performance evaluation. The authors argue that employing biased evaluation methods without acknowledging the distribution shift can have detrimental consequences, particularly in high-stakes domains like healthcare.
This research significantly contributes to the field of active feature acquisition by formally defining the AFAPE problem and proposing a novel framework for its solution. The findings have important implications for the development and deployment of reliable and safe AFA systems, particularly in healthcare, where accurate performance evaluation is crucial.
The authors acknowledge that the proposed semi-offline RL estimators, while offering advantages, may still require complex approximations and strong positivity assumptions in certain scenarios. Future research could explore more efficient and robust estimation techniques within this framework and investigate its applicability in broader settings beyond healthcare.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Henrik von K... at arxiv.org 11-06-2024
https://arxiv.org/pdf/2312.01530.pdfDeeper Inquiries