Core Concepts
A flexible object proposal network that can be tuned to balance the detection of known and unknown objects based on the needs of the application.
Abstract
The key insights and contributions of this work are:
The authors propose a Tunable Hybrid Proposal Network (THPN) that leverages both classification-based and localization-based objectness representations. This allows the model's behavior to be adjusted using a single hyperparameter (λCLS) to suit a variety of open-world settings.
THPN employs a novel self-training procedure that generates high-quality pseudo-labels on the training data to improve generalization to unknown object classes, without requiring any additional unlabeled data.
The authors devise a dynamic loss function that addresses challenges like class imbalance and imperfect pseudo-label targets during training.
To thoroughly evaluate THPN, the authors introduce several new open-world proposal challenges that simulate varying degrees of label bias by altering known class diversity and label count. These challenges go beyond the common VOC→COCO benchmark.
THPN outperforms existing state-of-the-art proposal methods across all evaluation settings. It exhibits strong performance on both known and unknown object detection, and is highly data efficient, surpassing baseline recall with a fraction of the labeled data.
Stats
"The goal of the open-set object proposal task is to train a model M parameterized by θ to detect and localize all object instances of potential interest in a test set (i.e., all instances in the set K ∪ U)."
"For a given test image X, the proposal network's function is M(X; θ) = {[x, y, w, h, s]j=1...N}, where x, y, w, and h denote the center coordinates, width, and height of the bounding box, respectively. The predicted "objectness" score s ∈[0, 1] is the confidence that box j contains an object."
Quotes
"Our goal is to provide a flexible proposal solution that can be easily tuned to suit a variety of open-world settings."
"THPN outperforms all baselines in all evaluation settings that we consider."
"THPN's flexibility enables it to be a better general solution for open-set/world detection problems."