The paper introduces DualLoop, an active learning method for interactive ontology matching. The key highlights are:
DualLoop employs an ensemble of tunable heuristic matchers to bootstrap the active learning process and provide initial voting results.
The short-term learner in the fast loop systematically selects high-confidence matching candidates identified by the labeling function ensemble, prioritizing exploitation to overcome the challenge of extreme class imbalance in ontology matching.
The slow loop creates and tunes new labeling functions based on a variety of distance metrics, allowing exploration of the space of potential matches beyond the initial set of heuristics.
Experiments on three datasets show that DualLoop consistently achieves higher F1 scores and recall compared to other active learning methods, while reducing the expected query cost needed to discover 90% of all matches by over 50%.
DualLoop has been successfully deployed in a commercial data interoperability system called TrioNet, demonstrating its practical value and efficiency in the Architecture, Engineering, and Construction (AEC) industry.
To Another Language
from source content
arxiv.org
Deeper Inquiries