The paper focuses on learning positional information of nodes in a graph, which is important for link prediction tasks. The authors propose a representation of positional information using a small number of representative nodes called landmarks. The landmarks are selected based on their high degree centrality, which is motivated by network science theory.
The authors provide a theoretical analysis on the achievable accuracy of distance estimates via landmarks for well-known random graph models, such as Erdős-Rényi (ER) and Barabási-Albert (BA) models. For ER graphs, they show that the detour via landmarks incurs a constant factor overhead compared to the shortest path distance. For BA graphs, they prove that the strategy of choosing high-degree nodes as landmarks is asymptotically optimal, i.e., the minimum detour distance is asymptotically equal to the shortest path distance.
Motivated by the theoretical insights, the authors propose the Hierarchical Position embedding with Landmarks and Clustering (HPLC) algorithm. HPLC partitions the graph into clusters, selects the node with the highest degree in each cluster as a landmark, and computes the encoding based on the distances to landmarks and the hierarchical grouping of clusters. The computation of HPLC can be mainly done during preprocessing, incurring low computational costs.
The experiments on 7 datasets with 16 baseline methods show that HPLC achieves state-of-the-art performances in link prediction, demonstrating the effectiveness of the proposed landmark-based positional embedding approach.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Minsang Kim,... lúc arxiv.org 04-22-2024
https://arxiv.org/pdf/2402.08174.pdfYêu cầu sâu hơn