Core Concepts
The paper proposes a novel transfer learning approach that explicitly encourages the learning of transferable features by introducing a reconstruction loss for common information shared across correlated optimization tasks. This approach enables efficient knowledge transfer and mitigates overfitting when training on limited target task data.
Abstract
The paper presents a transfer learning framework for solving correlated optimization tasks that share the same input distribution. The key contributions are:
Establishing the concept of "common information" - the shared knowledge required for solving the correlated tasks. This can be the problem inputs themselves or a more specific representation.
Proposing a novel training approach that adds a reconstruction loss to the model, encouraging the learned features to capture the common information. This allows efficient transfer of knowledge from the source task to the target task.
Demonstrating the effectiveness of the proposed approach through three applications:
MNIST handwritten digit classification
Device-to-device wireless network power control
MISO wireless network beamforming and localization
The results show that the proposed transfer learning method significantly outperforms conventional transfer learning and regular learning approaches, especially when the target task has limited training data available. The method is able to effectively extract and transfer the common knowledge across tasks, leading to better performance and higher data efficiency.
Stats
The paper presents the following key figures and statistics:
The MNIST dataset specifications:
Data Spec A: 12,313 source task training samples, 92 target task training samples
Data Spec B: 11,664 source task training samples, 926 target task training samples
The D2D wireless network simulation settings:
10 D2D links randomly deployed in a 150m x 150m region
Each transmitter has a maximum power of 30dBm and a direct-channel antenna gain of 6dB
Noise level of -150dBm/Hz, 5MHz bandwidth with full frequency reuse
The MISO wireless network simulation settings:
M=8 base stations, each with K=4 antennas, serving a single user equipment
Rician fading channel model with a Rician factor of 5dB
Maximum transmission power of 30dBm per base station, noise power of -90dBm