The paper presents a novel reinforcement learning-based approach for channel denoising in MIMO OFDM systems. The key highlights are:
Introduction of channel curvature as a metric to quantify the reliability of channel estimates, and derivation of a curvature magnitude threshold to identify unreliable estimates.
Formulation of the channel denoising process as a Markov Decision Process (MDP), where the actions involve updating the channel estimates based on the geometry of neighboring subcarriers, and the reward function captures the noise reduction achieved.
Application of Q-learning to solve the MDP and find the optimal sequential denoising order, without requiring any prior channel statistics or labeled training data.
Incorporation of a feedback mechanism to dynamically adjust the curvature threshold and further improve the denoising performance.
The proposed method is shown to outperform conventional least squares (LS) estimation and approach the performance of the ideal linear minimum mean square error (LMMSE) estimation, while exhibiting robustness against variations in channel conditions and statistical knowledge.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Myeung Suk O... at arxiv.org 03-29-2024
https://arxiv.org/pdf/2101.10300.pdfDeeper Inquiries