The paper introduces a novel CNN-Transformer approach for hyperspectral image (HSI) classification that leverages the strengths of CNNs in local feature extraction and transformers in long-range context modeling. The key contributions are:
The introduction of a Gate-Shift-Fuse (GSF) block designed to strengthen the extraction of local and global spatial-spectral features from HSI data. The GSF block integrates a spatial gating mechanism and a fusion block to dynamically process and combine spectral and spatial features.
The proposal of an effective attention mechanism module to enhance the extraction of local and global information from HSI cubes.
Extensive experiments conducted on four well-known HSI datasets (Indian Pines, Pavia University, WHU-WHU-Hi-LongKou, and WHU-Hi-HanChuan) demonstrate that the proposed framework achieves superior classification results compared to other state-of-the-art models.
The paper first provides an overview of traditional machine learning techniques and their limitations in handling the complex, high-dimensional, and nonlinear nature of HSI data. It then discusses the advancements of deep learning, particularly the use of Convolutional Neural Networks (CNNs) and transformers, in addressing these challenges.
The proposed CNN-Transformer approach consists of four main components:
The experimental results demonstrate the superior performance of the proposed framework compared to other state-of-the-art methods in terms of overall accuracy, average accuracy, and kappa coefficient. The ablation study further validates the contributions of the individual components, highlighting the importance of the GSF block and the TE module in enhancing the classification performance.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Mohamed Fadh... lúc arxiv.org 09-27-2024
https://arxiv.org/pdf/2406.14120.pdfYêu cầu sâu hơn