المفاهيم الأساسية
This paper introduces IGroupSS-Mamba, a novel deep learning framework for hyperspectral image classification that leverages the strengths of Selective State Space Models (SSMs) in a computationally efficient manner to achieve state-of-the-art classification accuracy.
الملخص
Bibliographic Information:
He, Y., Tu, B., Jiang, P., Liu, B., Li, J., & Plaza, A. (2024). IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification. IEEE Transactions on Geoscience and Remote Sensing, X(X).
Research Objective:
This paper aims to address the limitations of existing deep learning models for hyperspectral image (HSI) classification, particularly in handling the high dimensionality and information redundancy of HSI data, by proposing a lightweight yet powerful framework called IGroupSS-Mamba.
Methodology:
The proposed IGroupSS-Mamba framework employs a hierarchical structure with multiple stages, each incorporating a downsampling operation and an Interval Group Spatial-Spectral Block (IGSSB). The IGSSB leverages an Interval Group S6 Mechanism (IGSM) to perform interval-wise feature grouping and parallel unidirectional sequence scanning along both spatial and spectral dimensions using Selective State Space Models (SSMs). This approach enables efficient global spatial-spectral feature extraction while mitigating information redundancy.
Key Findings:
- IGroupSS-Mamba significantly outperforms state-of-the-art HSI classification methods in terms of overall accuracy, average accuracy, and Kappa coefficient on three benchmark datasets: Indian Pines, Pavia University, and Houston 2013.
- The proposed interval grouping strategy effectively reduces computational costs while leveraging the complementary strengths of different scanning directions.
- The hierarchical structure with downsampling operations facilitates multi-scale spatial-spectral semantic learning, further enhancing classification accuracy.
Main Conclusions:
IGroupSS-Mamba presents a novel and effective solution for HSI classification by combining the advantages of SSMs, interval grouping, and hierarchical feature learning. The proposed framework achieves state-of-the-art performance with reduced computational complexity compared to existing methods.
Significance:
This research contributes to the advancement of HSI classification by introducing a computationally efficient and highly accurate framework that addresses the challenges posed by the high dimensionality and information redundancy of HSI data. The proposed IGroupSS-Mamba has the potential to improve the performance of various remote sensing applications.
Limitations and Future Research:
Future work could explore the integration of attention mechanisms within the IGSM to further enhance the model's ability to selectively focus on relevant spatial-spectral features. Additionally, investigating the application of IGroupSS-Mamba to other remote sensing tasks, such as object detection and change detection, could be promising research directions.
الإحصائيات
The Pavia University dataset encompasses 103 spectral bands and 610 × 340 pixels, with a spatial resolution of 1.3 m per pixel.
The Indian Pines dataset consists of 200 spectral bands and 145 × 145 pixels, with a spatial resolution of 20 m per pixel.
The Houston 2013 dataset comprises 144 spectral bands and 340 × 1905 pixels, with a spatial resolution of 2.5 m per pixel.
The experiments on the Indian Pines, Pavia University, and Houston 2013 datasets were conducted with 10%, 5%, and 10% of the labeled samples, respectively.
The PCA dimension for reduction was set to 30.
The state dimension and expansion ratio in the S6 mechanism were fixed at 16 and 1, respectively.
The optimal patch size for IGroupSS-Mamba was determined to be 13 × 13.
The embedding dimension was set to 32, and the stage depth was determined as 3.
The downsample scale [2, 1] was uniformly applied across all three datasets.
اقتباسات
"Recent Mamba [19] built upon the State Space Models (SSMs) establish long-distance dependency through state transitions, which enjoys the advantages of global contextual modeling, linear computational complexity, and selective information processing."
"The high dimensionality of HSIs inevitably imposes substantial computational burdens."
"Adjacent spectral bands in original HSIs typically exhibit high similarity."
"Traditional multi-directional scanning strategy applied to all spectral bands may result in information redundancy."
"Given the abundant spectral information and strong spatial correlation inherent in HSIs, sequence scanning along only the spectral or spatial dimension may lead to the loss of spatial or spectral information, respectively."