基于双重注意力机制的RGB成像光谱重建方法研究

doi:10.3964/j.issn.1000-0593(2023)09-2687-07

摘要
参考文献
相关文章 (15)

全文: PDF (18400 KB)
输出: BibTeX | EndNote (RIS)

摘要：光谱图像比RGB图像存储的信息量更大，理论上具有更广泛的应用范围，然而受限于光谱成像设备成本高、数据处理复杂等问题，目前主要应用在遥感、军事及天文学等特定领域。近年来，学者们提出了利用RGB图像通过数学方法重建光谱图像的解决方案，能够极大提高光谱图像的应用范围。然而，当前光谱重建方法普遍存在图像细节易丢失、光谱精度不够高等问题，因此，提出一种基于双重注意力机制的RGB图像光谱重建方法，从图像细节和光谱精度方面提高光谱图像重建质量。所提出的光谱重建方法设计了一种稀疏信号深度重建网络，重点针对RGB图像的稀疏特性，从准确提取图像信息的多层次特征、挖掘更多语义信息入手，实现稀疏信号到完备信号重建。在网络结构上，所设计的光谱重建网络首先利用小参数卷积提取RGB图像的浅层特征信息；然后引入有效多频率通道注意力机制，计算特征层各通道之间的相关性，通过层间加权实现特征响应的有效分配；同时建立层特征加权融合注意力机制，学习不同层特征之间的依赖关系，通过不同层加权实现权重的优化，以便提取有效的光谱深度特征；最后基于所提取的深度特征通过卷积生成指定维度的光谱图像。该实验利用python3.7编程语言，以pytorch1.2作为深度学习模型框架，综合光谱图像误差和RGB图像误差作为损失函数进行光谱重建网络的训练。在NTIRE 2020和CAVE数据集上，对所提方法与7种主流光谱重建方法进行对比验证，从主观方面来看，该方法恢复的光谱图像细节更清晰、误差更小；从客观指标方面分析，该方法重建出的光谱图像与目前已有文献中重建性能较好的方法相比，在R_RMSE、R_SAM、R_ERGAS指标上分别降低了18.9%、16.6%、22.2%，而R_PSNR指标提高了4.5%。实验结果证明了该方法在RGB图像光谱重建的有效性。

关键词：光谱重建；高光谱成像；注意力机制；光谱超分辨率

Abstract：Spectral images, which theoretically have a wider range of applications, store more information than RGB images. However, due to the high cost of spectral imaging equipment and complex data processing, spectral images are mainly applied in remote sensing, military and other fields. In recent years, scholars have proposed solutions to reconstruct spectral images by mathematical methods using RGB images, which can greatly improve the application range of spectral images. However, there are many problems in current spectral reconstruction models, such as the loss of image details and insufficient spectral accuracy. Therefore, this paper proposes a spectral reconstruction method from RGB images based on a dual attention mechanism to improve the quality of spectral image reconstruction from image detail and spectral accuracy. The proposed spectral reconstruction method designs a sparse signal depth reconstruction network, focusing on the sparse characteristics of RGB images, and achieves sparse to complete signals reconstruction by accurately extracting multi-level features of image information and mining more semantic information. Regarding network structure, the designed spectral reconstruction network first uses small parameter convolution to extract shallow feature information of RGB images. Then, the effective multi-frequency channel attention mechanism was used to calculate the correlation between each channel in the feature layer, and the effective distribution of feature response was realized by inter-layer weighting. At the same time, the layer feature weighted fusion attention mechanism is introduced to learn the dependence between features of different layers, and the weights are optimized through different layers' weighting to extract effective spectral depth features. Finally, based on the extracted depth features, the hyperspectral image is transformed into a specified dimension by convolution. The experiment uses the python 3.7 programming language, pytorch 1.2, as the deep learning model framework and combined spectral image error and RGB image error as loss functions for the training of the spectral reconstruction network. The proposed method and 7 mainstream spectral reconstruction methods are compared and verified on the NTIRE 2020 and CAVE datasets. From a subjective perspective, the spectral image details recovered by this method are clearer, and the error is smaller. From the perspective of objective indicators, the spectral images reconstructed by this method are reduced by 18.9%, 16.6%, and 22.2% in R_RMSE, R_SAM and R_ERGAS indicators, respectively, compared with the methods with better reconstruction performance in the existing literature. The R_PSNR indicator improved by 4.5%. Therefore, the experimental results prove the effectiveness of the proposed method from RGB image spectral reconstruction.

Key words：Spectral reconstruction; Hyperspectral imaging; Attention mechanism; Spectral super-resolution

收稿日期: 2022-03-28 修订日期: 2022-10-12

中图分类号:

O433

基金资助: 国家自然科学基金项目(62076199，62275051)，陕西省重点研发计划项目(2021GY-027，2022ZDLGY01-03)资助

通讯作者: 姚其 E-mail: yaoqi@fudan.edu.cn

作者简介: 孙帮勇，1980年生，西安理工大学印刷包装与数字媒体学院教授 e-mail: sunbangyong@xaut.edu.cn

引用本文:

孙帮勇，喻梦莹，姚其. 基于双重注意力机制的RGB成像光谱重建方法研究[J]. 光谱学与光谱分析, 2023, 43(09): 2687-2693.
SUN Bang-yong, YU Meng-ying, YAO Qi. Research on Spectral Reconstruction Method From RGB Imaging Based on Dual Attention Mechanism. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(09): 2687-2693.

链接本文:

https://www.gpxygpfx.com/CN/10.3964/j.issn.1000-0593(2023)09-2687-07 或 https://www.gpxygpfx.com/CN/Y2023/V43/I09/2687

[1] Parmar M, Lansel S, Wandell B A. IEEE International Conference on Image Processing, 2008: 473.
[2] Arad B, Ben-Shahar O. European Conference on Computer Vision. Amsterdam, Netherlands, 2016: 19.
[3] Aeschbacher J, Wu J, Timofte R. IEEE International Conference on Computer Vision, 2017: 471.
[4] LI Fu-hao, LI Chang-jun（李富豪, 李长军）. Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(10): 3281.
[5] Xiao G S, Wan X X, Wang L X, et al. Optics Express, 2019, 27(24): 34921.
[6] Liang J X, Wan X X. Optics Express, 2017, 25(23): 28273.
[7] LIANG Jin-xing, WAN Xiao-xia（梁金星，万晓霞）. Acta Optica Sinica(光学学报), 2017, 37(9): 0933001.
[8] Jia Y, Zheng Y Q, Gu L, et al. IEEE International Conference on Computer Vision, 2017: 4715.
[9] Liang J X, Xiao K D, Pointer M R. Optics Express, 2019, 27(4): 5165.
[10] Yan Y Q, Zhang L, Li J, et al. Pattern Recognition and Computer Vision. Guangzhou, China, 2018, 11257: 206.
[11] LI Yong, JIN Qiu-yu, ZHAO Huai-ci, et al（李勇，金秋雨，赵怀慈, 等）. Acta Optica Sinica(光学学报), 2021, 41(7): 0730001.
[12] Xiong Z W, Shi Z, Li H Q, et al. IEEE International Conference on Computer Vision Workshop, 2017: 518.
[13] Nathan D S, Uma K, Vinothini D S, et al. arXiv: 2020, 2004.06930v2.
[14] Zhao Y Z, Po L M, Yan Q, et al. IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2020: 1695.
[15] LIU Peng-fei, ZHAO Huai-ci, LI Pei-xuan（刘鹏飞，赵怀慈，李培玄）. Infrared and Laser Engineering(红外与激光工程), 2020, 49(S1): 20200093.
[16] Li J J, Wu C X, Song R, et al. IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2020: 1894.
[17] Qin Z Q, Zhang P Y, Wu F, et al. IEEE International Conference on Computer Vision, 2021: 763.
[18] Galliani S, Lanaras C, Marmanis D, et al. arXiv: 2017, 1703.09470v1.