|
|
|
|
|
|
Transformer-Based Method for Segmentation of Gastric Cancer
Microscopic Hyperspectral Images |
ZHANG Ran1, 2, JIN Wei1, 2, MU Ying1, YU Bing-wen2, BAI Yi-wen2, SHAO Yi-bo1, 2, PING Jin-liang3*, SONG Peng-tao3, HE Xiang-yi3, LIU Fei3, FU Lin-lin3 |
1. College of Control Science and Engineering, Zhejiang University, Hangzhou 310058, China
2. Huzhou Institute of Zhejiang University, Huzhou 313000, China
3. Huzhou Central Hospital, Huzhou 313000, China
|
|
|
Abstract Gastric cancer is the third leading cause of cancer-related deaths globally, posing a serious threat to human life and health. Therefore, early identification of gastric cancer lesions is crucial for early diagnosis of gastric cancer. As an emerging technique, microscopic hyperspectral imaging technology can simultaneously obtain rich spectral information and spatial information of biological tissues at the microscopic level, providing a new approach for early pathological slice diagnosis. In this paper, gastric cancer microscopic hyperspectral images in the range of 400~1 000 nm were collected using a microscopic hyperspectral imaging system. The gastric cancer microscopic hyperspectral dataset containing 230 images was constructed through preprocessing, such as spectral calibration. Although spatial attention-based methods have achieved significant results in image classification, segmentation, and other fields, they still face challenges of high computational complexity and insufficient utilization of spectral information when dealing with hyperspectral images. Therefore, this paper proposes a backbone network model based on convolution and attention mechanism called Mixing Dual-Branch Transformer (MDBT). This model achieves spatial and channel feature aggregation between blocks and within blocks by alternately applying spatial and channel mixing modules. Specifically, this paper designs window attention, convolution dual branches, and spatial and channel interaction structures. This design not only reduces computational complexity but also achieves window-to-window information interaction and feature fusion through convolutional interaction, overcoming the limitation of window attention's receptive field and further improving the global modeling ability of the Transformer. In the image segmentation experiments, we adopt the UperNet model as the decode head network to reconstruct the features extracted by the backbone network to obtain the final segmentation results. Five-fold cross-validation experiments were conducted on the collected gastric cancer hyperspectral dataset, and the results show that the average priceand mIoU of this paper's model reach 85.39 and 74.66, respectively, outperforming mainstream image segmentation network models such as UNet, Swin, PVT, and VIT. Meanwhile, ablation experiments are designed to verify the optimization effects of the proposed spatial and channel dual mixing modules, convolution, window attention dual branches, and other structures on experimental results. Experimental results demonstrate that the proposed MDBT model can effectively utilize hyperspectral images' rich spatial and spectral information, improve the accuracy of gastric cancer image segmentation, and prove the research significance and application value of microscopic hyperspectral imaging technology in gastric cancer diagnosis.
|
Received: 2024-03-27
Accepted: 2024-07-05
|
|
Corresponding Authors:
PING Jin-liang
E-mail: pjl0173@163.com
|
|
[1] Bray Freddie, Laversanne Mathieu, Sung Hyuna, et al. CA: A Cancer Journal for Clinicians, 2024, 74(3): 229.
[2] Khan Muhammad Jaleed, Khan Hamid Saeed, Yousaf Adeel, et al. IEEE Access, 2018, 6: 14118.
[3] Lu Bing, Dao Phuong D, Liu Jiangui, et al. Remote Sensing, 2020, 12(16): 2659.
[4] Bengs Marcel, Gessert Nils, Laffers Wiebke, et al. Spectral-Spatial Recurrent-Convolutional Networks for in-vivo Hyperspectral Tumor Type Classification, Medical Image Computing and Computer Assisted Intervention-MICCAI 2020: 23rd International Conference, Lima, Peru, October 4-8, 2020, Proceedings, Part Ⅲ 23, 2020.
[5] Jeyaraj Pandia Rajan, Samuel Nadar Edward Rajan,Panigrahi Bijaya Ketan. ResNet Convolution Neural Network Based Hyperspectral Imagery Classification for Accurate Cancerous Region Detection, 2019 IEEE Conference on Information and Communication Technology, 2019.
[6] Gao Hongmin, Yang Mengran, Cao Xueying, et al. Machine Vision and Applications, 2023, 34(5): 72.
[7] Yun Boxiang, Lei Baiying, Chen Jieneng, et al. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(6): 4610.
[8] Liu Ze, Lin Yutong, Cao Yue, et al. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows, Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021.
[9] Chen Qiang, Wu Qiman, Wang Jian, et al. Mixformer: Mixing Features Across Windows and Dimensions, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
[10] Hu Jie, Shen Li, Sun Gang. Squeeze-and-Excitation Networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018.
[11] Dosovitskiy Alexey, Beyer Lucas, Kolesnikov Alexander, et al. An Image is Worth 16×16 Words: Transforms for Image Recognition at Scale, 2021. arXiv: 2010. 11929.
[12] Wang Wenhai, Xie Enze, Li Xiang, et al. Computational Visual Media, 2022, 8(3): 415.
[13] MMSegmentation Contributors. 2020, https://github.com/open-mmlab/mmsegmentation.
[14] Xiao Tete, Liu Yingcheng, Zhou Bolei, et al. Unified Perceptual Parsing for Scene Understanding, Proceedings of the European Conference on Computer Vision (ECCV), 2018.
[15] He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep Residual Learning for Image Recognition, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016.
[16] Ronneberger Olaf, Fischer Philipp, Brox Thomas. U-net: Convolutional Networks for Biomedical Image Segmentation, Medical Image Computing and Computer-Assisted Intervention-MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part Ⅲ 18, 2015.
[17] Chu Xiangxiang, Tian Zhi, Wang Yuqing, et al. Advances in Neural Information Processing Systems, 34(Neurl PS 2021), 9355.
[18] Xie Enze, Wang Wenhai, Yu Zhiding, et al. Advances in Neural Information Processing Systems 34 (Neurl PS 2021), 12077.
|
[1] |
TANG Bin1, HE Yu-long1, TANG Huan2*, LONG Zou-rong1, WANG Jian-xu1*, TAN Bo-wen2, QIN Dan2, LUO Xi-ling1, ZHAO Ming-fu1. Attention Mechanism Based Hyperspectral Image Dimensionality Reduction for Mold Spot Recognition in Paper Artifacts[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(01): 246-255. |
[2] |
WANG Sa1, 2, QU Liang1, 2*, ZHANG Li-fu3, GAO Yu3, LI Guang-hua1, 2, CHANG Jing-jing1, 2. Research on the Inverse Model of Paper Viscosity Based on Hyperspectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(12): 3524-3533. |
[3] |
HUANG Lin-feng1, JIANG Xue-song1, 2*, JIA Zhi-cheng1, ZHOU Hong-ping1, 2, ZHOU Lei1, RONG Zi-fan1. Deep Learning-Based Monitoring of Nutrient Content in Pear Trees[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(12): 3543-3552. |
[4] |
JIA Tong-hua1, CHENG Guang-xu1*, YANG Jia-cong1, CHEN Sheng2, WANG Hai-rong3, HU Hai-jun1. Research of Chlorine Concentration Inversion Method Based on 1D-CNN Using Ultraviolet Spectral[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(11): 3109-3119. |
[5] |
YUE Ji-bo1, LENG Meng-die1, TIAN Qing-jiu2, GUO Wei1, LIU Yang3, FENG Hai-kuan4, QIAO Hong-bo1*. Estimation of Leaf Physical and Chemical Parameters Based on Hyperspectral Remote Sensing and Deep Learning Technologies[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(10): 2873-2883. |
[6] |
HAN Bo-chong1, 2, SONG Yi-han1, 2*, ZHAO Yong-heng1, 2. Classification of Star Spectrum Based on Multi-Scale Feature Fusion[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(08): 2284-2288. |
[7] |
TU Liang-ping1, 2, LI Shuang-chuan2, TU Dong-xin1*, LI Jian-xi*, DING Zhi-chao2. SATS: A Stellar Spectral Classification Algorithm Based on Multiple Feature Extraction[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(07): 2029-2036. |
[8] |
YU Shui1, HUAN Ke-wei1*, LIU Xiao-xi2, WANG Lei1. Quantitative Analysis Modeling of Near Infrared Spectroscopy With
Parallel Convolution Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1627-1635. |
[9] |
WANG Zi-xuan1, YANG Liang2, 3, 4*, HUANG Ling-xia2, HE Yong4, ZHAO Li-hua3, ZHAN Peng-fei3. Nondestructive Determination of TSS Content in Postharvest Mulberry Fruits Using Hyperspectral Imaging and Deep Learning[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(06): 1724-1730. |
[10] |
MA Xiang-cai1, 2, CAO Qian2, BAI Chun-yan2, WANG Xiao-hong3, ZHANG Da-wei1*. Research on Low Illumination Image Enhancement Method Based on Spectral Reflectance[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(03): 610-616. |
[11] |
TANG Jie1, LUO Yan-bo2, LI Xiang-yu2, CHEN Yun-can1, WANG Peng1, LU Tian3, JI Xiao-bo4, PANG Yong-qiang2*, ZHU Li-jun1*. Study on One-Dimensional Convolutional Neural Network Model Based on Near-Infrared Spectroscopy Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(03): 731-736. |
[12] |
ZHANG Mei-ling, CHEN Yong-jie, WANG Min-juan, LI Min-zan, ZHENG Li-hua*. A Hyperspectral Deep Learning Model for Predicting Anthocyanin
Content in Purple Leaf Lettuce[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(03): 865-871. |
[13] |
ZHANG Hai-liang1, ZHOU Xiao-wen1, LIU Xue-mei2*, LUO Wei2, ZHAN Bai-shao2, PAN Fan3. Freshness Identification of Turbot Based on Convolutional Neural
Network and Hyperspectral Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 367-371. |
[14] |
KANG Rui1, 2, CHENG Ya-wen1, 2, ZHOU Ling-li1, 2, REN Ni1, 2*. A Novel Classification Method of Foodborne Bacterial Species Based on Hyperspectral Microscopy Imaging Technology[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(02): 392-397. |
[15] |
WANG Yu-chen1, 2, KONG Ling-qin1, 2, 3*, ZHAO Yue-jin1, 2, 3, DONG Li-quan1, 2, 3*, LIU Ming1, 2, 3, HUI Mei1, 2. Hyperspectral Reconstruction From RGB Images for Tissue Oxygen
Saturation Assessment[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 3193-3201. |
|
|
|
|