|
|
|
|
|
|
Cross-Modal Dual-Channel Camouflaged Object Detection Method for Visible-Spectrum Image |
CHENG Yu-hu, WU Shi-jia, WANG Hao-yu, WANG Xue-song* |
School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221116, China
|
|
|
Abstract The camouflaged object detection (COD) task for visible-spectrum images aims to utilize visible-spectrum information to detect camouflaged objects that are visually consistent with their surrounding environment. This visual consistency poses challenges such as difficulty in distinguishing object boundaries and learning discriminative features, which limit the effectiveness of existing object detection methods for COD. A Cross-modal Dynamic Collaborative Dual-channel Network (CDCDN) is proposed to explore the potential of global-local multi-level visual perception and visual-language models in COD. First, to address the challenge of distinguishing object boundaries, a dynamic, collaborative, dual-channel module is designed. Through the dual channels, the detection process is decoupled into global information localizationand local feature refinement, enabling object detection and optimization from a multi-level visual perspective. A dynamic information collaboration and fusion mechanism is established, through which global and local information are mutually complemented and corrected by global gating constraints and local perception correction. The spatial capture capability of the model is enhanced in scenarios with blurred object boundaries. To address the difficulty in learning discriminative features, a cross-modal scene-object matching module is designed. By incorporating a pre-trained VLM, this module establishes cross-modal interactions between the visual and language modalities, thereby enhancing the distinction between objects and backgrounds in the feature space and improving the model's semantic discrimination in scenes with limited discriminative features. CDCDN is evaluated on the MHCD2022 and COD10K datasets using the mAP@0.5, mAP@0.5∶0.95, and mAP@0.75 metrics. CDCDN achieves scores of 67.6%, 42.6%, 48.4% on the MHCD2022 dataset, and 67.9%, 40.6%, 41.0% on the COD10K dataset, respectively. Compared to five mainstream object detection methods, including Faster R-CNN, DETR, Lite-DETR, YOLOv5, and YOLOv10, CDCDN achieves the best detection accuracy across all three metrics.Visualization of detection results in four common camouflaged scenes -barren land, grassland, woodland, and snowfield -demonstrates the adaptability of CDCDN to various scenes. In an ablation study, the key components of CDCDN are incrementally removed to systematically evaluate their contributions, with results showing that each component significantly enhances the model's detection performance. Comprehensive experimental results indicate that CDCDN can accurately detect camouflaged objects with high visual consistency to their surroundings, providing a novel solution for COD.
|
Received: 2024-12-31
Accepted: 2025-05-20
|
|
Corresponding Authors:
WANG Xue-song
E-mail: wangxuesongcumt@163.com
|
|
[1] Zheng Y, Zhang X, Wang F, et al. IEEE Signal Processing Letters, 2019, 26(1): 29.
[2] XU Jing-yu, BAO Ni-sha, LANG Jie-shuang,et al(徐景余, 包妮沙, 郎洁双, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2024, 44(12): 3534.
[3] Talas L, Baddeley R J, Cuthill I C. Philosophical Transactions of the Royal Society B: Biological Sciences, 2017, 372(1724): 20160351.
[4] Liu Y, Wang C, Zhou Y. Defence Technology, 2023, 21: 176.
[5] Fan D P, Ji G P, Sun G, et al. Proceedings of the Computer Vision and Pattern Recognition,2020. 2777.
[6] Lv Y, Zhang J, Dai Y, et al. IEEE Transactions on Circuits and Systems for Video Technology, 2023, 33(7): 3462.
[7] Zhou T, Zhou Y, Gong C, et al. IEEE Transactions on Image Processing, 2022, 31: 7036.
[8] Cong R, Sun M, Zhang S, et al. Proceedings of the 31st ACM International Conference on Multimedia,2023. 1179.
[9] Khan A, Khan M, Gueaieb W, et al. Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision,2024. 1434.
[10] Liang B, Luo H. Expert Systems with Applications, 2024, 238: 121778.
[11] Zou Z, Chen K, Shi Z, et al. Proceedings of the IEEE, 2023, 111(3): 257.
[12] Ren S, He K, Girshick R, et al. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 39(6): 1137.
[13] Law H, Deng J. Proceedings of the European Conference on Computer Vision,2018. 734.
[14] Carion N, Massa F, Synnaeve G, et al. Proceedings of the European Conference on Computer Vision, 2020. 213.
[15] Liu M, Di X. Neurocomputing, 2023, 549: 126466.
[16] Woo S, Park J, Lee J Y, et al. Proceedings of the European Conference on Computer Vision,2018. 3.
[17] Li J, Li D, Xiong C, et al. Proceedings of the International Conference on Machine Learning, 2022. 12888.
[18] Michel P, Levy O, Neubig G. Proceedings of the 33rd International Conference on Neural Information Processing Systems, 2019. 14037.
[19] Li F, Zeng A, Liu S, et al. Proceedings of the Computer Vision and Pattern Recognition,2023. 18558.
[20] Khanam R, Hussain M. What is YOLOv5: A Deep Look Into the Internal Features of the Popular Object Detector, 2024, 10.48550/arXiv_2407_20892.
[21] Wang A, Chen H, Liu L, et al. Yolov10: Real-Time End-to-End Object Detection, 2024, arXiv: 2405. 14458. |
[1] |
LIANG Wei1, 2, CAI Lei1, 2, HAO Wen1, 2, JIN Hai-yan1, 2, HOU Yu3. Universal High Fidelity Spectral Image Compression Based on Color Perception[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2025, 45(07): 2008-2016. |
[2] |
LI Yu-tian1, 2, YU Hai-yan1, 2*, ZHANG Ke-xuan1, 2, BAI He1, 2, ZHANG Yu-ye1, 2. Spectroscopic Characteristics and Color Origin of Blue Coral[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(11): 3251-3257. |
[3] |
YUAN Yuan1*, ZHANG Jin2. Application of Generalized Regression Neural Network With Ultraviolet-
Visible Spectrometry Methods for Detection of Extra Virgin Olive Oil[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(10): 2973-2980. |
[4] |
WANG Pei-lian1, YUE Su-wei2*, LI Jia-yan1. Spectral Characteristics and Color Mechanism of Heat-Treated
Gem-Quality Yellow Sphene[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(09): 2545-2550. |
[5] |
LI Yu1, ZHANG Ke-can1, PENG Li-juan2*, ZHU Zheng-liang1, HE Liang1*. Simultaneous Detection of Glucose and Xylose in Tobacco by Using Partial Least Squares Assisted UV-Vis Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 103-110. |
[6] |
WANG Yi-ru1, GAO Yang2, 3, WU Yong-gang4*, WANG Bo5*. Study of the Electronic Structure, Spectrum, and Excitation Properties of Sudan Red Ⅲ Molecule Based on the Density Functional Theory[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(08): 2426-2436. |
[7] |
LIU Mei-jun, TIAN Ning*, YU Ji*. Spectral Study on Mouse Oocyte Quality[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1376-1380. |
[8] |
CI Cheng-gang*, ZANG Jie-chao, LI Ming-fei*. DFT Study on Spectra of Mn-Carbonyl Molecular Complexes[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(05): 1434-1441. |
[9] |
CHEN Qing1, TANG Bin1, 2*, LONG Zou-rong1, 2, MIAO Jun-feng1, HUANG Zi-heng1, DAI Ruo-chen1, SHI Sheng-hui1, ZHAO Ming-fu1, ZHONG Nian-bing1. Water Quality Classification Using Convolution Neural Network Based on UV-Vis Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(03): 731-736. |
[10] |
WANG Ren-jie1, 2, FENG Peng1*, YANG Xing3, AN Le3, HUANG Pan1, LUO Yan1, HE Peng1, TANG Bin1, 2*. A Denoising Algorithm for Ultraviolet-Visible Spectrum Based on
CEEMDAN and Dual-Tree Complex Wavelet Transform[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(03): 976-983. |
[11] |
LI Yun-xia1, MA Jun-cheng2, LIU Hong-jie3, ZHANG Ling-xian1*. Tillering Number Estimation of Winter Wheat Based on Visible
Spectrogram and Lightweight Convolutional Neural Network[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 273-279. |
[12] |
XU Meng-lei1, 2, GAO Yu3, ZHU Lin1, HAN Xiao-xia1, ZHAO Bing1*. Improved Sensitivity of Localized Surface Plasmon Resonance Using Silver Nanoparticles for Indirect Glyphosate Detection Based on Ninhydrin Reaction[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(01): 320-323. |
[13] |
LI Qing-bo1, BI Zhi-qi1, CUI Hou-xin2, LANG Jia-ye2, SHEN Zhong-kai2. Detection of Total Organic Carbon in Surface Water Based on UV-Vis Spectroscopy[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(11): 3423-3427. |
[14] |
HU Yu-xia1, CHEN Jie1, SHAO Hui1, YAN Pu1, XU Heng1, SUN Long1, XIAO Xiao1, XIU Lei3, FENG Chun2GAN Ting-ting2, ZHAO Nan-jing2*. Research Progress of Spectroscopy Detection Technologies for Waterborne Pathogens[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(09): 2672-2678. |
[15] |
LUO Heng, Andy Hsitien Shen*. Based on Color Calculation and In-Situ Element Analyze to Study the Color Origin of Purple Chalcedony[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2022, 42(06): 1891-1898. |
|
|
|
|