1. School of Optoelectronic Engineering, Guilin University of Electronic Technology, Guilin 541200, China
2. Harbin Engineering University, Harbin 150001, China
3. Key Lab of In-Fiber Integrated Optics, Ministry of Education, Harbin 150001, China
Abstract:The emergence of the novel coronavirus has caused significant losses in the global economy and public safety. The need for efficient detection and diagnosis has become urgent as the virus continues to inflict damage worldwide. Accurate and fast diagnosis of the novel coronavirus is critical for epidemic prevention and control. With the development of technology, deep learning has made many breakthroughs in detection and recognition, attracting widespread attention from researchers. However, deep learning requires a large amount of data for model training, and the collection of Raman spectra data for the novel coronavirus is limited by devices and environmental factors, making it difficult to obtain large amounts of data. The limited training data can hinder the training of deep learning models and limit their accuracy, resulting in poor performance in actual detection. To address this problem, this study introduces the CGAN adversarial network to automatically extract features from Raman spectra data and generate new spectra to expand the dataset for the novel coronavirus. These methods can effectively increase the training set's size and improve the model's accuracy. Using the enhanced Raman spectra dataset and deep neural networks, as well as traditional machine learning methods, including logistic regression, decision trees, random forests, support vector machines, and k-nearest neighbors algorithm, the diagnosis of COVID-19 is performed. The experimental results show that the deep neural network has a prediction accuracy of 98% for whether a patient is infected with the novel coronavirus, which is higher than traditional machine learning algorithms, demonstrating the superiority of deep learning models in Raman spectra detection of the novel coronavirus. We also compared the performance of the datasets before and after augmentation in different models, proving the effectiveness of data augmentation. Compared with traditional detection methods, this method is non-invasive, fast, and accurate, providing a new approach for biomedical detection of the novel coronavirus. The method proposed in this study can assist in rapidly and accurately detecting the novel coronavirus. It can be applied not only to the diagnosis of the novel coronavirus but also to the diagnosis of other diseases, having practical application value.
Key words:Novel coronavirus detection;Neural network;Raman spectra, Data augmentation;Qualitative classification
[1] WANG Wei(王 威). Youth Journalist(青年记者), 2020,(29): 60.
[2] GAO Peng, LI Meng-yao, LI Qiao-sheng, et al(高 鹏, 李梦瑶, 李乔晟, 等). Chinese Journal of Public Health(中国公共卫生), 2022, 38(5): 619.
[3] Lee K S, Landry Z, Pereira F C,et al. Nature Reviews Methods Primers, 2021, 1:80.
[4] SUN Gui-fang, WANG Ya-li, MENG Xian-zhu, et al(孙桂芳, 王雅丽, 孟现柱, 等). Chinese Optics(中国光学), 2019, 12(5): 1118.
[5] ZENG Qi, LIU Rui, WANG Nan, et al(曾 琦, 刘 瑞, 王 楠, 等). Acta Photonica Sinica(光子学报), 2021, 50(10): 1017002.
[6] WANG Xin-yi, DONG Peng-cheng, LUO Xin, et al(王新怡, 董鹏程, 罗 欣, 等). Food and Fermentation Industrie(食品与发酵工业), 2022, 48(24): 294.
[7] SHAO Shuai-bin, LIU Mei-han, SHI Yu-qing, et al(邵帅斌, 刘美含, 石宇晴, 等). Food Science(食品科学), 2022, 43(14): 296.
[8] ZUO Jia-qian, WANG Yu-kai, WANG Hong-qiu, et al(左佳倩, 王煜凯, 王红球, 等). The Journal of Light Scattering(光散射学报), 2022, 34(1): 1.
[9] TAN Ai-ling, CHU Zhen-yuan, WANG Xiao-si, et al(谈爱玲, 楚振原, 王晓斯, 等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2022, 42(3): 769.
[10] HUANG Zhong-min, ZENG Wan-dan, WU Min, et al(黄忠民, 曾万聃, 吴 敏, 等). Laser Journal(激光杂志), 2022, 43(9): 55.
[11] Montavon G, Samek W, Müller K R. Digital Signal Processing, 2018, 73: 1.
[12] Zhang M L, Zhou Z H. Pattern Recognition, 2007, 40(7): 2038.
[13] LaValley M P. Circulation, 2008, 117(18): 2395.
[14] Yin G, Li L, Lu S, et al. Journal of Raman Spectroscopy, 2021, 52(5): 949.
[15] Loey M, Manogaran G, Khalifa N E M. Neural Computing and Applications, 2020. https://doi.org/10.1007/S00521-020-0537-X.