|
|
|
|
|
|
Study on the Diagnosis of Breast Cancer by Fluorescence Spectrometry Based on Machine Learning |
CHEN Wen-jing, XU Nuo, JIAO Zhao-hang, YOU Jia-hua, WANG He, QI Dong-li, FENG Yu* |
School of Science,Shenyang Ligong University,Shenyang 110158,China
|
|
|
Abstract Breast cancer is a very dangerous disease for women worldwide, its prevalence is increasing year by year, and it is the main cause of death among women worldwide. In the case of large samples, the clinical diagnosis of breast cancer is limited by the relative shortage of high-quality medical resources, the diagnosis cycle is long, and the detection cost is high. Therefore, efficient, accurate and cost-effective breast cancer diagnosis methods have broad application prospects and are urgently needed for clinical diagnosis. Fluorescence spectroscopy is a method that can characterize the combined physical and chemical changes in cells and can be used to characterize normal and cancerous cells. Machine learning is good at mining useful information from a large amount of data and is an effective classification and prediction method. In the past, machine learning mostly used databases containing some biochemical information to train models, which easily led to information loss. The fluorescence spectrum is the superimposed spectrum of multiple substances in cells, and the use of fluorescence spectrum characteristic peaks to diagnose breast cancer has the problem of quantitative uncertainty.Therefore, this paper proposes a diagnostic method combining machine learning with fluorescence spectra of breast cancer samples. The fluorescence spectrum data of normal and cancerous breast tissue (pathological diagnosis has been made) was collected as a data set, and K-nearest Neighbor (KNN), support vector machine (SVM), Random Forest (RF) three algorithms to classify the fluorescence spectrum of normal and cancerous breast tissue. The discriminant results show that compared with the SVM algorithm, the KNN and RF algorithms have higher accuracy, stronger ability to balance recall and precision, and better classification ability for breast cancer fluorescence spectra. The results of the F1-score function are all above 95%, which is more conducive to the diagnosis of breast cancer. Furthermore, the classification ability of the Weighted K-nearest Neighbor (WKNN) algorithm for normal and cancerous breast tissue fluorescence spectra were discussed. Compared with the KNN algorithm, WKNN has a small improvement in the classification evaluation results and has better anti-noise and adaptability, and the algorithm is simple. In conclusion, the breast cancer diagnosis method based on machine learning and fluorescence spectroscopy proposed in this paper has high accuracy, high speed and high-cost performance. It is the future development direction of breast cancer diagnosis methods and has important clinical application value.
|
Received: 2022-04-18
Accepted: 2022-08-29
|
|
Corresponding Authors:
FENG Yu
E-mail: 14025921@qq.com
|
|
[1] Li J, Qi D, Hsieh T-C, et al. Pharmacology & Therapeutics,2021,223: 107800.
[2] Rosen E L, Eubank W B, Mankoff D A. Radiographics, 2007, 27(Suppl 1): S215
[3] Shang L-W, Ma D-Y, Fu J-J, et al. Biomedical Optics Express, 2020, 11(7): 3673.
[4] Choi J, Sharad G, Inho P, et al. Proc. of SPIE, 2007, 6434: 64340P.
[5] Zhang L, Li L, Tang M, et al. Alexandria Engineering Journal,2021,60(1): 897.
[6] Pu Y, Sordillo L A, Yang Y, et al. Optics Letters, 2014, 39(24): 6787.
[7] Sharma S. Aggarwal A, Choudhury T. 1st International Conference on Computational Techniques, Electronics and Mechanical Systems(CTEMS), 2018: 114.
[8] Ehsani R, Drablos F. Cancer Informatics, 2020, https://doi.org/10.1177/1176935120965542.
[9] Xue J, Pu Y, Smith J, et al. Proc. of SPIE, 2018, 10504: 105040L.
[10] DING Hua-jian, YU Hong, ZHANG Zhi-lin, et al(丁华建,于 虹,张志麟,等). Applied Laser(应用激光), 2000, 20(1): 38.
[11] WANG Ke-ren, YE Yu-qin, ZHU Zhu, et al(王可人,叶玉琴,朱 珠,等). Chin. J. Lab Diagn.(中国实验诊断学), 2013, 17(2): 186.
[12] Hu Y, Zheng C, Tao W, et al. Journal of Nanoparticle Research, 2020, 22(7): 186.
[13] Mahfouz M A, Shoukry A, Ismail M A. Artificial Intelligence in Medicine, 2021, 111: 101985.
[14] KONG De-ming, DONG Rui, CUI Yao-yao, et al(孔德明,董 瑞,崔耀耀,等). Spectroscopy and Spectral Analysis(光谱学与光谱分析), 2021, 41(8): 2505.
[15] LAI Sheng-sheng, LIU Qian-cheng, YU Li-ling, et al(赖胜圣,刘虔诚,余丽玲,等). Chinese Journal of Medical Physics(中国医学物理学杂志), 2019, 36(7):826.
[16] Adem K. Physica A: Statistical Mechanics and Its Applications, 2020, 551: 124591.
[17] Fatima N, Liu L, Hong S, et al. IEEE Access, 2020, 8: 150360.
[18] Khuriwal N, Mishra N. 2018 IEEMA Engineer Infinite Conference (eTechNxT), 2018.
[19] Asri H, Mousannif H, Moatassime H A, et al. Procedia Computer Science, 2016, 83: 1064.
[20] Wang H, Xu P, Zhao J. Alexandria Engineering Journal, 2021, 61(5): 3571.
[21] Li Z, Tan J, Li S, et al. Energy and Buildings, 2019, 183: 222.
|
[1] |
LEI Hong-jun1, YANG Guang1, PAN Hong-wei1*, WANG Yi-fei1, YI Jun2, WANG Ke-ke2, WANG Guo-hao2, TONG Wen-bin1, SHI Li-li1. Influence of Hydrochemical Ions on Three-Dimensional Fluorescence
Spectrum of Dissolved Organic Matter in the Water Environment
and the Proposed Classification Pretreatment Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 134-140. |
[2] |
GU Yi-lu1, 2,PEI Jing-cheng1, 2*,ZHANG Yu-hui1, 2,YIN Xi-yan1, 2,YU Min-da1, 2, LAI Xiao-jing1, 2. Gemological and Spectral Characterization of Yellowish Green Apatite From Mexico[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 181-187. |
[3] |
HAN Xue1, 2, LIU Hai1, 2, LIU Jia-wei3, WU Ming-kai1, 2*. Rapid Identification of Inorganic Elements in Understory Soils in
Different Regions of Guizhou Province by X-Ray
Fluorescence Spectrometry[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2024, 44(01): 225-229. |
[4] |
WANG Hong-jian1, YU Hai-ye1, GAO Shan-yun1, LI Jin-quan1, LIU Guo-hong1, YU Yue1, LI Xiao-kai1, ZHANG Lei1, ZHANG Xin1, LU Ri-feng2, SUI Yuan-yuan1*. A Model for Predicting Early Spot Disease of Maize Based on Fluorescence Spectral Analysis[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3710-3718. |
[5] |
CHENG Hui-zhu1, 2, YANG Wan-qi1, 2, LI Fu-sheng1, 2*, MA Qian1, 2, ZHAO Yan-chun1, 2. Genetic Algorithm Optimized BP Neural Network for Quantitative
Analysis of Soil Heavy Metals in XRF[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3742-3746. |
[6] |
SONG Yi-ming1, 2, SHEN Jian1, 2, LIU Chuan-yang1, 2, XIONG Qiu-ran1, 2, CHENG Cheng1, 2, CHAI Yi-di2, WANG Shi-feng2,WU Jing1, 2*. Fluorescence Quantum Yield and Fluorescence Lifetime of Indole, 3-Methylindole and L-Tryptophan[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3758-3762. |
[7] |
YANG Ke-li1, 2, PENG Jiao-yu1, 2, DONG Ya-ping1, 2*, LIU Xin1, 2, LI Wu1, 3, LIU Hai-ning1, 3. Spectroscopic Characterization of Dissolved Organic Matter Isolated From Solar Pond[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3775-3780. |
[8] |
LU Wen-jing, FANG Ya-ping, LIN Tai-feng, WANG Hui-qin, ZHENG Da-wei, ZHANG Ping*. Rapid Identification of the Raman Phenotypes of Breast Cancer Cell
Derived Exosomes and the Relationship With Maternal Cells[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3840-3846. |
[9] |
HUANG You-ju1, TIAN Yi-chao2, 3*, ZHANG Qiang2, TAO Jin2, ZHANG Ya-li2, YANG Yong-wei2, LIN Jun-liang2. Estimation of Aboveground Biomass of Mangroves in Maowei Sea of Beibu Gulf Based on ZY-1-02D Satellite Hyperspectral Data[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(12): 3906-3915. |
[10] |
LUO Li, WANG Jing-yi, XU Zhao-jun, NA Bin*. Geographic Origin Discrimination of Wood Using NIR Spectroscopy
Combined With Machine Learning Techniques[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3372-3379. |
[11] |
FANG Zheng, WANG Han-bo. Measurement of Plastic Film Thickness Based on X-Ray Absorption
Spectrometry[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3461-3468. |
[12] |
FU Gen-shen1, LÜ Hai-yan1, YAN Li-peng1, HUANG Qing-feng1, CHENG Hai-feng2, WANG Xin-wen3, QIAN Wen-qi1, GAO Xiang4, TANG Xue-hai1*. A C/N Ratio Estimation Model of Camellia Oleifera Leaves Based on
Canopy Hyperspectral Characteristics[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3404-3411. |
[13] |
KANG Ming-yue1, 3, WANG Cheng1, SUN Hong-yan3, LI Zuo-lin2, LUO Bin1*. Research on Internal Quality Detection Method of Cherry Tomatoes Based on Improved WOA-LSSVM[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3541-3550. |
[14] |
SUN Lin1, BI Wei-hong1, LIU Tong1, WU Jia-qing1, ZHANG Bao-jun1, FU Guang-wei1, JIN Wa1, WANG Bing2, FU Xing-hu1*. Identification Algorithm of Green Algae Using Airborne Hyperspectral and Machine Learning Method[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(11): 3637-3643. |
[15] |
LI Xiao-li1, WANG Yi-min2*, DENG Sai-wen2, WANG Yi-ya2, LI Song2, BAI Jin-feng1. Application of X-Ray Fluorescence Spectrometry in Geological and
Mineral Analysis for 60 Years[J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2023, 43(10): 2989-2998. |
|
|
|
|