The purpose is to accurately identify women at high risk of developing cervical cancer so as to optimize cervical screening strategies and make better use of medical resources. However, the predictive models currently in use require clinical physiological and biochemical indicators, resulting in a smaller scope of application. Stacking-integrated machine learning (SIML) is an advanced machine learning technique that combined multiple learning algorithms to improve predictive performance. This study aimed to develop a stacking-integrated model that can be used to identify women at high risk of developing cervical cancer based on their demographic, behavioral, and historical clinical factors.
Methods
The data of 858 women screened for cervical cancer at a Venezuelan Hospital were used to develop the SIML algorithm. The screening data were randomly split into training data (80%) that were used to develop the algorithm and testing data (20%) that were used to validate the accuracy of the algorithms. The random forest (RF) model and univariate logistic regression were used to identify predictive features for developing cervical cancer. Twelve well-known ML algorithms were selected, and their performances in predicting cervical cancer were compared. A correlation coefficient matrix was used to cluster the models based on their performance. The SIML was then developed using the best-performing techniques. The sensitivity, specificity, and area under the curve (AUC) of all models were calculated.
Results
The RF model identified 18 features predictive of developing cervical cancer. The use of hormonal contraceptives was considered as the most important risk factor, followed by the number of pregnancies, years of smoking, and the number of sexual partners. The SIML algorithm had the best overall performance when compared with other methods and reached an AUC, sensitivity, and specificity of 0.877, 81.8%, and 81.9%, respectively.
Conclusion
This study shows that SIML can be used to accurately identify women at high risk of developing cervical cancer. This model could be used to personalize the screening program by optimizing the screening interval and care plan in highand low-risk patients based on their demographics, behavioral patterns, and clinical data.
cervical cancer screening optimization, machine learning cervical screening, HPV risk prediction model, stacking ensemble cervical cancer, demographic behavioral cancer risk, cervical cancer early detection, artificial intelligence cancer screening, population-based cancer risk assessment, cervical dysplasia risk factors, cancer screening resource optimization
PMID 35242711 35242711 DOI 10.3389/fonc.2022.821453 10.3389/fonc.2022.821453
Cite this article
Sun, L., Yang, L., Liu, X., Tang, L., Zeng, Q., Gao, Y., Chen, Q., Liu, Z., & Peng, B. (2022). Optimization of Cervical Cancer Screening: A Stacking-Integrated Machine Learning Algorithm Based on Demographic, Behavioral, and Clinical Factors. *Frontiers in oncology*, *12*, 821453. https://doi.org/10.3389/fonc.2022.821453
Sun L, Yang L, Liu X, Tang L, Zeng Q, Gao Y, et al. Optimization of Cervical Cancer Screening: A Stacking-Integrated Machine Learning Algorithm Based on Demographic, Behavioral, and Clinical Factors. Front Oncol. 2022;12:821453. doi:10.3389/fonc.2022.821453
Sun, L., et al. "Optimization of Cervical Cancer Screening: A Stacking-Integrated Machine Learning Algorithm Based on Demographic, Behavioral, and Clinical Factors." *Frontiers in oncology*, vol. 12, 2022, pp. 821453.
Alabdullatif N et al., 2022
Open Access
International Journal of Environmental Research and Public Health
Effective patient-provider communication improves mammography utilization. Using information technology (IT) promotes health outcomes. However, there are disparities in access to IT that could contrib...
Vitagliano A et al., 2021
Open Access
Diagnostics (Basel, Switzerland)
Background: Chronic endometritis (CE) and endometrial polyps (EPs) are common conditions in reproductive age women. CE is an infectious disorder of the endometrium characterized by signs of chronic in...
Thiamine or vitamin B1 is an essential, water-soluble vitamin required for mitochondrial energetics-the production of adenosine triphosphate (ATP). It is a critical and rate-limiting cofactor to multi...
Marcu I et al., 2021
Open Access
Translational Andrology and Urology
Background: Glomerulations are not specific for interstitial cystitis/bladder pain syndrome (IC/BPS). Controversy exists about whether cystoscopic findings differ between patients with and without low...