Objective: To construct a predictive model for secondary tumors in patients with acute lymphoblastic leukemia(ALL) using the Surveillance, Epidemiology, and End Results(SEER) database. Methods: We systematically screened clinical data of 19 683 ALL patients from the SEER database. Eligible subjects were randomly allocated into a training set(n=13 778) and testing set(n=5 905) at a 7∶3 ratio. Patients in the training set were stratified into secondary tumor and non-secondary tumor groups. Logistic regression identified factors influencing secondary tumor development. The analysis results were incorporated into R software to construct and validate a nomogram prediction model. Results: Among 19 683 ALL patients, 383(1.95%) developed secondary tumors. Compared with the general population, ALL patients showed significantly elevated secondary tumor risk [standardized incidence ratio(SIR)=2.354, 95%CI 2.125-2.687], with an absolute excess risk(AER) of 16.852. There were significant differences in age, immunotyping, chemotherapy, radiotherapy and WBC between the secondary group and the non-secondary group(P<0.05).Logistic regression revealed age, radiotherapy, chemotherapy, and WBC ≥100×109 L-1 as significant predictors(P<0.05). The nomogram model demonstrated good predictive performance with AUCs of 0.876(training set) and 0.865(testing set), sensitivities of 96.70% and 94.30%, specificities of 79.50% and 78.70%(P<0.001), respectively. The C-index reached 0.889(95%CI 0.812-0.994). Calibration curves showed close alignment with ideal predictions, supported by Hosmer-Lemeshow test(χ2=2.843, P=0.673). Conclusion: Age, chemotherapy, radiotherapy, and WBC ≥100×10 L-1 are significant risk factors for secondary tumors in ALL patients. The developed nomogram model shows excellent predictive capacity and clinical utility. |