Objective: To investigate the association between phospholipase C-like 2(PLCL2) gene polymorphisms and large-artery atherosclerotic(LAA) ischemic stroke using a random forest(RF) algorithm and to construct a risk prediction model.Methods: Patients with LAA ischemic stroke admitted to the Second Affiliated Hospital of Xinjiang Medical University from July 2022 to July 2023, along with healthy controls during the same period, were enrolled. Clinical data were collected, and genotypes of the PLCL2 rs10754555 locus were analyzed. A RF model was constructed, and feature contributions were interpreted using SHAP analysis. Results: A total of 92 cases(53 males, 39 females) and 90 controls(46 males, 44 females) were included, with no significant difference in gender distribution between the two groups(P>0.05). Significant differences were observed in the genotype and allele frequencies of the PLCL2 rs107545555 locus between the two groups(P<0.05). The RF prediction model achieved AUC values of 0.964(95%CI 0.931-0.997) in the training set and 0.947(95%CI 0.895-0.999) in the test set, demonstrating good predictive performance. SHAP analysis revealed that ten indicators, including CRP, IL-6, and D-D, were risk factors, while HDL was a protective factor. The AG genotype of the PLCL2 rs107545555 locus significantly increased the risk of disease, whereas the AA and GG genotypes reduced the risk. Conclusion: The polymorphisms of the PLCL2 gene are closely associated with the risk of LAA ischemic stroke. The RF prediction model integrating multidimensional features exhibits strong predictive performance, offering potential value for early screening. |