Abstract
BACKGROUND: Diabetes mellitus has been shown to increase the risk of dementia, with diabetic patients demonstrating twice the dementia incidence rate of non-diabetic populations. We aimed to develop and validate a novel machine learning-based dementia risk prediction tool specifically tailored for diabetic population.</p>
METHODS: Using a prospective from 42,881 diabetic individuals in the UK Biobank, a rigorous multi-stage selection framework was implemented to optimize feature-outcome associations from 190 variables, and 32 predictors were final retained. Subsequently, eight data analysis strategies were used to develop and validate the dementia risk prediction model. Model performance was assessed using area under the curve (AUC) metrics.</p>
RESULTS: During a median follow-up of 9.60 years, 1337 incident dementia cases were identified among diabetic population. The Adaboost classifier demonstrated robust performance across different predictor sets: full model with 32 predictors versus streamlined simplified model with 13 predictors selected through forward feature subset selection algorithm (AUC: 0.805 ± 0.005 vs. 0.801 ± 0.005; p = 0.200) in model development employing an 8:2 data split (5-fold cross-validation for training). To facilitate community generalization and clinical applicability, the simplified model, named DRP-Diabetes, was deployed to a visual interactive web application for individualized dementia risk assessment.</p>
LIMITATIONS: Some variables were based on self-reported.</p>
CONCLUSIONS: A convenient and reliable dementia risk prediction tool was developed and validated for diabetic population, which could help individuals identify their potential risk profile and provide guidance on precise and timely actions to promote dementia delay or prevention.</p>