Premature mortality from cardio-cerebrovascular diseases in Bogotá an analytical machine learning approach

dc.contributor.authorYeimmy Carolina Malagón Sintura
dc.contributor.authorWanderley Augusto Arias Ortíz
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T20:04:21Z
dc.date.available2026-03-22T20:04:21Z
dc.date.issued2026
dc.description.abstractPremature mortality from cardio-cerebrovascular diseases represents an increasing burden on health systems, particularly in urban contexts across Latin America. This study analyzes mortality records in Bogotá from 2010 to 2022 via descriptive analysis, time series, and machine learning models. It includes deaths among individuals aged over 30, classified as premature or nonpremature based on a 75-year threshold1. Supervised models were trained using sociodemographic, insurance-related, and underlying cause-of-death variables, and their performance was evaluated via standard metrics. The random forest model showed the best overall performance, with educational level, insurance scheme, and place of death emerging as the main predictors. Additionally, separate models were developed for diagnostic groups (ischemic, cerebrovascular, hypertensive, and heart failure) and revealed differences in classification patterns. The model for ischemic heart disease achieved the highest AUC (0.69), followed by cerebrovascular (0.65), hypertensive (0.63), and heart failure (0.61). SHAP analysis highlighted the differential contribution of sociodemographic variables such as place of death, sex, educational level, and insurance scheme, with distinct patterns observed across causes of death. Trend analysis revealed a sustained increase in premature mortality, which increased during the pandemic period. These findings underscore the role of social determinants in premature cardiovascular deaths and highlight the potential of machine learning as a decision-support tool for public health.
dc.identifier.doi10.1038/s41598-026-39453-z
dc.identifier.urihttps://doi.org/10.1038/s41598-026-39453-z
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/79818
dc.language.isoen
dc.publisherNature Portfolio
dc.relation.ispartofScientific Reports
dc.sourceUniversidad del Rosario
dc.subjectMachine learning
dc.subjectMedicine
dc.subjectArtificial intelligence
dc.subjectPublic health
dc.subjectHeart disease
dc.subjectRandom forest
dc.subjectDisease
dc.subjectPandemic
dc.subjectPremature birth
dc.subjectDescriptive statistics
dc.titlePremature mortality from cardio-cerebrovascular diseases in Bogotá an analytical machine learning approach
dc.typearticle

Files