Premature mortality from cardio-cerebrovascular diseases in Bogotá an analytical machine learning approach
| dc.contributor.author | Yeimmy Carolina Malagón Sintura | |
| dc.contributor.author | Wanderley Augusto Arias Ortíz | |
| dc.coverage.spatial | Bolivia | |
| dc.date.accessioned | 2026-03-22T20:04:21Z | |
| dc.date.available | 2026-03-22T20:04:21Z | |
| dc.date.issued | 2026 | |
| dc.description.abstract | Premature mortality from cardio-cerebrovascular diseases represents an increasing burden on health systems, particularly in urban contexts across Latin America. This study analyzes mortality records in Bogotá from 2010 to 2022 via descriptive analysis, time series, and machine learning models. It includes deaths among individuals aged over 30, classified as premature or nonpremature based on a 75-year threshold1. Supervised models were trained using sociodemographic, insurance-related, and underlying cause-of-death variables, and their performance was evaluated via standard metrics. The random forest model showed the best overall performance, with educational level, insurance scheme, and place of death emerging as the main predictors. Additionally, separate models were developed for diagnostic groups (ischemic, cerebrovascular, hypertensive, and heart failure) and revealed differences in classification patterns. The model for ischemic heart disease achieved the highest AUC (0.69), followed by cerebrovascular (0.65), hypertensive (0.63), and heart failure (0.61). SHAP analysis highlighted the differential contribution of sociodemographic variables such as place of death, sex, educational level, and insurance scheme, with distinct patterns observed across causes of death. Trend analysis revealed a sustained increase in premature mortality, which increased during the pandemic period. These findings underscore the role of social determinants in premature cardiovascular deaths and highlight the potential of machine learning as a decision-support tool for public health. | |
| dc.identifier.doi | 10.1038/s41598-026-39453-z | |
| dc.identifier.uri | https://doi.org/10.1038/s41598-026-39453-z | |
| dc.identifier.uri | https://andeanlibrary.org/handle/123456789/79818 | |
| dc.language.iso | en | |
| dc.publisher | Nature Portfolio | |
| dc.relation.ispartof | Scientific Reports | |
| dc.source | Universidad del Rosario | |
| dc.subject | Machine learning | |
| dc.subject | Medicine | |
| dc.subject | Artificial intelligence | |
| dc.subject | Public health | |
| dc.subject | Heart disease | |
| dc.subject | Random forest | |
| dc.subject | Disease | |
| dc.subject | Pandemic | |
| dc.subject | Premature birth | |
| dc.subject | Descriptive statistics | |
| dc.title | Premature mortality from cardio-cerebrovascular diseases in Bogotá an analytical machine learning approach | |
| dc.type | article |