Juan Sebastian MorenoJuan Carlos Bravo OcañaÁlvaro RiascosÁngela R. ZambranoDiana Marcela MendozaJohan GarciaSergio I. Prada2026-03-222026-03-22202310.25100/cm.v54i1.5300https://doi.org/10.25100/cm.v54i1.5300https://andeanlibrary.org/handle/123456789/49540Citaciones: 4A preliminary algorithm validation against human extraction was performed over a small set of reports with satisfactory results. This shows that a regular-expression approach can accurately and precisely extract multiple specimen attributes from free-text Spanish pathology reports. Additionally, we developed a website to facilitate collaborative validation at a larger scale which may be helpful for future research on the subject.enNatural language processingArtificial intelligenceMedicineComputer scienceInformation extractionInformation retrievalMedical physicsData miningPathologyAutomated extraction of information from free text of Spanish oncology pathology reportsarticle