MEDDOPROF corpus: training set

Eulàlia Farré-Maduell; Salvador Lima-López; Antonio Miranda-Escalada; Vicent Brivá-Iglesias; Martin Krallinger

doi:10.5281/zenodo.4694769

MEDDOPROF corpus: training set

dc.contributor.author	Eulàlia Farré-Maduell
dc.contributor.author	Salvador Lima-López
dc.contributor.author	Antonio Miranda-Escalada
dc.contributor.author	Vicent Brivá-Iglesias
dc.contributor.author	Martin Krallinger
dc.coverage.spatial	Bolivia
dc.date.accessioned	2026-03-22T18:46:53Z
dc.date.available	2026-03-22T18:46:53Z
dc.date.issued	2021
dc.description.abstract	The MEDDOPROF Shared Task tackles the detection of occupations and employment statuses in clinical cases in Spanish from different specialties. Systems capable of automatically processing clinical texts are of interest to the medical community, social workers, researchers, the pharmaceutical industry, computer engineers, AI developers, policy makers, citizen’s associations and patients. Additionally, other NLP tasks (such as anonymization) can also benefit from this type of data. MEDDOPROF has three different sub-tasks: <strong>1) MEDDOPROF-NER</strong>: Participants must find the beginning and end of occupation mentions and classify them as PROFESION (PROFESSION) or SITUACION_LABORAL (WORKING_STATUS) <strong>2) MEDDOPROF-CLASS: </strong>Participants must find the beginning and end of occupation mentions and classify them according to their referent (PACIENTE [patient], FAMILIAR [family member], SANITARIO [health professional] or OTRO [other]). <strong>3) MEDDOPROF-NORM</strong>: Participants must find the beginning and end of occupation mentions and normalize them according to a reference codes list. MEDDOPROF is part of the IberLEF 2021 workshop, which is co-located with the SEPLN 2021 conference. For further information, please visit https://temu.bsc.es/meddoprof/ or email us at encargo-pln-life@bsc.es MEDDOPROF is promoted by the Plan de Impulso de las Tecnologías del Lenguaje de la Agenda Digital (Plan TL). <strong>Resources:</strong> - Web - Annotation Guidelines
dc.identifier.doi	10.5281/zenodo.4694769
dc.identifier.uri	https://doi.org/10.5281/zenodo.4694769
dc.identifier.uri	https://andeanlibrary.org/handle/123456789/72152
dc.language.iso	en
dc.publisher	European Organization for Nuclear Research
dc.relation.ispartof	Zenodo (CERN European Organization for Nuclear Research)
dc.source	Barcelona Supercomputing Center
dc.subject	Training (meteorology)
dc.subject	Set (abstract data type)
dc.subject	Natural language processing
dc.subject	Computer science
dc.subject	Training set
dc.subject	Artificial intelligence
dc.title	MEDDOPROF corpus: training set
dc.type	article

Collections

Artículo Científico Publicado

MEDDOPROF corpus: training set

Files

Collections