gENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena

dc.contributor.authorEva Vanmassenhove
dc.contributor.authorJohanna Monti
dc.coverage.spatialBolivia
dc.date.accessioned2026-03-22T14:39:58Z
dc.date.available2026-03-22T14:39:58Z
dc.date.issued2021
dc.descriptionCitaciones: 11
dc.description.abstractLanguages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder-IT, an English-Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.
dc.identifier.doi10.18653/v1/2021.gebnlp-1.1
dc.identifier.urihttps://doi.org/10.18653/v1/2021.gebnlp-1.1
dc.identifier.urihttps://andeanlibrary.org/handle/123456789/47839
dc.language.isoen
dc.sourceTilburg University
dc.subjectAmbiguity
dc.subjectSentence
dc.subjectSet (abstract data type)
dc.subjectComputer science
dc.subjectNatural language processing
dc.subjectIdentification (biology)
dc.subjectResolution (logic)
dc.subjectLinguistics
dc.subjectTask (project management)
dc.subjectWord (group theory)
dc.titlegENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena
dc.typearticle

Files