GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena

Eva Vanmassenhove, Johanna Monti

Research output: Contribution to conferencePaperScientificpeer-review

38 Downloads (Pure)

Abstract

Languages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder-IT, an English--Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.
Original languageEnglish
Pages1-7
Number of pages7
DOIs
Publication statusPublished - 5 Aug 2021
Event3th Workshop on Gender Bias in Natural Language Processing : co-located with ACL-IJCNLP - Bangkok, Thailand
Duration: 5 Aug 20215 Aug 2021
https://genderbiasnlp.talp.cat/gebnlp2021/

Workshop

Workshop3th Workshop on Gender Bias in Natural Language Processing
Abbreviated titleGeBNLP
Country/TerritoryThailand
CityBangkok
Period5/08/215/08/21
Internet address

Keywords

  • cs.CL
  • cs.AI
  • cs.CY

Fingerprint

Dive into the research topics of 'GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena'. Together they form a unique fingerprint.

Cite this