GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena

Eva Vanmassenhove, Johanna Monti

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    7 Citations (Scopus)

    Abstract

    Languages differ in terms of the absence or presence of gender features, the number of gender classes and whether and where gender features are explicitly marked. These cross-linguistic differences can lead to ambiguities that are difficult to resolve, especially for sentence-level MT systems. The identification of ambiguity and its subsequent resolution is a challenging task for which currently there aren't any specific resources or challenge sets available. In this paper, we introduce gENder- IT, an English-Italian challenge set focusing on the resolution of natural gender phenomena by providing word-level gender tags on the English source side and multiple gender alternative translations, where needed, on the Italian target side.

    Original languageEnglish
    Title of host publicationGeBNLP 2021
    Subtitle of host publication3rd Workshop on Gender Bias in Natural Language Processing
    EditorsMarta R. Costa-jussa, Hila Gonen, Christian Hardmeier, Christian Hardmeier, Kellie Webster
    Place of Publicationonline
    PublisherAssociation for Computational Linguistics (ACL)
    Pages1-7
    Number of pages7
    Volume3
    ISBN (Electronic)9781954085619
    DOIs
    Publication statusPublished - 2021
    Event3rd Workshop on Gender Bias in Natural Language Processing, GeBNLP 2021 - Virtual, Online, Thailand
    Duration: 5 Aug 2021 → …

    Publication series

    NameGeBNLP 2021 - 3rd Workshop on Gender Bias in Natural Language Processing, Proceedings

    Conference

    Conference3rd Workshop on Gender Bias in Natural Language Processing, GeBNLP 2021
    Country/TerritoryThailand
    CityVirtual, Online
    Period5/08/21 → …

    Keywords

    • Languages
    • Cross-linguistic Differences
    • Gender

    Fingerprint

    Dive into the research topics of 'GENder-IT: An Annotated English-Italian Parallel Challenge Set for Cross-Linguistic Natural Gender Phenomena'. Together they form a unique fingerprint.

    Cite this