Findings of the WMT 2022 Shared Task on Quality Estimation

Chrysoula Zerva, Frédéric Blain, Ricardo Rei, Piyawat Lertvittayakumjorn, José G.C. de Souza, Steffen Eger, Diptesh Kanojia, Duarte Alves, Constantin Orăsan, Marina Fomicheva, André F.T. Martins, Lucia Specia

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    60 Citations (Scopus)

    Abstract

    We report the results of the WMT 2022 shared task on Quality Estimation, in which the challenge is to predict the quality of the output of neural machine translation systems at the word and sentence levels, without access to reference translations. This edition introduces a few novel aspects and extensions that aim to enable more fine-grained, and explainable quality estimation approaches. We introduce an updated quality annotation scheme using Multidimensional Quality Metrics to obtain sentence- and word-level quality scores for three language pairs. We also extend the Direct Assessments and post-edit data (MLQE-PE) to new language pairs: we present a novel and large dataset on English-Marathi, as well as a zero-shot test-set on English-Yoruba. Further, we include an explainability sub-task for all language pairs and present a new format of a critical error detection task for two new language pairs. Participants from 11 different teams submitted altogether 991 systems to different task variants and language pairs.

    Original languageEnglish
    Title of host publicationWMT 2022 - 7th Conference on Machine Translation, Proceedings of the Conference
    PublisherAssociation for Computational Linguistics
    Pages69-99
    Number of pages31
    ISBN (Electronic)9781959429296
    Publication statusPublished - 2022
    Event7th Conference on Machine Translation, WMT 2022 - Abu Dhabi, United Arab Emirates
    Duration: 7 Dec 20228 Dec 2022

    Publication series

    NameConference on Machine Translation - Proceedings
    ISSN (Electronic)2768-0983

    Conference

    Conference7th Conference on Machine Translation, WMT 2022
    Country/TerritoryUnited Arab Emirates
    CityAbu Dhabi
    Period7/12/228/12/22

    Keywords

    • neural machine translation systems

    Fingerprint

    Dive into the research topics of 'Findings of the WMT 2022 Shared Task on Quality Estimation'. Together they form a unique fingerprint.

    Cite this