Findings of the WMT 2023 Shared Task on Quality Estimation.

Frédéric Blain, Chrysoula Zerva, Ricardo Ribeiro, Nuno Miguel Guerreiro, Diptesh Kanojia, José G. C. de Souza, Beatriz Silva, Tânia Vaz, Yan Jingxuan, Fatemeh Azadi, Constantin Orasan, André Martins

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

28 Downloads (Pure)

Abstract

We report the results of the WMT 2023 shared task on Quality Estimation, in which the challenge is to predict the quality of the output of neural machine translation systems at the word and sentence levels, without access to
reference translations. This edition introduces a few novel aspects and extensions that aim to enable more fine-grained, and explainable quality estimation approaches. We introduce an updated quality annotation scheme using Multidimensional Quality Metrics to obtain sentence- and word-level quality scores for three language pairs. We also extend the provided data to new language pairs: we specifically target low-resource languages and provide training, development and test data for English-Hindi, English-Tamil, English-Telegu and English-Gujarati as well as a zero-shot test set for English-Farsi. Further, we introduce a novel fine-grained error prediction task aspiring to motivate research towards more detailed quality predictions
Original languageEnglish
Title of host publicationProceedings of the Eighth Conference on Machine Translation
EditorsPhilipp Koehn, Barry Haddow, Tom Kocmi, Christof Monz
PublisherAssociation for Computational Linguistics
Pages629-653
Number of pages25
DOIs
Publication statusPublished - 2023
EventEight conference on machine translation - , Singapore
Duration: 6 Dec 20237 Dec 2023

Conference

ConferenceEight conference on machine translation
Abbreviated titleWMT23
Country/TerritorySingapore
Period6/12/237/12/23

Fingerprint

Dive into the research topics of 'Findings of the WMT 2023 Shared Task on Quality Estimation.'. Together they form a unique fingerprint.

Cite this