Abstract
Recent years have seen an increasing need for gender-neutral and inclusive language. Within the field of NLP, there are various mono- and bilingual use cases where gender inclusive language is appropriate, if not preferred due to ambiguity or uncertainty in terms of the gender of referents. In this work, we present a rule-based and a neural approach to gender-neutral rewriting for English along with manually curated synthetic data (WinoBias+) and natural data (OpenSubtitles and Reddit) benchmarks. A detailed manual and automatic evaluation highlights how our NeuTral Rewriter, trained on data generated by the rule-based approach, obtains word error rates (WER) below 0.18% on synthetic, in-domain and out-domain test sets.
Original language | English |
---|---|
Pages | 8940–8948 |
Number of pages | 8 |
DOIs | |
Publication status | Published - 13 Sept 2021 |
Event | The 2021 Conference on Empirical Methods in Natural Language Processing - Punta Cana, Punta Cana, Dominica Duration: 7 Nov 2022 → 11 Nov 2022 https://2021.emnlp.org/ |
Conference
Conference | The 2021 Conference on Empirical Methods in Natural Language Processing |
---|---|
Abbreviated title | EMNLP 2021 |
Country/Territory | Dominica |
City | Punta Cana |
Period | 7/11/22 → 11/11/22 |
Internet address |
Keywords
- cs.CL
- cs.AI