Abstract
Developments in language technology targeting signed languages are lagging behind in comparison to the advances related to what is available for so-called spoken languages. This is partly due to the scarcity of good quality signed language data, including good quality parallel corpora of signed and spoken languages. This paper introduces two parallel corpora which aim at reducing the gap between signed and spoken-only language technology: The XSL Hotel Review Corpus (XSL-HoReCo) and the Gold Standard Parallel Corpus of Signed and Spoken Language (GoSt-ParC-Sign). Both corpora are available through the CLARIN infrastructure.
Original language | English |
---|---|
Title of host publication | Selected papers from the CLARIN Annual Conference 2023 |
Editors | Krister Lindén, Thalassia Kontino, Jyrki Niemi |
Publisher | Linköping Electronic Conference Proceedings 210 |
Number of pages | 11 |
ISBN (Print) | 978-91-8075-740-9 |
DOIs | |
Publication status | Published - 9 Jul 2024 |