Abstract
We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance
Original language | English |
---|---|
Title of host publication | Proceedings of CLIN-2003 |
Place of Publication | Antwerp |
Publisher | Unknown Publisher |
Pages | 109-123 |
Number of pages | 15 |
Publication status | Published - 2004 |