Reduction of Dutch Sentences for Automatic Subtitling

E.F. Tjong Kim Sang, W. Daelemans, A. Höthker

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    Abstract

    We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance
    Original languageEnglish
    Title of host publicationProceedings of CLIN-2003
    Place of PublicationAntwerp
    PublisherUnknown Publisher
    Pages109-123
    Number of pages15
    Publication statusPublished - 2004

      Fingerprint

    Cite this

    Tjong Kim Sang, E. F., Daelemans, W., & Höthker, A. (2004). Reduction of Dutch Sentences for Automatic Subtitling. In Proceedings of CLIN-2003 (pp. 109-123). Antwerp: Unknown Publisher.