Reduction of Dutch Sentences for Automatic Subtitling

E.F. Tjong Kim Sang, W. Daelemans, A. Höthker

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    Abstract

    We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance
    Original languageEnglish
    Title of host publicationProceedings of CLIN-2003
    Place of PublicationAntwerp
    PublisherUnknown Publisher
    Pages109-123
    Number of pages15
    Publication statusPublished - 2004

    Fingerprint

    Audition
    Broadcasting
    Learning systems
    Industry

    Cite this

    Tjong Kim Sang, E. F., Daelemans, W., & Höthker, A. (2004). Reduction of Dutch Sentences for Automatic Subtitling. In Proceedings of CLIN-2003 (pp. 109-123). Antwerp: Unknown Publisher.
    Tjong Kim Sang, E.F. ; Daelemans, W. ; Höthker, A. / Reduction of Dutch Sentences for Automatic Subtitling. Proceedings of CLIN-2003. Antwerp : Unknown Publisher, 2004. pp. 109-123
    @inproceedings{96641337da634cc08cdf203129df5f0f,
    title = "Reduction of Dutch Sentences for Automatic Subtitling",
    abstract = "We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance",
    author = "{Tjong Kim Sang}, E.F. and W. Daelemans and A. H{\"o}thker",
    note = "Pagination: 15",
    year = "2004",
    language = "English",
    pages = "109--123",
    booktitle = "Proceedings of CLIN-2003",
    publisher = "Unknown Publisher",

    }

    Tjong Kim Sang, EF, Daelemans, W & Höthker, A 2004, Reduction of Dutch Sentences for Automatic Subtitling. in Proceedings of CLIN-2003. Unknown Publisher, Antwerp, pp. 109-123.

    Reduction of Dutch Sentences for Automatic Subtitling. / Tjong Kim Sang, E.F.; Daelemans, W.; Höthker, A.

    Proceedings of CLIN-2003. Antwerp : Unknown Publisher, 2004. p. 109-123.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    TY - GEN

    T1 - Reduction of Dutch Sentences for Automatic Subtitling

    AU - Tjong Kim Sang, E.F.

    AU - Daelemans, W.

    AU - Höthker, A.

    N1 - Pagination: 15

    PY - 2004

    Y1 - 2004

    N2 - We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance

    AB - We compare machine learning approaches for sentence length reduction for automatic generation of subtitles for deaf and hearing-impaired people with a method which relies on hand-crafted deletion rules. We describe building the necessary resources for this task: a parallel corpus of examples of news broadcasts of the Flemish VRT broadcasting corporation, and a Dutch shallow parser based on the material of the Spoken Dutch Corpus (CGN). We evaluate the sentence simplifiers and discuss their performance

    M3 - Conference contribution

    SP - 109

    EP - 123

    BT - Proceedings of CLIN-2003

    PB - Unknown Publisher

    CY - Antwerp

    ER -

    Tjong Kim Sang EF, Daelemans W, Höthker A. Reduction of Dutch Sentences for Automatic Subtitling. In Proceedings of CLIN-2003. Antwerp: Unknown Publisher. 2004. p. 109-123