Using a Parallel Transcript/Subtitle Corpus for Sentence Compression

V. Vandeghinste, E.F. Tjong Kim Sang

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

    10 Citations (Scopus)


    In this paper we describe the collection of a parallel corpus (in Dutch) and its use in a sentence compression tool with the intention to automatically generate subtitles for the deaf from transcripts of a television program. First, the collection of the corpus is described, together with the manipulations and transformations performed on that corpus. Second, a hybrid sentence compression tool is described together with its evaluation.
    Original languageEnglish
    Title of host publicationProceedings of the 4th International Language Resources and Evaluation Conference (LREC 2004)
    Place of PublicationLisbon
    PublisherUnknown Publisher
    Number of pages4
    Publication statusPublished - 2004


    Dive into the research topics of 'Using a Parallel Transcript/Subtitle Corpus for Sentence Compression'. Together they form a unique fingerprint.

    Cite this