A perceptual study of how rapidly and accurately audiovisual cues to utterance-final boundaries can be interpreted in Chinese and English

Ran Bi, Marc Swerts

    Research output: Contribution to journalArticleScientificpeer-review

    3 Citations (Scopus)

    Abstract

    Speakers and their addressees make use of both auditory and visual features as cues to the end of a speaking turn. Prior work, mostly based on analyses of languages like Dutch and English, has shown that intonational markers such as melodic boundary tones as well as variation in eye gaze behaviour are often exploited to pre-signal the terminal edge of an utterance. However, we still lack knowledge on how such auditory and visual cues relate to each other, and whether the results for Dutch and English also generalize to other languages. This article compares possible audiovisual cues to prosodic boundaries in two typologically different languages, i.e., English and Chinese. A specific paradigm was used to elicit natural stimuli from 16 speakers, evenly distributed over both languages, which were then presented to L1 and L2 observers. They were asked to judge whether a spoken fragment had occurred in utterance-final position or not, measuring both the participants’ reaction time and accuracy. Participants were exposed to stimuli in three different formats: audio-only, vision-only or audiovisual. Our most important results are that (1) visual cues were important for boundary perception in both languages; (2) judges from either language group identified boundaries faster and more accurately in English than in Chinese; (3) there is no in-group advantage as observers were equally good in judging finality in their L1 and L2; (4) there are consistent correlations between the measures of reaction time and accuracy (shorter responses correlate with higher accuracy).
    Original languageEnglish
    Pages (from-to)68-77
    JournalSpeech Communication
    Volume95
    DOIs
    Publication statusPublished - Dec 2017

    Keywords

    • Chinese and English
    • L1 and L2 observers
    • Perceptual study
    • Utterance-final boundaries
    • Audiovisual cues

    Fingerprint

    Dive into the research topics of 'A perceptual study of how rapidly and accurately audiovisual cues to utterance-final boundaries can be interpreted in Chinese and English'. Together they form a unique fingerprint.

    Cite this