The Influence of Context on the Learning of Metrical Stress Systems Using Finite-State Machines

Cesko Voeten*, Menno van Zaanen

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Languages vary in the way stress is assigned to syllables within words. This article investigates the learnability of stress systems in a wide range of languages. The stress systems can be described using finite-state automata with symbols indicating levels of stress (primary, secondary, or no stress). Finite-state automata have been the focus of research in the area of grammatical inference for some time now. It has been shown that finite-state machines are learnable from examples using state-merging. One such approach, which aims to learn k-testable languages, has been applied to stress systems with some success. The family of k-testable languages has been shown to be efficiently learnable (in polynomial time). Here, we extend this approach to k, l-local languages by taking not only left context, but also right context, into account. We consider empirical results testing the performance of our learner using various amounts of context (corresponding to varying definitions of phonological locality). Our results show that our approach of learning stress patterns using state-merging is more reliant on left context than on right context. Additionally, some stress systems fail to be learned by our learner using either the left-context k-testable or the left-and-right-context k, l-local learning system. A more complex merging strategy, and hence grammar representation, is required for these stress systems.

Original languageEnglish
Pages (from-to)329-348
Number of pages20
JournalComputational Linguistics
Volume44
Issue number2
DOIs
Publication statusPublished - Jun 2018

Keywords

  • WORD STRESS

Cite this

Voeten, Cesko ; van Zaanen, Menno. / The Influence of Context on the Learning of Metrical Stress Systems Using Finite-State Machines. In: Computational Linguistics. 2018 ; Vol. 44, No. 2. pp. 329-348.
@article{1c9d28c1fbe4487a9621b118ccbcac42,
title = "The Influence of Context on the Learning of Metrical Stress Systems Using Finite-State Machines",
abstract = "Languages vary in the way stress is assigned to syllables within words. This article investigates the learnability of stress systems in a wide range of languages. The stress systems can be described using finite-state automata with symbols indicating levels of stress (primary, secondary, or no stress). Finite-state automata have been the focus of research in the area of grammatical inference for some time now. It has been shown that finite-state machines are learnable from examples using state-merging. One such approach, which aims to learn k-testable languages, has been applied to stress systems with some success. The family of k-testable languages has been shown to be efficiently learnable (in polynomial time). Here, we extend this approach to k, l-local languages by taking not only left context, but also right context, into account. We consider empirical results testing the performance of our learner using various amounts of context (corresponding to varying definitions of phonological locality). Our results show that our approach of learning stress patterns using state-merging is more reliant on left context than on right context. Additionally, some stress systems fail to be learned by our learner using either the left-context k-testable or the left-and-right-context k, l-local learning system. A more complex merging strategy, and hence grammar representation, is required for these stress systems.",
keywords = "WORD STRESS",
author = "Cesko Voeten and {van Zaanen}, Menno",
year = "2018",
month = "6",
doi = "10.1162/COLI_a_00317",
language = "English",
volume = "44",
pages = "329--348",
journal = "Computational Linguistics",
issn = "0891-2017",
publisher = "The MIT Press",
number = "2",

}

The Influence of Context on the Learning of Metrical Stress Systems Using Finite-State Machines. / Voeten, Cesko; van Zaanen, Menno.

In: Computational Linguistics, Vol. 44, No. 2, 06.2018, p. 329-348.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - The Influence of Context on the Learning of Metrical Stress Systems Using Finite-State Machines

AU - Voeten, Cesko

AU - van Zaanen, Menno

PY - 2018/6

Y1 - 2018/6

N2 - Languages vary in the way stress is assigned to syllables within words. This article investigates the learnability of stress systems in a wide range of languages. The stress systems can be described using finite-state automata with symbols indicating levels of stress (primary, secondary, or no stress). Finite-state automata have been the focus of research in the area of grammatical inference for some time now. It has been shown that finite-state machines are learnable from examples using state-merging. One such approach, which aims to learn k-testable languages, has been applied to stress systems with some success. The family of k-testable languages has been shown to be efficiently learnable (in polynomial time). Here, we extend this approach to k, l-local languages by taking not only left context, but also right context, into account. We consider empirical results testing the performance of our learner using various amounts of context (corresponding to varying definitions of phonological locality). Our results show that our approach of learning stress patterns using state-merging is more reliant on left context than on right context. Additionally, some stress systems fail to be learned by our learner using either the left-context k-testable or the left-and-right-context k, l-local learning system. A more complex merging strategy, and hence grammar representation, is required for these stress systems.

AB - Languages vary in the way stress is assigned to syllables within words. This article investigates the learnability of stress systems in a wide range of languages. The stress systems can be described using finite-state automata with symbols indicating levels of stress (primary, secondary, or no stress). Finite-state automata have been the focus of research in the area of grammatical inference for some time now. It has been shown that finite-state machines are learnable from examples using state-merging. One such approach, which aims to learn k-testable languages, has been applied to stress systems with some success. The family of k-testable languages has been shown to be efficiently learnable (in polynomial time). Here, we extend this approach to k, l-local languages by taking not only left context, but also right context, into account. We consider empirical results testing the performance of our learner using various amounts of context (corresponding to varying definitions of phonological locality). Our results show that our approach of learning stress patterns using state-merging is more reliant on left context than on right context. Additionally, some stress systems fail to be learned by our learner using either the left-context k-testable or the left-and-right-context k, l-local learning system. A more complex merging strategy, and hence grammar representation, is required for these stress systems.

KW - WORD STRESS

U2 - 10.1162/COLI_a_00317

DO - 10.1162/COLI_a_00317

M3 - Article

VL - 44

SP - 329

EP - 348

JO - Computational Linguistics

JF - Computational Linguistics

SN - 0891-2017

IS - 2

ER -