Words with Consistent Diachronic Usage Patterns are Learned Earlier: A Computational Analysis Using Temporally Aligned Word Embeddings

Giovanni Cassani, Federico Bianchi, Marco Marelli

    Research output: Contribution to journalArticleScientificpeer-review

    7 Citations (Scopus)

    Abstract

    In this study, we use temporally aligned word embeddings and a large diachronic corpus of English to quantify language change in a data-driven, scalable way, which is grounded in language use. We show a unique and reliable relation between measures of language change and age of acquisition (AoA) while controlling for frequency, contextual diversity, concreteness, length, dominant part of speech, orthographic neighborhood density, and diachronic frequency variation. We analyze measures of language change tackling both the change in lexical representations and the change in the relation between lexical representations and the words with the most similar usage patterns, showing that they capture different aspects of language change. Our results show a unique relation between language change and AoA, which is stronger when considering neighborhood-level measures of language change: Words with more coherent diachronic usage patterns tend to be acquired earlier. The results support theories positing a link between ontogenetic and ethnogenetic processes in language.

    Original languageEnglish
    Article numbere12963
    Pages (from-to)e12963
    Number of pages29
    JournalCognitive Science
    Volume45
    Issue number4
    DOIs
    Publication statusPublished - Apr 2021

    Keywords

    • Age of acquisition
    • Computational psycholinguistics
    • Language change
    • Temporally aligned word embeddings

    Fingerprint

    Dive into the research topics of 'Words with Consistent Diachronic Usage Patterns are Learned Earlier: A Computational Analysis Using Temporally Aligned Word Embeddings'. Together they form a unique fingerprint.

    Cite this