Abstract
This case study investigates the extent to which a language model (GPT-2) is able to capture native speakers' intuitions about implicit causality in a sentence completion task. Study 1 reproduces earlier results (showing that the model's surprisal values correlate with the implicit causality bias of the verb; Davis and van Schijndel 2021), and then examine the effects of gender and verb frequency on model performance. Study 2 examines the reasoning ability of GPT-2: Is the model able to produce more sensible motivations for why the subject VERBed the object if the verbs have stronger causality biases? For this study we took care to avoid human raters being biased by obscenities and disfluencies generated by the model.
Original language | English |
---|---|
Title of host publication | Proceedings of the 15th International Conference on Computational Semantics |
Editors | Maxime Amblard, Ellen Breitholtz |
Place of Publication | Nancy, France |
Publisher | Association for Computational Linguistics |
Pages | 67-77 |
Number of pages | 11 |
Publication status | Published - 1 Jun 2023 |