TY - GEN
T1 - Wave to Syntax
T2 - Probing spoken language models for syntax
AU - Shen, Gaofei
AU - Alishahi, Afra
AU - Bisazza, Arianna
AU - Chrupała, Grzegorz
N1 - Publisher Copyright:
© 2023 International Speech Communication Association. All rights reserved.
PY - 2023
Y1 - 2023
N2 - Understanding which information is encoded in deep models of spoken and written language has been the focus of much research in recent years, as it is crucial for debugging and improving these architectures. Most previous work has focused on probing for speaker characteristics, acoustic and phonological information in models of spoken language, and for syntactic information in models of written language. Here we focus on the encoding of syntax in several self-supervised and visually grounded models of spoken language. We employ two complementary probing methods, combined with baselines and reference representations to quantify the degree to which syntactic structure is encoded in the activations of the target models. We show that syntax is captured most prominently in the middle layers of the networks, and more explicitly within models with more parameters.
AB - Understanding which information is encoded in deep models of spoken and written language has been the focus of much research in recent years, as it is crucial for debugging and improving these architectures. Most previous work has focused on probing for speaker characteristics, acoustic and phonological information in models of spoken language, and for syntactic information in models of written language. Here we focus on the encoding of syntax in several self-supervised and visually grounded models of spoken language. We employ two complementary probing methods, combined with baselines and reference representations to quantify the degree to which syntactic structure is encoded in the activations of the target models. We show that syntax is captured most prominently in the middle layers of the networks, and more explicitly within models with more parameters.
KW - computational linguistics
KW - speech recognition
KW - syntax
UR - http://www.scopus.com/inward/record.url?scp=85171575327&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2023-679
DO - 10.21437/Interspeech.2023-679
M3 - Conference contribution
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 1259
EP - 1263
BT - Proc. INTERSPEECH 2023
ER -