Visually grounded models of spoken language - A survey of datasets, architectures and evaluation techniques.

Research output: Contribution to journalArticleScientificpeer-review

Original languageEnglish
JournalJournal of Artificial Intelligence Research
Volumeabs/2104.13225
Publication statusSubmitted - 2021

Cite this