Drawing Insights: Sequential Representation Learning in Comics

Neil Cohn, Nanne van Noord, Sam Titarsolej

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

9 Downloads (Pure)

Abstract

Comics present images in a sequence, where the spatially presented sequence is key to the narrative storytelling. To understand a comic, a comprehender must learn to encode this sequential nature. For this we present a novel self-supervised sequential representation learning method designed for comics. Our approach capitalises on the sequential structure of comics to incorporate contextual information. We conduct experiments on the TINTIN Corpus of 1,000+ comics from 144 countries, and show that our method outperforms baseline methods on both classification and retrieval tasks. These results affirm the effectiveness of sequential representation learning for comics, and may aid in uncovering new cultural insights within comics.
Original languageEnglish
Title of host publication35th British Machine Vision Conference 2024
Number of pages13
Publication statusPublished - 2024
EventThe 35th British Machine Vision Conference - Glasgow, United Kingdom
Duration: 25 Nov 202428 Nov 2024
Conference number: 35

Conference

ConferenceThe 35th British Machine Vision Conference
Abbreviated titleBMVC 2024
Country/TerritoryUnited Kingdom
CityGlasgow
Period25/11/2428/11/24

Keywords

  • comics
  • visual language
  • machine learning
  • large language models
  • style
  • style analysis

Fingerprint

Dive into the research topics of 'Drawing Insights: Sequential Representation Learning in Comics'. Together they form a unique fingerprint.

Cite this