Generating flexible proper name references in text: Data, models and evaluation

Thiago Castro Ferreira, Emiel Krahmer, Sander Wubben

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

7 Citations (Scopus)


This study introduces a statistical model able to generate variations of a proper name, taking into account the person to be mentioned, the discourse context and individual variation. The model relies on the REGnames corpus, a dataset with 53,102 proper name references to 1,000 people in different discourse contexts. We evaluate the versions of our model in the perspective of how human writers produce proper names, and also how human readers process them.
Original languageEnglish
Title of host publicationProceedings of the European Chapter of the Association for Computational Linguistics
Subtitle of host publicationEACL 2017
PublisherAssociation for Computational Linguistics (ACL)
Publication statusPublished - 1 Apr 2017
EventEuropean Chapter of the Association for Computational Linguistics 2017 - Valencia, Spain
Duration: 3 Apr 20177 Apr 2017
Conference number: 15


ConferenceEuropean Chapter of the Association for Computational Linguistics 2017
Abbreviated titleEACL 2017
Internet address


Dive into the research topics of 'Generating flexible proper name references in text: Data, models and evaluation'. Together they form a unique fingerprint.

Cite this