Health warning

Might contain multiple personalities-the problem of homonyms in Thomson Reuters Essential Science Indicators

Anne-Wil Harzing*

*Corresponding author for this work

Research output: Contribution to journalLetterScientificpeer-review

Abstract

Author name ambiguity is a crucial problem in any type of bibliometric analysis. It arises when several authors share the same name, but also when one author expresses their name in different ways. This article focuses on the former, also called the “namesake” problem. In particular, we assess the extent to which this compromises the Thomson Reuters Essential Science Indicators ranking of the top 1 % most cited authors worldwide. We show that three demographic characteristics that should be unrelated to research productivity—name origin, uniqueness of one’s family name, and the number of initials used in publishing—in fact have a very strong influence on it. In contrast to what could be expected from Web of Science publication data, researchers with Asian names—and in particular Chinese and Korean names—appear to be far more productive than researchers with Western names. Furthermore, for any country, academics with common names and fewer initials also appear to be more productive than their more uniquely named counterparts. However, this appearance of high productivity is caused purely by the fact that these “academic superstars” are in fact composites of many individual academics with the same name. We thus argue that it is high time that Thomson Reuters starts taking name disambiguation in general and non-Anglophone names in particular more seriously.
Original languageEnglish
Pages (from-to)2259-2270
JournalScientometrics
Volume105
Issue number3
DOIs
Publication statusPublished - Dec 2015
Externally publishedYes

Keywords

  • Author disambiguation
  • Homonyms
  • Essential Science Indicators
  • Research productivity

Cite this

@article{dc3aa4bd526740d7b2a7c5ffd7299468,
title = "Health warning: Might contain multiple personalities-the problem of homonyms in Thomson Reuters Essential Science Indicators",
abstract = "Author name ambiguity is a crucial problem in any type of bibliometric analysis. It arises when several authors share the same name, but also when one author expresses their name in different ways. This article focuses on the former, also called the “namesake” problem. In particular, we assess the extent to which this compromises the Thomson Reuters Essential Science Indicators ranking of the top 1 {\%} most cited authors worldwide. We show that three demographic characteristics that should be unrelated to research productivity—name origin, uniqueness of one’s family name, and the number of initials used in publishing—in fact have a very strong influence on it. In contrast to what could be expected from Web of Science publication data, researchers with Asian names—and in particular Chinese and Korean names—appear to be far more productive than researchers with Western names. Furthermore, for any country, academics with common names and fewer initials also appear to be more productive than their more uniquely named counterparts. However, this appearance of high productivity is caused purely by the fact that these “academic superstars” are in fact composites of many individual academics with the same name. We thus argue that it is high time that Thomson Reuters starts taking name disambiguation in general and non-Anglophone names in particular more seriously.",
keywords = "Author disambiguation, Homonyms, Essential Science Indicators, Research productivity",
author = "Anne-Wil Harzing",
year = "2015",
month = "12",
doi = "10.1007/s11192-015-1699-y",
language = "English",
volume = "105",
pages = "2259--2270",
journal = "Scientometrics",
issn = "0138-9130",
publisher = "Springer",
number = "3",

}

Health warning : Might contain multiple personalities-the problem of homonyms in Thomson Reuters Essential Science Indicators. / Harzing, Anne-Wil.

In: Scientometrics, Vol. 105, No. 3, 12.2015, p. 2259-2270.

Research output: Contribution to journalLetterScientificpeer-review

TY - JOUR

T1 - Health warning

T2 - Might contain multiple personalities-the problem of homonyms in Thomson Reuters Essential Science Indicators

AU - Harzing, Anne-Wil

PY - 2015/12

Y1 - 2015/12

N2 - Author name ambiguity is a crucial problem in any type of bibliometric analysis. It arises when several authors share the same name, but also when one author expresses their name in different ways. This article focuses on the former, also called the “namesake” problem. In particular, we assess the extent to which this compromises the Thomson Reuters Essential Science Indicators ranking of the top 1 % most cited authors worldwide. We show that three demographic characteristics that should be unrelated to research productivity—name origin, uniqueness of one’s family name, and the number of initials used in publishing—in fact have a very strong influence on it. In contrast to what could be expected from Web of Science publication data, researchers with Asian names—and in particular Chinese and Korean names—appear to be far more productive than researchers with Western names. Furthermore, for any country, academics with common names and fewer initials also appear to be more productive than their more uniquely named counterparts. However, this appearance of high productivity is caused purely by the fact that these “academic superstars” are in fact composites of many individual academics with the same name. We thus argue that it is high time that Thomson Reuters starts taking name disambiguation in general and non-Anglophone names in particular more seriously.

AB - Author name ambiguity is a crucial problem in any type of bibliometric analysis. It arises when several authors share the same name, but also when one author expresses their name in different ways. This article focuses on the former, also called the “namesake” problem. In particular, we assess the extent to which this compromises the Thomson Reuters Essential Science Indicators ranking of the top 1 % most cited authors worldwide. We show that three demographic characteristics that should be unrelated to research productivity—name origin, uniqueness of one’s family name, and the number of initials used in publishing—in fact have a very strong influence on it. In contrast to what could be expected from Web of Science publication data, researchers with Asian names—and in particular Chinese and Korean names—appear to be far more productive than researchers with Western names. Furthermore, for any country, academics with common names and fewer initials also appear to be more productive than their more uniquely named counterparts. However, this appearance of high productivity is caused purely by the fact that these “academic superstars” are in fact composites of many individual academics with the same name. We thus argue that it is high time that Thomson Reuters starts taking name disambiguation in general and non-Anglophone names in particular more seriously.

KW - Author disambiguation

KW - Homonyms

KW - Essential Science Indicators

KW - Research productivity

U2 - 10.1007/s11192-015-1699-y

DO - 10.1007/s11192-015-1699-y

M3 - Letter

VL - 105

SP - 2259

EP - 2270

JO - Scientometrics

JF - Scientometrics

SN - 0138-9130

IS - 3

ER -