Large scale disambiguation of scientific references in patent databases

Kangran Zhao, Emiel Caron, Stanislaw Guner

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

The PATSTAT database stores information on patent applications and publications. One of its tables, stores scientific references cited by patents. As such, this is a potentially powerful resource to investigate the relation between science, technology and innovation. We aim to provide a reliable way to conduct research on such databases. To this end, we employ automated data cleaning and extract bibliographic information. Furthermore, a scoring system is used, and clusters of duplicates of scientific references are obtained by a clustering algorithm.
Original languageEnglish
Title of host publicationProceedings of 21st International Conference on Science and Technology Indicators (STI 2016)
Subtitle of host publicationPeripheries, frontiers and beyond
EditorsIsmael Rafols, Jordi Molas-Gallart, Elena Castro-Martinez, Richard Woolley
Place of PublicationValència (Spain)
PublisherEditorial Universitat Politècnica de València
Pages1404-1410
Number of pages6
ISBN (Print)9788490485194
Publication statusPublished - 14 Sept 2016
EventInternational Conference on Science and Technology Indicators - Valencia, Spain
Duration: 14 Sept 201616 Sept 2016
Conference number: 21
http://www.sti2016.org

Conference

ConferenceInternational Conference on Science and Technology Indicators
Abbreviated titleSTI2016
Country/TerritorySpain
CityValencia
Period14/09/1616/09/16
Internet address

Fingerprint

Dive into the research topics of 'Large scale disambiguation of scientific references in patent databases'. Together they form a unique fingerprint.

Cite this