Multi-source statistics: Basic situations and methods

Ton de Waal*, Arnout van Delden, Sander Scholtus

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Many National Statistical Institutes (NSIs), especially in Europe, are moving from single‐source statistics to multi‐source statistics. By combining data sources, NSIs can produce more detailed and more timely statistics and respond more quickly to events in society. By combining survey data with already available administrative data and Big Data, NSIs can save data collection and processing costs and reduce the burden on respondents. However, multi‐source statistics come with new problems that need to be overcome before the resulting output quality is sufficiently high and before those statistics can be produced efficiently. What complicates the production of multi‐source statistics is that they come in many different varieties as data sets can be combined in many different ways. Given the rapidly increasing importance of producing multi‐source statistics in Official Statistics, there has been considerable research activity in this area over the last few years, and some frameworks have been developed for multi‐source statistics. Useful as these frameworks are, they generally do not give guidelines to which method could be applied in a certain situation arising in practice. In this paper, we aim to fill that gap, structure the world of multi‐source statistics and its problems and provide some guidance to suitable methods for these problems.
Original languageEnglish
Number of pages26
JournalInternational Statistical Review
DOIs
Publication statusE-pub ahead of print - 2020

Fingerprint

Statistics
Official Statistics
Survey Data
Guidance
Output
Costs

Keywords

  • MULTIPLE IMPUTATION
  • NUMERICAL DATA
  • ONE-NUMBER CENSUS
  • POPULATION-SIZE ESTIMATION
  • SERIES
  • TRANSPORT
  • administrative data
  • data integration
  • multi-source statistics
  • statistical methods
  • survey data

Cite this

@article{2bc74129cb0b4bf7b99abaebbec38d17,
title = "Multi-source statistics: Basic situations and methods",
abstract = "Many National Statistical Institutes (NSIs), especially in Europe, are moving from single‐source statistics to multi‐source statistics. By combining data sources, NSIs can produce more detailed and more timely statistics and respond more quickly to events in society. By combining survey data with already available administrative data and Big Data, NSIs can save data collection and processing costs and reduce the burden on respondents. However, multi‐source statistics come with new problems that need to be overcome before the resulting output quality is sufficiently high and before those statistics can be produced efficiently. What complicates the production of multi‐source statistics is that they come in many different varieties as data sets can be combined in many different ways. Given the rapidly increasing importance of producing multi‐source statistics in Official Statistics, there has been considerable research activity in this area over the last few years, and some frameworks have been developed for multi‐source statistics. Useful as these frameworks are, they generally do not give guidelines to which method could be applied in a certain situation arising in practice. In this paper, we aim to fill that gap, structure the world of multi‐source statistics and its problems and provide some guidance to suitable methods for these problems.",
keywords = "MULTIPLE IMPUTATION, NUMERICAL DATA, ONE-NUMBER CENSUS, POPULATION-SIZE ESTIMATION, SERIES, TRANSPORT, administrative data, data integration, multi-source statistics, statistical methods, survey data",
author = "{de Waal}, Ton and {van Delden}, Arnout and Sander Scholtus",
year = "2020",
doi = "10.1111/insr.12352",
language = "English",
journal = "International Statistical Review",

}

Multi-source statistics : Basic situations and methods. / de Waal, Ton; van Delden, Arnout; Scholtus, Sander.

In: International Statistical Review, 2020.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Multi-source statistics

T2 - Basic situations and methods

AU - de Waal, Ton

AU - van Delden, Arnout

AU - Scholtus, Sander

PY - 2020

Y1 - 2020

N2 - Many National Statistical Institutes (NSIs), especially in Europe, are moving from single‐source statistics to multi‐source statistics. By combining data sources, NSIs can produce more detailed and more timely statistics and respond more quickly to events in society. By combining survey data with already available administrative data and Big Data, NSIs can save data collection and processing costs and reduce the burden on respondents. However, multi‐source statistics come with new problems that need to be overcome before the resulting output quality is sufficiently high and before those statistics can be produced efficiently. What complicates the production of multi‐source statistics is that they come in many different varieties as data sets can be combined in many different ways. Given the rapidly increasing importance of producing multi‐source statistics in Official Statistics, there has been considerable research activity in this area over the last few years, and some frameworks have been developed for multi‐source statistics. Useful as these frameworks are, they generally do not give guidelines to which method could be applied in a certain situation arising in practice. In this paper, we aim to fill that gap, structure the world of multi‐source statistics and its problems and provide some guidance to suitable methods for these problems.

AB - Many National Statistical Institutes (NSIs), especially in Europe, are moving from single‐source statistics to multi‐source statistics. By combining data sources, NSIs can produce more detailed and more timely statistics and respond more quickly to events in society. By combining survey data with already available administrative data and Big Data, NSIs can save data collection and processing costs and reduce the burden on respondents. However, multi‐source statistics come with new problems that need to be overcome before the resulting output quality is sufficiently high and before those statistics can be produced efficiently. What complicates the production of multi‐source statistics is that they come in many different varieties as data sets can be combined in many different ways. Given the rapidly increasing importance of producing multi‐source statistics in Official Statistics, there has been considerable research activity in this area over the last few years, and some frameworks have been developed for multi‐source statistics. Useful as these frameworks are, they generally do not give guidelines to which method could be applied in a certain situation arising in practice. In this paper, we aim to fill that gap, structure the world of multi‐source statistics and its problems and provide some guidance to suitable methods for these problems.

KW - MULTIPLE IMPUTATION

KW - NUMERICAL DATA

KW - ONE-NUMBER CENSUS

KW - POPULATION-SIZE ESTIMATION

KW - SERIES

KW - TRANSPORT

KW - administrative data

KW - data integration

KW - multi-source statistics

KW - statistical methods

KW - survey data

U2 - 10.1111/insr.12352

DO - 10.1111/insr.12352

M3 - Article

JO - International Statistical Review

JF - International Statistical Review

ER -