Economies-of-scale in many-server queuing systems: Tutorial and partial review of the QED Halfin-Whitt heavy-traffic regime

Johan S.H. van Leeuwaarden*, Britt W. J. Mathijsen, Bert Zwart

*Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract

Multiserver queueing systems describe situations in which users require service from multiple parallel servers. Examples include check-in lines at airports, waiting rooms in hospitals, queues in contact centers, data buffers in wireless networks, and delayed service in cloud data centers. These are all situations with jobs (clients, patients, tasks) and servers (agents, beds, processors) that have large capacity levels, ranging from the order of tens (checkouts) to thousands (processors). This survey investigates how to design such systems to exploit resource pooling and economies-of-scale. In particular, we review the mathematics behind the quality-and efficiency-driven (QED) regime, which lets the system operate close to full utilization, while the number of servers grows simultaneously large and delays remain manageable. Aimed at a broad audience, we describe in detail the mathematical concepts for the basic Markovian many-server system, and we provide only sketches or references for more advanced settings related to, e.g., load balancing, overdispersion, parameter uncertainty, general service requirements, and queueing networks. While serving as a partial survey of a massive body of work, the tutorial is not meant to be exhaustive.

Original languageEnglish
Pages (from-to)403-440
Number of pages38
JournalSIAM Review
Volume61
Issue number3
DOIs
Publication statusPublished - Sept 2019

Keywords

  • stochastic-process limits
  • queueing theory
  • central limit theorem
  • heavy traffic
  • limit theorems
  • POINTWISE STATIONARY APPROXIMATION
  • STAFFING CALL CENTERS
  • TIME-VARYING DEMAND
  • DIFFUSION APPROXIMATIONS
  • SERVICE SYSTEMS
  • STABILIZING PERFORMANCE
  • CUSTOMER ABANDONMENT
  • IMPATIENT CUSTOMERS
  • DELAY ANNOUNCEMENTS
  • MULTISERVER QUEUE

Fingerprint

Dive into the research topics of 'Economies-of-scale in many-server queuing systems: Tutorial and partial review of the QED Halfin-Whitt heavy-traffic regime'. Together they form a unique fingerprint.

Cite this