Value learning through reinforcement: The basics of dopamine and reinforcement learning

N.D. Daw, P.N. Tobler

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Abstract

This chapter provides an overview of reinforcement learning and temporal difference learning and relates these topics to the firing properties of midbrain dopamine neurons. First, we review the RescorlaWagner learning rule and basic learning phenomena, such as blocking, which the rule explains. Then we introduce the basic functional anatomy of the dopamine system and review studies that reveal a close correspondence between responses emitted by dopamine neurons and signals predicted by reinforcement learning. Finally, we introduce the generalization of the RescorlaWagner rule to sequential redictions as provided by temporal difference learning, and discuss its application to phasic activation changes of dopamine neurons. Subsequent chapters in this section deal with more advanced topics in reinforcement learning and presume that the reader is familiar with material covered in this chapter.
Original languageEnglish
Title of host publicationNeuroeconomics (2nd edition)
EditorsP.W. Glimcher, E. Fehr
Place of PublicationOxford
PublisherAcademic Press
Pages283-298
Number of pages560
ISBN (Print)9780124160088
Publication statusPublished - 2013

Fingerprint

reinforcement
learning
Values
activation

Cite this

Daw, N. D., & Tobler, P. N. (2013). Value learning through reinforcement: The basics of dopamine and reinforcement learning. In P. W. Glimcher, & E. Fehr (Eds.), Neuroeconomics (2nd edition) (pp. 283-298). Oxford: Academic Press.
Daw, N.D. ; Tobler, P.N. / Value learning through reinforcement : The basics of dopamine and reinforcement learning. Neuroeconomics (2nd edition). editor / P.W. Glimcher ; E. Fehr. Oxford : Academic Press, 2013. pp. 283-298
@inbook{f02e6dad1f18413298583262400c9554,
title = "Value learning through reinforcement: The basics of dopamine and reinforcement learning",
abstract = "This chapter provides an overview of reinforcement learning and temporal difference learning and relates these topics to the firing properties of midbrain dopamine neurons. First, we review the RescorlaWagner learning rule and basic learning phenomena, such as blocking, which the rule explains. Then we introduce the basic functional anatomy of the dopamine system and review studies that reveal a close correspondence between responses emitted by dopamine neurons and signals predicted by reinforcement learning. Finally, we introduce the generalization of the RescorlaWagner rule to sequential redictions as provided by temporal difference learning, and discuss its application to phasic activation changes of dopamine neurons. Subsequent chapters in this section deal with more advanced topics in reinforcement learning and presume that the reader is familiar with material covered in this chapter.",
author = "N.D. Daw and P.N. Tobler",
note = "Pagination: 560",
year = "2013",
language = "English",
isbn = "9780124160088",
pages = "283--298",
editor = "P.W. Glimcher and E. Fehr",
booktitle = "Neuroeconomics (2nd edition)",
publisher = "Academic Press",

}

Daw, ND & Tobler, PN 2013, Value learning through reinforcement: The basics of dopamine and reinforcement learning. in PW Glimcher & E Fehr (eds), Neuroeconomics (2nd edition). Academic Press, Oxford, pp. 283-298.

Value learning through reinforcement : The basics of dopamine and reinforcement learning. / Daw, N.D.; Tobler, P.N.

Neuroeconomics (2nd edition). ed. / P.W. Glimcher; E. Fehr. Oxford : Academic Press, 2013. p. 283-298.

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

TY - CHAP

T1 - Value learning through reinforcement

T2 - The basics of dopamine and reinforcement learning

AU - Daw, N.D.

AU - Tobler, P.N.

N1 - Pagination: 560

PY - 2013

Y1 - 2013

N2 - This chapter provides an overview of reinforcement learning and temporal difference learning and relates these topics to the firing properties of midbrain dopamine neurons. First, we review the RescorlaWagner learning rule and basic learning phenomena, such as blocking, which the rule explains. Then we introduce the basic functional anatomy of the dopamine system and review studies that reveal a close correspondence between responses emitted by dopamine neurons and signals predicted by reinforcement learning. Finally, we introduce the generalization of the RescorlaWagner rule to sequential redictions as provided by temporal difference learning, and discuss its application to phasic activation changes of dopamine neurons. Subsequent chapters in this section deal with more advanced topics in reinforcement learning and presume that the reader is familiar with material covered in this chapter.

AB - This chapter provides an overview of reinforcement learning and temporal difference learning and relates these topics to the firing properties of midbrain dopamine neurons. First, we review the RescorlaWagner learning rule and basic learning phenomena, such as blocking, which the rule explains. Then we introduce the basic functional anatomy of the dopamine system and review studies that reveal a close correspondence between responses emitted by dopamine neurons and signals predicted by reinforcement learning. Finally, we introduce the generalization of the RescorlaWagner rule to sequential redictions as provided by temporal difference learning, and discuss its application to phasic activation changes of dopamine neurons. Subsequent chapters in this section deal with more advanced topics in reinforcement learning and presume that the reader is familiar with material covered in this chapter.

M3 - Chapter

SN - 9780124160088

SP - 283

EP - 298

BT - Neuroeconomics (2nd edition)

A2 - Glimcher, P.W.

A2 - Fehr, E.

PB - Academic Press

CY - Oxford

ER -

Daw ND, Tobler PN. Value learning through reinforcement: The basics of dopamine and reinforcement learning. In Glimcher PW, Fehr E, editors, Neuroeconomics (2nd edition). Oxford: Academic Press. 2013. p. 283-298