Sage Journals: Discover world-class research

Abstract

A number of accounts of human and animal behavior posit the operation of parallel and competing valuation systems in the control of choice behavior. In these accounts, a flexible but computationally expensive model-based reinforcement-learning system has been contrasted with a less flexible but more efficient model-free reinforcement-learning system. The factors governing which system controls behavior—and under what circumstances—are still unclear. Following the hypothesis that model-based reinforcement learning requires cognitive resources, we demonstrated that having human decision makers perform a demanding secondary task engenders increased reliance on a model-free reinforcement-learning strategy. Further, we showed that, across trials, people negotiate the trade-off between the two systems dynamically as a function of concurrent executive-function demands, and people’s choice latencies reflect the computational expenses of the strategy they employ. These results demonstrate that competition between multiple learning systems can be controlled on a trial-by-trial basis by modulating the availability of cognitive resources.

Keywords

cognitive neuroscience decision making

Get full access to this article

View all access options for this article.

References

Conway

A. R. A.

Kane

M. J.

Engle

R. W.

(2003). Working memory capacity and its relation to general intelligence. Trends in Cognitive Sciences, 7, 547–552.

Daw

N. D.

Gershman

S. J.

Seymour

Dayan

Dolan

R. J.

(2011). Model-based influences on humans’ choices and striatal prediction errors. Neuron, 69, 1204–1215.

Daw

N. D.

Niv

Dayan

(2005). Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nature Neuroscience, 8, 1704–1711.

Dayan

(2009). Goal-directed control and its antipodes. Neural Networks, 22, 213–219.

Dickinson

(1985). Actions and habits: The development of behavioural autonomy. Philosophical Transactions of the Royal Society B: Biological Sciences, 308, 67–78.

Dickinson

Balleine

(2004). The role of learning in the operation of motivational systems. In Gallistel

(Ed.), Stevens’ handbook of experimental psychology: Vol. 3. Learning, motivation, and emotion (3rd ed.). Hoboken, NJ: Wiley.

Everitt

B. J.

Robbins

T. W.

(2005). Neural systems of reinforcement for drug addiction: From actions to habits to compulsion. Nature Neuroscience, 8, 1481–1489.

Foerde

Knowlton

B. J.

Poldrack

R. A.

(2006). Modulation of competing memory systems by distraction. Proceedings of the National Academy of Sciences, USA, 103, 11778–11783.

Gläscher

Daw

Dayan

O’Doherty

J. P.

(2010). States versus rewards: Dissociable neural prediction error signals underlying model-based and model-free reinforcement learning. Neuron, 66, 585–595.

10.

Kahneman

Frederick

(2002). Representativeness revisited: Attribute substitution in intuitive judgment. In Gilovich

Griffin

Kahneman

(Eds.), Heuristics and biases: The psychology of intuitive judgment (pp. 49–81). Cambridge, England: Cambridge University Press.

11.

Keramati

Dezfouli

Piray

(2011). Speed/accuracy trade-off between the habitual and the goal-directed processes. PLoS Computational Biology, 7(5), e1002055. Retrieved from http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002055

12.

Loewenstein

O’Donoghue

(2004). Animal spirits: Affective and deliberative processes in economic behavior (Working Papers No. 04–14). Ithaca, NY: Cornell University Center for Analytic Economics. Retrieved from http://ideas.repec.org/p/ecl/corcae/04-14.html

13.

McClure

S. M.

Laibson

D. I.

Loewenstein

Cohen

J. D.

(2004). Separate neural systems value immediate and delayed monetary rewards. Science, 306, 503–507.

14.

Miyake

Friedman

N. P.

Emerson

M. J.

Witzki

A. H.

Howerter

Wager

T. D.

(2000). The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: A latent variable analysis. Cognitive Psychology, 41, 49–100.

15.

Montague

P. R.

Dayan

Sejnowski

T. J.

(1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of Neuroscience, 16, 1936–1947.

16.

Norman

D. A.

Shallice

(1986). Attention to action: Willed and automatic control of behavior. In Davidson

R. J.

Schwartz

G. E.

Shapiro

(Eds.), Consciousness and self-regulation: Advances in research and theory (Vol. 4, pp. 1–18). New York, NY: Plenum.

17.

O’Doherty

J. P.

Dayan

Friston

Critchley

Dolan

R. J.

(2003). Temporal difference models and reward-related learning in the human brain. Neuron, 38, 329–337.

18.

Otto

A. R.

Taylor

E. G.

Markman

A. B.

(2011). There are at least two kinds of probability matching: Evidence from a secondary task. Cognition, 118, 274–279.

19.

Payne

J. W.

Bettman

J. R.

Johnson

E. J.

(1993). The adaptive decision maker. Cambridge, England: Cambridge University Press.

20.

Pinheiro

J. C.

Bates

D. M.

(2000). Mixed-effects models in S and S-PLUS. New York, NY: Springer.

21.

Schultz

Dayan

Montague

P. R.

(1997). A neural substrate of prediction and reward. Science, 275, 1593–1599.

22.

Sutton

R. S.

(1990). Integrated architecture for learning, planning, and reacting based on approximating dynamic programming. In Morgan

M. B.

(Ed.), Proceedings of the Seventh International Conference (1990) on Machine Learning (pp. 216–224). San Francisco, CA: Morgan Kaufmann.

23.

Sutton

R. S.

Barto

A. G.

(1998). Reinforcement learning. Cambridge, MA: MIT Press.

24.

Valentin

V. V.

Dickinson

O’Doherty

J. P.

(2007). Determining the neural substrates of goal-directed learning in the human brain. Journal of Neuroscience, 27, 4019–4026.

25.

Waldron

E. M.

Ashby

F. G.

(2001). The effects of concurrent task interference on category learning: Evidence for multiple category learning systems. Psychonomic Bulletin & Review, 8, 168–176.

26.

Yin

H. H.

Knowlton

B. J.

(2006). The role of the basal ganglia in habit formation. Nature Reviews Neuroscience, 7, 464–476.

27.

Zeithamova

Maddox

W. T.

(2006). Dual-task interference in perceptual category learning. Memory & Cognition, 34, 387–398.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

0.26 MB

The Curse of Planning

Abstract

Keywords

Get full access to this article

References

Supplementary Material