Evolution of Learning among Pavlov Strategies in a Competitive Environment with Noise

Abstract

Pavlov denotes a family of stochastic learning strategies that achieves the mutually cooperative outcome in the iterated prisoner's dilemma against a wide variety of strategies, although it can be exploited to some extent by some. When restricted to an environment of only Pavlov-type strategies, slower learning mutants cannot invade an initial dominant population. More surprising, mutants who learn much faster than the current population tend to overreact and also cannot invade. In particular, the “immediate learning” version of Pavlov, sometimes called win-stay-lose-switch, often fares poorly in this environment. Only those strategies that learn marginally faster than the dominant variety will have greater fitness. Although faster learners will eventually dominate a given homogeneous Pavlov population, the process must proceed through a gradual increase in the rate of learning.

Get full access to this article

View all access options for this article.

References

Atkinston, R. , and P. Suppes . 1958. An analysis of two-person game situations in terms of statistical learning theory. Journal of Experiential Psychology 35:369-378.

Axelrod, R. 1984. The evolution of cooperation. New York: Basic Books.

Beardsley, T. 1993. Never give a sucker an even break. Scientific American, October, 22.

Beardsley, T. 1994. Erratum: Never give a sucker an even break. Scientific American, February, 10.

Boyd, R. , and J. Lorberbaum . 1987. No pure strategy is evolutionarily stable in the repeated prisoner's dilemma game. Nature 327:58-59.

Bush, R. , and W. Estes . 1959. Studies in mathematical learning theory. Stanford, CA: Stanford University Press.

Donninger, C. 1986. Is it always efficient to be nice? In Paradoxical effects of social behavior, edited by A. Dickmann and P. Mitter , 123-134. Heidelberg, Germany: Physica Verlag.

Fudenberg, D. , and E. Maskin . 1990. Evolution and cooperation in noisy repeated games. New Developments in Economic Theory 80:274-279.

Hergenhahn, B. R. 1976. An introduction to the theories of learning. Englewood Cliffs, NJ: Prentice Hall.

10.

Kemeny, J. , and J. Snell . 1976. Finite Markov chains. New York: Springer-Verlag.

11.

Kraines, D. , and V. Kraines . 1989. Pavlov and the prisoner's dilemma. Theory and Decision 26:47-79.

12.

Kraines, D. , and V. Kraines . 1993. Learning to cooperate with Pavlov: An adaptive strategy for the prisoner's dilemma with noise. Theory and Decision 35:107-150.

13.

Lave, C. , and J. G. March . 1975. An introduction to models in the social sciences. New York: Harper & Row.

14.

Luce, R. D. , and H. Raiffa . 1957. Games and decisions. New York: Wiley.

15.

Macy, M. 1989. Walking out of social traps. Rationality and Society 1:197-219.

16.

Mailath, G. 1992. Symposium on evolutionary game theory. Journal of Economic Theory 57:259-277.

17.

Matsumoto, D. , N. Haan , G. Yabrove , P. Theodorou , and C. C. Carney . 1990. Preschoolers' moral actions and emotions in prisoner's dilemma. Developmental Psychology 22:663-669.

18.

Maynard Smith, J. 1982. Evolution and the theory of games. Cambridge, UK: Cambridge University Press.

19.

Milinski, M. 1987. Tit for tat in sticklebacks and the evolution of cooperation. Nature 325:433-435.

20.

Milinski, M. 1993. Cooperation wins and stays. Nature 364:12-13.

21.

Nowak, M. , and K. Sigmund . 1993. A strategy of win-stay-lose shift that outperforms tit-for-tat in the prisoner's dilemma game. Nature 364:56-58.

22.

Poundstone, W. 1992. Prisoner's dilemma. New York: Doubleday.

23.

Rapoport, A. , and A. M. Chammah . 1965. Prisoner's dilemma. Ann Arbor: University of Michigan Press.

24.

Simon, H. 1991. A mechanism for social selection and successful altruism. Science 250:1665-1668.

25.

Tversky, A. , and D. Kahneman . 1992. Advances in prospect theory: Cumulative representation of uncertainty. Journal of Risk and Uncertainty 5:297-323.

26.

Wu, H. , and R. Axelrod . 1995. How to cope with noise in the iterated prisoner's dilemma. Journal of Conflict Resolution 39:183-189.