Professional Documents
Culture Documents
a r t i c l e in fo
abstract
Article history:
Received 3 March 2008
Received in revised form
27 August 2008
Accepted 18 September 2008
Available online 2 October 2008
The standard model for direct reciprocity is the repeated Prisoners Dilemma, where in each round
players choose between cooperation and defection. Here we extend the standard framework to include
costly punishment. Now players have a choice between cooperation, defection and costly punishment.
We study the set of all reactive strategies, where the behavior depends on what the other player has
done in the previous round. We nd all cooperative strategies that are Nash equilibria. If the cost of
cooperation is greater than the cost of punishment, then the only cooperative Nash equilibrium is
generous-tit-for-tat (GTFT), which does not use costly punishment. If the cost of cooperation is less than
the cost of punishment, then there are innitely many cooperative Nash equilibria and the response to
defection can include costly punishment. We also perform computer simulations of evolutionary
dynamics in populations of nite size. These simulations show that in the context of direct reciprocity,
(i) natural selection prefers generous tit-for-tat over strategies that use costly punishment, and (ii) that
costly punishment does not promote the evolution of cooperation. We nd quantitative agreement
between our simulation results and data from experimental observations.
Published by Elsevier Ltd.
Keywords:
Cooperation
Punishment
Evolution
Nash equilibrium analysis
Finite population size analysis
Computer simulation
Reciprocity
1. Introduction
Two key mechanisms for the evolution of any cooperative
(or pro-social or other-regarding) behavior in humans are direct
and indirect reciprocity. Direct reciprocity means there are
repeated encounters between the same two individuals, and my
behavior towards you depends on what you have done to me.
Indirect reciprocity means there are repeated encounters in a
group of individuals, and my behavior towards you also depends
on what you have done to others. Our social instincts are shaped
by situations of direct and indirect reciprocity. All of our
interactions have possible consequences for the future. Either I
might meet the same person again or others might nd out what I
have done and adjust their behavior towards me. Direct
reciprocity has been studied by many authors (Trivers, 1971;
Axelrod and Hamilton, 1981; Axelrod, 1984; Selten and Hammerstein, 1984; Nowak and Sigmund, 1989, 1992, 1993; Kraines and
Kraines, 1989; Fudenberg and Maskin, 1990; Imhof et al., 2005,
2007). For indirect reciprocity, see (Sugden, 1986; Alexander,
1987; Kandori, 1992; Nowak and Sigmund, 1998, 2005; Lotem
ARTICLE IN PRESS
46
C
a2
D !
a4
a1
a3
(1)
C
bc
db
D !
c e
de
(2)
C
bc
B
D @ db
a b
P
2. Nash-equilibrium analysis
We are interested in Nash equilibria of the repeated game
given by the payoff matrix (3). We assume b; c; a; b40 and d; eX0
throughout the paper. We refer to one game interaction as a
round. With probability w 0owo1, the game continues for
another round. With probability 1 w, the game terminates. The
number of rounds follows a geometrical distribution with mean
1=1 w. The parameter w can also be interpreted as discounting
future payoffs.
A strategy of a player is a behavioral rule that prescribes an
action in each round. We assume that each player has a
probabilistic strategy as follows. In the rst round, a player
chooses an action (either C, D, or P) with probability p0, q0 and r 0 ,
respectively. From the second round on, a player chooses an action
depending on the opponents action in the previous round. The
probability that a player chooses C, D, or P, is given by pi, qi , and r i ,
for each possible previous action (i 1; 2; 3 for C; D; P) of the
opponent. Thus a strategy is described by 12 values as
0 C
p0
B
Response to C B p1
B
s
Response to D B
@ p2
p3
Response to P
Initial move
D P 1
q0 r 0
C
q1 r 1 C
C.
q2 r 2 C
A
q3 r 3
(4)
D
c e
de
a e
P
1
c b
db C
A.
a b
(3)
Note that the idea of punishment is not new in the world of the
Prisoners Dilemma. The classical punishment for defection is
defection. Tit-for-tat punishes defection with defection. The new
proposal, however, is that there is another form of punishment
which is costly for the punisher. Two questions then present
themselves. Is it advantageous to use costly punishment, P,
instead of defection, D, in response to a co-players defection?
And furthermore, does costly punishment allow cooperation to
succeed in situations where tit-for-tat does not? We will explore
these questions in the present paper.
S43
3
Y
fpi ; qi ; r i j pi qi r i 1; pi ; qi ; r i X0g.
(5)
i0
(6)
0
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
47
a Nash-equilibrium strategy, s, as cooperative if and only if two sstrategists always cooperate in the absence of errors (i.e. ! 0).
It is easy to show that the criterion above means that we
search for Nash equilibria of the form
0 C D P 1
1 0 0
B
C
Response to C B 1 0 0 C
B
C.
s
B
Response to D @ p2 q2 r 2 C
A
p3 q3 r 3
Response to P
Initial move
(7)
w0 X
cd
b maxfb; eg
and
apc
0 C D P 1
1 0 0
B
C
Response to C B 1 0 0 C
B
C.
s
C
Response to D B
@ p2 q2 r 2 A
p3 q3 r 3
Response to P
Initial move
bp2 eq2 br 2 b
cd
w4
bb
0
and
apc
(16)
(17)
cd
w0
(18)
ca
.
w0
(19)
and
(8)
bp3 eq3 br 3 b
Initial move
C
1
B
Response to C B 1
B
Response to D B
@ 1 r2
Response to P
1 r3
D
0
0
0
0
P 1
0
C
0C
C.
r2 C
A
r3
(9)
cd
cd
pw0 o
bb
be
cd
w0 b b
(10)
ca
.
w0 b b
(11)
cd
be
(12)
C
1
Initial move
B
Response to C B 1
B
s
Response to D B
@ 1 q2
1 q3
Response to P
D P1
0 0
C
0 0C
C.
q2 0 C
A
q3 0
(13)
ca
.
w0 b e
(21)
(14)
and
q3 4
apc
(20)
and
and
r3
(15)
cd
.
b maxfb; eg
(22)
bc
O.
1w
(23)
ARTICLE IN PRESS
48
bc
2b e b
.
1w
1w
(24)
cd
.
be
(25)
(26)
where
q2
cd
.
w0 b e
(27)
Fig. 1. The best (highest payoff) cooperative Nash equilibria can use costly punishment only if punishment is not more expensive than cooperation, apc. The strategy space
of our model, S43 is shown. From left to right, each simplex represents the initial action, reaction to cooperation, reaction to defection and reaction to punishment,
respectively. Each corner (labeled C, D, P) represents a pure reaction. A point in the simplex represents a probabilistic reaction. (A) The best cooperative Nash equilibria
when punishment is cheaper than cooperation can use punishment in response to defection and/or punishment. Any pair of points from the line in the D-simplex and the
line in the P-simplex is a payoff maximizing cooperative Nash equilibrium. (B) The best cooperative Nash equilibria when punishment is equal in cost to cooperation can use
punishment in response to defection, but always cooperates in response to punishment. (C) The best cooperative Nash equilibrium when punishment is more expensive
than cooperation is generous tit-for-tat. Only defection and cooperation are used in reaction to defection. Punishment is never used.
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
49
Table 1
The best cooperative Nash equilibria derived in Section 2.2
Punishment is less costly
than cooperation, apc
Initial move
Response to C
Response to D
C or D or P
Any (p2 ; q2 ; r 2 ) that satises
bb
cd
r2
q2
w1 3b e
be
p2 1
Response to P
C or D
C or D or P
Any (p3 ; q3 ; r 3 ) that satises
bb
ca
r3
q3
be
w1 3b e
cd
cd
,q
, r2 0
w1 3b e 2 w1 3b e
Such strategies are optimal in that each receives the highest payoff against itself. If apc, optimal cooperative Nash equilibria exist which use costly punishment in response
to defection and/or punishment. If a4c, the unique optimal cooperative Nash equilibrium is generous tit-for-tat, which never uses costly punishment. It is interesting that
as punishment becomes more expensive, the optimal response to an opponents punishment becomes more generous. If aXc, punishment use is always responded to with
full cooperation.
Table 2
The best (highest payoff) cooperative Nash equilibria derived in Section 2.2 when ! 0 and d e 0
Punishment is less costly
than cooperation, apc
Initial move
Response to C
Response to D
C or D or P
Any (p2 ; q2 ; r 2 ) that satises
bb
c
r2
q2
bw
b
p2 1
Response to P
C or D
C or D or P
Any (p3 ; q3 ; r 3 ) that satises
bb
ca
r3
q3
b
bw
c
c
,q
, r2 0
bw 2 bw
Again, costly punishment is only used if apc, and otherwise the optimal strategy is generous tit-for-tat. You can also see that if apc, strategies which respond to D using
only C and P (q2 0) are more forgiving than strategies which respond to D using only C and D (r 2 0).
pi
N
1 X
usi ; sj .
N 1 j1
(29)
jai
1
.
1 epT pL =t
(30)
This is a monotonically increasing function of the payoffdifference, pT pL , taking the values from 0 to 1. This update
rule is called the pairwise comparison (Pacheco et al., 2006b;
Traulsen et al., 2006, 2007). The parameter t40 in (30) is called
the temperature of selection. It is a measure of the intensity of
selection. For very large t we have weak selection (Nowak et al.,
2004).
In learning, we introduce a chance of mutation (or exploration). When the learner switches his strategy, then with
probability m he adopts a completely new strategy. In this case,
the probabilities pi , qi , and r i for each i are randomly generated
using a U-shaped probability density distribution as in Nowak and
ARTICLE IN PRESS
50
Fig. 3. The relative size of a and c has little effect on move frequencies. The time
average frequency of cooperation, defection, and punishment are shown for a
0:1 and a 10, with b 3; c 1; d e 1, and b 4. Simulation parameters
m 0:1, and t 0:8 are used. Move use is time averaged over N 50 players,
playing for a total of 2 107 generations. Consistent with the Nash equilibrium
analysis, there is a high level of cooperation in both cases, and the a4c simulation
contains slightly more C, less D, and less P than the aoc simulation.
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
51
ARTICLE IN PRESS
52
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
53
Fig. 6. Evolution disfavors the use of costly punishment across a wide range of payoff and simulation parameter values. Probabilities of D in response to D (q2 ) and P in
response to D (r 2 ) among cooperative strategies (p1 40:75) are shown, time averaged over a total of 2 107 generations. Parameters b (A), d (B), a (C), b (D), t (E), m (F), and
w (G) are varied. Values other than that which is varied are set to b 3; c 1; d e 1; a 1, b 4, t 0:8, m 0:1, and w 1. For all parameter sets explored, q2 4r 2
holds: strategies that respond to D with D more often than P are selected by evolution. (A) As punishment becomes more costly for the player who is punished, it becomes
more effective to use P. Yet even with a 25 : 1 punishment technology, we see q2 4r 2 . (B) As defection becomes more effective, it makes even less sense to punish. (C) As
punishment gets more expensive for the punisher, it becomes less effective to punish. Yet even if a 0, defection is still used more than punishment. (D) Increasing b
decreases the total number of D moves, and therefore decreases the selection pressure acting on q2 and r 2 . This has the effect of moving both values towards 13, thus
decreasing q2 and increasing r 2 . (E) Increasing the temperature of selection t also reduces selection pressure, and all move probabilities approach 13. Yet for all values of t, we
nd q2 4r 2 . (F) As mutation rate m increases, mutation dominates selection and all move probabilities approach 13. Yet for all values mo1, we nd q2 4r2 . (G) Even in nitely
repeated games, wo1, defection is favored over punishment, q2 4r 2 .
ARTICLE IN PRESS
54
D
0
P
0
0
1
2y
C
C
C
y A
0
2
71
1
0pyp1,
0
(31)
5. Discussion
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
Acknowledgments
Support from the John Templeton Foundation, the NSF/NIH joint
program in mathematical biology (NIH grant R01GM078986), and
J. Epstein is gratefully acknowledged.
55
(A.1)
c
|{z}
immediate payoff
wbp1 eq1 br 1
|{z}
(A.2)
future payoff
(A.3)
(A.4)
(A.5)
Appendix A
In this appendix we will analytically derive all Nash equilibria
of the repeated game.
We notice that saying strategy s is a Nash equilibrium is
equivalent to saying that s is a best response to s. In other words, s
C
Initial move 0 0
Response to C B
B0
s
B
Response to D @ 0
Response to P
0
D P
1
1 0
1 0C
C
C.
1 0A
1
(A.6)
ARTICLE IN PRESS
56
vC c w1 3e,
vC c w1 3fb br 1 bg,
vD d w1 3fb br 2 bg,
vD d w1 3e,
vP a w1 3e.
(A.7)
vP a w1 3fb br 3 bg.
(A.14)
D P
1
0 1
0 1C
C
C.
0 1A
0
D P 1
0 C
1 q0 q0 0
B
C
Response to C B 1 q1 q1 0 C
B
C.
s
C
Response to D B
@ 1 q2 q2 0 A
q
0
1
q
Response to P
3
3
Initial move
(A.16)
vP a w1 3b.
(A.9)
vC c w1 3fb eq1 bg,
(A.17)
c d w1 3b eq2 q1 ,
(A.15)
D
P 1
1 r0 r0
C
1 r1 r1 C
C.
1 r2 r2 C
A
1 r3 r3
(A.10)
vC c w1 3fe br 1 eg,
vD d w1 3fe br 2 eg,
(A.11)
0 C
p0
B
p
Response
to
C
B
B 1
s
Response to D B
@ p2
p3
Response to P
Initial move
D P 1
q0 r 0
C
q1 r 1 C
C.
q2 r 2 C
A
q3 r 3
(A.19)
c d4w1 3e br 1 r 2 ,
d a w1 3e br 3 r 2 .
A.7. When vC vD vP
Any strategy is a best response to s;
vP a w1 3fe br 3 eg.
(A.18)
(A.12)
a w1 3bp3 eq3 br 3 .
(A.20)
Appendix B
ARTICLE IN PRESS
D.G. Rand et al. / Journal of Theoretical Biology 256 (2009) 4557
xg1 1 xg1
ug1 1 ug1 du
(B.1)
(B.2)
X 02 X 2 X 1 ,
(B.3)
X 03 1 X 2 .
(B.4)
X 01 ; X 02 ,
X 03 .
and
This
Finally, pi , qi , and r i are randomly matched with
order randomization is necessary to preserve pi qi r i 1 while
still maintaining the same probability distribution for each variable.
The resulting distribution is U-shaped, with the probability density
near 0 twice as large as that near 1.
References
Alexander, R.D., 1987. The Biology of Moral Systems. Aldine de Gruyter, New York.
Axelrod, R., 1984. The Evolution of Cooperation. Basic Books, USA, New York.
Axelrod, R., 1986. An evolutionary approach to norms. Amer. Polit. Sci. Rev. 80,
10951111.
Axelrod, R., Hamilton, W.D., 1981. The evolution of cooperation. Science 211,
13901396.
Bellman, R., 1957. Dynamic Programming. Princeton University Press, Princeton, NJ.
Bochet, O., Page, T., Putterman, L., 2006. Communication and punishment in
voluntary contribution experiments. J. Econ. Behav. Org. 60, 1126.
Boerlijst, M.C., Nowak, M.A., Sigmund, K., 1997. The logic of contrition. J. Theor. Biol.
185, 281293.
Bowles, S., 2001. Individual interactions, group conicts, and the evolution of
preferences. In: Durlauf, S.N., Young, H.P. (Eds.), Social Dynamics. MIT Press,
Cambridge, MA, pp. 155190.
Boyd, R., Richerson, P., 1990. Group selection among alternative evolutionarily
stable strategies. J. Theor. Biol. 145, 331342.
Boyd, R., Richerson, P.J., 1992. Punishment allows the evolution of cooperation
(or anything else) in sizable groups. Ethol. Sociobiol. 13, 171195.
Boyd, R., Gintis, H., Bowles, S., Richerson, P.J., 2003. Evolution of altruistic
punishment. Proc. Natl. Acad. Sci. USA 100, 35313535.
Brandt, H., Sigmund, K., 2006. The good, the bad and the discriminatorerrors in
direct and indirect reciprocity. J. Theor. Biol. 239, 183194.
Brandt, H., Hauert, C., Sigmund, K., 2003. Punishment and reputation in spatial
public goods games. Proc. R. Soc. Lond. B 270, 10991104.
Burnham, T.C., Johnson, D.P., 2005. The biological and evolutionary logic of human
cooperation. Analyse and Kritik 27, 113135.
Carlsmith, K., Darley, J.M., Robinson, P.H., 2002. Why do we punish? Deterrence
and just deserts as motives for punishment. J. Pers. Soc. Psychol. 83, 284299.
Clutton-Brock, T.H., Parker, G.A., 1995. Punishment in animal societies. Nature 373,
209216.
Denant-Boemont, L., Masclet, D., Noussair, C.N., 2007. Punishment, counterpunishment and sanction enforcement in a social dilemma experiment. Econ.
Theory 33, 145167.
Dreber, A., Rand, D.G., Fudenberg, D., Nowak, M.A., 2008. Winners dont punish.
Nature 452, 348351.
Fehr, E., Gachter, S., 2000. Cooperation and punishment in public goods
experiments. Am. Econ. Rev. 90, 980994.
Fehr, E., Gachter, S., 2002. Altruistic punishment in humans. Nature 415, 137140.
Fowler, J.H., 2005. Altruistic punishment and the origin of cooperation. Proc. Natl.
Acad. Sci. 102, 70477049.
Fudenberg, D., Maskin, E., 1990. Evolution and cooperation in noisy repeated
games. Am. Econ. Rev. 80, 274279.
Fudenberg, D., Tirole, J., 1991. Game Theory. MIT Press, Cambridge, MA.
Gurerk, O., Irlenbusch, B., Rockenbach, B., 2006. The competitive advantage of
sanctioning institutions. Science 312, 108111.
Hauert, C., Traulsen, A., Brandt, H., Nowak, M.A., Sigmund, K., 2007. Via freedom to
coercion: the emergence of costly punishment. Science 316, 19051907.
Herrmann, B., Thoni, C., Gachter, S., 2008. Antisocial punishment across societies.
Science 319, 13621367.
Imhof, L.A., Nowak, M.A., 2006. Evolutionary game dynamics in a WrightFisher
process. J. Math. Biol. 52, 667681.
Imhof, L.A., Fudenberg, D., Nowak, M.A., 2005. Evolutionary cycles of cooperation
and defection. Proc. Natl. Acad. Sci. USA 102, 1079710800.
57
Imhof, L.A., Fudenberg, D., Nowak, M.A., 2007. Tit-for-tat or winstay, loseshift?
J. Theor. Biol. 247, 574580.
Kahneman, D., Schkade, D., Sunstein, C.R., 1998. Shared outrage and erratic awards:
the psychology of punitive damages. J. Risk Uncertainty 16, 4986.
Kandori, M., 1992. Social norms and community enforcement. Rev. Econ. Stud. 59,
6380.
Kraines, D., Kraines, V., 1989. Pavlov and the prisoners dilemma. Theory and
Decision 26, 4779.
Lindgren, K., 1991. Evolutionary phenomena in simple dynamics. In: Langton, C.,
et al. (Eds.), Articial Life II. Addison-Wesley, Redwood City, CA, pp. 295312.
Lindgren, K., Nordahl, M.G., 1994. Evolutionary dynamics of spatial games. Physica
D 75, 292309.
Lotem, A., Fishman, M.A., Stone, L., 1999. Evolution of cooperation between
individuals. Nature 400, 226227.
May, R.M., 1987. More evolution of cooperation. Nature 327, 1517.
Molander, P., 1985. The optimal level of generosity in a selsh, uncertain
environment. J. Conict Resolution 29, 611618.
Nakamaru, M., Iwasa, Y., 2006. The coevolution of altruism and punishment: role
of the selsh punisher. J. Theor. Biol. 240, 475488.
Nowak, M.A., 1990. Stochastic strategies in the prisoners dilemma. Theor. Popul.
Biol. 38, 93112.
Nowak, M.A., 2006. Five rules for the evolution of cooperation. Science 314,
15601563.
Nowak, M.A., Sigmund, K., 1989. Oscillations in the evolution of reciprocity.
J. Theor. Biol. 137, 2126.
Nowak, M.A., Sigmund, K., 1990. The evolution of stochastic strategies in the
Prisoners Dilemma. Acta Appl. Math. 20, 247265.
Nowak, M.A., Sigmund, K., 1992. Tit for tat in heterogeneous populations. Nature
355, 250253.
Nowak, M.A., Sigmund, K., 1993. A strategy of winstay, loseshift that outperforms tit for tat in Prisoners Dilemma. Nature 364, 5658.
Nowak, M.A., Sigmund, K., 1998. Evolution of indirect reciprocity by image scoring.
Nature 393, 573577.
Nowak, M.A., Sigmund, K., 2005. Evolution of indirect reciprocity. Nature 437,
12911298.
Nowak, M.A., Sasaki, A., Taylor, C., Fudenberg, D., 2004. Emergence of cooperation
and evolutionary stability in nite populations. Nature 428, 646650.
Ohtsuki, H., Iwasa, Y., 2004. How should we dene goodness? Reputation dynamics
in indirect reciprocity. J. Theor. Biol. 231, 107120.
Ohtsuki, H., Iwasa, Y., 2005. The leading eight: social norms that can maintain
cooperation by indirect reciprocity. J. Theor. Biol. 239, 435444.
Ostrom, E., Walker, J., Gardner, R., 1992. Covenants with and without a sword:
selfgovernance is possible. Am. Pol. Sci. Rev. 86, 404417.
Ostrom, E., Gardner, J., Walker, R., 1994. Rules, Games, and Common-Pool
Resources. University of Michigan Press, Ann Arbor.
Pacheco, J.M., Santors, F.C., Chalub, F.A.C.C., 2006a. Stern-judging: a simple,
successful norm which promotes cooperation under indirect reciprocity. PLoS
Comput. Biol. 2, 16341638.
Pacheco, J.M., Traulsen, A., Nowak, M.A., 2006b. Active linking in evolutionary
games. J. Theor. Biol. 243, 437443.
Page, T., Putterman, L., Unel, B., 2005. Voluntary association in public goods
experiments: reciprocity mimicry and efciency. Econ. J. 115, 10321053.
Panchanathan, K., Boyd, R., 2005. A tale of two defectors: the importance of
standing for evolution of indirect reciprocity. J. Theor. Biol. 224, 115126.
Rockenbach, B., Milinski, M., 2006. The efcient interaction of indirect reciprocity
and costly punishment. Nature 444, 718723.
Sanfey, A.G., Rilling, J.K., Aronson, J.A., Nystrom, L.E., Cohen, J.D., 2003. The neural basis
of economic decision-making in the Ultimatum Game. Science 300, 17551758.
Selten, R., Hammerstein, P., 1984. Gaps in Harleys argument on evolutionarily stable
learning rules and in the logic of tit for tat. Behav. Brain Sci. 7, 115116.
Sigmund, K., 2008. Punish or perish: enforcement and reciprocation in human
collaboration. Trends Ecol. Evol. 22, 593600.
Sigmund, K., Hauert, C., Nowak, M.A., 2001. Reward and punishment. Proc. Natl.
Acad. Sci. USA 98, 1075710762.
Sugden, R., 1986. The Economics of Rights, Cooperation and Welfare. Blackwell,
Oxford.
Traulsen, A., Nowak, M.A., 2006. Evolution of cooperation by multilevel selection.
Proc. Natl. Acad. Sci. USA 103, 1095210955.
Traulsen, A., Nowak, M.A., Pacheco, J.M., 2006. Stochastic dynamics of invasion and
xation. Phys. Rev. E 74, 011909.
Traulsen, A., Pacheco, J.M., Nowak, M.A., 2007. Pairwise comparison and selection
temperature in evolutionary game dynamics. J. Theor. Biol. 246, 522529.
Trivers, R.L., 1971. The evolution of reciprocal altruism. Q. Rev. Biol. 46, 3557.
Wahl, L.M., Nowak, M.A., 1999a. The continuous prisoners dilemma: I. Linear
reactive strategies. J. Theor. Biol. 200, 307321.
Wahl, L.M., Nowak, M.A., 1999b. The continuous prisoners dilemma: I. Linear
reactive strategies with noise. J. Theor. Biol. 200, 307321.
Wilson, D.S., 1975. A theory of group selection. Proc. Natl. Acad. Sci. 72, 143146.
Wilson, D.S., Sober, E., 1994. Reintroducing group selection to the human
behavioral sciences. Behav. Brain Sci. 17, 585654.
Wynne-Edwards, V.C., 1962. Animal Dispresion; in Relation to Social Behaviour.
Oliver and Boyd, London.
Yamagishi, T., 1986. The provision of a sanctioning system as a public good. J. Pers.
Soc. Psych. 51, 110116.
Yamagishi, T., 1988. Seriousness of social dilemmas and the provision of a
sanctioning system. Soc. Psychol. Q. 51, 3242.