Professional Documents
Culture Documents
1. introduction
It has long been recognized that community enforcement can make sellers behave
cooperatively even when they meet particular buyers only infrequently and have a
short-term incentive to cheat, e.g., to supply low quality or to shirk in a labor con-
tract. For instance, Klein and Leffler (1981) study the seller’s problem of credibly
committing to offer high quality in a model where a continuum of buyers is ran-
domly matched with a seller who has a short-term incentive to supply low quality at
a lower cost. In their model, community enforcement by the buyers, through a coor-
dinated boycott after observing low quality, provides an incentive for the seller to
produce high quality. The results of Klein and Leffler (1981), along with most of the
existing literature on community enforcement, depend upon the assumption that past
quality choices of the seller are public information.
When the number of sellers and buyers is large, and particular sellers and buyers
meet infrequently or only once, as is the case, for instance, for specialist doctors,
car dealers etc., the assumption of public information seems rather demanding.
Recently, this observation has led to a number of articles looking at community
399
400 AHN AND SUOMINEN
2.1. A Model without Signals. There is one seller and M buyers. Throughout
this paper, the seller will be referred to as “female” and the buyers as “male.” We
denote by MB = 1 2 M the set of buyers. Let be the set of all permutations
of MB . In each period t = 0 1 2 a permutation θt ∈ is chosen with uniform
probability, independent of previous realizations. The buyer θt 1 ∈ MB is selected
to play with the seller the following simultaneous move “trade” game:
Buyer θt 1
B NB
Seller H 1 1 0 0
L 1 + g − 0 0
where both g and are taken to be strictly positive numbers. The seller’s actions,
H and L refer to providing “high quality” and “low quality” while the buyer’s action
B refers to “buy” and NB to “not buy.” With g and strictly positive, L is the
dominant strategy for the seller and 0 0 is the only Nash equilibrium outcome of
the trade game. The seller and buyers have a common discount factor δ ∈ 0 1 and
their overall payoffs are the discounted sum of payoffs from the trade games. For
simplicity, it is assumed that the seller cannot recognize the identities of the buyers.2
In each period t = 0 1 2 there are N < M other buyers, called period t
spectators, who are neighboring θt 1 and also observe the outcome of the game. We
denote the set of spectators by Nθt = θt 2 θt 3 θt N + 1. Note that θt
determines the identities of both the matched buyer and the spectators. In addition,
there exists a public randomization device ft ∈ F ≡ 0 1
which is realized in the
beginning of the each period and drawn from a uniform probability distribution.
The stage game proceeds as follows:
1. θt and ft are realized and observed by all buyers. The seller observes ft .3
2. Seller and buyer θt 1 play the 2 × 2 simultaneous move trade game
described previously, while the spectators Nθt observe. We denote the
outcome (or the realized action profile) of this game by aSt aBt θt 1
where aSt ∈ AS = H L aBt θt 1 ∈ AB = B NB. We denote by ot j
the observation of buyer j in period t; i.e., let ot j = aSt aBt θt 1 ∈
AS × AB if j ∈ θt 1 ∪ Nθt and let ot j = otherwise.
The information that buyer j receives in period t can now be written as
θt ft ot j We denote with H t j the set of all possible histories for buyer j
up to, but not including, period t. By convention, let H 0 j = . An element
2
The assumption that the seller could recognize the identities of the buyers would not change
our results. The strategy profiles that we consider would be equilibria of such a game under slightly
different conditions.
3
The assumption that buyers observe θt is made in order to ease the notation.
WORD-OF-MOUTH COMMUNICATION 403
ht j ∈ H t j includes all past realizations of θs and fs as well as all past observa-
tions of player j os j where 0 ≤ s < t. Hence, ht j is
t−1
ht j = θτ fτ oτ jτ=0
j
bt : × F × H t j → B NB
Here
j
bt θt ft ht j specifies the choice of action for buyer j in period t given θt ft ,
and j’s private history ht j If j = θt 1 this action is either B or NB; if j = θt 1
the action space for the buyer is merely the empty set.
For the seller, we define the pure strategy as a sequence of maps s t ∞
t=0 where
Note that the assumption that a seller does not recognize the identity of a buyer is
implicit in this notation.
2.2. A Model with Signals. This model differs from the previous one in that in
each period t = 0 1 2 before the trade game, the spectators j ∈ Nθt are
allowed to send payoff-irrelevant signals to buyer θt 1
Let us introduce the following notation:
• C = γ β: the set of possible signals.
j
• mt ∈ γ β : the signal from buyer j to buyer θt 1.
• mt j ∈ γ βN : the signals received by buyer j in period t
We can interpret the signals γ and β as meaning “Good” and “Bad,” respectively.
The interpretation of mt and mt j is similar to that of
j j
bt : e.g., if j = θt 1 mt j ∈
N
γ β and if j = θt 1 mt j = . The information that buyer j receives in period t
can now be written as θt ft mt j ot j. As before, we denote by H t j the set of all
possible histories for buyer j up to, but not including, period t and let H 0 j = An
element ht j ∈ H t j now includes all past realizations of θs and fs all past messages
to player j, ms j, all past messages from player j, mjs and all past observations of
player j os j where 0 ≤ s < t. Hence, ht j is
mt
j j
A pure strategy for buyer j is then a sequence bt ∞
t=0 , where
j
t : × F × H t j → γ β
m
j
bt : × F × H t j × γ βN → B NB
j
Here, mt θt ft ht j specifies the signal that buyer j, with private history ht j
sends to the matched buyer θt 1 in period t, while
j
bt θt ft ht j mt j specifies
the choice of action for buyer j = θt 1 in a period t trade game, given ft , j’s private
history ht j and signals mt j that he received in the period t communication stage.
For the seller, we define the pure strategies as before.
404 AHN AND SUOMINEN
2.3. Equilibrium Concept. Because of the private histories that players have, the
equilibrium concept that we apply is sequential equilibrium.4 A trivial sequential
equilibrium is one where the seller plays L and the buyers NB after any history:
the repetition of the only pure strategy Nash equilibrium of the trade game. We
are, however, interested in sequential equilibria where H B is played along the
equilibrium path.
4
In Kreps and Wilson (1982), the definition of sequential equilibrium requires the specification
of a beliefs system as well as a strategy profile. Because the beliefs system, which is consistent with
our strategy profiles, is simple, we refer only to the strategy profile when describing a sequential
equilibrium.
WORD-OF-MOUTH COMMUNICATION 405
when ft > f ∈ 0 1
in which case, all players return to the equilibrium path of the
randomly forgiving strategy profile and the game is essentially restarted. The unforgiving
strategy profile is a special case of the randomly forgiving strategy profile with f = 1.
There are two reasons for focusing on these strategy profiles. First, they are simple:
In fact, because of the private information that players have, it is difficult to imagine
other strategies that could support the efficient outcome, where H B is played in
every period along the equilibrium path, as a sequential equilibrium of this game. For
instance, strategies with finite punishments are difficult to implement because buyers
typically do not know the time of the first defection and therefore cannot synchronize
the last period of a punishment phase.5 Our strategy profiles avoid these problems.
The buyers’ incentives are particularly easily shown to be satisfied.
The second reason for focusing on these strategy profiles is that the unforgiving
strategy profile results in the fastest possible dissemination of information about a
seller’s defection and allows the buyers to implement the maximum punishment. This
is important because the conditions for the efficient outcome that we derive then
give us the minimum N that is necessary for the efficient outcome in any sequential
equilibrium.
3.2. Equilibrium Conditions. In this section, we study the necessary and sufficient
conditions in terms of N for a sequential equilibrium where H B is played by all
players in every period.
Before proceeding, we introduce some additional notation. Take any two time
periods t and t, where t < t. For the first model, we denote by b the probability that
θt 1 was among the N + 1 buyers who observed the seller’s action in period t . That
is, let b denote the probability
Prθt 1 ∈ θt 1 Nθt
Correspondingly, for the second model, we denote by b the probability that either
θt 1 or some of his time t spectators, h ∈ Nθt , were among the N + 1 buyers who
observed the seller’s action in period t In this case, b is therefore the probability
Prθt 1 Nθt ∩ θt 1 Nθt =
If the outcome of the game has been H B in all periods except period t < t
under the unforgiving strategy profile, for both models, b is the probability that θt 1
assigns the seller a bad status in period t after the possible preplay communication
among buyers. This probability, which we call “the degree of informativeness,” plays
an important role in the analysis below. It is straightforward to show that the degree
of informativeness is
N+1
M
for the first model
b= M−N−1
N+1
1 − for the second model
N+1
M
5
It is because of these kinds of difficulties that the existing models on community enforcement
with private information have made such extreme assumptions regarding the observability of players’
actions and identities.
406 AHN AND SUOMINEN
For both models, note that, since θt is i.i.d., b is time independent and does not
depend on t or t Note also that for both strategies, b increases in the number of
spectators, N.
In the proofs of Propositions 1 and 2, we need the unconditional probability that
θt 1 assigns the seller a good status in period t, given that the seller has defected in
every period t ∈ tD t − 1 If the buyers follow the unforgiving strategy profile,
this probability is simply 1 − bt−tD . In the analysis below, the constant
g1 − δ
b∗ ≡
δ
also plays a crucial role. Equipped with this notation, we are now ready to state our
first proposition:
The left-hand side is the payoff from playing H in every period, whereas the right-
hand side is the payoff from playing L in every period. This equation can be rewritten
as b ≥ b∗
Now, assume that b < b∗ Then, whatever strategy the buyers adopt off the equilib-
rium path, the payoff to the seller from playing L in every period is higher than the
payoff on the right-hand side of Equation (1). Therefore, playing H in every period
cannot be optimal for the seller.
It is easy to show that the unforgiving strategy profile is a sequential equilibrium
when b = b∗ . The seller’s incentive to follow the unforgiving strategy profile is charac-
terized by two conditions: one preventing her from playing L along the equilibrium
path, and one that guarantees that a seller who has defected keeps defecting irrespec-
tive of her private history. To give the seller an incentive to play H along the equilib-
rium path, the short-term gain from cheating, g, must be outweighed by the long-term
loss resulting from the gradual loss of reputation among buyers. As we have seen,
given the buyers’ strategies, this occurs when b ≥ b∗ . On off-the-equilibrium paths,
the unforgiving strategy profile requires the seller to keep defecting rather than play H
in an attempt to slow down the deterioration of her reputation. It can be shown that
indifference to supplying high quality along the equilibrium path, which occurs when
b = b∗ , implies that this off-the-equilibrium path condition always holds The reason
is that the seller has less incentive to protect her reputation on off-the-equilibrium
paths than on the equilibrium path as, in any case, information about her defection
WORD-OF-MOUTH COMMUNICATION 407
is spreading through the actions of the informed buyers. Given the seller’s strategy,
the buyers’ incentives to follow the unforgiving strategy profile are trivially satisfied.
Somewhat surprisingly, the unforgiving strategy profile is not always a sequential
equilibrium for all b ≥ b∗ . The reason is that when b is larger than b∗ , the seller may
have an incentive to provide high quality in an attempt to slow down the deterioration
of her reputation, thus undermining the buyers’ incentives to punish her by not buy-
ing. As we show in Ahn and Suominen (1996), when δ is large, for both models there
exist bL and bH , such that the unforgiving strategy profile is a sequential equilibrium
of the game if, and only if, b∗ ≤ b ≤ bL or b ≥ bH where b∗ < bL < bH < 1.
As we have seen, b ≥ b∗ is needed for the seller’s incentive to play H along
the equilibrium path. For the off-the-equilibrium path condition, that a seller who
has defected keeps defecting, when δ is large, it is necessary that either b ≤ bL
or b ≥ bH If b is large, b ≥ bH , such a seller has an incentive to keep defecting, as
there are many informed buyers who keep playing NB against her, and her reputation
deteriorates quickly irrespective of her current and future actions. On the contrary, if
b is small, b ≤ bL , playing L is preferable simply because the information about her
defections is not spreading very quickly. It can be shown that this off-the-equilibrium
path condition is always satisfied with a strict inequality when b = b∗ , implying that
bL > b∗ .
One way to modify the strategy profile so that it supports the efficient outcome
for all b ≥ b∗ is to make use of the randomly forgiving strategy profile and the public
randomization device to coordinate and adjust the severity of the punishment for
the seller so that she becomes indifferent to supplying high or low quality along the
equilibrium path. As in the case of the unforgiving strategy profile, the indifference to
supplying high quality on the equilibrium path implies that the off-the-equilibrium
path incentives in the randomly forgiving strategy profile are satisfied. In this way, we
can prove the following:
Given that b is a probability, it can only be larger than b∗ if δ ≥ g/1 + g, so that
∗
b ≤ 1. For the remainder of this paper, let us assume that δ ≥ g/1 + g.
observed the seller’s action in period t < t. Given this, increasing N will increase
b in two different ways: First, it increases the number of players who observe the
current game and, second, it increases the probability with which each of these players
observed the game in period t . The presence of these two reinforcing effects, both
order N, suggests that N ∗ may now grow more slowly than M. Indeed, the following √
proposition shows that, in order to sustain trade, N ∗ must grow only at a rate M
in the second model of word-of-mouth communication.
N ∗ M2
0 < lim <∞
M→∞ M
Proposition 3 shows that when the buyer population is large enough, it is possible
to provide the seller with an incentive to sell high quality even when each buyer
observes the seller’s action in any given period with an arbitrarily small probability.
This follows from the fact that the proportion N ∗ /M which is the probability of
observing any given period’s trade game when N = N ∗ , goes to zero, as M goes to
infinity. Indeed, as the proportion shows, it goes to zero quickly. In this sense, when
signalling is allowed, word-of-mouth communication can be a surprisingly efficient
means of transmitting information and thereby an efficient mechanism to facilitate
cooperation.
The reader may wonder whether the fact that, in our model, the identities of
each buyer’s spectators can change from period to period is crucial for the result in
Proposition 3. It is not. The same result would hold even in a setting where each
buyer has N randomly selected spectators, whose identities stay fixed throughout the
game. To understand this, note that, as M increases, the probability that the seller
is matched with the same buyer twice during any finite length of time goes to zero,
and all learning takes place through observing different buyers’ trade games. In this
case, it does not matter whether we randomly select the matched buyer’s spectators
in every period or randomly select a fixed set of spectators for each buyer at the
beginning of the game.
5. endogenous networks
In this section, we complete our previous study of community enforcement under
word-of-mouth communication by making the networking among buyers costly. We
set up a model where the information transmission and b are endogenously deter-
mined. What we find is that the endogenously determined b is equal to b∗ This
is interesting because, as we saw in Section 3, the unforgiving strategy profile, which
apportions the seller the maximum punishment for selling low quality, is always an
equilibrium when b = b∗ . We also show, however, that when the buyer population
M is large, the efficient outcome can never be reached, and low quality is produced
with positive probability in any Nash equilibrium. For simplicity, we will focus on the
first model of word-of-mouth communication. The results for the second model are
similar.
WORD-OF-MOUTH COMMUNICATION 409
Let us now assume that in period zero, before θ0 is realized, each buyer j ∈ MB can
invest in Nj connections that allow him to observe the outcome of the trade game
whenever θt−1 j ≤ Nj + 1 i.e., if j is the matched player θt 1 or among the Nj
players closest to him in period t. Let us also assume that the cost of this investment
is proportional to Nj . That is, let us assume that in order to obtain Nj connections, j
must pay Nj c, where c > 0, in period zero. This assumption corresponds to the idea
that each buyer, at the beginning of the game, makes an effort to create connections
through which he can later gather information about the actions of the seller against
other buyers. We can now prove the following:
Proposition 4. In any Nash equilibrium of the game with costly networking, low
quality is produced with a positive probability when M > δ/g1 − δ
.
Similarly, as before, the probability that buyer j has never observed the seller until
period t is simply 1 − beNj t .
This equilibrium has several interesting characteristics: First, for c > 0, Equation
(2) requires that p∗ is strictly positive, as was shown in Proposition 4. In equilibrium,
L has to be played with positive probability in order to provide the buyers with an
incentive to network. Second, this probability increases in M In the first model of
word-of-mouth communication p∗ is approximately proportional to M 2 , and in the
second model (it can be shown) it is less than proportional to M 2 . A striking result is
also the knife-edge property of our equilibrium: If even one more buyer invested in
one more connection, there would be no low quality at all (increasing the utility of
all buyers discontinuously). However, networking to reduce production of low quality
is a public good, and, as usual, everyone wants to free-ride in its production. Because
of this, the economy is stuck in an inefficient equilibrium.
Another interesting observation is that when c > c, the trade collapses because a
buyer, who is set to trade with an unknown seller, is not willing to experiment, as he
does not take into account the future trading opportunities of the other buyers with
the seller. In the second model of word-of-mouth communication, there would still
be another externality because of the informative signalling by the spectators. When
choosing the number of other buyers’ games to observe, a buyer would not take into
account the other buyers’ learning but would be interested only in his own learning.
appendix
1 1+g
∞
δt
(3) ≥ + f t−1 1 − f
1−δ 1 − 1 − bfδ t=1 1−δ
1 1+g
⇔ ≥
1 − δf 1 − 1 − bfδ
The left-hand side of the first inequality is the seller’s payoff from following (I),
while the right-hand side is her payoff from deviating and following (II), as long as
fτ ≤ f and following (I) thereafter. For all b ≥ b∗ there exists f δ ∈ 0 1
such
that Equation (3) holds as an equality when f = f δ. From now on, let us assume
that f = f δ.
A seller who is playing on the equilibrium path is now indifferent to opting between
playing H in every period and deviating. She is also indifferent to either playing H
in the current period and then deviating or deviating right away. By playing H in
the current period and deviating only in the next period, she can keep her good
reputation among N + 1 buyers until they, in some way, learn about her defections
in the future. Now consider a seller off the equilibrium path who is following (II).
If her opponent and all the N spectators to her game happen to assign her a good
status, she also can keep her good reputation among N + 1 buyers by playing H. This
reputation, however, is worth less to her than if she were on the equilibrium path
WORD-OF-MOUTH COMMUNICATION 411
because, with some buyers already assigning her a bad status, these N + 1 buyers are
now more likely to learn about her bad status before playing against her in the future.
Since the short-term gain from deviating is the same in both cases, we conclude that
playing L is optimal off the equilibrium path.
To consider more formally the seller’s incentive to follow (II) in period t when
ft ≤ f δ, we need some additional notation: Denote by Ps K the probability that
K + s buyers assign the seller a bad status at the end of period t if she plays H in
period t and K players assigned her a bad status at the end of period t − 1. Denote
by us K the associated conditional probability that the seller’s period t + 1 match
θt+1 1 does not assign her a bad status (after the period t + 1 communication stage),
given that K + s buyers assign her a bad status at the end of period t.
Using the principle of dynamic programming, it is sufficient to show that a single-
period deviation to H is unprofitable. Given this, a seller has an incentive to follow
(II) if, and only if,
1 + gu0
∞
δt
+ f δt−1 1 − f δ
1 − 1 − bf δδ t=1 1−δ
minNM−K
δf δ1 + g Ps us
s=0
∞
δt
≥ u0 + + f δt−1 1 − f δ
1 − 1 − bf δδ t=1
1−δ
minNM−K
δf δ1 + g Ps u s
1 + gu0 s=0
⇔ ≥ u0 +
1 − 1 − bf δδ 1 − 1 − bf δδ
The first two equalities come from the fact that Equation (3) holds as an equality
with f = f δ The inequality follows, since
minNM−K
minNM−K
Ps us ≤ P s u 0 ≤ u0
s=0 s=0
In the second model, when ft ≤ f δ if a buyer j ∈ Nθt has ever observed the
seller play L or her opponent play NB against her, j is indifferent to signalling γ or
β to θt 1, since, given (II), he expects the seller to play L in all her future trade
games until fτ > f δ, including those with j himself, irrespective of the signal that
he sends. So, in this case, we may assume that he sends a truthful signal β. If j has
never observed the seller’s game, or if j has observed some games but the outcome
412 AHN AND SUOMINEN
in all those games was always H B he strictly prefers to send message γ instead
of β. Sending the message β would result in θt 1 playing NB against the seller,
giving her a bad status and making her play L in the future. This would reduce buyer
j’s future payoffs from games where he is matched with the seller. This establishes
the buyers’ incentives to signal. When ft ≤ f δ, a buyer θt 1 expects a seller with
status γ to play H, in which case, playing B is the best response. Similarly, when
ft ≤ f δ, he expects a seller with status β to play L, in which case, playing NB is
the best response. On the other hand, given that even a defected seller plays H after
ft > f δ, it is optimal for all buyers to assign the seller a good status γ if ft > f δ
and to treat her like a seller who has never defected.
To prove Proposition 3, we need the following lemma:
N ∗ M
lim =0
M→∞ M
First, note that lim supM→∞ N ∗ M/M < 1/2. For the subsequences of N ∗ M/M,
whose limits are 1/2, the numerator of the right-hand side of the strict inequality
in Equation (4) approaches 1, and the denominator goes to infinity, leading to a
contradiction since b∗ is assumed to be strictly less than one.
Using Stirling’s formula,
√ √
2πnnn e−n ≤ n! ≤ 2πnnn e−n e1/12n
Given that lim supM→∞ N ∗ /M < 1/2 it is clear that the exponential term that
appears at the right-hand side of Equation (5) goes to 1 as M approaches infinity.
The nonexponential term then has to be bounded away from zero for large M. In
what follows, we show that this implies limM→∞ N ∗ /M = 0.
Let q = N ∗ /M. The nonexponential terms in the right-hand side of Equation (5)
can now be written as
M
1 − q2−2q 1 − q
1 − 2q1−2q 1 − 2q1/2
WORD-OF-MOUTH COMMUNICATION 413
Since 0 < q < 1/2 and lim supM→∞ q < 1/2 the second term of the above expression
is bounded. Furthermore, we can show that the first term is strictly decreasing with
respect to q for 0 ≤ q ≤ 1/2 and for that range of q it is one iff q = 0 Thus, if
limM→∞ q = 0, the first term is very close to zero for large M, which is a contradiction.
Hence, it must be that limM→∞ N ∗ /M = 0.
where
2
2 − M−N ∗ /N ∗
N∗
αM = 1 − −e
M − N∗
−M/2N ∗
2N ∗
βM = 1 − −e
M
Since limM→∞ N ∗ /M = 0, both sequences αM and βM converge to zero from
above. Now, define new sequences aM bM such that eaM = e + αM ebM = e + βM .
We can then write
∗
M − N ∗ 2M−N N ∗2 2N ∗2
(8) ∗ = exp aM − b M
M M M − 2N ∗ M−2N M − N∗ M
414 AHN AND SUOMINEN
Given Equations (7) and (8), all that remains is to show that N ∗2 /M must
converge to some positive number in order for aM N ∗2 /M − N ∗ − bM 2N ∗2 /M
to converge to ln1 − b∗ . If limM→∞ N ∗2 /M = ∞, aM N ∗2 /M − N ∗ − bM 2N ∗2 /M
approaches minus infinity as M goes to infinity. This can be easily shown using the
∗
fact that aM and bM converge to 1 from above. If limM→∞ N 2 /M = 0, we
∗2 ∗ ∗2
obtain another contradiction since aM N /M − N − bM 2N /M → 0 as M → ∞.
Since N ∗2 /M is bounded, it is straightforward to show that limM→∞ N ∗2 /M =
− ln1 − b∗ .
1 r1 + g 1 − r1 + g
(9) = +
1−δ 1 − 1 − bN ∗ −1 δ 1 − 1 − beN ∗ δ
e
1−p p
(10) N ∗ N ∗ − 1 ∈ arg max − − Nj c
Nj M1 − δ M1 − δ1 − beNj
δ1 − p
(11) 1 − p + ≥ p
M1 − δ
A seller is willing to randomize in the initial period between providing high quality
goods forever and providing low quality goods forever if, and only if, Equation (9)
holds. The left-hand side of Equation (9) is the payoff from providing high quality
goods, and the right-hand side is the expected payoff from providing low quality
goods forever. To understand this, note that the probability that j = θt 1 has never
observed the seller is 1 − beNj t . Given this indifference on the equilibrium path, the
seller’s incentives to follow the modified unforgiving strategy profile off the equilibrium
path are also satisfied. The proof of this is exactly the same as that for Proposition 2.
Equation (10) requires that buyers be willing to randomize between N ∗ and N ∗ − 1
connections. We can show that the right-hand side of Equation (10) has a single peak
if Nj is treated as a positive real number (or as a continuous variable). Hence, if the
expected payoff to buyer j is the same with N ∗ and N ∗ − 1, then both N ∗ and N ∗ − 1
must solve j’s maximization problem. Equation (10) is therefore satisfied if
p p
+ Mc =
1 − δ1 − be N ∗ 1 − δ1 − be N ∗ − 1
or
1 − δ1 − be N ∗ − 11 − δ1 − be N ∗
(12) p = cM 2
δ
If a buyer has ever observed the seller play L or someone play NB against her,
he should expect the seller to play L, in which case playing NB is the best response.
If the buyer has observed the seller, and the outcomes have always been H B, he
should expect her to play H, in which case he should play B. If the buyer has never
observed the seller, he should expect her to play H with probability 1 − p. For
WORD-OF-MOUTH COMMUNICATION 415
such a buyer to play B against her rather than give her a bad status by playing NB,
Equation (11) has to be satisfied. This equation can be rewritten as
M1 − δ + δ
(13) p≤
+ 1M1 − δ + δ
The modified unforgiving strategy profile with p defined by Equation (12) is a sequen-
tial equilibrium if p satisfies Equation (13). This happens when c ≤ c.
references
Ahn, I., and M. Suominen, “Word-of-Mouth Communication and Community Enforcement,”
CARESS Working Paper 96-02, 1996.
Allen, F., “Reputation and Product Quality,” Rand Journal of Economics 15 (1984), 311–27.
Ellison G., “Cooperation in The Prisoners’ Dilemma with Anonymous Random Matching,” Review
of Economic Studies 61 (1994), 567–88.
Greif, A., “Contract Enforceability and Economic Institutions in Early Trade: The Magrabi Traders’
Coalition,” The American Economic Review 80 (1993), 525–48.
, “Cultural Beliefs and The Organization of Society: A Historical and Theoretical Reflection
on Collectivist and Individualist Societies,” Journal of Political Economy 102 (1994), 912–50.
, P. Milgrom, and B. R. Weingast, “Coordination, Commitment, and Enforcement: The
Case of The Merchant Quild,” Journal of Political Economy 102 (1994), 745–76.
Harrington, J., “Cooperation in a One-shot Prisoners’ Dilemma,” Games and Economic Behavior
8 (1995), 364–77.
Kandori, M., “Social Norms and Community Enforcement,” Review of Economic Studies 59 (1992),
63–80.
Klein, B., and K. Leffler, “The Role of Market Forces in Assuring Contractual Performance,”
Journal of Political Economy 89 (1981), 615–41.
Kreps, D., and R. Wilson, “Sequential Equilibria,” Econometrica 50 (1982), 863–94.
Milgrom, P., D. North, and B. R. Weingast, “The Role of Institutions in The Revival of Trade:
The Law Merchant, Private Judges, and The Champagne Fairs,” Economics and Politics 2 (1990),
1–23.
Okuno-Fujiwara, M., and A. Postlewaite, “Social Norms and Random Matching Games,”
Games and Economic Behavior 9 (1995), 79–109.