Dynamic Competition with Irreversible Moves · For a wide range of competitive constituent games, the payoff allocations that are equilibrial in IMGs are unique under renegotiation

Dynamic Competition with Irreversible Moves

Friedel Bolle and∗Yves Breitmoser†

European University Viadrina

Postfach 1786, 15207 Frankfurt(Oder), Germany

email: [email protected]

October 12, 2004

Discussion Paper 207

Europa–Universität Viadrina, Frankfurt (Oder)

Abstract

We define a class of duopolistic “competitive games” (including Bertrand and

Cournot competition, public–good games, and rent–seeking contests) and apply it to

define a dynamic game where deviations to increasingly competitive strategies are

irreversible. There is generally a unique profile of equilibrium payoffs that Pareto

dominates the stage game Nash payoffs, and in a wide range of circumstances, this

payoff profile is even unique overall. Moreover, we define a generalized repeated

game where deviations to increasingly competitive strategies can be made irreversible

by the respective player. Under comparably mild assumptions, we find that the sets of

equilibria are payoff–equivalent in both kinds of games. Thus, it is irrelevant whether

deviations to competitive strategies are irreversible or can be made irreversible. Fi-

nally, we define a perturbed infinitely repeated game, where opportunities to restricts

one’s strategies occur with arbitrarily small probabilities, and extend the above equiv-

alence (without strengthening the assumptions).

JEL classification:D40, L10 (Market Structure), D43, L13 (Oligopoly), C73 (Dy-

namic Games)

Keywords: dynamic oligopoly, tacit collusion, equilibrium refinement, irreversible

moves

∗We would like to thank the Deutsche Forschungsgemeinschaft (DFG) for the support.†corresponding author

1 Introduction

“To burn down bridges passed” is a rather frequently cited metaphor describing an irreversible de-

cision. As a result of this decision, another decision (the previous troops movement) ismadeirre-

versible. This is quite remarkable, since at first glance it seems preferable that decisions be reversible.

However, decision makers can be better off deliberately restricting their options; not only to win bat-

tles, but also to deter entrances of firms into markets (through committing to competitive prices,

capacity levels, or advertisement spendings, e.g. Sylos–Labini, 1962), and to cope with the own

weaknesses of will (Strotz, 1956, Elster, 1984, and many others). Moreover, players can be restricted

unvoluntarily through their emotions (e.g. Elster, 1998, and Loewenstein, 2000) or through persons

that the players have to report to (e.g. share holders). We shall analyze these aspects in a generalized

model of duopolistic competition. Precisely, we analyze which outcomes would result

1. when deviations to more competitive strategies are irreversible,

2. when the players are provided with opportunities to make such deviations irreversible, and

3. when the players are provided with such opportunities arbitrarily rarely.

Below, we define precisely when a strategy would be said to be more competitive than another one,

but basically: the more competitive a strategy the less the opponent’s payoff, the own payoff is

unimodal in the competitiveness of the own strategy, and the best reply functions in terms of com-

petitiveness (i.e. mappings of opponent’s competitiveness to optimal own competitiveness) have a

unique intersection. This intersection is unique even after linearly intrapolating the best–reply func-

tions (note that we assume the strategy sets to be finite) and the unique intersection is in a pure

strategy profile. Simultaneous move games where the strategies can be ranked in this sense of com-

petitiveness are calledcompetitive games.

The resulting set of competitive games includes numerous models that have been studied in in-

dustrial organization (Bertrand/Cournot competition, rent–seeking contests, and public–good games),

behavioral economics, and political economics (e.g. arms races, trade tariffs, and environmental reg-

ulations). Interactions that are not competitive in our sense include coordination games and zero–sum

games. Let us note that competitive games in our sense are not related to competitive games as they

are defined in Friedman (1983) and Kats and Thisse (1992). Their classes of games are to generalize

zero–sum games, instead of generalizing duopolistic models of competition (as our definition).

The implications of deviations being strictly irreversible (point 1) are analyzed inirreversible

move games(IMGs): in each stage the players can repeat the previously played stage strategy or

they can deviate to a more competitive one; the game ends when both players repeat their previous

strategies, and only the finally played strategies are payoff–relevant. These games are related to

monotone games (Gale, 2001; Lockwood and Thomas, 2002), which is discussed below, and to

spectrum auctions (Milgrom, 2000; Bolle and Breitmoser, 2001).

1

For a wide range of competitive constituent games, the payoff allocations that are equilibrial

in IMGs are unique under renegotiation proofness (including all games with strategic complements

Bulow et al., 1985). We assume renegotiation proofness in the sense ofPareto perfectness(Bernheim

et al., 1987): the strategy profiles are required to induce Pareto efficient Nash equilibria in all sub-

games. Thus, we rule out threats of inefficient responses to deviations. Moreover, there is generally

a unique equilibrium where no player is worse off than in the Nash equilibrium of the constituent

game, which might be focal out of the players’ eyes (however, we do not subject this claim to more

powerful concepts of equilibrium selection).

Basically, IMGs describe the dynamics in games where the decisions have sunk–cost character.

These include games of uncovering information, the destruction of (unique) objects, and the commit-

ment of crimes (see also Schelling, 1963). Moreover, IMGs describe the dynamics that would apply

in infinite interactions, even though IMGs themselves are finite (but importantly, the time horizon is

endogenous, i.e. indefinite). For instance, the IMG equilibria are payoff–equivalent to the equilibria

of infinite games where all rounds are payoff relevant when the players do not discount future payoffs

(provided the irreversibilities apply as described above). This is the most significant characteristic

that distinguishes our approach to IMGs from irreversible move games with exogenous and finite

horizons (see Romano and Yildirim, 2004). In the latter model, the players can backward induce that

their opponent would deviate to competitive strategies in the last round, and as a result, collusive out-

comes can not be sustained in equilibrium. Moreover, since there is a last round in finite IMGs, the

respective set of equilibrium payoffs does not generalize to infinite games (similarly to the distinction

of finitely and infinitely repeated Prisoner’s dilemmas). This point is discussed below.

Another infinite game that is closely related to IMGs (as we define them) is the scope of point 2.

We define a generalized infinitely repeated game where the players do not only choose a strategy to

be played for each round (depending on the history of play), but also decide whether the respective

strategies are implemented in short–term contracts or in one–sided long–term contracts. If the latter

option is chosen, the players can deviate from that strategy only to more competitive ones in future

rounds (note that deviations to strategies that the opponent perceives as being more competitive are

supposed to be beneficial to one’s contract partners, as customers or employees).

When we can assume that the equilibrium paths would be “simple” in all subgames (as explained

next), we can show that these generalized repeated games are payoff–equivalent to IMGs (in equilib-

rium). The equilibrium path of a given subgame is understood to be “simple” when there is a unique

stage game strategy profile that is played infinitely often along this path (this profile would be called

“focus point” of the respective subgame). Note that we do not require any kind of consistency of the

focus points over different subgames. Mainly, we want (and need) to rule out circular equilibrium

paths, as those can not be replicated in IMGs; but when they are ruled out, all IMG equilibria have

corresponding equilibria in generalized repeated games, and vice versa.

2

In some circumstances (as infinitely repeated Prisoner’s Dilemmas), the assumption of simple

paths appears rather demanding, since the set of payoff profiles that can be sustained in equilibrium

shrinks dramatically. More generally, though, the stage games would have arbitrarily large strategy

sets (e.g. in Bertrand/Cournot competition, contests, public–good contributions), and then, all payoff

profiles in non–simple paths can be approximated in simple paths. If, furthermore, short–term adap-

tations of the strategies would be associated with costs, then all non–simple paths would be Pareto

dominated by nearby simple paths. In these circumstances, the equilibria in simple paths are even

unique.

We interpret opportunities to sign long–term contract as means to(re)negotiatethe payoff allo-

cation to be sustained along the equilibrium path. Short–term moves, to contrary, allow to defend

long–term allocations against short–term deviations (through temporary retaliations of such devia-

tions). This distinction suggests thatrenegotiation proofnesswould concern only the Pareto effi-

ciency of the long–term commitments in this context. Consequently, short–term retaliations need

not be Pareto efficient. We model that through applying the limit–of–means criterion in the case of

generalized repeated games (i.e. through assuming that the players do not discount future payoffs),

thanks to which temporary retaliations are payoff irrelevant in the long term (but note that even under

the discounting criterion, temporary retaliations are only marginally inefficient).

Given that we apply the limit–of–means criterion, it is even more significant that (for a wide

range of interactions) the payoff–uniqueness of the IMG equilibria continues to hold in general-

ized repeated games (e.g. in all Bertrand competitions, public–good contributions, and sufficiently

symmetric Cournot competitions and contests). For, the combined assumptions of simple paths and

renegotiation proofness merely discretize the sets of payoff profiles that can be sustained in equilib-

rium, but these sets can still be arbitrarily large (they are bounded only by the number of individually

rational and Pareto efficient stage game strategy profiles, which itself is unbounded). Thus, the reduc-

tion of the set of equilibria observed in our model stems from allowing the players to make long–term

commitments, which appears straightforward in a number of circumstances.

Finally, we define a model of repeated games where (for each round) an opportunity to sign a

long–term commitment arises only with some probabilityπ > 0. Here,π is allowed to be arbitrarily

small, and the resulting model is calledperturbed infinitely repeated games. We find that, regardless

of how smallπ is, the allocations that can be sustained in equilibrium are equivalent to those of gener-

alized repeated games, and hence to those of irreversible move games. Thus, marginal perturbations

of infinitely repeated games suffice to obtain the uniqueness of the outcomes that we find for a wide

range of IMGs. This appears interesting, as most other concepts struggle to refine the equilibria of

repeated games.

In Section 2, we shall provide a number of introductory examples (one for each of the most

prominent competitive games) that illustrate our model, our assumptions, and the main aspects of the

equilibrium induction. In this context, we also discuss the various links to previous studies of dy-

3

namic competition and irreversible moves. The two main strings of studies on irreversibilities shall

be introduced already here, however. As mentioned, one of these strings includes studies of finite

irreversible move games with exogenous time horizon Romano and Yildirim (2004) and Yildirim

(2004). We complement these studies by providing the results for several kinds of infinite interac-

tions. Moreover, Feuerstein and Gersbach (2004) study irreversible investment in infinite interactions

and describe equilibria in grim strategies. They show that for discount factorsδ near 1, collusive al-

locations can be sustained in equilibrium. As a complement to that, we showwhichequilibria result

(under renegotiation proofness) in a wider class of games, and how this can be generalized to games

with voluntary irreversibilities. Note also that Feuerstein and Gersbach (2004) survey a number of

additional empirical and theoretical studies on irreversibilities in industrial interactions.

On the other hand, there are studies of irreversible move games (e.g. Lockwood and Thomas,

2002) where precisely the opposite direction of irreversibilities is assumed. Thus, alternative de-

signs of public–good contribution mechanisms are analyzed. The applications of these models (de–

escalation mechanisms) differ from those of our model (strategies to prevent escalation). However,

the equilibrium outcomes in our case generally Pareto dominate theirs, which suggests that it is easier

to prevent escalation than to de–escalate conflicts. This is discussed in detail below (Section 2).

In Section 3, the competitive games and based on them irreversible move games are defined, and

their equilibrium paths are analyzed. In Section 4, we show that irreversible move games are (payoff–

) equivalent to generalized repeated games and perturbed repeated games. Section 5 concludes, and

some of the proofs are relegated to the appendix.

2 Some Introductory Examples

In this section, we give a number of examples that introduce our approach to irreversibilities and

infinitely repeated games intuitively, and that allow us to differentiate our model from previously

published ones. For most of this section, we focus on irreversible move games (IMGs), whereas

the relation of IMGs to generalized or perturbed repeated games is described below. To follow the

arguments in this section, the definitions of “competitiveness” and IMGs as they were provided above

(in the introduction) are sufficiently precise, but note that formal definitions are provided in Section

3.

Further terms that we use (and that are defined formally in Section 3) arestage strategyand

endpoint. We formalize the definition of IMGs by defining a tree of stages, and each stage is defined

through the history of previous strategy adaptations. In each stage the players move simultaneously,

and the sets ofstage strategiesare the strategies of the constituent game that can be played in a

given stage. The path of play is the sequence of stage strategies that is implied under a given IMG

strategy profile, and itsendpointis the profile of stage strategies that is played finally. In Section

4

4, we introducefocus pointsas the corresponding profiles in repeated games: the focus point is the

(assumingly unique) strategy profile that is played infinitely often along the equilibrium path of a

given subgame. Under the limit–of–means criterion, the focus points are uniquely payoff–relevant

in given subgames of repeated games. Finally, let us clarify that when we refer to renegotiation

proofness, we understand it in the sense of Pareto perfectness (Bernheim et al., 1987): the strategy

profiles are to induce Pareto efficient Nash equilibria in all subgames.

2.1 Repeated Prisoner’s Dilemma

In Prisoner’s Dilemmas (PDs) there are only two choices, cooperation (c) and defection (d), and we

defined to be more competitive thanc. In the following, the payoff relations are understood in one

of the usual ways, e.g. 3,2,1,0. In IMGs, the unique Pareto perfect equilibrium implies(c,c). For,

any deviation from(c,c) would result in the endpoint(d,d), and thus, a coordinated deviation from

(c,c) is Pareto dominated, and unilateral deviations imply losses to the deviating players.

In an infinitely repeated game with arbitrarily small time preferences (or under the limit–of–

means criterion), the Folk theorem of Fudenberg and Maskin (1986) applies. Thus, for each of the

payoff allocations indicated in Figure 1 there is a subgame–perfect equilibrium (SPE) where the

respective allocation describes the mean payoffs along the equilibrium path (notably, in this figure

we allow for circular paths). There are four distinguished points in Figure 1.A would result from the

permanent play of(c,d), B from (c,c), C from (d,c), andD from (d,d).

When we restrict the attention to simple paths, only the payoffs associated withB andD can

be sustained in equilibrium. When we assume Pareto perfectness and allow for circular paths (of

PDs), we find that all outcomes on the Pareto frontier can be sustained (Farrell and Maskin, 1989;

technically, these equilibria are shown to be strongly perfect in the sense of Rubinstein, 1980, and

hence they are Pareto perfect). When we assume simple paths and Pareto perfectness, the unique

corresponding outcome implies the permanent play of(c,c).

Thus, it appears that the assumption of simple paths is already sufficient to obtain the equivalence

Figure 1: Possible Equilibrium Payoffs (in Circular Paths) in Infinitely Repeated PDs

- profit 1

6

profit 2

rD

rA

rC

rBXXXXCCCC XXXX

CCCC

��

��

��

�

5

of repeated games and IMGs. This is true in the case of PDs, but it is not generally. When there are

several Pareto efficient and individually rational stage game strategy profiles, then there are also

several renegotiation–proof equilibria in simple paths—but only one them corresponds with an IMG

equilibrium. Here, it is relevant whether the players are provided with opportunities to restrict their

strategy sets (however unlikely these opportunities are). In these circumstances, the perturbation of

repeated games provides highly significant refinements upon renegotiation proofness (and even upon

renegotiation proofness in combination with simple paths).

Secondly, we see that the assumption of simple paths is necessary. For, players who can restrict

themselves in repeated games would not restrict themselves to playd in the above (circular–path)

equilibria. Such a restriction would imply the permanent play of(d,d), which is neither individually

rational nor Pareto efficient in relation to the paths in non–simple equilibria. Generally, the assump-

tion of simple paths is not as restrictive as in PDs, however, as most payoff allocations of non–simple

paths can be approximated in simple paths.

In this context, let us relate our model of IMGs to that of Romano and Yildirim (2004). They

define an IMG with an exogenously fixed and finite number of stages (T), whereas in our case, the

number of stages played is endogenous (and indefinite). Thus, there is a last period in their model, and

in that last period, the players would play a strategy profile that is a Nash equilibrium within the sets of

remaining strategies (this feature allows the play to be induced backwardly). For instance, the unique

outcome in a Prisoner’s Dilemma would be(d,d), regardless of what the number of rounds is. In our

model, to the contrary, there is no last stage where(d,d) must necessarily be played, which allows the

players to collude—leading to the unique outcome of(c,c) in PDs. This intuition generalizes to all

competitive games: in finite IMGs, the equilibria imply either the Nash outcome of the stage game, a

Stackelberg outcome, or an intermediate outcome (partially Nash, partially Stackelberg), whereas in

typical applications of infinite IMGs the unique equilibrium implies collusion at (or, near) the cartel

strategies.

2.2 Public Good Games

Next, let us consider irreversibilities in infinitely repeated public–good games. In a public–good

game, the players can contribute amounts as they are described in their strategy setsSb = {0,1, . . . ,n},and the payoff ofb is Ub(s1,s2) = δ∗ (s1 +s2)−sb. In this context, a strategysA

b is understood to be

more competitive thansBb if it implies a smaller contribution. In turn, this implies for IMGs that the

players can only deviate from their last stage contribution by decreasing it. For instance, a fisher can

enlarge his fleet, but he can not reduce it.

Apparently, under renegotiation proofness, contributions of(n,n,) must result along the equilib-

rium path. Precisely, when the players have played(1,1) in the previous stage, the situation is equiv-

alent to a PD, and further restrictions are Pareto dominated (and even weakly dominated). Hence,

6

(1,1) is an endpoint, and there is generally a player who would deviate from points as(1, i) with

i > 1. As a result, no player would deviate from restrictions to(2,2), since deviations would trigger

the endpoints(0,0) or (1,1). In this way, the backward induction can be completed to show that

(n,n) is the unique endpoint of equilibrium paths. Thus, Pareto efficiency is generally and uniquely

secured.

To the contrary, in plain repeated games, the set of contributions sustained in renegotiation–proof

equilibria includes all profiles(i,n) with i ≤ n that are individually rational. Thus, the outcomes of

repeated games differ significantly from those of IMGs (under renegotiation proofness) even if we

restrict the attention to simple paths. Since the equilibria of IMGs are payoff–equivalent to the equi-

libria of perturbed repeated games, this shows that introducing possibilities to restrict one’s options in

repeated games can have significant strategical implications. Such refinement effects can be observed

whenever there are several pure strategy profiles in the constituent game that imply Pareto efficient

payoff allocations.

Our model of public good games complements previous studies of irreversibilities in public

good games, e.g. Admati and Perry (1991), Marx and Matthews (2000), Gale (2001), and in the

most general way in Lockwood and Thomas (2002). In these studies, precisely the opposite direc-

tion of irreversibilities had been assumed, i.e. the players had been allowed only to increase their

contributions in the course of the game. The applications of these models include (Lockwood and

Thomas, 2002) donations to funds, pollution reductions, capacity reductions in a declining industry,

disarmament, and tariff reductions. Apart from fund raising, the studied models are understood as

mechanisms to de–escalate conflicts. To the contrary, we study circumstances where the players are

best off preventing the conflict from escalating in the first place.

In relation to the outcomes that result in our models, their outcomes are generally not efficient

(neither Pareto efficient nor socially efficient). Apparently, when players can only reduce their con-

tributions (as in our model), the players are generally best off imitating their opponent’s behavior

(i.e. to reciprocate negatively). However, when the contributions are only to be increased, imitating

the opponents is not generally equilibria. In fact, where one would contribute is largely independent

of how much the opponent has contributed before; mainly, it depends on whether one’s contribution

would trigger further contributions of the opponent. Generally, the latter is not satisfied at the ends

of the game tree, i.e. in subgames with previous contributions of(n−1,n−1), and iteratively, this

can prevent contributions to be made altogether.

2.3 Symmetric Competition

The next examples concern irreversibilities in models of Bertrand and Cournot competition. First,

we shall examine a model of competition where the consumers react symmetrically to decisions of

7

the duopolists. That is, the payoff functions are assumed to satisfy for all playersb

Ub(sb,s−b) = sb∗ (1−sb +αs−b) with |α| ≤ 1. (1)

Notably, if we understand thesb to represent prices, a Bertrand model results, and if they represent

quantities, a Cournot model results. Generally, the strategical issues that arise are independent of the

paradigm that we assume: n a model withα > 0, the strategies are complementary (in the sense of

Bulow et al., 1985), and in a model withα < 0, the strategies are substitutionary. However, ifα > 0

and we understand thesb as prices, the underlying goods are substitutes of the consumers’ eyes, but

if α > 0 and we understand thesb as quantities, the underlying goods are complements. In the case

of quantity competition,α = 1 implies (as a limiting case) the Cournot model of homogenous goods.

The best–reply functions, the Nash strategiessNb ≡ s∗b, and the cartel strategiessM

b are

Rb(s−b) =1+αs−b

2, sN

b =1

2−α, sM

b =1

2−2α. (2)

We see thatsNb < sM

b ⇔ α > 0. Thus, in the case of Bertrand competition andα < 0, the customers

would benefit from tacit collusion (towards the cartel prices), as it implies decreasing prices. Like-

wise, the customers benefit under Cournot competition andα > 0 (as the quantities increase), and

generally, they benefit in cases of complementary goods. Moreover, forα → 1, the cartel strategies

converge to infinity, i.e. the players would collude at nearly infinite prices or quantities.

Again, we define IMGs as games where adaptations of the stage game strategies are possible only

towards more competitive strategies. In this case, the relation “more competitive” is defined through

the relation of Nash and cartel strategies, with the Nash strategies being competitive and the cartel

strategies being collusive. Thus, ifsNb < sM

b , then a strategysAb is more competitive thansB

b iff sAb < sB

b.

As a result, the strategies may only be decremented in the course of the interaction. Alternatively, if

sNb > sM

b , then the strategies may only be incremented. Note that in the case of complementary goods,

these directions are somewhat unintuitive (increasing prices, decreasing quantities), but generally, we

would assume that competitors produce rather substitutionary than complementary goods.

Since our strategy sets are assumed to be discrete, all strategies can be represented asskb = sN

b +kεfor some sufficiently smallε > 0. The cartel prices correspond withK = ε−1∗α

2(1−α)(2−α) and k = 0

represents the competitive (Nash) strategies. Apparently, ifα > 0 thenK > 0 and if α < 0 then

K < 0. To simplify the notation, let us define the range(0, . . . ,K) to be understood as(K, . . . ,0) in

caseK < 0.

Our backward induction of the play in the irreversible move game starts ink = 0 and ends

in k = K (which is the opposite direction of the actual irreversibilities). To simplify the backward

induction, let us assume in this section that all strategiesskb that can be defined through ak∈ (0, . . . ,K)

are indeed in playerb’s strategy set. Thus, the following identity holds for both playersb∈ B.{sb ∈ Sb : sN

b ≤ sb ≤ sMb

}={

s∈ IR : ∃k∈ (0, . . . ,K) , s= sNb +kε

}. (3)

8

Figure 2:Strategic complements(i.e. α > 0, left) andStrategic substitutes(α < 0, right) End-

points in quasi continuous strategy spaces. In the case of strategic substitutes, it is supposed that (17)

applies.

-

6

Comp. ofs1

Comp. ofs2

rr

��

��

��

��

��

��

EndpointsPPPPPPi

BB

BB

BBBM "

"

""

""

"""

""

""

"""

""

"""

sN2

sM2

sN1sM

1

R2(s1)

R1(s2)

-

6

Comp. ofs1

Comp. ofs2

rr

��

QQ

QQ

QQ

QQ

JJ

JJ

JJ

JJ

JJ

QQ

QQ

QQ

JJ

JJ

JJ

��

��

��

��

��

��

��

��

��

��

��

��

��

��

��

sN2

sM2

sN1sM

1

R2(s1)

R1(s2)

Endpoints��9

PPPPPPi

As a result of the second assumption, the above game qualifies assimply structured competitive(for

a general definition see 3.16), and otherwise it would qualify ascompetitive(in the sense of 3.1).

In simply structured games, the path(skb,s

k−b) for k = 0, . . . ,K is between the best reply curves (14),

along it both players’ payoffs are increasing monotonically (15), and the payoffs associated withK

are Pareto efficient with respect to all possible payoff profiles (16).

In the unique renegotiation–proof SPE of the respective IMG,(sKb ,sK

−b) results (see also Figure

2), and all points with(skb,s

k−b) for k= 0, . . . ,K are (potential) endpoints. Since traversing along these

points fork = 0, . . . ,K implies Pareto improvements, the players are generally worse off initiating

deviations from any of these points towards a point nearerk = 0. The respective backward induction

starts ink= 1, where deviations tok= 0 (or arbitrary more competitive points) are Pareto dominated.

As a result,(skb,s

k−b) for k = 1 is an endpoint, deviations fromk = 2 would be Pareto dominated, and

so on.

Note that ifα < 0, the competitiveness of the best replies is decreasing in the competitiveness of

the opponent’s strategy. Thus, the best response to some opponent’s strategy that is less competitive

than his Nash strategy (e.g. to somek < 0) is a strategy that is more competitive than one’s Nash

strategy (i.e. with somek > 0). Generally, points(sA,sB) such thatsA ≥ RA(sB) andsB ≥ RB(sA) are

endpoints, i.e. when they are reached in IMGs, the players would not further adapt their strategies.

If α < 0, such points can be reached through unilateral deviations from anysk. In Lemma A.1 we

see that no player is better off deviating unilateral from somek ∈ (K, . . . ,0) to such an endpoint in

the above model of competition. In simply structured games in general, we assume that no player

would be better off deviating in this way (17). In our most general framework, competitive games,

we analyze the play without any such assumption.

9

In the limiting case of homogenous Cournot competition (α =−1), the players are merely indif-

ferent with respect to deviations from thesk to such extremely competitive endpoints (i.e. to endpoints

where his strategies are more competitive than his Nash strategy). Thus, the collusive endpointssk

are merely weak, but they are uniquely renegotiation proof. In equilibrium, the cartel pricessK re-

sult uniquely and the cartel surpluses are shared equally amongst the players. Let us note again

that this uniqueness provides significant refinements upon renegotiation proofness in plain repeated

games. For, even under strong perfectness and the discounting criterion, asymmetric allocations are

not ruled out in equilibrium (precisely, the players are secured only to get a quarter of the monopoly

payoff, Farrell and Maskin, 1989).

2.4 Asymmetric Competition

Briefly, let us generalize the previous example to an asymmetric family of models, such that for all

playersb

Ub = sb∗ (1−sb +αbs−b), with |αb| ≤ 1, αb∗α−b ∈ (0,1]. (4)

Basically, this analysis follows the guidelines of the previous one. Thanks to the increased generality,

however, we can now point out the potential obstacles to tacit collusion in IMGs more precisely.

Playerb’s best reply function, his opponent’s inverted best reply, his Nash strategysNb , and his cartel

strategysMb , which happens to equalsM

−b, are

Best replies: Rb(s−b) =12

(1+αbs−b) R−1−b(s−b) =

1α−b

(2s−b−1) , (5)

Equilibria: sNb =

2+αb

4−αbα−bsMb =

12−αb−α−b

. (6)

Note that

∀b pNb =

2+αb

4−αbα−b<

2+αb

4−αbα−b−2α−b−α2b

= pMb

⇔ α−b >−α2

b

2and αb >−

α2−b

2⇔

αb∗α−b>0αb,α−b > 0 (7)

As before, we assume that both Nash and cartel strategies are inb’s strategy set and that his strategy

set is discrete. Thus,b’s collusive strategiesskb ∈[sNb ,sM

b

]correspond with somek∈ (0, . . . ,K) using

K = (sMb − sN

b ) ∗ ε−1, such thatskb = sN

b + kε. Moreover, we assume again that for allk ∈ (0, . . . ,K)

the corresponding strategyskb is indeed inb’s strategy set (thus, the game might qualify as simply

structured).

For αb,α−b > 0 (strategic complements), the case is essentially equivalent to the previous one.

We can backward induce the play, and the players collude at(pK

b , pK−b

)≡(pM

b , pM−b

).

Obstacles to tacit collusion can arise ifαb,α−b < 0. Playerb is better off deviating unilaterally

from (pkb, pk

−b) to a strategy beyond his Nash prices, i.e. toR−1−b(pk

−b), if and only if αb < 0 and

10

Figure 3:Asymmetric competition The set of parameter profiles compatible with tacit collusion

- α1

6

α2

−1

−1

�

��

��

��

��

��

��

��

��

��

��

��

αb +α2−b > 0 (Lemma A.2). Likewise, his opponent is better off iffα−b +α2

b > 0. In the symmetric

caseαb = α−b (see above), none of these inequalities had been satisfied, but here, they may be

satisfied, and as a result the set of collusive endpointssk < sN may be empty. The region in the

parameter space where collusive outcomes are supported is illustrated in Figure 3, and in all of these

cases, the cartel prices result uniquely.

Besides our relation to the study of Farrell and Maskin (1989), let us relate our outcomes also to

the study of Maskin and Tirole (1987, 1988). They analyzed the implications of asynchronous moves

in combination with Markov strategies in some repeated oligopoly games (the respective equilibria

are called Markov perfect). In general, the assumption of asynchronous moves helps the players to

coordinate towards Pareto efficient equilibria (Lagunoff and Matsui, 1997), i.e. it implies renegoti-

ation proofness. In combination with Markov strategies (i.e. with strategies that are functions only

of the previously played strategy of the opponent), however, this effect is sacrificed to the favor of

strategical simplicity. Interestingly, this simplicity yields unique equilibria in a number of circum-

stances, but it rules out temporary retaliations of aggressive strategies. The latter complicates the

sustainment of collusive outcomes in equilibrium; in repeated PDs, for instance, the unique equilib-

rium implies permanent defections. Besides this simplification, the assumption of Markov strategies

offers an interpretation that we shall present below.

Similar to the case of PDs, the Markov perfect equilibria of repeated Cournot duopolies are not

collusive; but here, they are even more competitive than (and Pareto dominated by) the constituent

game’s Nash equilibrium. As described above, in our model the cartel strategies result. Maskin and

Tirole have not analyzed Bertrand competition with heterogenous goods explicitly, but it appears

that the Cournot results extend equivalently to the Bertrand case, and thus, the outcomes are never

collusive. In relation to our results, these equilibria appear somewhat unintuitive.

11

Figure 4: The situation in a rent–seeking contest

-

6

xA

xB

rr��

��

��

��

��

��

��

��

��

��

�

��

��

��

��

��

��

��

��

��

��

��

��

�

��

��

��

��

��

�

��

��

xmin

xmin14

14

1

1

RB(xA) =√

xA−xA

RA(xB) =√

xB−xB

QQQk

��

endpoints

2.5 Contests

Two competitorsA andB can choose expendituressb,s−b ∈ {smin,smin + ε,smin + 2ε, . . . ,smax} in

order to win a price which is valued 1 by both players (a case of a contest with asymmetric prices is

analyzed below). The probability thatb wins the price is Probb = sb(sb + s−b)−1 andb’s expected

payoff is

Ub =sb

sb +s−b−sb. (8)

The best reply functions are (in a quasi–continuous case)Rb(s−b) = s0.5−b− s−b ∀b, and the unique

Nash equilibrium issNA = sN

B = 14.

Strategically, contests are neither purely complementary nor purely substitutionary (see also

Figure 4). However, it can be shown rather easily that contests (as described above) qualify as simply

structured competitive for smallε. This implies that no player is better off deviating unilaterally to a

strategy beyond his Nash strategy in the sense of Equation (17), which is shown in Lemma A.3. Thus,

we can backward induce (as above) that the set of collusive endpoints is{(sA,sB) : sA = sB ≤ 1

4

}, and

that the smallest possible contributions(sminA ,smin

B

)result uniquely along the equilibrium path.

Now, let us generalize this to a contest with asymmetric valuations. The price of the contest

is worth PA for A and PB for B. To circumvent some cases that need technically more involved

treatments, let us assume thatP2b +1≥ P−b ∀b. For instance, this is satisfied in all cases with1

2P−b≤PB ≤ 2P−b ∀b, i.e. if the players do not value the price too differently. We shall describe a simply

structure instance of such a contest (generalizations can be treated similarly). The expected profits

and best reply functions are

Ub(sb,s−b) = Pb∗sb

sb +s−b−sb and Rb(s−b) =

√Pbs−b−s−b. (9)

12

Thus, the one–round Nash equilibrium strategies aresNb = cPb with c = PAPB ∗ (PA + PB)−2. The

dynamic contest is simply structured, if (for instance) the strategies can be described asin ∗sN

b , i.e. if

the strategy sets satisfy

Sb ={

r ∈ (0,Pb) : ∃i ∈ IN, r =in∗cPb

}.

We refer to the strategies of playerb assib = i

ncPb. Moreover, Equation (17) is generally satisfied for

i ≤ n andP2b +1≤ P−b (Lemma A.4). The set of potential endpoints implying tacit collusions≤ sN

is equivalent to{(sA,sB) ∈ (0,Pb)2 : ∃i ≤ n,∀b, sb =

in

cPb

}, (10)

and thus(1

ncPA, 1ncPB

)is the unique outcome of renegotiation–proof SPEs in IMGs.

There are several previous studies of dynamic contests. Most of these studies deal with two–

stage contests (Dixit, 1987; Baik and Shogren, 1992), and as a result, the induced dynamics are rather

different from those in our model. An exception is the study of Leininger and Yang (1994), who

analyze infinite–round contests and describe an equilibrium in Markov strategies and asynchronous

moves. According to this equilibrium, the firstly moving player invests half of the eventual price,

and the secondly moving one invests nothing. The dynamics induced through the Markov strategies

imply monotonically increasing expenditures off the equilibrium path, which therefore resembles our

model. Apparently, the main difference from our model (yielding the asymmetric outcome) concerns

the asynchronous bid structure. The described equilibrium is socially inefficient and, more strikingly,

it is not Pareto–superior to the Nash equilibrium of the respective one–round contest. However,

it is not Pareto–dominated by our equilibrium (contrary to Markov perfect equilibria in the above

cases of Bertrand/Cournot competition). Finally, let us refer to Yang (1993), who finds in different

circumstances that equilibria implying minimal expenditures can occur (but not uniquely so, contrary

to our results).

3 The Analysis of Irreversible Move Games

In this section, we define competitive games as two–players games with simultaneous moves, a dy-

namic game based on this class of games, and determine its equilibria. The dynamic game that we

analyze is called irreversible move game (IMG), and it serves as an abstract representation of the

strategical constellation in a number of alternative dynamic games that are introduced in the next

section.

13

3.1 Competitive Games

There are two players,A andB, and they play a simultaneous–move game with finite strategy spaces

SA andSB. Their payoff functions areUA,UB : SA×SB → IR. Let us defineS= SA×SB and the

payoff profile isU(s) =(UA(s),UB(s)

). For each player’s set of strategies we assume an ordering>

(which is transitive and complete), and this ordering is referred to asmore competitive. Based on this

ordering, we define several restrictions of the valuation functions, and the game is calledcompetitive

if all of them are fulfilled. In the following, a representative player is calledb, his strategies are

sb ∈ Sb, and his opponent’s strategies ares−b ∈ S−b.

Definition 3.1 Competitive Gamessatisfy the following characteristics (i), (ii), (iii).

(i) CompetitivenessThe more competitive the strategy, the lower the opponent’s payoff, i.e. for

all b, sb, s−b, s′−b : s−b > s′−b ⇒ Ub(sb,s−b) < Ub(sb,s′−b).

(ii) Unimodality of Partial Payoff Functions The utility functions are strictly unimodal in the

competitiveness of the own strategies and for each opponent’s strategy there is a unique best

reply. The best reply function isRb(s−b) ∈ argmaxsb Ub(sb,s−b), and the joint best–reply

function is calledR(s) := (RA(sB),RB(sA)). Formally, the payoff’s unimodality implies for all

s−b ∈ S−b and allsb 6= s′b ∈ Sb : sb≶ s′b≶ Rb(s−b) ⇒ Ub(sb,s−b)≶Ub(s′b,s−b).

(iii) Intersections of Best RepliesThe continuous functions resulting from linearly intrapolating

the (discrete) best reply functions have a unique intersection, which is in the strategy profile

s∗ ∈ S. Necessarily,s∗ is the unique Nash equilibrium in pure strategies, and for allsbR s∗b we

havesbR Rb(R−b(Sb)

).

Note that we did not assume the best reply functions to be monotonic in any sense. However, in the

literature (Bulow et al., 1985), games with best reply functions that are increasingly competitive in the

opponent’s competitiveness are said to be strategically complementary, and games with decreasing

best reply functions are said to exhibit strategical substitutes. Next let us introduce some notation

which simplifies the following analysis.

Definition 3.2 Vector relations In the following,s≤ s′ shall be understood as being equivalent to

sA ≤ s′A andsB ≤ s′B. Similarly, The strict relations< s′ is equivalent tos≤ s′,s 6= s′. The relations

�,� refer to the logical negations of the above statements. Finally, defines� s′ as being equivalent

to sA < s′−A andsB < s′B.

In this sense, we can define the maximum of a set of strategy profilesS′ ⊆ S as the set of profiles

maxS′ such thats∈maxS′ is equivalent tos∈ S′ and there exists nos′ ∈ S′ that satisfiess′ > s,

maxS′ ={

s′ ∈ S′ : ∀s′′ ∈ S′, s′ ≮ s′′}

. (11)

14

The minimum ofS′ is defined correspondingly. Finally, let us define theinterior of the strategy set

as the set of strategy profiles that are less competitive than the joint best replies.

Definition 3.3 The interior SI ⊆ Sof the strategy space isSI = {s∈ S: ∀b,sb ≤ Rb(s−b)}.

3.2 Irreversible Move Games

IMGs are multi–stage games where adaptations of the stage game strategies are restricted to be

weakly increasingly competitive. Moreover, IMGs end when a stage game strategy profile is played

repeatedly and only these finally played strategies are payoff–relevant. Each stage is described

through a history of play, and the set of all stages is

K ={(s0,s1, . . . ,st) : t ≥ 0, s0 ∈minS, ∀t ′ ≤ t : st ′ ∈ Sandst ′ > st ′−1, st ≥ st−1

}. (12)

A stage isterminal if st = st−1, and it is adecisionstage otherwise. The payoff profile induced in

terminal stages isU(st), and the set of terminal stages is calledKT . The set of strategies in decision

stages is the set of strategies that lead to further stages, i.e.{s : s≥ st}, and the set of decision stages

is calledKD.

An IMG strategy is a function describing a feasible action for each decision stage ˆsb : KD → S,

ands= (sb) denotes an IMG strategy profile. Since we restrict our attention to pure strategies, the

terminal stage that follows a given decision stage under some ˆs is unique. The last component of the

history in a terminal stage is calledendpoint, and the endpoint functiones : KD → S describes for

each decision node the endpoint that is induced under a given profile ˆs. The set ofpotentialendpoints

Es induced in a strategy profile ˆs is the set of stage game strategy profiless for which there exists a

decision stagek∈ KD such thats= es(k) is the resulting endpoint,

Es= es(KD) ={

s∈ S: ∃k∈ KD, s= es(k)}

. (13)

Next, let us describe the solution concept that we apply. We assume that the IMG strategy pro-

files under consideration are renegotiation–proof subgame–perfect equilibria in the following sense

(Pareto perfectness). Thus, we rule out threats of inefficient strategies off the equilibrium path. First,

we define the Pareto relations.

Definition 3.4 Pareto RelationsA strategy profiles′ ∈ SPareto dominatess′′ ∈ S iff U(s′) > U(s′′).

A strategy profiles is Pareto efficient with respect to a setS′ ⊆ S iff for all s′ ∈ S′ : U(s′) ≯ U(s).

Definition 3.5 Pareto Perfect Equilibrium (PPE) The strategy profile induces Pareto–efficient

Nash equilibria in all subgames (Bernheim et al., 1987).

Additionally, let us impose an assumption that simplifies some of the notation but is irrelevant oth-

erwise. In particular, it does not affect the equilibrium payoffs, i.e. we could assume precisely the

opposite direction without affecting the equilibrium payoffs.

15

Assumption 3.6 If two endpoints induce equivalent payoffs and if one of them is less competitive

than the other one, the equilibrium path implies the more competitive one.

3.3 Characteristics of Pareto–Perfect Equilibria (PPEs) in IMGs

In order to get a basis for backward inductions in IMGs, we shall first determine the endpoints that

are not below the Nash equilibriums∗. These endpoints are described in the following Lemma (its

proof is relegated to the appendix).

Lemma 3.7 Consider an arbitrary PPE ˆs of an IMG. A points≮ s∗ is an endpoints∈ E induced in

s if and only if s≥ R(s). The endpoints∗ Pareto dominates all endpointss′ ≥ s∗.

Thanks to Lemma 3.7, we know all endpointse that satisfye� s∗. In the following, we shall

backward induce the remaining endpoints.

Proposition 3.8 In any PPE of an IMG, the induced set of potential endpointsE satisfies for alls∈S

thats∈ E if and only if s Pareto dominates all points inEs = {e∈ E : e> s}. Thus,E is unique.

PROOF. The “if” part of the proposition follows directly from the characteristics of Pareto–perfect

equilibria. To prove the “only if” part, we shall contradict the opposite hypothesis. Assume there is a

PPE such that there are endpointss∈E that do not Pareto dominate all points inEs = {e∈E : e> s}.Let Econ denote the set of alls∈ E that contradict our claim, and consider anysc ∈ maxEcon. As a

result, the above proposition must be satisfied for alls′ > sc. Since it is not satisfied forsc, however,

there is somee∈ E : e> sc that is not Pareto dominated bysc, and hence, there is a playerb such that

Ub(e) > Ub(sc).

Whenb deviates unilaterally toeb, one of the Pareto efficient endpoints inE′ = {e′ ∈ E : e′ ≥(eb,sc

−b)} would result. Generally, playerb is better off in(eb,sc−b) than ine, assc

−b < e−b. All of the

Pareto efficient points ine′ 6= e∈ E′ satisfy (by assumption) thate′ ≯ e, and thuse′−b < e−b. Thus,

in any continuation equilibrium following a stage with(eb,sc−b), b’s opponent plays a stage strategy

s−b ≤ e−b. Hence, no such continuation equilibrium can imply a payoff allocation whereb is worse

off than ine, asb could deviate unilaterally to a strategy where he does not move beyondeb. As a

result,b can secure the payoff resulting ine, and ansc must not be an endpoint. QED

Corollary 3.9 There is a unique set of endpointsE that is induced in all PPEs of a given IMG.

The uniqueness of the set of endpoints does not generally imply that the equilibrium paths in all

subgames are unique. Consider a stagek ∈ KD with st = s, define the set of endpoints that can still

be reached asEs{e∈ E : e≥ s} . Under renegotiation proofness, the continuation equilibrium must

lead to one of the Pareto efficient points inEs. Thanks to the construction ofE (and thusEs), this

implies that the subgame’s equilibrium path ends in one of the points in minEs.

16

Lemma 3.10 Consider a decision stagek ∈ KD and denote the set of potential endpoints in the

resulting subgame asEst = {e∈ E : e≥ st}. Any PPE path in that subgame must imply an endpoint

in minEst .

Thus, a subgame’s equilibrium path is unique (apart from payoff–equivalence and interchangeability)

iff minimum of Est is unique. The uniqueness is examined in the following. First, consider the

following lemma.

Lemma 3.11 For all pairs of endpointse1 6= e2 ∈E : e1,e2 < s∗, eithere1 > e2 or e2 > e1 is satisfied.

Moreover, all endpointse< s∗ are interior, i.e. they satisfye≤ R(e). (The proofs are given in the

appendix.)

As a result, the set of endpoints that are less competitive thans∗, E′ := E∩{s∈ S : s≤ s∗}, can

be completely ordered by “�”. That is, we can connect all of these endpoints through a (unique)

monotonically increasing line (see the examples in Section 2). In some sense, this line characterizes

the strategical aspects of IMGs. This leads straightforwardly to the following.

Lemma 3.12 There is generally a unique endpoint that can result along the path of a PPE where

no player is worse off than in the constituent game’s Nash equilibriums∗ (the proof is given in the

appendix).

As a result, if for all endpointse∈E : e≮ s∗ we havee≥ s∗ (as in the case of strategic complements),

the endpoint that results along the equilibrium path is unique, and no player is worse off than ins∗.

Next, let us illustrate the relative equilibrium payoffs if the Pareto efficient endpoint is not unique.

Lemma 3.13 Let EPs denote the set of potential Pareto–efficient endpoints in a stagek ∈ KD with

st = s. For alle′ 6= e′′ ∈ EPs , we have for allb thate′b > e′′b ⇔ Ub(e′) > Ub(e′′) (the proof is given in

the appendix).

This observation allows us to establish the payoff–equivalence of IMGs and generalized/perturbed

repeated games (see below). In the following, we shall use it to illustrate the assumption of alternating

moves in IMGs, instead of Pareto perfectness. Here, we shall assume that for alls′ 6= s′′ ∈ S in the

constituent game and for all playersb, the induced payoffs are differentUb(s′) 6=Ub(s′′). Generically,

this is satisfied. Thus, it is straightforward to induce (backwardly) that the sets of potential endpoint

E is equivalent to that under Pareto perfectness and that Pareto–efficient paths result in all subgames.

More significantly, as a consequence of Lemma 3.13 we have for all playersb, all s∈ S, and

all stagesk∈ KD : st = s, that if e∈ Es denotes the (necessarily unique) endpoint that maximizesb’s

payoff, thene is also the unique Pareto efficient endpoint in a subgame following the play(eb,s−b).

As a result, the first moving player in a alternating–move IMG can secure the endpoint that maximizes

his payoff, and thus, the equilibrium outcome is unique.

17

Proposition 3.14 Generically, in IMGs with alternating moves, the subgame–perfect equilibrium

outcome (payoff allocation) is unique. (Note that we need not assume Pareto perfectness.)

3.4 Simply Structured Competitive Games

If the stage game is (beyond its competitiveness) simply structured in the sense defined below, the

endpoints are particularly easy to induce. All of the examples that we gave initially are simply

structured in this sense. The simplified way to induced the endpoints is defined first.

Definition 3.15 Simplified Induction Start withs0 = s∗ and construct the pointssk+1 = max{s∈S : s� sk} until the newly constructed point fails to yield Pareto improvements. The last point

implying Pareto improvements is calledsK.

Thus, we construct a sequence of pointssK < · · · < sk < sk−1 < · · · < s1 < s∗ which is connected in

the sense that the deviation from somesk to the next pointsk−1 requires precisely one step per player

(towards a more competitive strategy). This is simplified, we can not go back simply one step per

player in general. Instead, we have to look for the “next” point that Pareto dominates all endpoints

that have already been found.

The set of simply structured competitive games is defined such that all endpointse< s∗ can be

derived in the simplified induction if and only if the IMG is based on a simply structured game. Note

that the remaining endpointse≮ s∗ are described in Lemma 3.7. Thus, when we have shown that a

game is simply structured, all endpoints can be derived easily.

Definition 3.16 Simply structured competitive gameA simply structured competitive game is a

competitive game that satisfies for the(sk)Kk=0 constructed in the stepwise induction

∀k : sk ≤ R(

sk)

(14)

∀k′ < k′′ : U(

sk′)

< U(

sk′′)

(15)

∀b, ∀k, ∀sb > s∗b :(

sb,sk−b

)≥ R

(sb,s

k−b

)⇒ Ub

(sb,s

k−b

)≤Ub

(sk)

. (16)

∀s< sK : Ub(s) ≯ Ub(sK) (17)

∀s : s≥ R(s) ands≯ sK : U(s) < U(sK) (18)

Verbally, in simply structured competitive games, the path fromsK to s∗ lies between the best reply

curves (14), symmetric deviations to more competitive strategy profiles are Pareto dominated (15),

deviations to endpointse≮ s∗ are generally not profitable (16), the players would generally deviate

from s < sK to sK (17), andsK Pareto dominates all potential endpointss≥ R(s) that can not be

reached anymore whensK is reached (18).

18

Proposition 3.17 The “simplified induction” yields the (unique) set of endpoints satisfyinge≤ s∗ if

and only if the stage game is simply structured competitive. Along any equilibrium path,sK results

(the proof is skipped).

4 Dynamic Games that Are Payoff–Equivalent to IMGs

4.1 Infinite Irreversible Move Games

In infinite irreversible move games (IIMGs), stage game strategy adaptations are irreversible as in

IMGs, but the game never stops and all rounds are payoff–relevant. The set of stages in IIMGs is

K ={(s0,s1, . . . ,st) : t ≥ 0, s0 = minS, ∀t ′ ≤ t : st ′ ∈ Sandst ′ ≤ tst ′−1

}. (19)

There are no terminal stages. An IIMG strategy ˆsb maps each stage to a feasible move, i.e. to some

s∈ {s′ ∈ S: s′ ≥ st}. The path of play followingk∈ K according to ˆs= (sb) is (recursively)

p(k) = (pt ′(k))t ′>t using pt ′(k) = s(k, ∪t ′′: t<t ′′<t ′ pt ′′(k)) , (20)

and the endpoint of a subgame is the (unique) stage game strategy profile that is played infinitely

often,

e(k) ∈{

s∈ S: ∃t ′ ∀t ′′ > t ′ s= pt ′′(k)}

. (21)

Based on that, we can define the set of potential endpointsE equivalently to (13). We assume that

the players aggregate the infinite payoff stream according to the limit–of–means criterion (LMC), i.e.

the payoff ofb in the IIMG is

ULMCb = liminf

t→∞

1t ∑

t ′≤t

Ub(pt ′(s0)

). (22)

Thanks to LMC (in combination with Pareto perfectness), the payoff–equivalence of IMGs and

IIMGs is straightforward. That is, only the endpoints are payoff relevant in IIMGs, equivalently

to IMGs, and hence, the backward inductions of the endpoints that would result in given subgames is

equivalent in both games. Likewise, the endpoints that would result in given subgames must be the

Pareto efficient ones in the respective sets of potential endpoints.

Proposition 4.1 The setsE of potential endpoints are equivalent in IMGs and IIMGs, and the end-

points that can result along the equilibrium paths of PPEs are equivalent in all subgames, too.

4.2 Generalized Infinitely Repeated Games

In plain infinitely repeated games (IRGs), the stage game strategies are unrestricted, the game never

ends, and all rounds are payoff relevant. The set of stages isK = {(s0, . . . ,st) : t ≥ 0} and an IRG

strategy ofb is a function ˆsb : K → Sb. The payoff (under LMC) is defined equivalently to (22).

19

We restrict our attention to equilibria insimple paths: for each subgame there is a unique stage

game strategy profile that is played infinitely often. These profiles are calledfocus points, but note

that we do not assume the focus point to be equivalent in all subgames. Using (21) the focus point

along the equilibrium path can be defined asf = e(s0), and likewise, we can define a function that

all maps stagesk∈ K to the resulting focus points. Finally, equivalently to (13) we can define the set

of potential focus points that result according to a strategy profile ˆs in arbitrary subgames.

Apparently, all focus points that induce payoffs which Pareto dominate the minimax payoffs can

be sustained in SPEs. For instance, Rubinstein (1979) and Fudenberg and Maskin (1986) describe

(Folk theorem) equilibria that hold equivalently under the limit–of–means criterion (LMC). More-

over, since temporary retaliations are payoff–irrelevant under LMC, the Folk theorem equilibria are

even Pareto perfect iff all focus points inF are Pareto efficient points with respect toS.

Now we generalize IRGs to generalized infinitely repeated games (GIRGs). Here, the players are

additionally allowed to impose restrictions upon their (own) stage game strategy sets for the stages

following a decision stage. The players can restrict themselves by stating: “In all of the following

rounds, I would play a strategy that is as least as competitive as my current one.” That is, the players

can lock in their current strategies as lower limits for the following stages. When the current stage

game strategy is locked in, a flag is set tof = 1, and otherwise it isf = 0. The set of stages is

K ={(s0, f0,s1, f1, . . . ,st , ft) : t ≥ 0, s0 = minS,

∀t ′ ≤ t : st ′ ∈ S, ft ′ ∈ {0,1}2, st ′ ≥ smin(s0, f0, . . . ,st ′−1, ft ′−1)}, (23)

using

sminb (k) = max{sb ∈ Sb : ∃t ′ ≤ t, st ′,b = sb and ft ′,b = 1}, (24)

as the strategysb ∈ Sb that defines the restriction that playerb has committed to in stagek. Precisely,

he is committed to playsb ≥ sminb (k). The set of stage strategy profiles is the set of profiles that

leads to further stages. A GIRG strategy ˆsb of playerb maps each stage to a feasible stage strategy.

Moreover, for a given GIRG strategy profile ˆs, let us define a path functionp that maps each stage to

the implied path of payoff–relevant moves(st+1,st+2, . . .).

As in IRGs, we restrict our attention to strategy profiles ˆs that induce simple paths. That is, for

all k ∈ K, there is a uniques∈ S that is played infinitely often alongp(k). Again, thiss is called

focus point of the subgame followingk. Let f (k) denote the focus point resulting in the subgame

following k, and equivalently (13), let us defineF as the set of potential focus points. Under LMC,

the payoff in the continuation equilibrium followingk implies the payoff allocationU( f (k)) and the

equilibrium payoff allocation isU( f (s0)).

20

4.3 Payoff–Equivalence of IMGs and GIRGs

In the following, we concentrate on Pareto–perfect equilibria (PPEs) of IMGs, and on PPEs in simple

paths in GIRGs. The set of potential endpoints in IMGs is calledE, and the endpoints that can still

be reached in a subgame following a stagek with st = s is Es = {e∈ E : e≥ s}. In PPEs of IMGs,

the continuation equilibrium of such a subgame must imply one of the Pareto–efficient endpoints in

Es. Moreover, the set of potential focus points in GIRGs isF , and the focus points that can still be

reached in a stage withsmin(k) = s is Fs = { f ∈ F : f ≥ s}.

The set of endpointsE in IMGs is unique. Consider a PPE of an GIRG ˆs for which there is a

stagek such that (usings = smin(k)) the resulting focus point is not a Pareto efficient point inEs.

Define the setS′ ⊆ S that contains thesmin(k) for all k where such violations occur. We concentrate

on a stagek implying smin(k)∈maxS′, and there it holds for all following stagesk′ that if smin(k′) > s

then the induced focus point is a Pareto efficient point inFsmin(k′) ≡ Esmin(k′). Let f denote the focus

point implied in the continuation equilibrium followingk and letsr denote the restrictions that the

players eventually restrict to along the equilibrium path followingk.

First, consider the casesr 6= s, i.e. the restrictions are adapted in the subgame followingk. By

assumption, the focus pointf is a Pareto efficient point inEsr but not a Pareto efficient point inEs.

Since all points that are not Pareto efficient inEs are Pareto dominated by at least one of the Pareto

efficient points inEs, the path implyingf must not be Pareto perfect.

Secondly, consider the casesr = s, i.e. the restrictions are not adapted, but a focus pointf results

which is not a Pareto efficient point inEs. Here, let us distinguish two subcases:s would result in

a corresponding IMG subgame,s∈ Es, or it would not,s /∈ Es. If s /∈ Es, then for any pointf ≥ s

such thatf is not Pareto efficient inEs, there is an endpointe∈ Es and a playerb who is better off

in e than in f (by construction ofE). Assume that playerb deviates unilaterally from ˆsby restricting

himself to play strategiessb ≥ eb. Thanks to Lemma 3.13, we know that all endpoints, which are

Pareto efficient inEs and more competitive than(eb,s−b) must provideb with a payoff that is at least

as high as his payoff ine. By assumption, the path leads from(eb,s−b) to one of these points, and

hence,b gains by deviating unilaterally from ˆs. Thus,s is not an SPE.

Alternatively, if s∈ Es (i.e. the players would coordinate ons in the IMG), thens is the unique

Pareto efficient endpoint inEs. By assumption,f 6= s. Construct the setE′s = {e∈ E : e> s} as

the endpoints that are strictly more competitive thans. If f is not inE′s, then there is a player who

would profitably deviate to someeb (as in the previous case), and ˆs is not an SPE. Iff is in E′s, then

the play of f is Pareto dominated bys. Moreover, no player has incentives to deviate unilaterally

from restrictions tos, as the resulting endpoint must be Pareto dominated. As a result, sticking to

restrictions ofs and to the focus pointf = s is the unique Pareto efficient equilibrium. In turn, the

abovely assumed profile ˆs is not renegotiation proof. Hence, the GIRG focus points that can result

are equivalent to the IMG endpoints that can result.

21

Proposition 4.2 Consider an arbitrary simple–path PPE ˆs′ of a GIRG and denote the set of potential

focus pointsF . Consider an arbitrary PPE ˆs′′ of an IMG and denote the set of potential endpoints

E. E andF are equivalent for all constituent games that are competitive. Moreover, for anys∈ S,

consider a GIRG stagek′ such thatsmin(k′) = s and an IMG stagek′′ such thatst = s. Both, the

focus point f (k) induced in ˆs′ and the endpointe(k) induced in ˆs′′, are Pareto efficient points in

Es≡ Fs = { f ∈ F : f ≥ s}, and for all such points, there are corresponding equilibria.

Implicitly, the set of focus pointsF induced in PPEs of GIRGs is unique, one of the points in

minF results along the equilibrium path, and there is a unique focus point that can result along an

equilibrium path which Pareto dominates (weakly) the stage game Nash equilibriums∗.

4.4 Perturbed Infinitely Repeated Games

PIRGs are obtained when we adapt generalized repeated games by introducing a move of Nature in

each stage, and this move determines whether the players are allowed to lock in their strategies in the

current stage (nt = 1 with probabilityπ > 0) or not (nt = 0 with probability 1−π > 0). The players

are informed about Nature’s move before they decide. Ifnt = 1, the players’ flags have to satisfy

ft,b ∈ {0,1}, and ifnt = 0 the flags have to satisfyft,b ∈ {0}. The set of stages is

K ={(n0,s0, f0,n1,s1, f1, . . . ,nt ,st , ft) : t ≥ 0, s0 = minS,

∀t ′ ≤ t : (nt ′,st ′, ft ′) ∈ {0,1}×S×{0,1}2, ft ′,b ≤ nt ′ ∀b,

∀t ′ ≤ t : st ′ ≥ smin(s0, f0, . . . ,st ′−1, ft ′−1)}. (25)

The restrictionssminb (k) and the pathp(k) apply as they are define above. A strategy ˆsb maps all

stages to feasible stage strategies. Again, we concentrate on PPEs insimple paths, but we shall adapt

this restriction to acknowledge that the focus points may depend on how Nature moves. Ex post, the

moves of Nature following a stagek can be described as a sequencen∞ ∈ {0,1}∞, and we require

that the focus point is unique only ex post: for each stagek∈ K and for each forthcoming sequence

n∞ of moves of Nature, there is a unique stage strategy profile being played infinitely often along

the equilibrium path (thefocus point). Since the moves of nature are unknown ex ante, the focus

points that may result need to satisfy certain consistency requirements, but those are not relevant in

the following analysis (hence, we skip formalizations of them).

Focus points that result only with probability zero are payoff–irrelevant and can therefore be ne-

glected when we show that the equilibria of PIRGs are payoff–equivalent to the equilibria of IMGs.

Let f (k) denote the set of focus points that can result with positive probability in the subgame fol-

lowing stagek∈ K, and letF denote the set of focus points that can result with positive probability

in arbitrary subgames.

We want to show thatF is equivalent to the set of potential IMG endpointsE. In the arguments

leading to Proposition 4.2, we showed that players would deviate from focus points that can not result

22

in corresponding subgames of IMGs, and we relied on at most one round where the players adapted

their commitments. In any stagek of a PIRG, the players will have an opportunity to do so almost

surely (regardless of how smallπ is). Hence, the corresponding IMG endpoints would result almost

surely, and deviations from IMG endpoints would not occur with positive probability. This argument

leads to the following proposition.

Proposition 4.3 Consider a PPE in simple paths of a PIRG with an arbitraryπ > 0 inducing the

potential focus pointsF , and an IMG PPE inducing the set of endpointsE. F andE are equivalent

for all constituent games that are competitive. Moreover, consider an arbitrary PIRG stagek inducing

restrictionss= smin(k), and denote the set of focus points that are sustained with positive probability

in the subgame following stagek as f (k). The set f (k) is a subset of the focus points that can

result in corresponding IMG subgames,f (k) ⊆ EPs with EP

s as the set of Pareto efficient points in

Es = {e∈ E : e≥ s}.

Apparently, forπ approaching zero, the PIRGs approach IRGs. Hence, marginal perturbations

of infinitely repeated games suffice to reduce the set of payoff allocations that can be sustained

in simple–path PPEs to the set of payoff allocations that can be sustained in IMGs. This is quite

remarkable, as (for constituent games with large strategy sets) the assumption of simple paths is

hardly restricting. Moreover, since marginal perturbations of IRGs suffice to generate this effect,

our approach is merely refining the equilibria of IRGs. This is interesting, as the usual refinement

concepts are rarely effective in IRGs (see, e.g. Rubinstein, 1980).

5 Discussion

Above, we have analyzed strategic irreversibilities, and mainly, we did so with respect to the unique-

ness of collusive equilibria. Judged by our results, deterrence and tacit collusion is to be observed

frequently (i.e. nearly as frequently as irreversible moves). These results are supported by some

experiments on multiple–round auctions with perfect irreversibilities (Breitmoser, 2004a). Theo-

retically, the main exception appears to be the case of strategic substitutes (Cournot competition),

where there may be empty sets of collusive endpoints when it pays to deviate unilaterally beyond

the Nash equilibrium strategy. Empirically, the evidence supporting our theoretical results appears

mixed. Some contradictions might be due to the weakness of the existent collusive endpoints (when

unilateral deviations are cheap), as in the case of Cournot competition. Other contradictions might be

due to imperfections of the irreversibilities, which is more likely to apply to Bertrand competitions

(and it might be worst when the number of competitors is high). An extreme example highlighting

the latter point is reported by Bolle and Baier (2004), who analyze the German call–by–call market.

More contradictions appear to be explainable through referring to emotions (as hate, which may

override rationality), or through referring to inconsistent and/or false beliefs. For instance, consider

23

the situation in Israel. Both sides, the Israel government and Hamas, appear to believe that rigorous

actions bring the other side to give in. In cases like those, where our predictions appear to be falsified,

the introduced model might still be useful to as a basis in illustrations of possible reasons that lead

to escalations of conflicts. In the case mentioned, the contradictions might partly be handled through

assuming bounded rationality on (at least) one of the sides. A still wider range of phenomena might

be explained through altering the assumptions in one of the following directions.

1. static interpersonal concerns (altruism, spite) or dynamical models of emotions (reciprocity),

2. auction–like competitions where overbidding is required,

3. endogenous discretization of the strategy sets, or

4. number of players greater than two.

The first two points are combined in (e.g.) Breitmoser (2004b), where the equilibria are predomi-

nantly collusive as well. In general, however, the introduction of emotions would notably affect the

resulting set of endpoints. The most drastic of effects of interpersonal concerns can be observed in

pure Cournot games (symmetric competition withα = −1, see Section 2.3), when the competitors

aremarginallyspiteful: all collusive endpoints will disappear. Obviously, interpersonal concerns (as

spite or altruism) are likely to be relevant in personal relationships and political games. However,

such concerns are not significantly less appealing in market games. “Altruism” can be thought of as

applying between firms which own shares of each other, or of firms which partly belong to the same

owner (Bolle and Güth, 1992). “Spite” may capture a heuristic resulting from a long–term strategy

aimed at weakening a threatening rival.

The third point concerns the step sizes along which the ideally continuous strategy spaces are

partitioned by the players. Generally, this partitioning affects the outcomes that can be sustained in

renegotiation–proof equilibria, and thus it might be considered to be a part of the game. However, our

results are robust with respect to arbitrary symmetric partitions of the strategy spaces, and asymmetric

partitionings may appear difficult to motivate. In particular, it appears difficult to motivate how the

players can credibly commit to such partitionings. A similar problem arises in irreversible move

games based on non–competitive games. There, it needs be discussed for each class of constituent

games, how the players can credibly restrict their options.

Finally, the above list suggests to extend the analysis to models with more than two players.

Generally, the analysis is likely to become considerably more complex, as a variety of asymmetric

equilibria can result. For example, in public good games partial free riding equilibria are likely to

result.

24

References

Admati, A. R. and Perry, M. (1991). Joint projects without commitment,Review of Economic Studies

58: 259–276.

Baik, K. H. and Shogren, J. F. (1992). Strategic behavior in contests: Comment,American Economic

Review82(1): 359–362.

Bernheim, B. D., Peleg, B. and Whinston, M. D. (1987). Coalition–proof nash equilibria: I. concepts,

Journal of Economic Theory42(1): 1–12.

Bolle, F. and Baier, A. (2004). Zyklische Preisentwicklung im offenen Call-by-Call-Markt,

Schmalenbachs Zeitschrift für betriebswirtschaftliche Forschung (forthcoming).

Bolle, F. and Breitmoser, Y. (2001). Spectrum auctions: How they should and how they should not

be shaped,FinanzArchiv58(3): 260–285.

Bolle, F. and Güth, W. (1992). Competition among mutually dependent sellers,Journal of Institu-

tional and Theoretical Economics (JITE)148(2): 209–239.

Breitmoser, Y. (2004a). Collusion and competition in laboratory simultaneous multiple–round auc-

tions, Discussion Paper 179, Europa–Universität Viadrina, Frankfurt (Oder).

Breitmoser, Y. (2004b). Equilibrium refinement based on interpersonal preferences, Discussion Paper

183, Europa–Universität Viadrina, Frankfurt (Oder).

Bulow, J. I., Geanakoplos, J. D. and Klemperer, P. D. (1985). Mulitmarket oligopoly: Strategic

substitutes and complements,Journal of Political Economy93(3): 488–511.

Dixit, A. (1987). Strategic behavior in contests,American Economic Review77(5): 891–898.

Elster, J. (1984).Ulysses and the Sirens: Studies in rationality and irrationality, rev. edn, Cambridge

Un. Press.

Elster, J. (1998). Emotions and economic theory,Journal of Economic Literature36: 47–74.

Farrell, J. and Maskin, E. (1989). Renegotiation in repeated games,Games and Economic Behavior

1(4): 327–360.

Feuerstein, S. and Gersbach, H. (2004). Is capital a collusion device,Economic Theory39: 76–83.

Friedman, J. W. (1983). On characterizing equilibrium points in two person strictly competitive

games,International Journal of Game Theory12(4): 245–247.

Fudenberg, D. and Maskin, E. (1986). The folk theorem in repeated games with discounting and

incomplete information,Econometrica54: 533–554.

25

Gale, D. (2001). Monotone games with positive spillovers,Games and Economic Behavior

37(2): 295–320.

Kats, A. and Thisse, J.-F. (1992). Unilateral competitive games,International Journal of Game

Theory21: 291–299.

Lagunoff, R. and Matsui, A. (1997). Asynchronous choice in repeated coordination games,Econo-

metrica65(6): 1467–1477.

Leininger, W. and Yang, C.-L. (1994). Dynamic rent–seeking games,Games and Economic Behavior

7: 406–427.

Lockwood, B. and Thomas, J. P. (2002). Gradualism and irreversibility,Review of Economic Studies

69: 339–356.

Loewenstein, G. (2000). Emotions in economic theory and economic behavior,AEA Papers and

Proceedings90(2): 426–432.

Marx, L. M. and Matthews, S. A. (2000). Dynamic voluntary contribution to a public project,Review

of Economic Studies67: 327–358.

Maskin, E. and Tirole, J. (1987). A theory of dynamic oligopoly iii,European Economic Review

31: 947–968.

Maskin, E. and Tirole, J. (1988). A theory of dynamic oligopoly,Econometrica56(3): 549–569,571–

599.

Milgrom, P. (2000). Putting auction theory to work: The simultaneous ascending auction,Journal of

Political Economy108(2): 245–272.

Romano, R. and Yildirim, H. (2004). On the endogeneity of Cournot–Nash and Stackelberg equilib-

ria: Games of accumulation,Journal of Economic Theory (forthcoming).

Rubinstein, A. (1979). Equilibrium in supergames with the overtaking criterion,Journal of Economic

Theory21: 1–9.

Rubinstein, A. (1980). Strong perfect equilibrium in supergames,International Journal of Game

Theory9(1): 1–12.

Schelling, T. C. (1963).The Strategy of Conflict, Harvard University Press.

Strotz, R. (1956). Myopia and inconsistency in dynamic utility maximization,Review of Economic

Studies23: 165–180.

26

Sylos–Labini, P. (1962).Oligopoly and Technical Change, Harvard University Press, Cambridge,

Mass.

Yang, C.-L. (1993). Cooperation by credible threats: On the social costs of transfer contests under

uncertainty,Journal of Institutional and Theoretical Economics149(3): 559–578.

Yildirim, H. (2004). Contests with multiple rounds,Games and Economic Behavior(forthcoming).

A Proofs

Lemma A.1 The model of symmetric competition based on (1) satisfies (17).

PROOF. The inverse best reply function is

R−1−b(pk

−b) =1α

(2pk

−b−1)

=1α

(2

2−α+2kε−1

)=

11−α

+2kεα

. (26)

The payoff from sticking to the path(pkb, pk

−b) is

Πb(pkb, pk

−b) =(

1− 12−α

−kε+α

2−α+αkε

)∗(

12−α

+kε)

(27)

=1

(2−α)2 +αkε− (kε)2∗ (1−α). (28)

The payoff from the deviation onto the opponents best reply curve is

Πb

(R−1−b(pk

−b), pk−b

)=(

1− 12−α

− 2α

kε+α

2−α+αkε

)∗(

12−α

+2α

kε)

(29)

=1

(2−α)2 +αkε− (kε)2∗(

4α2 −2

). (30)

Thus, the former is greater than the latter, iff

1−α <4

α2 −2 ⇔ α2(3−α) < 4, (31)

which is satisfied for allα ∈ (−1,0). QED

Lemma A.2 In the model of competition described in (4), playerb is better off deviating unilaterally

from (pkb, pk

−b) to R−1−b(pk

−b) in the stage game if and only ifαb < 0 andαb +α2−b > 0.

PROOF. Thepkb andpk

−b satisfy

pkb := pN

b +kK

(pM

b − pNb

)=

2+αb

4−αbα−b+

kK∗

2α−b +α2b

(4−αbα−b) (2−αb−α−b), (32)

pk−b := pN

−b +kK

(pM−b− pN

−b

)=

2+α−b

4−αbα−b+

kK∗

2αb +α2−b

(4−αbα−b) (2−αb−α−b). (33)

27

UsingR−1−b(pM

−b) = αb+α−bα−b(2−αb−α−b)

, we can calculateR−1−b(pk

−b) as

R−1−b(pk

−b) = pNb +

kK

[R−1−b(pM

−b)− pNb

]=

2+αb

4−αbα−b+

kK∗

4αb +2α2−b

α−b(4−αbα−b)(2−αb−α−b).

(34)

In cases where no playerb is better off deviating, the players collude at(pMb , pM

−b). The payoff from

not deviating is

Πb(pkb, pk

−b) = (1− pkb +αbpk

−b)∗ pkb (35)

=(

2+αb

4−αbα−b

)2

+kK∗ 2+αb

4−αbα−b∗

2α2b +αbα2

−b

(4−αbα−b) (2−αb−α−b)

+(

kK

)2

∗α4

b +α3bα2

−b +2αbα3−b−4α2

−b

(4−αbα−b)2 (2−αb−α−b)2

and the payoff from deviating is

Πb

[R−1−b(pk

−b), pk−b

]=[1−R−1

−b(pk−b)+αbpk

−b

]∗R−1

−b(pk−b)

=(

2+αb

4−αbα−b

)2

+kK∗ 2+αb

4−αbα−b∗

2α2b +αbα2

−b

(4−αbα−b) (2−αb−α−b)

+(

kK

)2

∗8α3

bα−b +8α2bα3

−b−16α2b−16αbα2

−b +2αbα5−b−4α4

−b

α2−b (4−αbα−b)2 (2−αb−α−b)2

Playerb is not better off deviating iff

Πb(pkb, pk

−b)≥Πb

[R−1−b(pk

−b), pk−b

](36)

α4bα2

−b +α3bα4

−b ≥ 8α2bα3

−b−16α2b−16αbα2

−b +8α3bα−b (37)

α3bα2

−b∗(αb +α2

−b

)≥(8α2

bα−b−16αb)∗(αb +α2

−b

)(38)

Obviously, this is fulfilled ifαb + α2−b = 0. Additionally, if αb > 0 or αb + α2

−b < 0 the relation

simplifies to

α2bα2

−b +8(1−αbα−b)︸︷︷︸>0

≥−8, (39)

which is fulfilled, too. However, if (and only if)αb + α2−b > 0, playerb is better off deviating to

R−1−b(pk

−b), as the relationα2bα2

−b + 8(1−αbα−b) ≤ −8 can not be fulfilled in our circumstances.

QED

Lemma A.3 In the contest according to (8), Equation (17) is satisfied.

PROOF. If it is less more profitable for some playerA to deviate from a symmetric collusive point

(x,x) to the endpoint(R−1−b(x),x), then (in a quasi–continuous case)

x2x−x >

yy+x

−y, with y =

(12

+

√14−x

)2

. (40)

28

which simplifies to

12−x > x ⇔ x <

14

(41)

which is fulfilled for thex in question. QED

Lemma A.4 In a contest according to (9), Equation (17) is satisfied.

PROOF. Equation (17) is equivalent to

Ub

(ln

cPb,ln

cP−b

)≥Ub

(ln

cPb +n− l

nPb,

ln

cP−b

)≥Ub

(R−1−b

(ln

cP−b

),

ln

cP−b

)Pb

Pb +P−b+

n− ln

Pb ≥(n+ lc− l)Pb

(n+ lc− l)Pb + lcP−b

P−b +1Pb

− nlPb ≤

lcP−b

(n+ lc− l)Pb⇔ n− l

n+ lc− l≤ n

l

P2b

P−b+

1P−b

which is satisfied forl ≤ n andP2b +1≤ P−b. QED

Proof of Lemma 3.7. If any playerb deviates from a points= (sb,s−b) ≮ s∗ with sb ≥ Rb(s−b) to

a more competitive strategys′b, then whatever endpoints′′ > s is reached finally,b is worse off since

Ub(s) > Ub(s′b,s−b)≥Ub(s′′). (42)

The first inequality stems fromsb ≥ Rb(s−b) and 3.1.(ii), the second one from 3.1.(i). Hence, any

s : s≥ R(s) is an endpoint. Similarly, we see thats∗ Pareto dominates any points′ > s∗: for each

playerb, he is better off in(s′b,s∗−b) than ins′ and he is better off ins∗ than in(s′b,s

∗−b).

To show thats≥ R(s) is even necessary fors ≮ s∗ being an endpoint, let us assume thatb’s

opponent has moved beyond his Nash strategy,s−b > s∗−b, and defines′b := Rb(s−b). Thanks to

3.1.(iii), s−b > s∗−b implies s−b ≥ R−b [Rb(s−b)] and thus we have(s′b,s−b) ≥ R(s′b,s−b). Hence,

(s′b,s−b) is an endpoint.

If s≥ s∗, there must be someb : sb < Rb(s−b) such thats−b > s∗b. In that case,b can deviate

profitably and unilaterally to the endpoint[Rb(s−b),s−b]. If s� s∗, then there is a playerb : sb < s∗b.

As a result of 3.1.(iii), we can rule out thatsb ≥ Rb(s−b) ands−b < R−b(sb). Hence,sb < Rb(s−b),

andb can deviate unilaterally and profitably to the endpoint[Rb(s−b),s−b]. Thus,s : s� s∗, s� R(s)

can not be an endpoint. QED

Proof of Lemma 3.11. Here, we make use of the notations1∨s2 :=(max{s1,A,s2,A},max{s1,B,s2,B}).The proof results from the following three claims.

Claim A.5 If for any pair of endpointse1 6= e2 < s∗ in an IMG bothe1 � e2 ande2 � e1 hold, then

the pointe3 = e1∨e2 must be outside the interiorSI of the strategy set (proved in the appendix).

29

PROOF OFCLAIM . Suppose to the contrary that there would be two endpointse1,e2 ∈ E that have

e1 � e2, e1 � e2, ande1∨e2 ∈ SI . This implies thate1,e2 ≤ s∗. Without loss of generality, we shall

assumee1,A > e2,A. Sincee3 ∈ SI and thuse3 ≤ s∗, we know

UA(e3) > UA(e1), UB(e3) > UB(e2). (43)

As a result of that,A’s payoff in e3 must be high enough to Pareto dominate all endpointse′ ≥ e1,

which is a superset of all endpointse′ ≥ e3. Similarly,B’s payoff ins3 must be high enough to Pareto

dominate all endpointse′ ≥ e2, which again includes all endpointse′ ≥ e3. Hence,e3 must be an

endpoint itself; and in turn,e1,e2 must not be, since eitherA or B is better off deviating toe3. a

Claim A.6 If for any pair of endpointse1 6= e2 < s∗ both e1 � e2 ande2 � e1 hold, then the point

e3 = e1∨e2 must be an endpoint, too.

PROOF OFCLAIM . Supposee3 < s∗ is not an endpoint. Let us definee′ as an arbitrary endpoint that

is more competitive thane3, i.e.e′ ∈ {s∈ E : s> e3}. If e3 is outside the interior, there is a playerb

such thate3,b > Rb(e3,−b). Since the payoff functions are unimodal and monotonically decreasing in

the opponent’s competitiveness, this implies

Ub(e3) > Ub(e′b,e3,−b) > U(e′) (44)

that b is strictly better off ine3 than in any more competitive endpoint. Thanks to characteristic

3.1.(iii), we know thate3,−b < R−b(e3,b). Without loss of generality, assume thate1,−b < e3,−b.

Hence,b’s opponent is better off ine3 than ine1, and sincee1 is an endpoint, he is better off ine1

than in all more competitive endpoints. This extends toe3, and thus, both players are better off ine3

than in any more competitive endpoint. As a result,e3 must be an endpoint ande1 must not be an

endpoint. a

Claim A.7 There are no endpointse< s∗ that are outside the interior.

PROOF OFCLAIM . Suppose to the contrary,e < s∗ is an endpoint that is outside the interior and

define playerb such thateb > Rb(e−b). Now, constructe′−b = R−1b (eb). Equivalently to argument

leading to (44),b is better off in(eb,e′−b) than in any more competitive point. Moreover, in competi-

tive games,e′−b < R−b(eb), and thusb’s opponent is better off in(eb,e′−b) than ine. Thus, similarly

to the above arguments,(eb,e′−b) is an endpoint andemust not be an endpoint. a

Combined, these claims complete the proof. QED

Proof of Lemma 3.12. Only a point in minE can result along the equilibrium path. Sinces∗ ∈ E,

there must be a pointe∈minE such thate≤ s∗. Moreover, as a result of Lemma 3.11, the endpoint

30

e∈minE : e≤ s∗ is unique. That is, there is a unique endpointe≤ s∗ that can result in equilibrium,

and there is generally such an endpoint.

Alternatively, some endpointse≮ s∗ may result in equilibrium. In all of these endpoints, how-

ever, at least one of the players is worse off than ins∗. For, in all of these endpointse there is some

playerb : eb < s∗b, and this player is better off ins∗ than ine. Precisely, he is better off(eb,s∗−b) than

in e, and better off ins∗ than in(eb,s∗−b). Hence, the unique endpoint that can result in equilibrium

where no player is worse off than ins∗ is e∈minE : e≤ s∗. QED

Proof of Lemma 3.13. SupposeEs is the set of potential endpoints in a stagek ∈ KD with st = s,

andEPs is the set of Pareto efficient points inEs, which impliesEP

s ∈ minEs. As a result, for each

pair of pointse′ 6= e′′ ∈ EPs , we have (for all playersb) thate′b > e′′b ⇔ e′−b < e′−b.

There are three groups of potential endpoints that we have to establish the claimed payoff relation

for. First, endpointsemay satisfyeb < s∗b ande−b > s∗−b; secondly, they may satisfye≤ s∗ (of which

there is only one such point); thirdly, the may satisfyeb > s∗b ande−b < s∗−b. We have to proof the

claim for all possible combinations of these groups: two points within group 1, two points within

group 3, group 1 vs. group 2, group 2 vs. group 3, and group 1 vs. group 3.

First, we relate a pointe1 of group 1 tos∗. Sincee1−b > s∗−b, we haveUb(e1) < Ub(e1

b,s∗−b).

Moreover,e1b < Rb(s∗−b) impliesUb(s∗) > Ub(e1

b,s∗−b), and hence,Ub(s∗) > Ub(e1). The above re-

lations hold equivalently, when we substitutes∗ with an arbitrary pointe′ of group 1 that satisfies

e′b > e1b, or when we substitutes∗ with an arbitrary pointe2 < s∗ of group 2. Thus, for the first two

combinations of groups, the claim is proved.

Secondly, we relate a pointe3 of group 3 tos∗. Equivalently to the above induction, we can

show that the payoff ofb’s opponent is higher ins∗ than ine3, i.e.U−b(s∗) > U−b(e3). However,

e3 is known to be Pareto efficient, which can only be satisfied ifUb(s∗) < Ub(e3). Equivalently, this

relation holds if we substitutes∗ with an arbitrary pointe′ of group 3 that satisfiese′b < e3b, or when

we substitutes∗ with an arbitrary pointe2 < s∗ of group 2.

Finally, we have to relate pointse1 of group 1 to pointse3 of group 3. In any subgame with

restrictions, where both kinds of endpoints can be reached, we haves� s∗. Thus, there is an interior

endpoint (at leasts∗) that can be reached, too, and the endpointse1 ande3 are not Pareto dominated

by that one. Thus, we can combine the above cases to prove the claimed payoff relation: playerb is

better off ins∗ than ine1, and he is better off ine3 than ins∗. Hence, he is better off ine3 than ine1.

QED

31

Dynamic Competition with Irreversible Moves · For a wide range of competitive constituent games, the payoff allocations that are equilibrial in IMGs are unique under renegotiation

Documents