H.G. de Wolff - Distributing according to Nash

Distributing according to Nash

First insertion on Heterodox Gazette Sam de Wolff: 2 september 2018

E.A. Bakkum is a blogger for the Sociaal Consultatiekantoor. He loves to reflect on the labour movement.

The present column gives a detailed explanation of the axiomatic model of John Nash for the analysis of bargaining. Both the symmetric and the general variant are discussed. The model of Nash is associated with the model of Rubinstein for a negotiation with time pressure, with utilitarianism, and with the proportional method. It is shown, that from this Binmore derives a model of social morals. Finally the models are compared with experiences from practice.

The economic theory of the twentieth century mainly analyzes situations, where the distribution of wealth is given. In such a situation the social optimum can be realized by a paired exchange of goods between individuals, which is modelled graphically by the Edgeworth box. However, the Gazette has from the beginning also paid attention to the influence of group power to the social distribution. Incidentally, power itself is already relevant in the Edgeworth box, because it determines within the core of possible distributions which point on the contract curve is selected. Here and there in the Gazette references are made to marxism, which explains the distribution of incomes by the power, which originates from the property of capital goods. A distribution of income based on power is a bargaining process.

Naturally here a brief reference must be made to the scientific work of the marxist Sam de Wolff, the namegiver of the Gazette. He notably introduces an ingenious bargaining model, which wants to explain the distribution of the national income. According to De Wolff, the workers demand as their reward a fixed minimal utility¹ . Nowadays this minimum would be called the reservation utility of the workers. Next the enterprise can determine the working hours. He selects these in such a manner, that his profit is maximal. The model of De Wolff shows, that it is possible to mathematically model the bargain about the distribution. Here the negotiation is still one-sided, because in marxism the workers can merely enforce their own reproduction. After the Second Worldwar the mathematician John Nash has derived a morel general bargaining model, and this is the theme of the present column.

The model of Nash

Various columns have applied the model of Nash, without explaining its theoretical foundations. A year ago the general Nash solution of a bilateral negotiation has been described, for the case of a network. Previously various columns have applied the model to the bargaining between trade unions and enterprises. Recently a model has been presented, where the state bargains with a firm about a subsidy. A more recent column relates the Nash solution to the social welfare function (SWF). So it is indeed time to study the Nash solution more closely. It has an interpretation, which deviates from the welfare functions. Namely, the SWF describes the preferences of a central planning agency with respect to the distribution. The Nash solution holds for the situation, where a neutral arbiter decides about the distribution problem. The conceptual difference is indeed marginal² .

Nash has developed his model for a bilateral transaction. However, it can be applied simply to larger groups. Suppose that the studied group consists of N individuals. The members of the group are together involved in a transaction or enterprise, which creates value. Let x_k (k=1, ..., K) be the quantities of benefits k resulting from the activity. The group must compromise about the distribution of the benefits x_k. Benefits are by definition useful. The preferences of the individual n are described by the utility function u_n(x₁(n), ..., x_K(n)), where x_k(n) is the quantity, which is received by n. Each distribution leads to a utility vector u = (u₁, ..., u_N). The total of utility vectors u forms a set U of utility possibilities. In other words, the deliberations about the enterprise create the set U. The figure 1a shows such a set for the case N=2 (in pink)³ .

Picture of sets of utility possibilities

Figure 1: Sets of utility possibilities:
(a) status quo ν and solution σ;
(b) symmetrical set and expansion

The transaction only proceeds, when all N individuals agree. When no agreement is reached, then the existing situation remains intact. This utility point ν is called the status-quo. See the figure 1a. Loyal readers will recognize an analogy with the Edgeworth box, which has a similar starting point ξ, albeit of quantities of benefits x_k. The set of utility possibilities obviously has a boundary. For, the quantities x_k are also bounded. This frontier is represented by the utility possibility curve. It is shown in red in the figure 1a. Nash assumes, that the set of utility possibilities is convex⁴ . Next he derives the solution σ of the distribution problem by proposing five properties, which could in reason hold for the wanted solution. This is called an axiomatic approach.

The five properties will each be discussed now. The first property is individual rationality. This is to say, the individual n will not accept solutions, which are worse than his status quo ν_n. In the figure 1a this property is represented by dashed lines for both individuals. The solution can not be below or to the left of the dashed line. An individual would indeed be foolish to cooperate in a transaction, which would only bring him disutility. Apparently ν_n is the reservation-value of n. For a singular transaction ν_n is the threat- or breakpoint. In an ongoing (continuously repeating) transaction, ν_n is the point of impasse or deadlock⁵ .

The second property is the Pareto efficiency. The individual will only accept solutions, which are on the curve of utility possibilities. For, as long as this is not the case, he can improve his utility without hurting the utility of the other individuals. Note that the Pareto property requires, that all individuals dispose of complete information about the transaction. Namely, only then can they determine their maximally realizable utility. Suppose that the agreement is a (vector) contract σ. As an illustration the figure 1a shows this contract σ, located on the curve of utility possibilities. Note that the properties of individual reationality and Pareto efficiency are also revelant for the Edgeworth box. Here they define the so-called core ⁶ .

The third property is symmetry, also called anonimity. It holds for symmetrical sets U, as is shown in the pink area of the figure 1b. Such a set remains exactly identical, when it is mirrored in the 45^o line coming from the origin (dashed in the figure 1b). The mirror-image is in fact a permutation of the individuals 1 and 2. Now the symmetry property states, that for a symmetrical set all σ_n are identical. In two dimensions σ_n is on the 45^o line, with σ₁=σ₂. The interpretation is, that identical individuals have the same utility in the transaction. Nobody is discriminated.

The fourth property is the independency of irrelevant alternatives. Suppose that during the deliberations about the contract σ an event occurs, which is irrelevant for the contract. The status quo ν does not change. The event does result in an expansion of the set U, for instance with the light-green parts in the figure 1b. Now the property requires, that such an irrelevant expansion does not change the optimal contract σ. The requirement seems trivial⁷ . The property can also be presented reversely. Suppose that originally the green curve of utility possibilies is valid. The optimal contract is σ. The possibilities in the green area are irrelevant for the contract σ. Now these possibilities are eliminated. Then in the new situation the contract will still be σ.

Figure 2: Set of utility possibilities:
(a) translation of the utility field (u₁, u₂);
(b) relative scaling with factors (β₁, β₂)

The fifth and final property is the independency of linear utility transformations. Such a transformation has the form u'_n = α_n×u_n + β_n, where α_n and β_n are constants. So the transformation varies for each individual n. The explanation becomes more clear by considering the translation α and the relative scaling β separately. The figure 2a shows the translation. The whole utility field is simply shifted with respect to the frame of axes. The new status quo is μ. Since the utility to the left of ν₁ and below ν₂ is meaningless because of the individual rationality, the situation has not really changed. The new Nash solution is simply shifted in the same manner, and is τ. The independency of the translation is convenient, because ν can be shifted to the origin. This makes the problem more transparant.

Nash assumed an independency of the relative scaling β, because he wants to avoid interpersonal utility comparisons. For, the utility u_n is expressed in the utils (utility units) of n. When the contract σ would depend on β_n, then the utility of the individuals n and m could be compared by means of the ratio β_m/β_n. Traditionally economists are reserved in this regard, because then actually a moral judgement would be made between the interests of the individuals n and m. Concretely, the independency for relative scaling β implies, that the scaling changes the contract σ in τ, with elements τ_n = β_n×σ_n. See the figure 2b. This is to say, the old contract simply scales in the same manner as the utility field. Or more formally: let B be the scaling. Then the contract for B(U) with status quo B(ν) equals B(σ).

Now, by means of these five properties the Nash solution for a negotiation can be defined. It is a recipe for calculating the vector σ(U, ν), which represents the agreement between the N group members. In other words, σ is determined by the set U of utility possibilities, the status quo ν, and the recipe. This recipe is:

(1) maximize for the vectors u in U: Π_n=1^N (u_n − ν_n)

In the formula 1, Π is the mathematical symbol for multiplication of the N terms. Nash discovered, that this solution satisfies the five mentioned properties. Moreover it can be proved, that the recipe of the formula 1 is the only (unique) solution⁸ .

The following notes concerning the formula 1 are worth mentioning. Actually the five properties have more resemblance to an arbitrage (judgement about a conflict) than to a negotiation. This has already been remarked in the introduction. It is supposed, that the N individuals cooperate. In particular it is ignored, that they could mutually form coalitions in order to obtain a better result for themselves than the Nash solution. Incidentally, this was not relevant for Nash, because he studied the case N=2. Furthermore, the Nash solution is not fair in the sense, that the status quo ν is accepted as fixed. And the status quo could be extremely unfair. The injustice in the starting position is also found in the similar model of the Edgeworth box. In particular, the model of Nash is not a recipe for maximizing the social welfare⁹ .

The reader must be aware, that the independency of linear utility transformations has radical consequences. For instance, it allows to make the status quo ν invisible by means of a translation to ν'= 0. The Nash method indeed considers merely the differences u_n − ν_n. Besides, the maximally realizable u_n for all individuals n can be made identical by means of the relative scaling, for instance to u_n^max = 1. Then the difference between the individuals n is determined merely by the form of the boundary of the set U. It is intuitively logical, that a (qua utility) wealthy individual n with a strong status quo ν will be capable of enforcing a transfer of a significant part of the yield of the transaction to himself. For, the utility often correlates with money, and money represents power. Then the boundary of U will be changed to his advantage. It is curious that this aspect is not discussed in the consulted sources¹⁰ .

The general solution of bargaining

In practical cases it is often necessary to give up the third property, the symmetry or anonimity. Then a parameter γ_n≥0 is added to the model, which characterizes the individual n. This modelling is called a general Nash negotiation. The general recipe to find the Nash solution is:¹¹

(2) maximize for the vectors u in U: Π_n=1^N (u_n − ν_n)^γ(n)

Loyal readers will observe a resemblance between the formula 2 and the individual utility function in the rational choice theory, which has the form u = Π_m=1^M c_m^γ(m). In the utility function, γ_m represents the weight, which the individual attaches to his need to own a quantity c_m of the good m. In the same way γ_n in the formula 2 expresses the weight, which the arbiter or referee attaches to the interests and needs of the individual n, but now in comparison with the other individuals j≠n of the group.

As an illustration of the general solution a special case can be considered, namely a transaction with N=2, where the set U of utility possibilities is given by u₁ + u₂ ≤ π ¹² . This is to say, thanks to the transaction a quantity utility π can be distributed. It can be shown, that in this case the most reasonable contract along the lines of Nash is given by¹³

(3) σ_n = ν_n + (π − ν₁ − ν₂) × γ_n / (γ₁ + γ₂)

Here one has n=1 or 2. Since the solution does not depend on translations, one may assume ν=0. Then the individual n apparently obtains a fraction γ_n / (γ₁ + γ₂) of the available utility π. For this reason γ_n is sometimes interpreted as the bargaining power of n. This would then be the power, which the arbiter gives to the individual n. In the case of equal power the distribution is fair: σ_n=π/N.

The model of Rubinstein

Worth mentioning is the model of Rubinstein, which derives the general Nash solution in a totally different way¹⁴ . This model is part of game theory, and is called the game of alternate offers. The transaction concerns a group of two members (N=2), and produces a total utility of π ¹⁵ . The two individuals make alternate offers for the distribution of π. Note that they are both egoistically, and therefore do not aim at a just solution, like the arbiter. Therefore this is called a transaction without cooperation (although the individuals must naturally complete the enterprise together). Each time after an individual has made an offer, the other withdraws for a period Δt in order to consider the offer. The negotiation has a certain time pressure, because the individuals prefer to benefit from their utility as soon as possible. Therefore the utility is devalued with time, with a discount factor δ_n per period Δt.

Suppose that it is the turn of individual n to make an offer. Each rejected offer reduces the value of the transaction, because hereafter a time Δt will pass. Therefore n (=1 or 2) wants to make an offer, which is acceptable for the other individual m≠n. The individual n calculates, that it is optimal for m to offer a fraction x_n of the total value to n. Then m himself obtains a fraction x_m = 1 − x_n. However, since m will make his offer later, the value of the transaction has again been reduced by δ_m with respect to the present. The individual n can make a convenient use of this devaluation. Namely, he offers to m a fraction δ_m×x_m. This offer is precisely worth as much as the offer, which m will do a period Δt later. This is to say, m is indifferent with respect of these two offers. Therefore m will agree with the present offer. And n obtains a fraction x_n = 1 − δ_m×x_m.

Suppose that n by presumption or laxness makes an offer, which is too low. Then it is m's turn to make his best offer. The presented argument is also applicable to m. This is to say, his best offer is δ_n×x_n. A rational n will agree, so that m gets a fraction x_m = 1 − δ_n×x_n. Substitute this formula in the found expression for x_n, then the result is¹⁶

(4) x_n = (1 − δ_m) / (1 − δ_n×δ_m)

Here one has of course δ_n×δ_m=δ₁×δ₂. The formula 4 is based on the assumption, that n can make the first offer. This gives him an advantage. Assume for the sake of convenience, that δ₁=δ₂=δ holds. Then the formula 4 becomes identical to x_n = 1/(1+δ). Therefore m gets a fraction x_m = 1 − x_n = δ/(1+δ), so a factor δ less than n.

The discount factor δ_n and the discount rate d_n are related by means of δ_n = 1/(1+ d_n). Let r_n be the discount rate, which represents the continuous devaluation, namely the devaluation for extremely small time intervals. Now it can be proved, that one has δ_n = exp(-r_n × Δt) ¹⁷ . Substitute this relation in the formula 4. It can be shown with the obtained formula, that in the limit Δt→0 one has x_n = r_m / (r₁ + r₂) ¹⁸ . Here one obviously has n=1 or 2, n≠m, just like before. So this distribution x_n occurs for offers with very small periods in-between. To be more precise, it can no longer be assumed, which individual makes the first offer. The distribution is always the same, irrespective of who begins¹⁹ . For the comparison of the solution of Rubinstein with the Nash solution the following form is more convenient:

(5) x_n = (1/r_n) / ((1/r₁) + (1/r₂))

A comparison with the formula 3 shows, that the distribution x_n from game theory is identical to the solution σ_n of the general Nash negotiation, under the condition that 1/r_n = γ_n holds! According as the individual n has a smaller continuous discount rate r_n, he gains power in the general Nash negotiation. This has some logic, because an individual with a small r_n is slow in devaluing the value π of the transaction. Such an individual is patient. Apparently patience leads to power²⁰ . Furthermore, note that a long-lasting deadlock would have the consequence, that the transaction loses all of its value, due to the discount factors. Here the deadlock can be interpreted as the status quo. The breaking point consists of the external alternatives of the individuals. Here they are irrelevant, because the model of Rubinstein offers a certain solution²¹ .

Models with alternative axioms

In the paragraph about the Nash solution the approach has been called axiomatic. The axioms lead to the Nash function f_N(U, ν) in the formula 1 ²² . Although the five axioms of Nash are logical, one could also invent other axioms. Alternative axioms lead to another function f(U, ν). Certainly when f_N is interpreted as the utility function of the arbiter, it is logical to search for alternative functions, which are also social welfare functions²³ . The utilitarianism and the principle of proportionality lead to well-known welfare functions, namely

(6a) f_U(U, ν) = Σ_n=1^N α_n × (u_n − ν_n)
(6b) f_P(U, ν) = minimum of α_n × (u_n − ν_n) for n=1, ..., N

In the formula 6a the term -Σ_n=1^N α_n × ν_n is commonly ignored, because it is a constant. Welfare is always relative. But it turns out that in the present argument the notation of the formula 6a is convenient. It is immediately clear, that neither the utilitarian nor the proportional approach satisfies the independency of linear utility transformations. They also do not satisfy the property of symmetry (anonimity). On the contrary, the weighing factors α_n discriminate between individuals, and in the formula 6b ν_n also discriminates. Therefore neither f_U nor f_P are sound functions for modelling the bargaining. Nevertheless they are important for the present column, because they have a special relation with the model of Nash.

Namely, the maximization of f_U(U, ν) and f_P(U, ν) commonly leads to two different optima on the curve of utility possibilities U. However, consider the special case, that both functions yield the same optimum σ for the welfare. Equate the value of the function f_U(σ, ν) in the point σ to W_σ. By definition in the proportional optimum one has, that all α_n × (σ_n − ν_n) have identical values. Thus one finds, that the common point σ must satisfy

(7) σ_n = ν_n + W_σ / (N × α_n)

Figure 3: Model of Nash:
    functions for the solutions of
    Nash, utilitarianism,
    and the proportional method

As an illustration the figure 3 shows the coincidence of the two optima for the case N=2. It strongly resembles the figure in another recent column, about welfare economics. The green line represents the function f_U, and the blue line shows the relation between u₁ and u₂ according to f_P. The slopes of the lines have the same absolute value, namely α₁/α₂, but it is negative for f_U. The intersection of the two lines naturally is the point σ, located on the boundary of U. Note that the line f_U through σ also contains the point u = [ν₁, ν₂ + W_σ/α₂]. So in this point u₂ − ν₂ is exactly two times as large as σ₂ − ν₂, this is to say, u₂ − σ₂ = σ₂ − ν₂ ²⁴ .

The case of a shared welfare optimum σ in utilitarianism and the proportional method has the typical property, that σ is also the solution of symmetric Nash bargaining (so with γ_n=1 in the formula 2)! This can be shown in the following manner²⁵ . First note that the Nash function f_N(U, ν) in the formula 1 can be rewritten as

(8) g_N(U, ν) = Σ_n=1^N ln(u_n − ν_n)

For, g_N = ln(f_N) is maximal for the same u as f_N itself. Furthermore note, that the formula 7 can be rewritten as α_n = (N / W_σ) / (u_n − ν_n), calculated in the point u=σ. This is to say, α_n = (N / W_σ) × ∂(ln(u_n − ν_n))/∂u_n, in u=σ. According to the formula 8, this is identical to α_n = (N / W_σ) × ∂g_N/∂u_n, in u=σ. Now the vector α is on the one hand the normal of the hyperplane, defined by f_U=W_σ ²⁶ . On the other hand, the vectorial gradient ∂g_N/∂u_n is the normal of the (N-1)-dimensional surface (set, if desired) g_N(u, ν) = V, where V is a constant²⁷ . Therefore the surface g_N(u, ν) = V_σ, which contains the point σ, must touch the hyperplane f_U=W_σ, where σ is the only shared point. But then g_N(u, ν) = V_σ touches the set U in σ. Apparently σ is indeed the point of U, which makes g_N and f_N maximal. Because of the formula 1, then σ is the Nash solution of the transaction. The orange curve in the figure 3 shows f_N(u, ν) = exp(V_σ) for N=2.

The identification of the Nash solution with the utilitarian and proportional solutions with identical weighing factors α has a peculiar consequence, which is not stated clearly in the consulted sources. Namely, the Nash solution assumes the property of symmetry (anonimity). However, the utilitarian and proportional solutions empathically do not satisfy this property. Apparently the restriction of a proportional distribution removes the interpersonal comparison from the utilitarian welfare, and conversely. Nevertheless, this shows that the Nash solution can indeed be interpreted as a result, which is obtained by means of moral reflections, based on interpersonal comparisons²⁸ .

Morals according to Binmore

An important incentive for analyzing the transaction model of Nash is to gain more insight in the model of society of Ken Binmore, which is completely based on the Nash model. Half a year ago it has been explained in a column, that Binmore attributes the development of morals to the constitutional deliberations in the sense of the philosopher J. Rawls. The social contract is determined in deliberations of all citizens, which are done behind a veil of ignorance. Binmore states, that the veil filters the social position out of the memory of each citizen. However, the transparency of the veil is sufficient for observing the status quo ν of society. The deliberations can be modelled with the model of Nash. The citizens defend their own interest, but they are also capable of empathy. They can imagine themselves in the situation of others. This is essential for living together.

In a column of a month ago it is explained, that Binmore in his model of society makes a convenient use of the coincidende of the solutions of Nash, utilitarianism, and the proportional method (which is a variant of the maximin principle). Ordinary negotiations and the constitutional deliberations both follow the recipe of Nash. During the deliberations there is anonimity (symmetry), so that a moral judgement of interests is in danger of being omitted. However, this deficit is corrected by relating the Nash solution σ to utilitarianism and the proportional method. For, the weighing factors α_n are a moral judgement about the manner of distributing the yield among the individuals. Thanks to empathy and the veil the deliberation ends in an agreement about the optimal values of α_n. Binmore calls the α_n the empathic preferences with regard to the various social positions n.

Figure 4: Sets of utility possibilities:
(a) proportional growth; (b) disproportional growth

Since the deliberation about the social contract is a rare event, in the short run the α_n values are fixed²⁹ . Apparently then the values of α_n are in equilibrium. They can be adapted in the medium term, during the various constitutional deliberations. In the way the social changes can be taken into account. Then the interpersonal utility comparison α_n/α_m with n≠m is adapted to the new status quo. The various utility units (utils) get a different value. So the equilibrated morals evolve in a path-dependent manner. During the deliberations the morals are based on the utilitarian welfare function (formula 6a)³⁰ . For, there is a desire for a high and increasing welfare. The utilitarian solution is an expression of collective empathy. The deliberations must finally lead to consensus about a future society σ, and a plan (agreement) to realize it.

This social contract is only useful, when the growth path from ν to σ is feasible. The completion of the growth requires a certain time. This is daily life in the short term, where the individuals defend their own interest. Therefore they demand the application of the proportional solution. But then the growth to σ must follow the line through ν and σ, with in each point the same α_n × (u_n − ν_n) for all N. See the figure 4a. In other words, the economic growth will by definition shift the status quo ν. When this shift does not follow the mentioned line, then the citizens will demand another Nash solution σ. They would like to renegotiate the contract. See the figure 4b³¹ . Therefore the proportionality is essential. Thus the feasibility imposes a restriction upon the utilitarian maximization. This guarantees that the social contract becomes self-enforcing or self-policing (internalized).

Thus it seems that the scientific merit of Binmore is mainly, that he gives a social meaning to the existing scientific models. The ideological starting point is the image of man as an empathic egoist. In the status quo the individuals together conclude a social contract (constitution), which mirrors the collective morals. The individual empathic preferences are sufficiently similar for realizing a collective compromise. The result is the Nash solution, a compromise between welfare and justice, here in the sense of reciprocity. Binmore places the interpersonal utility comparison at the centre, as the basis for subjective ethics³² . Then the proportional approach obtains the meaning of the individual self-commitment to the contract. In this manner Binmore presents an interesting abstract frame of thinking. However, your columnist does not know any practical applications of the model of Binmore³³ .

Bargaining in practice

It is obviously important to know whether the model of Nash agrees with the practical experiences in the real world. The behavioural economics has studied negotiations in laboratory experiments. In such experiments it turns out, that the individuals interpret the available information in a biased manner (self-serving bias)³⁴ . This is evidently not rational. In such situations an arbiter can correct the views of the concerned actors. The model of Rubinstein has also been simulated in experiments. It turns out that in repeated games some individuals indeed learn to apply the formula 4. However, the learning process can be slow³⁵ . Apparently, at least in such experiments the assumption of rationally acting individuals is doubtful.

The symmetrical model of Nash ignores the influence of individual power. But often power will precisely be a decisive factor for the distribution. In the rational choice theory power is equated to the total value of the resources, which are available to an individual. In a previous column about this theory it has been stated, that wealthy individual can reinforce their position during a transaction by "buying" support. They have a lot to offer to the individuals in their network. Also beyond the rational choice theory the economic theories give a prominent place to power, such as the models of rent seeking. The role of power in the model of Binmore is difficult to interpret. One wonders, why a powerful individual would want to forget his position during the deliberations behind the veil³⁶ . On the other hand, power is perhaps indirectly included in the model of Nash, by means of the form of the boundary of the set U.

Furthermore, there is an enormous amount of manuals, which give advise about the strategy in negotiations, A special mention deserves the book Excellent onderhandelen (in short EO), which originates from the Harvard Negotiation Project, and therefore has a scientific foundation³⁷ . The authors emphasize the importance of the best alternative to a negotiated agreement (in short BATNA). Since the BATNA is simply the status quo ν_n, thus the model of Nash is confirmed. For instance, ν_n can be the utility value, which the individual n would obtain, when an external arbiter or court would decide about the distribution of the surplus. In EO it is emphasized, that the participants in the negotiations about the transaction must try to improve their status quo (breaking point, security level, reservation utility), so that ν_n is actually dynamic.

Another advise of EO is to study the mutual interests, because this can probably increase the available yield of the transaction. Then the set U expands. This is also called the win-win strategy. Showing empathy is a desirable property. In this regard, EO joins the view of Binmore. It also recommends to base the negotiations on objective criteria (costs, norms, conventions, scientific ideas, and the like). This increases the chance of a rational result. And the chosen solution must not be affected by irrelevant factors. In particular, EO points out, that the good relation between the individuals must be separated from the agreement. Thus various properties of the Nash solution are addressed in EO.

Figure 5: DGB poster

But there are also differences between EO and the model of Nash. Notably, EO absolutely does not assume symmetry or anonimity for the various participants in the negotiation about the transaction. A skilled negotiator gets better results than emotional or rigid individuals. The individual power, such as having an attractive BATNA, partly determines the distribution of the surplus. This criticism on the model of Nash obviously also holds for the model of Rubinstein. On the other hand, it must be acknowledged that an asymmetric situation will probably be rare in institutional bargaining, such as between pressure groups.

Among the Dutch publications, the book Onderhandelen (in short OH) has gained some prestige³⁸ . It argues, that bargaining requires a behaviour, which mixes cooperation and competition³⁹ . The arbiter in the model of Nash, who carefully weighs all interests, precisely suggests, that cooperation is desirable. In OH the bargaining is presented as a pre-eminently dynamic process, where each individual exerts influence on the balance of power, the prodedures, the atmosphere, and the personal rank-and-file. There clearly is not a situation of individual anonimity. According to OH, it is true that the negotiator must be prepared to compromise, but yet he must defend his own interests (u_n). This requires a behaviour, varying between exploring and remaining unrelenting. The exchange of information is rarely complete, because it is used in a tactical manner. Therefore the set U of alternatives is never known completely.

The conclusion of the model of Rubinstein is generally supported by the experts. For, the importance of patience is mentioned in all manuals about bargaining⁴⁰ . This literature confirms, that the available information is never complete⁴¹ . The behaviour during the negotiations can be irrational⁴² . Sometimes the win-win strategy is propagated⁴³ . Although the importance of the BATNA (ν) is acknowledged, yet the general view is, that it must always be attempted to break the deadlock⁴⁴ . Deadlocks are simply a natural part of the negotiation.

It is curious that little attention is paid to the true execution of the agreement⁴⁵ . For, Binmore rightly points out, that a contract is fragile, as long as it insufficiently meets the interests of the concerned individuals. A good agreement must encourage the self-commitment. The concerned group as a whole must be dedicated to its observance. And finally, especially the popular literature hardly distinguishes between political-institutional and daily bargaining⁴⁶ . So it is logical that it differs somewhat from the models in this column, which probably mainly aim at institutional processes.

See p.368 and 449 of Het Economisch Getij (1929, J. Emmering) by S. de Wolff. (back)
See p.838 of Microeconomic theory (1995, Oxford University Press) by A. Mas-Colell, M.D. Whinston and J.R. Green, or p.123 in The economics of the trade union (1996, Cambridge University Press) by A.L. Booth. On p.209 in Rational-Choice-Theorie (2011, Juventa Verlag) by N. Braun and T. Gautschi it is indeed assumed, that the Nash solution maximizes the welfare of the concerned group. For your columnist this step is just a bit too far. (back)
Some textbooks, including those of the economist K. Binmore, do not discuss the model of Nash with a utility set U, but with the set X of possible yields x (pay-off). Your columnist does not (yet) have a preference for the method with U or X. He follows the convention in Microeconomic theory, which is based on U, because this books is impressive due to its conscientious arguments. The set U has a variable size, so that apparently U is not determined uniquely by the transaction. First, U can also include situations, where members hurt themselves by only taking in burdens. Even a negative utility is conceivable. Besides, U can also include the utility, which does not logically follow from the transaction. Such an irrelevant utility does not affect the optimal distribution according to Nash. See further on in the column the independency of irrelevant alternatives. (back)
According to p.44 and 946 in Microeconomic theory a set U is convex, when for two points u₁ and u₂ of U also all points in the connecting line are in U. A function f(x) can also be convex (p.931). This is the case, when for all x in the interval [x₁, x₂] one has that the line between f(x₁) and f(x₂) is above f(x). For a differentiable f this corresponds to ∂²f/∂x² > 0. In other words, f rises or falls faster and faster, for an increasing x. The concept concave is the reverse of convex. In a concave set the connecting line is outside of the set. For a concave function ∂²f/∂x² < 0 holds. Thus the confusing situation is, that the convex set of the main text has a curve (boundary) of utility possibilities, which can be represented by a concave function. U is convex, because the utility function u_n(x_k) is concave (p.930). Namely, according as the individual n gets more of x_k, the marginal utility ∂u_n/∂x_k decreases. When the individual m also gets less of x_k, then the marginal utility ∂u_m/∂x_k increases. In this situation the marginal substitution rate du_m/du_n becomes more and more negative. This makes in the figure 1a indeed likely, that the curve of utility possibilities is a concave function. Note that the presence of this boundary of U implies, that U is a closed set (p.943-944).
These definitions can also be used in spaces with N dimensions (R^N). In this case the equivalent of the curve of utility possibilities is the set with N-1 dimensions, which is the boundary of U. Usually this is not a hyperplane. Your columnist likes to call this an (N-1)-surface. (back)
See p.66 in Just playing (1998, The MIT Press) by K.G. Binmore. The remark about ν as the deadlock is also made on p.389 van Labor economics (2004, The MIT Press) by P. Cahuc and A. Zylberberg. On p.473 in Game theory and political theory (1993, Cambridge University Press) by P.C. Ordeshook, ν_n is called the security level, in the context of game theory. (back)
This remark is made on p.470 in Game theory and political theory. (back)
It is difficult to imagine, how the solution can depend on irrelevant alternatives. On p.471 in Game theory and political theory the following example is given. A guest orders soup in a restaurant. The menu gives the choices of vegetables soup and tomato soup. The guest orders vegetables soup with the waiter. Now the waiter tells, that chicken soup is also available. The guest understands that this new information increases his set of alternatives, and this is useful. However, he hates chicken soup. Therefore he decides to change his order from vegetables soup into tomato soup. (back)
The uniqueness of the Nash solution σ is proved on p.843-844 in Microeconomic theory. The argument is as follows. Do for the sake of convenience the translation to ν=0 (properties 1 and 5). Let σ be the Nash solution in the formula 1, based on the maximization of f_Nash(U) = Π_n=1^N u_n, where the set U is arbitrary. Suppose there is a second recipe for a solution, besides the formula 1, now based on a function g(U). Call this solution τ. It will be proved by means of two auxiliary sets E and V, that this recipe based on g(U) must lead to the Nash solution σ. Consider first the set E, which is defined by Σ_n=1^N u'_n ≤ N. The set E is symmetrical, so that the solution τ of g(E) must lead to an equal distribution of u'_n (property 3). This implies τ_n=1 for all n. Due to the independency for relative scaling there is the freedom to scale τ according to u_n = β_m×u'_n, with β_n = σ_n (property 5). This changes the set E into the setV, which is defined by Σ_n=1^N u_n/σ_n ≤ N. Note, that due to the relative scaling τ changes into σ. Because of Pareto efficiency, the solution σ is on the boundary of V (property 2). The boundary of V is formed by a hyperplane, just like incidentally the one of E (p.64). Apparently the sets U and V share the point σ. Now it can be shown, that U is included in V. Since σ is a solution for both sets, apparently the extra alternatives in V are irrelevant. The solution does not depend on these alternatives (property 4). But this implies, that also the solution τ of g(V) is shared by U and V. Since τ and σ coincide in V, this is also the case in U. In other words, g and f_Nash are identical, which was to be proved. Remains to show, that U is included in V. This can mathematically be proved as follows. The function in the formula 1 with ν=0 can be rewritten in the form f_Nash(U) = Σ_n=1^N ln(u_n). Apparently the normal of the body f_Nash = constant is identical to ∂f_Nash/∂u_n, so to (1/u₁, ..., 1/u_N) (p.934). By definition f_Nash is maximal in σ. There the normal is (1/σ₁, ..., 1/σ_N). This is also the normal on the hyperplane, which is the boundary of V. Apparently U and V do not merely share the point σ, but this is even their point of tangency. Since moveover V is a hyperplane, U is included in V. By the way, it deserves mentioning, that your columnist has learned this theory of geometry more than forty years ago as part of a study in physics. However, this is advanced stuff. Even for your columnist, who since then has never used the knowledge, the insight has become rusty, and in an ideal world it should be revitalized. (back)
The possibility of an unfair ν is suggested on p.577 in Public choice III (2009, Cambridge University Press) by D.C. Mueller. The theory of the formation of coalitions often uses the so-called Shapley value. On p.472-473 in Game theory and political theory situations are described, where the individuals can affect the status quo ν. This is for instance the case, when the model of Nash is applied in game theory. Then the transaction is interpreted as a certain combination of strategies s_n of the participants n in the game. And then the status quo is determined by a strategy s with a poor yield. When the individual n disposes of a s_n, which will hurt the other participant(s), then he can use this as a threat in order to increase his own yield. Then he prefers to fight. Even an independent arbiter will not ignore such an attitude. Ordeshook shows, that fighting based on asymmetrical strategies is even possible in a situation, which is symmetrical (anonymous) in the sense of the model of Nash. Each participant n tries to influence ν in such a manner, that his utility σ_n will be maximal. Then the status quo is endogenous. It can naturally be objected, that the model of Nash is not intended for such sitations. (back)
On p.290-296 and further in Playing fair (1994, The MIT Press) K. Binmore indeed applies a linear transform, which restricts the utility values to the interval [0, 1]. (back)
See p.150 in The economics of the trade union or p.207 and further in Rational-Choice-Theorie. (back)
See p.150 in The economics of the trade union, or p.209 in Rational-Choice-Theorie. (back)
The solution is found by means of the wellknown method of Lagrange. Rewrite the formula 2 as a logarithm. Then the Lagrangian is L = Σ_n=1² (γ_n × ln(u_n − ν_n) + λ × (π/2 − u_n)), where λ is the multiplier of Lagrange. The first-order conditions lead to γ_n / (u_n − ν_n) = λ and π = u₁ + u₂. These relations determine u=σ. Insertion and some rewriting yields σ_n = ν_n + (π − ν₁ − ν₂) × γ_n / (γ₁ + γ₂), which was to be proved. See p.150 in The economics of the trade union. (back)
See p. 151-152 in The economics of the trade union, or p.123-126 in Just playing. (back)
Perhaps some prefer the term payoff instead of utility, because the distribution of utility π requires an interpersonal comparison of utility. Another advantage of the payoff is, that can be studied how different risk aversions of the concerned individuals affect the distribution. See p.69-70 in Just playing.
On p.383-388 in Labor economics it is assumed, that after reaching an agreement about the distribution of π the transaction will be repeated in all subsequent periods (until eternity), with the agreed distribution. Your columnist does not see, why this assumption adds anything useful to the argument. (back)
Often it is desirable to dispose of various reference books, because sometimes the explanation is not clear. For instance, p.152 of The economics of the trade union states, that the studied problem is stationary. The formula 4 indeed does not take into account time. But it does matter who makes the first offer, and the total utility π decreases in value with time. Then the word stationary is rather confusing. And on p.125 in Just playing it is explained, that the individual must not offer less than the x_n of the formula 4, because the game must have just one (unique) subgame perfect equilibrium. Your columnist believes, that the concept of the subgame perfect equilibrium can be ignored. On p.431-432 in Labor economics it is also shown, that the height of the offer is unique. (back)
Here p.73 in Just playing is consulted. First the discrete periods Δt must be replaced by a continuous time variable t. Define t = τ×Δt. Then the discount factor for t equals δ_n^τ. Represent the corresponding discount rate by d(t). Now the rate of continuous discounting is r_n = lim_t→0 d(t)/t = lim_τ→0 (-1 + 1/δ_n^τ) / (τ×Δt). Rewrite this into -r_n×Δt = lim_τ→0 (1 − 1/δ_n^τ) / τ = lim_τ→0 (1 − exp(-τ × ln(δ_n))/τ. The rule of l'Hôpital is convenient for the competion of the proof. Your columnist has learned this rule more than forty years ago during a study in physics. See for instance p.531 in Introduction to the theory of statistics (1974, McGraw-Hill) by A.M. Mood, F.A. Graybill and D.C. Boes. The rule is: suppose there are functions f(τ) and g(τ), which satisfy lim_τ→a f(τ) = lim_τ→a g(τ) = 0. Then one has lim_τ→a f(τ) / g(τ) = lim_τ→a (∂f/∂τ) / (∂g/∂τ). So take f(τ) = 1 − exp(-τ × ln(δ_n)) and g(τ) = τ, then the rule states that -r_n×Δt = ln(δ_n). In other words, δ_n = exp(-r_n×Δt), which was to be proved. (back)
The substitution leads to x_n = (1 − exp(-r_m×Δt)) / (1 − exp(-(r₁+r₂) × Δt)). Take the limit Δt→0, and apply the rule of l'Hôpital (see previous footnote). For this choose f(Δt) = 1 − exp(-r_m×Δt) and g(Δt) = 1 − exp(-(r₁+r₂) × Δt). The result is x_n = r_m / (r₁+r₂), which was to be proved. See p.153 in The economics of the trade union or p.126 in Just playing. (back)
On p.128 in Just playing Binmore draws the model of Rubinstein in a utility field, just like the figure 1a in the column. As long as one still has Δt>0, there are two solutions on the curve of utility possibilities, for the cases that individual 1 or individual 2 makes the first offer. The two solutions merge for Δt→0. (back)
See p.129 in Just playing. (back)
See p.129-130 in Just playing. On p.171 the situation is considered, where the deadlock leads to costs c_n per unit of time τ = t/Δt. Then the deadlock becomes a burden, and the status quo shifts to negative values. (back)
Perhaps this formulation is a bit sloppy or confusing. On p.839 in Microeconomic theory a vector function σ = f(U, ν) is introduced, which gives the solution σ for U and ν. This function is also called a rule for finding the solution. In the main text your columnist refers to a real function, namely the scalar function, which must be maximized for finding the unique point σ on the boundary of U. (back)
See p.841 in Microeconomic theory. (back)
See p.842 in Microeconomic theory. (back)
The arguments are copied from p.842-843 in Microeconomic theory. On p.86 in Just playing the arguments are presented in a different order. Moreover, they are limited to the case N=2. The starting point in Just playing is the tangent to U with the Nash solution σ as its point of tangency. Binmore states on p.79-80, that the relation u₂ − σ₂ = σ₂ − ν₂ holds, where u₂ is the point on the tangent with u₁=ν₁. He does not prove this, but refers to one of his other books. On p.86 Binmore concludes, that the tangent to U in σ also defines the function f_U of the utilitarian solution. So σ is also the utilitarian solution. The slope of the tangent is -α₁/α₂. Then, due to the mentioned equality of distances, the vector σ − ν must have a slope α₁/α₂. This is to say, when the solution is determined with the proportional method, then the choice of the weighing factors α₁ and α₂ guarantees, that σ is also the proportional solution. Apparently then the solutions of Nash, utilitarianism, and the proportional method coincide, which was to be proved. The reader is warned, that on p.88 in Playing fair the coincidence of the three solutions is simply assumed, without proof. Conversely Just playing regularly refers to Playing fair. (back)
For, one has f_U = α · (u − ν). Take two arbitrary points v and w on the hyperplane f_U = W_σ, then one has α · (v − w) = 0. Apparently α is the normal of the hyperplane. (back)
Your columnist learned this statement forty years ago during his study of physics. On p.115 and 121 in Infinitesimaalrekening (1969, Uitgeverij Het Spectrum N.V.) by F. van der Blij and J. van Tiel this is shown for respectively the cases N=2 and N=3. A general argument must be approximately as follows. Let g_N(u, ν) = V_σ be the (N-1)-dimensional surface, which contains σ. Consider a point σ + Δu, which is also on the surface. Then one has g_N(σ, ν) = g_N(σ+Δu, ν). Then due to the formula 8 one has Σ_n=1^N ln(1 + Δu_n / (σ_n - ν_n)) = 0. Expand the formula in a Taylor series (p.29 in Infinitesimaalrekening). Then one has ln(1 + Δu_n / (σ_n - ν_n)) = Δu_n / (σ_n - ν_n) + O(Δu_n²), where O(Δu_n²) is a term of order Δu_n². It follows that Σ_n=1^N (Δu_n / (σ_n - ν_n) + O(Δu_n²)) = 0. Apparently one has Δu · grad(g_N) = 0 in σ, except for the term Σ_n=1^N O(Δu_n²). Assume that the vector Δu has an infinitesimally small length, so that this term can be ignored. Due to the infinitesimal length of Δu all points on this vector are in the considered surface g_N=V_σ. Then grad(g_N) is perpendicular to the surface, which was to be shown.
On p.843 in Microeconomic theory another argument is used, which is perhaps more elegant. However, here use is made of the fact, that g_N is a concave function, so that the argument becomes abstract (p.930-934). Your columnist believes, that his argument is more appealing to intuition. (back)
The social theory of Binmore, which will be explained in the following paragraph, is based on this statement. One wonders, whether Nash was already aware of this intriguing interpretation of his axioms. (back)
See p.87-88 in Playing fair or p.226 and 436 in Just playing. (back)
See p.88 in Playing fair. On p.253-258 in Just playing the advantages of utilitarianism are described again. However, utilitarianism jams, because the individuals will not be committed to such a social contract. In their real position they will try to renegotiate about the α_n, which seemed so just behind the veil. See the remainder of the text. (back)
See p.67 in Playing fair, and p.352-354 and 436-437 in Just playing. In the figure 4b the disproportional growth leads to a new status quo, which clearly requires another σ than those in the contract. In the mean time the individual 1 (or the group of individuals 1) has significantly improved his situation. When deliberations behind the veil would continue, then the individual would prefer another vector α of morals. Contracts can not be reinforced centrally. The contract is only practicable, as long as it can be reconciled with the personal interest. This guarantees the self-commitment. Another remark: on p.89 in Playing fair it is suggested, that the proportional growth path will even (provisionally) be maintained, when due to technological progress the set U expands. But the expansion will obviously change the Nash solution. And then the growth path would no longer lead to the utilitarian solution. The proportional morals would slow down the welfare. On p.258 and further in Just playing the disadvantages of utilitarianism are explained. A philosopher-king would be needed in order to dictate the utilitarian morals to the citizens (p.156). The philosopher-king is an ideal observer (representative person, p.152), who also listens to the will of the people. But in the modern democracy such an absolutistic ruler is no longer accepted. Now the proportional solution guarantees that the citizens do not desire a renegotiation. (back)
Mueller states on p.580 in Public choice III, that the social contract is highly based on an intrapersonal comparison. For, during the constitutional deliberations all individuals agree with the contract, and apparently believe that it is just. (back)
The hypothesis of Binmore, that utility can be compared interpersonally, is still rejected by many (traditional) economists. The books of Binmore give an accurate explanation of his social model. Unfortunately he has the habit of often jumping forward or backward in his argument. He also discusses various smaller models, where the consequences for his encompassing model of society are not always clear. Your columnist can only hope, that this column accurately presents the essence. The understanding must probably grow gradually by applying the model, and by combining and comparing it with similar models. Furthermore, Binmore in Just playing pays much attention to the mathematical formalism of Nash bargaining behind the veil. In their position behind the veil of ignorance the individuals must take into account all possible social positions, which they could have in reality. These mathematical considerations remain beyond the scope of the present column. (back)
See p.295 in An introduction to behavioral economics (2008, Palgrave MacMillan) by N. Wilkinson. (back)
See p.297 in An introduction to behavioral economics. (back)
Perhaps this remark does not do justice to Binmore, who devotes the whole paragraph 4.7 (p.454-470) in Just playing to the meaning of power. This deserves a further analysis. It is possible, that powerful individuals have exceptionally empathic preferences. The individual empathy remains conserved, even behind the veil. (back)
See Excellent onderhandelen (1993, Uitgeverij Contact) by R. Fisher, W. Ury and B. Patton. Already 23 years ago your columnist devoured books about negotiating, stimulated by an internal company training. It was and is a fascinating matter, but excellent bargaining can obviously not be learned from a book. So far as the books are American (and most of them are), they have been acquired at the American Book Center in Amsterdam. Your columnist must admit, that he would prefer to keep some of these bargaining experts out of his circle of friends. (back)
See Onderhandelen (1993, Uitgeverij Het Spectrum B.V.) by W.F.G. Mastenbroek. Broadly outlined Mastenbroek gives similar advices as Excellent onderhandelen. But the nuance is different. For instance, Mastenbroek on p.108 advises to devote 25% of the time for preparation to the search of alternatives (BATNA). (back)
Most textbooks about negotiating make this remark. See also p.23-25 in Effective negotiating (1995, Kogan Page Limited) by C. Robinson and p.24-26 in Smart negotiating (1993, Simon & Schuster Ltd) by J.C. Freund. On p.18 in Everything is negotiable (1993, Arrow Books Limited) by G. Kennedy one reads: "Negotiators expect to negotiate. They feel cheated if somebody does not recognize this". He even states (p.116): "Briefly, toughness pays!" H. Cohen states in You can negotiate anything (1982, Bantam Books), that a negotiation consists of the use of power, information and time, with the aim of dictating a certain behaviour. On p.19 in The art of negotiating (1984, Simon & Schuster Inc.) by G.I. Nierenberg one reads: "Negotiation isn't always neat. And it is often not nice". Further on (p.29): "Think of negotiation as a cooperative enterprise". (back)
On p.59 in Effective negotiating one reads: "The key requirement is patience". On p.45 in Smart negotiating: "Time pressure cuts across all other negotiating considerations". On p.81 in Don't be a chump! (1995, The Princeton Review Publishing Co., Ltd.) by N.R. Schaffzin: "Pressing the other side by using timing can be tremendously effective". On p.18 of Never take no for an answer (1995, Kogan Page Limited) by S. Le Poole: "Patience is an absolute requirement for a negotiator". On p.64 in The negotiating game (1994, HarperCollins Publishers, Inc.) by C.L. Karrass: "Time and patience are power". On p.98 in You can negotiate anything: "As a general rule, patience pays". (back)
On p.48 in Effective negotiating one reads: "Perhaps you have not revealed all or the other team has jumped to an unjustified conclusion of the facts available". On p.37 in Smart negotiating: "The information we glean in negotiations is often fragmentary and incomplete", and p.60: "Protect sensitive information". On p.102 in You can negotiate anything: "During the actual negotiating event it is often common strategy for one or both sides to conceal their true interest, needs and priorities". (back)
On p.72 in The negotiating game one reads: "It sometimes pays to be unreasonable and irrational in negotiation". Chapter 3 of The art of negotiating argues, that behaviour is often rational, when all circumstances are known. (back)
The win-win concept is explicitely recommended in chapter 4 of Effective negotiating, and in chapter 9 of You can negotiate anything. (back)
On p.58 in Never take no for an answer one reads: "The relative power of the negotiating parties depends on a large extent on how (un)attractive the option of not reaching agreement is to them". (back)
The importance of self-commitment is mentioned on p.79 in Succesvol onderhandelen ("[After its conclusion] the agreement must be carried into effect and rules must be made for checking the fulfilment, reporting and making corrections"). (back)
In chapter 4 of Succesvol onderhandelen the missile crisis in 1962 in Cuba is studied. On the other hand, The total negotiator (1994, Avon Books) by S.M. Pollan and M. Levine gives advices for renting a house, avoiding a traffic fine, or convincing a family member. (back)