This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\DeclareDelimFormat

nameyeardelim,

Optimal Delegation in a Multidimensional World

Andreas Kleiner Department of Economics, Arizona State University. Email: [email protected].

We study a model of delegation in which a principal takes a multidimensional action and an agent has private information about a multidimensional state of the world. The principal can design any direct mechanism, including stochastic ones. We provide necessary and sufficient conditions for an arbitrary mechanism to maximize the principal’s expected payoff. We also discuss simple conditions which ensure that some convex delegation set is optimal. A key step of our analysis shows that a mechanism is incentive compatible if and only if its induced indirect utility is convex and lies below the agent’s first-best payoff.

1 Introduction

In many economic and political environments, a principal delegates decisions to a better-informed agent: a firm appoints a manager to choose investment levels in different projects; US Congress delegates power to federal agencies; a legislative forms a committee to draft bills; a regulator lets a monopolist choose prices. Following [18], an extensive literature models such delegation problems by assuming that both the action and the state of the world lie in a one-dimensional space. A main result of this literature characterizes when it is optimal for the principal to constrain the agent’s choice to lie in an interval, and this conclusion has been used to explain why managers face spending caps, regulators impose price ceilings, and trade agreements specify maximum tariff levels.

The assumption that the action and state space are one-dimensional is made for tractability. In many applications, the underlying states and actions are more complex and more realistically modeled as multidimensional: managers invest in several projects, Congress delegates many decisions to the EPA, and committees draft multiple bills. What mechanisms are optimal in such multidimensional settings? How robust are conclusions obtained for one-dimensional models? And can we still expect that relatively simple mechanisms are often optimal?

To study these questions, we consider a principal that takes a multidimensional action and faces an agent with private information about a multidimensional state of the world (the agent’s type). Payoffs depend on the action and the state of the world, and transfers are infeasible. The principal can design arbitrary mechanisms, including stochastic ones, to maximize her expected payoff. Our main result characterizes, for an arbitrary mechanism, when this mechanism is optimal. Often, it is optimal to delegate the decision to the agent but to constrain the agent by requiring that her action lies in some set. For convex delegation sets, we provide a simple characterization, which is a direct analog of conditions characterizing when interval delegation is optimal in one-dimensional models. Our results illustrate how a principal can benefit from optimally bundling independent decisions. Even for one-dimensional models, our approach provides new insights: Our main result characterizes for arbitrary mechanisms—not just interval delegation sets—when this mechanism is optimal. And as corollaries, we obtain a novel condition under which some interval delegation set will be optimal and a comparative statics result showing when the agent will get more discretion.

A key step to deriving our results lies in obtaining a simple characterization of the set of feasible mechanisms. Given a mechanism, the corresponding indirect utility assigns to any type the payoff this type would get by choosing his report optimally. This payoff must be less than the first-best payoff, i.e., the payoff this type would get if he could choose the action without any constraints. Moreover, the indirect utility must be a convex function since it is the maximum of a family of affine functions. Lemma 1 shows that any function satisfying these two properties is the indirect utility of an incentive-compatible mechanism.

This characterization is easy to use and already helpful for one-dimensional delegation models. Our formulation differs from the previous literature, which often considered only deterministic mechanisms. Since the convex combination of two incentive-compatible mechanisms is not necessarily incentive compatible, the set of deterministic mechanisms is not even convex.111Some earlier papers also consider stochastic mechanisms (or allow for money burning/restricted transfer) and obtain a convex set of mechanisms; see, for example, [2, 25, 5, 3, 21, 22]. Moreover, a common approach is to first treat the model as one with transfers and then impose that these transfers are zero (or negative). Compared to this approach, formulating the problem via indirect utilities is more direct and provides valuable geometric insights into which mechanisms can be optimal. For the multidimensional problem, the approach via indirect utilities provides additional benefits because it circumvents intricate characterizations of incentive compatibility [33, see].

To find the optimal mechanism, we formulate the principal’s problem in terms of indirect utilities. In this formulation, the problem becomes a linear program, and we use linear programming duality to derive necessary and sufficient conditions for a given mechanism to be optimal. Typically, optimal mechanisms pool certain types, and our main result shows that a mechanism is optimal if conditional on any pooling region, a stochastic dominance condition (using the convex order) is satisfied. Intuitively, this condition requires that restricted to the pooling region (where the indirect utility function is affine), any convex indirect utility yields a lower payoff. If the pooling regions are at most one-dimensional, the stochastic dominance condition has a simple formulation in terms of majorization. Using this observation, we provide necessary and sufficient conditions for a convex delegation set with a smooth boundary to be optimal. These conditions are easy to check and are straightforward extensions of conditions that ensure the optimality of interval delegation sets in one-dimensional models [2, see].

Related Literature

The literature on delegation has focused mainly on problems in which the principal delegates a single one-dimensional decision and therefore assumed that both the action and state spaces are one-dimensional; see, for example, [18, 19, 30, 1, 2, 24].

A few delegation papers do consider richer action and/or type spaces. [6] considers an agent with two-dimensional private information and discusses several applications. Since the principal’s action is assumed to be one-dimensional (and only interval delegation sets are considered), there is only limited scope to screen two-dimensional types in his analysis. [23] characterize the optimal mechanism in a setting where two decisions depend on a single-dimensional underlying state. [15] considers multidimensional information and actions but restricts the principal’s choice to a particular class of “budgeting mechanisms”. The closest paper to ours is [14], which studies the delegation of several independent decisions, which yield multidimensional action and state spaces. For quadratic preferences with a constant bias, he shows that if the states are independently and identically distributed according to normal distributions then it is optimal to delegate a ‘half space’. Without the normality assumption, he shows that the principal’s payoff from such a mechanism converges to the first-best as the number of independent decision problems grows. [13] also considers multidimensional delegation problems and characterizes the max-min optimal mechanism, which maximizes the principal’s payoff against the worst-case preference type of the agent.

The elicitation of information about multiple independent decisions from a biased agent has been studied in general mechanism design [20, e.g.,] and cheap talk environments [9, 26]. [20] show that by linking independent decisions, the principal’s payoff converges to the first-best as the number of decisions grows. Our results can be used to show how the principal should optimally link decisions, which can be important if there is a limited number of decisions.

On a methodological level, our work is related to the literature on multidimensional mechanism design, and in particular on multiproduct monopolists [33, 27, 28, 10, 16, see, e.g.,].

2 Model

A principal chooses an action ana\in\mathbb{R}^{n}. An agent is privately informed about the state of the world sSs\in S, where SnS\subseteq\mathbb{R}^{n} is compact and convex. The agent’s and principal’s payoffs depend on both the action and the state of the world, and are given by

uA(a,s)\displaystyle u_{A}(a,s) :=as+b(a)\displaystyle:=a\cdot s+b(a)
uP(a,s)\displaystyle u_{P}(a,s) :=ag(s)+κb(a),\displaystyle:=a\cdot g(s)+\kappa b(a),

respectively, where b:nb:\mathbb{R}^{n}\rightarrow\mathbb{R} is strictly concave, differentiable with a Lipschitz-continuous gradient mapping, and satisfies limab(a)a=\lim_{\lVert a\rVert\rightarrow\infty}\frac{b(a)}{\lVert a\rVert}=-\infty,222Since we do not artificially constrain the set of actions, some assumptions are needed to ensure that for every type sSs\in S there is an optimal action. The current assumptions ensure this and simplify some arguments, but weaker conditions on bb could be imposed. g:Sng:S\rightarrow\mathbb{R}^{n} is continuous, and κ>0\kappa>0.

We assume that the state ss is distributed according to a probability distribution FF with differentiable density ff and support SS. The principal aims to maximize her expected payoff and can design arbitrary mechanisms.

The revelation principle applies and we define a mechanism to be a function m:SΔ(n)m:S\rightarrow\Delta(\mathbb{R}^{n}) such that expected payoffs are integrable.333We denote by Δ(n)\Delta(\mathbb{R}^{n}) the Borel σ\sigma-algebra on n\mathbb{R}^{n}. To simplify notation, we extend the domain of b()b(\cdot) and ui(,s)u_{i}(\cdot,s) linearly to include probability distributions over n\mathbb{R}^{n}, so that b(m(s))=𝔼m(s)[b(a)]b(m(s))=\mathbb{E}_{m(s)}[b(a)] and analogously for ui(,s)u_{i}(\cdot,s). A mechanism is incentive compatible if for all ss and ss^{\prime} in SS,

uA(m(s),s)uA(m(s),s).\displaystyle u_{A}(m(s),s)\geq u_{A}(m(s^{\prime}),s).

3 Characterizing incentive-compatible mechanisms

We characterize the set of incentive-compatible mechanisms in terms of their indirect utilities. To any incentive-compatible mechanism mm corresponds an indirect utility U:nU:\mathbb{R}^{n}\rightarrow\mathbb{R} defined by

U(s):=supsS𝔼[m(s)]s+b(m(s)).U(s):=\sup_{s^{\prime}\in S}\mathbb{E}[m(s^{\prime})]\cdot s+b(m(s^{\prime})).

Which indirect utilities correspond to some incentive-compatible mechanism? First, any indirect utility is convex as the supremum of a family of functions that are affine in the state ss. Second, in the absence of transfers the agent’s utility cannot be higher than if he was free to choose his action. Defining the first-best payoff h:nh:\mathbb{R}^{n}\rightarrow\mathbb{R} by

h(s):=supanas+b(a),h(s):=\sup_{a\in\mathbb{R}^{n}}a\cdot s+b(a),

UhU\leq h is clearly necessary.444We denote the pointwise order by \leq, so UhU\leq h means U(s)h(s)U(s)\leq h(s) for all ss in the common domain of UU and hh. The following result shows that these two conditions characterize the set of feasible indirect utilities.

Lemma 1.

An indirect utility UU corresponds to an incentive-compatible mechanism if and only if UU is convex and lies below the first-best payoff: UhU\leq h.

Refer to caption
Figure 1: The function UU satisfies U(s)h(s)U(s)\leq h(s) for all sSs\in S but does not correspond to a feasible mechanism. To see this, note that there is no convex extension of UU to \mathbb{R} such that the extension lies below hh. Lemma 1 then implies that UU is not the indirect utility of any feasible mechanism.

Intuitively, if UU is convex then it would correspond to an incentive-compatible mechanism if transfers were available and the agent had quasi-linear preferences. If the required transfers are all negative then we can use the agent’s risk aversion (coming from the strict concavity of bb) to simulate these transfers via stochastic actions. One can show that UhU\leq h implies that the required transfers are negative. This last step relies on the domain of UU and hh being large enough and it would not suffice to require only that U(s)h(s)U(s)\leq h(s) for all sSs\in S. Figure 1 illustrates a convex function UU which lies below hh on all of SS, but which does not correspond to a mechanism because the lotteries assigned to low types would yield a strictly higher payoff than the first-best payoffs for some hypothetical types, an impossibility.

  • Proof.

    Let us first state three basic observations from convex analysis. The convex conjugate of a function UU is denoted by UU^{*} and defined by U(a):=supsnasU(s)U^{*}(a):=\sup_{s\in\mathbb{R}^{n}}a\cdot s-U(s). We will use the following facts, which follow immediately from this definition: (i) h=(b)h=(-b)^{*}, (ii) UhU\leq h implies hUh^{*}\leq U^{*}, and (iii) aU(s)a\in\partial U(s) implies U(a)=asU(s)U^{*}(a)=a\cdot s-U(s).555Here, U(s)\partial U(s) denotes the subdifferential of UU at ss. To see (iii), note that the definition of UU^{*} implies U(a)asU(s)U^{*}(a)\geq a\cdot s-U(s). Conversely, convexity of UU implies that for all ss^{\prime}, asU(s)asU(s)a\cdot s-U(s)\geq a\cdot s^{\prime}-U(s^{\prime}). Taking the supremum of the right-hand side with respect to ss^{\prime} yields asU(s)U(a)a\cdot s-U(s)\geq U^{*}(a).

    Suppose UU is convex and satisfies UhU\leq h. Let the mechanism mm assign to any type sSs\in S a lottery with expected value aU(s)a\in\partial U(s) that yields the payoff as+b(a)U(a)+h(a)a\cdot s+b(a)-U^{*}(a)+h^{*}(a). Such a lottery exists because as+b(a)a\cdot s+b(a) would be the payoff for type ss from always getting action aa, because fact (ii) implies that the agents payoff is lower, and because bb is strictly concave.666More formally, strict concavity of bb implies that for any ana\in\mathbb{R}^{n} and nonzero dnd\in\mathbb{R}^{n} there is ε>0\varepsilon>0 such that 1/2[b(a+d)+b(ad)]<b(a)ε1/2[b(a+d)+b(a-d)]<b(a)-\varepsilon. It follows that for any λ>1\lambda>1, 1/2[b(a+λd)+b(aλd)]b(a)λε.1/2[b(a+\lambda d)+b(a-\lambda d)]\leq b(a)-\lambda\varepsilon. Therefore, by choosing λ\lambda arbitrarily large, one can design lotteries with expected value aa that yield arbitrarily low payoff to the agent. Then facts (i) and (iii) imply that the payoff of a truthful type ss is U(s)U(s):

    uA(m(s),s)=sa+b(a)U(a)+h(a)=U(s).\displaystyle u_{A}(m(s),s)=s\cdot a+b(a)-U^{*}(a)+h^{*}(a)=U(s).

    It remains to show that mm is incentive compatible. For all ss and ss^{\prime},

    uA(m(s),s)=U(s)U(s)+𝔼[m(s)](ss)\displaystyle u_{A}(m(s),s)=U(s)\geq U(s^{\prime})+\mathbb{E}[m(s^{\prime})]\cdot(s-s^{\prime})
    =\displaystyle= 𝔼[m(s)]s+b(m(s))+𝔼[m(s)](ss)=uA(m(s),s),\displaystyle\mathbb{E}[m(s^{\prime})]\cdot s^{\prime}+b(m(s^{\prime}))+\mathbb{E}[m(s^{\prime})]\cdot(s-s^{\prime})=u_{A}(m(s^{\prime}),s),

    where the first inequality follows since E[m(s)]U(s)E[m(s)]\in\partial U(s^{\prime}). ∎

Figure 2 illustrates the result for one-dimensional types and quadratic payoffs. It shows four indirect utilities that, according to Lemma 1, correspond to incentive-compatible mechanisms. In 2(a), all types between s1s_{1} and s2s_{2} obtain their first-best utility and UU is affine below s1s_{1} and above s2s_{2}. This indirect utility can be obtained by letting types choose their preferred action from the interval of deterministic actions [s1,s2][s_{1},s_{2}]. In 2(b), the menu of actions from which the agent can choose contains an additional deterministic action above s2s_{2}. The indirect utility in 2(c) contains an affine piece that lies strictly below the graph of hh. This part of the indirect utility corresponds to types that obtain a (nondegenerate) stochastic action, which yields no type its first-best payoff. Finally, 2(d) illustrates an indirect utility corresponding to a mechanism in which types in two adjacent regions obtain a stochastic action.

Refer to caption
(a) Interval delegation
Refer to caption
(b) A deterministic mechanism

Refer to caption

(c) A stochastic mechanism
Refer to caption
(d) A stochastic mechanism with two adjacent stochastic actions
Figure 2: Examples of indirect utilities. The blue curves show the function hh for one-dimensional types and quadratic payoffs (i.e., assuming b(a)=a22b(a)=-\frac{a^{2}}{2}). The green curves show indirect utilities corresponding to incentive-compatible mechanisms.

4 Characterizing optimal mechanisms

We characterize the optimal mechanisms in this section. To do so, we first formulate the principal’s problem in terms of indirect utilities (Section 4.1). We then state the main characterization of optimal mechanisms in Section 4.2 and illustrate the result for particular mechanisms. Finally, we outline the proof of the main result in Section 4.3.

4.1 Formulating the principal’s problem

Consider an indirect utility UU that corresponds to some incentive-compatible mechanism. In general, there are many incentive-compatible mechanisms that induce the same indirect utility; however, all such mechanism induce the same payoff for the principal. To see this, let mm be an incentive-compatible mechanism with corresponding indirect utility UU. Using U(s)=𝔼[m(s)]\nabla U(s)=\mathbb{E}[m(s)] (by an Envelope theorem) and U(s)=U(s)s+b(m(s))U(s)=\nabla U(s)\cdot s+b(m(s)), the principal’s payoff from mechanism mm in state ss is completely determined by UU:

𝔼[m(s)]g(s)+κb(m(s))=U(s)[g(s)κs]+κU(s).\displaystyle\mathbb{E}[m(s)]\cdot g(s)+\kappa b(m(s))=\nabla U(s)\cdot[g(s)-\kappa s]+\kappa U(s).

This observation implies that the principal’s payoff is a linear function of UU. Therefore, a solution to the principal’s problem can be found at an extreme point of the feasible set. Returning to Figure 2, it is easy to see that the indirect utilities in Figures 2(a)2(c) are extremal in that they cannot be written as a nontrivial convex combination of two feasible indirect utilities. In contrast, the indirect utility in 2(d) can be written as such a convex combination. This implies that whenever this mechanism is optimal, there is another (and simpler) mechanism which is also optimal. Intuively, one can write this indirect utility as a convex combination because two adjacent regions obtain distinct stochastic actions. This insight shows how, without loss of optimality, one can restrict attention to a smaller class of mechanism.777[22] develop this point more formally in the context of one-dimensional types/actions and quadratic utilities and characterize the set of extremal mechanisms. Formulating the problem in terms of indirect utilities and using our Lemma 1, one can obtain this characterization more directly. It would be interesting to extend this characterization of extremal mechanisms to the multidimensional setting. For multidimensional settings, analogous arguments show that many complicated mechanisms are not extremal and therefore the principal need not consider these mechanisms.

As is standard in multidimensional mechanism design [[, see, for example,]]rochet1998ironing, we can use the divergence theorem to reformulate the objective function as follows:

[κU(s)+U(s)[g(s)κs]]dF(s)\displaystyle\int\big{[}\kappa U(s)+\nabla U(s)\cdot[g(s)-\kappa s]\big{]}\,\mathrm{d}F(s)
=\displaystyle= U(s)[κf(s)div[(g(s)κs)f(s)]]ds+bdSU(s)[g(s)κs]f(s)𝐧^S(s)d(s),\displaystyle\int U(s)\big{[}\kappa f(s)-\operatorname{div}[(g(s)-\kappa s)f(s)]\big{]}\,\mathrm{d}s+\int_{\operatorname{bd}S}U(s)[g(s)-\kappa s]f(s)\cdot\hat{\mathbf{n}}_{S}(s)\,\mathrm{d}\mathcal{H}(s),

where ÷\div denotes the divergence of a function, for any set AA, bdA\operatorname{bd}A denotes the boundary of AA, \mathcal{H} denotes the n1n-1-dimensional Hausdorff measure on the boundary of SS, and 𝐧^S(s)\hat{\mathbf{n}}_{S}(s) denotes the outward normal vector to the convex set SS at sbdSs\in\operatorname{bd}S.

This allows us to write the principal’s problem as888The existence of a maximizer follows from standard arguments.

maxU convexU(s)dμ(s)\displaystyle\max_{U\text{ convex}}\int U(s)\,\mathrm{d}\mu(s)
s. t. Uh,\displaystyle\text{s.\,t. }U\leq h,

where the measure μ\mu is defined by

μ(E)=Eν(s)dλ(s),\displaystyle\mu(E)=\int_{E}\nu(s)\,\mathrm{d}\lambda(s),

λ\lambda is the Lebesgue measure on SS plus the Hausdorff measure on the boundary of SS, and

ν(s):={κf(s)div[(g(s)κs)f(s)] if sintS[g(s)κs]f(s)𝐧^S(s) if sbdS.\displaystyle\nu(s):=\begin{cases}\kappa f(s)-\operatorname{div}[(g(s)-\kappa s)f(s)]&\text{ if }s\in\operatorname{int}S\\ [g(s)-\kappa s]f(s)\cdot\hat{\mathbf{n}}_{S}(s)&\text{ if }s\in\operatorname{bd}S.\end{cases}

Cearly, ν\nu plays an important role in determining which mechanisms are optimal. Heuristically, ν(s)\nu(s) measures how much the prinicipal’s payoff increases if the indirect utility of type ss is increased, but where types on the boundary get extra weight.

To illustrate ν\nu and for later use, let use compute ν\nu for a one-dimensional type space S=[s¯,s¯]S=[\underline{s},\overline{s}]:

ν(s):={κf(s)[g(s)κ]f(s)[g(s)κs]f(s) if s(s¯,s¯)[g(s¯)κs¯]f(s¯) if s=s¯[κs¯g(s¯)]f(s¯) if s=s¯.\displaystyle\nu(s):=\begin{cases}\kappa f(s)-[g^{\prime}(s)-\kappa]f(s)-[g(s)-\kappa s]f^{\prime}(s)&\text{ if }s\in(\underline{s},\overline{s})\\ [g(\overline{s})-\kappa\overline{s}]f(\overline{s})&\text{ if }s=\overline{s}\\ [\kappa\underline{s}-g(\underline{s})]f(\underline{s})&\text{ if }s=\underline{s}.\end{cases} (1)
Example 1.

Suppose S=[12,12]nS=[-\frac{1}{2},\frac{1}{2}]^{n} and FF is the uniform distribution on SS. Let us assume payoffs are quadratic and that g(s)=αsg(s)=\alpha s for some α(0,κ]\alpha\in(0,\kappa]; this implies that the principal is biased towards the ex-ante optimal action 0. In that case, ν\nu simplifies to

ν(s):={κ+(κα)n if sintS(ακ)s𝐧^S(s) if sbdS.\displaystyle\nu(s):=\begin{cases}\kappa+(\kappa-\alpha)n&\text{ if }s\in\operatorname{int}S\\ (\alpha-\kappa)s\cdot\hat{\mathbf{n}}_{S}(s)&\text{ if }s\in\operatorname{bd}S.\end{cases} (2)

4.2 Optimal mechanisms

Given an indirect utility UU, we let 𝒬\mathcal{Q} denote a coarsest partition of n\mathbb{R}^{n} such that UU is affine on each partition element. We denote by {μ|Q}Q𝒬\{\mu|_{Q}\}_{Q\in\mathcal{Q}} a conditional measure of μ\mu given QQ.

Theorem 1.

Let UU be a feasible indirect utility. Then UU is optimal if for all Q𝒬Q\in\mathcal{Q}, μ|Q(Q)0\mu|_{Q}(Q)\geq 0 and μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q}, where δQ\delta_{Q} is a point mass of mass μ|Q(Q)\mu|_{Q}(Q) at ss if there is sQs\in Q satisfying U(s)=h(s)U(s)=h(s) and δQ\delta_{Q} is the zero-measure otherwise.

Moreover, this condition is necessary for UU to be optimal if UU is differentiable |μ||\mu|-almost everywhere.

Two comments on the conditions in Theorem 1 are in order. First, if there is sQs\in Q such that U(s)=h(s)U(s)=h(s) then it is unique because UU is affine on QQ and hh is strictly convex. Second, for the necessity result, observe that UU is differentiable Lebesgue-almost everywhere since it is a convex function. Since |μ||\mu| is absolutely continuous with respect to the Lebesgue measure on the interior of SS, UU is differentiable |μ||\mu|-almost everywhere if, for example, the density ff is zero on the boundary of SS or if UU is differentiable \mathcal{H}-almost everywhere on the boundary of SS. In the one-dimensional case, this last condition can always be satisfied.

Why are the conditions in Theorem 1 sufficient for UU to be optimal? Consider a partition element Q𝒬Q\in\mathcal{Q} and suppose δQ\delta_{Q} is a point mass at ss^{*}. Then any feasible indirect utility VV will be convex and lie below UU at ss^{*}. Also, μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q} implies V(s)dμ|Q(s)V(s)dδQ(s)\int V(s)\,\mathrm{d}\mu|_{Q}(s)\leq\int V(s)\,\mathrm{d}\delta_{Q}(s). Moreover, if aa is an affine function that coincides with VV at the barycenter of μ|Q\mu|_{Q} then we get V(s)dμ|Q(s)a(s)dμ|Q(s)\int V(s)\,\mathrm{d}\mu|_{Q}(s)\leq\int a(s)\,\mathrm{d}\mu|_{Q}(s). Since UU restricted to QQ is affine, lies above VV at ss^{*}, and μ|Q(Q)0\mu|Q(Q)\geq 0, this implies that conditional on the type belonging to QQ, the principal’s expected payoff under UU is higher than under VV. And if δQ\delta_{Q} is the zero measure then VV might lie above UU but the same conclusion follows since μ|Q(Q)=0\mu|_{Q}(Q)=0 and conditional on QQ, adding a constant to the indirect utility does not change the principal’s payoff. The conclusion that these conditions are also essentially necessary shows that the problem can in some sense be decomposed: whenever the principal can improve UU conditional on QQ, she can extend this improved version to a feasible indirect utility that yields unconditionally a higher payoff.

A particularly simple mechanism is if the principal delegates the decision to the agent, potentially restricting the agents action to belong to some set AA. Note that any deterministic mechanism can be implemented as an indirect mechanism in this way. For a closed set ASA\subseteq S, we say that delegating to AA is optimal if an optimal mechanism takes the form that any type in AA gets her first-best action, and any other type gets her most preferred action among the first-best actions of types in AA. For example, if n=1n=1 and A=[s1,s2]A=[s_{1},s_{2}] then delegating to AA is optimal if there is an optimal mechanism in which any type below s1s_{1} gets the first-best action of type s1s_{1}, any type in [s1,s2][s_{1},s_{2}] gets her first best action, and any type above s2s_{2} gets the first-best action of type s2s_{2}. In the following we will specialize Theorem 1 and discuss under what conditions such a mechanism is optimal.

We can simplify the conditions in Theorem 1 by recalling that the convex order has a simple structure for one-dimensional spaces. A cdf H1H_{1} on a one-dimensional interval [x,y][x,y] dominates a cdf H2H_{2} in the convex order if and only if H2H_{2} majorizes H1H_{1}:

syH1(z)dzsyH2(z)dz\int_{s}^{y}H_{1}(z)\,\mathrm{d}z\leq\int_{s}^{y}H_{2}(z)\,\mathrm{d}z

for all s[x,y]s\in[x,y] with equality for s=xs=x [[]Theorem 3.A.1]shaked2007stochastic. This observation simplifies the characterization in Theorem 1 whenever UU is affine on at most one-dimensional sets. As we will see, this is useful even if the type space is multidimensional. To illustrate the simpler conditions, we first consider when interval delegation is optimal with one-dimensional types [[, for earlier characterizations, see]]AM:08,AB:13.

Corollary 1.

Suppose n=1n=1 and s1,s2Ss_{1},s_{2}\in S with s1<s2s_{1}<s_{2}. Delegating to the interval [s1,s2][s_{1},s_{2}] is optimal if and only if

  1. (i)

    ν(s)0\nu(s)\geq 0 for all s[s1,s2]s\in[s_{1},s_{2}],

  2. (ii)

    ss¯(xs)ν(x)dλ(x|xs2)0\int_{s}^{\overline{s}}(x-s)\nu(x)\,\mathrm{d}\lambda(x|x\geq s_{2})\leq 0 for all ss2s\geq s_{2} with equality for s=s2s=s_{2}, and

  3. (iii)

    s¯s(sx)ν(x)dλ(x|xs1)0\int_{\underline{s}}^{{s}}(s-x)\nu(x)\,\mathrm{d}\lambda(x|x\leq s_{1})\leq 0 for all ss1s\leq s_{1} with equality for s=s1s=s_{1}.

  • Proof.

    Note that the partition 𝒬\mathcal{Q} induced by UU has elements [s¯,s1][\underline{s},s_{1}], [s2,s¯][s_{2},\overline{s}], and {s}\{s\} for all s(s1,s2)s\in(s_{1},s_{2}). For all s(s1,s2)s\in(s_{1},s_{2}), ν(s)0\nu(s)\geq 0 is equivalent to μ|Q(Q)0\mu|_{Q}(Q)\geq 0 and μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q} for Q={s}Q=\{s\}.999For s{s1,s2}s\in\{s_{1},s_{2}\}, if sintSs\in\operatorname{int}S then ν(s)0\nu(s)\geq 0 follows because ν\nu is continuous on the interior of SS. And if sbdSs\in\operatorname{bd}S, there is Q𝒬Q\in\mathcal{Q} with QS={s}Q\cap S=\{s\} and hence μ|Q(Q)0\mu|_{Q}(Q)\geq 0 implies ν(s)0\nu(s)\geq 0. Now consider Q=[s2,s¯]Q=[s_{2},\overline{s}] and let λ(x|xs2)\lambda(x|x\geq s_{2}) denote the conditional distribution of λ\lambda conditional on xs2x\geq s_{2}. Since δQ\delta_{Q} is a point mass of mass μ|Q(Q)\mu|_{Q}(Q) at s2s_{2}, we can use majorization to rewrite μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q} as

    ss¯xs¯ν(z)dλ(z|zs2)dx0\int_{s}^{\overline{s}}\int_{x}^{\overline{s}}\nu(z)\,\mathrm{d}\lambda(z|z\geq s_{2})\,\mathrm{d}x\leq 0

    for all ss2s\geq s_{2} with equality for s=s2s=s_{2}. Integrating by parts, this becomes condition (ii). Moreover, since the derivative with respect to ss of the left-hand side evaluated at s2s_{2} is negative, we obtain μ|Q(Q)0\mu|_{Q}(Q)\geq 0. The argument for Q=[s¯,s1]Q=[\underline{s},s_{1}] is analogous. ∎

Refer to caption
Figure 3: Optimality of interval delegation

Figure 3 illustrates condition (ii). Suppose that starting with interval delegation (represented by the solid indirect utility), the principal changes the mechanism and assigns a lottery with expected value strictly above U(s2)\nabla U(s_{2}) to all types above ss. This tilts the indirect utility starting at ss upwards (see the dashed indirect utility) and therefore increases the indirect utility for every type xsx\geq s in proportion to xsx-s. The change in the principal’s expected payoff is therefore proportional to ss¯(xs)ν(x)dλ(x|xs2)\int_{s}^{\overline{s}}(x-s)\nu(x)\,\mathrm{d}\lambda(x|x\geq s_{2}). Consequently, condition (ii) ensures that such changes are not profitable. Equality for s=s2s=s_{2} implies, in addition, that it would not be profitable to marginally reduce the action for all types above s2s_{2} either.

Interestingly, the conditions identified in Corollary 1 are in our setting equivalent to the ones obtained in [2, Proposition 2a ]. This might initially be surprising since we characterize optimality of interval delegation in the class of stochastic mechanisms and [2] characterize optimality in the class of deterministic mechanisms (and stochastic mechanisms can do strictly better in general). Figure 3 illustrates why the conditions are the same: Suppose the principal strictly benefits from deviating to the dashed indirect utility, which represents a stochastic mechanism. Since her payoff is linear in UU, the arguments in the previous paragraph imply that she also benefits from deviating to the dotted indirect utility. Since the dotted linear utility corresponds to a deterministic mechanism, we conclude that conditions (ii) in (iii) in Corollary 1 are necessary for interval delegation to be optimal in the class of deterministic mechanisms (and necessity of condition (i) can be shown easily). Later, it will become clear that this equivalence is specific to the one-dimensional setting.

Corollary 2.

If n=1n=1 and {sS:ν(s)0}\{s\in S:\nu(s)\geq 0\} is an interval, then delegating to an interval is optimal.

The key insight for this result is that any pooling region (i.e., any QQ such that QSQ\cap S is not a singleton) must contain types ss with ν(s)0\nu(s)\geq 0 (since μ|Q(Q)0\mu|_{Q}(Q)\geq 0) and types ss with ν(s)0\nu(s)\leq 0 (since no point measure δQ\delta_{Q} can dominate a distinct positive measure in the convex order). If ν\nu is positive on an interval, it follows that there can be at most two pooling regions. A simple argument then shows that delegating to an interval is an optimal mechanism.

Corollary 2 extends Proposition 2(a) in [4], which in our notation requires ν\nu to be positive on (s¯,s¯)(\underline{s},\overline{s}). An simple implication of our result is the following, which can be useful for applications.

Corollary 3.

Suppose the type space is one-dimensional (i.e., n=1n=1), κ=1\kappa=1, and the agent has a constant bias (i.e., g(s)=s+βg(s)=s+\beta for some β\beta\in\mathbb{R}). If ff is logconcave then delegating to an interval is optimal.

As another illustration, let us return to Example 1 specializing to a one-dimensional type space.

Example 1 (continued).

For n=1n=1 and κ=1\kappa=1, the objective function simplifies to ν(s)=2κα\nu(s)=2\kappa-\alpha for s(s¯,s¯)s\in(\underline{s},\overline{s}), ν(s¯)=(κα)s¯\nu(\underline{s})=(\kappa-\alpha)\underline{s}, and ν(s¯)=(ακ)s¯\nu(\overline{s})=(\alpha-\kappa)\overline{s}. Since ν\nu is positive on an interval, Corollary 2 implies that delegating to an interval is optimal, and it only remains to find the best interval.

The optimal interval must satisfy Condition (ii) as an equality for s=s2s=s_{2}, which requires

(2α)[1812s22s2(12s2)]=0,(2-\alpha)\left[\frac{1}{8}-\frac{1}{2}s_{2}^{2}-s_{2}\left(\frac{1}{2}-s_{2}\right)\right]=0,

and simple algebra yields s2=α2αs_{2}=\frac{\alpha}{2-\alpha}. Using symmetry, it follows that it is optimal to delegate to the interval [α2α,α2α]\left[-\frac{\alpha}{2-\alpha},\frac{\alpha}{2-\alpha}\right].

For a one-dimensional type space, the approach used in Corollary 1 can be used to simplify the conditions in Theorem 1 for any mechanism, not just interval delegation. More generally, this approach is useful even with multidimensional types. To see this, let AA be a closed and convex set and, for sbdAs\in\operatorname{bd}A, let NA(s)N_{A}(s) denote the normal cone to AA at ss. With quadratic payoffs, if the principal delegates AA and sbdAs\in\operatorname{bd}A, then all types in s+NA(s)s+N_{A}(s) will choose action ss. Moreover, if the boundary of AA is differentiable then NA(s)N_{A}(s) is a (one-dimensional) ray and we can again use majorization to simplify the convex dominance conditions in Theorem 1.

Refer to caption
Figure 4: Indirect utility for delegation to a convex set.
Corollary 4.

Suppose payoffs are quadratic and ASA\subseteq S is closed, convex, has nonempty interior and a differentiable boundary. Delegating to AA is optimal if and only if

  1. (i)

    ν(s)0\nu(s)\geq 0 for all sAs\in A and

  2. (ii)

    for all sbdAs\in\operatorname{bd}A and z>0z>0,

    z(xz)ν(s+x𝐧^A(s))dλ(s+x𝐧^A(s)|s+NA(s))0\int_{z}^{\infty}(x-z)\nu(s+x\hat{\mathbf{n}}_{A}(s))\,\mathrm{d}\lambda(s+x\hat{\mathbf{n}}_{A}(s)|s+N_{A}(s))\leq 0

    with equality for z=0z=0.

The conditions in Corollary 4 closely resemble those in Corollary 1. Indeed, Condition (i) in either case requires that ν\nu is positive on the set of types that obtain their first-best payoffs, and Condition (ii) (and Conditions (ii) and (iii), respectively) impose that for each point on the boundary the analogous stochastic dominance condition holds.

The economic interpretation of Condition (ii) is analogous to how we interpreted Condition (ii) in Corollary 1. This condition ensures that the principal does not benefit from marginally tilting the indirect utility along line segments that are orthogonal to the boundary of AA, e.g., the solid line segment in Figure 4. Observe that there is a stochastic mechanism in which the indirect utility is increased only in a small neighborhood of the solid line segment (by Lemma 1). On the other hand, there is no deterministic mechanism achieving this because for any deterministic action the indirect utility would have to increase significantly along the solid line segment (in order to reach the first-best payoff for some type) and convexity then requires that all types in a neighborhood of the line segment obtain higher indirect utilities. This indicates that our characterization relies in the multidimensional setting on stochastic mechanisms being feasible.

Example 1 (continued).

Consider a two-dimensional example and recall that FF is the uniform distribution and g(s)=αsg(s)=\alpha s for some α[0,κ)\alpha\in[0,\kappa). We assume quadratic payoffs; in that case, the problem is separable across dimensions: the principal’s optimal action in dimension 1 depends only on the first component of the state and is independent of the action in dimension 2.

Suppose first that there are two agents: For i=1,2i=1,2, agent ii has private information about sis_{i} (but not sjs_{j} for jij\neq i) and cares only about the action and state in dimension ii. It follows that the principal faces two independent delegation problems, and our earlier analysis implies that it is optimal to let each agent choose any action in [α2α,α2α]\left[-\frac{\alpha}{2-\alpha},\frac{\alpha}{2-\alpha}\right]. In effect, the agents’ choice will be the action in the red square in Figure 5 that is closed to the realized state.

Now compare this to the situation where there is only one agent. This agent has private information about both dimensions of the state and cares about both dimensions of the action. How can the principal improve her expected payoff? Intuitively, she could offer the agent to take more extreme actions in one dimension if he moderates his action in the other dimension. How can the principal optimally bundle the two decision problems?

Corollary 4 provides insights into how to solve the problem: if one can find a AA satisfying the conditions stated there, delegating to this set will be an optimal mechanism. Since ν\nu is positive on the interior of SS and strictly negative on the boundary of SS, Condition (i) will be satisfied if AintSA\subseteq\operatorname{int}S and Condition (ii) will be satisfied if, for every sbdAs\in\operatorname{bd}A, equality holds in Condition (ii) for z=0z=0. This yields a second-order differential equation, whose solution describes the boundary of the optimal delegation set, see the blue curve in Figure 5 for an illustration.

Refer to caption
Figure 5: Optimal bundling

4.3 Proof Sketch

To proof Theorem 1, we use duality in linear programming. To formulate the dual program, it is more convenient to work with indirect utilities that are defined on a compact domain. But recall that it is not enough to only require that U(s)h(s)U(s)\leq h(s) for all sSs\in S. The following technical result ensures that we can restrict the indirect utilities to have a compact domain as long as this domain is chosen large enough.

Lemma 2.

There is a compact XnX\subseteq\mathbb{R}^{n} such that the principal’s problem can be written as max{Udμ|U:X,U convex,Uh}\max\{\int U\,\mathrm{d}\mu|U:X\rightarrow\mathbb{R},\ U\text{ convex},\ U\leq h\}.

Formally, we show that if XX is chosen large enough then for any solution to the above problem there is a corresponding solution to the original problem. For a convex function UU defined on SS, we consider the smallest convex function defined on n\mathbb{R}^{n} that extends UU. If this extension lies below hh on a large set XX then h(y)<U(y)h(y)<U(y) for some yy is possible only if U(s)\lVert\nabla U(s)\rVert is large for some sSs\in S, i.e., the expected action for some type is large. We show that this implies that the principal’s expected payoff is low, contradicting that UU is a solution.

Now let XX be as in the above lemma and denote by 𝒰\mathcal{U} the set of convex continuous functions that map XX to \mathbb{R} and by +\mathcal{M}_{+} the set of positive measures on XX. We can formulate the principal’s problem as follows (and call this formulation the primal problem):

maxU𝒰U(s)dμ(s)\displaystyle\max_{U\in\mathcal{U}}\int U(s)\,\mathrm{d}\mu(s) (P)
s. t. Uh\displaystyle\text{s.\,t. }U\leq h

The dual problem

We will show that the following problem is the dual problem:

infγ+h(s)dγ(s)\displaystyle\inf_{\gamma\in\mathcal{M}_{+}}\int h(s)\,\mathrm{d}\gamma(s) (D)
s. t. γcxμ,\displaystyle\text{s.\,t. }\gamma\geq_{cx}\mu,

where cx\geq_{cx} denotes the convex order on the space of measures.

Note that hh is a convex function; therefore, if μ\mu was a positive measure, this would be a trivial problem with solution γ=μ\gamma=\mu. However, since μ\mu is a signed measure and γ\gamma has to be a positive measure, μ\mu is not feasible in general.

It is easy to see that weak duality holds, that is, the value of the primal problem (P) is always below the value of the dual problem (D). Indeed, for any feasible UU and γ\gamma,

U(s)dμ(s)(i)U(s)dγ(s)(ii)h(s)dγ(s)\displaystyle\int U(s)\,\mathrm{d}\mu(s)\underbrace{\leq}_{(i)}\int U(s)\,\mathrm{d}\gamma(s)\underbrace{\leq}_{(ii)}\int h(s)\,\mathrm{d}\gamma(s) (3)

since (i) UU is convex and μcxγ\mu\leq_{cx}\gamma and (ii) γ\gamma is a positive measure and UhU\leq h. The following result shows that strong duality holds, that is the optimal values of both problems are equal and the dual problem has a solution.

Lemma 3 (Strong duality).

A feasible mechanism UU is optimal if and only if there exists a positive measure γcxμ\gamma\geq_{cx}\mu such that

U(s)\displaystyle U(s) =h(s) for γ-almost every v\displaystyle=h(s)\text{ for $\gamma$-almost every\ $v$ } (4)
U(s)dμ(s)\displaystyle\int U(s)\,\mathrm{d}\mu(s) =U(s)dγ(s).\displaystyle=\int U(s)\,\mathrm{d}\gamma(s). (5)

This result is an analogue of a result in the revenue-maximization problem of a multiproduct monopolist [[, see Theorem 2 in]]daskalakis-etal2017. Our formulation of the delegation problem allows us to easily deduce strong duality. Note that there is a convex function UU such that h(x)U(x)>0h(x)-U(x)>0 for all xXx\in X. Therefore, Slater’s condition is satisfied and standard results from linear programming imply that the dual problem has a solution and that the optimal solutions of the primal and dual problems achieve the same value. Since both inequalities in (3) have to hold as equalities, Lemma 3 follows.

Proof idea for Theorem 1.

It is easy to show that the conditions in Theorem 1 imply that UU is optimal: by aggregating the measures δQ\delta_{Q}, one obtains a positive measure γ\gamma satisfying the complementary slackness conditions (4) and (5) and γcxμ\gamma\geq_{cx}\mu. Lemma 3 then implies that UU is optimal.

For the converse direction, suppose UU is optimal. By Lemma 3, there is a positive measure γ\gamma such that the complementary slackness conditions (4) and (5) hold and γcxμ\gamma\geq_{cx}\mu. Letting μ+\mu^{+} (μ\mu^{-}) denote the positive (negative) part of μ\mu, this last condition is equivalent to γ+μcxμ+\gamma+\mu^{-}\geq_{cx}\mu^{+}. Strassen’s theorem then implies that γ+μ\gamma+\mu^{-} is a mean-preserving spread of μ+\mu^{+}: one can obtain the measure γ+μ\gamma+\mu^{-} by taking, for every ss, the mass μ+\mu^{+} puts on ss and spreading it according to a probability measure DsD_{s} with expected value ss. Since UU is convex, Jensen’s inequality implies that U(s)U(x)dDs(x)U(s)\leq\int U(x)\,\mathrm{d}D_{s}(x) and equality holds only if UU is affine on the convex hull of the support of DsD_{s}. Since equality must hold by (5), we obtain that for all Q𝒬Q\in\mathcal{Q} and sQs\in Q, the support of DsD_{s} is contained in the closure of QQ. To simplify this informal discussion, suppose that for all Q𝒬Q\in\mathcal{Q} and sQs\in Q, the support of DsD_{s} is actually contained in QQ (and not just the closure of QQ) and consider a partition element QQ of positive measure. Then the conditional measure γ|Q\gamma|_{Q} is positive (since γ\gamma is positive) and satisfies γ|Q+μ|Qcxμ+|Q\gamma|_{Q}+\mu^{-}|_{Q}\geq_{cx}\mu^{+}|_{Q} (since the left-hand side is a mean-preserving spread of the right-hand side). Moreover, by (5) we get U(s)=h(s)U(s)=h(s) for every ss in the support of γ|Q\gamma|_{Q}. Since hh is strictly convex and UU is affine on QQ, there is at most one sQs\in Q with U(s)=h(s)U(s)=h(s) and therefore γ|Q\gamma|_{Q} is a point mass at this ss or the zero measure. It follows that μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q}, where δQ\delta_{Q} is a point mass at ss or is the zero measure. The proof in the Appendix follows this sketch but uses the additional assumption in Theorem 1 and additional arguments to deal with the case where the support of DsD_{s} is a subset of the closure of QQ but not a subset of QQ.101010If ss lies in the closure of QQ and QQQ^{\prime}\neq Q, then UU is not differentiable at ss and, therefore, U(s)<h(s)U(s)<h(s). If follows from (4) that such points have measure zero under γ\gamma. The additional assumption ensures that such points also have measure zero under μ+\mu^{+} and μ\mu^{-}, and hence play no role.

References

  • [1] Ricardo Alonso and Niko Matouschek “Optimal Delegation” In Review of Economic Studies 75.1, 2008, pp. 259–293.
  • [2] Manuel Amador and Kyle Bagwell “The Theory of Optimal Delegation with an Application to Tariff Caps” In Econometrica 81.4, 2013, pp. 1541–1599
  • [3] Manuel Amador and Kyle Bagwell “Money burning in the theory of delegation” In Games and Economic Behavior 121, 2020, pp. 382–412 DOI: https://doi.org/10.1016/j.geb.2020.02.010
  • [4] Manuel Amador, Kyle Bagwell and Alex Frankel “A Note on Interval Delegation” In Economic Theory Bulletin 6.2, 2018, pp. 239–249 DOI: 10.1007/s40505-017-0133-4
  • [5] Attila Ambrus and Georgy Egorov “Delegation and Nonmonetary Incentives” In Journal of Economic Theory 171, 2017, pp. 101–135
  • [6] Mark Armstrong “Delegating Decision-Making to an Agent with Unknown Preferences” unpublished, 1995
  • [7] Vladimir Igorevich Bogachev “Measure theory” Springer, 2007
  • [8] Vladimir Igorevich Bogachev “Measure theory” Springer, 2007
  • [9] Archishman Chakraborty and Rick Harbaugh “Comparative cheap talk” In Journal of Economic Theory 132.1 Elsevier, 2007, pp. 70–94
  • [10] Constantinos Daskalakis, Alan Deckelbaum and Christos Tzamos “Strong Duality for a Multiple-Good Monopolist” In Econometrica 85.3 Blackwell Publishing Ltd, 2017, pp. 735–767 DOI: 10.3982/ECTA12618
  • [11] F Dragomirescu and C Ivan “The smallest convex extensions of a convex function” In Optimization 24.3-4 Taylor & Francis, 1992, pp. 193–206
  • [12] Nelson Dunford and Jacob T. Schwartz “Linear Operators, Part 1: General Theory” John WileySons, 1988
  • [13] Alex Frankel “Aligned delegation” In American Economic Review 104.1, 2014, pp. 66–83
  • [14] Alex Frankel “Delegating multiple decisions” In American Economic Journal: Microeconomics 8.4, 2016, pp. 16–53
  • [15] Simone Galperti “A theory of personal budgeting” In Theoretical Economics 14.1 Wiley Online Library, 2019, pp. 173–210
  • [16] Nima Haghpanah and Jason Hartline “When is pure bundling optimal?” In The Review of Economic Studies 88.3 Oxford University Press, 2021, pp. 1127–1156
  • [17] Jean-Baptiste Hiriart-Urruty and Claude Lemaréchal “Fundamentals of convex analysis” Springer Science & Business Media, 2004
  • [18] Bengt Holmström “On Incentives and Control in Organizations” Ph.D. Thesis, Stanford University, 1977
  • [19] Bengt Holmström “On the Theory of Delegation” In Bayesian Models in Economic Theory North Holland, 1984, pp. 115–141
  • [20] Matthew O Jackson and Hugo F Sonnenschein “Overcoming incentive constraints by linking decisions” In Econometrica 75.1 Wiley Online Library, 2007, pp. 241–257
  • [21] Navin Kartik, Andreas Kleiner and Richard Van Weelden “Delegation in veto bargaining” In American Economic Review 111.12, 2021, pp. 4046–87
  • [22] Andreas Kleiner, Benny Moldovanu and Philipp Strack “Extreme points and majorization: Economic applications” In Econometrica 89.4 Wiley Online Library, 2021, pp. 1557–1593
  • [23] Frédéric Koessler and David Martimort “Optimal delegation with multi-dimensional decisions” In Journal of Economic Theory 147.5 Elsevier, 2012, pp. 1850–1881
  • [24] Anton Kolotilin and Andriy Zapechelnyuk “Persuasion meets Delegation” unpublished, 2019
  • [25] Eugen Kováč and Tymofiy Mylovanov “Stochastic Mechanisms in Settings Without Monetary Transfers: The Regular Case” In Journal of Economic Theory 144.4, 2009, pp. 1373–1395
  • [26] Elliot Lipnowski and Doron Ravid “Cheap talk with transparent motives” In Econometrica 88.4 Wiley Online Library, 2020, pp. 1631–1660
  • [27] Alejandro M Manelli and Daniel R Vincent “Bundling as an optimal selling mechanism for a multiple-good monopolist” In Journal of Economic Theory 127.1 Elsevier, 2006, pp. 1–35
  • [28] Alejandro M Manelli and Daniel R Vincent “Multidimensional mechanism design: Revenue maximization and the multiple-good monopoly” In Journal of Economic theory 137.1 Elsevier, 2007, pp. 153–185
  • [29] Albert W Marshall, Ingram Olkin and Barry C Arnold “Inequalities: theory of majorization and its applications” Springer, 2010
  • [30] Nahum Melumad and Toshiyuki Shibano “Communication in Settings with No Transfers” In RAND Journal of Economics 22.2, 1991, pp. 173–198
  • [31] Robert R Phelps “Lectures on Choquet’s theorem” Springer, 2001
  • [32] Chris Preston “Some Notes on Standard Borel and Related Spaces” arXiv, 2008 DOI: 10.48550/ARXIV.0809.3066
  • [33] Jean-Charles Rochet “A necessary and sufficient condition for rationalizability in a quasi-linear context” In Journal of Mathematical Economics 16.2 Elsevier, 1987, pp. 191–200
  • [34] Jean-Charles Rochet and Philippe Choné “Ironing, sweeping, and multidimensional screening” In Econometrica JSTOR, 1998, pp. 783–826
  • [35] Moshe Shaked and J George Shanthikumar “Stochastic orders” Springer, 2007
  • [36] Alexander Shapiro “On duality theory of conic linear problems” In Semi-Infinite Programming: Recent Advances Springer, U.S.A., 2010

Appendix A Omitted Proofs

  • Proof of Lemma 2.

    Let BrB_{r} denote a ball of radius rr around 0 and let UU be a solution to max{Udμ|U:Br,U convex,Uh}\max\{\int U\,\mathrm{d}\mu|U:B_{r}\rightarrow\mathbb{R},\ U\text{ convex},\ U\leq h\}. We will show that UU can be extended to a solution to the principal’s original problem. Let U~\tilde{U} denote the smallest convex extension to n\mathbb{R}^{n} of the restriction of UU to SS [[, see]]dragomirescu92. If U~\tilde{U} is not feasible for the original problem then there is yBry\not\in B_{r} such that U~(y)>h(y)\tilde{U}(y)>h(y) and there is sSs\in S such that U~(y)=U(s)+U(s)(ys)\tilde{U}(y)=U(s)+\nabla U(s)\cdot(y-s) (since U~\tilde{U} is the smallest convex extension). Using strong convexity of hh (which follows since bb has Lipschitz-continuous gradients, see Theorem E.4.2.2 in [17]), one can show that U(s)<h(s)z(r)U(s)<h(s)-z(r), where z(r)z(r)\rightarrow\infty as rr\rightarrow\infty.111111 Let cc^{\prime} denote modulus of convexity of hh. Then, for all yBry\in B_{r} that lie on the line segment from ss to x, and all t(hU~)(y)t\in\partial(h-\tilde{U})(y), h(s)U~(s)[h(y)U~(y)]+t(sy)+c2ys2.h(s)-\tilde{U}(s)\geq[h(y)-\tilde{U}(y)]+t\cdot(s-y)+\frac{c^{\prime}}{2}\lVert y-s\rVert^{2}. Since U~(s)=U(s)\tilde{U}(s)=U(s) and the first two terms of the RHS are positive, the claim follows. Then either U(s)h(s)z(r)/2U(s^{\prime})\leq h(s)-z(r)/2 for all sSs^{\prime}\in S or, on a set of positive Lebesgue-measure, U(s)Br/c\nabla U(s^{\prime})\not\in B_{r/c} for some constant c>0c>0 independent of r,sr,s and ss^{\prime}. Since limab(a)=\lim_{\lVert a\rVert\rightarrow\infty}b(a)=-\infty by assumption, this implies that in either case for rr large enough, the principals payoff from UU will be less than her payoff from taking the ex-ante optimal action. This contradicts our assumption that UU was optimal. Hence, any solution can be extended to a solution of the original problem. ∎

  • Proof of Lemma 3.

    Let 𝒞(X)\mathcal{C}(X) denote the vector space of continuous functions on XX with the supremum norm and recall that its dual space is the space of (Radon) measures on XX, which we denote by (X)\mathcal{M}(X). Let 𝒱:={g𝒞(X):xV,g(x)0}\mathcal{V}:=\{g\in\mathcal{C}(X):\forall x\in V,g(x)\geq 0\}; the polar cones of 𝒰\mathcal{U} and 𝒱\mathcal{V} are defined by

    𝒰\displaystyle\mathcal{U}^{*} :={γ(X):U𝒰,Udγ0}\displaystyle:=\{\gamma\in\mathcal{M}(X):\forall U\in\mathcal{U},\int U\,\mathrm{d}\gamma\geq 0\}
    𝒱\displaystyle\mathcal{V}^{*} :={γ(X):g𝒱,gdγ0}.\displaystyle:=\{\gamma\in\mathcal{M}(X):\forall g\in\mathcal{V},\int g\,\mathrm{d}\gamma\geq 0\}.

    The principal’s problem can be written as maxU𝒰Udμ\max_{U\in\mathcal{U}}\int U\,\mathrm{d}\mu subject to hU𝒱h-U\in\mathcal{V}. This is a conical linear program and its dual is infγ𝒱hdγ\inf_{\gamma\in\mathcal{V}^{*}}\int h\,\mathrm{d}\gamma subject to μγU\mu-\gamma\in U^{*} [[, e.g.,]]shapiro10. Since 𝒱=+(X)\mathcal{V}^{*}=\mathcal{M}_{+}(X) by the Riesz representation theorem [[]p. 65]dunford-schwartz88a and μγU\mu-\gamma\in U^{*} is equivalent to μcxγ\mu\geq_{cx}\gamma, (D) is the dual problem.

    Since there is U𝒰U\in\mathcal{U} such that hUh-U is in the interior of 𝒱\mathcal{V}, Slater’s condition is satisfied and standard results imply that strong duality holds [[, e.g.,]Proposition 2.8]shapiro10. Therefore, UU is optimal if and only if there is a positive measure γcxμ\gamma\geq_{cx}\mu such that Udμ=hdγ\int U\,\mathrm{d}\mu=\int h\,\mathrm{d}\gamma, which implies the result. ∎

  • Proof of Theorem 1.

    Given sXs\in X, we denote by Q(s)Q(s) the partition element of 𝒬\mathcal{Q} that contains ss.

    \Leftarrow”: Let γ:=δQ(s)d|μ|(s)\gamma:=\int\delta_{Q(s)}\,\mathrm{d}|\mu|(s). Given the properties of μ|Q\mu|_{Q}, we conclude that γ+\gamma\in\mathcal{M}_{+} and suppγ{s:U(s)=h(s)}\operatorname{supp}\gamma\subseteq\{s:U(s)=h(s)\}. Moreover, for all c𝒰c\in\mathcal{U},

    c(x)dγ(x)=c(x)dδQ(s)(x)d|μ|(s)c(x)dμ|Q(s)d|μ|(s)=c(x)dμ(x).\displaystyle\int c(x)\,\mathrm{d}\gamma(x)=\int\int c(x)\,\mathrm{d}\delta_{Q(s)}(x)\,\mathrm{d}|\mu|(s)\geq\int\int c(x)\,\mathrm{d}\mu|_{Q(s)}\,\mathrm{d}|\mu|(s)=\int c(x)\,\mathrm{d}\mu(x).

    and equality holds for cUc\equiv U because (i) UU is affine on each Q𝒬Q\in\mathcal{Q} and (ii) δQcxμ|Q\delta_{Q}\geq_{cx}\mu|_{Q} implies a(x)dδQ=a(x)dμ|Q\int a(x)\,\mathrm{d}\delta_{Q}=\int a(x)\,\mathrm{d}\mu|_{Q} for any affine function a𝒞(X)a\in\mathcal{C}(X). Therefore, γ\gamma is feasible for the dual problem and satisfies the complementary slackness conditions (4) and (5). We conclude that UU is optimal.

    \Rightarrow”: By Lemma 3, UU is optimal if and only if there is γ+\gamma\in\mathcal{M}_{+} satisfying (4), (5), and γcxμ\gamma\geq_{cx}\mu. Letting μ+\mu^{+} and μ\mu^{-} denote the positive and negative parts of μ\mu, respectively, the last condition becomes γ+μcxμ+\gamma+\mu^{-}\geq_{cx}\mu^{+}. Since both sides of the inequality are positive measures, Strassen’s theorem [[, see, for example,]p. 93-94]phelps01 implies that there is a dilation DsD_{s} (that is, for each ss, DsD_{s} is a probability measure with barycenter ss) satisfying γ+μ=Dsdμ+(s)\gamma+\mu^{-}=\int D_{s}\,\mathrm{d}\mu^{+}(s).

    Let μ|Q\mu|_{Q} be a (regular, proper) system of conditional measures (such conditional measures exist by Example 10.4.11 in [8]), which by definition satisfies

    Xc(s)dμ(s)=XXc(y)dμ|Q(s)(y)d|μ|(s)\int_{X}c(s)\,\mathrm{d}\mu(s)=\int_{X}\int_{X}c(y)\,\mathrm{d}\mu|_{Q(s)}(y)\,\mathrm{d}|\mu|(s)

    for all c𝒞(X)c\in\mathcal{C}(X). Letting αQ:=Dsdμ|Q+(s)μ|Q\alpha_{Q}:=\int D_{s}\,\mathrm{d}\mu|_{Q}^{+}(s)-\mu|_{Q}^{-}, we claim that there is 𝒬𝒬\mathcal{Q}^{\prime}\subseteq\mathcal{Q} such that 𝒬\mathcal{Q}^{\prime} has |μ||\mu|-measure 0 and, for all Q𝒬𝒬Q\in\mathcal{Q}\setminus\mathcal{Q}^{\prime}, αQ\alpha_{Q} is a positive measure that has support on Q{s:U(s)=h(s)}Q\cap\{s:U(s)=h(s)\}.

    Before proving this claim, we show that it implies the result: From the definition of αQ\alpha_{Q} it follows that if αQ\alpha_{Q} is a positive measure then μ|Q(Q)αQ(Q)0\mu|_{Q}(Q)\geq\alpha_{Q}(Q)\geq 0. Also, αQcxμ|Q\alpha_{Q}\geq_{cx}\mu|_{Q} since DsD_{s} is a dilation; therefore, if αQ\alpha_{Q} has support in Q{s:U(s)=h(s)}Q\cap\{s:U(s)=h(s)\} then δQcxμ|Q\delta_{Q}\geq_{cx}\mu|_{Q}, where δQ\delta_{Q} is a point mass at Q{s:U(s)=h(s)}Q\cap\{s:U(s)=h(s)\} or the zero measure if U(s)<h(s)U(s)<h(s) for all sQs\in Q. For each Q𝒬Q\in\mathcal{Q}^{\prime}, we let μ|Q\mu|_{Q} be the zero measure and observe that μ|Q\mu|_{Q} is still a conditional measure for μ\mu since 𝒬\mathcal{Q}^{\prime} has measure 0. This proves the result.

    First, suppose there is 𝒬𝒬\mathcal{Q}^{\prime}\subseteq\mathcal{Q} with strictly positive |μ||\mu|-measure such that, for all Q𝒬Q^{\prime}\in\mathcal{Q}^{\prime}, the support of αQ\alpha_{Q^{\prime}} is not a subset of the closure of QQ. Fix arbitrary Q𝒬Q^{\prime}\in\mathcal{Q}^{\prime}. Since the support of αQ\alpha_{Q^{\prime}} is not contained in the closure of QQ^{\prime}, there is a set AQA\subseteq Q^{\prime} of strictly positive μ|Q+\mu|_{Q^{\prime}}^{+}-measure such that, for all xAx\in A, the support of DxD_{x} is not contained in the closure of QQ^{\prime}. Since Jensen’s inequality is strict whenever the convex function is not affine on the convex hull of the support [[]Proposition 16.C.1]marshall79, we obtain

    U(s)dμ|Q+(s)<[U(x)dDs(x)]dμ|Q+(s).\int U(s)\,\mathrm{d}\mu|_{Q^{\prime}}^{+}(s)<\int\left[\int U(x)\,\mathrm{d}D_{s}(x)\right]\,\mathrm{d}\mu|_{Q^{\prime}}^{+}(s).

    This yields

    U(s)dμ(s)\displaystyle\int U(s)\,\mathrm{d}\mu(s) =[U(x)dμ|Q(s)+(x)U(x)dμ|Q(s)(x)]d|μ|(s)\displaystyle=\int\left[\int U(x)\,\mathrm{d}\mu|_{Q(s)}^{+}(x)-\int U(x)\,\mathrm{d}\mu|_{Q(s)}^{-}(x)\right]\,\mathrm{d}|\mu|(s)
    <[(U(y)dDx(y))dμ|Q(s)+(x)U(x)dμ|Q(s)(x)]d|μ|(s)\displaystyle<\int\left[\int\left(\int U(y)\,\mathrm{d}D_{x}(y)\right)\,\mathrm{d}\mu|_{Q(s)}^{+}(x)-\int U(x)\,\mathrm{d}\mu|_{Q(s)}^{-}(x)\right]\,\mathrm{d}|\mu|(s)
    =U(y)dDs(y)dμ+(s)U(x)dμ(s)\displaystyle=\int\int U(y)\,\mathrm{d}D_{s}(y)\,\mathrm{d}\mu^{+}(s)-\int U(x)\,\mathrm{d}\mu^{-}(s)
    =U(s)dγ(s),\displaystyle=\int U(s)\,\mathrm{d}\gamma(s),

    which contradicts (5). We conclude that, except possibly on a |μ||\mu|-Null set, the support of αQ\alpha_{Q} is a subset of the closure of QQ.

    Second, we show that αQ(s)\alpha_{Q(s)} is a positive measure for |μ||\mu|-almost every ss: Let

    B:={sX:sclQclQ for QQ},B:=\{s\in X:s\in\operatorname{cl}Q\cap\operatorname{cl}Q^{\prime}\text{ for }Q\neq Q^{\prime}\},

    and note that for any sBs\in B, UU is not differentiable at ss and therefore U(s)<h(s)U(s)<h(s). Since suppγ{s:U(s)=h(s)}\operatorname{supp}\gamma\subseteq\{s:U(s)=h(s)\} by (4), γ(B)=0\gamma(B)=0. Moreover, μ(B)=0\mu^{-}(B)=0 because UU is continuously differentiable |μ||\mu|-almost everywhere by assumption. Let 𝒢\mathcal{G} denote the σ\sigma-algebra generated by 𝒬\mathcal{Q} and note that the Borel σ\sigma-algebra on XX is generated by some countable algebra {A1,A2,}\{A_{1},A_{2},...\} [[]Propositions 3.1 and 3.3]preston08. For each nn and G𝒢G\in\mathcal{G},

    GαQ(s)(An)d|μ|(s)=\displaystyle\int_{G}\alpha_{Q(s)}(A_{n})\,\mathrm{d}|\mu|(s)= GXDs(An)dμ|Q(s)+(s)μ|Q(s)(An)d|μ|(s)\displaystyle\int_{G}\int_{X}D_{s^{\prime}}(A_{n})\,\mathrm{d}\mu|_{Q(s)}^{+}(s^{\prime})-\mu|_{Q(s)}^{-}(A_{n})\,\mathrm{d}|\mu|(s)
    =\displaystyle= GDs(An)dμ+(s)μ(AnG)\displaystyle\int_{G}D_{s}(A_{n})\,\mathrm{d}\mu^{+}(s)-\mu^{-}(A_{n}\cap G)
    \displaystyle\geq [XDs(AnG)dμ+(s)μ(AnG)]XGDs(AnG)dμ+(s).\displaystyle\left[\int_{X}D_{s}(A_{n}\cap G)\,\mathrm{d}\mu^{+}(s)-\mu^{-}(A_{n}\cap G)\right]-\int_{X\setminus G}D_{s}(A_{n}\cap G)\,\mathrm{d}\mu^{+}(s).

    The bracketed term equals γ(AnG)\gamma(A_{n}\cap G) and is therefore positive. The last term is zero since

    XGDs(AnG)dμ+(s)Ds(AnGB)dμ+(s)μ(AnGB)=γ(AnGB)=0\int_{X\setminus G}D_{s}(A_{n}\cap G)\,\mathrm{d}\mu^{+}(s)\leq\int D_{s}(A_{n}\cap G\cap B)\,\mathrm{d}\mu^{+}(s)-\mu^{-}(A_{n}\cap G\cap B)=\gamma(A_{n}\cap G\cap B)=0

    (recall that γ(B)=μ(B)=0\gamma(B)=\mu^{-}(B)=0). Since αQ(s)(An)\alpha_{Q(s)}(A_{n}) is 𝒢\mathcal{G}-measurable in ss, it follows that there is a |μ||\mu|-Null set ZnZ_{n} such that αQ(s)(An)0\alpha_{Q(s)}(A_{n})\geq 0 for all sXZns\in X\setminus Z_{n}. Letting Z:=n=1ZnZ:=\bigcup_{n=1}^{\infty}Z_{n}, for all sXZs\in X\setminus Z and Borel sets AA, αQ(s)(A)0\alpha_{Q(s)}(A)\geq 0 by Caratheodory’s extension theorem [[, see]Theorem 1.5.6 and the comment afterward]bogachev07a.

Finally, if it is not true that for |μ||\mu|-almost every ss, the support of αQ(s)\alpha_{Q(s)} is a subset of {s:U(s)=h(s)}\{s:U(s)=h(s)\}, then Udγ<hdγ\int U\,\mathrm{d}\gamma<\int h\,\mathrm{d}\gamma, contradicting (4). Moreover, for any sclQQs\in\operatorname{cl}Q\setminus Q, UU is not differentiable at ss and therefore U(s)<h(s)U(s)<h(s) (because UhU\leq h and hh is differentiable). We conclude that there is a collection 𝒬𝒬\mathcal{Q}^{\prime}\subset\mathcal{Q} with |μ||\mu|-measure 0 such that, for all Q𝒬𝒬Q\in\mathcal{Q}\setminus\mathcal{Q}^{\prime}, αQ\alpha_{Q} is a positive measure that has support on Q{s:U(s)=h(s)}Q\cap\{s:U(s)=h(s)\}. ∎

  • Proof of Corollary 2.

    Let UU be an optimal indirect utility and 𝒬\mathcal{Q} a corresponding partition. Since μ|Q(Q)0\mu|_{Q}(Q)\geq 0 and μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q}, any pooling region121212That is, any QQ such that QSQ\cap S contains strictly more than one element. Q𝒬Q\in\mathcal{Q} must contain types with ν(s)0\nu(s)\geq 0 and types with ν(s)s\nu(s)\leq s.

    If ν(s¯)0\nu(\underline{s})\geq 0 and ν(s¯)0\nu(\overline{s})\geq 0, ν\nu is positive everywhere and the claim follows. So suppose ν(s¯)<0\nu(\underline{s})<0; then there is a pooling region Q:=[x,y]𝒬Q:=[x,y]\in\mathcal{Q} which contains s¯\underline{s} and some ss with ν(s)>0\nu(s)>0. If ν(y)<0\nu(y)<0, then [x,y]Q[x,y]\subseteq Q must hold and the claim follows. Therefore, assume ν(y)0\nu(y)\geq 0. The measure δQ\delta_{Q} from Theorem 1 must be a point mass at some zQz\in Q with ν(z)0\nu(z)\geq 0 (if δQ\delta_{Q} were the zero measure or a point mass at zz^{\prime} with ν(z)<0\nu(z^{\prime})<0, then xxdμ|Q>xxdδQ\int x-x^{*}\,\mathrm{d}\mu|_{Q}>\int x-x^{*}\,\mathrm{d}\delta_{Q} whenever x=inf{x:ν(x)0}x^{*}=\inf\{x:\nu(x)\geq 0\}, which contradicts μ|QcxδQ\mu|_{Q}\leq_{cx}\delta_{Q}). It follows that U(z)=h(z)U(z)=h(z).

    If ν(s¯)0\nu(\overline{s})\geq 0 then ν(s)0\nu(s)\geq 0 for all s[z,s¯]s\in[z,\overline{s}] and delegating to [z,s¯][z,\overline{s}] is optimal. If ν(s¯)<0\nu(\overline{s})<0, repeating our previous argument implies that there is an interval [x,y]𝒬[x^{\prime},y^{\prime}]\in\mathcal{Q} which contains s¯\overline{s} and some zz^{\prime} with ν(z)0\nu(z^{\prime})\geq 0 and U(z)=h(z)U(z^{\prime})=h(z^{\prime}). Since ν(s)0\nu(s)\geq 0 for all s[z,z]s\in[z,z^{\prime}], delegating to [z,z][z,z^{\prime}] is optimal. If ν(s¯)0\nu(\underline{s})\geq 0 and ν(s¯)<0\nu(\overline{s})<0, a symmetric argument applies. ∎

  • Proof of Corollary 3.

    It follows from (1) that ν(s)=f(s)[1βf(s)f(s)]\nu(s)=f(s)\left[1-\beta\frac{f^{\prime}(s)}{f(s)}\right] for s(s¯,s¯)s\in(\underline{s},\overline{s}). If β0\beta\geq 0 then ν\nu is singlecrossing from below on (s¯,s¯)(\underline{s},\overline{s}) (since ff is logconcave) and ν(s¯)0\nu(\underline{s})\leq 0. The claim then follows from Corollary 2. ∎

  • Proof of Corollary 4.

    The corresponding indirect utility induces the partition with the following elements: for any aa in the interior of AA, {a}\{a\}, and for any abdAa\in\operatorname{bd}A, the normal cone NA(a)N_{A}(a), which is a ray through aa and orthogonal to bdA\operatorname{bd}A. For any such normal ray QQ, condition (ii) is equivalent to μ|QcxδQ\mu|_{Q}\geq_{cx}\delta_{Q} by the same argument as in Corollary 1.

    \Leftarrow”: Condition (i) ensures that μ|{a}\mu|_{\{a\}} is positive for all aa in the interior of AA. Since it has singleton support, μ|{a}cxδ{a}\mu|_{\{a\}}\geq_{cx}\delta_{\{a\}}. For any normal ray QQ, μ|QcxδQ\mu|_{Q}\geq_{cx}\delta_{Q} by condition (ii) and μ|Q(Q)0\mu|_{Q}(Q)\geq 0 since 0ν(s+x𝐧^A(s))dλ(s+x𝐧^A(s)|ray)0\int_{0}^{\infty}\nu(s+x\hat{\mathbf{n}}_{A}(s))\,\mathrm{d}\lambda(s+x\hat{\mathbf{n}}_{A}(s)|ray)\geq 0 follows from condition (ii). It follows from Theorem 1 that UU is optimal.

    \Rightarrow”: If ν(a)<0\nu(a)<0 for some aa in the interior of AA then there is a subset of AA with positive |μ||\mu|-measure on which ν\nu is strictly negative, which implies μ|Q(Q)<0\mu|_{Q}(Q)<0 on a set of positive measure, which contradicts optimality of UU. Similarly, if ν(a)<0\nu(a)<0 for some abdAa\in\operatorname{bd}A then it can be shown that μ|Q(Q)<0\mu|_{Q}(Q)<0 on a set of positive measure, which contradicts optimality of UU by Theorem 1.

    If condition (ii) is violated, μ|QcxδQ\mu|_{Q}\not\geq_{cx}\delta_{Q} on a set of positive measure, which again contradicts optimality of UU by Theorem 1. ∎