This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Lorenz map, inequality ordering and curves based on multidimensional rearrangements

Yanqin Fan, Marc Henry, Brendan Pass and Jorge A. Rivero University of Washington, Penn State, University of Alberta and University of Washington
Abstract.

We propose a multivariate extension of the Lorenz curve based on multivariate rearrangements of optimal transport theory. We define a vector Lorenz map as the integral of the vector quantile map associated with a multivariate resource allocation. Each component of the Lorenz map is the cumulative share of each resource, as in the traditional univariate case. The pointwise ordering of such Lorenz maps defines a new multivariate majorization order, which is equivalent to preference by any social planner with inequality averse multivariate rank dependent social evaluation functional. We define a family of multi-attribute Gini index and complete ordering based on the Lorenz map. We propose the level sets of an Inverse Lorenz Function as a practical tool to visualize and compare inequality in two dimensions, and apply it to income-wealth inequality in the United States between 1989 and 2022.

Keywords: Multidimensional inequality, Lorenz curve, Gini index, vector quantiles, optimal transport, majorization

JEL codes: D63

The first version is of March 11, 2022. This version is of . The authors are grateful to Isaiah Andrews, Marco Scarsini, Xiaoxia Shi, John Weymark and three anonymous referees for helpful comments. Corresponding author: Marc Henry: [email protected] Department of Economics, The Pennsylvania State University, University Park, PA 16802.

Introduction

The Lorenz curve, first proposed in Lorenz (1905), is a compelling visual and simple quantification tool for the analysis of dispersion in univariate distributions. It allows easy visualization of dispersion from the curvature of a convex curve and its distance from the diagonal. The diagonal itself is the Lorenz curve of a degenerate distribution– an egalitarian allocation where all individuals have the same amount of resource. It also enables quick computations, reading off the curve, as it were, of the share of a resource held by the top or bottom of the allocation distribution for that resource. These features of the Lorenz curve account for much of its enduring appeal among practitioners, policy analysts and policy makers. This appeal is further enhanced by the relation between majorization and the pointwise ordering of Lorenz curves, which provides a way to visualize inequality comparisons between populations and within a given population between time periods. Comprehensive accounts are given in Marshall et al. (2011) and Arnold and Sarabia (2018).

The appealing properties of the Lorenz curve are well captured by the formulation given in Gastwirth (1971). In that formulation, the Lorenz curve is the graph of the Lorenz map, and the latter is the cumulative share of individuals below a given rank in the distribution, i.e., the normalized integral of the quantile function. The relation to majorization and the convex order follows immediately, as shown in section C of Marshall et al. (2011). As pointed out by Arnold (2008), this makes the Lorenz ordering an uncontroversial partial inequality ordering of univariate distributions, and most open questions concern the higher dimensional case.

Dispersion in multivariate distributions is not adequately described by the Lorenz curve of each marginal, and a genuinely multidimensional approach is needed. Even for utilitarian welfare inequality, Atkinson and Bourguignon (1982) motivate the need for the multidimensional approach initiated by Fisher (1956). More generally, the literature on multidimensional inequality of outcomes and its measurement is vast, as evidenced by many recent surveys, see for instance Decancq and Lugo (2012), Aaberge and Brandolini (2014), Andreoli and Zoli (2020). We only discuss it insofar as it relates to the Lorenz curve.

Multivariate extensions have been proposed for the Lorenz curve, most notably Taguchi (1972a, 1972b), Arnold (1983), and Koshevoy and Mosler (1996, 1999)111More recently, subsequent to our work, Hallin and Mordant (2022) also adopt a multivariate rearrangement approach to the definition of multi-attribute Lorenz curves. They adopt a center-outward approach (see Hallin et al. (2021)), which is better suited to define notions of middle class.. They are reviewed in Marshall et al. (2011) and Sarabia and Jorda (2014) and discussed in more details in section 1.4.1, where we compare them to our proposal. We contribute to this literature with a vector version of the Gastwirth (1971) formulation of the Lorenz curve. We provide an implementable criterion to measure and compare inequality in multivariate distributions, which emulates the features of the Lorenz curve that most contributed to its success.

The traditional Gastwirth (1971) formulation of the Lorenz curve is an integrated quantile over the lowest ranked individuals. To simplify the argument in the univariate case, model the population as a continuum on [0,1]\displaystyle[0,1] and suppose the distribution of incomes in the population is continuous. Then the Gastwirth (1971) formulation can be thought of involving two stages. Take an income allocation Y\displaystyle Y, which is a random variable on +\displaystyle\mathbb{R}_{+} with cumulative distribution function FY\displaystyle F_{Y}. First, reorder individuals in the population so that they are ranked in increasing incomes. Then compute the cumulative share of lowest ranked individuals by integrating FY1\displaystyle F_{Y}^{-1} from 0\displaystyle 0 to r\displaystyle r and dividing by the mean. The first step involves the probability integral transform FY(Y)\displaystyle F_{Y}(Y), which should be thought of in this context as a cardinal to ordinal transformation, since FY(Y)\displaystyle F_{Y}(Y) is uniform on [0,1]\displaystyle[0,1], so cardinal information is purged, but FY\displaystyle F_{Y} is increasing, so ordinal information is preserved. Our proposal is based on a multivariate version of the cardinal to ordinal transformation involved in the first stage. The latter is the unique map that transforms a d\displaystyle d dimensional allocation into a uniform one on [0,1]d\displaystyle[0,1]^{d}, and is cyclically monotone222Existence and uniqueness are shown in McCann (1995). See section 1.1 for details and definitions. and hence preserves ordinal information. This motivates our definition of the vector Lorenz map as the cumulative integral of the multivariate quantile of Chernozhukov et al. (2017).

The vector Lorenz map we propose, therefore, is the vector of shares of each resource held by individuals below a given rank. The associated Lorenz inequality dominance criterion deems a multivariate allocation more equal if this share of resources is larger for each rank. Hence, our proposal shares the interpretation of the traditional Lorenz curve and Lorenz dominance. It also shares the desirable properties of the Lorenz curve and dominance ordering. Like the Lorenz zonoid of Koshevoy and Mosler (1996, 1999), it characterizes the distribution of an allocation (see section 1.4.1 for a definition and discussion). Unlike the Lorenz zonoid, the vector Lorenz map we propose can be efficiently computed as an unconstrained convex optimization problem and connected to recent developments in computational optimal transport theory333An account of recent advances is given in Peyré and Cuturi (2018).. Hence, the Lorenz dominance order we propose is an implementable inequality dominance criterion. Using recent advances on the asymptotic properties of multivariate quantiles, surveyed in Hallin (2022), our Lorenz dominance criterion can be the basis for inequality dominance testing that accounts for sampling uncertainty. This contrasts our proposal with the growing literature on multivariate inequality dominance criteria proposed for finite populations. See for instance Gravel and Moyes (2012), Banerjee (2016), Faure and Gravel (2021) and references within.

Other implementable inequality dominance criteria are proposed in the literature, in Koshevoy (1995), Koshevoy and Mosler (1996), Koshevoy and Mosler (2007) and Banerjee (2016) and other references surveyed in Arnold and Sarabia (2018). However, they do not provide an equivalence between the Lorenz dominance criterion and a class of compatible social evaluation functionals. An exception is Gravel and Moyes (2012) and Faure and Gravel (2021) who give a comprehensive treatment of the special case of a finite population with a single cardinal transferable attribute combined with an ordinal non transferable one. We characterize the class of social evaluation functionals that are inequality averse in the sense that they are increasing in the Lorenz dominance order. We build on the multivariate extension of the Quiggin (1992)-Yaari (1987) rank dependent decision theory in Galichon and Henry (2012) to show that, as in Weymark (1981) for the univariate case, social evaluation functionals are inequality averse if and only if they are rank dependent social evaluation functionals with attribute specific weights decreasing in ranks. We also characterize the class of transfers that increase inequality according to the Lorenz dominance criterion as rank preserving transfers of any attribute from a lower to a higher ranked individual. A special case of such transfers, which we call monotone regressive transfers weakly increase marginal inequality and dependence between attributes.

To visualize Lorenz dominance, we define an Inverse Lorenz Function at a given vector of resource shares as the fraction of the population that cumulatively holds those shares. It is characterized by the cumulative distribution function of the image of a uniform random vector by the Lorenz map. Hence, it is a cumulative distribution function by construction, like the univariate inverse Lorenz curve. In two dimensions, the α\displaystyle\alpha-level sets of this cumulative distribution function, which we call α\displaystyle\alpha-Lorenz curves, are non crossing downward sloping curves that shift to the south-west when inequality increases, as defined by the Lorenz ordering. For the cases, where allocations are not ranked in the Lorenz inequality dominance ordering, we propose a family of multivariate S-Gini coefficients based on our vector Lorenz map, with the flexibility to entertain different tastes for inequality in different dimensions. Finally, we propose an illustration to the analysis of income-wealth inequality in the United States between 1989 and 2022.

Plan of the paper

In the first section, we define the Lorenz map, explain its computation, detail its properties and how it compares with alternative proposals. In section 2, we introduce the Lorenz dominance ordering, its characterization in terms of classes of social evaluation functionals and in terms of transfers compatible with it. Section 3 illustrates the implementation of our proposed tools, and the final section concludes.

1. Vector Lorenz Map

1.1. Definition of the Lorenz map

The Lorenz curve was originally proposed in Lorenz (1905) to provide a graphical representation of inequality of distribution of a single resource. Let Y\displaystyle Y be a random variable on +\displaystyle\mathbb{R}_{+} with cumulative distribution function FY\displaystyle F_{Y}, which represents the allocation of a resource in a population. The population is modeled as the continuum [0,1]\displaystyle[0,1].

The Lorenz curve is traditionally defined as the set of points in [0,1]2\displaystyle[0,1]^{2}, parameterized by y\displaystyle y, with coordinates

(FY(y),1μY0yv𝑑FY(v)),\displaystyle\displaystyle\left(F_{Y}(y),\frac{1}{\mu_{Y}}\int_{0}^{y}vdF_{Y}(v)\right), (1.1)

where μY\displaystyle\mu_{Y} is the expectation of Y\displaystyle Y. See for instance page 149 of Arnold and Sarabia (2018). Gastwirth (1971) points out that the Lorenz curve is given by the graph of the map on [0,1]\displaystyle[0,1]

qLY(q)=1μY0qFY1(v)𝑑v,\displaystyle\displaystyle\begin{array}[]{ccccc}q&\mapsto&L_{Y}(q)&=&\frac{1}{\mu_{Y}}\int_{0}^{q}F_{Y}^{-1}(v)\,dv,\end{array} (1.3)

where F1(v):=inf{y:vFY(y)}\displaystyle F^{-1}(v):=\inf\{y:v\leq F_{Y}(y)\} is the traditional quantile function. Formulation (1.3) provides a closed form expression and simple interpretation: For each proportion q[0,1]\displaystyle q\in[0,1], the Lorenz map gives the cumulative share of the resource held by the poorest proportion q\displaystyle q of the population. This relies on the well known fact that the quantile function FY1\displaystyle F_{Y}^{-1} is the only increasing function such that for any uniformly distributed random variable V\displaystyle V on [0,1]\displaystyle[0,1], FY1(V)\displaystyle F_{Y}^{-1}(V) is distributed identically to Y\displaystyle Y. Hence, integrating FY1\displaystyle F_{Y}^{-1} from 0\displaystyle 0 to q\displaystyle q and normalizing produces the cumulative share held by the individuals ranked below q\displaystyle q.

Conversely, when FY\displaystyle F_{Y} admits a density, the probability integral transform V:=FY(Y)\displaystyle V:=F_{Y}(Y) produces a uniformly distributed random variable V\displaystyle V on [0,1]\displaystyle[0,1], which preserves the ranks of individuals in the population. This holds because FY\displaystyle F_{Y} is an increasing map. Hence, the probability integral transform removes cardinal information (by producing a uniformly distributed outcome), while preserving ordinal information (by keeping the rank order of individuals in the population). The probability integral transform V:=FY(Y)\displaystyle V:=F_{Y}(Y) is the rank associated with allocation Y\displaystyle Y.

If FY\displaystyle F_{Y} is not continuous, then FY(Y)\displaystyle F_{Y}(Y) is no longer uniformly distributed (positive masses of individuals have identical ranks). However, it is still the case that for any uniformly distributed random variable V\displaystyle V on [0,1]\displaystyle[0,1], FY1(V)=inf{y:VFY(y)}\displaystyle F_{Y}^{-1}(V)=\inf\{y:V\leq F_{Y}(y)\} is distributed identically to Y\displaystyle Y. Hence, the closed form solution for the Lorenz map (1.3) still holds with the same interpretation: Integrating FY1\displaystyle F_{Y}^{-1} from 0\displaystyle 0 to q\displaystyle q and normalizing still produces the cumulative share held by the individuals ranked below q\displaystyle q.

Consider now an allocation X:=(X1,,Xd)\displaystyle X:=(X_{1},\ldots,X_{d}) of d\displaystyle d resources in the population. To analyze inequality in allocation X\displaystyle X, we can first look at inequality in each marginal allocation X1,,Xd\displaystyle X_{1},\ldots,X_{d}, using the univariate Lorenz curves L1:=LX1,,Ld:=LXd\displaystyle L_{1}:=L_{X_{1}},\ldots,L_{d}:=L_{X_{d}}. However, this strategy disregards the effect of dependence. The latter is relevant to inequality, as can be trivially illustrated by the fact that for given wealth and income marginal allocations, the comonotonic allocation (the wealthier individuals have higher income) is more unequal than the admittedly unrealistic counter-monotonic allocation (the wealthier individuals have lower income).

To take dependence into account, we propose to emulate the Gastwirth (1971) formulation by measuring cumulative shares of each resource for all individuals, below a certain rank. Conceptually, this is achieved in two steps. First, we find a transformation that removes cardinal information while preserving individual’s ranking in the population, i.e., a cardinal to ordinal transformation. Then we integrate the shares of individuals with lowest rank. The difficulty here, of course, is the absence of a canonical order in d\displaystyle\mathbb{R}^{d} to define the rank.

As noted in Faugeras and Rüschendorf (2017), by Borel’s Isomorphism Theorem444See for instance section 13.1 page 487 of Dudley (2002)., there exist measurable bijective maps T:[0,1]d\displaystyle T:[0,1]\rightarrow\mathbb{R}^{d} such that for any uniformly distributed random variable V\displaystyle V on [0,1]\displaystyle[0,1], T(V)\displaystyle T(V) is distributed identically to X\displaystyle X. However, such maps are unsuitable cardinal to ordinal transformations for two main reasons. First, there is no known explicit construction, hence no way to compute them. Second, even if we could compute such a map, its choice would imply an implicit ad hoc aggregation of the different resources in allocation X\displaystyle X in order to arrive at a scalar ranking of individuals in the population.

In order to avoid an implicit ad hoc aggregation of the different resources in X\displaystyle X, the cardinal to ordinal transformation must be between d\displaystyle\mathbb{R}^{d} and [0,1]d\displaystyle[0,1]^{d}. Hence, we model the population as a continuum on [0,1]d\displaystyle[0,1]^{d} and individual ranks are points in [0,1]d\displaystyle[0,1]^{d}. The multivariate quantile transform, and its inverse (the cardinal to ordinal transform, or rank transform), must satisfy the same requirements as in the univariate case: It must map the uniform distribution (no cardinal information) to the distribution of the allocation, and it must be monotonic (so as to preserve ordinal information). The monotonicity of the quantile in the univariate case ensures that the cardinal to ordinal transformation does indeed preserve the rankings of individuals in the population.

To construct an analogue of the Gastwirth (1971) Lorenz curve formulation, we therefore need the cardinal to ordinal transformation to satisfy a form of multivariate monotonicity. The classical notion of monotonicity in d\displaystyle\mathbb{R}^{d}, also known as 2\displaystyle 2-monotonicity of a map T:dd\displaystyle T:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}, requires

(T(x)T(x))(xx)\displaystyle\displaystyle\left(T(x^{\prime})-T(x)\right)^{\top}\left(x^{\prime}-x\right) \displaystyle\displaystyle\geq 0\displaystyle\displaystyle 0

for any pair of vectors x,xd\displaystyle x,x^{\prime}\in\mathbb{R}^{d}. It can be interpreted as monotonicity on average. For uniqueness of the cardinal to ordinal transformation, we need the stronger version of monotonicity, called cyclical monotonicity, which characterizes the gradients of convex functions and was introduced by Rockafellar (1966). Cyclical monotonicity requires

i=1K(T(xi)T(xi+1))(xixi+1)\displaystyle\displaystyle\sum_{i=1}^{K}(T(x_{i})-T(x_{i+1}))^{\top}\left(x_{i}-x_{i+1}\right) \displaystyle\displaystyle\geq 0\displaystyle\displaystyle 0

for any K\displaystyle K, and any collection of vectors (x1,,xK)\displaystyle(x_{1},\ldots,x_{K}), setting xK+1=x1\displaystyle x_{K+1}=x_{1}. Cyclical monotonicity also characterizes maps T\displaystyle T that minimize distortion in the sense that in case the allocation X\displaystyle X has continuous distribution with finite variance, T\displaystyle T minimizes 𝔼XT(X)2\displaystyle\mathbb{E}\|X-T(X)\|^{2} among all the maps such that T(X)\displaystyle T(X) is uniformly distributed on [0,1]d\displaystyle[0,1]^{d}.

The following definition summarizes the properties needed for a cardinal to ordinal transformation as a first step in the Lorenz map construction.

Definition 1 (Vector quantile).

A vector quantile QX\displaystyle Q_{X} associated with random vector X\displaystyle X on d\displaystyle\mathbb{R}^{d} is a map QX:[0,1]dd\displaystyle Q_{X}:[0,1]^{d}\rightarrow\mathbb{R}^{d} with the following properties.

  1. (1)

    For any uniformly distributed random variable U\displaystyle U on [0,1]d\displaystyle[0,1]^{d}, QX(U)\displaystyle Q_{X}(U) is distributed identically to X\displaystyle X.

  2. (2)

    If the distribution of X\displaystyle X is absolutely continuous, QX\displaystyle Q_{X} is invertible and QX1(X)\displaystyle Q_{X}^{-1}(X) is uniformly distributed on [0,1]d\displaystyle[0,1]^{d}.

  3. (3)

    The map QX\displaystyle Q_{X} is cyclically monotone.

  4. (4)

    When d=1\displaystyle d=1, QX\displaystyle Q_{X} is the traditional quantile function (This is automatically satisfied when (1) and (3) hold).

As shown in McCann (1995), there exists a transformation that conforms with definition 1 and it is unique in the sense that two such transformations are equal almost everywhere. It is proposed as a vector quantile notion in Chernozhukov et al. (2017), and we will refer to it as the vector quantile associated with X\displaystyle X.

Once we model the population as a continuum on [0,1]d\displaystyle[0,1]^{d}, interpret each point on [0,1]d\displaystyle[0,1]^{d} as a rank, and define the vector quantile QX\displaystyle Q_{X} as a multidimensional rearrangement of the allocation X\displaystyle X in rank order, we simply integrate the quantile over the lowest ranks to define a multivariate version of the Gastwirth (1971) formulation of the Lorenz curve.

Definition 2 (Lorenz map).

Let U\displaystyle U be a uniformly distributed random vector on [0,1]d\displaystyle[0,1]^{d}, and let X:=(X1,,Xd)\displaystyle X:=(X_{1},\ldots,X_{d}) be an allocation, i.e., a random vector on +d\displaystyle\mathbb{R}_{+}^{d} with finite mean μ=(μ1,,μd)\displaystyle\mu=(\mu_{1},\ldots,\mu_{d}). Call X~\displaystyle\tilde{X} the normalized version of X\displaystyle X, i.e.,

X~:=(X1μ1,,Xdμd),\displaystyle\displaystyle\tilde{X}:=\left(\frac{X_{1}}{\mu_{1}},\ldots,\frac{X_{d}}{\mu_{d}}\right),

and let QX~\displaystyle Q_{\tilde{X}} be the vector quantile of X~\displaystyle\tilde{X}. The Lorenz map of allocation X\displaystyle X is the vector-valued function X:[0,1]d[0,1]d\displaystyle\mathcal{L}_{X}:[0,1]^{d}\to[0,1]^{d} defined for each r:=(r1,,rd)[0,1]d\displaystyle r:=(r_{1},\ldots,r_{d})\in[0,1]^{d} by

X(r1,,rd)\displaystyle\displaystyle\mathcal{L}_{X}(r_{1},\ldots,r_{d}) =\displaystyle\displaystyle= 0r10rdQX~(u1,,ud)𝑑u1𝑑ud.\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\cdots\!\int_{0}^{r_{d}}Q_{\tilde{X}}(u_{1},\ldots,u_{d})du_{1}\ldots du_{d}. (1.4)

The transformation of X\displaystyle X into its normalized version X~\displaystyle\tilde{X} prior to integrating the vector quantile is required to remove dependence of the Lorenz map of definition 2 on units of measurements. Different resources, such as earnings and health, may not be measured with the same units of measurement. The transformation into X~\displaystyle\tilde{X} makes the allocation unit free. Hence the Lorenz map satisfies ratio-scale invariance (i.e., invariance to rescaling of the different attributes, or change of units of measurement). Section 1.3 discusses an alternative unnormalized version of the definition in the spirit of Shorrocks (1983).

When X\displaystyle X has absolutely continuous distribution PX\displaystyle P_{X}, its quantile function is PX\displaystyle P_{X}-almost everywhere invertible (see for instance theorem 2.1 in Chernozhukov et al. (2017)). In that case, the transformation U=QX1(X)\displaystyle U=Q_{X}^{-1}(X) is the vector analogue of the probability integral transform V=FY(Y)\displaystyle V=F_{Y}(Y) discussed above. The random vector U=QX1(X)\displaystyle U=Q_{X}^{-1}(X) is uniformly distributed on [0,1]d\displaystyle[0,1]^{d}, and is the vector rank of the individual with endowment X\displaystyle X, in the terminology of Chernozhukov et al. (2017). The Lorenz map of definition 2 can then be rewritten as:

X(r)\displaystyle\displaystyle\mathcal{L}_{X}(r) =\displaystyle\displaystyle= 𝔼[X~ 1{QX~1(X~)r}].\displaystyle\displaystyle\mathbb{E}\left[\tilde{X}\;\mathds{1}\left\{Q_{\tilde{X}}^{-1}(\tilde{X})\leq r\right\}\right]. (1.5)

This clarifies the interpretation of X(r)\displaystyle\mathcal{L}_{X}(r) as the cumulative share of all individuals with vector rank below r\displaystyle r in the partial order of d\displaystyle\mathbb{R}^{d}.

In the scalar case discussed above, inverting the Lorenz curve LY\displaystyle L_{Y} defined in (1.3) yields the inverse Lorenz curve

LY1(y)=0LY1(y)𝑑v=01 1{LY(v)y}𝑑v=(LY(V)y),\displaystyle\displaystyle\begin{array}[]{ccccccc}L_{Y}^{-1}(y)&=&\int_{0}^{L_{Y}^{-1}(y)}dv&=&\int_{0}^{1}\,\mathds{1}\{L_{Y}(v)\leq y\}\,dv&=&\mathbb{P}(L_{Y}(V)\leq y),\end{array} (1.7)

where the probability is taken with respect to a uniformly distributed random variable V\displaystyle V on [0,1]\displaystyle[0,1]. The scalar inverse Lorenz curve at y\displaystyle y is therefore shown in (1.7) to be equal to the maximum proportion of the population with cumulative share of the resource equal to y\displaystyle y. In the vector case, the analogue of the right-hand side of (1.7) can still be used to define an Inverse Lorenz Function.

Definition 3 (Inverse Lorenz Function).

The Inverse Lorenz Function (ILF) of a random vector X\displaystyle X is the function lX:[0,1]d[0,1]\displaystyle l_{X}:[0,1]^{d}\to[0,1] defined for each z=(z1,,zd)[0,1]d\displaystyle z=(z_{1},\ldots,z_{d})\in[0,1]^{d} by lX(z):=(X(U)z),\displaystyle l_{X}(z):=\mathbb{P}(\mathcal{L}_{X}(U)\leq z), where z=(z1,,zd)[0,1]d\displaystyle z=(z_{1},\ldots,z_{d})\in[0,1]^{d}, inequality \displaystyle\leq is understood component-wise, and the probability is taken with respect to the uniform random vector U\displaystyle U on [0,1]d\displaystyle[0,1]^{d}.

The expression above is no longer the mathematical inverse of the Lorenz map X\displaystyle\mathcal{L}_{X}, but it can still be interpreted as the share of the population with cumulative shares of all resources equal to a predetermined proportion z=(z1,,zd)\displaystyle z=(z_{1},\ldots,z_{d}).

1.2. Computation and examples

1.2.1. Computation

We now give a step-by-step method to compute the Lorenz map of a discrete distribution, which may be the allocation in a finite population, or the empirical distribution of a (possibly weighted) sample from an underlying (possibly mixed discrete-continuous) distribution. The full algorithm and a step-by-step guide to implementation in R are given in appendix A.

Let X\displaystyle X be a random vector in +d\displaystyle\mathbb{R}_{+}^{d} with discrete distribution. The probability mass function of the distribution of X\displaystyle X is given by {(x1,w1),,(xn,wn)}\displaystyle\{(x^{1},w_{1}),\ldots,(x^{n},w_{n})\}, where x1,,xn\displaystyle x^{1},\ldots,x^{n} are vectors in +d\displaystyle\mathbb{R}_{+}^{d} and w1,,wn\displaystyle w_{1},\ldots,w_{n} are positive scalar weights summing to 1\displaystyle 1.

  1. (1)

    First, normalize the allocation vector X\displaystyle X and form X~:=(X1/μ1,,Xd/μd)\displaystyle\tilde{X}:=(X_{1}/\mu_{1},\ldots,X_{d}/\mu_{d}), where Xj\displaystyle X_{j} is the j\displaystyle j-th coordinate of X\displaystyle X and μj=Σi=1nwixji\displaystyle\mu_{j}=\Sigma_{i=1}^{n}w_{i}x_{j}^{i} is the mean of Xj\displaystyle X_{j}, for each j=1,,d\displaystyle j=1,\ldots,d. The issue of normalization is discussed in section 1.3.

  2. (2)

    Then compute the vector quantile QX~\displaystyle Q_{\tilde{X}} of X~\displaystyle\tilde{X}. According to definition 1 (requirements (2) and (4)), the vector quantile QX~\displaystyle Q_{\tilde{X}} must satisfy the following requirements:

    1. (a)

      For any uniformly distributed random variable U\displaystyle U on [0,1]d\displaystyle[0,1]^{d}, QX~(U)\displaystyle Q_{\tilde{X}}(U) is distributed identically to X~\displaystyle\tilde{X}. Hence:

      1. (i)

        For all u[0,1]d\displaystyle u\in[0,1]^{d}, QX~(u){x1,,xn}\displaystyle Q_{\tilde{X}}(u)\in\{x^{1},\ldots,x^{n}\};

      2. (ii)

        For all i=1,,n,\displaystyle i=1,\ldots,n,

        Wi\displaystyle\displaystyle W_{i} :=\displaystyle\displaystyle:= QX~1(xi)={u[0,1]d:QX~(u)=xi}\displaystyle\displaystyle Q_{\tilde{X}}^{-1}(x^{i})=\{u\in[0,1]^{d}:Q_{\tilde{X}}(u)=x^{i}\} (1.8)

        has measure wi\displaystyle w_{i}.

    2. (b)

      The map QX~\displaystyle Q_{\tilde{X}} is cyclically monotone. Hence, by Rockafellar (1966), there is a convex function ψX~:[0,1]d\displaystyle\psi_{\tilde{X}}:[0,1]^{d}\rightarrow\mathbb{R} such that QX~\displaystyle Q_{\tilde{X}} is almost everywhere equal to the gradient of ψX~\displaystyle\psi_{\tilde{X}}.

    Since QX~\displaystyle Q_{\tilde{X}} takes a finite number of values and is constant and equal to xi\displaystyle x^{i} on each Wi\displaystyle W_{i}, the computation of QX~\displaystyle Q_{\tilde{X}} is equivalent to the computation of the partition of [0,1]d\displaystyle[0,1]^{d} in regions W1,,Wn\displaystyle W_{1},\ldots,W_{n}. As QX~\displaystyle Q_{\tilde{X}} is the gradient of the convex function ψX~\displaystyle\psi_{\tilde{X}} and is constant on each of the Wi\displaystyle W_{i}, ψX~\displaystyle\psi_{\tilde{X}} is affine on each of the Wi\displaystyle W_{i}, and each Wi\displaystyle W_{i} is a convex polytope in [0,1]d\displaystyle[0,1]^{d}. Aurenhammer et al. (1998) show

    ψX~(u)\displaystyle\displaystyle\psi_{\tilde{X}}(u) =\displaystyle\displaystyle= maxi=1,,n{uxihi},\displaystyle\displaystyle\max_{i=1,\ldots,n}\{u^{\top}x^{i}-h^{i}\},

    where h:=(h1,,hn)\displaystyle h:=(h^{1},\ldots,h^{n}) solves the convex optimization program

    min{i=1nwihi+[0,1]dmaxk=1,,n{uxkhk}𝑑u}.\displaystyle\displaystyle\min\;\left\{\sum_{i=1}^{n}w_{i}h^{i}+\int_{[0,1]^{d}}\max_{k=1,\ldots,n}\{u^{\top}x^{k}-h^{k}\}du\right\}. (1.9)

    The algorithm minimizes (1.9) to find h\displaystyle h, from which the regions Wi\displaystyle W_{i} are obtained as

    Wih\displaystyle\displaystyle W_{i}^{h} =\displaystyle\displaystyle= {u[0,1]d:uxihiuxjhj,1jn}.\displaystyle\displaystyle\{u\in[0,1]^{d}:u^{\top}x^{i}-h^{i}\geq u^{\top}x^{j}-h^{j},1\leq j\leq n\}. (1.10)

    The first order conditions of (1.9) fulfill requirement (2)(a)(ii), i.e., for all i=1,,n\displaystyle i=1,\dots,n,

    λ(Wih)\displaystyle\displaystyle\lambda(W_{i}^{h}) =\displaystyle\displaystyle= wi,\displaystyle\displaystyle w_{i}, (1.11)

    where λ\displaystyle\lambda is the Lebesgue measure. Finally, the vector quantile QX~\displaystyle Q_{\tilde{X}} is the piecewise constant map that takes values xi\displaystyle x^{i} on each Wi:=Wih\displaystyle W_{i}:=W_{i}^{h},  i=1,,n\displaystyle i=1,\ldots,n.

  3. (3)

    Once we have computed the vector quantile map QX~\displaystyle Q_{\tilde{X}}, the Lorenz map at r:=(r1,,rn)[0,1]d\displaystyle r:=(r_{1},\ldots,r_{n})\in[0,1]^{d} is obtained straightforwardly as the integral of the piece-wise constant map QX~\displaystyle Q_{\tilde{X}} over [0,r]:=[0,r1]××[0,rd]\displaystyle[0,r]:=[0,r_{1}]\times\ldots\times[0,r_{d}]:

    X(r)\displaystyle\displaystyle\mathcal{L}_{X}(r) =\displaystyle\displaystyle= i=1nλ(Wi[0,r])xi,\displaystyle\displaystyle\sum_{i=1}^{n}\lambda\left(W_{i}\cap[0,r]\right)\;x^{i}, (1.12)

    where the term in λ\displaystyle\lambda is the ordinary area of the convex polytope formed by the intersection of the cell Wi\displaystyle W_{i} and the rectangle [0,r]\displaystyle[0,r].

  4. (4)

    Finally, X\displaystyle\mathcal{L}_{X} can be used to generate a pseudo sample {X(U1),,X(Um)}\displaystyle\{\mathcal{L}_{X}(U_{1}),\ldots,\mathcal{L}_{X}(U_{m})\}, where {U1,,Um}\displaystyle\{U_{1},\dots,U_{m}\} is a uniformly distributed random sample or any pseudo-random (a.k.a. minimum discrepancy) sequence that approximates the uniform distribution on [0,1]d\displaystyle[0,1]^{d}. The Inverse Lorenz Function lX\displaystyle l_{X} can then be approximated with the empirical distribution of this pseudo-sample:

    lm(z)\displaystyle\displaystyle l_{m}(z) :=\displaystyle\displaystyle:= 1mj=1m𝟙{X(Uj)z},z[0,1]d.\displaystyle\displaystyle\frac{1}{m}\sum_{j=1}^{m}\mathds{1}\{\mathcal{L}_{X}(U_{j})\leq z\},\>\>\>z\in[0,1]^{d}. (1.13)

1.2.2. Examples

To illustrate the definition and the computation of the Lorenz map, we now explore examples of specific allocations and compute the corresponding Lorenz maps. First, we illustrate the computation of the Lorenz map for a discrete allocation.

Example 1 (Discrete allocations).

Let X\displaystyle X be the allocation with probability mass function {(x1,1/n),,(xn,1/n)}\displaystyle\{(x^{1},1/n),\ldots,(x^{n},1/n)\}. We select the support points (x1,,xn)\displaystyle(x^{1},\ldots,x^{n}) as the realizations of n\displaystyle n i.i.d. draws from the bivariate standard normal distribution. The vector quantile QX~\displaystyle Q_{\tilde{X}} of the normalized allocation X~\displaystyle\tilde{X} is characterized by its value xi\displaystyle x^{i} on the convex polygon Wi\displaystyle W_{i}, such that (W1,,Wn)\displaystyle(W_{1},\ldots,W_{n}) form the partition of [0,1]2\displaystyle[0,1]^{2} shown on the left panel of figure 1. As shown on the right panel of figure 1, the Lorenz map at r=(r1,r2)[0,1]2\displaystyle r=(r_{1},r_{2})\in[0,1]^{2} is equal to the sum of the xi\displaystyle x^{i}’s times the area of Wi\displaystyle W_{i} intersected with [0,r1]×[0,r2]\displaystyle[0,r_{1}]\times[0,r_{2}].

Next, we consider the special case, where all individuals are endowed with the same quantity of resources.

Example 2 (Identical allocations).

Let X\displaystyle X be the constant allocation X=(1,1,,1)\displaystyle X=(1,1,...,1). Then QX(u)=(1,1,,1)\displaystyle Q_{X}(u)=(1,1,...,1) for all u[0,1]d\displaystyle u\in[0,1]^{d}, so that X(u)\displaystyle\mathcal{L}_{X}(u) is a d\displaystyle d-vector with identical entries u1u2ud\displaystyle u_{1}u_{2}\cdots u_{d}. The image of X\displaystyle\mathcal{L}_{X} is the diagonal in [0,1]d\displaystyle[0,1]^{d}. The Inverse Lorenz Function lX(z)\displaystyle l_{X}(z) of X\displaystyle X is 0\displaystyle 0 when z1z2zd=0\displaystyle z_{1}z_{2}\cdots z_{d}=0. For d1\displaystyle d\geq 1 and (z1,z2,zd)(0,1]d\displaystyle(z_{1},z_{2},...z_{d})\in(0,1]^{d}, and letting z¯:=min{z1,z2,,zd}\displaystyle\underline{z}:=\min\{z_{1},z_{2},...,z_{d}\}, the Inverse Lorenz Function lX(z)\displaystyle l_{X}(z) of X\displaystyle X is

lX(z)\displaystyle\displaystyle l_{X}(z) =\displaystyle\displaystyle= (U1U2Udz1,U1U2Udz2,,U1U2Udzd)\displaystyle\displaystyle\mathbb{P}(U_{1}U_{2}\cdots U_{d}\leq z_{1},U_{1}U_{2}\cdots U_{d}\leq z_{2},...,U_{1}U_{2}\cdots U_{d}\leq z_{d})
=\displaystyle\displaystyle= (U1U2Udz¯)\displaystyle\displaystyle\mathbb{P}(U_{1}U_{2}\cdots U_{d}\leq\underline{z})
=\displaystyle\displaystyle= z¯k=1d(1)k1(k1)![log(z¯)]k1.\displaystyle\displaystyle\underline{z}\sum_{k=1}^{d}\frac{(-1)^{k-1}}{(k-1)!}[\log(\underline{z})]^{k-1}.
Refer to caption
Refer to caption
Figure 1. The left panel is a visualization of the quantile map. On each element of the blue partition of [0,1]2\displaystyle[0,1]^{2}, the quantile map takes the value of the sample point. The sample point is indicated by a black dot and it is connected to the corresponding region by a dashed line. Note that sample points are drawn from the standard bivariate normal, then shifted and rescaled to fit into [0,1]2\displaystyle[0,1]^{2}. The right panel shows a visualization of the computation of the Lorenz map at r:=(0.6,0.4)\displaystyle r:=(0.6,0.4). The latter is equal to the sum of areas of the intersection between the red shaded area and the elements of the blue partition weighted by the corresponding sample realization.

We also check that our definition is compatible with scalar definitions when all resources are independently distributed.

Example 3 (Independent Resources).

Let the components X1,,Xd\displaystyle X_{1},...,X_{d} of X\displaystyle X be independent with marginal Lorenz curves L1,,Ld\displaystyle L_{1},...,L_{d}, respectively. Then, the i\displaystyle ith component of the Lorenz map is Li(ri)Πj=1,jidrj\displaystyle L_{i}(r_{i})\Pi_{j=1,j\neq i}^{d}r_{j}. This expression of the Lorenz map has the following interpretation. Consider the first component r2rdL1(r1)\displaystyle r_{2}\cdots r_{d}L_{1}(r_{1}). The share of resource 1\displaystyle 1 held by people with multivariate rank in [0,r1]×[0,1]d1\displaystyle[0,r_{1}]\times[0,1]^{d-1} is the marginal share, equal to the marginal Lorenz curve. Since the resources are independent, this share is uniformly distributed along the other dimensions, so that people with ranks in [0,r1]×[0,r2]××[0,rd]\displaystyle[0,r_{1}]\times[0,r_{2}]\times\cdots\times[0,r_{d}] command a share r2rdL1(r1)\displaystyle r_{2}\cdots r_{d}L_{1}(r_{1}). The other components are interpreted analogously. When d=2\displaystyle d=2 and  r1=1\displaystyle r_{1}=1, the Lorenz map takes values X(1,r2)=(r2,L2(r2))\displaystyle\mathcal{L}_{X}(1,r_{2})=(r_{2},L_{2}(r_{2})). That is, the image of {(1,r2):0r21}\displaystyle\{(1,r_{2}):0\leq r_{2}\leq 1\} under X\displaystyle\mathcal{L}_{X} is the marginal Lorenz curve L2\displaystyle L_{2} of the second resource X2\displaystyle X_{2} (and symmetrically when r2=1\displaystyle r_{2}=1).

The Inverse Lorenz Function lX(z)\displaystyle l_{X}(z) of allocation X\displaystyle X with independent components is

lX(z)=(U2UdL1(U1)z1,U1U3UdL2(U2)z2,.,U1Ud1Ld(Ud)zd)\displaystyle\displaystyle l_{X}(z)=\mathbb{P}(\,U_{2}\cdots U_{d}L_{1}(U_{1})\leq z_{1},U_{1}U_{3}\cdots U_{d}L_{2}(U_{2})\leq z_{2},....,U_{1}\cdots U_{d-1}L_{d}(U_{d})\leq z_{d})
=[0,1]d1min{z1L1(u1)Πk1,duk,z2L2(u2)Πk2,duk,,ld(zdΠkduk)}𝑑u1𝑑ud1,\displaystyle\displaystyle=\int_{[0,1]^{d-1}}\min\left\{\frac{z_{1}}{L_{1}(u_{1})\Pi_{k\neq 1,d}u_{k}},\frac{z_{2}}{L_{2}(u_{2})\Pi_{k\neq 2,d}u_{k}},...,l_{d}\left(\frac{z_{d}}{\Pi_{k\neq d}u_{k}}\right)\right\}du_{1}...du_{d-1},

where ld\displaystyle l_{d} is the univariate inverse Lorenz curve of Xd\displaystyle X_{d}.

Next, we derive the Lorenz map in the case of allocations X=(X1,X2)\displaystyle X=(X_{1},X_{2}) with the same components, i.e., X1=X2\displaystyle X_{1}=X_{2} almost surely. We stick to d=2\displaystyle d=2 for notational simplicity.

Example 4 (Comonotonic Resources).

Consider bivariate comonotonic allocations. Let the components X1\displaystyle X_{1} and X2\displaystyle X_{2} of the allocation X\displaystyle X be almost surely equal. Then, X1\displaystyle X_{1} and X2\displaystyle X_{2} have identical distributions. Since the distribution of X=(X1,X2)\displaystyle X=(X_{1},X_{2}) concentrates on the line x1=x2\displaystyle x_{1}=x_{2}, the vector quantile depends on u\displaystyle u only through u1+u2\displaystyle u_{1}+u_{2} (it is an index cost in the terminology of Chiappori et al. (2017)). More precisely, it is (u1,u2)(ψ(u1+u2),ψ(u1+u2))\displaystyle(u_{1},u_{2})\mapsto(\psi^{\prime}(u_{1}+u_{2}),\psi^{\prime}(u_{1}+u_{2})), where zψ(z)\displaystyle z\mapsto\psi^{\prime}(z) is the optimal transport map from σ\displaystyle\sigma to the distribution of X1\displaystyle X_{1}, where σ\displaystyle\sigma has density on [0,2]\displaystyle[0,2] given by 1|1z|\displaystyle 1-|1-z|. Each component of the Lorenz curve is then given by

1(r1,r2)=2(r1,r2)\displaystyle\displaystyle\mathcal{L}_{1}(r_{1},r_{2})=\mathcal{L}_{2}(r_{1},r_{2}) =\displaystyle\displaystyle= 0r20r1ψ(u1+u2)𝑑u1𝑑u2=0r2[ψ(u2+r1)ψ(u2)]𝑑u2.\displaystyle\displaystyle\int_{0}^{r_{2}}\int_{0}^{r_{1}}\psi^{\prime}(u_{1}+u_{2})du_{1}du_{2}=\int_{0}^{r_{2}}[\psi(u_{2}+r_{1})-\psi(u_{2})]du_{2}.

In case X1\displaystyle X_{1} and X2\displaystyle X_{2} are uniformly distributed on [0,2]\displaystyle[0,2], the optimal transport map ψ\displaystyle\psi^{\prime} for z<1\displaystyle z<1 is given by ψ(z)=z2\displaystyle\psi^{\prime}(z)=z^{2}, so that ψ(z)=z3/3\displaystyle\psi(z)=z^{3}/3. We then have,

1(r1,r2)=r13r23+r1r233+r12r222,\mathcal{L}_{1}(r_{1},r_{2})=\frac{r_{1}^{3}r_{2}}{3}+\frac{r_{1}r_{2}^{3}}{3}+\frac{r_{1}^{2}r_{2}^{2}}{2},

when r1+r21\displaystyle r_{1}+r_{2}\leq 1, and

1(r1,r2)=23(r1+r2)3112(r1+r2)4(r1+r2)2r1412r2412+23(r1+r2)16,\displaystyle\displaystyle\mathcal{L}_{1}(r_{1},r_{2})=\,\frac{2}{3}(r_{1}+r_{2})^{3}-\frac{1}{12}(r_{1}+r_{2})^{4}-(r_{1}+r_{2})^{2}-\frac{r_{1}^{4}}{12}-\frac{r_{2}^{4}}{12}+\frac{2}{3}(r_{1}+r_{2})-\frac{1}{6},

when r1+r2>1\displaystyle r_{1}+r_{2}>1. The image of this Lorenz map is once again the diagonal in [0,1]2\displaystyle[0,1]^{2}.

Letting RU[0,1]2\displaystyle R~{}U[0,1]^{2}, the Inverse Lorenz Function lX(z)\displaystyle l_{X}(z) of allocation X=(X1,X2)\displaystyle X=(X_{1},X_{2}) with X2=X1\displaystyle X_{2}=X_{1} almost surely, is

lX(z)\displaystyle\displaystyle l_{X}\left(z\right) =\displaystyle\displaystyle= (1(R)z1,2(R)z2)\displaystyle\displaystyle\mathbb{P}\left(\mathcal{L}_{1}\left(R\right)\leq z_{1},\mathcal{L}_{2}\left(R\right)\leq z_{2}\right)
=\displaystyle\displaystyle= (1(R)min{z1,z2})\displaystyle\displaystyle\mathbb{P}\left(\mathcal{L}_{1}\left(R\right)\leq\min\left\{z_{1},z_{2}\right\}\right)
=\displaystyle\displaystyle= h(min{z1,z2}),\displaystyle\displaystyle h\left(\min\left\{z_{1},z_{2}\right\}\right),

where j(R)\displaystyle\mathcal{L}_{j}(R) is the j\displaystyle j-th component, j=1,2\displaystyle j=1,2, of X(R)\displaystyle\mathcal{L}_{X}(R), and h\displaystyle h is the distribution function of 1(R)\displaystyle\mathcal{L}_{1}\left(R\right).

1.3. Relative scale and normalization

Let X\displaystyle X be the original allocation. Normalizing X\displaystyle X into X~\displaystyle\tilde{X} as in definition 2 by dividing each component by its mean, removes any sensitivity to (changes in) units of measurements. It is a standard approach to achieve ratio-scale invariance. See for instance Banerjee (2010, 2016). However, by construction, it comes with the disadvantage of removing scale effects. In the univariate case, Shorrocks (1983) proposes to eschew normalization in order to take scale effects into account in the measurement of inequality.

When units of measurement are not a concern, an alternative definition without the feature described above is defined as

0r10rdQX(u1,,ud)𝑑u1𝑑ud,\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\cdots\!\int_{0}^{r_{d}}Q_{X}(u_{1},\ldots,u_{d})du_{1}\ldots du_{d},

where QX\displaystyle Q_{X} is the vector quantile of the original allocation X\displaystyle X (not the normalized one). This alternative version also allows the weighting of different resources according to a priori importance to overall inequality. Call Xϵ=(λ1(ϵ)X1,,λd(ϵ)Xd)\displaystyle X^{\epsilon}=(\lambda_{1}(\epsilon)X_{1},\ldots,\lambda_{d}(\epsilon)X_{d}) the suitably rescaled version of the initial allocation X\displaystyle X. Define the sequence of weights (λ1(ϵ),,λd(ϵ))\displaystyle(\lambda_{1}(\epsilon),\ldots,\lambda_{d}(\epsilon)) in such a way that λk+1(ϵ)/λk(ϵ)0\displaystyle\lambda_{k+1}(\epsilon)/\lambda_{k}(\epsilon)\rightarrow 0 as ϵ0\displaystyle\epsilon\rightarrow 0. In this way, resources are ordered in decreasing importance to inequality, and we can entertain the extreme lexicographic case, where ϵ0\displaystyle\epsilon\rightarrow 0.

It follows from Carlier et al. (2010) that, when X\displaystyle X has an absolutely continuous distribution, as ϵ\displaystyle\epsilon tends to 0\displaystyle 0, the alternative Lorenz map tends to the map

0r10rdQXKR(u1,,ud)𝑑u1𝑑ud,\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\cdots\!\int_{0}^{r_{d}}Q^{KR}_{X}(u_{1},\ldots,u_{d})du_{1}\ldots du_{d},

where QXKR\displaystyle Q_{X}^{KR} is the inverse of the Knothe-Rosenblatt transform TKR:d[0,1]d\displaystyle T^{KR}:\mathbb{R}^{d}\mapsto[0,1]^{d} of the original allocation X\displaystyle X proposed by Rosenblatt (1952) and Knothe (1957), and defined as follows

TKR(x1,x2,,xd)\displaystyle\displaystyle T^{KR}(x_{1},x_{2},\ldots,x_{d}) :=\displaystyle\displaystyle:= [FX1(x1)FX2|X1(x2|x1)FXd|X1,,Xd1(xd|x1,,xd1)].\displaystyle\displaystyle\left[\begin{array}[]{c}F_{X_{1}}(x_{1})\\ F_{X_{2}|X_{1}}(x_{2}|x_{1})\\ \vdots\\ F_{X_{d}|X_{1},\ldots,X_{d-1}}(x_{d}|x_{1},\ldots,x_{d-1})\end{array}\right].

The Knothe-Rosenblatt quantile map is the only multivariate quantile map from the uniform on [0,1]d\displaystyle[0,1]^{d} to d\displaystyle\mathbb{R}^{d} proposed in the literature other than optimal transport based vector quantiles as in definition 1. The result above shows that the Knothe-Rosenblatt quantile is not a good alternative to vector quantiles of definition 1 to base an integrated quantile definition for the Lorenz map, since it relies on an a priori lexicographic ordering of the different resources in the allocation.

1.4. Properties and comparisons with other multivariate Lorenz concepts

In this section, we detail previous proposals for multivariate extensions of the Lorenz curve and list the properties that distinguish our proposal from the former.

1.4.1. Alternative multivariate Lorenz proposals

Until now, the development of multivariate extensions of the Lorenz curve was hampered by the lack of simple multivariate analogues of ranks and quantiles. Early proposals for bivariate extensions of the Lorenz curve in Taguchi (1972a,1972b) and Arnold (1983,2012) are based on a direct ad-hoc extension of the traditional formula given in (1.1). Let (x1,x2)F(x1,x2)\displaystyle(x_{1},x_{2})\mapsto F(x_{1},x_{2}) be the CDF of a bivariate allocation with density f\displaystyle f and mean (μ1,μ2)\displaystyle(\mu_{1},\mu_{2}). Taguchi (1972a,1972b) proposes the bivariate Lorenz surface L:[0,1]2[0,1]\displaystyle L:[0,1]^{2}\rightarrow[0,1] defined implicitly by (s,t,L(s,t)):=\displaystyle(s,t,L(s,t)):=

(F(x1,x2),1μ10x10x2u1f(u)𝑑u,1μ20x10x2u2f(u)𝑑u).\displaystyle\displaystyle\left(\,F(x_{1},x_{2})\,,\,\frac{1}{\mu_{1}}\int_{0}^{x_{1}}\!\!\!\int_{0}^{x_{2}}u_{1}f(u)du\,,\,\frac{1}{\mu_{2}}\int_{0}^{x_{1}}\!\!\!\int_{0}^{x_{2}}u_{2}f(u)du\,\right). (1.15)

In order to treat both dimensions of the allocation symmetrically, Arnold (1983,2012) proposes the alternative Lorenz surface parameterized by (x1,x2)\displaystyle(x_{1},x_{2}) as the set of points

(F1(x1),F2(x2),1μ120x10x2u1u2f(u)𝑑u),\displaystyle\displaystyle\left(\,F_{1}(x_{1})\,,\,F_{2}(x_{2})\,,\,\frac{1}{\mu_{12}}\int_{0}^{x_{1}}\!\!\!\int_{0}^{x_{2}}u_{1}u_{2}f(u)du\,\right), (1.16)

where F1\displaystyle F_{1} and F2\displaystyle F_{2} are the marginal CDFs associated with F\displaystyle F, and μ12\displaystyle\mu_{12} is the expectation of the product X1X2\displaystyle X_{1}X_{2}. A closed form solution, given in Sarabia and Jorda (2014), makes the Lorenz surface (1.16) amenable to parameterization and statistical analysis. However, it does not share the interpretation or any of the properties of the univariate Lorenz curve.

A more successful proposal in that respect, is the Lorenz zonoid of Koshevoy and Mosler (1996). Again, take (1.1) in the univariate case as the point of departure. It associates a fraction p\displaystyle p of the population to the share of the resource collectively held by the poorest fraction p\displaystyle p of the population. Koshevoy and Mosler (1996) eschew the need to order the population by associating with a fraction p\displaystyle p of the population the share of resources held by any group of individuals making up a fraction p\displaystyle p of the population, poor, rich, or mixed. The lower bound is the share held by the poorest individuals (the traditional Lorenz curve), and the upper bound is the share held by the richest individuals (a reverse Lorenz curve). The Lorenz zonoid is defined in Koshevoy and Mosler (1996) as the collection of all such shares for each fraction of the population. It is a convex region in [0,1]2\displaystyle[0,1]^{2} bounded below by the Lorenz curve and above by the reverse Lorenz curve. More precisely, the Lorenz zonoid is defined as the set of points

L(Y)\displaystyle\displaystyle L(Y) :=\displaystyle\displaystyle:= {(0ϕ(v)𝑑FY(v),1μY0vϕ(v)𝑑FY(v)):ϕΦ},\displaystyle\displaystyle\left\{\left(\,\int_{0}^{\infty}\phi(v)dF_{Y}(v)\,,\,\frac{1}{\mu_{Y}}\int_{0}^{\infty}v\phi(v)dF_{Y}(v)\,\right)\,:\;\phi\in\Phi\right\},

where the function ϕ\displaystyle\phi ranges over the set Φ\displaystyle\Phi of measurable functions from +\displaystyle\mathbb{R}_{+} to [0,1]\displaystyle[0,1]. The lower (resp. upper) bound is obtained with the collection of functions ϕ(v):=𝟙{vy}\displaystyle\phi(v):=\mathds{1}\{v\leq y\} (resp. 𝟙{v>y}\displaystyle\mathds{1}\{v>y\}), y+\displaystyle y\in\mathbb{R}_{+}.

Since the definition of the Lorenz zonoid does not rely on ranks or quantiles, the extension to higher dimensions is straightforward. Let Φ\displaystyle\Phi now be the set of measurable functions from +d\displaystyle\mathbb{R}_{+}^{d} to [0,1]\displaystyle[0,1] and X=(X1,,Xd)\displaystyle X=(X_{1},\ldots,X_{d}) be a multivariate allocation with CDF F\displaystyle F and mean (μ1,,μd)\displaystyle(\mu_{1},\ldots,\mu_{d}). The Lorenz zonoid of Koshevoy and Mosler (1996) is defined as the set of points L(X):=\displaystyle L(X):=

{(0ϕ(u)𝑑F(u),1μ10u1ϕ(u)𝑑F(u),,1μd0udϕ(u)𝑑F(u)):ϕΦ}.\displaystyle\displaystyle\left\{\left(\,\int_{0}^{\infty}\phi(u)dF(u)\,,\,\frac{1}{\mu_{1}}\int_{0}^{\infty}u_{1}\phi(u)dF(u)\,,\,\cdots\,,\,\frac{1}{\mu_{d}}\int_{0}^{\infty}u_{d}\phi(u)dF(u)\,\right)\,:\;\phi\in\Phi\right\}.

The Lorenz zonoid is an American football-shaped region in [0,1]d+1\displaystyle[0,1]^{d+1} with poles at points (0,,0)\displaystyle(0,\ldots,0) and (1,,1)\displaystyle(1,\ldots,1). The Lorenz surface of Taguchi (1972a,1972b) is a subset of the Lorenz zonoid, obtained when Φ\displaystyle\Phi is restricted to the set of functions ϕx():=𝟙{x}\displaystyle\phi_{x}(\cdot):=\mathds{1}\{\cdot\leq x\}, all x2\displaystyle x\in\mathbb{R}^{2}. A function ϕΦ\displaystyle\phi\in\Phi defines a point in the zonoid. The interpretation is simple in the case of indicator functions. The latter pick out specific groups of individuals in the population and the corresponding point in the zonoid has first coordinate equal to the fraction of the population involved. The other coordinates are the shares of each of the resources held by this group of individuals.

Definition 4.1 of Banerjee (2016) proposes a multivariate inequality ordering in the finite population case. The analogue multivariate Lorenz map in the general case of a (possibly mixed discrete continuous) vector allocation X\displaystyle X with normalized version X~\displaystyle\tilde{X} can be defined555We thank Xiaoxia Shi for bringing this to our attention. for all r=(r1,,rd)[0,1]d\displaystyle r=(r_{1},\ldots,r_{d})\in[0,1]^{d} by

LB(r)\displaystyle\displaystyle L^{B}(r) :=\displaystyle\displaystyle:= (0rjQB,j(u)𝑑u)j=1d,\displaystyle\displaystyle\left(\,\int_{0}^{r_{j}}Q^{B,j}(u)\,du\,\right)_{j=1}^{d},

where, for each j=1,,d\displaystyle j=1,\ldots,d, and QB,j\displaystyle Q^{B,j} is the quantile function associated with the random variable Σk=1d(X~j+X~k)/2d\displaystyle\Sigma_{k=1}^{d}(\tilde{X}_{j}+\tilde{X}_{k})/2d. If the latter were replaced by X~j\displaystyle\tilde{X}_{j}, the Lorenz map would be the vector of marginal Lorenz curves. Mixing with the average across allocation introduces sensitivity to dependence between the marginal allocations. Note however, that QB:=(QB,j)j=1d\displaystyle Q^{B}:=(Q^{B,j})_{j=1}^{d} is not a valid vector quantile for X~\displaystyle\tilde{X}, since QB(U)\displaystyle Q^{B}(U) is not distributed like X~\displaystyle\tilde{X} when U\displaystyle U is uniform on [0,1]d\displaystyle[0,1]^{d}. As a result, LB(r)\displaystyle L^{B}(r) is not a vector of resource shares, as is the case for the univariate Lorenz curve.

1.4.2. Properties of the Lorenz map and Inverse Lorenz Function

The proposed multivariate extensions of the Lorenz curve in both Taguchi (1972a,1972b) and Koshevoy and Mosler (1996) relate population proportions to a vector of resource shares. Our proposal differs substantially from these in that it directly relates a specific subset of the population, namely individuals with multivariate rank below r\displaystyle r to their share of both resources. Beyond this major conceptual difference, we now investigate properties of our multivariate extension of the Lorenz curve that make it a valuable contribution.

  1. (1)

    Interpretation. Unlike other multivariate proposals, the Lorenz map shares the interpretation of the traditional Lorenz curve as the cumulative share of resources held by the lowest ranked individuals.

  2. (2)

    Computation. As shown in section 1.2, the Lorenz map can be efficiently computed via convex programming. As an integrated vector quantile, it relies on the growing literature on computational geometry and computational optimal transport, where algorithms and implementations abound and are tested in a variety of applied fields. This is in sharp contrast with the Lorenz zonoid proposed by Koshevoy and Mosler (1996) which is notoriously difficult to compute.

  3. (3)

    Statistical inference. As an integrated quantile, the Lorenz map is amenable to statistical inference. The convergence of sample analogues of vector quantiles to their theoretical counterpart was shown in Chernozhukov et al. (2017) and Figalli (2018). Vector ranks are distribution free and can be used in rank based statistical procedures that emulate scalar rank-based inference, as shown in Deb and Sen (2023), Ghosal and Sen (2022) and Shi et al. (2022). See the survey in Hallin (2022) and references within.

  4. (4)

    Uniqueness. The Lorenz map characterizes the distribution of the allocation it is associated with. This property is shared with the Lorenz zonoid of Koshevoy and Mosler (1996), but not the other alternative proposals in the literature.

    Proposition 1.

    The Lorenz map X\displaystyle\mathcal{L}_{X} characterizes the distribution of X\displaystyle X in the sense that X\displaystyle X and X~\displaystyle\tilde{X} are identically distributed if and only if X=X~\displaystyle\mathcal{L}_{X}=\mathcal{L}_{\tilde{X}}.

  5. (5)

    Lorenz curve as a CDF. The Lorenz map is a map from [0,1]d\displaystyle[0,1]^{d} to [0,1]d\displaystyle[0,1]^{d}. Hence, unlike the traditional scalar Lorenz curve, it cannot be a CDF. However, the Inverse Lorenz Function is the cumulative distribution function of a random vector on [0,1]d\displaystyle[0,1]^{d} by construction. This property is not shared by the alternative proposals in the literature.

  6. (6)

    Decomposition under independent attributes. As shown in example 3, the Lorenz map reduces to a simple function of the marginal Lorenz curves in case marginal attribute allocations are independent. This feature is shared with the multivariate Lorenz proposal in Arnold (1983,2012) but not the alternative proposals.

  7. (7)

    Dominance of egalitarian allocations. In the univariate case, the Lorenz curve of the identical allocation Y=1\displaystyle Y=1 almost surely, is LY(q)=q\displaystyle L_{Y}(q)=q, which is sometimes called the egalitarian line. The Lorenz curve of any other allocation Y0\displaystyle Y\geq 0 is below the egalitarian line, i.e., LY(q)q\displaystyle L_{Y}(q)\leq q, for all q[0,1]\displaystyle q\in[0,1]. For d>1\displaystyle d>1, the identical allocation of example 2 is a direct extension of the univariate notion of egalitarian. We show here that the Lorenz map and Inverse Lorenz Function of the identical allocation provide similar bounds in the multi-attribute case. For this, we require allocations with components that display a form of positive association defined in assumption 1.

    Assumption 1.

    The vector quantile QX~:=(Q1,,Qd)\displaystyle Q_{\tilde{X}}:=(Q_{1},\ldots,Q_{d}) of X~\displaystyle\tilde{X} is such that, for each j\displaystyle j, 𝔼[Qj(U1,,Ud)|Uk=uk,all kj]\displaystyle\mathbb{E}\left[Q_{j}(U_{1},\ldots,U_{d})\,|\,U_{k}=u_{k},\mbox{all }k\neq j\right] is monotonically increasing in each of the uk\displaystyle u_{k}, kj\displaystyle k\neq j, where the vector (U1,,Ud)\displaystyle(U_{1},\ldots,U_{d}) is uniform on [0,1]d\displaystyle[0,1]^{d}.

    This assumption imposes a type of positive dependence between the components of X\displaystyle X through their ranks. More precisely, assumption 1 imposes a form of positive regression dependence, as in Lehmann (1966), between one resource and the others’ ranks. For allocations satisfying assumption 1, we show that Lorenz map and Inverse Lorenz Function of the identical allocation serve as upper and lower bounds, respectively.

    Proposition 2.

    The Lorenz map of any allocation X\displaystyle X satisfying assumption 1 is component-wise dominated by the Lorenz map of the identical allocation in example 2. Moreover, the Inverse Lorenz Function of allocation X\displaystyle X is bounded below by the Inverse Lorenz Function of the identical allocation.

    We argue in appendix C.2 that defining egalitarianism solely by identical allocations is too restrictive in the case of multiple resources. In case d=2\displaystyle d=2, we show that a much larger class of allocations have Lorenz maps dominated by an egalitarian allocation from definition 9, which includes the identical allocation.

2. Multi-attribute inequality comparisons

We can use the vector Lorenz map X\displaystyle\mathcal{L}_{X} of an allocation X\displaystyle X introduced in section 1.1 as a tool to compare inequality of different allocations. We base an inequality dominance criterion to compare different allocations on the dominance of Lorenz maps. We develop a visualization tool for inequality dominance, and an inequality index for the cases, where the allocations are not Lorenz ordered.

2.1. Lorenz dominance

Consider two allocations X\displaystyle X and X\displaystyle X^{\prime}, with respective Lorenz maps X\displaystyle\mathcal{L}_{X} and X\displaystyle\mathcal{L}_{X^{\prime}}. If X(r)X(r)\displaystyle\mathcal{L}_{X}(r)\geq\mathcal{L}_{X^{\prime}}(r) for some vector rank r\displaystyle r, the same proportion of the population with vector ranks below r\displaystyle r commands a larger share of all resources in allocation X\displaystyle X than in allocation X\displaystyle X^{\prime}. If this is true for any vector rank r\displaystyle r in [0,1]d\displaystyle[0,1]^{d}, then, we say that allocation X\displaystyle X^{\prime} is more unequal than allocation X\displaystyle X.

Definition 4.

An allocation X\displaystyle X^{\prime} is said to be more unequal in the Lorenz order than an allocation X\displaystyle X if X(r)X(r)\displaystyle\mathcal{L}_{X}(r)\geq\mathcal{L}_{X^{\prime}}(r) for all r[0,1]d\displaystyle r\in[0,1]^{d}. We denote this XX\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime}.666As a partial ordering based on cumulative sums of vector quantiles, the relation XX\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime} is a multivariate extension of the concept of majorization of Hardy et al. (1934). It is different from existing multivariate notions of majorization reviewed in  Marshall et al. (2011) and Arnold and Sarabia (2018), in that it relies on a multivariate reordering of the random vector allocation.

The Lorenz partial order of definition 4 is an implementable dominance criterion: The Lorenz maps can be computed and compared. The relation XX\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime} is equivalent to stochastic dominance of the random vector X(U)\displaystyle\mathcal{L}_{X}(U), with UU[0,1]d\displaystyle U\sim U[0,1]^{d}, over X(U)\displaystyle\mathcal{L}_{X^{\prime}}(U) (see Section 3.8 of Müller and Stoyan (2002)). Hence, dominance tests can be derived on the basis of sample analogues of the Lorenz maps to emulate the large literature on inference techniques to compare inequality of distributions of a single attribute. See Davidson and Duclos (2000) and references within.

Following the literature on the measurement of inequality, we assess the value of this implementable dominance criterion for inequality comparisons in two ways. First, we analyze the class of social evaluation functionals that are compatible with the Lorenz order, and show that they are rank-dependent social evaluation functionals, with weights decreasing in rank. Second, we identify the class of transfers that increase inequality as defined by this Lorenz criterion.

2.1.1. Rank-dependent social evaluation functionals

The first way to gain insight into the relevance of our multivariate Lorenz dominance criterion is to characterize the set of social evaluation functionals that are compatible with it. A social evaluation functional is a map S\displaystyle S from an allocation X\displaystyle X, i.e., a random vector in +d\displaystyle\mathbb{R}_{+}^{d}, to \displaystyle\mathbb{R}, which orders allocations in their social desirability. A social evaluation functional S\displaystyle S is compatible with the dominance criterion if XXS(X)S(X)\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime}\Rightarrow S(X)\geq S(X^{\prime}). Compatibility with Lorenz dominance is a form of inequality aversion of the social evaluation functional, since more equal allocations are deemed socially more desirable.

By construction, a social evaluation functional that is compatible with the Lorenz dominance order must satisfy anonymity and ratio-scale invariance. Anonymity, also called law-invariance or symmetry in the literature, refers to the fact that S(X)=S(X)\displaystyle S(X)=S(X^{\prime}) whenever X\displaystyle X and X\displaystyle X^{\prime} are identically distributed. The identity of individuals does not matter in the social evaluation, so that a permutation of individuals in the population leaves S\displaystyle S unchanged. Ratio-scale invariance refers to the fact that S(λX)=S(X)\displaystyle S(\lambda^{\top}X)=S(X) for any positive vector λ\displaystyle\lambda. Hence, the social evaluation is not affected by a change in units of measurement.

Next, and more substantively, all social evaluation functionals that are compatible with the Lorenz dominance criterion are rank-dependent social evaluation functionals. Individuals are weighted in the social evaluation according to their rank in the distribution. To define and formalize this statement, start with the case of a single attribute. Weymark (1981) shows that social evaluations that satisfy the comonotonic independence property defined below take the form of weighted sums of quantiles.

Property CI (Comonotonic Independence). A social evaluation functional S\displaystyle S is said to satisfy comonotonic independence if, whenever X\displaystyle X, X\displaystyle X^{\prime} and Z\displaystyle Z are comonotonic allocations, and S(X)S(X)\displaystyle S(X)\geq S(X^{\prime}), then, for all μ(0,1)\displaystyle\mu\in(0,1), S(μX+(1μ)Z)S(μX+(1μ)Z)\displaystyle S(\mu X+(1-\mu)Z)\geq S(\mu X^{\prime}+(1-\mu)Z).

In the univariate case, two allocations are called comonotonic if one is a positive increasing function of the other. In other words, individuals are ranked identically in both allocations. Comonotonic independence means that the comparison of two allocations with a common component only depends on the comparison between the two variable components, as long as rankings stay unchanged. As an illustration, when assessing the effect on household income distributions of a policy that only affects women, under perfect assortative matching, one need only look at the change in the distribution of women’s income.

The same property of comonotonic independence can be entertained in the multi-attribute case, with the same interpretation. Two allocations are comonotonic if individuals are ranked identically in both allocations. Now, in case of random vectors X\displaystyle X and X\displaystyle X^{\prime}, comonotonicity is defined in the same way by the fact that X\displaystyle X and X\displaystyle X^{\prime} have the same vector ranks. Definition 5 below follows Galichon and Henry (2012) and Ekeland et al. (2012), where it is called μ\displaystyle\mu-comonotonicity777See Puccetti and Scarsini (2010) for a discussion of this and other multivariate comonotonicity concepts. of X~1,,X~J\displaystyle\tilde{X}^{1},\ldots,\tilde{X}^{J}.

Definition 5 (Vector comonotonicity).

Random vectors X1,,XJ\displaystyle X^{1},\ldots,X^{J} on d\displaystyle\mathbb{R}^{d} are said to be comonotonic if there exists a uniform random vector U\displaystyle U on [0,1]d\displaystyle[0,1]^{d} such that X~j=QX~j(U)\displaystyle\tilde{X}^{j}=Q_{\tilde{X}^{j}}(U), j=1,,J\displaystyle j=1,\ldots,J, almost surely, where QX\displaystyle Q_{X} is the vector quantile of definition 1 associated with the distribution of X\displaystyle X, and X~\displaystyle\tilde{X} is the normalized version of X\displaystyle X as in definition 2.

With this definition of comonotonicity (which coincides with the usual definition in the single attribute case), comonotonic independence of a social evaluation functional is still defined as property CI. If individuals are ranked identically in allocations X\displaystyle X and X\displaystyle X^{\prime}, and X\displaystyle X^{\prime} is socially less desirable than X\displaystyle X, then, adding to both X\displaystyle X and X\displaystyle X^{\prime} a third common allocation Z\displaystyle Z cannot reverse the ordering, if Z\displaystyle Z ranks individuals as X\displaystyle X and X\displaystyle X^{\prime} do.

As in Weymark (1981) for the single attribute case, Galichon and Henry (2012) show that comonotonic additive social evaluation functionals are rank dependent, i.e., of the form

Sϕ(X)\displaystyle\displaystyle S_{\phi}(X) :=\displaystyle\displaystyle:= [0,1]dϕ(u)QX~(u)𝑑u,\displaystyle\displaystyle\int_{[0,1]^{d}}\phi(u)^{\top}Q_{\tilde{X}}(u)du, (2.1)

for some function ϕ:[0,1]d+d\displaystyle\phi:[0,1]^{d}\rightarrow\mathbb{R}_{+}^{d}. To each vector rank u\displaystyle u, ϕ\displaystyle\phi associates the attribute-specific weights of ranked u\displaystyle u individual in the social evaluation. We show that social evaluation functionals are only compatible with the Lorenz dominance order if they satisfy comonotonic additivity, hence if they are rank-dependent social evaluation functionals. There remains to determine which functions ϕ\displaystyle\phi make social evaluation functional Sϕ\displaystyle S_{\phi} compatible with the Lorenz dominance criterion. As we discuss below, they are characterized by inequality aversion.

Inequality aversion of a rank dependent social evaluation functional implies a weighting scheme that gives more weight to lower ranked individuals. In the scalar case analyzed in Weymark (1981), an inequality averse rank dependent social evaluation functional is characterized by decreasing weights as ranks increase. We show a similar result in the multivariate case. Social evaluation functionals that are compatible with the Lorenz order of definition 4 are rank dependent social evaluation functionals with rank-specific weights of the form

ϕm(u):=([0,1]d𝟙{ur}𝑑m1(r),,[0,1]d𝟙{ur}𝑑md(r)),\displaystyle\displaystyle\phi_{m}(u):=\left(\int_{[0,1]^{d}}\mathds{1}\{u\leq r\}\,dm_{1}(r),\ldots,\int_{[0,1]^{d}}\mathds{1}\{u\leq r\}\,dm_{d}(r)\right)^{\top}, (2.2)

where mj\displaystyle m_{j} is a non negative measure on [0,1]d\displaystyle[0,1]^{d}, all jd\displaystyle j\leq d.

Proposition 3.

A social evaluation functional is compatible with the Lorenz dominance order of definition 4 if and only if it is of the form

S(X):=[0,1]dϕm(u)QX~(u)𝑑u\displaystyle\displaystyle S(X):=\int_{[0,1]^{d}}\phi_{m}(u)^{\top}Q_{\tilde{X}}(u)\,du (2.3)

A special case of weighting scheme satisfying proposition 3 is the case mj=δr\displaystyle m_{j}=\delta_{r} all j\displaystyle j, where all individuals below rank r\displaystyle r receive weight 1\displaystyle 1 and all individuals above rank r\displaystyle r receive weight 0\displaystyle 0. More generally, individuals can be given different weights for different resource dimensions, but as non negative mixtures of the indicators 𝟙{ur}\displaystyle\mathds{1}\{u\leq r\}, the weights are always decreasing in ranks.

2.1.2. Increasing marginal inequality and increasing correlation

The second way we evaluate our Lorenz dominance criterion is by identifying transfers of resources between individuals that increase inequality according to this criterion. Since inequality is a cardinal aspect of the distribution, we consider a class of transfers that preserves the multivariate ranks. The transfers we consider are functions T:[0,1]dd\displaystyle T:[0,1]^{d}\rightarrow\mathbb{R}^{d}. If the j\displaystyle j-th component of transfer T\displaystyle T is positive (resp. negative), it is added to (subtracted from) the endowment in resource j\displaystyle j of individual with rank u\displaystyle u.

Definition 6.

A rank preserving transfer from allocation X\displaystyle X to allocation X\displaystyle X^{\prime} is a transfer such that pre-transfer and post-transfer allocations are comonotonic (individuals preserve the same rank). Equivalently, it is a function T:[0,1]dd\displaystyle T:[0,1]^{d}\rightarrow\mathbb{R}^{d} such that QX~(u)=QX~(u)+T(u)\displaystyle Q_{\tilde{X}^{\prime}}(u)=Q_{\tilde{X}}(u)+T(u) for all u[0,1]d\displaystyle u\in[0,1]^{d}, where QX\displaystyle Q_{X} is the vector quantile of definition 1 associated with the distribution of X\displaystyle X, and X~\displaystyle\tilde{X} is the component-wise demeaned version of X\displaystyle X.

First we show that the transfers that increase inequality according to the Lorenz criterion are the arbitrary combinations of rank preserving transfers of a non negative quantity of one of the resources from an individual with rank u1\displaystyle u_{1} to an individual with rank u2u1\displaystyle u_{2}\geq u_{1}.

Proposition 4.

An allocation X\displaystyle X^{\prime} is more unequal than X\displaystyle X, i.e., XX\displaystyle X^{\prime}\preccurlyeq_{\mathcal{L}}X, if and only if an allocation with the same distribution as X\displaystyle X^{\prime} can be obtained from X\displaystyle X via an arbitrary sequence of rank preserving transfers T\displaystyle T such that for all r[0,1]d\displaystyle r\in[0,1]^{d},

[0,1]dT(u)𝟙{ur}𝑑u0.\displaystyle\displaystyle\int_{[0,1]^{d}}T(u)\mathds{1}\{u\leq r\}\,du\leq 0. (2.4)

The inequality in (2.4) expresses the fact that mass is transferred from lower ranked to higher ranked individuals.

A desirable feature of the Lorenz inequality ordering of definition 4 is its ability to rank two allocations X\displaystyle X and X\displaystyle X^{\prime}, when the latter is obtained from the former through a transfer that increases inequality of the marginals or that increases the degree of positive dependence between the marginals. We formalize this feature with a specific type of multivariate transfer we call Monotone Regressive Transfers. We specialize the discussion to bivariate allocations to avoid wading into concepts of increasing multivariate dependence when d>2\displaystyle d>2.

Definition 7 (Monotone Regressive Transfer, MRT).

A transfer T:[0,1]22\displaystyle T:[0,1]^{2}\rightarrow\mathbb{R}^{2} is a monotone regressive transfer if T\displaystyle T is rank preserving and has non-negative Jacobian (i.e., the Jacobian’s entries are all non negative).

In the univariate case, a monotone regressive transfer reduces to a monotone mean preserving spread (Quiggin (1992)), also called Bickel-Lehmann increase in dispersion (Bickel and Lehmann (1976)). In the multivariate case888A related extension in the theory of multivariate risks was proposed in Charpentier et al. (2016)., monotone regressive transfers weakly increase both marginal inequality and positive dependence. The former happens because each component of the transfer has non negative own derivative, hence is increasing in each component of the rank. The latter happens because the transfer has non negative cross derivative, hence increases the degree of positive dependence between the two resources.

Proposition 5 (Monotonicity in MRT).

If an allocation X\displaystyle X^{\prime} is obtained from an allocation X\displaystyle X through a monotone regressive transfer, then XX\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime}, i.e., X\displaystyle X^{\prime} is more unequal than X\displaystyle X as defined by the Lorenz dominance partial order of definition 4.

Proposition 5 shows that the multivariate Lorenz dominance order of definition 4 therefore ranks an allocation as more unequal if the marginal resource allocations are weakly more unequal and if the marginal resource allocations are weakly more positively dependent. This is in contrast with the Lorenz dominance order based on inclusion of Lorenz zonoids proposed in Koshevoy and Mosler (1996). Indeed, by Proposition 8 in Koshevoy and Mosler (2007), if two allocations have identical marginals, and X\displaystyle X dominates X\displaystyle X^{\prime} in the Lorenz dominance order based on Lorenz zonoid inclusion, then X\displaystyle X and X\displaystyle X^{\prime} are identically distributed.

The Lorenz dominance ordering of definition 4 does not satisfy the uniform majorization principle proposed by Kolm (1977). In case of discrete populations, the uniform majorization principle stipulates that inequality should be reduced through multiplication by a doubly stochastic matrix different from a permutation. However, as Dardanoni (1993) points out, such transformations can increase correlation and therefore increase inequality in an egregious way. See the discussion in Farina and Savaglio (2006). We show that a similar issue arises with the continuous version of the uniform majorization principle. The latter requires an inequality dominance order to be monotonic with respect to the concave order. From theorem 4 of Galichon and Henry (2012), we deduce that a social evaluation functional that satisfies uniform majorization and comonotonic independence must be equal to

SUM(X):=1[0,1]duQX~(u)𝑑u\displaystyle\displaystyle S_{UM}(X):=1-\int_{[0,1]^{d}}u^{\top}Q_{\tilde{X}}(u)du (2.5)

up to an affine transformation. We show in appendix C.3 that in the case of bivariate allocation X=(X1,X2)\displaystyle X=(X_{1},X_{2}), SUM\displaystyle S_{UM} is minimized when the two components X1\displaystyle X_{1} and X2\displaystyle X_{2} of allocation X\displaystyle X are independent, which runs against the intuition that increased dependence can increase inequality999Note that we are considering inequality over outcomes, not welfare inequality. Hence, the point made in Atkinson and Bourguignon (1982), that increased correlation may decrease utilitarian welfare inequality when resources are complements, doesn’t apply here..

2.2. Visualization of Lorenz dominance

Failures of Lorenz dominance can be visualized with the Inverse Lorenz Function. Consider two allocations X\displaystyle X and X\displaystyle X^{\prime}, with respective Inverse Lorenz Functions lX\displaystyle l_{X} and lX\displaystyle l_{X^{\prime}}. If lX(z)lX(z)\displaystyle l_{X}(z)\leq l_{X^{\prime}}(z) for some vector of shares z\displaystyle z, a larger proportion of the population commands the same share of resources in allocation X\displaystyle X^{\prime} than in allocation X\displaystyle X. Lorenz dominance of allocation X\displaystyle X over allocation X\displaystyle X^{\prime} (in the sense of definition 4) implies that the relation  lX(z)lX(z)\displaystyle l_{X}(z)\leq l_{X^{\prime}}(z) holds for each resource share vector z\displaystyle z (see proposition 9 in the appendix).

In case of bivariate allocations, the latter can be easily visualized on [0,1]2\displaystyle[0,1]^{2} through the relative positions of the level sets of the Inverse Lorenz Function, which we call α\displaystyle\alpha-Lorenz curves, denoted

𝒞Xα:={z[0,1]2:lX(z)=α}, for each α(0,1).\displaystyle\displaystyle\mathcal{C}_{X}^{\alpha}:=\{z\in[0,1]^{2}:l_{X}(z)=\alpha\},\mbox{ for each }\alpha\in(0,1).

The α\displaystyle\alpha-Lorenz curves provide a visualization of Lorenz dominance. We can compare the inequality of different allocations based on the shape and relative positions of their respective α\displaystyle\alpha-Lorenz curves. Suppose X\displaystyle X is less unequal in the Lorenz dominance order than X\displaystyle X^{\prime}. Then, by proposition 9 in appendix E, for any z𝒞Xα\displaystyle z\in\mathcal{C}^{\alpha}_{X^{\prime}}, lX(z)lX(z)=α\displaystyle l_{X}(z)\leq l_{X^{\prime}}(z)=\alpha. So z𝒞Xα~\displaystyle z\in\mathcal{C}^{\tilde{\alpha}}_{X} with α~α\displaystyle\tilde{\alpha}\leq\alpha. This can be visualized as a shift to the north-east of the α\displaystyle\alpha-Lorenz curves of the more unequal allocation X\displaystyle X^{\prime} to the α\displaystyle\alpha-Lorenz curves of the less unequal allocation X\displaystyle X.

Figure 2 and 3 display α\displaystyle\alpha-Lorenz curves of the multivariate lognormal allocation X\displaystyle X defined by

lnX𝒩([00],[σ12ρρσ22])\displaystyle\displaystyle\ln X\sim\mathcal{N}\left(\begin{bmatrix}0\\ 0\end{bmatrix},\begin{bmatrix}\sigma_{1}^{2}&\rho\\ \rho&\sigma_{2}^{2}\end{bmatrix}\right) (2.6)

where 𝒩\displaystyle\mathcal{N} is the normal distribution, σ1,σ2>0\displaystyle\sigma_{1},\sigma_{2}>0 control the dispersion of the respective marginals and ρ\displaystyle\rho controls the degree of dependence.

In figure 2, we revisit the three special cases of identical allocations, independent attributes and comonotonic attributes. We also add the counter-monotonic case, where individuals are ranked in opposite order for the two resources, as well as intermediate dependence cases. Figure 2 shows the α\displaystyle\alpha-Lorenz curve (with α=0.9\displaystyle\alpha=0.9) of the identical allocation compared to multivariate lognormal allocations with variances σ1=σ2=1\displaystyle\sigma_{1}=\sigma_{2}=1 and correlation coefficients ρ=0.99\displaystyle\rho=-0.99 (counter-monotonic), ρ=0.6,0.3\displaystyle\rho=-0.6,-0.3, ρ=0\displaystyle\rho=0 (independent), ρ=0.3,0.6\displaystyle\rho=0.3,0.6, and ρ=0.99\displaystyle\rho=0.99 (comonotonic).

Refer to caption
Figure 2. 0.9\displaystyle 0.9-Lorenz curves of multivariate lognormal random vectors that vary by correlation between marginals defined in (2.6). The scale parameters are σ1=σ2=1\displaystyle\sigma_{1}=\sigma_{2}=1. The curves correspond to different correlation coefficient ρ\displaystyle\rho (from bottom left to top right: 0.99,0.6,0.3,0,0.3,0.6,0.99\displaystyle-0.99,0.6,-0.3,0,0.3,0.6,0.99). The nested curves show that increasing dependence between fixed marginals increases inequality.

Visually, inequality can be assessed by the departure of α\displaystyle\alpha-Lorenz curves from those of the identical allocation. This visual comparison is facilitated by the fact that they are shaped like indifference curves. In addition, correlation information is preserved through the curvature of the α\displaystyle\alpha-Lorenz curves, which decreases when positive dependence increases.

Proposition 6.

(1) The α\displaystyle\alpha-Lorenz curves 𝒞Xα\displaystyle\mathcal{C}^{\alpha}_{X} are the level curves of a bivariate cdf, hence they are downward sloping, non decreasing in α\displaystyle\alpha and they do not cross. In addition, (2) The α\displaystyle\alpha-Lorenz curves 𝒞Xα\displaystyle\mathcal{C}^{\alpha}_{X} are convex if

lXz2lX2z1z2lXz1lX2z220.\displaystyle\displaystyle\frac{\partial l_{X}}{\partial z_{2}}\frac{\partial l_{X}^{2}}{\partial z_{1}\partial z_{2}}-\frac{\partial l_{X}}{\partial z_{1}}\frac{\partial l_{X}^{2}}{\partial z_{2}^{2}}\geq 0.
Refer to caption
Figure 3. 0.9\displaystyle 0.9-Lorenz curves of multivariate lognormal random vectors that vary by correlation and scale parameters defined in (2.6). Notably the dotted-dashed lines display when σ1σ2\displaystyle\sigma_{1}\neq\sigma_{2} showing a skewness towards the axis z2\displaystyle z_{2} of the marginal with larger value. The diagonal black dotted line mark where the identical allocation lies and helps to show skewness towards an axis.

Figure 3 compares α\displaystyle\alpha-Lorenz curves of 6\displaystyle 6 different allocations that are multivariate lognormally distributed as in (2.6), for α=0.9\displaystyle\alpha=0.9. The parameters σ12\displaystyle\sigma_{1}^{2} and σ22\displaystyle\sigma_{2}^{2} take values 1\displaystyle 1 or 2\displaystyle 2, whereas  ρ\displaystyle\rho takes values 0.2\displaystyle 0.2 or 0.8\displaystyle 0.8. In case of marginals with different σ\displaystyle\sigma, the asymmetry is reflected in the  α\displaystyle\alpha-Lorenz curves. Moreover, other things equal, inequality increases with σ\displaystyle\sigma, which measures inequality in the marginals, and with ρ\displaystyle\rho, which measures correlation.

Finally, figure 4 shows an example of two multivariate lognormally distributed allocations X\displaystyle X and X\displaystyle X^{\prime} such that the marginals of X\displaystyle X are more unequal than those of X\displaystyle X^{\prime}, but X\displaystyle X^{\prime} is more unequal overall due to positive dependence of its marginals. Specifically, the marginals of X\displaystyle X are independent and have the same scale parameter σ2=1.2\displaystyle\sigma^{2}=1.2, while X\displaystyle X^{\prime} has marginals with correlation parameter ρ=0.9\displaystyle\rho=0.9 and scale σ2=1\displaystyle\sigma^{2}=1 each. This shows that the Lorenz inequality dominance ordering does not imply dominance of the marginals, and how it can instead incorporate trade-offs between marginal inequality and positive dependence.

Refer to caption
Figure 4. α\displaystyle\alpha-Lorenz curves of X\displaystyle X with independent lognormal marginals with σ2=1.2\displaystyle\sigma^{2}=1.2 in black and X\displaystyle X^{\prime} with dependent lognormal marginals with σ2=1\displaystyle\sigma^{2}=1 and correlation ρ=0.9\displaystyle\rho=0.9 in gray.

2.3. Multivariate Gini inequality index

The Lorenz dominance ordering is a partial ordering of multivariate distributions. In many cases, α\displaystyle\alpha-Lorenz curves may cross. For a complete inequality ordering, we also propose an extension of the classical Gini index to compare inequality in multi-attribute allocations. Gajdos and Weymark (2005) propose a multivariate Gini coefficient based on aggregation across individuals first, then across dimensions, which removes the effect of dependence across attributes. Decancq and Lugo (2012) propose to aggregate across dimensions first, then across individuals, in order to keep track of correlation. From the volume of the Lorenz zonoid, a multidimensional Gini coefficient can be derived naturally as in Koshevoy and Mosler (1997). An alternative strategy is followed by Arnold (1983), Koshevoy and Mosler (1997), who extend the definition based on the sum of all distances between pairs of individuals.101010Other multivariate Gini indices based on multivariate Lorenz curve proposals include Banerjee (2010), Grothe et al. (2022), and Sarabia and Jorda (2020).

The univariate Gini index can be interpreted as the average deviation from the egalitarian allocation, the univariate version of our identical allocation. We emulate this interpretation and define a multivariate Gini based on an average deviation from the Lorenz map r(r1r2rd,,r1r2rd)\displaystyle r\mapsto(r_{1}r_{2}\cdots r_{d},\ldots,r_{1}r_{2}\cdots r_{d}). The deviation measure we propose is

j=1d[0,1]d[k=1drkj(r)]𝑑r,\displaystyle\displaystyle\displaystyle\sum_{j=1}^{d}\;\int_{[0,1]^{d}}\;\left[\;\prod_{k=1}^{d}r_{k}-\mathcal{L}_{j}(r)\right]dr, (2.7)

where j\displaystyle\mathcal{L}_{j} is the j\displaystyle j-th component of the Lorenz map X\displaystyle\mathcal{L}_{X}, with j=1,,d\displaystyle j=1,\ldots,d. After normalization, (2.7) becomes

G(X)\displaystyle\displaystyle G(X) =\displaystyle\displaystyle= 12dd([0,1]dj=1dj(r)dr),\displaystyle\displaystyle 1-\frac{2^{d}}{d}\left(\;\int_{[0,1]^{d}}\;\sum_{j=1}^{d}\mathcal{L}_{j}(r)\,dr\;\right), (2.8)

which yields the following definition.

Definition 8 (Gini Index).

(2.8) defines the multivariate Gini index of allocation X\displaystyle X.

The traditional Gini index of a univariate allocation can also be characterized as a weighted sum of outcomes, where the weights are increasing linearly in the rank of the individual in the population. Hence, the negative of the Gini, seen as a social evaluation functional displays inequality aversion by giving more weight to the outcomes of lower ranked individuals than to those of higher ranked ones.

The same interpretation is valid for our multivariate Gini. Specifically,

S(X)\displaystyle\displaystyle S(X) :=\displaystyle\displaystyle:= d2d(1G(X))\displaystyle\displaystyle\frac{d}{2^{d}}\,(1-G(X)) (2.9)
=\displaystyle\displaystyle= [0,1]d([0,1]d(𝟙{ur},,𝟙{ur})QX~(u)𝑑u)𝑑r\displaystyle\displaystyle\int_{[0,1]^{d}}\left(\int_{[0,1]^{d}}\,\left(\mathds{1}\{u\leq r\},\ldots,\mathds{1}\{u\leq r\}\right)^{\top}Q_{\tilde{X}}(u)\,du\right)\,dr
=\displaystyle\displaystyle= [0,1]d(j=1d(1uj),,j=1d(1uj))QX~(u)𝑑u\displaystyle\displaystyle\int_{[0,1]^{d}}\,\left(\prod_{j=1}^{d}(1-u_{j}),\ldots,\prod_{j=1}^{d}(1-u_{j})\right)^{\top}Q_{\tilde{X}}(u)\,du

is an inequality averse social evaluation functional of the form (2.3) with uniform measures on [0,1]d\displaystyle[0,1]^{d} in (2.2). So S(X)\displaystyle S(X) evaluates the social desirability of an allocation with a weighted sum of individual endowments, where the weights Πj=1d(1uj)\displaystyle\Pi_{j=1}^{d}(1-u_{j}) are component-wise decreasing in the individual’s rank u=(u1,,ud)\displaystyle u=(u_{1},\ldots,u_{d}).

The Gini index correspondingly takes the form G(X)=1(2d/d)S(X),\displaystyle G(X)=1-(2^{d}/d)S(X), with S\displaystyle S as in (2.9). For instance, in the case of bivariate allocations, (2.8) takes the form

G(X)\displaystyle\displaystyle G(X) =\displaystyle\displaystyle= 20101(u1+u2u1u2)(Q1(u)+Q2(u))𝑑u1𝑑u2 3.\displaystyle\displaystyle 2\int_{0}^{1}\!\!\!\int_{0}^{1}(u_{1}+u_{2}-u_{1}u_{2})\left(Q_{1}(u)+Q_{2}(u)\right)du_{1}du_{2}\,-\,3. (2.10)

In expression (2.10), Q1(u)\displaystyle Q_{1}(u) and Q2(u)\displaystyle Q_{2}(u) are the components of QX~\displaystyle Q_{\tilde{X}}, so that Q1(u)+Q2(u)\displaystyle Q_{1}(u)+Q_{2}(u) is the sum of the two normalized resource allocations of the individual in the population with vector rank (u1,u2)\displaystyle(u_{1},u_{2}). Hence, the Gini index is indeed a weighted sum of outcomes, with weights (u1+u2u1u2)\displaystyle(u_{1}+u_{2}-u_{1}u_{2}) increasing with the vector ranks (u1,u2)\displaystyle(u_{1},u_{2}). It is a genuinely multivariate extension in that the weighting scheme, hence the social evaluation of inequality, depends on multivariate ranks of individuals.

  • Examples 1 continued

    The Gini coefficient for discrete distributions can be computed by plugging in the Lorenz map from section 1.2 into (2.8).

  • Examples 3 and 4 continued

    We compare Gini indices in the independent case with the perfect comonotonicity case, where X1\displaystyle X_{1} and X2\displaystyle X_{2} have the same marginal distributions (uniform on [0,2]\displaystyle[0,2]). We verify (analytically for r1+r2<1\displaystyle r_{1}+r_{2}<1 and numerically using Wolfram for r1+r21\displaystyle r_{1}+r_{2}\geq 1) that Q1(u)+Q2(u)\displaystyle Q_{1}(u)+Q_{2}(u) is smaller in the comonotonic case, than in the independent case. Hence the Gini index (and the measure of inequality) is larger in the comonotonic case.

Example 5 (Countermonotone Resources).

If we have X1+X2=2\displaystyle X_{1}+X_{2}=2 a.s., then Q1(u)+Q2(u)=2\displaystyle Q_{1}(u)+Q_{2}(u)=2 for almost all u\displaystyle u, and we obtain 1(r1,r2)+2(r1,r2)=2r1r2r2L1(r1)+r1L2(r2)\displaystyle\mathcal{L}_{1}(r_{1},r_{2})+\mathcal{L}_{2}(r_{1},r_{2})=2r_{1}r_{2}\geq r_{2}L_{1}(r_{1})+r_{1}L_{2}(r_{2}), so that, in particular, the Gini index in the countermonotone case is the same as in the case of the identical allocation, i.e., equal to 0\displaystyle 0, and both are smaller than the Gini of the allocation with independent resources. This is consistent with the fact that these allocations X\displaystyle X are considered egalitarian according to definition 9 in appendix C.2.

The Gini index of definition 8 is in [0,1]\displaystyle[0,1] under assumption 1. It equals 0\displaystyle 0 for the identical allocation. It tends to 1\displaystyle 1, when the Lorenz map tends to 0\displaystyle 0 (extreme inequality). The Gini index of an allocation with independent components reduces to the average of classical scalar Ginis of both components. Like the classical scalar Gini index, it preserves the Lorenz inequality ordering, in the sense that higher inequality according to \displaystyle\preccurlyeq_{\mathcal{L}} implies a larger value of the Gini index. In other words, XX\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime} implies G(X)G(X)\displaystyle G(X)\leq G(X^{\prime}), so that the negative of the Gini is a compatible social evaluation functional. Hence it inherits the properties of anonymity, scale invariance and comonotonic independence.

Multivariate S-Gini

The multivariate Gini in expression (2.8) is the suitably normalized negative of an inequality averse social evaluation functional of the form (2.3) with uniform measures on [0,1]d\displaystyle[0,1]^{d} in (2.2). It can be extended to reflect varying concern for inequality in different attributes. To achieve this, a multivariate Gini coefficient can be defined as 1cS(X)\displaystyle 1-cS(X), where c\displaystyle c is a normalizing constant and S\displaystyle S is a social evaluation functional that reflects different degrees of inequality aversion in different attributes.

In the univariate case, to reflect varying degrees of inequality aversion, Donaldson and Weymark (1980) propose a single parameter family of Gini coefficients, called S-Gini, defined by

Gδ(X)\displaystyle\displaystyle G_{\delta}(X) :=\displaystyle\displaystyle:= 1δ(δ1)[0,1](1r)δ2(r)𝑑r,\displaystyle\displaystyle 1-\delta(\delta-1)\int_{[0,1]}(1-r)^{\delta-2}\mathcal{L}(r)\,dr,

where \displaystyle\mathcal{L} is the traditional Lorenz curve, and δ\displaystyle\delta ranges from 1\displaystyle 1, corresponding to indifference to inequality, to the Rawlesian extreme at the limit δ\displaystyle\delta\rightarrow\infty, where only the poorest individual matters111111We were unable to locate a precise statement of this in the literature, so we include it in proposition 10 in the appendix with a proof for completeness..

The S-Gini family of Donaldson and Weymark (1980) can be extended to the assessment of multivariate inequality within our framework. Let δ=(δ1,,δd)\displaystyle\delta=(\delta_{1},\ldots,\delta_{d}) be a d\displaystyle d-dimensional parameter, where δj[1,)\displaystyle\delta_{j}\in[1,\infty), j=1,,d,\displaystyle j=1,\ldots,d, reflects the concern for inequality in attribute j\displaystyle j. We define the family of multivariate S-Gini coefficients of inequality of an allocation X\displaystyle X with Lorenz map X=(1,,d)\displaystyle\mathcal{L}_{X}=(\mathcal{L}_{1},\ldots,\mathcal{L}_{d}) as

Gδ(X)\displaystyle\displaystyle G_{\delta}(X) :=\displaystyle\displaystyle:= 1cδSδ(X),\displaystyle\displaystyle 1-c_{\delta}S_{\delta}(X),

where cδ:=2d1/j=1dδj1\displaystyle c_{\delta}:=2^{d-1}/\sum_{j=1}^{d}\delta_{j}^{-1} is a normalizing constant, and Sδ\displaystyle S_{\delta} is the social evaluation functional

Sδ(X)\displaystyle\displaystyle S_{\delta}(X) :=\displaystyle\displaystyle:= j=1d(δj1)[0,1]d(1rj)δj2j(r)𝑑r.\displaystyle\displaystyle\sum_{j=1}^{d}(\delta_{j}-1)\int_{[0,1]^{d}}(1-r_{j})^{\delta_{j}-2}\mathcal{L}_{j}(r)\,dr.

The normalizing constant cδ\displaystyle c_{\delta} is chosen such that the multivariate S-Gini Gδ\displaystyle G_{\delta} lies in [0,1]\displaystyle[0,1] and is zero in case of the identical allocation. There remains to verify that the social evaluation functional Sδ\displaystyle S_{\delta} is indeed of the form (2.3), and hence compatible with the Lorenz order. Indeed, we have

Sδ(X)=j=1d[0,1]dj(r)𝑑mj(δ)(r),S_{\delta}(X)=\sum_{j=1}^{d}\int_{[0,1]^{d}}\mathcal{L}_{j}(r)\,dm^{(\delta)}_{j}(r),

with mj(δ)(r)=(l=1,ljdrl)[1(1rj)δj1]\displaystyle m^{(\delta)}_{j}(r)=\left(\prod_{l=1,l\neq j}^{d}r_{l}\right)[1-(1-r_{j})^{\delta_{j}-1}], for each j=1,,d\displaystyle j=1,\ldots,d. The multivariate S-Gini thereby incorporates varying degrees of inequality aversion for different attributes. We recover the S-Gini of Donaldson and Weymark (1980) when d=1\displaystyle d=1, and the multivariate Gini of section 2.3 when δj=2\displaystyle\delta_{j}=2, for all j=1,,d\displaystyle j=1,\ldots,d, as desired. We also recover Rawelsian limits as δj\displaystyle\delta_{j} tends to zero as formalized in proposition 11 in the appendix.

3. Empirical Illustration

In this section, we apply our methodology to the analysis of income-wealth inequality in the United States between 1989 and 2022, based on the public version of the triennial Survey of Consumer Finances (SCF). Wealth refers to all assets, financial and otherwise. Details of the sampling technique and a discussion of specific features and issues with the data set are given in appendix B. A guide for practical implementation of the computational procedure outlined in section 1.2 is given in appendix A. We refer to inequality displayed by our measure as overall inequality, while specific marginal inequality is described as wealth or income inequality.

3.1. Income-wealth 𝜶\displaystyle\bm{\alpha}-Lorenz curves

Figure 5 shows the α\displaystyle\alpha-Lorenz curves for α=0.6,0.8,0.95\displaystyle\alpha=0.6,0.8,0.95 for the years 1989, 2007, 2010, and 2022. There is a general worsening of overall inequality over 3 decades since the curves shift away from the north-east corner. The tight curvature also reflects the positive correlation of income and wealth as in figure 2. Using figure 3 as reference, the skew towards the wealth axis indicates inequality from the wealth marginal is dominant at these α\displaystyle\alpha-levels, as expected.

aaaaRefer to caption

Figure 5. α\displaystyle\alpha-Lorenz curves (α=0.6,0.8,0.95\displaystyle\alpha=0.6,0.8,0.95) for US Income-Wealth for years 1989, 2007, 2010, and 2022.

3.2. Resource shares

Figure 6 shows resource shares of the 25%\displaystyle 25\% fraction of the the population below rank r=(0.5,0.5)\displaystyle r=(0.5,0.5) as well as the 90%\displaystyle 90\% fraction121212The fraction of the population is exactly 0.952=0.9025\displaystyle 0.95^{2}=0.9025. of the population below rank r=(0.95,0.95)\displaystyle r=(0.95,0.95). The shares in both resources of the bottom 90%\displaystyle 90\% have been steadily declining and the shares of wealth are lower than the shares of income. Between 2007 and 2010, we see income shares increasing relatively more than the decrease in wealth shares. Wealth and income shares both fell from 2010 to 2022, explaining the shift in the curves from figure  5. As for the bottom 25%\displaystyle 25\%, changes over time are minor compared to those of the bottom 90%\displaystyle 90\%.

Refer to caption
Figure 6. The income and wealth shares of the 90%\displaystyle 90\% (resp. 25%\displaystyle 25\%) fraction of the population ranked below r=(0.95,0.95)\displaystyle r=(0.95,0.95) (resp. (0.5,0.5)\displaystyle(0.5,0.5)).

3.3. Gini indices

Figure 7 displays the marginal Gini indices for income and for wealth, the multivariate Gini index based on (2.8), as well as Kendall’s τ\displaystyle\tau for the dependence between income and wealth over time. The multivariate Gini shows a steady increase in overall inequality. If the resources were independent, the multivariate Gini would be the average of the marginal Ginis. In the present case, the multivariate Gini reveals a positive association between resources since it is higher than the average of the marginal Gini indices. The multivariate Gini shows reduced overall inequality between 2007\displaystyle 2007 and 2010\displaystyle 2010. The decreased correlation and income inequality may have been sufficient to offset the rise in wealth inequality.

Refer to caption
Figure 7. Top: Gini indices for income and for wealth, multivariate Gini index, and Kendall’s τ\displaystyle\tau (dashed) for US Income-Wealth across 1989-2022.

Inequality analysis across groups can reveal further insights. Figure 8 shows Gini indices among White and Black respondents as well as among the working age (64 years and below) and retiring age populations (65 years and above). While overall inequality has worsened among White respondents, the inequality among Black respondents has remained steady. When comparing inequality across age groups, they both exhibit a steady increase in overall inequality, however the multivariate Gini among the retiring age group inherits the variability in the income marginal.

Refer to caption
Refer to caption
Figure 8. Left: Multivariate Gini indices among White and Black respondents. A significant overall inequality gap has formed over time between the two segments. Right: Multivariate Gini indices among working age (64\displaystyle 64 years and below) and retiring age (65\displaystyle 65 years and above).

Concluding remarks

In this paper, we propose a new multivariate extension of the Lorenz curve. We propose to emulate the Gastwirth (1971) formulation of the Lorenz curve and define a Lorenz map by integrating vector quantiles of Chernozhukov et al. (2017). The value of the Lorenz map is a vector of shares of each resource held by the poorer section of the population, as in the scalar case. Dominance of Lorenz maps defines a multi-attribute inequality dominance partial ordering. This Lorenz ordering is, like its scalar counterpart, an implementable criterion to compare inequality in allocations. It is, also like its scalar counterpart, equivalent to preference by any inequality averse rank dependent social evaluation functional. We propose an Inverse Lorenz Function and its level sets as a multivariate inequality visual comparison tool, and apply it to income and wealth in the United States between 1989\displaystyle 1989 and 2022\displaystyle 2022.

Multi-attribute inequality can vary substantially across population groups, as shown in Maasoumi and Racine (2016) within the information theoretic framework of Maasoumi (1986). There is a tension between heterogeneity across covariates and the anonymity axiom, according to which inequality measurement should not depend on individual’s identities, but only on the distribution of resource allocations. As Kolm (1977) pointed out, this tension is alleviated in part by including more variables in the allocation. This reinforces the motivation for a multidimensional approach to inequality measurement. As for the other potential sources of individual heterogeneity that matters to the social planner, anonymity can be restored by measuring inequality in each subgroup. We illustrate this in figure 8. Beyond this, the conditional approach of Maasoumi and Racine (2016) could also be extended to our framework with the use of conditional vector quantiles in Carlier et al. (2016).

Finally, we argue that a formal test of multi-attribute inequality dominance can be based on our Lorenz map, in analogy to dominance testing based on the traditional Lorenz curve in Davidson and Duclos (2000) and references within. The statistical theory for such a test relies on multivariate stochastic dominance testing and the regularity of optimal transport maps, and is left for future research.

Appendix A User’s implementation guide

In this section, we will point to specific computational routines the reader may use to accomplish each step in section 1.2. All of the figures in this paper were generated via implementations in the R language, however these implementations are standard and can be found in other languages and packages. Algorithm 1 is therefore intended to guide the reader across the key steps of the implementation. We specialize to the case of d=2\displaystyle d=2 and leave d>2\displaystyle d>2 for the online supplement.

For the vector quantile, the transport package for R provides various implementations to solve (1.9) that are found in the function semidiscrete. Both the standard descent approach of Aurenhammer et al. (1998) and, our preferred, multiscale initialization and L-BGFS approach of Mérigot (2011) are supported methods. Alternatively and for all of our calculations, we used the Rgeogram package that is a wrapper of the C++ Geogram library implementation of Mérigot (2011). Both packages provide the optimal weight vector h\displaystyle h required for the next steps.

The Lorenz map requires solving for the convex cells defined in (1.10). The optimal h\displaystyle h from the vector quantile calculation can be used as an input for the power_diagram function in the transport package. It will provide as output the vertices of the convex cells. With a desired rank r=(r1,r2)\displaystyle r=(r_{1},r_{2}), the next step is to find the area of the intersection of the convex cells with the rectangle [0,r]\displaystyle[0,r]. The sf package provides these tools: first, the function st_polygon transforms the vertices of the cells and rectangle into a “polygon” object that can be then read into st_intersection and st_area, which calculate the intersection and the area, respectively. These two functions can be used to calculate λ(Wih[0,r])\displaystyle\lambda(W_{i}^{h}\cap[0,r]) in (1.12) for all i=1,,n\displaystyle i=1,\dots,n.

The Inverse Lorenz Function is calculated as an empirical distribution function. To facilitate drawing α\displaystyle\alpha-Lorenz curves, it is recommended to form a uniform grid of values and calculate the ecdf at each value, e.g., {0.01,,0.99,1}2\displaystyle\{0.01,\dots,0.99,1\}^{2}. Each pair should correspond to a row and column in a matrix of ecdf values. Then, pass this matrix as input into any function that plots contours of three-dimensional surfaces such as the base R function contour or geom_contour as part of the ggplot2 package. Finally, with a sample of Lorenz map values, one can compute the Gini index (2.8) by plug-in.

Algorithm 1 Vector Quantiles, Lorenz Maps, and Inverse Lorenz Functions
1:Input
2:    (X,w)\displaystyle(X,w)      Weighted sample with normalized points Xi\displaystyle X_{i} and weights wi\displaystyle w_{i}
3:procedure Vector-Quantile(X,w\displaystyle{X},{w})
4:    Set convergence tolerance δ\displaystyle\delta, step size η\displaystyle\eta.
5:    Set s0\displaystyle s\leftarrow 0, initialize weight vector h0\displaystyle h^{0}. \displaystyle\triangleright e.g. multiscale approach
6:    repeat\displaystyle\triangleright Begin gradient descent
7:         ss+1\displaystyle s\leftarrow s+1
8:         h(s)h(s1)η[wiλ(Wih(s1))]\displaystyle h^{(s)}\leftarrow h^{(s-1)}-\eta[w_{i}-\lambda(W_{i}^{h^{(s-1)}})] \displaystyle\triangleright Modifiable, e.g., L-BFGS
9:    until h(s)h(s1)<δ\displaystyle\left\lVert h^{(s)}-h^{(s-1)}\right\rVert<\delta
10:    h:=h(s)\displaystyle h:=h^{(s)} solution to (1.9) with (Xi,wi)\displaystyle(X_{i},w_{i})
11:    {Wih}i=1n\displaystyle\{W_{i}^{h}\}_{i=1}^{n}\leftarrow (1.11) \displaystyle\triangleright e.g., computing power diagram using h\displaystyle h
12:    return {Wih}\displaystyle\{W_{i}^{h}\} defined by their vertices
13:procedure Lorenz-Map(r,X,{Wih}i=1n\displaystyle r,{X},\{W_{i}^{h}\}_{i=1}^{n})
14:    Input
15:         r\displaystyle rVector of ranks of interest from [0,1]2\displaystyle[0,1]^{2}
16:         Wih\displaystyle W_{i}^{h}Vertices of cells that define the vector quantile of (X,w)\displaystyle(X,w)     
17:    Output
18:         X\displaystyle\mathcal{L}_{X}Lorenz map evaluated at r\displaystyle r, a vector     
19:    X(r)0\displaystyle\mathcal{L}_{X}(r)\leftarrow 0
20:    for i=1\displaystyle i=1 to n\displaystyle n do
21:         Find vertices of Ai:=Wih[0,r1]×[0,r2]\displaystyle A_{i}:=W_{i}^{h}\cap[0,r_{1}]\times[0,r_{2}] \displaystyle\triangleright Ai\displaystyle A_{i} is convex
22:         λi:=λ(Ai)\displaystyle\lambda_{i}:=\lambda(A_{i}): ordinary area of Ai\displaystyle A_{i} \displaystyle\triangleright Many equate to 0 or wi\displaystyle w_{i}
23:         X(r)X(r)+Xiλi\displaystyle\mathcal{L}_{X}(r)\leftarrow\mathcal{L}_{X}(r)+X_{i}\lambda_{i}     
24:    return X(r)\displaystyle\mathcal{L}_{X}(r)
25:procedure Inverse-Lorenz-Function(X,{Wih}i=1n,m\displaystyle{X},\{W_{i}^{h}\}_{i=1}^{n},m)
26:    Input
27:         Wih\displaystyle W_{i}^{h}Vertices of cells that define the vector quantile of (X,w)\displaystyle(X,w)
28:         m\displaystyle mSize of pseudo sample from U([0,1]2)\displaystyle U([0,1]^{2})     
29:    Output
30:         lX\displaystyle l_{X}Matrix of cumulative probabilities: rows and columns are coordinates     
31:    Generate Z:=\displaystyle Z:= evenly-spaced lattice in [0,1]2\displaystyle[0,1]^{2} \displaystyle\triangleright e.g., Z={0.01,,0.99,1}2\displaystyle Z=\{0.01,\dots,0.99,1\}^{2}
32:    for j=1\displaystyle j=1 to m\displaystyle m do
33:         Draw a single RjU[0,1]2\displaystyle R_{j}\sim U[0,1]^{2}
34:         j:=\displaystyle\mathcal{L}_{j}:= Lorenz-Map(Rj,X,{Wih}i=1n\displaystyle R_{j},X,\{W_{i}^{h}\}_{i=1}^{n})     
35:    for zijZ\displaystyle z_{ij}\in Z do
36:         (lX)ij:=m1j=1m𝟙{jzij}\displaystyle{\textstyle(l_{X})_{ij}:=m^{-1}\sum_{j=1}^{m}\mathds{1}\{\mathcal{L}_{j}\leq z_{ij}\}}\displaystyle\triangleright empirical cdf of the j\displaystyle\mathcal{L}_{j}     
37:    return lX\displaystyle l_{X} \displaystyle\triangleright Matrix of function values is usual input for contour plots

Appendix B Specific features and issues with the data source

We review some known issues with the data set that impact our analysis. See Hanna et al. (2018) for a more in-depth account.

Sampling strategy

The over sampling of high income and wealthy households is achieved by applying two distinct sampling techniques. The first sample is the core representative sample selected by a standard multi-stage area-probability design. The second is the high income supplement from statistical records derived from tax data by the Statistics of Income (SOI) division of the U.S. Internal Revenue Service. The stages sample disproportionately– usually one-third of the final sample is from the high income supplement. Sampling in this way retains characteristic information of the population while also addressing the known selection biases of the wealthy not responding to surveys. In order to represent the population with this sample, weights must be constructed for each unit of observation. For more details on the construction of weights and their implications on the distribution of wealth, see Kennickell and Woodburn (1999).

Unit of observation and timing of interviews

The observations in this data set are not households, but rather a subset called the primary economic unit (PEU) that may be individuals or couples and their financial dependents. For example in the 2016 data set 13% of PEUs were in a household that contained one or more members not in their PEU. Additionally, the respondent is not necessarily the head of the household, so special care must be taken if analyzing attitudes in relation to some demographic characteristics such as age. The interviews start in May of the survey year, after most income taxes are filed and usually finish by the end of the calendar year, see Kennickell (2017b) for challenges at the end of the interview period. Questions also may change over time so it is important to review the codebook each year when making comparisons across time.

Multiple Imputation

During interviews, respondents may omit answers or provide a range of values for which their response belongs. This missing data impacts analysis and so the SCF contains 5 imputed values for each PEU, creating a sample 5 times larger than the actual number of respondents and forms 5 data sets called implicates. Imputation is done by the Federal Reserve Imputation Technique Zeta model (FRITZ), details can be found in Kennickell (2017a) based upon the ideas of Little and Rubin (2019). Multiple imputation for missing data provide multiple probable values. Each of these form a data set from which sample statistics can be found. The technique of Repeated Imputation Inference (RII) is applied in our analysis. For each implicate =1,2,3,4,5\displaystyle\ell=1,2,3,4,5, the empirical Inverse Lorenz Function l^\displaystyle\widehat{l}_{\ell*} is calculated using the appropriate quantile map estimator taking into account sample weights. Then the repeated-imputation estimate of l\displaystyle l is

l^(z)=15=15l^(z).\displaystyle\widehat{l}(z)=\frac{1}{5}\sum_{\ell=1}^{5}\widehat{l}_{\ell*}(z).

Calculation of the Gini index follows a similar procedure. Accounting for the multiple imputation in the calculation of standard errors is an important issue, but is not revelant to our visualization technique. For more information on multiple imputation and inference with imputed values, see Rubin (1996).

Definition of Wealth

In the literature, there is no consensus on what factors should be included in wealth measurement. Wolff (2021) defines wealth as marketable weath, which is the sum of marketable or fungible assets less the current value of all debts. Bricker et al. (2017) define wealth as net worth including those assets which are not readily transformed into consumption: properties, vehicles, etc. In our analysis we consider all assets, including financial, as our wealth variable.

Appendix C Additional details and results

C.1. Vector ranks and quantiles

Proposition 7 below, a seminal result in the theory of measure transportation (see Villani (2003, 2009)), states essential uniqueness of the gradient of a convex function (hence cyclically monotone map) that pushes the uniform distribution on [0,1]d\displaystyle[0,1]^{d} into the distribution of an allocation X\displaystyle X.

Following Villani (2003), we let g#ν\displaystyle g_{\#}\nu denote the image measure (or push-forward) of a measure ν\displaystyle\nu by a measurable map g:dd\displaystyle g:\mathbb{R}^{d}\rightarrow\mathbb{R}^{d}. Explicitly, for any Borel set A\displaystyle A, g#ν(A):=ν(g1(A))\displaystyle g_{\#}\nu(A):=\nu(g^{-1}(A)). The symbol \displaystyle\nabla denotes the gradient, and D\displaystyle D the Jacobian. The convex conjugate of a convex lower semicontinuous function ψ\displaystyle\psi is denoted ψ\displaystyle\psi^{\ast}.

Proposition 7 (McCann 1995).

Let P\displaystyle P and λ\displaystyle\lambda be two distributions on d\displaystyle\mathbb{R}^{d}. (i\displaystyle i) If λ\displaystyle\lambda is absolutely continuous with respect to the Lebesgue measure on d\displaystyle\mathbb{R}^{d}, with support contained in a convex set 𝒰\displaystyle\mathcal{U}, the following statements hold: there exists a convex function ψ:𝒰{+}\displaystyle\psi:\mathcal{U}\rightarrow\mathbb{R}\cup\{+\infty\} such that ψ#λ=P\displaystyle\nabla\psi_{\#}\lambda=P. The function ψ\displaystyle\nabla\psi exists and is unique, λ\displaystyle\lambda-almost everywhere. (ii\displaystyle ii) If, in addition, P\displaystyle P is absolutely continuous on d\displaystyle\mathbb{R}^{d} with support contained in a convex set 𝒳\displaystyle\mathcal{X}, the following holds: there exists a convex function ψ:𝒳{+}\displaystyle\psi^{\ast}:\mathcal{X}\rightarrow\mathbb{R}\cup\{+\infty\} such that ψ#P=λ\displaystyle\nabla\psi^{\ast}_{\#}P=\lambda. The function ψ\displaystyle\nabla\psi^{\ast} exists, is unique and equal to (ψ)1\displaystyle\left(\nabla\psi\right)^{-1}, P\displaystyle P-almost everywhere.

Proposition 7 is an extension of Brenier (1991) (see also Rachev and Rüschendorf (1990)). It removes the finite variance requirement, which is undesirable in our context. Proposition 7 is the basis for the definition of vector quantiles and ranks in Chernozhukov et al. (2017). In our context, it is applied with uniform reference measure.131313This vector quantile notion was introduced in Galichon and Henry (2012) and Ekeland et al. (2012) and called μ\displaystyle\mu-quantile.

In case d=1\displaystyle d=1, gradients of convex functions are nondecreasing functions, hence vector quantiles and ranks reduce to classical quantile and cumulative distribution functions. As the notation indicates, the function ψ\displaystyle\psi^{\ast} of proposition 7 is the convex conjugate of ψ\displaystyle\psi. In case of absolutely continuous distributions P\displaystyle P on d\displaystyle\mathbb{R}^{d} with finite variance, the vector rank function solves a quadratic optimal transport problem, i.e., vector rank R\displaystyle R minimizes, among all functions T\displaystyle T such that T(X)\displaystyle T(X) is uniform on [0,1]d\displaystyle[0,1]^{d}, the quantity 𝔼XT(X)2\displaystyle\mathbb{E}\|X-T(X)\|^{2}, where XP\displaystyle X\sim P.

Proposition 7 is the basis for definition 1. In the proofs, we shall use the notation QX=ψX\displaystyle Q_{X}=\nabla\psi_{X} for the vector quantile of a random vector X\displaystyle X and call convex function ψX\displaystyle\psi_{X} the transport potential associated with the distribution of X\displaystyle X.

C.2. Egalitarian multi-attribute allocations

C.2.1. Identical allocations: additional details

In this section, we consider bivariate allocations only. A sufficient condition for assumption 1 is supermodularity of the potential function ψX\displaystyle\psi_{X} of allocation X\displaystyle X, as shown in lemma 1 below. We also show in lemma 1, that supermodularity of the potential function ψX\displaystyle\psi_{X} also implies positive quadrant dependence of the two components X1\displaystyle X_{1} and X2\displaystyle X_{2} of X\displaystyle X, i.e., (X1x1,X2x2)(X1x1)(X2x2)\displaystyle\mathbb{P}(X_{1}\leq x_{1},X_{2}\leq x_{2})\geq\mathbb{P}(X_{1}\leq x_{1})\mathbb{P}(X_{2}\leq x_{2}), for all x1,x2𝒳\displaystyle x_{1},x_{2}\in\mathcal{X}, see Lehmann (1966).

Lemma 1 (Supermodular potential).

Suppose X\displaystyle X has a supermodular potential function, i.e.,

(2ψX(U)/u1u20)=1, with UU[0,1]2.\displaystyle\displaystyle\mathbb{P}(\partial^{2}\psi_{X}(U)/\partial u_{1}\partial u_{2}\geq 0)=1,\mbox{ with }U\sim U[0,1]^{2}.

Then, assumption 1 holds, and X1\displaystyle X_{1} and X2\displaystyle X_{2} are positive quadrant dependent.

For allocations satisfying assumption 1, we show that Lorenz map and Inverse Lorenz Function of the identical allocation serve as upper and lower bounds, respectively. Without assumption 1, some allocations may have a Lorenz map that is component-wise larger than the Lorenz map of the identical allocation for some ranks. To illustrate the point, consider the potential ψX(u)=(u1u2)2/2+u1+u2\displaystyle\psi_{X}(u)=(u_{1}-u_{2})^{2}/2+u_{1}+u_{2}. It corresponds to an allocation X\displaystyle X, whose distribution is supported on the line X1+X2=2\displaystyle X_{1}+X_{2}=2. Calculating the Lorenz map, we obtain

(r)\displaystyle\displaystyle\mathcal{L}(r) =\displaystyle\displaystyle= [r1r2(r1r2)/2+r1r2r1r2(r2r1)/2+r1r2].\displaystyle\displaystyle\left[\begin{array}[]{c}r_{1}r_{2}(r_{1}-r_{2})/2+r_{1}r_{2}\\ \\ r_{1}r_{2}(r_{2}-r_{1})/2+r_{1}r_{2}\end{array}\right].

Notice, in particular, that 1(r)>r1r2\displaystyle\mathcal{L}_{1}(r)>r_{1}r_{2} in the region where r1>r2\displaystyle r_{1}>r_{2}. If the implicit relative price of resource 2\displaystyle 2 is 1\displaystyle 1, allocation X\displaystyle X is an egalitarian allocation, since all individuals have equal budgets. However, this allocation does not satisfy assumption 1 and its Lorenz map is not dominated by (r1r2,r1r2)\displaystyle(r_{1}r_{2},r_{1}r_{2}) as we have shown. This apparent departure from properties of the scalar Lorenz curve is due to the fact that an allocation with X1+X2=2\displaystyle X_{1}+X_{2}=2 a.s. can also be considered egalitarian, as we discuss in the following section.

C.2.2. Egalitarian allocations

The identical allocation with Lorenz map (r1r2,r1r2)\displaystyle(r_{1}r_{2},r_{1}r_{2}) is a very special instance of egalitarian allocation. We extend this narrow notion of egalitarian allocation to include income egalitarianism, in the terminology of Kolm (1977). In the special case where the two resources are transferable with relative price p\displaystyle p of the second resource, an allocation is deemed egalitarian if all agents have the same budget endowment, i.e., if X1+pX2=1+p\displaystyle X_{1}+pX_{2}=1+p (where the constant value 1+p\displaystyle 1+p is derived from the normalization 𝔼X1=𝔼X2=1\displaystyle\mathbb{E}X_{1}=\mathbb{E}X_{2}=1). In the general case of non (or imperfectly) transferable resources, we call egalitarian the allocations with equalized shadow budgets.

Definition 9 (Egalitarian allocation).

An allocation X\displaystyle X such that X1+pX2=1+p\displaystyle X_{1}+pX_{2}=1+p a.s., for some p>0\displaystyle p>0, is called egalitarian.

Another way to interpret egalitarianism of such an allocation, beyond shadow budget equality, is through the perfect compensation of inequality in the marginal resource allocations by perfect negative correlation between resource allocations. The vector quantile and Lorenz map of egalitarian allocations can be characterized in the following way.

Proposition 8.

Let (U1,U2)\displaystyle(U_{1},U_{2}) be a random vector with distribution U[0,1]2\displaystyle U[0,1]^{2}. (i)\displaystyle(i) An egalitarian allocation X\displaystyle X such that X1+pX2=1+p\displaystyle X_{1}+pX_{2}=1+p, admits potential ψX(u1,u2)=u1+u2+v(pu1u2)\displaystyle\psi_{X}(u_{1},u_{2})=u_{1}+u_{2}+v(pu_{1}-u_{2}) for some convex function v\displaystyle v such that 01v(pz)𝑑z=01v(z)𝑑z\displaystyle\int_{0}^{1}v(p-z)dz=\int_{0}^{1}v(z)dz and allocation X\displaystyle X is equal in distribution to (1+pv(pU1U2),1v(pU1U2))\displaystyle(1+pv^{\prime}(pU_{1}-U_{2}),1-v^{\prime}(pU_{1}-U_{2})); (ii)\displaystyle(ii) The Lorenz map is given by

X(r)\displaystyle\displaystyle\mathcal{L}_{X}(r) =\displaystyle\displaystyle= [r1r2+0r2[v(pr1u2)v(u2)]𝑑u2r1r21p0r2[v(pr1u2)v(u2)]𝑑u2];\displaystyle\displaystyle\begin{bmatrix}r_{1}r_{2}+\int_{0}^{r_{2}}[v(pr_{1}-u_{2})-v(-u_{2})]\;du_{2}\\ r_{1}r_{2}-\frac{1}{p}\int_{0}^{r_{2}}[v(pr_{1}-u_{2})-v(-u_{2})]\;du_{2}\end{bmatrix};

(iii)\displaystyle(iii) If, in addition, F11\displaystyle F_{1}^{-1} denotes the quantile function of X1\displaystyle X_{1}, then

v(z)\displaystyle\displaystyle v(z) =\displaystyle\displaystyle= 1p0z(F11(Hp(y))1)𝑑y,\displaystyle\displaystyle\frac{1}{p}\int_{0}^{z}\left(F_{1}^{-1}(H_{p}(y))-1\right)dy,

where Hp\displaystyle H_{p} is the cdf of the random variable pU1U2\displaystyle pU_{1}-U_{2}; see lemma 2 below for an explicit expression for Hp(z)\displaystyle H_{p}(z).

Lemma 2 (Explicit formula for Hp(z)\displaystyle H_{p}(z)).

The cumulative distribution function of Z=pU1U2\displaystyle Z=pU_{1}-U_{2} with (U1,U2)U[0,1]2\displaystyle(U_{1},U_{2})\sim U[0,1]^{2} is given by the following.

Hp(z)\displaystyle\displaystyle H_{p}(z) =\displaystyle\displaystyle= {1ifp<z,1p2+zz22pifmax{p1,0}<zp,1+2z2pif0<zmax{p1,0},1p2+zifmin{p1,0}<z0,1p(12+z+z22)if1<zmin{p1,0},0ifz1.\displaystyle\displaystyle\left\{\begin{array}[]{llc}1&\mbox{if}&p<z,\\ \\ 1-\frac{p}{2}+z-\frac{z^{2}}{2p}&\mbox{if}&\max\{p-1,0\}<z\leq p,\\ \\ \frac{1+2z}{2p}&\mbox{if}&0<z\leq\max\{p-1,0\},\\ \\ 1-\frac{p}{2}+z&\mbox{if}&\min\{p-1,0\}<z\leq 0,\\ \\ \frac{1}{p}\left(\frac{1}{2}+z+\frac{z^{2}}{2}\right)&\mbox{if}&-1<z\leq\min\{p-1,0\},\\ \\ 0&\mbox{if}&z\leq-1.\end{array}\right.

We see in proposition 8 that the distribution of the egalitarian allocation X\displaystyle X is entirely determined by the convex function v\displaystyle v, which is itself determined by the distribution of one of the marginals of X\displaystyle X. This follows from the deterministic linear relationship between the two resource allocations. The perfect negative correlation compensates any inequality in the marginal allocations.

With this definition of egalitarian allocations, we show that a large class of allocations are dominated in the Lorenz order by egalitarian allocations, and that egalitarian allocations are maximal in the Lorenz order of definition 4.

Assumption 2.

For some p>0\displaystyle p>0, the potential ψX\displaystyle\psi_{X} of allocation X\displaystyle X satisfies for all z[1,p]\displaystyle z\in[-1,p]:

suppu1u2=z{1p2ψX(u1,u2)u1u2}infpu1u2=zmin{1p22ψX(u1,u2)u12,2ψX(u1,u2)u22}.\displaystyle\displaystyle\begin{array}[]{lcl}\sup_{pu_{1}-u_{2}=z}\left\{-\frac{1}{p}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\right\}&\leq&\inf_{pu_{1}-u_{2}=z}\min\left\{\frac{1}{p^{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}},\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}\right\}.\end{array}

Before stating the main result of this section, which is an extension of property (7) in section 1.4.2, we discuss sufficient conditions for assumption 2 and examples of classes of allocations that satisfy assumption 2. The following lemma provides sets of sufficient conditions based on a suitable choice of p\displaystyle p.

Lemma 3 (Sufficient condition for assumption 2).

An allocation with potential ψX\displaystyle\psi_{X} satisfies assumption 2 if any of the following conditions hold.

  1. (i)\displaystyle(i)

    The potential ψX\displaystyle\psi_{X} is supermodular.

  2. (ii)\displaystyle(ii)

    The potential ψX\displaystyle\psi_{X} satisfies:

    infu1,u22ψX(u1,u2)u12×infu1,u22ψX(u1,u2)u22+infu1,u22ψX(u1,u2)u1u20.\displaystyle\displaystyle\sqrt{\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}}\times\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}}+\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\geq 0. (C.4)
  3. (iii)\displaystyle(iii)

    The function

    p(u1,u2)\displaystyle\displaystyle p(u_{1},u_{2}) :=\displaystyle\displaystyle:= 2ψX(u1,u2)u12/2ψX(u1,u2)u22\displaystyle\displaystyle\sqrt{\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}}\bigg{/}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}}

    is positive and constant equal to p\displaystyle p over [0,1]2\displaystyle[0,1]^{2} and, for all z[1,p]\displaystyle z\in[-1,p], the Hessian of ψX\displaystyle\psi_{X} is constant over pu1u2=z\displaystyle pu_{1}-u_{2}=z.

The first sufficient condition in lemma 3, i.e., supermodularity of the potential ψX\displaystyle\psi_{X}, imposes a form of positive dependence between the two resources, which implies assumption 2 (and 1). However, assumption 2 also accommodates allocations that do not exhibit positive dependence. For instance, the mixture of an egalitarian allocation with a positively dependent one satisfies assumption 2.

Example 6.

An allocation with potential ψ(u1,u2)=u1+u2+v(pu1u2)+ψ~(u1,u2)\displaystyle\psi(u_{1},u_{2})=u_{1}+u_{2}+v(pu_{1}-u_{2})+\tilde{\psi}(u_{1},u_{2}), with v\displaystyle v convex, p>0\displaystyle p>0 and ψ~\displaystyle\tilde{\psi} ultramodular, satisfies assumption 2. It mixes a perfectly negatively correlated allocation with a positively dependent one.

Aspecial case of condition (2) in lemma 3 is the case, where ψX\displaystyle\psi_{X} is a quadratic function, hence has a constant Hessian. Indeed, in that case, convexity of ψX\displaystyle\psi_{X} immediately yields (C.4).

Example 7.

All allocations with quadratic potential ψX(u1,u2)=a1u1+a2u2+a11u12+a12u1u2+a22u22\displaystyle\psi_{X}(u_{1},u_{2})=a_{1}u_{1}+a_{2}u_{2}+a_{11}u_{1}^{2}+a_{12}u_{1}u_{2}+a_{22}u_{2}^{2} with a1,a11,a2,a22,a12\displaystyle a_{1},a_{11},a_{2},a_{22},a_{12}\in\mathbb{R}, i.e., allocations of the form X=(a1+2a11U1+a12U2,a2+2a22U2+a12U1)\displaystyle X=(a_{1}+2a_{11}U_{1}+a_{12}U_{2},a_{2}+2a_{22}U_{2}+a_{12}U_{1}), with (U1,U2)U[0,1]2\displaystyle(U_{1},U_{2})\sim U[0,1]^{2}, satisfy assumption 2.

Sufficient condition (2) in lemma 3 can also be used to show that allocations where the two marginal resource allocations are independent also satisfy assumption 2. More generally, a large class of allocations defined as deviations from independence satisfy assumption 2 as formalized in the following example.

Example 8.

An allocation X\displaystyle X with potential ψX(u1,u2)=ϕ1(u1)+ϕ2(u2)+ρϕ(u1,u2)\displaystyle\psi_{X}(u_{1},u_{2})=\phi_{1}(u_{1})+\phi_{2}(u_{2})+\rho\phi(u_{1},u_{2}) satisfies assumption 2 if ϕ1′′B1,ϕ2′′B2,2ϕu12B11,2ϕu22B22\displaystyle\phi_{1}^{\prime\prime}\geq B_{1},\phi_{2}^{\prime\prime}\geq B_{2},\frac{\partial^{2}\phi}{\partial u_{1}^{2}}\geq B_{11},\frac{\partial^{2}\phi}{\partial u_{2}^{2}}\geq B_{22}, 2ϕu1u2B12\displaystyle\frac{\partial^{2}\phi}{\partial u_{1}u_{2}}\geq B_{12}, and ρB12(B1+ρB11)(B2+ρB22)\displaystyle-\rho B_{12}\leq\sqrt{(B_{1}+\rho B_{11})(B_{2}+\rho B_{22})} with B1,B11,B2,B22,B12\displaystyle B_{1},B_{11},B_{2},B_{22},B_{12}\in\mathbb{R}. The case ρ=0\displaystyle\rho=0 is the case of independent marginal allocations.

Assumption 2 is not satisfied, however, in case X1\displaystyle X_{1} and X2\displaystyle X_{2} are perfectly negatively dependent, i.e., X2=ϕ(X1)\displaystyle X_{2}=-\phi(X_{1}) with increasing ϕ\displaystyle\phi, when ϕ\displaystyle\phi is nonlinear.

Under assumption 2, we can complement property (7) in section 1.4.2 and emulate the traditional property of Lorenz curves, which are maximal at perfect equality. Here we show that egalitarian allocations dominate all allocations that satisfy assumption 2, and are themselves undominated thereby forming a class of distributions that are maximal under the Lorenz order.

  • Property (7) continued

    [Lorenz map maximal at egalitarian allocations] For any allocation X\displaystyle X satisfying assumption 2, there is an egalitarian allocation X~\displaystyle\tilde{X} such that XX~\displaystyle X\preccurlyeq_{\mathcal{L}}\tilde{X}, i.e., X(r)X~(r)\displaystyle\mathcal{L}_{X}(r)\leq\mathcal{L}_{\tilde{X}}(r) for all r[0,1]2\displaystyle r\in[0,1]^{2}. In addition, if two egalitarian allocations are ranked in the Lorenz order, then they are equal.

C.2.3. Proofs for section C.2

Proof of lemma 1.

Let UU[0,1]2\displaystyle U\sim U[0,1]^{2}. Let X~j:=ψuj(U1,U2)\displaystyle\tilde{X}_{j}:=\frac{\partial\psi}{\partial u_{j}}(U_{1},U_{2}), j=1,2\displaystyle j=1,2. Then (X~1,X~2)\displaystyle(\tilde{X}_{1},\tilde{X}_{2}) is distributed identically to (X1,X2)\displaystyle(X_{1},X_{2}). Since X~2=ψu2(U1,U2)\displaystyle\tilde{X}_{2}=\frac{\partial\psi}{\partial u_{2}}(U_{1},U_{2}) is monotonically increasing in U2\displaystyle U_{2}, we have U2=(ψu2)1(U1,X~2)\displaystyle U_{2}=\left(\frac{\partial\psi}{\partial u_{2}}\right)^{-1}(U_{1},\tilde{X}_{2}). Hence

X~1\displaystyle\displaystyle\tilde{X}_{1} =\displaystyle\displaystyle= ψu1(U1,(ψu2)1(U1,X~2)).\displaystyle\displaystyle\frac{\partial\psi}{\partial u_{1}}\left(U_{1},\left(\frac{\partial\psi}{\partial u_{2}}\right)^{-1}(U_{1},\tilde{X}_{2})\right).

Under the stated assumption, X~1\displaystyle\tilde{X}_{1} is increasing in U1\displaystyle U_{1} and X~2\displaystyle\tilde{X}_{2}. Since (X~1,X~2)=𝑑(X1,X2)\displaystyle\left(\tilde{X}_{1},\tilde{X}_{2}\right)\overset{d}{=}\left(X_{1},X_{2}\right), we have

FX(x1,x2)\displaystyle\displaystyle F_{X}\left(x_{1},x_{2}\right) =\displaystyle\displaystyle= (X~1x1,X~2x2)\displaystyle\displaystyle\mathbb{P}\left(\tilde{X}_{1}\leq x_{1},\tilde{X}_{2}\leq x_{2}\right)
=\displaystyle\displaystyle= 𝔼[(ψu1(U1,(ψu2)1(U1,X~2))x1,X~2x2)|U1]\displaystyle\displaystyle\mathbb{E}\bigg{[}\mathbb{P}\left(\frac{\partial\psi}{\partial u_{1}}\left(U_{1},\left(\frac{\partial\psi}{\partial u_{2}}\right)^{-1}(U_{1},\tilde{X}_{2})\right)\leq x_{1},\tilde{X}_{2}\leq x_{2}\right)\bigg{|}\;U_{1}\bigg{]}
=\displaystyle\displaystyle= 𝔼[min{F1(x1|U1),F2(x2|U1)}]\displaystyle\displaystyle\mathbb{E}\left[\min\left\{F_{1}\left(x_{1}|U_{1}\right),F_{2}\left(x_{2}|U_{1}\right)\right\}\right]
\displaystyle\displaystyle\geq 𝔼[F1(x1|U1)F2(x2|U1)],\displaystyle\displaystyle\mathbb{E}\left[F_{1}\left(x_{1}|U_{1}\right)F_{2}\left(x_{2}|U_{1}\right)\right],

where Fi(|Uj)\displaystyle F_{i}(\cdot|U_{j}) denotes the cumulative distribution function of Xi\displaystyle X_{i} conditional on Uj\displaystyle U_{j}. Now F1(x1|U1)\displaystyle F_{1}\left(x_{1}|U_{1}\right) is increasing in U1\displaystyle U_{1}, since

F1(x1|U1)\displaystyle\displaystyle F_{1}\left(x_{1}|U_{1}\right) =\displaystyle\displaystyle= (ψu1(U1,U2)x1|U1)\displaystyle\displaystyle\mathbb{P}\left(\frac{\partial\psi}{\partial u_{1}}(U_{1},U_{2})\leq x_{1}\bigg{|}\;U_{1}\right)
=\displaystyle\displaystyle= (U2(ψu1)1(U1,x1)|U1)\displaystyle\displaystyle\mathbb{P}\left(U_{2}\leq\left(\frac{\partial\psi}{\partial u_{1}}\right)^{-1}(U_{1},x_{1})\bigg{|}\;U_{1}\right)
=\displaystyle\displaystyle= (ψu1)1(U1,x1).\displaystyle\displaystyle\left(\frac{\partial\psi}{\partial u_{1}}\right)^{-1}(U_{1},x_{1}).

Similarly F2(x2|U1)\displaystyle F_{2}\left(x_{2}|U_{1}\right) is increasing in U1\displaystyle U_{1}. We conclude that FX(x1,x2)F1(x1)F2(x2)\displaystyle F_{X}\left(x_{1},x_{2}\right)\geq F_{1}\left(x_{1}\right)F_{2}\left(x_{2}\right), see e.g. Joe (1997). ∎

Proof of proposition 8.

The potential ψ\displaystyle\psi of an egalitarian allocation satisfies ψ/u1+pψ/u2=1+p\displaystyle\partial\psi/\partial u_{1}+p\partial\psi/\partial u_{2}=1+p. Solutions are of the form

ψ(v,p)(u1,u2)=u1+u2+v(pu1u2).\displaystyle\displaystyle\psi_{(v,p)}(u_{1},u_{2})=u_{1}+u_{2}+v(pu_{1}-u_{2}).

Convexity of ψ\displaystyle\psi implies convexity of v\displaystyle v. The normalization

0101ψ(v,p)(u1,u2)𝑑u1𝑑u2=(1,1)\displaystyle\displaystyle\int_{0}^{1}\!\!\!\int_{0}^{1}\nabla\psi_{(v,p)}(u_{1},u_{2})du_{1}du_{2}=(1,1)

implies

0101v(pu1u2)𝑑u1𝑑u2=0.\displaystyle\displaystyle\int_{0}^{1}\!\!\!\int_{0}^{1}v^{\prime}(pu_{1}-u_{2})du_{1}du_{2}=0.

The latter, in turn, implies

01v(px)𝑑x=01v(x)𝑑x.\displaystyle\displaystyle\int_{0}^{1}v(p-x)dx=\int_{0}^{1}v(x)dx.

Call Hp\displaystyle H_{p} the cdf of Z=pU1U2\displaystyle Z=pU_{1}-U_{2}, where (U1,U2)U[0,1]2\displaystyle(U_{1},U_{2})\sim U[0,1]^{2}. Call F1\displaystyle F_{1} the cdf of 1ψ(p,v):=1+pv(Z)\displaystyle\nabla_{1}\psi_{(p,v)}:=1+pv^{\prime}(Z), which is the first marginal of allocation X\displaystyle X. Then

F1(x)\displaystyle\displaystyle F_{1}(x) =\displaystyle\displaystyle= (v(Z)x1p)\displaystyle\displaystyle\mathbb{P}\left(v^{\prime}(Z)\leq\frac{x-1}{p}\right)
=\displaystyle\displaystyle= (Z(v)1(x1p))\displaystyle\displaystyle\mathbb{P}\left(Z\leq(v^{\prime})^{-1}\left(\frac{x-1}{p}\right)\right)
=\displaystyle\displaystyle= Hp((v)1(x1p)).\displaystyle\displaystyle H_{p}\left((v^{\prime})^{-1}\left(\frac{x-1}{p}\right)\right).

Now

F1(x)=Hp((v)1(x1p))\displaystyle\displaystyle F_{1}(x)=H_{p}\left((v^{\prime})^{-1}\left(\frac{x-1}{p}\right)\right) \displaystyle\displaystyle\Rightarrow (v)1(x1p)=Hp1(F1(x))\displaystyle\displaystyle(v^{\prime})^{-1}\left(\frac{x-1}{p}\right)=H_{p}^{-1}\left(F_{1}(x)\right)
\displaystyle\displaystyle\Rightarrow x1p=v(Hp1(F1(x)))\displaystyle\displaystyle\frac{x-1}{p}=v^{\prime}\left(H_{p}^{-1}\left(F_{1}(x)\right)\right)
\displaystyle\displaystyle\Rightarrow v(z)=F11(Hp(z))1p.\displaystyle\displaystyle v^{\prime}(z)=\frac{F_{1}^{-1}(H_{p}(z))-1}{p}.

Hence

v(z)\displaystyle\displaystyle v(z) =\displaystyle\displaystyle= 0xF11(Hp(y))1p𝑑y,\displaystyle\displaystyle\int_{0}^{x}\frac{F_{1}^{-1}(H_{p}(y))-1}{p}dy,

as desired. ∎

Proof of lemma 3.

A sufficient condition for assumption 2 is

infu1,u21p2ψX(u1,u2)u1u2min{infu1,u21p22ψX(u1,u2)u12,infu1,u22ψX(u1,u2)u22}.\displaystyle\displaystyle-\inf_{u_{1},u_{2}}\frac{1}{p}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\leq\min\left\{\inf_{u_{1},u_{2}}\frac{1}{p^{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}},\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}\right\}.

If we choose the optimal value of p\displaystyle p, i.e.,

p2=infu1,u22ψX(u1,u2)u12infu1,u22ψX(u1,u2)u22,\displaystyle\displaystyle p^{2}=\frac{\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}}}{\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}},

we get the sufficient inequality

infu1,u22ψX(u1,u2)u1u2infu1,u22ψX(u1,u2)u12×infu1,u22ψX(u1,u2)u22\displaystyle\displaystyle-\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\leq\sqrt{\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}}\times\inf_{u_{1},u_{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}}

as desired. ∎

Proof of example 2.

Let ψ(u1,u2)=u1+u2+v(pu1u2)+ψ~(u1,u2)\displaystyle\psi(u_{1},u_{2})=u_{1}+u_{2}+v(pu_{1}-u_{2})+\tilde{\psi}(u_{1},u_{2}), with v\displaystyle v convex and twice continuously differentiable, and ψ\displaystyle\psi ultramodular. We have, for j=1,2,\displaystyle j=1,2,

2ψ(u1,u2)u12=p2v′′(pu1u2)+2ψ~(u1,u2)u22,2ψ(u1,u2)u12=v′′(pu1u2)+2ψ~(u1,u2)u22.\displaystyle\displaystyle\begin{array}[]{lll}\frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{1}^{2}}&=&p^{2}v^{\prime\prime}(pu_{1}-u_{2})+\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{2}^{2}},\\ \\ \frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{1}^{2}}&=&v^{\prime\prime}(pu_{1}-u_{2})+\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{2}^{2}}.\end{array}

Also,

2ψ(u1,u2)u1u2=pv′′(pu1u2)+2ψ~(u1,u2)u1u2.\displaystyle\displaystyle\begin{array}[]{lll}\frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}&=&-pv^{\prime\prime}(pu_{1}-u_{2})+\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}.\end{array}

Therefore

suppu1u2=z{1p2ψ(u1,u2)u1u2}=suppu1u2=z{v′′(pu1u2)1p2ψ~(u1,u2)u1u2}=v′′(z)infpu1u2=z{1p2ψ~(u1,u2)u1u2}v′′(z)+min{infpu1u2=z1p22ψ~(u1,u2)u12,infpu1u2=z2ψ~(u1,u2)u22}=infpu1u2=zmin{1p22ψ(u1,u2)u12,2ψ(u1,u2)u22}.\displaystyle\displaystyle\begin{array}[]{rll}\sup_{pu_{1}-u_{2}=z}\left\{-\frac{1}{p}\frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\right\}&=&\sup_{pu_{1}-u_{2}=z}\left\{v^{\prime\prime}(pu_{1}-u_{2})-\frac{1}{p}\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\right\}\\ \\ &=&v^{\prime\prime}(z)-\inf_{pu_{1}-u_{2}=z}\left\{\frac{1}{p}\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\right\}\\ \\ \leq\;\;v^{\prime\prime}(z)&+&\min\left\{\inf_{pu_{1}-u_{2}=z}\frac{1}{p^{2}}\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{1}^{2}},\inf_{pu_{1}-u_{2}=z}\frac{\partial^{2}\tilde{\psi}(u_{1},u_{2})}{\partial u_{2}^{2}}\right\}\\ \\ &=&\inf_{pu_{1}-u_{2}=z}\min\left\{\frac{1}{p^{2}}\frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{1}^{2}},\frac{\partial^{2}\psi(u_{1},u_{2})}{\partial u_{2}^{2}}\right\}.\end{array}

Proof of “property (7) continued” in C.2.2.

Define

v′′(z)\displaystyle\displaystyle v^{\prime\prime}(z) :=\displaystyle\displaystyle:= infpu1u2=zmin{1p22ψX(u1,u2)u12,2ψX(u1,u2)u22}.\displaystyle\displaystyle\inf_{pu_{1}-u_{2}=z}\min\left\{\frac{1}{p^{2}}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}^{2}},\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{2}^{2}}\right\}.

Under assumption 2,

v′′(z)\displaystyle\displaystyle v^{\prime\prime}(z) \displaystyle\displaystyle\geq suppu1u2=z{1p2ψX(u1,u2)u1u2}.\displaystyle\displaystyle\sup_{pu_{1}-u_{2}=z}\left\{-\frac{1}{p}\frac{\partial^{2}\psi_{X}(u_{1},u_{2})}{\partial u_{1}\partial u_{2}}\right\}.

Hence ψ:=ψX(u1,u2)v(pu1u2)\displaystyle\psi:=\psi_{X}(u_{1},u_{2})-v(pu_{1}-u_{2}) is an ultramodular function. Applying the proof of property (7) in section 1.4.2, we find that for all (r1,r2)[0,1]2\displaystyle(r_{1},r_{2})\in[0,1]^{2},

0r10r2ψ(u1,u2)𝑑u1𝑑u2(r1r2,r1r2).\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}\nabla\psi(u_{1},u_{2})du_{1}du_{2}\leq(r_{1}r_{2},r_{1}r_{2}).

Hence

0r10r2ψX(u1,u2)𝑑u1𝑑u20r10r2ψ(v,p)(u1,u2)𝑑u1𝑑u2,\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}\nabla\psi_{X}(u_{1},u_{2})du_{1}du_{2}\leq\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}\nabla\psi_{(v,p)}(u_{1},u_{2})du_{1}du_{2},

where ψ(v,p)(u1,u2):=v(pu1u2)+u1+u2\displaystyle\psi_{(v,p)}(u_{1},u_{2}):=v(pu_{1}-u_{2})+u_{1}+u_{2} as desired.

We now show that egalitarian allocations do not dominate each other. Suppose an egalitarian allocation X(v,p)\displaystyle X_{(v,p)} with potential v(pu1u2)+u1+u2\displaystyle v(pu_{1}-u_{2})+u_{1}+u_{2} dominates an allocation X(v~,p~)\displaystyle X_{(\tilde{v},\tilde{p})} with potential v~(p~u1u2)+u1+u2\displaystyle\tilde{v}(\tilde{p}u_{1}-u_{2})+u_{1}+u_{2}. Then

[r1r2+0r10r2[1+pv~(p~r1u2)]𝑑u1𝑑u2r1r2+0r10r2[1v~(p~r1u2)]𝑑u1𝑑u2]\displaystyle\displaystyle\left[\begin{array}[]{cc}r_{1}r_{2}+\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}[1+p\tilde{v}^{\prime}(\tilde{p}r_{1}-u_{2})]\;du_{1}du_{2}\\ r_{1}r_{2}+\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}[1-\tilde{v}^{\prime}(\tilde{p}r_{1}-u_{2})]\;du_{1}du_{2}\end{array}\right]
[r1r2+0r10r2[1+pv(pr1u2)]𝑑u1𝑑u2r1r2+0r10r2[1v(pr1u2)]𝑑u1𝑑u2].\displaystyle\displaystyle\hskip 100.0pt\leq\left[\begin{array}[]{cc}r_{1}r_{2}+\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}[1+pv^{\prime}(pr_{1}-u_{2})]\;du_{1}du_{2}\\ r_{1}r_{2}+\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}[1-v^{\prime}(pr_{1}-u_{2})]\;du_{1}du_{2}\end{array}\right].

Hence, for all (r1,r2)[0,1]2\displaystyle(r_{1},r_{2})\in[0,1]^{2},

0r10r2v~(p~u1u2)\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}\tilde{v}^{\prime}(\tilde{p}u_{1}-u_{2}) =\displaystyle\displaystyle= 0r10r2v(pu1u2),\displaystyle\displaystyle\int_{0}^{r_{1}}\!\!\!\int_{0}^{r_{2}}v^{\prime}(pu_{1}-u_{2}),

so that both allocations have the same Lorenz map, hence are equally distributed. ∎

C.3. Uniform Majorization

In this section, we show the undesirable feature of uniform majorization detailed in section 2.1.2. Galichon and Henry (2012) show that the only rank dependent social evaluation functional (up to an affine transformation) that satisfies the uniform majorization principle of Kolm (1977) is given in (2.5). We now show that it is unsuitable as a tool to measure multivariate inequality with the following two observations about bivariate allocations X\displaystyle X. First, the following expression shows that SUM\displaystyle S_{UM} only depends on  1(p,1)+2(1,p)\displaystyle\mathcal{L}_{1}(p,1)+\mathcal{L}_{2}(1,p), hence, on very specific features of the dependence between the two components X1\displaystyle X_{1} and X2\displaystyle X_{2} of allocation X\displaystyle X.

SUM(X)\displaystyle\displaystyle S_{UM}(X) =\displaystyle\displaystyle= 01([1(p,1)p]+[2(1,p)p])𝑑p.\displaystyle\displaystyle\int_{0}^{1}\Bigl{(}\left[\mathcal{L}_{1}\left(p,1\right)-p\right]+\left[\mathcal{L}_{2}\left(1,p\right)-p\right]\Bigr{)}\,dp. (C.10)

Second, and more troubling still, for any given fixed marginals for X1\displaystyle X_{1} and X2\displaystyle X_{2}SUM\displaystyle S_{UM} is minimized when X1\displaystyle X_{1} and X2\displaystyle X_{2} are independent. Indeed, we show below that SUM\displaystyle S_{UM} is always larger than minus the average of univariate Gini coefficients, which is its value when the marginals are independent.

SUM(X)\displaystyle\displaystyle S_{UM}(X) \displaystyle\displaystyle\geq 12[G(X1)+G(X2)],\displaystyle\displaystyle-\frac{1}{2}\left[\,G\left(X_{1}\right)+G\left(X_{2}\right)\,\right], (C.11)

where G(X1)\displaystyle G(X_{1}) and G(X2)\displaystyle G(X_{2}) denote the classical scalar Gini index of components X1\displaystyle X_{1} and X2\displaystyle X_{2} respectively.

Proof of (C.10).

Let U\displaystyle U be uniformly distributed on [0,1]2\displaystyle[0,1]^{2}. Note that

SUM(X)\displaystyle\displaystyle S_{UM}(X) =\displaystyle\displaystyle= 1𝔼[U1ψu1(U1,U2)]𝔼[U1ψu1(U1,U2)].\displaystyle\displaystyle 1-\mathbb{E}\left[U_{1}\frac{\partial\psi}{\partial u_{1}}(U_{1},U_{2})\right]-\mathbb{E}\left[U_{1}\frac{\partial\psi}{\partial u_{1}}(U_{1},U_{2})\right].

Now,

𝔼[U1ψu1(U1,U2)]\displaystyle\displaystyle\mathbb{E}\left[U_{1}\frac{\partial\psi}{\partial u_{1}}(U_{1},U_{2})\right] =\displaystyle\displaystyle= 01[01u1ψu1(u1,u2)𝑑u1]𝑑u2\displaystyle\displaystyle\int_{0}^{1}\left[\int_{0}^{1}u_{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2})\,du_{1}\right]du_{2}
=\displaystyle\displaystyle= 01[01u1d(1u2(u1,u2))]𝑑u2,\displaystyle\displaystyle\int_{0}^{1}\left[\int_{0}^{1}u_{1}\,d\left(\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(u_{1},u_{2})\right)\,\right]du_{2},

where the last equality follows from interchangeability of the order of integration and 1\displaystyle\mathcal{L}_{1} is the first component of the Lorenz map. Note that

1r2(r1,r2)\displaystyle\displaystyle\frac{\partial\mathcal{L}_{1}}{\partial r_{2}}(r_{1},r_{2}) =\displaystyle\displaystyle= 0r1ψu1(u1,u2)𝑑u1\displaystyle\displaystyle\int_{0}^{r_{1}}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2})\,du_{1}

and

r1(1(r1,r2)r2)\displaystyle\displaystyle\frac{\partial}{\partial r_{1}}\left(\frac{\partial\mathcal{L}_{1}(r_{1},r_{2})}{\partial r_{2}}\right) =\displaystyle\displaystyle= ψu1(r1,r2).\displaystyle\displaystyle\frac{\partial\psi}{\partial u_{1}}(r_{1},r_{2}).

Therefore

𝔼[U1ψu1(U1,U2)]\displaystyle\displaystyle\mathbb{E}\left[U_{1}\frac{\partial\psi}{\partial u_{1}}(U_{1},U_{2})\right] =\displaystyle\displaystyle= 01(u11u2(u1,u2)|01011u2(u1,u2)𝑑u1)𝑑u2\displaystyle\displaystyle\int_{0}^{1}\left(u_{1}\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(u_{1},u_{2})\bigg{|}_{0}^{1}-\int_{0}^{1}\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(u_{1},u_{2})\,du_{1}\right)\,du_{2}
=\displaystyle\displaystyle= 01(1u2(1,u2)011u2(u1,u2)𝑑u1)𝑑u2\displaystyle\displaystyle\int_{0}^{1}\left(\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(1,u_{2})-\int_{0}^{1}\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(u_{1},u_{2})\,du_{1}\right)\,du_{2}
=\displaystyle\displaystyle= 011u2(1,u2)𝑑u201011u2(u1,u2)𝑑u2𝑑u1\displaystyle\displaystyle\int_{0}^{1}\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(1,u_{2})\,du_{2}-\int_{0}^{1}\int_{0}^{1}\frac{\partial\mathcal{L}_{1}}{\partial u_{2}}(u_{1},u_{2})\,du_{2}\,du_{1}
=\displaystyle\displaystyle= (1,1)(1,0)011(u1,1)1(u1,0)du1\displaystyle\displaystyle\mathcal{L}(1,1)-\mathcal{L}(1,0)-\int_{0}^{1}\mathcal{L}_{1}(u_{1},1)-\mathcal{L}_{1}(u_{1},0)\,du_{1}
=\displaystyle\displaystyle= 1011(u1,1)𝑑u1.\displaystyle\displaystyle 1-\int_{0}^{1}\mathcal{L}_{1}(u_{1},1)\,du_{1}.

Similarly, we have 𝔼[U2ψu2(U1,U2)]=1012(1,u2)𝑑u1\displaystyle\mathbb{E}\left[U_{2}\frac{\partial\psi}{\partial u_{2}}(U_{1},U_{2})\right]=1-\int_{0}^{1}\mathcal{L}_{2}(1,u_{2})\,du_{1}, as desired. ∎

Proof of (C.11).

The inequality follows from 1(p,1)L1(p)\displaystyle\mathcal{L}_{1}\left(p,1\right)\geq L_{1}\left(p\right) and 2(1,p)L2(p)\displaystyle\mathcal{L}_{2}\left(1,p\right)\geq L_{2}\left(p\right). We now prove the latter. Letting ψ\displaystyle\nabla\psi be the vector quantile function of (X1,X2)\displaystyle(X_{1},X_{2}), note that since ψu1\displaystyle\frac{\partial\psi}{\partial u_{1}} pushes uniform measure on [0,1]2\displaystyle[0,1]^{2} forward to law(X1)\displaystyle(X_{1}), we can write

L1(r1)={u:ψu1(u)z1(r1)}ψu1(u)𝑑u\displaystyle L_{1}(r_{1})=\int_{\{u:\frac{\partial\psi}{\partial u_{1}}(u)\leq z_{1}(r_{1})\}}\frac{\partial\psi}{\partial u_{1}}(u)du

where z1(r1)\displaystyle z_{1}(r_{1}) is the quantile of the random variable X1\displaystyle X_{1}. Note that the area of the domain {u:ψu1(u)z1(r1)}\displaystyle\{u:\frac{\partial\psi}{\partial u_{1}}(u)\leq z_{1}(r_{1})\} of integration must be r1\displaystyle r_{1}. On the other hand,

1(r1,1)=0r101ψu1(u)𝑑u1𝑑u2\displaystyle\mathcal{L}_{1}(r_{1},1)=\int_{0}^{r_{1}}\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u)du_{1}du_{2}

is an integral of the same function over a region with the same area. Writing A:={u:ψu1(u)z1(r1)}\displaystyle A:=\left\{u:\frac{\partial\psi}{\partial u_{1}}(u)\leq z_{1}(r_{1})\right\}, we have A=BC\displaystyle A=B\cup C, where B=A([0,r1]×[0,1])\displaystyle B=A\cap([0,r_{1}]\times[0,1]) and C=A((r1,1]×[0,1])\displaystyle C=A\cap((r_{1},1]\times[0,1]) and the union is disjoint. Similarly, [0,r1]×[0,1]=BD\displaystyle[0,r_{1}]\times[0,1]=B\cup D where D=([0,r1]×[0,1])Ac\displaystyle D=([0,r_{1}]\times[0,1])\cap A^{c}. Note that the areas of C\displaystyle C and D\displaystyle D must be the same, |C|=|D|\displaystyle|C|=|D|, and ψu1(u)z1(r1)\displaystyle\frac{\partial\psi}{\partial u_{1}}(u)\leq z_{1}(r_{1}) throughout C\displaystyle C while ψu1(u)>z1(r1)\displaystyle\frac{\partial\psi}{\partial u_{1}}(u)>z_{1}(r_{1}) throughout D\displaystyle D. We have

L1(r1)\displaystyle\displaystyle L_{1}(r_{1}) =\displaystyle\displaystyle= Bψu1(u)𝑑u+Cψu1(u)𝑑u\displaystyle\displaystyle\int_{B}\frac{\partial\psi}{\partial u_{1}}(u)du+\int_{C}\frac{\partial\psi}{\partial u_{1}}(u)du
\displaystyle\displaystyle\leq Bψu1(u)𝑑u+z1(r1)|C|\displaystyle\displaystyle\int_{B}\frac{\partial\psi}{\partial u_{1}}(u)du+z_{1}(r_{1})|C|
=\displaystyle\displaystyle= Bψu1(u)𝑑u+z1(r1)|D|\displaystyle\displaystyle\int_{B}\frac{\partial\psi}{\partial u_{1}}(u)du+z_{1}(r_{1})|D|
\displaystyle\displaystyle\leq Bψu1(u)𝑑u+Dψu1(u)𝑑u\displaystyle\displaystyle\int_{B}\frac{\partial\psi}{\partial u_{1}}(u)du+\int_{D}\frac{\partial\psi}{\partial u_{1}}(u)du
=\displaystyle\displaystyle= 1(r1,1).\displaystyle\displaystyle\mathcal{L}_{1}(r_{1},1).

Note that this inequality holds for any dependence structure between X1\displaystyle X_{1} and X2\displaystyle X_{2}. ∎

Appendix D Inequality Dominance based on the Inverse Lorenz Function

We can also define an increasing inequality order based on the Inverse Lorenz Functions. Consider two allocations X\displaystyle X and X\displaystyle X^{\prime}, with respective Inverse Lorenz Functions lX\displaystyle l_{X} and lX\displaystyle l_{X^{\prime}}. If lX(z)lX(z)\displaystyle l_{X}(z)\leq l_{X^{\prime}}(z) for some vector of shares z\displaystyle z, a larger proportion of the population commands the same share of resources in allocation X\displaystyle X^{\prime} than in allocation X\displaystyle X. If this is true for any vector z\displaystyle z of resource shares in [0,1]d\displaystyle[0,1]^{d}, then, we say that allocation X\displaystyle X^{\prime} is more unequal than allocation X\displaystyle X.

Definition 10.

An allocation X\displaystyle X^{\prime} is said to be more unequal in the weak Lorenz order than an allocation X\displaystyle X if lX(z)lX(z)\displaystyle l_{X^{\prime}}(z)\geq l_{X}(z) for all z[0,1]d\displaystyle z\in[0,1]^{d}. We denote this XlX\displaystyle X\succcurlyeq_{l}X^{\prime}.

The relation XlX\displaystyle X\succcurlyeq_{l}X^{\prime} is equivalent to lower orthant dominance of the random vector X(U)\displaystyle\mathcal{L}_{X}(U), with UU[0,1]d\displaystyle U\sim U[0,1]^{d}, over X(U)\displaystyle\mathcal{L}_{X^{\prime}}(U) (see Section 3.8 of Müller and Stoyan (2002)). In the scalar case, the orderings of definitions 4 and 10 both coincide with the traditional Lorenz ordering. In higher dimensions, however, the equivalence may not hold141414We have not been able to either prove the equivalence or find a counterexample.. Nonetheless, as the name indicates, the weak Lorenz inequality order of definition 10 is weaker than the Lorenz order of definition 4, as we show in proposition 9.

Proposition 9.

An allocation X\displaystyle X^{\prime} is more unequal in the weak Lorenz order than an allocation X\displaystyle X, i.e., XlX\displaystyle X\succcurlyeq_{l}X^{\prime} (definition 10) if X\displaystyle X^{\prime} is more unequal in the Lorenz order, i.e., XX\displaystyle X\preccurlyeq_{\mathcal{L}}X^{\prime} (definition 4).

Proof of proposition 9.

X~X\displaystyle\tilde{X}\succcurlyeq_{\mathcal{L}}X is equivalent to first order stochastic dominance of X(U)\displaystyle\mathcal{L}_{X}(U) over X~(U)\displaystyle\mathcal{L}_{\tilde{X}}(U), where UU[0,1]d\displaystyle U\sim U[0,1]^{d} (see Section 6.B page 266 of Shaked and Shanthikumar (2007)). Hence, X~X\displaystyle\tilde{X}\succcurlyeq_{\mathcal{L}}X implies (X(U)S)(X~(U)S)\displaystyle\mathbb{P}(\mathcal{L}_{X}(U)\in S)\leq\mathbb{P}(\mathcal{L}_{\tilde{X}}(U)\in S) for any lower set S\displaystyle S, so that X~X\displaystyle\tilde{X}\succcurlyeq_{\mathcal{L}}X implies X~lX\displaystyle\tilde{X}\succcurlyeq_{l}X, given that the sets [0,z]\displaystyle[0,z] are lower sets. ∎

Appendix E Proofs of the main results

Recall that in the proofs, we shall use the notation QX=ψX\displaystyle Q_{X}=\nabla\psi_{X} for the vector quantile of a random vector X\displaystyle X and call convex function ψX\displaystyle\psi_{X} the transport potential associated with the distribution of X\displaystyle X. See section C.1 for details. In this section, we omit the X\displaystyle X subscript of ψX\displaystyle\psi_{X} for notational compactness.

Proof of proposition 1.

In case d=2\displaystyle d=2, the off diagonal elements of the Jacobian of (r)\displaystyle\mathcal{L}(r) are ψ(r1,r2)ψ(r1,0)\displaystyle\psi(r_{1},r_{2})-\psi(r_{1},0) and ψ(r1,r2)ψ(0,r2)\displaystyle\psi(r_{1},r_{2})-\psi(0,r_{2}). From the latter, by differentiation, we can recover ψ(r1,r2)\displaystyle\nabla\psi(r_{1},r_{2}). The result then follows from the fact that ψ\displaystyle\nabla\psi characterizes PX\displaystyle P_{X}, see for instance Chernozhukov et al. (2017). The result extends straightforwardly to d>2\displaystyle d>2. ∎

Proof of proposition 2.

We only need to show the result for one component of the Lorenz map and the others follow with similar reasoning. We have for the first component

1(r)\displaystyle\displaystyle\mathcal{L}_{1}(r) =\displaystyle\displaystyle= 0rd0r20r1ψu1(u1,u2,,ud)𝑑u1𝑑u2𝑑ud\displaystyle\displaystyle\int_{0}^{r_{d}}\cdots\int_{0}^{r_{2}}\int_{0}^{r_{1}}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2},...,u_{d})du_{1}du_{2}...du_{d}
=\displaystyle\displaystyle= 0rd0r2[ψ(r1,u2,,ud)ψ(0,u2,,ud)]𝑑u2𝑑ud\displaystyle\displaystyle\int_{0}^{r_{d}}\cdots\int_{0}^{r_{2}}[\psi(r_{1},u_{2},...,u_{d})-\psi(0,u_{2},...,u_{d})]du_{2}...du_{d}
\displaystyle\displaystyle\leq 0rd0r2r1[ψ(1,u2,,ud)ψ(0,u2,,ud)]𝑑u2𝑑ud, by convexity\displaystyle\displaystyle\int_{0}^{r_{d}}\cdots\int_{0}^{r_{2}}r_{1}[\psi(1,u_{2},...,u_{d})-\psi(0,u_{2},...,u_{d})]du_{2}...du_{d}\text{, by convexity }
=\displaystyle\displaystyle= r10rd0r201ψu1(u1,u2,,ud)𝑑u1𝑑u2𝑑ud\displaystyle\displaystyle r_{1}\int_{0}^{r_{d}}\cdots\int_{0}^{r_{2}}\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2},...,u_{d})du_{1}du_{2}...du_{d}

Now define H(r2,,rd)=0rd0r201ψu1(u1,u2,,ud)𝑑u1𝑑u2𝑑ud\displaystyle H(r_{2},...,r_{d})=\int_{0}^{r_{d}}\cdots\int_{0}^{r_{2}}\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2},...,u_{d})du_{1}du_{2}...du_{d}. Then

Hr2(r2,,rd)=0rd0r301ψu1(u1,r2,u3,,ud)𝑑u1𝑑u3𝑑ud\displaystyle\frac{\partial H}{\partial r_{2}}(r_{2},...,r_{d})=\int_{0}^{r_{d}}\cdots\int_{0}^{r_{3}}\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},r_{2},u_{3},...,u_{d})du_{1}du_{3}...du_{d}

is monotone increasing in r2\displaystyle r_{2}, since it is the integral of the functions r201ψu1(u1,r2,u3,,ud)𝑑u1\displaystyle r_{2}\mapsto\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},r_{2},u_{3},...,u_{d})du_{1}, which are monotonically increasing by assumption. Therefore, r2H(r2,rd)\displaystyle r_{2}\mapsto H(r_{2},...r_{d}) is convex and so H(r2,r3,,rd)H(0,r3,,rd)+r2(H(1,r3,,rd)H(0,r3,,rd))\displaystyle H(r_{2},r_{3},...,r_{d})\leq H(0,r_{3},...,r_{d})+r_{2}(H(1,r_{3},...,r_{d})-H(0,r_{3},...,r_{d})). Note that H(0,r3,,rd)=0\displaystyle H(0,r_{3},...,r_{d})=0 as an integral over a degenerate interval, so H(r2,r3,,rd)r2H(1,r3,,rd)\displaystyle H(r_{2},r_{3},...,r_{d})\leq r_{2}H(1,r_{3},...,r_{d}). A very similar argument yields H(1,r3,,rd)r3H(1,1,r4,,rd)\displaystyle H(1,r_{3},...,r_{d})\leq r_{3}H(1,1,r_{4},...,r_{d}), so that H(r2,r3,,rd)r2r3H(1,1,r4,,rd)\displaystyle H(r_{2},r_{3},...,r_{d})\leq r_{2}r_{3}H(1,1,r_{4},...,r_{d}), and, iterating in this way, we eventually obtain,

H(r2,r3,,rd)r2r3rdH(1,1,,1).\displaystyle H(r_{2},r_{3},...,r_{d})\leq r_{2}r_{3}\cdots r_{d}H(1,1,...,1).

We then conclude

1(r)r1r2r3rd0101ψu1(u1,u2,,ud)𝑑u1𝑑u2𝑑u3𝑑ud.\displaystyle\mathcal{L}_{1}(r)\leq r_{1}r_{2}r_{3}\cdots r_{d}\int_{0}^{1}\cdots\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2},...,u_{d})du_{1}du_{2}du_{3}...du_{d}.

The integral 0101ψu1(u1,u2,,ud)𝑑u1𝑑u2𝑑u3𝑑ud\displaystyle\int_{0}^{1}\cdots\int_{0}^{1}\frac{\partial\psi}{\partial u_{1}}(u_{1},u_{2},...,u_{d})du_{1}du_{2}du_{3}...du_{d} is 1\displaystyle 1, as the expected value of the normalized X1\displaystyle X_{1}, and so we obtain the desired result. ∎

Proof of proposition 3.

We need to show that a social evaluation functional S\displaystyle S of the form (2.1) satisfies

XXS(X)S(X)\displaystyle\displaystyle X\succcurlyeq_{\mathcal{L}}X^{\prime}\Rightarrow S(X)\geq S(X^{\prime})

if and only if ϕ\displaystyle\phi is of the form (2.2). To show this, we note that

S(X)\displaystyle\displaystyle S(X) =\displaystyle\displaystyle= [0,1]dϕm(u)ψX~(u)𝑑u\displaystyle\displaystyle\int_{[0,1]^{d}}\phi_{m}(u)^{\top}\nabla\psi_{\tilde{X}}(u)\,du
=\displaystyle\displaystyle= j=1d[0,1]dϕm,j(u)ψX~,j(u)𝑑u\displaystyle\displaystyle\sum_{j=1}^{d}\int_{[0,1]^{d}}\phi_{m,j}(u)\nabla\psi_{\tilde{X},j}(u)\,du
=\displaystyle\displaystyle= j=1d[0,1]d[[0,1]d𝟙{ur}𝑑mj(r)]ψX~,j(u)𝑑u\displaystyle\displaystyle\sum_{j=1}^{d}\int_{[0,1]^{d}}\left[\int_{[0,1]^{d}}\mathds{1}\{u\leq r\}\,dm_{j}(r)\right]\nabla\psi_{\tilde{X},j}(u)\,du
=\displaystyle\displaystyle= j=1d[0,1]d[[0,1]d𝟙{ur}ψX~,j(u)𝑑u]𝑑mj(r)\displaystyle\displaystyle\sum_{j=1}^{d}\int_{[0,1]^{d}}\left[\int_{[0,1]^{d}}\mathds{1}\{u\leq r\}\nabla\psi_{\tilde{X},j}(u)\,du\right]\,dm_{j}(r)
=\displaystyle\displaystyle= j=1d[0,1]dj(r)𝑑mj(r).\displaystyle\displaystyle\sum_{j=1}^{d}\int_{[0,1]^{d}}\mathcal{L}_{j}(r)\,dm_{j}(r).

So sufficiency holds for all non-negative measures. Necessity follows by taking one of the measures as a degenerate measure and the other measures as trivial measures. ∎

Proof of proposition 4.

It suffices to see that X(r)X(r)\displaystyle\mathcal{L}_{X}(r)\geq\mathcal{L}_{X^{\prime}}(r) if and only if

[0,1]d 1{ur}(ψX~(u)ψX~(u))𝑑u0.\displaystyle\displaystyle\int_{[0,1]^{d}}\;\mathds{1}\{u\leq r\}\left(\nabla\psi_{\tilde{X}}(u)-\nabla\psi_{\tilde{X}^{\prime}}(u)\right)\;du\geq 0.

Proof of proposition 5.

Suppose X\displaystyle X^{\prime} is obtained from X\displaystyle X through an MRT, so that there is a supermodular and component-wise convex function ψ\displaystyle\psi, such that ψX~(u)=ψX~(u)+ψ(u)\displaystyle\nabla\psi_{\tilde{X}^{\prime}}(u)=\nabla\psi_{\tilde{X}}(u)+\nabla\psi(u) holds for all u[0,1]d\displaystyle u\in[0,1]^{d}. We want to show  XX\displaystyle\mathcal{L}_{X^{\prime}}\leq\mathcal{L}_{X}, i.e., rψ(u)𝑑u0\displaystyle\int\!\!\!\int^{r}\nabla\psi(u)du\leq 0. Consider the first component:

0r10r20rdψ(u)u1𝑑u\displaystyle\displaystyle\int_{0}^{r_{1}}\int_{0}^{r_{2}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du =\displaystyle\displaystyle= 0r20rd[ψ(r1,u2,,ud)ψ(0,u2,,ud)]𝑑u2𝑑ud\displaystyle\displaystyle\int_{0}^{r_{2}}...\int_{0}^{r_{d}}[\psi(r_{1},u_{2},...,u_{d})-\psi(0,u_{2},...,u_{d})]du_{2}...du_{d}
\displaystyle\displaystyle\leq r10r20rd[ψ(1,u2,,ud)ψ(0,u2,,ud)]𝑑u2𝑑ud\displaystyle\displaystyle r_{1}\int_{0}^{r_{2}}...\int_{0}^{r_{d}}[\psi(1,u_{2},...,u_{d})-\psi(0,u_{2},...,u_{d})]du_{2}...du_{d}
=\displaystyle\displaystyle= r1010r20rdψ(u)u1𝑑u1u2.dud\displaystyle\displaystyle r_{1}\int_{0}^{1}\int_{0}^{r_{2}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{1}u_{2}....du_{d}

by convexity.

Now, note that supermodularity implies the function 0r20rdψ(u)u1𝑑u2.dud\displaystyle\int_{0}^{r_{2}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{2}....du_{d} is convex with respect to r2\displaystyle r_{2}, and it is clearly 0\displaystyle 0 when r2=0\displaystyle r_{2}=0. Therefore,

0r20rdψ(u)u1𝑑u2.dudr2010r30rdψ(u)u1𝑑u2.dud.\displaystyle\int_{0}^{r_{2}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{2}....du_{d}\leq r_{2}\int_{0}^{1}\int_{0}^{r_{3}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{2}....du_{d}.

Similarly,

0r30rdψ(u)u1𝑑u3.dudr3010r40rdψ(u)u1𝑑u3.dud.\displaystyle\int_{0}^{r_{3}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{3}....du_{d}\leq r_{3}\int_{0}^{1}\int_{0}^{r_{4}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du_{3}....du_{d}.

Continuing iteratively in this way, we eventually obtain:

0r10r20rdψ(u)u1𝑑u\displaystyle\displaystyle\int_{0}^{r_{1}}\int_{0}^{r_{2}}...\int_{0}^{r_{d}}\frac{\partial\psi(u)}{\partial u_{1}}du \displaystyle\displaystyle\leq r1r2rd010101ψ(u)u1𝑑u\displaystyle\displaystyle r_{1}r_{2}...r_{d}\int_{0}^{1}\int_{0}^{1}...\int_{0}^{1}\frac{\partial\psi(u)}{\partial u_{1}}du
=\displaystyle\displaystyle= r1r2rd[(LX)1(1,1,1)(LX)1(1,1,1)]=0\displaystyle\displaystyle r_{1}r_{2}...r_{d}[\mathcal{(}L_{X^{\prime}})_{1}(1,1,...1)-\mathcal{(}L_{X})_{1}(1,1,...1)]=0

Similar reasoning applies to the other components, and the result follows. ∎

Proof of proposition 6.

(1) Let lX(z1,z2)=α\displaystyle l_{X}(z_{1},z_{2})=\alpha. Since lX\displaystyle l_{X} is a cdf, hence non decreasing in both arguments, then z2:=lX1(z1;α)=inf{ζ;αlX(z1,ζ)}\displaystyle z_{2}:=l_{X}^{-1}(z_{1};\alpha)=\inf\{\zeta;\alpha\leq l_{X}(z_{1},\zeta)\} is non increasing in z1\displaystyle z_{1} and non decreasing in α\displaystyle\alpha. (2) See Claim 1 in Brock and Thomson (1966). ∎

Proposition 10.

(Rawlesian limits of social welfare functions) For a random variable X\displaystyle X in \displaystyle\mathbb{R}, define, as in Section 2.3, the social welfare function:

Sδ(X):=δ(δ1)[0,1](1r)δ2X(r)𝑑r.\displaystyle S_{\delta}(X):=\delta(\delta-1)\int_{[0,1]}(1-r)^{\delta-2}\mathcal{L}_{X}(r)\,dr.

Then limδSδ(X)=QX(0+):=limu0+QX(u)\displaystyle\lim_{\delta\rightarrow\infty}S_{\delta}(X)=Q_{X}(0+):=\lim_{u\rightarrow 0^{+}}Q_{X}(u).

Proof of proposition 10.

Integrating by parts implies

Sδ(X)=δ01(1r)δ1QX(r)𝑑r=δi=1N(i1)/Ni/N(1r)δ1QX(r)𝑑r\displaystyle S_{\delta}(X)=\delta\int_{0}^{1}(1-r)^{\delta-1}Q_{X}(r)dr=\delta\sum_{i=1}^{N}\int_{(i-1)/N}^{i/N}(1-r)^{\delta-1}Q_{X}(r)dr

for any positive integer N\displaystyle N. As the quantile QX()\displaystyle Q_{X}(\cdot) is increasing, we then have

Sδ(X)\displaystyle\displaystyle S_{\delta}(X) \displaystyle\displaystyle\leq δi=1NQX(i/N)(i1)/Ni/N(1r)δ1𝑑r\displaystyle\displaystyle\delta\sum_{i=1}^{N}Q_{X}(i/N)\int_{(i-1)/N}^{i/N}(1-r)^{\delta-1}dr
=\displaystyle\displaystyle= i=1NQX(i/N)[(1r)δ]|(i1)/Ni/N\displaystyle\displaystyle\sum_{i=1}^{N}Q_{X}(i/N)[-(1-r)^{\delta}]|^{i/N}_{(i-1)/N}
=\displaystyle\displaystyle= QX(1/N)(1(11/N)δ)+i=2NQX(i/N)[(1i1N)δ(1iN)δ]\displaystyle\displaystyle Q_{X}(1/N)(1-(1-1/N)^{\delta})+\sum_{i=2}^{N}Q_{X}(i/N)[(1-\frac{i-1}{N})^{\delta}-(1-\frac{i}{N})^{\delta}]
=\displaystyle\displaystyle= QX(1/N)+M(δ),\displaystyle\displaystyle Q_{X}(1/N)+M(\delta),

where the function M(δ):=QX(1/N)(11/N)δ+i=2NQX(i/N)[(1i1N)δ(1iN)δ]\displaystyle M(\delta):=-Q_{X}(1/N)(1-1/N)^{\delta}+\sum_{i=2}^{N}Q_{X}(i/N)[(1-\frac{i-1}{N})^{\delta}-(1-\frac{i}{N})^{\delta}] tends to 0\displaystyle 0 as δ\displaystyle\delta\rightarrow\infty.

Similarly, we have

Sδ(X)\displaystyle\displaystyle S_{\delta}(X) \displaystyle\displaystyle\geq δi=1NQX((i1)/N)(i1)/Ni/N(1r)δ1𝑑r\displaystyle\displaystyle\delta\sum_{i=1}^{N}Q_{X}((i-1)/N)\int_{(i-1)/N}^{i/N}(1-r)^{\delta-1}dr
=\displaystyle\displaystyle= QX(0+)+m(δ)\displaystyle\displaystyle Q_{X}(0+)+m(\delta)

where m(δ):=QX(0+)(11/N)δ+i=2NQX((i1)/N)[(1i1N)δ(1iN)δ]\displaystyle m(\delta):=-Q_{X}(0+)(1-1/N)^{\delta}+\sum_{i=2}^{N}Q_{X}((i-1)/N)[(1-\frac{i-1}{N})^{\delta}-(1-\frac{i}{N})^{\delta}] tends to 0\displaystyle 0 as δ\displaystyle\delta\rightarrow\infty. We therefore have

QX(0+)lim infδSδ(X)lim supδSδ(X)QX(1/N).\displaystyle Q_{X}(0+)\leq\liminf_{\delta\rightarrow\infty}S_{\delta}(X)\leq\limsup_{\delta\rightarrow\infty}S_{\delta}(X)\leq Q_{X}(1/N).

As this holds for every integer N\displaystyle N, and limNQX(1/N)=QX(0+)\displaystyle\lim_{N\rightarrow\infty}Q_{X}(1/N)=Q_{X}(0+), the result follows. ∎

The following is a multivariate extension:

Proposition 11.

Let X\displaystyle X be a d\displaystyle d dimensional random variable and QX\displaystyle Q_{X} its multivariate quantile. Consider the j\displaystyle jth term in the sum defining the multivariate S-Gini in Section 2.3:

Sδj(X):=δj(δj1)[0,1]d(1rj)δj2j(r)𝑑r.\displaystyle S^{j}_{\delta}(X):=\delta_{j}(\delta_{j}-1)\int_{[0,1]^{d}}(1-r_{j})^{\delta_{j}-2}\mathcal{L}_{j}(r)\,dr.

Then

limδjSδj(X)=[0,1]d1Πi=1,ijd(1ui)Qj(u1,u2,uj1,0+,uj+1,ud)𝑑u1𝑑u2𝑑uj1𝑑uj+1𝑑ud,\displaystyle\lim_{\delta_{j}\rightarrow\infty}S_{\delta}^{j}(X)=\int_{[0,1]^{d-1}}\Pi_{i=1,i\neq j}^{d}(1-u_{i})Q_{j}(u_{1},u_{2},...u_{j-1},0+,u_{j+1},...u_{d})du_{1}du_{2}...du_{j-1}du_{j+1}...du_{d},

where Qj(u1,u2,uj1,0+,uj+1,ud)=limuj0+Qj(u)\displaystyle Q_{j}(u_{1},u_{2},...u_{j-1},0+,u_{j+1},...u_{d})=\lim_{u_{j}\rightarrow 0^{+}}Q_{j}(u).

Note that the expression is a measure of inequality among agents with lowest rank in the j\displaystyle jth component, with weighting functions in the other rank variables as in the standard Gini index.

Proof.

The proof is similar to the argument above. Without loss of generality, we assume j=1\displaystyle j=1. Integrating by parts, we have, for any positive integer N\displaystyle N:

Sδ1(X)\displaystyle\displaystyle S^{1}_{\delta}(X) =\displaystyle\displaystyle= δ1[0,1]d(1r1)δ11[0r20rdQ1(r1,u2,,ud)𝑑u2𝑑ud]𝑑r\displaystyle\displaystyle\delta_{1}\int_{[0,1]^{d}}(1-r_{1})^{\delta_{1}-1}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(r_{1},u_{2},...,u_{d})du_{2}...du_{d}\Big{]}dr
=\displaystyle\displaystyle= δ1[0,1]d1(i=1N(i1)/Ni/N(1r1)δ11[0r20rdQ1(r1,u2,,ud)𝑑u2𝑑ud]𝑑r1)𝑑r2𝑑rd\displaystyle\displaystyle\delta_{1}\int_{[0,1]^{d-1}}\Big{(}\sum_{i=1}^{N}\int_{(i-1)/N}^{i/N}(1-r_{1})^{\delta_{1}-1}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(r_{1},u_{2},...,u_{d})du_{2}...du_{d}\Big{]}dr_{1}\Big{)}dr_{2}...dr_{d}

Using monotonicity of Q\displaystyle Q in the first argument, we then have:

δ1i=1N(i1)/Ni/N(1r1)δ11[0r20rdQ1(r1,u2,,ud)𝑑u2𝑑ud]𝑑r1\displaystyle\displaystyle\delta_{1}\sum_{i=1}^{N}\int_{(i-1)/N}^{i/N}(1-r_{1})^{\delta_{1}-1}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(r_{1},u_{2},...,u_{d})du_{2}...du_{d}\Big{]}dr_{1}
δ1i=1N[0r20rdQ1(i/N,u2,,ud)𝑑u2𝑑ud](i1)/Ni/N(1r1)δ11𝑑r1\displaystyle\displaystyle\leq\delta_{1}\sum_{i=1}^{N}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(i/N,u_{2},...,u_{d})du_{2}...du_{d}\Big{]}\int_{(i-1)/N}^{i/N}(1-r_{1})^{\delta_{1}-1}dr_{1}
=i=1N[0r20rdQ1(i/N,u2,,ud)𝑑u2𝑑ud][(1i1N)δ1(1iN)δ1]\displaystyle\displaystyle=\sum_{i=1}^{N}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(i/N,u_{2},...,u_{d})du_{2}...du_{d}\Big{]}\Big{[}(1-\frac{i-1}{N})^{\delta_{1}}-(1-\frac{i}{N})^{\delta_{1}}\Big{]}
=0r20rdQ1(1/N,u2,,ud)𝑑u2𝑑ud+M(δ1,r2,,rd)\displaystyle\displaystyle=\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(1/N,u_{2},...,u_{d})du_{2}...du_{d}+M(\delta_{1},r_{2},...,r_{d}) (E.1)

where M(δ1,r2,,rd)\displaystyle M(\delta_{1},r_{2},...,r_{d}) converges monotonically to 0\displaystyle 0 as δ1\displaystyle\delta_{1}\rightarrow\infty.

Therefore, using the monotone convergence theorem, we have

lim supδ1Sδ1(X)\displaystyle\displaystyle\limsup_{\delta_{1}\rightarrow\infty}S_{\delta}^{1}(X) \displaystyle\displaystyle\leq [0,1]d1(0r20rdQ1(1/N,u2,,ud)𝑑u2𝑑ud)𝑑r2𝑑rd\displaystyle\displaystyle\int_{[0,1]^{d-1}}\Big{(}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(1/N,u_{2},...,u_{d})du_{2}...du_{d}\Big{)}dr_{2}...dr_{d}
=\displaystyle\displaystyle= [0,1]d1Πi=2d(1uj)Q1(1/N,u2,,ud)𝑑u2𝑑ud\displaystyle\displaystyle\int_{[0,1]^{d-1}}\Pi_{i=2}^{d}(1-u_{j})Q_{1}(1/N,u_{2},...,u_{d})du_{2}...du_{d}

As this holds for all N\displaystyle N, and Q1\displaystyle Q_{1} is monotone in the first argument, we can take the limit as N\displaystyle N\rightarrow\infty and apply the monotone convergence theorem again to obtain

lim supδ1Sδ1(X)[0,1]d1Πi=2d(1uj)Q1(0+,u2,,ud)𝑑u2𝑑ud\displaystyle\limsup_{\delta_{1}\rightarrow\infty}S_{\delta}^{1}(X)\leq\int_{[0,1]^{d-1}}\Pi_{i=2}^{d}(1-u_{j})Q_{1}(0+,u_{2},...,u_{d})du_{2}...du_{d}

A very similar argument to the one used to derive (E) then implies:

δ1i=1N(i1)/Ni/N(1r1)δ11[0r20rdQ1(r1,u2,,ud)𝑑u2𝑑ud]𝑑r1\displaystyle\displaystyle\delta_{1}\sum_{i=1}^{N}\int_{(i-1)/N}^{i/N}(1-r_{1})^{\delta_{1}-1}\Big{[}\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(r_{1},u_{2},...,u_{d})du_{2}...du_{d}\Big{]}dr_{1}
0r20rdQ1(0+,u2,,ud)𝑑u2𝑑ud+m(δ1,r2,,rd)\displaystyle\displaystyle\geq\int_{0}^{r_{2}}\cdots\int_{0}^{r_{d}}Q_{1}(0+,u_{2},...,u_{d})du_{2}...du_{d}+m(\delta_{1},r_{2},...,r_{d})

where m(δ)\displaystyle m(\delta) converges monotonically to 0\displaystyle 0 as δ1\displaystyle\delta_{1}\rightarrow\infty. Proceeding as above gives

lim infδ1Sδ1(X)[0,1]d1Πi=2d(1uj)Q1(0+,u2,,ud)𝑑u2𝑑ud\displaystyle\liminf_{\delta_{1}\rightarrow\infty}S_{\delta}^{1}(X)\geq\int_{[0,1]^{d-1}}\Pi_{i=2}^{d}(1-u_{j})Q_{1}(0+,u_{2},...,u_{d})du_{2}...du_{d}

which implies the desired conclusion. ∎

References

  • Aaberge and Brandolini [2014] R. Aaberge and A. Brandolini. Multidimensional poverty and inequality. Discussion Papers, No. 792, Statistics Norway, Research Department, Oslo, 2014.
  • Andreoli and Zoli [2020] F. Andreoli and C. Zoli. From unidimensional to multidimensional inequality: a review. Metron, 78:5–42, 2020.
  • Arnold [2008] B. Arnold. The Lorenz curve: Evergreen after 110 years. In Advances on Income Inequality and Concentration Measures. Routledge, 2008.
  • Arnold [1983] B. C. Arnold. Pareto Distributions. International Co-operative Publishing House, 1983.
  • Arnold [2012] B. C. Arnold. Majorization and the Lorenz order: A brief introduction. Springer Science & Business Media, 2012.
  • Arnold and Sarabia [2018] B. C. Arnold and J. M. Sarabia. Majorization and the Lorenz order with Applications in Applied Mathematics and Economics. Springer, 2018.
  • Atkinson and Bourguignon [1982] A. Atkinson and F. Bourguignon. The comparison of multi-dimensioned distributions of economic status. Review of Economic Studies, 49:183–201, 1982.
  • Aurenhammer et al. [1998] F. Aurenhammer, F. Hoffmann, and B. Aronov. Minkowski-type theorems and least-squares clustering. Algorithmica, 20:61–76, 1998.
  • Banerjee [2010] A. Banerjee. Multidimensional Gini index. Mathematical Social Sciences, 60:87–93, 2010.
  • Banerjee [2016] A. Banerjee. Multidimensional Lorenz dominance: A definition and an example. Keio Economic Studies, 52:65–80, 2016.
  • Bickel and Lehmann [1976] P. Bickel and E. Lehmann. Descriptive statistics for nonparametric models, iii. dispersion. Annals of Statistics, 4:1139–1158, 1976.
  • Brenier [1991] Y. Brenier. Polar factorization and monotone rearrangement of vector‐valued functions. Communications on Pure and Applied Mathematics, 4:375–417, 1991.
  • Bricker et al. [2017] J. Bricker, L. J. Dettling, A. Henriques, J. W. Hsu, L. Jacobs, K. B. Moore, S. Pack, J. Sabelhaus, J. Thompson, and R. A. Windle. Changes in US family finances from 2013 to 2016: Evidence from the survey of consumer finances. Fed. Res. Bull., 103:1, 2017.
  • Brock and Thomson [1966] W. Brock and R. Thomson. Convex solutions of implicit relations. Mathematics Magazine, 39:208–111, 1966.
  • Carlier et al. [2010] G. Carlier, A. Galichon, and F. Santambrogio. From Knothe’s transport to Brenier’s map and a continuation method for optimal transport. SIAM Journal on Mathematical Analysis, 41:2554–2576, 2010.
  • Carlier et al. [2016] G. Carlier, V. Chernozhukov, and A. Galichon. Vector quantile regression: An optimal transport approach. Annals of Statistics, 44:1165–1192, 2016.
  • Charpentier et al. [2016] A. Charpentier, A. Galichon, and M. Henry. Local utility and multivariate risk aversion. Mathematics of Operations Research, 41:266–276, 2016.
  • Chernozhukov et al. [2017] V. Chernozhukov, A. Galichon, M. Hallin, and M. Henry. Monge-Kantorovich depth, quantiles, ranks and signs. Annals of Statistics, 45:223–256, 2017.
  • Chiappori et al. [2017] P.-A. Chiappori, R. McCann, and B. Pass. Multi to one-dimensional transport. Communications on Pure and Applied Mathematics, 70:2405–2444, 2017.
  • Dardanoni [1993] V. Dardanoni. Measuring social mobility. Journal of Economic Theory, 61:372–394, 1993.
  • Davidson and Duclos [2000] R. Davidson and J.-Y. Duclos. Statistical inference for stochastic dominance and for the measurement of poverty and inequality. Econometrica, 68:1435–1464, 2000.
  • Deb and Sen [2023] N. Deb and B. Sen. Multivariate rank-based distribution-free nonparametric testing using measure transportation. Journal of the American Statistical Association, 118:192–207, 2023.
  • Decancq and Lugo [2012] K. Decancq and M. Lugo. Inequality of wellbeing: A multidimensional approach. Economica, 79:721–746, 2012.
  • Donaldson and Weymark [1980] D. Donaldson and J. Weymark. Single parameter generalizations of Gini indices of inequality. Journal of Economic Theory, 22:67–86, 1980.
  • Dudley [2002] R. Dudley. Real Analysis and Probability. Cambridge University Press, 2002.
  • Ekeland et al. [2012] I. Ekeland, A. Galichon, and M. Henry. Comonotone measures of multivariate risks. Mathematical Finance, 22:109–132, 2012.
  • Farina and Savaglio [2006] F. Farina and E. Savaglio, editors. Three approaches to the analysis of multidimensional inequality, chapter 10. Routledge, 2006.
  • Faugeras and Rüschendorf [2017] O. Faugeras and Rüschendorf. Markov morphisms: A combined copula and mass transportation approach to multivariate quantiles. Mathematica Applicanda, 45:3–45, 2017.
  • Faure and Gravel [2021] M. Faure and N. Gravel. Reducing inequalities among unequals. International Economic Review, 62:357–404, 2021.
  • Figalli [2018] A. Figalli. On the continuity of center-outward distribution and quantile functions. Nonlinear Analysis, 177:413–429, 2018.
  • Fisher [1956] F. Fisher. Income distribution, value judgments and welfare. Quarterly Journal of Economics, 70:380–424, 1956.
  • Gajdos and Weymark [2005] T. Gajdos and J. Weymark. Multidimensional generalized Gini indices. Economic Theory, 26:471–496, 2005.
  • Galichon and Henry [2012] A. Galichon and M. Henry. Dual theory of choice under multivariate risks. Journal of Economic Theory, 147:1501–1516, 2012.
  • Gastwirth [1971] J. L. Gastwirth. A general definition of the Lorenz curve. Econometrica, 39:1037–1039, 1971.
  • Ghosal and Sen [2022] P. Ghosal and B. Sen. Multivariate ranks and quantiles using optimal transport: Consistency, rates, and nonparametric testing. Annals of Statistics, 2022.
  • Gravel and Moyes [2012] N. Gravel and P. Moyes. Ethically robust comparisons of bi-dimensionnal distributions with an ordinal attribute. Journal of Economic Theory, 147:1384–1426, 2012.
  • Grothe et al. [2022] O. Grothe, F. Kächele, and F. Schmid. A multivariate extension of the Lorenz curve based on copulas and a related multivariate gini coefficient. Journal of Economic Inequality, 20:727–748, 2022.
  • Hallin [2022] M. Hallin. Measure transportation and statistical decision theory. Annual Review of Statistics and Its Application, 9:410–424, 2022.
  • Hallin and Mordant [2022] M. Hallin and G. Mordant. Center-outward multiple-output Lorenz curves and Gini indices a measure transportation approach. Unpublished manuscript, 2022.
  • Hallin et al. [2021] M. Hallin, E. del Barrio, J. Cuesta-Albertos, and C. Matrán. Distribution and quantile functions, ranks and signs in dimension d: A measure transportation approach. Annals of Statistics, 49:1139–1165, 2021.
  • Hanna et al. [2018] S. D. Hanna, K. T. Kim, and S. Lindamood. Behind the numbers: Understanding the survey of consumer finances. Journal of Financial Counseling and Planning, 29:410–418, 2018.
  • Hardy et al. [1934] G. Hardy, J. Littlewood, and G. Pólya. Inequalities. Cambridge University Press, 1934.
  • Joe [1997] H. Joe. Multivariate Models and Multivariate Dependence Concepts. Chapman and Hall, 1997.
  • Kennickell [2017a] A. Kennickell. Multiple imputation in the survey of consumer finances. Statistical Journal of the IAOS, 33:143–151, 2017a.
  • Kennickell [2017b] A. B. Kennickell. The bitter end? The close of the 2007 SCF field period. Statistical Journal of the IAOS, 33:93–99, 2017b.
  • Kennickell and Woodburn [1999] A. B. Kennickell and R. L. Woodburn. Consistent weight design for the 1989, 1992 and 1995 SCFs, and the distribution of wealth. Review of Income and Wealth, 45:193–215, 1999.
  • Knothe [1957] H. Knothe. Contributions to the theory of convex bodies. Michigan Mathematical Journal, 4:39–52, 1957.
  • Kolm [1977] S.-C. Kolm. Multidimensional egalitarianisms. The Quarterly Journal of Economics, 91:1–13, 1977.
  • Koshevoy [1995] G. Koshevoy. Multivariate Lorenz majorization. Social Choice and Welfare, 12:93–102, 1995.
  • Koshevoy and Mosler [1996] G. Koshevoy and K. Mosler. The Lorenz zonoid of a multivariate distribution. Journal of the American Statistical Association, 91:873–882, 1996.
  • Koshevoy and Mosler [1997] G. Koshevoy and K. Mosler. Multivariate Gini indices. Journal of Multivariate Analysis, 60:252–276, 1997.
  • Koshevoy and Mosler [1999] G. Koshevoy and K. Mosler. Price majorization and the inverse Lorenz function. Discussion Papers in Statistics and Econometrics 3/99, University of Cologne, 1999.
  • Koshevoy and Mosler [2007] G. Koshevoy and K. Mosler. Multivariate Lorenz dominance based on zonoids. AStA, 91:57–76, 2007.
  • Lehmann [1966] E. L. Lehmann. Some concepts of dependence. The Annals of Mathematical Statistics, 37:1137–1153, 1966.
  • Little and Rubin [2019] R. J. Little and D. B. Rubin. Statistical analysis with missing data, volume 793. John Wiley & Sons, 2019.
  • Lorenz [1905] M. Lorenz. Methods of measuring the concentration of wealth. Publication of the American Statistical Association, 9:209–219, 1905.
  • Maasoumi [1986] E. Maasoumi. The measurement and decomposition of multi-dimensional inequality. Econometrica, 54:991–997, 1986.
  • Maasoumi and Racine [2016] E. Maasoumi and J. Racine. A solution to aggregation and an application to multidimensional ‘well-being’ frontiers. Journal of Econometrics, 191:374–383, 2016.
  • Marshall et al. [2011] A. W. Marshall, I. Olkin, and B. C. Arnold. Inequalities: Theory of Majorization and Its Applications. Springer, 2nd edition, 2011.
  • McCann [1995] R. J. McCann. Existence and uniqueness of monotone measure-preserving maps. Duke Mathematical Journal, 80:309–323, 1995.
  • Mérigot [2011] Q. Mérigot. A multiscale approach to optimal transport. Computer Graphics Forum, 30:1583–1592, 2011.
  • Müller and Stoyan [2002] A. Müller and D. Stoyan. Comparison Methods for Stochastic Models and Risks. Wiley, 2002.
  • Peyré and Cuturi [2018] G. Peyré and M. Cuturi. Computational Optimal Transport. ArXiv:1803.00567, 2018.
  • Puccetti and Scarsini [2010] G. Puccetti and M. Scarsini. Multivariate comonotonicity. Journal of Multivariate Analysis, 101:291–304, 2010.
  • Quiggin [1992] J. Quiggin. Increasing risk: Another definition. In A. Chikan, editor, Progress in Decision, Utility and Risk Theory, pages 239–248. Kluwer: Dordrecht, 1992.
  • Rachev and Rüschendorf [1990] S. Rachev and L. Rüschendorf. A characterization of random variables with minimal L2 distance. Journal of Multivariate Analysis, 32:48–54, 1990.
  • Rockafellar [1966] R. T. Rockafellar. Characterization of the subdifferentials of convex functions. Pacific Journal of Mathematics, 17:497–510, 1966.
  • Rosenblatt [1952] M. Rosenblatt. Remarks on a multivariate transformation. Annals of Mathematical Statistics, 23:470–472, 1952.
  • Rubin [1996] D. B. Rubin. Multiple imputation after 18+ years. Journal of the American Statistical Association, 91:473–489, 1996.
  • Sarabia and Jorda [2014] J.-M. Sarabia and V. Jorda. Bivariate Lorenz curves: a review of recent proposals. In XXII Jornadas de ASEPUMA, 2014.
  • Sarabia and Jorda [2020] J.-M. Sarabia and V. Jorda. Lorenz surfaces based on the Sarmanov–Lee distribution with applications to multidimensional inequality in well-being. Mathematics, 8:1–17, 2020.
  • Shaked and Shanthikumar [2007] M. Shaked and J. G. Shanthikumar. Stochastic Orders. Springer, 2007.
  • Shi et al. [2022] H. Shi, M. Drton, and F. Han. Distribution-free consistent independence tests via center-outward ranks and signs. Journal of the American Statistical Association, 117:395–410, 2022.
  • Shorrocks [1983] A. F. Shorrocks. Ranking income distributions. Economica, 197:3–17, 1983.
  • Taguchi [1972a] T. Taguchi. On the two-dimensional concentration surface and extensions of concentration coefficient and Pareto distribution to the two-dimensional case-i. Annals of the Institute of Statistical Mathematics, 24:355–382, 1972a.
  • Taguchi [1972b] T. Taguchi. On the two-dimensional concentration surface and extensions of concentration coefficient and Pareto distribution to the two-dimensional case-ii. Annals of the Institute of Statistical Mathematics, 24:599–619, 1972b.
  • Villani [2003] C. Villani. Topics in Optimal Tranportation. American Mathematical Society, 2003.
  • Villani [2009] C. Villani. Optimal Transport: Old and New. Springer, 2009.
  • Weymark [1981] J. Weymark. Generalized Gini inequality indices. Mathematical Social Sciences, 1:409–430, 1981.
  • Wolff [2021] E. N. Wolff. Household wealth trends in the United States, 1962 to 2019: Median wealth rebounds… but not enough. Working paper, National Bureau of Economic Research, 2021.
  • Yaari [1987] M. E. Yaari. The dual theory of choice under risk. Econometrica, 55:95–115, 1987.