This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

On indication, strict monotonicity, and efficiency of projections
in a general class of path-based data envelopment analysis models

Margaréta Halická [email protected] Mária Trnovská [email protected] Aleš Černý [email protected] Faculty of Mathematics, Physics and Informatics, Comenius University in Bratislava, Mlynská dolina, 842 48 Bratislava, Slovakia Bayes Business School, City St George’s, University of London, 106 Bunhill Row, London EC1Y 8TZ, UK
Abstract

Data envelopment analysis (DEA) theory formulates a number of desirable properties that DEA models should satisfy. Among these, indication, strict monotonicity, and strong efficiency of projections tend to be grouped together in the sense that, in individual models, typically, either all three are satisfied or all three fail at the same time. Specifically, in slacks-based graph models, the three properties are always met; in path-based models, such as radial models, directional distance function models, and the hyperbolic function model, the three properties, with some minor exceptions, typically all fail.

Motivated by this observation, the article examines relationships among indication, strict monotonicity, and strong efficiency of projections in the class of path-based models over variable returns-to-scale technology sets. Under mild assumptions, it is shown that the property of strict monotonicity and strong efficiency of projections are equivalent, and that both properties imply indication. This paper also characterises a narrow class of technology sets and path directions for which the three properties hold in path-based models.

keywords:
Data envelopment analysis , Directional distance function , Hyperbolic distance function , Indication , Strict monotonicity
journal: European Journal of Operational Research

1 Introduction

Data envelopment analysis (DEA) is a non-parametric method measuring relative efficiency within a group of homogeneous units that use multiple inputs to produce multiple outputs. This paper focuses on DEA models that integrate an analytical description of the technology set and an efficiency measure into a single mathematical optimisation programme. The output of the programme for the unit under evaluation is its efficiency score and a certain benchmark/projection (a point on the frontier of the technology set) from which the score is derived. The concepts of efficiency score and projection enter the formulations of the so-called desirable properties of the models.

The most comprehensive lists of desirable properties are provided in Cooper, Park and Pastor (1999) and Sueyoshi and Sekitani (2009). These properties include indication of strong efficiency; homogeneity; strict/weak monotonicity; boundedness; unit invariance; and translation invariance. The selected DEA models are then classified on the basis of these criteria. In the works Halická and Trnovská (2021) and Halická, Trnovská and Černý (2024), the property of strongly efficient projection is added to the desirable properties and the fulfilment of all properties is studied within two wide classes of DEA models. It is observed that the three properties: indication, strong efficiency of projection, and strict monotonicity usually appear as a trio, either all three are fulfilled or not fulfilled in the given model.

This article examines the connection between the above mentioned three properties, the meaning and importance of which are as follows.
Indication: The efficiency score is equal to one if and only if the evaluated unit is strongly efficient. This property allows revealing the strong efficiency of a unit just from the mere value of its efficiency score.
Strict monotonicity: An increase in any input or decrease of any output relative to the evaluated unit, holding other inputs as well as outputs constant, reduces the efficiency score. This property states that the measure of efficiency is fair – if a unit dominates another unit, then the former one has greater efficiency score. In the work Pastor et al. (1999), this property is interpreted as the sensitivity of the measure to changes in inputs or outputs.
Strong efficiency of projections: Projections generated by the model are strongly efficient. This property is sometimes called the property of efficient comparison. It states that the score is based on the comparison of the evaluated unit with a strongly efficient one, and therefore the value of the efficiency score is not overestimated. It could be viewed as an extension of the indication property (dealing with units with efficiency score equal to one) to any unit, and it could be alternatively formulated as: the efficiency score accounts for all sources of inefficiency if and only if the projection point is strongly efficient.

Note that the desirable properties of the measures over general technology sets were already formulated and analysed prior to the emergence of DEA – in the framework of economic production theory.111In the economic production theory, the measures of technical efficiency or inefficiency are called indices and the desirable properties of the indices are called axioms. The first work that formulated certain properties that an input (or output)-based efficiency measure should satisfy was the work of Färe and Lovell (1978). In this work, four properties were formulated, which, in addition to the three mentioned above, also included the property of homogeneity. Next, in Russell (1985) it was shown that, for a special type of measure, the efficient comparison property is redundant with respect to the other three properties. Certain objections were also raised regarding the unclear formulation of this property, worded as: “…comparison to efficient input vectors (the measure compares each feasible input vector to an efficient input vector).”

Later, in both areas, the list of desirable properties was expanded and the properties were examined not only in connection with input or output models, but also for graph measures of efficiency such as hyperbolic or additive measures (e.g. Pastor et al., 1999 and Russell and Schworm, 2011). In these and other works, the efficient comparison property was no longer mentioned, objections to the vagueness of the definition of this property were not reconsidered, and the validity of the conclusions of Russell (1985) about its redundancy for other types of measures was not verified.

In actual fact, DEA does provide mathematical tools for defining the projection and also supplies additional reasons to not neglecting the property of efficient comparison. Thanks to DEA, the models can be formulated as mathematical programming problems, and the relevant benchmarks/projections can be easily identified from their optimal solutions. This makes the efficient comparison property well-defined. Furthermore, the efficient comparison property (as well as the indication property) can be easily verified in the DEA models for each of the assessed units using the standard second-phase method. Nevertheless, the efficient comparison property is only seldom included in the list of desirable properties,222To the best of our knowledge, the efficient comparison property is explicitly listed among desirable properties of DEA models only in Halická and Trnovská (2021), Halická et al. (2024) and Pastor et al. (2022, p. 40), where in the last cited work this property appears as an extension of the standard ‘indication property’ under (E1b). and instead many authors settle for the weaker property of indication. However, the efficient comparison property and its verification in specific models are particularly important in DEA. Namely, DEA approximates the most widely used variable returns-to-scale technology using observed data coupled with the postulates of convexity and free disposability. This leads to polyhedral technologies whose large parts of the frontiers are not strongly efficient. Then, individual models projecting units on the frontier may derive scores from projections that are weakly but not strongly efficient.

There are two basic ways of searching for the benchmarks in DEA, leading to the classification of models into path-based models and slacks-based models introduced in the work Russell and Schworm (2018). As explained there, the slacks-based measures are expressed in terms of additive or multiplicative slacks for all inputs and outputs, and particular measures are generated by specifying the form of aggregation over the coordinate-wise slacks. On the other hand, the path-based measures are expressed in terms of a common contraction/expansion factor, and particular measures are generated by specifying the parametric path leading from the assessed unit towards the frontier of the technology set. In this class, the projection is uniquely determined as the point at which the path leaves the technology set. However, the slacks-based models may provide multiple benchmarks and hence it may not be apparent from which benchmark the efficient score is derived.

The desirable properties of slacks-based models over variable returns-to-scale (VRS) technology sets were analysed in Halická and Trnovská (2021) through a general scheme. The scheme encompassed all commonly used models, such as the Slacks-Based Measure (SBM) model of Tone (2001), the Russell Graph Measure model of Färe, Grosskopf and Lovell (1985), the Additive Model (AM) of Charnes et al. (1985), and the Weighted Additive Models (WAM) including the Range Adjusted Measure (RAM) model (Cooper, Park and Pastor, 1999) and the Bounded Adjusted Measure (BAM) model (Cooper et al., 2011). It was shown in Halická and Trnovská (2021) that the scores of these models are derived from benchmarks that may not be unique but are all strongly efficient, and the resulting score does not depend on the choice of the benchmark. As a consequence, all models in this class satisfy the indication and efficient comparison properties and, therefore, they all account for all sources of inefficiencies. The strict monotonicity property had to be individually assessed for models belonging to this scheme and was proven for all standard models with the exception of the BAM model, which only met the weak monotonicity.

Quite different results were obtained in the class of path-based models under VRS technology sets. These models include the radial BCC input or output-oriented models (Banker, Charnes and Cooper, 1984), the directional distance function (DDF) model (Chambers, Chung and Färe, 1996, 1998), and the hyperbolic distance function (HDF) model (Färe, Grosskopf and Lovell, 1985). Halická et al. (2024) analysed this class through a general scheme that depends on the parameters choices (convex functions and directional vectors) and covers all standard path-based models. The scheme allows for negative data.

The results showed that most models fail simultaneously all three properties of indication, strict monotonicity, and efficient comparison. However, the authors also presented very specific examples of path-based models that, in the case of technology sets with one input and one output, met all three properties. The simultaneous failure or success of the three properties invites the following questions,

  1. (i)

    to what extent the three properties are related;

  2. (ii)

    can one find sufficient conditions under which path-based models satisfy all three properties.

The aim of this article is to provide answers to these two questions.

With regard to the first question, it is fairly immediate that the property of efficient comparison implies the indication property; such an implication is also valid outside of the scheme of path-based models. However, the reverse implication is not apparent, and we will show that it is not even generally valid. Moreover, the connection between the efficient comparison property and the property of strict monotonicity is also unclear. In the framework of the general scheme of path-based models, we will show the equivalence of these two properties under mild assumptions. To the best of our knowledge, this equivalence will be identified and proved here for the first time.

Obtaining an answer to the second question can be important for practitioners — is it possible to attain the three desirable properties by a suitable choice of parameters of path-based models? An analysis outlined in Halická et al. (2024) indicates that appropriately modified range directions of Portela et al. (2004) tend to improve the properties of the model. These directions work well in single-input and single-output examples but not in all three-dimensional examples. More extensive numerical experiments with two inputs and two outputs carried out in Halická et al. (2024) show that these directions can significantly reduce the number of units not projected onto the strongly efficient frontier but cannot eliminate the presence of such units completely in general. Therefore, it seems that the problem partly stems from the characteristics of the VRS technology set generated by the data. In the second part of this article, we confirm this conjecture. We provide a characterisation of type of data, or equivalently, the type of technological set that, together with the aforementioned choice of directions, will ensure the three properties are met. We term this type of technology the ideal technology. We also offer a practical recipe to determine whether given data generates an ideal technology set.

The paper is organised as follows. The preliminaries required to accurately address the topics discussed in this article are outlined in Section 2. Section 3 examines the connections among the three desirable properties. In Section 4, particular directions and technology sets are outlined to ensure that all three properties are satisfied. Section 5 contains several numerical illustrations on real data. The concluding Section 6 summarises our theoretical findings and draws lessons for practitioners. A analyses the facial structure of a general VRS technology set, B gives equivalent characterizations of the ideal technology, and C contains the proof of Lemma 4.8.

2 Preliminaries

Let d\mathbb{R}^{d} be the dd-dimensional Euclidean space and +d\mathbb{R}^{d}_{+} its non-negative orthant. Bold lowercase letters denote column vectors, and bold uppercase letters denote matrices. The superscript denotes the transpose of a column vector or a matrix. For a vector 𝒛od\bm{z}_{o}\in\mathbb{R}^{d}, zkoz_{ko} denotes its kk-th component, and hence 𝒛o=[z1o,,zdo]\bm{z}_{o}=[z_{1o},\dots,z_{do}]^{\top}. The symbol 𝒆\bm{e} denotes a vector of ones.

We will consider a production process with mm inputs and ss outputs. For any two input-output vectors (𝒙o,𝒚o),(𝒙p,𝒚p)m×s(\bm{x}_{o},\bm{y}_{o}),(\bm{x}_{p},\bm{y}_{p})\in\mathbb{R}^{m}\times\mathbb{R}^{s} we will use the notation

  • (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\succsim(\bm{x}_{p},\bm{y}_{p}) if (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) weakly dominates (𝒙p,𝒚p)(\bm{x}_{p},\bm{y}_{p}), i.e., 𝒙o𝒙p\bm{x}_{o}\leq\bm{x}_{p} and 𝒚o𝒚p\bm{y}_{o}\geq\bm{y}_{p};

  • (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\succnsim(\bm{x}_{p},\bm{y}_{p}) if (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) dominates (𝒙p,𝒚p)(\bm{x}_{p},\bm{y}_{p}), i.e, 𝒙o𝒙p\bm{x}_{o}\leq\bm{x}_{p}, 𝒚o𝒚p\bm{y}_{o}\geq\bm{y}_{p}, and (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\neq(\bm{x}_{p},\bm{y}_{p});

  • (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\succ(\bm{x}_{p},\bm{y}_{p}) if (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) strictly dominates (𝒙p,𝒚p)(\bm{x}_{p},\bm{y}_{p}), i.e., 𝒙o<𝒙p\bm{x}_{o}<\bm{x}_{p} and 𝒚o>𝒚p\bm{y}_{o}>\bm{y}_{p}.

2.1 Technology set

Consider a set of nn decision-making units DMUj{\mathrm{DMU}}_{j} (j=1,,nj=1,\dots,n) with observed input–output vectors (𝒙j,𝒚j)m×s(\bm{x}_{j},\bm{y}_{j})\in\mathbb{R}^{m}\times\mathbb{R}^{s}. The input–output data of (DMUj)j=1n(\mathrm{DMU}_{j})_{j=1}^{n} are arranged into the m×nm\times n input and s×ns\times n output matrices 𝑿=[𝒙1,,𝒙n]\bm{X}=[\bm{x}_{1},\dots,\bm{x}_{n}] and 𝒀=[𝒚1,,𝒚n]\bm{Y}=[\bm{y}_{1},\dots,\bm{y}_{n}], respectively. No assumption about the non-negativity of the data is made at this point. The non-negativity requirement may follow later from other assumptions placed on the models.

Based on the given data 𝑿,𝒀\bm{X},\bm{Y} we consider the technology set

𝒯={(𝒙,𝒚)m×s|𝑿𝝀𝒙,𝒀𝝀𝒚,𝝀𝟎,𝒆𝝀=1},\mathcal{T}=\left\{(\bm{x},\bm{y})\in\mathbb{R}^{m}\times\mathbb{R}^{s}\ |\ \bm{X}\bm{\lambda}\leq\bm{x},\ \bm{Y}\bm{\lambda}\geq\bm{y},\ \bm{\lambda}\geq\bm{0},\ \bm{e}^{\top}\bm{\lambda}=1\right\}, (1)

corresponding to variable returns to scale (VRS). Note that the common non-negativity of (𝒙,𝒚)(\bm{x},\bm{y}) is not imposed here. It follows from (1) that the set 𝒯\mathcal{T} is closed, has a non-empty interior (denoted int𝒯\operatorname{int}\mathcal{T}) and its boundary 𝒯\partial\mathcal{T} satisfies 𝒯=𝒯int𝒯\partial\mathcal{T}=\mathcal{T}\setminus\text{int}\mathcal{T}. Elements of 𝒯{\mathcal{T}} will be called units. By (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) we denote a unit from 𝒯{\mathcal{T}} to be evaluated.

The point (𝒙min,𝒚max)m×s(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\in\mathbb{R}^{m}\times\mathbb{R}^{s}, with elements ximin=minjxijx^{\mathrm{min}}_{i}=\min_{j}x_{ij}, for i=1,mi=1,\dots m and yrmax=maxjxrjy^{\mathrm{max}}_{r}=\max_{j}x_{rj} for r=1,sr=1,\dots s, respectively, is called the ideal point of 𝒯\mathcal{T} in DEA (see, e.g., Portela et al., 2004). The ideal point typically does not belong to 𝒯\mathcal{T}, in which case it dominates every unit in 𝒯\mathcal{T}. The technology set, for which (𝒙min,𝒚max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\in\mathcal{T}, will be called trivial. Clearly, a trivial technology is an affine transformation of a non-negative orthant.

A unit (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} is called strongly efficient333This is the well known Pareto–Koopmans efficiency. Some authors call such units Pareto efficient, or fully efficient - see the discussion in Cooper et al. (2007, p. 45). if no other unit in 𝒯\mathcal{T} dominates (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}), i.e., if the property that (𝒙,𝒚)𝒯(\bm{x},\bm{y})\in\mathcal{T} dominates (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) yields (𝒙,𝒚)=(𝒙o,𝒚o)(\bm{x},\bm{y})=(\bm{x}_{o},\bm{y}_{o}). A unit (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} is called weakly efficient if there is no unit in 𝒯\mathcal{T} that strictly dominates (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}).

Evidently, any strongly efficient unit is weakly efficient, and the weakly efficient units lie on the boundary of the technology set. The converse is also true: every unit on the boundary 𝒯\partial\mathcal{T} is weakly efficient because the definition of 𝒯\mathcal{T} in (1) does not impose the non-negativity assumption on the units therein. The boundary 𝒯\partial\mathcal{T} is thus uniquely partitioned into the strongly efficient frontier S𝒯\partial^{S}\mathcal{T} containing all strongly efficient units and the remaining part W𝒯:=𝒯S𝒯\partial^{W}\mathcal{T}:=\partial\mathcal{T}\setminus\partial^{S}\mathcal{T}, which consists of the weakly but not strongly efficient units. In this paper we will refer to the remaining part of the boundary as the weakly efficient frontier. Thus, we have 𝒯=S𝒯W𝒯\partial\mathcal{T}=\partial^{S}\mathcal{T}\cup\partial^{W}\mathcal{T} and S𝒯W𝒯=\partial^{S}\mathcal{T}\cap\partial^{W}\mathcal{T}=\emptyset.

2.2 A general scheme for path-based models

We now recall the general scheme (GS) for path-based models from Halická et al. (2024). The scheme depends on both the choice of a prescription 𝒈\bm{g} that defines the directional vector 𝒈o=(𝒈ox,𝒈oy)0\bm{g}_{o}=(\bm{g}_{o}^{x},\bm{g}_{o}^{y})\gneqq 0 for each (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}), and the choice of real functions ψx\psi^{x} and ψy\psi^{y} that together with their domains (dom\operatorname{dom}) and images (im\operatorname{im}) satisfy the following assumptions:

  1. (A1)

    dom(ψx)=(ax,)\operatorname{dom}(\psi^{x})=(a^{x},\infty) with ax{,0}a^{x}\in\{-\infty,0\} and dom(ψy)=(ay,)\operatorname{dom}(\psi^{y})=(a^{y},\infty) with ay{,0}a^{y}\in\{-\infty,0\};

  2. (A2)

    ψx\psi^{x} is smooth, concave, increasing and ψy\psi^{y} is smooth, convex, decreasing;

  3. (A3)

    ψx(1)=ψy(1)=1\psi^{x}(1)=\psi^{y}(1)=1;

  4. (A4)

    im(ψx)=(bx,)\text{im}(\psi^{x})=(b^{x},\infty) with bx=b^{x}=-\infty if 𝒈y=𝟎\bm{g}^{y}=\bm{0} and bx{,0}b^{x}\in\{-\infty,0\} otherwise; im(ψy)=(by,)\operatorname{im}(\psi^{y})=(b^{y},\infty) with by{,0}b^{y}\in\{-\infty,0\}.

The scheme is built on the technology set 𝒯\mathcal{T} specified in (1) and is consequently also dependent on the matrices 𝑿,𝒀\bm{X},\bm{Y}. Any choices of (𝑿,𝒀,𝒈,ψx,ψy)(\bm{X},\bm{Y},\bm{g},\psi^{x},\psi^{y}) that satisfy the requirements mentioned above will be referred to as admissible parameters for a model in the GS scheme.

For a fixed choice of admissible parameters (𝑿,𝒀,𝒈,ψx,ψy)(\bm{X},\bm{Y},\bm{g},\psi^{x},\psi^{y}), the (path-based) GS model for assessment of (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} is defined by

(GS)omin\displaystyle(\text{GS})_{o}\qquad\min\ θ\displaystyle{}\theta (2a)
𝑿𝝀𝒙o+(ψx(θ)1)𝒈ox,\displaystyle{}\bm{X}\bm{\lambda}\leq\bm{x}_{o}+(\psi^{x}(\theta)-1)\bm{g}_{o}^{x}, (2b)
𝒀𝝀𝒚o+(ψy(θ)1)𝒈oy,\displaystyle{}\bm{Y}\bm{\lambda}\geq\bm{y}_{o}+(\psi^{y}(\theta)-1)\bm{g}_{o}^{y}, (2c)
𝒆𝝀=1,𝝀𝟎.\displaystyle{}\bm{e}^{\top}\bm{\lambda}=1,\quad\bm{\lambda}\geq\bm{0}. (2d)

The right-hand sides of (2b) and (2c), denoted by

ϕox(θ):=𝒙o+(ψx(θ)1)𝒈oxandϕoy(θ):=𝒚o+(ψy(θ)1)𝒈oy,\bm{\phi}_{o}^{x}(\theta):=\bm{x}_{o}+(\psi^{x}(\theta)-1)\bm{g}_{o}^{x}\quad\text{and}\quad\bm{\phi}_{o}^{y}(\theta):=\bm{y}_{o}+(\psi^{y}(\theta)-1)\bm{g}_{o}^{y}, (3)

define a smooth path ϕo(θ)=(ϕox(θ),ϕoy(θ))\bm{\phi}_{o}(\theta)=(\bm{\phi}_{o}^{x}(\theta),\bm{\phi}_{o}^{y}(\theta)) in the input-output space m×s\mathbb{R}^{m}\times\mathbb{R}^{s} parametrized by θ𝒟=dom(ψx)dom(ψy)\theta\in{\cal{D}}=\operatorname{dom}(\psi^{x})\cap\operatorname{dom}(\psi^{y}).

For any choice of admissible parameters and each (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}, the well-definedness of the programme (GS)o(GS)_{o} and other useful properties of the path ϕo\bm{\phi}_{o} are established in Theorem 3.1 of Halická et al. (2024). According to this theorem, the path ϕo\bm{\phi}_{o} passes through the point (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} at θ=1\theta=1 (i.e., ϕo(1)=(𝒙o,𝒚o)\bm{\phi}_{o}(1)=(\bm{x}_{o},\bm{y}_{o})), and for decreasing values of θ\theta it moves towards the boundary of TT gradually passing through points that dominate one another. This property of the path can be formally expressed as ϕo(θ1)ϕo(θ2)\bm{\phi}_{o}(\theta_{1})\succnsim\bm{\phi}_{o}(\theta_{2}) for θ1<θ2\theta_{1}<\theta_{2}, and we will refer to it as the monotonicity of the path with respect to θ\theta. Finally, the path leaves 𝒯\mathcal{T} at some ϕo(θo)𝒯\bm{\phi}_{o}(\theta^{*}_{o})\in\partial\mathcal{T}, where θo1\theta^{*}_{o}\leq 1, and θo\theta^{*}_{o} is the optimal value in (GS)o(GS)_{o}. The optimal value θo\theta^{*}_{o} is called the efficiency score, or alternatively, the value of the efficiency measure for (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}). The point (ϕox(θo),ϕoy(θo))(\bm{\phi}_{o}^{x}(\theta_{o}^{*}),\bm{\phi}_{o}^{y}(\theta_{o}^{*})) on the path ϕo\bm{\phi}_{o} is called the projection of (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) in the GS model.

2.3 Standard path-based models

It is easy to see that the well-known BCC input and output models (Banker et al., 1984), the hyperbolic distance function model (HDF) (Färe et al., 1985) as well as the generic directional distance function model (DDF) (Chambers et al., 1996, 1998) can be equivalently rewritten in the form of the general scheme (2). The scheme also includes the so-called generalised distance function model (GDF) introduced by Chavas and Cox (1999). These models, taken in conjunction with the usual assumptions on the positiveness of the data, will be called the standard path-based models. The corresponding parameterizations are shown in Table 1.

Model ϕox(θ)\bm{\phi}_{o}^{x}(\theta) ϕoy(θ)\bm{\phi}_{o}^{y}(\theta)
BCC-I Banker et al. (1984) 𝒙o+(θ1)𝒙o\bm{x}_{o}+(\theta-1)\bm{x}_{o} 𝒚o\bm{y}_{o}
BCC-O Banker et al. (1984) 𝒙o\bm{x}_{o} 𝒚o+(1θ1)𝒚o\bm{y}_{o}+(\frac{1}{\theta}-1)\bm{y}_{o}
DDF-g Chambers et al. (1996, 1998) 𝒙o+(θ1)𝒈oy\bm{x}_{o}+(\theta-1)\bm{g}_{o}^{y} 𝒚o+(2θ1)𝒈oy\bm{y}_{o}+(2-\theta-1)\bm{g}_{o}^{y}
HDF Färe et al. (1985) 𝒙o+(θ1)𝒙o\bm{x}_{o}+(\theta-1)\bm{x}_{o} 𝒚o+(1θ1)𝒚o\bm{y}_{o}+(\frac{1}{\theta}-1)\bm{y}_{o}
GDF Chavas and Cox (1999) p[0,1]p\in[0,1] 𝒙o+(θ1p1)𝒙o\bm{x}_{o}+(\theta^{1-p}-1)\bm{x}_{o} 𝒚o+(θp1)𝒚o\bm{y}_{o}+(\theta^{-p}-1)\bm{y}_{o}
Table 1: Parameterization of the standard path-based models.

The choice gox=0g^{x}_{o}=0 or goy=0g^{y}_{o}=0 for all (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} leads to output or input oriented models, respectively. Among the standard path-based models, only DDF is formulated with general directional vectors. The BCC as well as the HDF use gox=𝒙og^{x}_{o}=\bm{x}_{o} and / or goy=𝒚og^{y}_{o}=\bm{y}_{o}. A generalisation of HDF towards general directions is introduced in Halická et al. (2024).

2.4 Properties of the general model

In Halická et al. (2024), the GS models are analysed in light of ten desired properties. Three of these merit further investigation. First, we recall the precise definitions of Halická et al. (2024) abridged to suit our needs here.

  • (ID)

    Identification of strong efficiency. If for a given (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} one has θo=1\theta_{o}^{*}=1, then (𝒙o,𝒚o)S𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{S}\mathcal{T}.

  • (PR)

    Strong efficiency of projections. One has ϕo(θo)S𝒯\phi_{o}(\theta_{o}^{*})\in\partial^{S}\mathcal{T} for each (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}.

  • (MO)

    Strict monotonicity. If (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\succnsim(\bm{x}_{p},\bm{y}_{p}) for some (𝒙o,𝒚o),(𝒙p,𝒚p)𝒯(\bm{x}_{o},\bm{y}_{o}),(\bm{x}_{p},\bm{y}_{p})\in\mathcal{T}, then θo<θp\theta_{o}^{*}<\theta^{*}_{p}.

Remark 2.1.

Note that (ID) is the ‘only if ’part of the property known in DEA as Indication: θo=1\theta_{o}^{*}=1, if and only if (𝒙o,𝒚o)S𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{S}\mathcal{T}. Unlike the ‘only if ’part of Indication, the ‘if ’part is universally satisfied by each model of the GS scheme (see Theorem 2 in Halická et al., 2024).444The ‘if ’and the ‘only if ’parts of Indication property are denoted as (P2a) and (P2b), respectively, in Halická et al. (2024).

Remark 2.2.

A weaker version of (MO) where (𝒙o,𝒚o)(𝒙p,𝒚p)(\bm{x}_{o},\bm{y}_{o})\succsim(\bm{x}_{p},\bm{y}_{p}) implies θoθp\theta_{o}^{*}\leq\theta^{*}_{p}, is known in DEA as (weak) monotonicity. This property is met in GS models under mild assumptions that are satisfied by all common path-based models (Halická et al., 2024, Remark 8 and Theorem 9).

Let us briefly revisit the reasons for studying this triplet in more detail. The findings of Halická et al. (2024) show that none of the three properties are satisfied in the entire class of GS models. Moreover, there are examples of admissible model parameters, where all three properties are met, but it also appears that in most models the three properties fail simultaneously.

The relationships among the three properties for a particular GS model with a fixed set of admissible model parameters have been partially explored in Halická et al. (2024). Since (ID) is a restriction of (PR) to the weakly efficient frontier, it is easy to see that (PR) \Rightarrow (ID), or equivalently,

NOT (ID) \Rightarrow NOT (PR).555By NOT (XY) we understand that the model corresponding to the given configuration of admissible parameters violates the property (XY) at least at one (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}.

NOT (ID) occurs if and only if there is a unit on the weakly efficient frontier whose score is 1. Hence Halická et al. (2024, Theorem 4.20) further yields

NOT (ID) \Rightarrow NOT (MO).

Moreover, NOT (ID) arises whenever a unit on the weakly efficient frontier is associated with a positive direction (Halická et al., 2024, Theorem 3). The hierarchy of these properties, formulated by contra-position, is summarised in Figure 1.

PRID no point
in
W𝒯\partial^{W}\mathcal{T} has
score of 1
not all
directions
are positive
MO

Figure 1: Hierarchy of properties for the GS model corresponding to a fixed choice of admissible model parameters.

It remains to establish the link between (PR) and (MO) and find examples, if any, where (ID) holds but (PR) and/or (MO) do not. This will be the subject of Section 3 with the main findings summarised in Figure 3.

A natural next question is what selection of admissible parameters leads to the success or failure of these characteristics. To this end, illustrative examples in Halická et al. (2024) indicate that the success could be connected with a special configuration of the data 𝑿,𝒀\bm{X},\bm{Y} defining 𝒯\mathcal{T} together with a special choice of unit-dependent directions based on Portela et al.’s range directions

gox=𝒙o𝒙min,goy=𝒚max𝒚o,g^{x}_{o}=\bm{x}_{o}-\bm{x}^{\mathrm{min}},\qquad g^{y}_{o}=\bm{y}^{\mathrm{max}}-\bm{y}_{o}, (4)

suitably modified to fit the GS scheme. This topic is addressed in Section 4.

3 Connections among (ID), (PR), and (MO)

Unless the model parameters are explicitly specified, the results of this section apply to a particular GS model with a fixed but arbitrarily chosen set of admissible model parameters. We are looking to investigate the consequences of (ID) on (PR) and (MO), as well as the connection between (PR) and (MO). In preparation for these tasks, we recall the necessary and sufficient conditions for (PR).

Theorem 3.1 (Halická et al., 2024, Theorem 4).

The projection of (𝐱o,𝐲o)(\bm{x}_{o},\bm{y}_{o}) is strongly efficient if and only if for each optimal solution (𝛌o,θo)(\bm{\lambda}^{*}_{o},\theta^{*}_{o}) of (2), the inequality constraints (2b) and (2c) are satisfied with equality.

3.1 (ID) does not imply (PR)

The next example shows that (ID) may hold while (PR) fails for a certain selection of admissible model parameters.

Refer to caption
Figure 2: DDF-g model with specific directions over one input and one output technology set. All units belonging to W𝒯\partial^{W}\mathcal{T} are projected on S𝒯\partial^{S}\mathcal{T}, but some units belonging to int𝒯\mathcal{T} are projected on W𝒯\partial^{W}\mathcal{T}. Therefore, (ID) is satisfied, but (PR) is not.
Example 3.2.

Consider a one-input and one-output example with 7 DMUs: units A=(1,1)A=(1,1), B=(1,2)B=(1,2), C=(2,3)C=(2,3), and D=(3,3)D=(3,3) on the boundary of 𝒯\mathcal{T} and units E=(2,1)E=(2,1), F=(3,1)F=(3,1), and G=(4,1)G=(4,1) in its interior. Now, let us apply the DDF-g model with directions gox=𝒙o𝒙ming^{x}_{o}=\bm{x}_{o}-\bm{x}^{\mathrm{min}}, goy=3(𝒚max𝒚o)g^{y}_{o}=3(\bm{y}^{\mathrm{max}}-\bm{y}_{o}). Figure 2 shows that all units belonging to W𝒯\partial^{W}\mathcal{T} (including generic A,DA,D) are projected onto S𝒯\partial^{S}\mathcal{T}, and the only units that are projected onto themselves, and therefore have a score of 1, are the strongly efficient units from the line segment BC. Therefore, the property (ID) is satisfied. On the other hand, some units belonging to int𝒯\operatorname{int}\mathcal{T} (e.g., F,GF,G) are projected onto W𝒯\partial^{W}\mathcal{T} and therefore the property (PR) fails.666This example can be adapted to a more general case of mm inputs and one output, or one input and ss outputs over the so-called ‘ideal technology sets’ introduced later in Section 4. Note that a similar situation occurs for the HDF-g model with the Portela et al.’s range directions (4).

3.2 (MO) implies (PR)

Until now, we have considered the right-hand side of (2b) and (2c) as a (vector) function of θ\theta for a fixed (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}), which yields a path passing through the assessed unit (𝒙o,𝒚o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}; see (3). To obtain a link between (MO) and (PR), it is helpful to consider the right-hand sides of (2b) and (2c) as functions of the assessed unit (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) for a fixed θ\theta. Observe that the right-hand sides in (3) depend explicitly on (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) but the dependence may also be implicit via the directions 𝒈o=(𝒈ox,𝒈oy)\bm{g}_{o}=(\bm{g}_{o}^{x},\bm{g}_{o}^{y}).

Definition 3.3.

For each fixed θ¯𝒟\bar{\theta}\in\cal D we define the path-flow mapping ϕ(𝐱o,𝐲o;θ¯)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\bar{\theta}) as the function that maps each unit (𝐱o,𝐲o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} to the point (ϕox(θ¯),(ϕoy(θ¯))m+s(\bm{\phi}_{o}^{x}(\bar{\theta}),(\bm{\phi}_{o}^{y}(\bar{\theta}))\in\mathbb{R}^{m+s}.

Theorem 3.4.

If there exists (𝐱o,𝐲o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} such that (ϕox(θo),ϕoy(θo))W𝒯(\bm{\phi}_{o}^{x}(\theta_{o}^{*}),\bm{\phi}_{o}^{y}(\theta_{o}^{*}))\in\partial^{W}\mathcal{T} and the path-flow mapping ϕ(𝐱o,𝐲o;θo)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\theta_{o}^{*}) is continuous at (𝐱o,𝐲o)(\bm{x}_{o},\bm{y}_{o}), then the GS model does not meet the property of strict monotonicity (MO).

Proof.

Since (ϕox(θo),ϕoy(θo))W𝒯(\bm{\phi}_{o}^{x}(\theta_{o}^{*}),\bm{\phi}_{o}^{y}(\theta_{o}^{*}))\in\partial^{W}\mathcal{T}, by Theorem 3.1 there exists an optimal solution (𝝀o,θo)(\bm{\lambda}^{*}_{o},\theta^{*}_{o}) of (2) and (𝒔x,𝒔y)𝟎(\bm{s}^{x*},\bm{s}^{y*})\geq\bm{0} such that (𝒔x,𝒔y)𝟎(\bm{s}^{x*},\bm{s}^{y*})\neq\bm{0} and

𝑿𝝀+𝒔x=ϕox(θo),𝒀𝝀𝒔y=ϕoy(θo).\bm{X}\bm{\lambda}^{*}+\bm{s}^{x*}=\bm{\phi}_{o}^{x}(\theta^{*}_{o}),\quad\bm{Y}\bm{\lambda}^{*}-\bm{s}^{y*}=\bm{\phi}_{o}^{y}(\theta^{*}_{o}).\\ (5)

Define

𝒮={(𝒙o𝒔x,𝒚o+𝒔y)| 0(𝒔x,𝒔y)(𝒔x,𝒔y)}.{\cal S}=\{(\bm{x}_{o}-\bm{s}^{x},\bm{y}_{o}+\bm{s}^{y})\ |\ 0\lneqq(\bm{s}^{x},\bm{s}^{y})\leq(\bm{s}^{x*},\bm{s}^{y*})\}.

Each unit in 𝒮\cal S is distinct from and dominates (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}). Since (𝒙o,𝒚o)=ϕo(1)(\bm{x}_{o},\bm{y}_{o})=\bm{\phi}_{o}(1) and θo1\theta^{*}_{o}\leq 1, the monotonicity of the path in θ\theta implies that ϕox(θo)𝒙o\bm{\phi}_{o}^{x}(\theta^{*}_{o})\leq\bm{x}_{o} and ϕoy(θo)𝒚o\bm{\phi}_{o}^{y}(\theta^{*}_{o})\geq\bm{y}_{o}. Therefore, the equations in (5) imply that (𝒙osx,𝒚o+sy)𝒯(\bm{x}_{o}-s^{x*},\bm{y}_{o}+s^{y*})\in\mathcal{T} and therefore also 𝒮𝒯{\cal S}\subset\mathcal{T}. Note that 𝒮\cal S contains points that are arbitrarily close to (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}). Now, by the continuity assumption, for ϵ:=mini,r{six>0,sry>0}\epsilon:=\min_{i,r}\{s^{x*}_{i}>0,s^{y*}_{r}>0\} there exists a unit (𝒙p,𝒚p)𝒮(\bm{x}_{p},\bm{y}_{p})\in\cal S such that ϕo(θo)ϕq(θo)<ϵ||\bm{\phi}_{o}(\theta^{*}_{o})-\bm{\phi}_{q}(\theta^{*}_{o})||<\epsilon. This implies that the vector inequalities |ϕox(θo)ϕpx(θo)|𝒔x|\bm{\phi}^{x}_{o}(\theta^{*}_{o})-\bm{\phi}^{x}_{p}(\theta^{*}_{o})|\leq\bm{s}^{x*} and |ϕoy(θo)ϕpy(θo)|𝒔y|\bm{\phi}^{y}_{o}(\theta^{*}_{o})-\bm{\phi}^{y}_{p}(\theta^{*}_{o})|\leq\bm{s}^{y*} hold. Therefore, one also has

ϕox(θo)𝒔xϕpx(θo),ϕo(θo)+𝒔yϕpy(θo).\bm{\phi}_{o}^{x}(\theta^{*}_{o})-\bm{s}^{x*}\leq\bm{\phi}_{p}^{x}(\theta^{*}_{o}),\quad\bm{\phi}_{o}(\theta^{*}_{o})+\bm{s}^{y*}\geq\bm{\phi}_{p}^{y}(\theta^{*}_{o}). (6)

By substituting (6) into (5) we get

𝑿𝝀ϕpx(θo),𝒀𝝀ϕpy(θo).\bm{X}\bm{\lambda}^{*}\leq\bm{\phi}_{p}^{x}(\theta^{*}_{o}),\quad\bm{Y}\bm{\lambda}^{*}\geq\bm{\phi}^{y}_{p}(\theta^{*}_{o}). (7)

This yields that (𝝀,θo)(\bm{\lambda}^{*},\theta_{o}^{*}) is a feasible solution for (GS)p, and hence one has θpθo\theta_{p}^{*}\leq\theta_{o}^{*}. The strict monotonicity is violated because (𝒙p,𝒚p)(\bm{x}_{p},\bm{y}_{p}) dominates and is different from (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) but θpθo\theta^{*}_{p}\leq\theta_{o}^{*}. ∎

Observe that Theorem 3.4 formulates a local property: if a point in 𝒯\mathcal{T} is not projected onto S𝒯\partial^{S}\mathcal{T}, then one can find another point such that the pair fails to maintain strict monotonicity. Thus, the “local” formulation offers more flexibility than the following “global” corollary.

Corollary 3.5.

Assume that the path-flow mapping ϕ(𝐱o,𝐲o;θ)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\theta) is continuous for each fixed θ𝒟\theta\in\mathcal{D} on 𝒯\mathcal{T}. Then (MO) \Rightarrow (PR) holds.

3.3 (PR) implies (MO)

The monotonicity of GS models is related to the monotonicity of the path-flow mapping ϕ(𝒙o,𝒚o;θ)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\theta), which we formalise next.

Definition 3.6.

We say that the path-flow mapping ϕ(𝐱o,𝐲o;θ¯)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\bar{\theta}): (𝐱o,𝐲o)ϕo(θ¯)(\bm{x}_{o},\bm{y}_{o})\to\bm{\phi}_{o}(\bar{\theta}) is monotone on 𝒯\mathcal{T} at θ¯𝒟\bar{\theta}\in\mathcal{D} if for any two units (𝐱o,𝐲o)(\bm{x}_{o},\bm{y}_{o}), (𝐱p,𝐲p)(\bm{x}_{p},\bm{y}_{p}) in 𝒯\mathcal{T}, one has

(𝒙o,𝒚o)(𝒙p,𝒚p)ϕo(θ¯)ϕp(θ¯).(\bm{x}_{o},\bm{y}_{o})\succsim(\bm{x}_{p},\bm{y}_{p})\ \Rightarrow\ \bm{\phi}_{o}(\bar{\theta})\succsim\bm{\phi}_{p}(\bar{\theta}). (8)

If, in addition,

(𝒙o,𝒚o)(𝒙p,𝒚p)ϕo(θ¯)ϕp(θ¯),(\bm{x}_{o},\bm{y}_{o})\succnsim(\bm{x}_{p},\bm{y}_{p})\ \Rightarrow\ \bm{\phi}_{o}(\bar{\theta})\succnsim\bm{\phi}_{p}(\bar{\theta}), (9)

we say that the path-flow mapping is strictly monotone on 𝒯\mathcal{T} at θ¯𝒟\bar{\theta}\in\mathcal{D}.

Remark 3.7.

If for each i{1,,m}i\in\{1,\dots,m\} the ii-th component [ϕx(𝒙o,𝒚o;θ¯)]i[\bm{\phi}^{x}(\bm{x}_{o},\bm{y}_{o};\bar{\theta})]_{i} of ϕx(𝒙o,𝒚o;θ¯)\bm{\phi}^{x}(\bm{x}_{o},\bm{y}_{o};\bar{\theta}) depends only on xiox_{io}, and for each r{1,,s}r\in\{1,\dots,s\} component [ϕy(𝒙o,𝒚o;θ¯)]r[\bm{\phi}^{y}(\bm{x}_{o},\bm{y}_{o};\bar{\theta})]_{r} depends only on yroy_{ro}, then the monotonicity property in Definition 3.6 simply means that [ϕx(𝒙o,𝒚o;θ¯)]i[\bm{\phi}^{x}(\bar{\bm{x}_{o},\bm{y}_{o};\theta})]_{i} and [ϕy(𝒙o,𝒚o;θ¯)]r[\bm{\phi}^{y}(\bm{x}_{o},\bm{y}_{o};\bar{\theta})]_{r} are nondecreasing in xiox_{io} and yroy_{ro}, respectively. On the other hand, the strict monotonicity property in Definition 3.6 means that [ϕx(𝒙o,𝒚o;θ¯)]i[\bm{\phi}^{x}(\bm{x}_{o},\bm{y}_{o};\bar{\theta})]_{i} and [ϕy(𝒙o,𝒚o;θ¯)]r[\bm{\phi}^{y}(\bm{x}_{o},\bm{y}_{o};\bar{\theta})]_{r} are increasing in 𝒙io\bm{x}_{io} and 𝒚ro\bm{y}_{ro}, respectively.

Remark 3.8.

Note that if 𝒈o\bm{g}_{o} does not depend on (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}), then the path-flow mapping ϕ(𝒙o,𝒚o;θ)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\theta) is monotone on 𝒯\mathcal{T} for any choice of ψx\psi^{x} and ψy\psi^{y} at any θ𝒟\theta\in\mathcal{D}.

The next lemma will be useful for further analysis of strict monotonicity.

Lemma 3.9 (Halická et al., 2024, Lemma 3).

Let (𝐱o,𝐲o)(\bm{x}_{o},\bm{y}_{o}) and (𝐱p,𝐲p)(\bm{x}_{p},\bm{y}_{p}) be two units in 𝒯\mathcal{T} with the corresponding optimal values θo\theta^{*}_{o} and θp\theta^{*}_{p}. If ϕo(θo)ϕp(θo)\bm{\phi}_{o}(\theta^{*}_{o})\succsim\bm{\phi}_{p}(\theta^{*}_{o}), then each optimal solution (θo,𝛌o)(\theta_{o}^{*},\bm{\lambda}_{o}^{*}) of (GS)o is a feasible solution of (GS)p and hence θpθo\theta^{*}_{p}\leq\theta^{*}_{o}.

Let us note that the assumption of monotonicity of the path-flow mapping ensures the property of the so-called weak monotonicity of each model in the GS scheme as indicated by Lemma 3.9. To establish the strict monotonicity (MO) we will need the strict monotonicity of the path-flow mapping and property (PR). First, we formulate a local version of the assertion.

Theorem 3.10.

Suppose that units (𝐱o,𝐲o)(\bm{x}_{o},\bm{y}_{o}) and (𝐱p,𝐲p)(\bm{x}_{p},\bm{y}_{p}) in 𝒯\mathcal{T} with the efficiency scores θo\theta^{*}_{o} and θp\theta^{*}_{p}, respectively, are projected onto the strongly efficient frontier. Suppose also that (𝐱o,𝐲o)(𝐱p,𝐲p)(\bm{x}_{o},\bm{y}_{o})\succnsim(\bm{x}_{p},\bm{y}_{p}) and that the path-flow mapping is strictly monotone at θo\theta_{o}^{*}. Then θo>θp\theta_{o}^{*}>\theta^{*}_{p}.

Proof.

From the assumptions of the theorem it follows that ϕo(θo)ϕp(θo)\bm{\phi}_{o}(\theta_{o}^{*})\succnsim\bm{\phi}_{p}(\theta_{o}^{*}). This, by Lemma 3.9 implies that θoθp\theta^{*}_{o}\geq\theta^{*}_{p}. Assume, by contradiction, that θp=θo\theta^{*}_{p}=\theta^{*}_{o}. Denote by (𝝀o,θo)(\bm{\lambda}_{o}^{*},\theta_{o}^{*}) an optimal solution for (GS)o. Obviously, the same pair also represents an optimal solution for (GS)p. Since the model projects (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) and (𝒙p,𝒚p)(\bm{x}_{p},\bm{y}_{p}) onto the strongly efficient frontier, by Theorem 3.1 the inequalities (2b) and (2c) in (GS)o and (GS)p are binding, that is,

𝑿𝝀o\displaystyle\bm{X}\bm{\lambda}_{o}^{*} =ϕox(θo)\displaystyle{}=\bm{\phi}_{o}^{x}(\theta_{o}^{*})  and 𝒀𝝀o\displaystyle\bm{Y}\bm{\lambda}_{o}^{*} =ϕoy(θo).\displaystyle{}=\bm{\phi}^{y}_{o}(\theta^{*}_{o}). (10)
𝑿𝝀o=ϕpx(θo) and 𝒀𝝀o=ϕpy(θo).\bm{X}\bm{\lambda}_{o}^{*}=\bm{\phi}_{p}^{x}(\theta_{o}^{*})\text{\quad and \quad}\bm{Y}\bm{\lambda}_{o}^{*}=\bm{\phi}_{p}^{y}(\theta^{*}_{o}).\ (11)

By comparing the right-hand sides of (10) and (11), we get

ϕox(θo)=ϕpx(θo),ϕoy(θo)=ϕpy(θo).\bm{\phi}_{o}^{x}(\theta_{o}^{*})=\bm{\phi}^{x}_{p}(\theta_{o}^{*}),\quad\bm{\phi}^{y}_{o}(\theta^{*}_{o})=\bm{\phi}_{p}^{y}(\theta^{*}_{o}). (12)

This is in contradiction with the assumption of strict monotonicity of the path-flow mapping at θo\theta_{o}^{*}, according to which ϕo(θo)ϕp(θo)\bm{\phi}_{o}(\theta_{o}^{*})\succnsim\bm{\phi}_{p}(\theta_{o}^{*}) holds. ∎

Theorem 3.10, too, is formulated locally, i.e., if two ordered units are both projected onto S𝒯\partial^{S}\mathcal{T}, then their efficiency scores are strictly ordered. The local formulation once again offers more flexibility than the following global corollary.

Corollary 3.11.

Denote by 𝒟{\cal D}^{*} the set of efficiency scores θo\theta_{o}^{*} achievable by units (𝐱o,𝐲o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}. Assume that the path-flow mapping ϕ(𝐱o,𝐲o;θ)\bm{\phi}(\bm{x}_{o},\bm{y}_{o};\theta^{*}) is strictly monotone on 𝒯\mathcal{T} for all θ𝒟\theta^{*}\in{\cal D}^{*}. Then (PR) \Rightarrow (MO) holds.

The results of this section are summarised in Figure 3.

MOPRIDpath-flow continuitypath-flow monotonicity

Figure 3: Hierarchy of three properties in the models of the GS scheme.

4 Specific technology sets and direction vectors ensuring (PR)

We recall that the GS models fail all three properties (ID), (PR), and (MO) when the directions are positive. Nonetheless, there exist simple examples, where the GS models with specific directions over specific technology sets project every unit from 𝒯\mathcal{T} onto the strongly efficient frontier.

Refer to caption
Refer to caption
Figure 4: Illustration of Example 4.1. GS models in the case of a technology set 𝒯\mathcal{T} with one input and one output that project all units on S𝒯\partial^{S}\mathcal{T}. Left: DDF-g paths with the range directions (4). Right: HDF-g paths with the modified range directions. In both cases, the paths pass through the ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}).
Example 4.1.

Consider a one-input, one-output technology set generated by three units A,B,CA,B,C as depicted in Figure 4. DDF-g paths (where ψx(θ)=θ\psi^{x}(\theta)=\theta and ψy(θ)=2θ\psi^{y}(\theta)=2-\theta) with the range directions 𝒈ox=𝒙o𝒙min\bm{g}^{x}_{o}=\bm{x}_{o}-\bm{x}^{\mathrm{min}} and 𝒈oy=𝒚maxyo\bm{g}^{y}_{o}=\bm{y}^{\mathrm{max}}-y_{o} connect the units from 𝒯\mathcal{T} with the ideal point (1,5)(1,5) for this technology as seen in the left diagram of Figure 4. In the case of HDF-g paths (where ψx(θ)=θ\psi^{x}(\theta)=\theta and ψy(θ)=1θ\psi^{y}(\theta)=\frac{1}{\theta}) the range directions were modified to 𝒈ox=2(𝒙o𝒙min)\bm{g}^{x}_{o}=2(\bm{x}_{o}-\bm{x}^{\mathrm{min}}) and 𝒈oy=𝒚maxyo\bm{g}^{y}_{o}=\bm{y}^{\mathrm{max}}-y_{o} to pass through the ideal point (1,5)(1,5) as shown in the right panel of Figure 4. In both cases, the models project units onto the strongly efficient boundary consisting of the line segments ABAB and BCBC.

In this section, we will generalise the two GS models presented in Example 4.1 with the aim of fully characterising those technology sets and directions under which the GS model yields the strong efficiency of projections. In doing so, the ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) of the technology set 𝒯\mathcal{T} will play an important role.

4.1 GS range directions

As shown in Figure 4, the paths in Example 4.1 connect units from 𝒯\mathcal{T} to the ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}), which ensures that the units are projected onto the strongly efficient boundary. Inspired by this example, we now characterise those directions gog_{o} for which the corresponding path ϕo\bm{\phi}_{o} for (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) passes through the ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) at an arbitrarily chosen θmin[0,1)\theta_{\mathrm{min}}\in[0,1). The choice of common θmin\theta_{\mathrm{min}} for all units in 𝒯\mathcal{T} then allows us to obtain comparable scores whose common lower bound is θmin\theta_{\mathrm{min}}. The proof follows by a simple calculation and is therefore omitted.

Recall that (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) denotes the ideal point of 𝒯\mathcal{T} and that for fixed ψx\psi^{x} and ψy\psi^{y}, the path ϕo\phi_{o} is determined by the evaluated unit (xo,yo)(x_{o},y_{o}) and the direction 𝒈o=(𝒈ox,𝒈oy)\bm{g}_{o}=(\bm{g}_{o}^{x},\bm{g}_{o}^{y}) as detailed in Subsection 2.2.

Lemma 4.2.

Fix ψx\psi^{x}, ψy\psi^{y} satisfying assumptions (A1)(A4) and 𝒯\mathcal{T} of the form (1). For θmin[0,1)𝒟\theta_{\mathrm{min}}\in[0,1)\cap\mathcal{D} and (𝐱o,𝐲o)𝒯{(𝐱min,𝐲max)}(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}\setminus\{(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\}, the following are equivalent.

  1. (i)

    The path ϕo\bm{\phi}_{o} runs through the ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) at θ=θmin\theta=\theta_{\mathrm{min}}, i.e., ϕo(θmin)=(𝒙min,𝒚max)\bm{\phi}_{o}({\theta}_{\mathrm{min}})=(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}).

  2. (ii)

    The direction vector 𝒈o=(𝒈ox,𝒈oy)\bm{g}_{o}=(\bm{g}_{o}^{x},\bm{g}_{o}^{y}) of ϕo\bm{\phi}_{o} satisfies

    𝒈ox=𝒙o𝒙min1ψx(θmin),𝒈oy=𝒚max𝒚oψy(θmin)1.\bm{g}_{o}^{x}=\frac{\bm{x}_{o}-\bm{x}^{\mathrm{min}}}{1-\psi^{x}({\theta}_{\mathrm{min}})},\quad\bm{g}_{o}^{y}=\frac{\bm{y}^{\mathrm{max}}-\bm{y}_{o}}{\psi^{y}({\theta}_{\mathrm{min}})-1}. (13)

Moreover, if either of these conditions holds, then θminθo\theta_{\mathrm{min}}\leq\theta^{*}_{o}.

We will call the direction vector defined by (13) the GS range direction.

In the case of DDF-g models, we have ψx(θ)=θ\psi^{x}(\theta)=\theta, ψy(θ)=2θ\psi^{y}(\theta)=2-\theta, and 𝒟=\mathcal{D}=\mathbb{R}. On selecting θmin=0\theta_{\mathrm{min}}=0, the GS range directions coincide with the range directions (4) of Portela et al. (2004). In the case of HDF-g models, we have ψx(θ)=θ\psi^{x}(\theta)=\theta, ψy(θ)=1θ\psi^{y}(\theta)=\frac{1}{\theta}, and 𝒟=(0,)\mathcal{D}=(0,\infty), hence θmin\theta_{\mathrm{min}} must be chosen positive. The value θmin=1/2\theta_{\mathrm{min}}=1/2 yields 𝒈ox=2(𝒙o𝒙min)\bm{g}^{x}_{o}=2(\bm{x}_{o}-\bm{x}_{\mathrm{min}}) and 𝒈oy=𝒚max𝒚o\bm{g}^{y}_{o}=\bm{y}^{\mathrm{max}}-\bm{y}_{o}.

Remark 4.3.

For (𝒙o,𝒚o)𝒯{(𝒙min,𝒚max)}(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}\setminus\{(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\}, the GS range direction for (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) satisfies 𝒈o=(𝒈ox,𝒈oy)0\bm{g}_{o}=(\bm{g}_{o}^{x},\bm{g}_{o}^{y})\gneqq 0 and therefore the programme (GS)o(GS)_{o} is well defined. On the other hand, if (𝒙min,𝒚max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\in\mathcal{T}, then the GS range direction for (𝒙min,𝒚max)=(𝒙o,𝒚o)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})=(\bm{x}_{o},\bm{y}_{o}) vanishes and the corresponding (GS)o(GS)_{o} programme is not well defined. However, since (𝒙min,𝒚max)=(𝒙o,𝒚o)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})=(\bm{x}_{o},\bm{y}_{o}) is the (only) strongly efficient unit in 𝒯\mathcal{T}, we can set θo=1\theta_{o}^{*}=1 by definition for this point.

The next lemma shows that the choice of the GS range direction ensures that both ϕox\bm{\phi}_{o}^{x} and ϕoy\bm{\phi}_{o}^{y}, the xx and yy components of the path ϕo\bm{\phi}_{o}, passes through the relative interior of the line segments connecting 𝒙o\bm{x}_{o} with 𝒙min\bm{x}^{\mathrm{min}} and 𝒚o\bm{y}_{o} with 𝒚max\bm{y}^{\mathrm{max}}, respectively.

Lemma 4.4.

Let θmin[0,1)𝒟\theta_{\mathrm{min}}\in[0,1)\cap\mathcal{D}, (𝐱min,𝐲max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\notin\mathcal{T} and θo<1\theta_{o}^{*}<1 be the optimal value of (𝐱o,𝐲o)𝒯(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T} in the (GS)o(GS)_{o} model with the GS range direction. Then, for each θ^[θo,1)\hat{\theta}\in[\theta_{o}^{*},1) there exist αx(θ^)(0,1)\alpha^{x}(\hat{\theta})\in(0,1) and αy(θ^)(0,1)\alpha^{y}(\hat{\theta})\in(0,1) such that

ϕox(θ^)=(1αx(θ^))𝒙o+αx(θ^)𝒙m,ϕoy(θ^)=(1αy(θ^))𝒚o+αy(θ^)𝒚M.\bm{\phi}_{o}^{x}(\hat{\theta})=(1-\alpha^{x}(\hat{\theta}))\bm{x}_{o}+\alpha^{x}(\hat{\theta})\bm{x}^{m},\quad\bm{\phi}_{o}^{y}(\hat{\theta})=(1-\alpha^{y}(\hat{\theta}))\bm{y}_{o}+\alpha^{y}(\hat{\theta})\bm{y}^{M}. (14)
Proof.

For each θ^[θo,1)\hat{\theta}\in[\theta_{o}^{*},1) we have (𝒙min,𝒚max)=ϕo(θmin)ϕo(θ)ϕo(θ^)ϕo(1)=(𝒙o,𝒚o)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})=\bm{\phi}_{o}(\theta_{\mathrm{min}})\succnsim\bm{\phi}_{o}(\theta^{*})\succeq\bm{\phi}_{o}(\hat{\theta})\succnsim\bm{\phi}_{o}(1)=(\bm{x}_{o},\bm{y}_{o}) and hence ψx(θmin)<ψx(θo)ψx(θ^)<1\psi^{x}(\theta_{\mathrm{min}})<\psi^{x}(\theta^{*}_{o})\leq\psi^{x}(\hat{\theta})<1 and ψy(θmin)>ψy(θo)ψy(θ^)>1\psi^{y}(\theta_{\mathrm{min}})>\psi^{y}(\theta^{*}_{o})\geq\psi^{y}(\hat{\theta})>1. Therefore,

αx(θ^):=1ψx(θ^)1ψx(θmin)(0,1),αy(θ^):=ψy(θ^)1ψy(θmin)1(0,1).\alpha^{x}(\hat{\theta}):=\frac{1-\psi^{x}(\hat{\theta})}{1-\psi^{x}({\theta}_{\mathrm{min}})}\in(0,1),\quad\alpha^{y}(\hat{\theta}):=\frac{\psi^{y}(\hat{\theta})-1}{\psi^{y}(\theta_{\mathrm{min}})-1}\in(0,1). (15)

Finally, a simple computation yields

ϕox(θo)=𝒙oαx(θ^)𝒈ox=(1αx(θ^))𝒙o+αx(θ^)𝒙min,\displaystyle\bm{\phi}_{o}^{x}(\theta^{*}_{o})=\bm{x}_{o}-\alpha^{x}(\hat{\theta})\bm{g}_{o}^{x}=(1-\alpha^{x}(\hat{\theta}))\bm{x}_{o}+\alpha^{x}(\hat{\theta})\bm{x}^{\mathrm{min}},
ϕoy(θo)=𝒚o+αy(θ^)𝒈oy=(1αy(θ^))𝒚o+αy(θ^)𝒚max,\displaystyle\bm{\phi}_{o}^{y}(\theta^{*}_{o})=\bm{y}_{o}+\alpha^{y}(\hat{\theta})\bm{g}_{o}^{y}=(1-\alpha^{y}(\hat{\theta}))\bm{y}_{o}+\alpha^{y}(\hat{\theta})\bm{y}^{\mathrm{max}},

from which the claim of the lemma follows. ∎

Figure 4 illustrates that the paths containing the ideal point of a given technology set project all units onto the strongly efficient frontier. This property, whilst true in two-dimensional technology sets, no longer holds for all higher-dimensional technologies. Indeed, the two-input, one-output example in Halická et al. (2024, Example 2 and Figure 3) shows that the DDF-g model with the GS range directions projects a unit from W𝒯\partial^{W}\mathcal{T} onto itself. Halická et al. (2024) conducted further numerical experiments on real data consisting of 30 units, 2 inputs, and 2 outputs, with some negative output values. Their analysis contrasts the so-called proportional directions 𝒈ox=|𝒙o|\bm{g}_{o}^{x}=|\bm{x}_{o}|, 𝒈oy=|𝒚o|\bm{g}_{o}^{y}=|\bm{y}_{o}| of Kerstens and Van de Woestyne (2011) with the GS range directions for four different choices of the path function ψ\psi. In their analysis, the GS range directions yield a significantly smaller proportion of units not projected onto the strongly efficient frontier but do not eliminate such units completely. Therefore, even though the GS range directions ensure the passage of all paths through the ideal point, this alone is not enough to guarantee that units are projected onto the strongly efficient frontier in the case of higher-dimensional technology sets.

4.2 Ideal technology sets

It is obvious that to ensure the fulfilment of (PR) using GS range directions, it will be necessary to limit ourselves to a certain subclass of technology sets. To specify the properties of the subclass, it turned out that the facial structure of the technology as a polyhedral set will play an important role. Some characteristics of the facial structure of (10), as well as all necessary terms, are included in Appendix A. The results of the current section will finally show that the GS range directions will guarantee projecting onto strongly efficient frontier if and only if we restrict ourselves to technology sets specified in the following definition.

Definition 4.5.

A technology set 𝒯\mathcal{T} of the form (1) is called an ideal technology if each unit (𝐱,𝐲)(\bm{x},\bm{y}) from the weakly efficient boundary of 𝒯\mathcal{T} has at least one component in common with the ideal point (𝐱min,𝐲max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) of 𝒯\mathcal{T}, i.e.,

(𝒙,𝒚)W𝒯:(i{1,,m}:xi=ximin)or(r{1,,s}:yr=yrmax).\forall\ (\bm{x},\bm{y})\in\partial^{W}\mathcal{T}:\ (\exists\ i\in\{1,...,m\}:\ x_{i}=x_{i}^{\mathrm{min}})\ \hbox{or}\ (\exists\ r\in\{1,...,s\}:\ y_{r}=y_{r}^{\mathrm{max}}). (16)

It is easy to see that the technology sets (1) with only one input and one output, as in Example 4.1, are ideal. A trivial example of an ideal technology in any dimension is the trivial technology, i.e., the technology containing its ideal point (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}). The trivial technology is a polyhedral cone with the vertex at the ideal point. Its facets — faces of full dimension — are parallel to the orthant hyperplanes. The vertex is the only strongly efficient unit of the trivial technology set.

We now provide examples of ideal and non-ideal technology sets.

Example 4.6.

Consider two inputs and one output example with three DMU‘s: A=(3,2,4)A=(3,2,4), B=(2,3,4)B=(2,3,4), C=(2,2,2)C=(2,2,2), see Figure 5. All three units A,BA,B and CC are strongly efficient and the triangle ABCABC forms a strongly efficient frontier for the technology generated by these three DMU’s. Apparently I=(𝒙min,𝒚max)=(2,2,4)I=(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})=(2,2,4). It is seen that the closure of W𝒯\partial^{W}\mathcal{T} consists of three facets, each determined by one of the hyperplanes x1=2x_{1}=2, x2=2x_{2}=2 or y=4y=4, and thus this technology set satisfies the property (16).

Refer to caption
Figure 5: The ideal technology set from Example 4.6. It contains just m+s=3m+s=3 unbounded (blue) edges.
Example 4.7.

Consider again two inputs and one output example with another three DMU‘s: A=(3,2,2)A=(3,2,2), B=(2,3,2)B=(2,3,2), C=(3,3,4)C=(3,3,4), see Figure 6. Also in this example all three points are strongly efficient and triangle ABCABC forms a strongly efficient frontier for the technology generated these three DMU’s. Apparently I=(𝒙min,𝒚max)=(2,2,4)I=(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})=(2,2,4). In this case, the closure of W𝒯\partial^{W}\mathcal{T} consists of six facets, where only three of them correspond to hyperplanes x1=2x_{1}=2, x2=2x_{2}=2 or y=4y=4. Points (x1,x2,y)(x_{1},x_{2},y) from (the relative) interiors of the other three facets satisfy x1>x1min=2x_{1}>x_{1}^{\mathrm{min}}=2, x2>x2min=2x_{2}>x_{2}^{\mathrm{min}}=2 and y<ymax=4y<y^{\mathrm{max}}=4 and hence property (16) is not satisfied.

Refer to caption
Figure 6: The non ideal technology set from Example 4.7. It contains six unbounded (blue) edges.

Theorem B.1 provides equivalent characterisations of ideal technologies. It shows that every ideal technology is obtained by ‘tapering off’ some part of the trivial technology near the ideal point by means of a finite number of hyperplanes in such a way that all m+sm+s edges of the trivial technology are preserved sufficiently far from its vertex (Figure 5). A technology is not ideal precisely when at least one of the m+sm+s edges of the trivial technology is completely missing. Figure 5 gives an example of a non-ideal technology, where all three edges of the induced trivial technology are absent. Item iv of Theorem B.1 also provides a tool for the practical recognition of whether a given set of DMUs generates an ideal technology or not.

The following lemma is an important instrument for developing the main results in Subsection 4.3. It characterises the relative boundary of unbounded faces of an ideal technology 𝒯\mathcal{T}. Its proof is found in C. The necessary material describing the facial structure of 𝒯\mathcal{T} as a polyhedral set is collected in A and B.

Lemma 4.8.

Let 𝒯\mathcal{T} be an ideal technology set. For (𝐱^,𝐲^)𝒯(\hat{\bm{x}},\hat{\bm{y}})\in\mathcal{T} we introduce index sets I={i:x^i=ximin}I=\{i:\hat{x}_{i}=x_{i}^{\mathrm{min}}\} and R={r:y^r=yrmin}R=\{r:\hat{y}_{r}=y_{r}^{\mathrm{min}}\} and denote their cardinality by |I||I| and |R||R|, respectively. Then (𝐱^,𝐲^)(\hat{\bm{x}},\hat{\bm{y}}) belongs either to S𝒯\partial^{S}\mathcal{T}, or to the relative interior of the (m+s|I||R|)(m+s-|I|-|R|) dimensional face of 𝒯\mathcal{T} given by

IR:={(x,y)𝒯:xi=ximin,iI;yr=yrmin,rR}.\mathcal{\mathcal{F}}_{IR}:=\{(x,y)\in\mathcal{T}:x_{i}=x_{i}^{\mathrm{min}},i\in I;\ y_{r}=y_{r}^{\mathrm{min}},r\in R\}. (17)

4.3 GS models with GS range directions over ideal technology sets

The next theorem shows that the GS models with GS range directions meet the property (PR) for any feasible ψ\psi if and only if 𝒯\mathcal{T} is ideal.

Theorem 4.9.

In the GS model with the GS range directions (13), the following are equivalent.

  1. (i)

    All units in 𝒯\mathcal{T} are projected onto the strongly efficient frontier.

  2. (ii)

    𝒯\mathcal{T} is an ideal technology set.

Proof.

i \Rightarrow ii Arguing by contradiction, assume that 𝒯\mathcal{T} is not an ideal technology. This means that 𝒯\mathcal{T} does not have the property (16) and therefore there exists (𝒙o,𝒚o)W𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{W}\mathcal{T} such that 𝒙o>𝒙min\bm{x}_{o}>\bm{x}^{\mathrm{min}} and 𝒚o<𝒚max\bm{y}_{o}<\bm{y}^{\mathrm{max}}. Then the formulas in (13) imply go>0g_{o}>0. Now, by Theorem 4.2 of Halická et al. (2024), the positivity of the direction at (𝒙o,𝒚o)W𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{W}\mathcal{T} implies θo=1\theta_{o}^{*}=1. Hence (𝒙o,𝒚o)W𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{W}\mathcal{T} is identical to its projection and (PR) is not satisfied.
ii \Rightarrow i If (𝒙min,𝒚max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\in\mathcal{T} (that is, if 𝒯\mathcal{T} is the trivial technology), then the choice of the GS range direction guarantees that each (𝒙o,𝒚o)𝒯{(𝒙min,𝒚max)}(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}\setminus\{(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\} is projected on (𝒙min,𝒚max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\in\mathcal{T}, which is the only strongly efficient unit in 𝒯\mathcal{T}, and hence the theorem holds. Obviously θo=θmin\theta^{*}_{o}=\theta_{min}.

Now consider the case (𝒙min,𝒚max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\notin\mathcal{T}. Let II and RR be the index sets of (𝒙o,𝒚o)𝒯({\bm{x}_{o}},{\bm{y}_{o}})\in\mathcal{T} defined in Lemma 4.8, which can also be empty. According to Lemma 4.8, (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) either belongs to the relative interior of IR\mathcal{F}_{IR} or is strongly efficient. In the latter case, there is nothing to prove. Therefore, assume that (𝒙o,𝒚o)rel intIR(\bm{x}_{o},\bm{y}_{o})\in\text{rel int}\mathcal{F}_{IR} and θo\theta_{o}^{*} is its score. From the prescription (13) of the GS range direction it follows that 𝒈iox=0\bm{g}_{io}^{x}=0 and 𝒈roy=0\bm{g}_{ro}^{y}=0 if and only if iIi\in I and rRr\in R respectively. This implies that the path (ϕox(θ),ϕoy(θ))(\bm{\phi}_{o}^{x}(\theta),\bm{\phi}_{o}^{y}(\theta)) for decreasing values of θ1\theta\leq 1 stays in IR\mathcal{F}_{IR} until it reaches the relative boundary of IR\mathcal{F}_{IR} at some θ^<1\hat{\theta}<1. This allows us to apply Lemma 4.4 according to which there exist αx(θ^)(0,1)\alpha^{x}(\hat{\theta})\in(0,1) and αy(θ^)(0,1)\alpha^{y}(\hat{\theta})\in(0,1) such that ϕox(θ^)\bm{\phi}_{o}^{x}(\hat{\theta}) and ϕoy(θ^)\bm{\phi}_{o}^{y}(\hat{\theta}) satisfy (14). From this follows that ϕiox(θ^)=ximin\phi_{io}^{x}(\hat{\theta})=x_{i}^{\mathrm{min}} and ϕioy(θ^)=yimax\phi_{io}^{y}(\hat{\theta})=y_{i}^{\mathrm{max}} if an only if iIi\in I and rRr\in R, resp., hence II and RR is the index set also for (ϕox(θ^),ϕoy(θ^))(\bm{\phi}_{o}^{x}(\hat{\theta}),\bm{\phi}_{o}^{y}(\hat{\theta})). We apply Lemma 4.8 again, this time to a point (ϕox(θ^),ϕoy(θ^))(\bm{\phi}_{o}^{x}(\hat{\theta}),\bm{\phi}_{o}^{y}(\hat{\theta})) that we know is from the relative boundary of IR\mathcal{F}_{IR} and hence (ϕox(θ^),ϕoy(θ^))(\bm{\phi}_{o}^{x}(\hat{\theta}),\bm{\phi}_{o}^{y}(\hat{\theta})) is strongly efficient and θo=θ^\theta_{o}^{*}=\hat{\theta}. ∎

Theorem 4.10.

Let 𝒯\mathcal{T} be an ideal technology set with (𝐱min,𝐲max)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\notin\mathcal{T}. Then the GS model with the GS range direction meets the property of strict monotonicity (MO).

Proof.

According to Theorem 4.9, the GS model with the GS range direction satisfies (PR), and hence by Corollary 3.11 it suffices to prove that the path-flow mapping ϕ(θ)\bm{\phi}(\theta^{*}) is strictly monotone on 𝒯\mathcal{T} at any θ𝒟\theta^{*}\in{\cal D}^{*}. In our case, the vector function ϕ(θ)\bm{\phi}({\theta^{*}}) has a property that [ϕx(θ)]i[\bm{\phi}^{x}({\theta^{*}})]_{i} depends only on xiox_{io} and [ϕy(θ)]r[\bm{\phi}^{y}({\theta^{*}})]_{r} depends only on yroy_{ro}, and hence by Remark 3.7 it suffices to prove that [ϕx(θ)]i[\bm{\phi}^{x}({\theta^{*}})]_{i} and [ϕy(θ)]r[\bm{\phi}^{y}({\theta^{*}})]_{r} are increasing in xiox_{io} and yroy_{ro}, respectively. Obviously, the path-flow mapping ϕ(θ)\bm{\phi}(\theta^{*}) is strictly monotone on 𝒯\mathcal{T} at θ=1\theta^{*}=1. For the case θo<1\theta^{*}_{o}<1 we use Lemma 4.4 according which there exist αx(0,1)\alpha^{x}\in(0,1) and αy(0,1)\alpha^{y}\in(0,1) such that

[ϕox(θo)]i=(1αx)xio+αxxim,[\bm{\phi}_{o}^{x}(\theta^{*}_{o})]_{i}=(1-\alpha^{x})x_{io}+\alpha^{x}x^{m}_{i}, (18)

and hence the derivative of [ϕx(θ)]i[\bm{\phi}^{x}({\theta^{*}})]_{i} with respect to xiox_{io} is positive. The proof for [ϕy(θ)]r[\bm{\phi}^{y}({\theta^{*}})]_{r} is analogous. ∎

Remark 4.11.

It is easy to see that the GS model with the GS range direction does not meet the property (MO) over the trivial technology 𝒯\mathcal{T}. By Lemma 4.2 the path generated by directions (13) passes through (𝒙min,𝒚max)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}}) at θmin\theta_{\mathrm{min}} and since this point belongs to 𝒯\mathcal{T}, it holds θmin=θo\theta_{\mathrm{min}}=\theta_{o}^{*}. Hence, the efficiency score θo\theta_{o}^{*} of each unit (𝒙o,𝒚o)𝒯{(𝒙min,𝒚max)}(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}\setminus\{(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\} is equal to the same value θmin\theta_{\mathrm{min}}. Note that this is consistent with the results of Section 3.3: since for any (𝒙o,𝒚o)𝒯{(𝒙min,𝒚max)}(\bm{x}_{o},\bm{y}_{o})\in\mathcal{T}\setminus\{(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\} the corresponding projection ϕ(θo)\bm{\phi}(\theta^{*}_{o}) is equal to the same value ϕ(θo)=ϕ(θmin)\bm{\phi}(\theta^{*}_{o})=\bm{\phi}(\theta_{\mathrm{min}}), the path-flow mapping ϕ(θo)\bm{\phi}(\theta^{*}_{o}) is not strictly monotone on 𝒯\mathcal{T} at θo\theta_{o}^{*}.

5 Numerical examples

This section documents the ability of individual path-based DEA models to project units onto the strongly efficient frontier in different real datasets. We consider two basic settings for the function ψ\psi: linear, which leads to the DDF-g models, and hyperbolic, which leads to the HDF-g models. As direction vectors, we have selected vectors from groups analysed by Halická et al. (2024); their description can be found in Table 2.

Notation 𝒈ox\bm{g}_{o}^{x} 𝒈oy\bm{g}_{o}^{y} Reference
(G1) |𝒙o||\bm{x}_{o}| |𝒚o||\bm{y}_{o}| Kerstens and Van de Woestyne (2011)
(G2) δx(𝒙o𝒙min)\delta^{x}(\bm{x}_{o}-\bm{x}^{\mathrm{min}}) δy(𝒚max𝒚o)\delta^{y}(\bm{y}^{\mathrm{max}}-\bm{y}_{o}) Halická et al. (2024)
(G3) 𝒙max𝒙min\bm{x}^{\mathrm{max}}-\bm{x}^{\mathrm{min}} 𝒚max𝒚min\bm{y}^{\mathrm{max}}-\bm{y}^{\mathrm{min}} Portela et al. (2004)
(G4) |𝒙ev||\bm{x}^{ev}| |𝒚ev||\bm{y}^{ev}| Aparicio et al. (2013)
Table 2: Choices of directions gog_{o} used in numerical experiments. Here xiev=1njxijx^{ev}_{i}=\frac{1}{n}\sum_{j}x_{ij} and yrev=1njyrjy^{ev}_{r}=\frac{1}{n}\sum_{j}y_{rj}. The absolute values in directions (G1) and (G4) accommodate the possibility of negative data. The directions (G2) correspond to the GS range directions defined in (13), i.e., δx=(1ψx(θmin))1\delta^{x}=(1-\psi^{x}(\theta_{min}))^{-1}, δy=(ψy(θmin)1)1\delta^{y}=(\psi^{y}(\theta_{min})-1)^{-1}.

The numerical experiments look at 10 real datasets (see Table 3) that have previously been used in various case studies. The descriptive statistics for each dataset can be found in D. It should be noted that Dataset 4 contains negative data, and that the two undesirable outputs in Dataset 10 were treated as inputs. Furthermore, the three datasets (i.e., 2, 8, 9) that did not initially contain input or output values were supplemented by input or output values equal to unity for all units, respectively. The datasets vary in the density of efficient units, with the ratio of strongly efficient units to the total number of units ranging from 22% for Dataset 10 to 57% for Dataset 6.

Dataset \sharp total/eff DMUs density \sharp inputs \sharp outputs source
1 20/9 45.0% 3 2 Sueyoshi and Sekitani (2007)
2 139/32 23.0% 4 0 Toloo and Kresta (2014)
3 31/11 35.5% 3 2 Juo et al. (2015)
4 30/9 30.0% 2 2 Tone et al. (2020)
5 13/5 38.5% 1 3 Talluri and Yoon (2000)
6 28/16 57.1% 4 2 Ray (2008)
7 31/15 48.4% 2 3 Xiong et al. (2019)
8 46/11 23.9% 0 4 Foroughi (2011)
9 15/4 26.7% 0 6 Liu et al. (2011)
10 92/20 21.7% 3+2 1 Färe et al. (2007)
Table 3: Basic description of the datasets used in the experiments.

We have applied the DDF-g ( ψx(θ)=θ\psi^{x}(\theta)=\theta, ψy(θ)=2θ\psi^{y}(\theta)=2-\theta, θmin=0\theta_{\mathrm{min}}=0) and HDF-g models ( ψx(θ)=θ\psi^{x}(\theta)=\theta, ψy(θ)=1θ\psi^{y}(\theta)=\frac{1}{\theta}, θmin=0.1\theta_{min}=0.1), combined with directions (G1), (G2), (G3), and (G4) to all 10 datasets listed in Table 3. For numerical computations we have utilised a Matlab implementation of the CVX modelling system (see Grant and Boyd, 2014, Grant and Boyd, 2008) to solve convex programmes. After the efficiency evaluation using (2), we have applied the standard second phase method (described for path-based models by programme (18) in Halická et al., 2024) to determine the number of units projected onto the strongly efficient frontier. The results are shown in Table 4.

DDM-g HDM-g
Dataset (G1) (G2) (G3) (G4) (G1) (G2) (G3) (G4)
1 50.0 50.0 50.0 50.0 50.0 50.0 50.0 50.0
2 22.3 53.2 27.3 22.3 23.0 53.2 27.3 23.7
3 35.5 35.5 35.5 35.5 35.5 35.5 35.5 35.5
4 40.0 70.0 66.7 43.3 43.3 76.7 66.7 53.3
5 53.9 61.5 53.9 61.5 46.2 61.5 53.9 61.5
6 57.1 57.1 57.1 57.1 57.1 57.1 57.1 57.1
7 61.3 67.7 58.1 58.1 61.3 64.7 58.1 58.1
8 23.9 80.4 52.2 23.9 23.9 80.4 52.2 23.9
9 26.7 26.7 26.7 26.7 26.7 26.7 26.7 26.7
10 22.8 22.8 21.7 21.7 22.8 23.9 21.7 22.8
Table 4: Percentage of units projected onto the strongly efficient frontier S𝒯\partial^{S}\mathcal{T} for each model, with maximal value in bold. It can be seen that the GS range direction (G2) always provides the maximal number of strongly efficient projections.

We can see that four of the datasets (i.e., 1, 3, 6, 9) are resistant to direction selection — the same number of units is projected onto the strongly efficient frontier regardless of our choice of directions. A possible explanation for this phenomenon is that the units projected onto W𝒯\partial^{W}\mathcal{T} are so close to this part of the boundary that even the choice of the direction leading to the ideal point cannot prevent their projections onto W𝒯\partial^{W}\mathcal{T}. To a lesser extent, this is also true for datasets 5 and 10. In all other datasets, the GS range direction (G2) outperforms, often significantly, the remaining directions.

In our numerical experiment, the GS range direction achieves the highest number of strongly efficient projections in each dataset. However, as one can see on the example of Datasets 3, 9, and 10, even the the best direction may project majority of the inefficient units onto W𝒯\partial^{W}\mathcal{T}. Their scores do not capture all the inefficiencies. Therefore, it is important to bear this in mind when interpreting the results of path-based models.

6 Conclusions

The paper analyses connections among three desirable properties of DEA models: indication, strict monotonicity, and strong efficiency of projections. For a correct interpretation of the results of a model, it is important to know whether the model meets these characteristics. A good understanding of the properties allows one to decide whether the units with a score equal to one were correctly identified as strongly efficient, whether the score of any unit from 𝒯\mathcal{T} captures all sources of inefficiency, and whether the obtained scores of the units allow a fair comparison of inefficient units with each other. If it is known that one of the properties is not met in the model, it may be necessary to perform further analyses or exercise care when interpreting the results of the model.

This article focuses on path-based models which are characterised by the fact that individual models, with some exceptions, do not fulfil even one of the three properties. Our findings are summarised in Figure 7.

ideal technology
++
GS range direction
MOPR no point in W𝒯\partial^{W}\mathcal{T}
has score of 1
ID not all directions
are positive
path-flow continuitypath-flow monotonicity

Figure 7: Hierarchy of properties for the GS model corresponding to a fixed choice of admissible model parameters.

The article shows that the common practice of replacing the property (PR) with the property (ID) is not justified, in general. This is evidenced by example 3.2, where the model satisfies (ID) but not (PR). On the other hand, it was shown (surprisingly to us) that the property (PR) is equivalent to the property (MO) under mild assumptions. Nonetheless, the verification of these two properties has a different flavour: while the property (MO) is usually verified analytically for a specific model (i.e., it is proved theoretically as a global property, even outside the class of path-based models), the validation of the property (PR) for a given unit in 𝒯\mathcal{T} can be performed numerically by applying the standard second phase.

In this paper, we also characterise the cases that ensure the three properties are met. Such models are characterised by the GS range directions (13) and the data 𝑿,𝒀\bm{X},\bm{Y} that generate an ideal technology (Definition 4.5). The latter requirement can be easily verified in practice using Theorem B.1(iv). Taken as a whole, our analysis yields the following recommendations for DEA practitioners.

  1. 1.

    When using path-based models, it must be assumed that the model does not meet the indication, strict monotonicity, and efficiency of the projections. This means that some units may not be projected onto the strongly efficient boundary. Therefore, if one wishes to know whether the obtained score captures all sources of inefficiency, it is necessary to apply the second-phase to identify the non-included slacks.

  2. 2.

    To increase the number of units projected onto the strongly efficient frontier, it is advisable to use the GS range directions. These directions ensure that all units are projected onto the strongly efficient frontier in the case of single input, single output data, and more generally for all data configurations that correspond to an ideal technology. Although higher-dimensional data typically do not result in an ideal technology, empirical evidence suggests that the GS range directions perform no worse and in some datasets significantly better than their alternatives.

Acknowledgements

The authors thank three anonymous reviewers for their valuable comments. The research of the first two authors was supported by the APVV-20-0311 project of the Slovak Research and Development Agency and the VEGA 1/0611/21 grant administered jointly by the Scientific Grant Agency of the Ministry of Education, Science, Research and Sport of the Slovak Republic and the Slovak Academy of Sciences.

Appendix A Polyhedral characteristics of the VRS technology set 𝒯\mathcal{T}

As is well known, every polyhedral set can be expressed as the intersection of a limited number of half-spaces (see, e.g., Klee, 1959, or Rockafellar, 1970). In the case of the variable returns-to-scale technology set 𝒯\mathcal{T} defined in (1), the half-spaces must be of the form

(𝒖,𝒗,σ)={(𝒙,𝒚):𝒖𝒚𝒗𝒙σ},where 𝒖+m and 𝒗+s.\mathcal{H}(\bm{u},\bm{v},\sigma)=\{(\bm{x},\bm{y}):\bm{u}^{\top}\bm{y}-\bm{v}^{\top}\bm{x}\leq\sigma\},\quad\text{where $\bm{u}\in\mathbb{R}^{m}_{+}$ and $\bm{v}\in\mathbb{R}^{s}_{+}$.} (19)

In order to exclude redundant half-spaces, we consider only the facet-defining half-spaces for which

(𝒖,𝒗,σ)𝒯{(𝒙,𝒚)𝒯:𝒖𝒚𝒗𝒙=σ},\partial\mathcal{H}(\bm{u},\bm{v},\sigma)\cap\mathcal{T}\equiv\{(\bm{x},\bm{y})\in\mathcal{T}:\bm{u}^{\top}\bm{y}-\bm{v}^{\top}\bm{x}=\sigma\}, (20)

is a facet of the polyhedron 𝒯\mathcal{T}.777 The relative boundary (𝒖,𝒗,σ)\partial\mathcal{H}(\bm{u},\bm{v},\sigma) of the facet defining half-space (𝒖,𝒗,σ)\mathcal{H}(\bm{u},\bm{v},\sigma) is a supporting hyperplane to 𝒯\mathcal{T} at any point of the corresponding facet. We also introduce notation ix\mathcal{H}_{i}^{x} and iy\mathcal{H}_{i}^{y} for the special half-spaces

ix:={(𝒙,𝒚):xiximin},ry:={(𝒙,𝒚):yryrmax},\begin{split}\mathcal{H}_{i}^{x}:=\{(\bm{x},\bm{y}):x_{i}\geq x_{i}^{\mathrm{min}}\},\qquad\mathcal{H}_{r}^{y}:=\{(\bm{x},\bm{y}):y_{r}\leq y_{r}^{\mathrm{max}}\},\end{split} (21)

and denote the corresponding facets by

ix={(𝒙,𝒚)𝒯:xi=ximin},ry={(𝒙,𝒚)𝒯:yr=yrmax},\mathcal{F}_{i}^{x}=\{(\bm{x},\bm{y})\in\mathcal{T}:x_{i}=x_{i}^{\mathrm{min}}\},\qquad\mathcal{F}_{r}^{y}=\{(\bm{x},\bm{y})\in\mathcal{T}:y_{r}=y_{r}^{\mathrm{max}}\}, (22)

for i=1,,mi=1,\dots,m and r=1,,sr=1,\dots,s. The following lemma lists some known results about polyhedral sets applied to the facial structure of general VRS technologies defined by (1). These results are helpful in the proof of Theorem B.1.

Lemma A.1.

Any technology set 𝒯\mathcal{T} defined by (1) is the intersection of a finite number of facet-defining half-spaces (𝐮,𝐯,σ)\mathcal{H}(\bm{u},\bm{v},\sigma) with 𝐮0\bm{u}\geq 0 and 𝐯0\bm{v}\geq 0 and its boundary is the union of the corresponding facets. Moreover, the following statements hold.

  1. (a)

    Facets with 𝒖>𝟎\bm{u}>\bm{0} and 𝒗>𝟎\bm{v}>\bm{0} are bounded and their union forms S𝒯\partial^{S}\mathcal{T}.

  2. (b)

    Facets whose vector (𝒖,𝒗)(\bm{u},\bm{v}) contains at least one zero component are unbounded and their union is the closure of W𝒯\partial^{W}\mathcal{T}.

  3. (c)

    Each of the special half-spaces ix,ry\mathcal{H}_{i}^{x},\mathcal{H}_{r}^{y} in (21) is facet-defining. Furthermore, the corresponding facets ix\mathcal{F}^{x}_{i}, ry\mathcal{F}^{y}_{r} are unbounded.

  4. (d)

    Every point (𝒙,𝒚)𝒯(i=1mix)(r=1sry)(\bm{x},\bm{y})\in\mathcal{T}\setminus(\cup_{i=1}^{m}\mathcal{F}_{i}^{x})\cup(\cup_{r=1}^{s}\mathcal{F}_{r}^{y}) satisfies 𝒙>𝒙min\bm{x}>\bm{x}^{\mathrm{min}} and 𝒚<𝒚max\bm{y}<\bm{y}^{\mathrm{max}}.

In line with Davtalab-Olyaie et al. (2015), we refer to the facets in b, including those in c, as weak facets.

Appendix B Characterisation of ideal technology sets

The next theorem formulates six equivalent characterisations of ideal technology sets, of which the property vi was previously used as the definition (see (16) in Subsection 4.2). By comparing the properties of the general technology set 𝒯\mathcal{T} presented in Lemma A.1 with the characterisation in Theorem B.1 i, we see that ideal technologies are those sets whose weak facets are precisely the special facets ix\mathcal{F}_{i}^{x} and ry\mathcal{F}_{r}^{y}.

Theorem B.1.

For a technology set 𝒯\mathcal{T} defined by (1), the following are equivalent.

  1. (i)

    𝒯\mathcal{T} is the intersection of the m+sm+s half-spaces ix\mathcal{H}_{i}^{x} and ry\mathcal{H}_{r}^{y} and a finite number of facet defining half-spaces (𝒖,𝒗,σ)\mathcal{H}(\bm{u},\bm{v},\sigma) with positive 𝒖\bm{u} and 𝒗\bm{v}.

  2. (ii)

    The closure of the weakly efficient boundary can be represented as clW𝒯=(i=1mix)(r=1sry)\operatorname{cl}\partial^{W}\mathcal{T}=(\cup_{i=1}^{m}\mathcal{F}_{i}^{x})\cup(\cup_{r=1}^{s}\mathcal{F}_{r}^{y}).

  3. (iii)

    The weakly efficient boundary can be represented as W𝒯=(i=1mix)(r=1sry)S𝒯\partial^{W}\mathcal{T}=(\cup_{i=1}^{m}\mathcal{F}_{i}^{x})\cup(\cup_{r=1}^{s}\mathcal{F}_{r}^{y})\setminus\partial^{S}\mathcal{T}.

  4. (iv)

    For each of the m+sm+s input–output coordinates, there is a generating unit DMUj, j{1,,n}j\in\{1,\ldots,n\}, which coincides with the ideal point except perhaps in this one coordinate. More formally, for each i{1,,m}i^{\prime}\in\{1,\ldots,m\} there exists a j{1,,n}j\in\{1,\dots,n\} such that xij=ximinx_{ij}=x^{\mathrm{min}}_{i} \forall iii\neq i^{\prime} and 𝒚j=𝒚max\bm{y}_{j}=\bm{y}^{\mathrm{max}}; and for each s{1,,r}s^{\prime}\in\{1,...,r\} there exists a j{1,,n}j\in\{1,\dots,n\} such that 𝒙j=𝒙min\bm{x}_{j}=\bm{x}^{\mathrm{min}} and ysj=ysmaxy_{sj}=y^{\mathrm{max}}_{s}, \forall sss\neq s^{\prime}.

  5. (v)

    For each of the m+sm+s input–output coordinates, there is a point in the technology set that coincides with the ideal point except perhaps in this one coordinate. More formally, for each i{1,,m}i\in\{1,\ldots,m\} there exists a δx0,\delta^{x}\geq 0, such that (𝒙min,𝒚max)+δx(𝒆ix,0s)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})+\delta^{x}(\bm{e}_{i}^{x},0_{s})\in\mathcal{T} and for each r{1,,s}r\in\{1,\ldots,s\} there exists a δy0,\delta^{y}\geq 0, such that (𝒙min,𝒚max)δ(0m,𝒆ry)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})-\delta(0_{m},\bm{e}_{r}^{y})\in\mathcal{T}.

  6. (vi)

    Each point on the weakly efficient frontier coincides with the ideal point in at least one component. More formally, for each (𝒙,𝒚)W𝒯(\bm{x},\bm{y})\in\partial^{W}\mathcal{T} there exists a i{1,,m}:xi=ximinorr{1,,s}:yr=yrmax.i\in\{1,...,m\}:\ x_{i}=x_{i}^{\mathrm{min}}\ \hbox{or}\ r\in\{1,...,s\}:\ y_{r}=y_{r}^{\mathrm{max}}.

Proof.

The proof scheme is shown in Figure 8.

iiiviiiiviv
Figure 8: Scheme of proof for Theorem B.1.
  • i \Rightarrow ii

    Assume that i holds. Then ii follows from Lemma A.1.

  • ii \Rightarrow i

    Assume that ii holds and i does not. Lemma A.1 then yields a facet generating half-space ^\hat{\mathcal{H}} different from those in (21) whose vector (a,b)(a,b) is non-positive. The corresponding facet therefore lies in the closure of W𝒯\partial^{W}\mathcal{T} in contradiction to ii.

  • ii \Rightarrow iii

    Since clW𝒯𝒯\operatorname{cl}\partial^{W}\mathcal{T}\subset\partial\mathcal{T}, the claim follows.

  • iii \Rightarrow iv

    The claim iii implies that 𝒯\mathcal{T} contains m+sm+s facets i,r\mathcal{F}_{i},\mathcal{F}_{r}. These facets are mutually orthogonal and, hence, the intersections of any m+s1m+s-1 of these facets form edges (one-dimensional faces) of 𝒯\mathcal{T}. Each of these edges contains a vertex of the form described in iv. Since the vertices of 𝒯\mathcal{T} belong to the set of units generating 𝒯\mathcal{T}, the assertion iv holds.

  • iv \Rightarrow v

    This implication is trivial.

  • v \Rightarrow vi

    Assume by contradiction that v holds and that there exists (𝒙o,𝒚o)W𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{W}\mathcal{T} such that (𝒙min,𝒚max)(𝒙o,𝒚o)(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})\succ(\bm{x}_{o},\bm{y}_{o}). Since (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) is weakly but not strongly efficient, due to free disposability, there exists a unit in 𝒯\mathcal{T} dominating (𝒙o,𝒚o)(\bm{x}_{o},\bm{y}_{o}) in exactly one component. Without loss of generality, assume that it is the ii-th input component, i.e., one has (𝒙o,𝒚o)γ(𝒆ix,0m)𝒯(\bm{x}_{o},\bm{y}_{o})-\gamma(\bm{e}_{i}^{x},0_{m})\in\mathcal{T} for some γ>0\gamma>0. Note that v implies (𝒙min,𝒚max)+δx(𝒆ix,0s)𝒯(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})+\delta^{x}(\bm{e}_{i}^{x},0_{s})\in\mathcal{T} for some δx0\delta^{x}\geq 0. For λ(0,1)\lambda\in(0,1) define

    (𝒙(λ),𝒚(λ)):=λ[(𝒙min,𝒚max)+δ(𝒆ix,0s)]+(1λ)[(𝒙o,𝒚o)γ(𝒆ix,0m)].(\bm{x}(\lambda),\bm{y}(\lambda)):=\lambda[(\bm{x}^{\mathrm{min}},\bm{y}^{\mathrm{max}})+\delta(\bm{e}_{i}^{x},0_{s})]+(1-\lambda)[(\bm{x}_{o},\bm{y}_{o})-\gamma(\bm{e}_{i}^{x},0_{m})].

    Clearly (𝒙(λ),𝒚(λ))𝒯(\bm{x}(\lambda),\bm{y}(\lambda))\in\mathcal{T} for all λ(0,1)\lambda\in(0,1). For λ\lambda sufficiently small, we have

    (𝒙(λ),𝒚(λ))(𝒙o,𝒚o),(\bm{x}(\lambda),\bm{y}(\lambda))\succ(\bm{x}_{o},\bm{y}_{o}),

    which contradicts (𝒙o,𝒚o)W𝒯(\bm{x}_{o},\bm{y}_{o})\in\partial^{W}\mathcal{T}.

  • vi \Rightarrow ii

    Assume that vi holds and ii does not. Then there exists a point (𝒙^,𝒚^)(\hat{\bm{x}},\hat{\bm{y}}) in W𝒯\partial^{W}\mathcal{T} that does not belong to (i=1mix)(r=1sry)(\cup_{i=1}^{m}\mathcal{F}_{i}^{x})\cup(\cup_{r=1}^{s}\mathcal{F}_{r}^{y}). Lemma A.1d yields that 𝒙^>𝒙min\hat{\bm{x}}>\bm{x}^{\mathrm{min}} and 𝒚^<𝒚max\hat{\bm{y}}<\bm{y}^{\mathrm{max}} in contradiction to vi. ∎

Appendix C Proof of Lemma 4.8

Trivially, IR=(iIix)(rRry)\mathcal{\mathcal{F}}_{IR}=(\cap_{i\in I}\mathcal{F}_{i}^{x})\cap(\cap_{r\in R}\mathcal{F}_{r}^{y}) (see definitions in (22)). Since the normals of the facets i\mathcal{F}_{i} and r\mathcal{F}_{r} are linearly independent, IR\mathcal{F}_{IR} is the (m+s|I||R|)(m+s-|I|-|R|) - dimensional face of 𝒯\mathcal{T}. The (relative) boundary of each face consists of faces of lower dimensions. Theorem B.1i implies that the relative boundary of IR\mathcal{\mathcal{F}}_{IR} consists of two types of (m+s|I||R|1)(m+s-|I|-|R|-1) - dimensional faces: the first one is expressed as the intersections of IR\mathcal{\mathcal{F}}_{IR} with one of the facets FiF_{i}, iIi\notin I or FrF_{r}, rRr\notin R; the second one is expressed as the intersections of IR\mathcal{\mathcal{F}}_{IR} with one of the faces generated by (𝒂,𝒃,σ)\mathcal{H}(\bm{a},\bm{b},\sigma) where 𝒂>0\bm{a}>0, 𝒃>0\bm{b}>0.

From the definition of II and RR it follows that (𝒙,𝒚)(\bm{x},\bm{y}) does not belongs to any of the faces of the first type. Therefore, (𝒙,𝒚)(\bm{x},\bm{y}) is either an interior point of IR\mathcal{\mathcal{F}}_{IR} or it belongs to a face of type (𝒂,𝒃,σ)\mathcal{F}(\bm{a},\bm{b},\sigma), which due to the positiveness of the vectors 𝒂,𝒃\bm{a},\bm{b} belongs to the strongly efficient frontier of 𝒯\mathcal{T}. The lemma is proved.

Appendix D Descriptive statistics of datasets used in Section 5

x1x_{1} x2x_{2} x3x_{3} y1y_{1} y2y_{2}
min 27.00 115.00 2584.00 5765.00 3286.00
max 1043.00 436.00 17412.00 223340.00 39653.00
mean 403.10 248.80 8170.90 81723.00 14413.40
std. dev. 381.20 96.37 5130.93 82604.64 11747.93
Table 1: Dataset 1 (Sueyoshi and Sekitani, 2007) consists of data of 20 Japanese banks, with three inputs (total amount of capital, number of offices, and number of employees) and two outputs (total profit and total amount of deposits).
x1x_{1} x2x_{2} x3x_{3} x4x_{4}
min 0.00 6521.00 0.000 1.192
max 508203.00 36062.00 1213.000 1.804
mean 225635.85 15382.37 976.281 1.413
std. dev. 152560.30 7652.33 480.586 0.151
Table 2: Dataset 2 (Toloo and Kresta, 2014) consists of data of 139 different alternatives for long-term asset financing at Czech banks and leasing companies, with four inputs (downpayment, annuities, other fees, and bank loan coefficient) and no outputs.
x1x_{1} x2x_{2} x3x_{3} y1y_{1} y2y_{2}
min 24930.00 204.00 489.00 2203.00 60552.00
max 2781454.00 8804.00 78284.00 431255.00 1912741.00
mean 710482.84 3811.26 14003.23 134856.35 560873.87
std. dev. 684169.90 2608.07 15703.33 135245.53 537324.60
Table 3: Dataset 3 (Juo et al., 2015) consists of data of 31 banks operating in Taiwan, with three inputs (financial funds, labor, and physical capital) and two outputs (financial investments and loans).
x1x_{1} x2x_{2} y1y_{1} y2y_{2}
min 418242.00 1319.0 -561965.00 -0.25830
max 16655569.00 634436.0 3092358.00 3.53714
mean 3705789.57 102759.1 4.26691.37 0.32565
std. dev. 3807894.37 135690.5 653124.35 0.79420
Table 4: Dataset 4 (Tone et al., 2020) consists of data of 30 Taiwanese electrical machinery firms with two inputs (cost of sales and R&D expenses) and two outputs (net income and return).
x1x_{1} y1y_{1} y2y_{2} y3y_{3}
min 4.0 0.025 6.00 0.50
max 12.5 1.250 115.00 2.90
mean 7.6 0.420 44.92 1.22
std. dev. 2.6 0.450 35.16 0.73
Table 5: Dataset 5 (Talluri and Yoon, 2000) consists of data of 13 industrial robots with single input (cost) and three outputs (repeatability, load capacity, and velocity).
x1x_{1} x2x_{2} x3x_{3} x4x_{4} y1y_{1} y2y_{2}
min 4067.00 62.00 241.00 587.00 2943.00 65.00
max 80627.00 2381.00 5678.00 18624.00 133796.00 5346.00
mean 28690.43 838.18 2140.25 7075.04 4.3555.86 1298.61
std. dev. 20279.43 653.57 1442.54 5352.85 3.4877.86 1183.87
Table 6: Dataset 6 (Ray, 2008) consists of data of 28 international airlines (year 1990) with four inputs (number of employees, fuel, operating and maintenance expenses, capital) and two outputs (passenger-kilometers flown, freight tonne-kilometers flown).
x1x_{1} x2x_{2} y1y_{1} y2y_{2} y3y_{3}
min 1.0 367.70 31.26 39.78 124.63
max 65.9 3738.00 3209.72 1887.45 21801.65
mean 22.7 1609.20 1132.71 0.738.08 5383.30
std. dev. 18.7 896.72 812.36 501.28 5357.41
Table 7: Dataset 7 (Liu et al., 2011) consists of data of 31 Chinese transportation sectors, with two inputs (labour,capital) and three outputs (GDP by transportation sector, passenger turnover volume, freight turnover volume).
y1y_{1} y2y_{2} y3y_{3} y4y_{4}
min 1.18 10.82 163.0 11.29
max 3.87 55.65 718.0 139.02
mean 1.81 25.05 474.7 56.52
std. dev. 0.72 11.21 138.99 31.12
Table 8: Dataset 8 (Foroughi, 2011) consists of data of 46 association rules, with no inputs and four outputs (support, confidence, itemset value, cross-selling profit).
y1y_{1} y2y_{2} y3y_{3} y4y_{4} y5y_{5} y6y_{6}
min 6.88 12.06 1.15 2.68 11.22 15.97
max 100.00 100.00 100.00 100.00 100.00 100.00
mean 36.56 47.80 16.10 19.08 43.67 48.54
std. dev. 32.09 30.15 26.25 25.51 22.38 26.82
Table 9: Dataset 9 (Liu et al., 2011) consists of data of 15 research institutions, with no inputs and six outputs (SCI pub. per research staff, SCI pub. per research expenditures, high quality papers per research staff, high quality papers per research expenditures, external research funding, graduation students enrolled).
x1x_{1} x2x_{2} x3x_{3} b1b_{1} b2b_{2} y1y_{1}
min 39349184.0 39.0 1892407000000 1293.20 423.05 166616000
max 750024804.0 535.0 175688406305600.0 252344.60 72524.10 18212069000
mean 240000014.7 185.2 47104363123213.0 40745.19 17494.02 4686524843
std. dev. 146352514.9 110.9 39982238625707.7 48244.78 16190
Table 10: Dataset 10 (Färe et al., 2007) consists of data of 92 power plants, with three inputs (capital, labour, heat), two undesirable outputs (SO2SO_{2} and NOxNO_{x} emissions), which are handled as inputs, and single output (energy).

References

  • Aparicio et al. (2013) Aparicio, J., Pastor, J.T., Ray, S.C., 2013. An overall measure of technical inefficiency at the firm and at the industry level: The ‘lost profit on outlay’. European Journal of Operational Research 226, 154–162.
  • Banker et al. (1984) Banker, R.D., Charnes, A., Cooper, W.W., 1984. Some models for estimating technical and scale inefficiencies in data envelopment analysis. Management Science 30, 1078–1092. doi:10.1287/mnsc.30.9.1078.
  • Chambers et al. (1996) Chambers, R.G., Chung, Y., Färe, R., 1996. Benefit and distance functions. Journal of Economic Theory 70, 407–419.
  • Chambers et al. (1998) Chambers, R.G., Chung, Y., Färe, R., 1998. Profit, directional distance functions, and Nerlovian efficiency. Journal of Optimization Theory and Applications 98, 351–364. doi:10.1023/A:1022637501082.
  • Charnes et al. (1985) Charnes, A., Cooper, W.W., Golany, B., Seiford, L., Stutz, J., 1985. Foundations of data envelopment analysis for Pareto-Koopmans efficient empirical production functions. Journal of Econometrics 30, 91–107. doi:10.1016/0304-4076(85)90133-2.
  • Chavas and Cox (1999) Chavas, J.P., Cox, T.L., 1999. A generalized distance function and the analysis of production efficiency. Southern Economic Journal 66, 294–318.
  • Cooper et al. (1999) Cooper, W.W., Park, K.S., Pastor, J.T., 1999. RAM: A range adjusted measure of inefficiency for use with additive models, and relations to other models and measures in DEA. Journal of Productivity Analysis 11, 5–42. doi:10.1023/a:1007701304281.
  • Cooper et al. (2011) Cooper, W.W., Pastor, J.T., Borras, F., Aparicio, J., Pastor, D., 2011. BAM: A bounded adjusted measure of efficiency for use with bounded additive models. Journal of Productivity Analysis 35, 85–94.
  • Cooper et al. (2007) Cooper, W.W., Seiford, L.M., Tone, K., 2007. Data Envelopment Analysis: A Comprehensive Text with Models, Applications, References and DEA-Solver Software. Springer US. doi:10.1007/978-0-387-45283-8.
  • Davtalab-Olyaie et al. (2015) Davtalab-Olyaie, M., Roshdi, I., Partovi Nia, V., Asgharian, M., 2015. On characterizing full dimensional weak facets in DEA with variable returns to scale technology. Optimization 64, 2455–2476. doi:10.1080/02331934.2014.917305.
  • Färe et al. (1985) Färe, R., Grosskopf, S., Lovell, C.A.K., 1985. The Measurement of Efficiency of Production. Springer Netherlands. doi:10.1007/978-94-015-7721-2.
  • Färe et al. (2007) Färe, R., Grosskopf, S., Pasurka Jr, C.A., 2007. Environmental production functions and environmental directional distance functions. Energy 32, 1055–1066.
  • Färe and Lovell (1978) Färe, R., Lovell, C.A.K., 1978. Measuring the technical efficiency of production. Journal of Economic Theory 19, 150–162. doi:10.1016/0022-0531(78)90060-1.
  • Foroughi (2011) Foroughi, A., 2011. A note on “a new method for ranking discovered rules from data mining by dea”, and a full ranking approach. Expert Systems with Applications 38, 12913–12916.
  • Grant and Boyd (2008) Grant, M., Boyd, S., 2008. Graph implementations for nonsmooth convex programs, in: Blondel, V., Boyd, S., Kimura, H. (Eds.), Recent Advances in Learning and Control. Springer-Verlag Limited. Lecture Notes in Control and Information Sciences, pp. 95–110. http://stanford.edu/~boyd/graph_dcp.html.
  • Grant and Boyd (2014) Grant, M., Boyd, S., 2014. CVX: Matlab software for disciplined convex programming, version 2.1. http://cvxr.com/cvx.
  • Halická and Trnovská (2021) Halická, M., Trnovská, M., 2021. A unified approach to non-radial graph models in data envelopment analysis: Common features, geometry, and duality. European Journal of Operational Research 289, 611–627. doi:10.1016/j.ejor.2020.07.019.
  • Halická et al. (2024) Halická, M., Trnovská, M., Černý, A., 2024. A unified approach to radial, hyperbolic, and directional efficiency measurement in data envelopment analysis. European Journal of Operational Research 312, 298–314.
  • Juo et al. (2015) Juo, J.C., Fu, T.T., Yu, M.M., Lin, Y.H., 2015. Profit-oriented productivity change. Omega 57, 176–187.
  • Kerstens and Van de Woestyne (2011) Kerstens, K., Van de Woestyne, I., 2011. Negative data in dea: a simple proportional distance function approach. Journal of the Operational Research Society 62, 1413–1419.
  • Klee (1959) Klee, V., 1959. Some characterizations of convex polyhedra. Acta Mathematica 102, 79–107. doi:10.1007/BF02559569.
  • Liu et al. (2011) Liu, W., Zhang, D., Meng, W., Li, X., Xu, F., 2011. A study of dea models without explicit inputs. Omega 39, 472–480.
  • Pastor et al. (2022) Pastor, J.T., Aparicio, J., Zofío, J.L., 2022. Benchmarking Economic Efficiency: Technical and Allocative Fundamentals. International Series in Operations Research and Management Science, Springer Cham.
  • Pastor et al. (1999) Pastor, J.T., Ruiz, J.L., Sirvent, I., 1999. An enhanced DEA Russell graph efficiency measure. European Journal of Operational Research 115, 596–607.
  • Portela et al. (2004) Portela, M.C.A.S., Thanassoulis, E., Simpson, G., 2004. Negative data in DEA: A directional distance approach applied to bank branches. Journal of the Operational Research Society 55, 1111–1121. doi:10.1057/palgrave.jors.2601768.
  • Ray (2008) Ray, S.C., 2008. The directional distance function and measurement of super-efficiency: an application to airlines data. Journal of the Operational Research Society 59, 788–797.
  • Rockafellar (1970) Rockafellar, R.T., 1970. Convex analysis. Princeton Mathematical Series, No. 28, Princeton University Press, Princeton, N.J.
  • Russell (1985) Russell, R.R., 1985. Measures of technical efficiency. Journal of Economic theory 35, 109–126.
  • Russell and Schworm (2011) Russell, R.R., Schworm, W., 2011. Properties of inefficiency indexes on \langleinput, output\rangle space. Journal of Productivity Analysis 36, 143–156. doi:10.1007/s11123-011-0209-3.
  • Russell and Schworm (2018) Russell, R.R., Schworm, W., 2018. Technological inefficiency indexes: A binary taxonomy and a generic theorem. Journal of Productivity Analysis 49, 17–23. doi:10.1007/s11123-017-0518-2.
  • Sueyoshi and Sekitani (2007) Sueyoshi, T., Sekitani, K., 2007. Computational strategy for russell measure in dea: Second-order cone programming. European Journal of Operational Research 180, 459–471.
  • Sueyoshi and Sekitani (2009) Sueyoshi, T., Sekitani, K., 2009. An occurrence of multiple projections in DEA-based measurement of technical efficiency: Theoretical comparison among DEA models from desirable properties. European Journal of Operational Research 196, 764–794. doi:10.1016/j.ejor.2008.01.045.
  • Talluri and Yoon (2000) Talluri, S., Yoon, K.P., 2000. A cone-ratio dea approach for amt justification. International Journal of Production Economics 66, 119–129.
  • Toloo and Kresta (2014) Toloo, M., Kresta, A., 2014. Finding the best asset financing alternative: A dea–weo approach. Measurement 55, 288–294.
  • Tone (2001) Tone, K., 2001. A slacks-based measure of efficiency in data envelopment analysis. European Journal of Operational Research 130, 498–509.
  • Tone et al. (2020) Tone, K., Chang, T.S., Wu, C.H., 2020. Handling negative data in slacks-based measure data envelopment analysis models. European Journal of Operational Research 282, 926–935.
  • Xiong et al. (2019) Xiong, B., Chen, H., An, Q., Wu, J., 2019. A multi-objective distance friction minimization model for performance assessment through data envelopment analysis. European Journal of Operational Research 279, 132–142.