This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Inclusion and Intersection Relations Between Fundamental Classes of Discrete Convex Functions

Satoko Moriguchi  and Kazuo Murota Faculty of Economics and Business Administration, Tokyo Metropolitan University, [email protected] The Institute of Statistical Mathematics, and Faculty of Economics and Business Administration, Tokyo Metropolitan University, [email protected]
(November 2021 / July 2022 / February 2023)
Abstract

In discrete convex analysis, various convexity concepts are considered for discrete functions such as separable convexity, L-convexity, M-convexity, integral convexity, and multimodularity. These concepts of discrete convex functions are not mutually independent. For example, M-convexity is a special case of integral convexity, and the combination of L-convexity and M-convexity coincides with separable convexity. This paper aims at a fairly comprehensive analysis of the inclusion and intersection relations for various classes of discrete convex functions. Emphasis is put on the analysis of multimodularity in relation to L-convexity and M-convexity.

Keywords: Discrete convex analysis, L-convex function, M-convex function, Multimodular function, Separable convex function

1 Introduction

A function defined on the integer lattice n{\mathbb{Z}}^{n} is called a discrete function. Various types of convexity have been defined for discrete functions in discrete convex analysis [6, 15, 16, 18, 19], such as L-convex functions, L-convex functions, M-convex functions, M-convex functions, integrally convex functions, and multimodular functions. L-convex functions have applications in several fields including operations research (inventory theory, scheduling, etc.) [3, 4, 33], economics and auction theory [20, 31], and computer vision [31]. M-convex functions have applications in operations research [4] and economics [16, 19, 20, 25, 32]. Multimodular functions are used for queueing, Markov decision processes, and discrete-event systems [1, 2, 8].

The classes of discrete convex functions are not mutually independent. In some cases, two classes of discrete convex functions have an inclusion relation as an (almost) immediate consequence of the definitions. This is the case, for example, with the inclusion of M-convex functions in the set of M-convex functions. In other cases, inclusion relations are established as theorems. For example, it is a theorem [16, Theorem 6.42] that M-convex functions are integrally convex functions. This paper aims at a fairly comprehensive analysis of the inclusion relations for various classes of discrete convex functions.

In this paper we are also interested in what is implied by the combination of two convexity properties. For example, it is known ([23, Theorem 3.17], [16, Theorem 8.49]) that a function is both L-convex and M-convex if and only if it is separable convex, where a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is called separable convex if it is represented as

f(x)=i=1nφi(xi)(xn)f(x)=\sum_{i=1}^{n}\varphi_{i}(x_{i})\qquad(x\in{\mathbb{Z}}^{n}) (1.1)

with univariate functions φi\varphi_{i} (i=1,2,,n)(i=1,2,\ldots,n) that satisfy φi(t1)+φi(t+1)2φi(t)\varphi_{i}(t-1)+\varphi_{i}(t+1)\geq 2\varphi_{i}(t) for all tt\in{\mathbb{Z}} and are finite-valued on intervals of integers. It is easy to see that a separable convex function is both L-convex and M-convex, and hence the content of the above statement lies in the claim that the combination of L-convexity and M-convexity implies separable convexity. In this paper we are interested in such statements for other classes of discrete convex functions. In particular, we show in Section 5.3 that a function is both L-convex and multimodular if and only if it is separable convex.

It is noted, however, that such relationship may not be so simple for other pairs of function classes. As an example, consider M-convex functions and multimodular functions. It is known [9] that M-convex functions and multimodular functions are the same for functions in two variables. For functions in more variables, this is not the case, and there is no inclusion relation between the classes of M-convex functions (resp., sets) and that of multimodular functions (resp., sets). Figure 1 shows examples of integral polymatroids (M-convex sets) with and without multimodularity (see Examples 5.3 and 5.4 for details). The indicator function of the set in (a) is both M-convex and multimodular, but not separable convex. A systematic study in Section 5.4 will show that there are infinitely many functions (resp., sets) that are both M-convex and multimodular, but not separable convex.

Refer to caption
Refer to caption

(a) M-convex and multimodular   (b) M-convex and not multimodular

Figure 1: M-convex sets (polymatroids) with and without multimodularity

This paper is organized as follows. Section 2 introduces notations and summarizes the known pairwise inclusion relations between classes of discrete convex functions and shows how the analysis for functions can be reduced to that for sets. The main results are given in Sections 3 and 4 with illustrative examples. Section 5 presents technical contributions, containing theorems and proofs about multimodularity. Definitions of various concepts of discrete convex functions are given in Appendix for the convenience of reference.

2 Preliminaries

2.1 Notation

Let nn be a positive integer and N={1,2,,n}N=\{1,2,\ldots,n\}. For a subset AA of NN, we denote by eAe^{A} the characteristic vector of AA; the iith component of eAe^{A} is equal to 1 or 0 according to whether iAi\in A or not. We use a short-hand notation eie^{i} for e{i}e^{\{i\}}, which is the iith unit vector. The vector with all components equal to 1 is denoted by 𝟏{\bf 1}, that is, 𝟏=(1,1,,1)=eN{\bf 1}=(1,1,\ldots,1)=e^{N}. We sometimes use e0e^{0} to mean the zero vector 𝟎{\bf 0}.

For a vector x=(x1,x2,,xn)x=(x_{1},x_{2},\ldots,x_{n}) and a subset AA of NN, we use notation x(A)=iAxix(A)=\sum_{i\in A}x_{i}. For two vectors x,ynx,y\in{\mathbb{R}}^{n}, the vectors of componentwise maximum and minimum of xx and yy are denoted, respectively, by xyx\vee y and xyx\wedge y, i.e.,

(xy)i=max(xi,yi),(xy)i=min(xi,yi)(i=1,2,,n).(x\vee y)_{i}=\max(x_{i},y_{i}),\quad(x\wedge y)_{i}=\min(x_{i},y_{i})\qquad(i=1,2,\ldots,n). (2.1)

For a real number tt\in{\mathbb{R}}, t\left\lceil t\right\rceil denotes the smallest integer not smaller than tt (rounding-up to the nearest integer) and t\left\lfloor t\right\rfloor the largest integer not larger than tt (rounding-down to the nearest integer), and this operation is extended to a vector by componentwise applications. That is,

x=(x1,x2,,xn),x=(x1,x2,,xn).\left\lceil x\right\rceil=(\lceil x_{1}\rceil,\lceil x_{2}\rceil,\ldots,\lceil x_{n}\rceil),\qquad\left\lfloor x\right\rfloor=(\lfloor x_{1}\rfloor,\lfloor x_{2}\rfloor,\ldots,\lfloor x_{n}\rfloor). (2.2)

For integer vectors a({})na\in({\mathbb{Z}}\cup\{-\infty\})^{n} and b({+})nb\in({\mathbb{Z}}\cup\{+\infty\})^{n} with aba\leq b, the box of reals and the box of integers (discrete rectangle, integer interval) between aa and bb are denoted by [a,b][a,b]_{{\mathbb{R}}} and [a,b][a,b]_{{\mathbb{Z}}}, respectively, i.e.,

[a,b]\displaystyle[a,b]_{{\mathbb{R}}} ={xnaixibi(i=1,2,,n)},\displaystyle=\{x\in{\mathbb{R}}^{n}\mid a_{i}\leq x_{i}\leq b_{i}\ (i=1,2,\ldots,n)\}, (2.3)
[a,b]\displaystyle[a,b]_{{\mathbb{Z}}} ={xnaixibi(i=1,2,,n)}.\displaystyle=\{x\in{\mathbb{Z}}^{n}\mid a_{i}\leq x_{i}\leq b_{i}\ (i=1,2,\ldots,n)\}. (2.4)

We consider functions defined on integer lattice points, f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}, where the function may possibly take ++\infty. The effective domain of ff means the set of xx with f(x)<+f(x)<+\infty and is denoted as

domf={xnf(x)<+}.{\rm dom\,}f=\{x\in{\mathbb{Z}}^{n}\mid f(x)<+\infty\}. (2.5)

We always assume that domf{\rm dom\,}f is nonempty. The set of the minimizers of ff is denoted by argminf\arg\min f. For a function ff and a vector pp, f[p]f[-p] denotes the function defined by

f[p](x)=f(x)i=1npixi.f[-p](x)=f(x)-\sum_{i=1}^{n}p_{i}x_{i}. (2.6)

The convex hull of a set SS (n)(\subseteq{\mathbb{Z}}^{n}) is denoted by S¯\overline{S}. The indicator function of a set SnS\subseteq{\mathbb{Z}}^{n} is denoted by δS\delta_{S}, which is the function δS:n{0,+}\delta_{S}:{\mathbb{Z}}^{n}\to\{0,+\infty\} defined by

δS(x)={0(xS),+(xS).\delta_{S}(x)=\left\{\begin{array}[]{ll}0&(x\in S),\\ +\infty&(x\not\in S).\\ \end{array}\right. (2.7)

2.2 Classes of discrete convex functions

In this paper we investigate pairwise relations for the following 13 classes of discrete convex functions: separable convex functions, integrally convex functions, L-convex functions, L-convex functions, L2-convex functions, L2{}^{\natural}_{2}-convex functions, M-convex functions, M-convex functions, M2-convex functions, M2{}^{\natural}_{2}-convex functions, multimodular functions, globally discrete midpoint convex functions, and directed discrete midpoint convex functions. The separable convex functions are defined by (1.1) in Introduction. The definitions of other functions are given in Appendix A, except that the definition of multimodular functions is given in Section 5.1.

The inclusion relations between these function classes are summarized in the following proposition, with Remark 2.1 giving the references. The inclusion relations (2.8)–(2.15) are depicted in Fig. 2.

Proposition 2.1.

The following inclusion relations hold for functions on n{\mathbb{Z}}^{n}:

{separable conv.}{L-conv.}{integrally conv.},\displaystyle\{\mbox{\rm separable conv.}\}\subsetneqq\ \{\mbox{\rm{L${}^{\natural}$}-conv.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.8)
{separable conv.}{M-conv.}{integrally conv.},\displaystyle\{\mbox{\rm separable conv.}\}\subsetneqq\ \{\mbox{\rm{M${}^{\natural}$}-conv.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.9)
{L-conv.}{{L-conv.}{L2-conv.}}{L2-conv.}{integrally conv.},\displaystyle\{\mbox{\rm L-conv.}\}\subsetneqq\ \left\{\begin{array}[]{l}\{\mbox{\rm{L${}^{\natural}$}-conv.}\}\\ \{\mbox{\rm{L${}_{2}$}-conv.}\}\end{array}\right\}\subsetneqq\ \{\mbox{\rm{L${}^{\natural}_{2}$}-conv.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.12)
{M-conv.}{{M-conv.}{M2-conv.}}{M2-conv.}{integrally conv.},\displaystyle\{\mbox{\rm M-conv.}\}\subsetneqq\ \left\{\begin{array}[]{l}\{\mbox{\rm{M${}^{\natural}$}-conv.}\}\\ \{\mbox{\rm{M${}_{2}$}-conv.}\}\end{array}\right\}\subsetneqq\ \{\mbox{\rm{M${}^{\natural}_{2}$}-conv.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.15)
{separable conv.}{multimodular}{integrally conv.},\displaystyle\{\mbox{\rm separable conv.}\}\subsetneqq\ \{\mbox{\rm multimodular}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.16)
{L-conv.}{globally d.m.c.}{integrally conv.},\displaystyle\{\mbox{\rm{L${}^{\natural}$}-conv.}\}\subsetneqq\ \{\mbox{\rm globally d.m.c.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}, (2.17)
{L-conv.}{directed d.m.c.}{integrally conv.}.\displaystyle\{\mbox{\rm{L${}^{\natural}$}-conv.}\}\subsetneqq\ \{\mbox{\rm directed d.m.c.}\}\subsetneqq\ \{\mbox{\rm integrally conv.}\}. (2.18)

 

Refer to caption
Figure 2: Inclusion relations between classes of discrete convex functions
Remark 2.1.

Here is a supplement to Proposition 2.1. Integral convexity is established for L-convex functions in [16, Theorem 7.20], for M-convex functions in [23, Theorem 3.9] (see also [16, Theorem 6.42]), and for globally discrete midpoint convex functions in [13, Theorem 6]. The inclusion relations (2.12) and (2.15) concerning L2-convex and M2-convex functions are given in [16, Section 8.3]. The integral convexity of multimodular functions in (2.16) was pointed out first in [18, Section 14.6], while this is implicit in the construction of the convex extension given earlier in [8, Theorem 4.3]. The first inclusion in (2.16) for multimodular functions is given in [9, Proposition 2]. The inclusion relations in (2.17) are given in [13, Theorem 6], and those in (2.18) are in [35].  

2.3 Our approach to identify the intersection of two classes

When two function classes have no inclusion relation between themselves, we are interested in identifying the intersection of these classes. A typical (known) statement of this kind is that a function is both L-convex and M-convex if and only if it is separable convex. In this paper (Section 4) we are concerned with similar statements:

A function is both A-convex and B-convex if and only if it is C-convex,\mbox{A function is both A-convex and B-convex if and only if it is C-convex}, (2.19)

where A-, B-, and C-convex denote certain specified discrete convexity from among those treated in this paper.

In discrete convex analysis, it is generally true that a set SS  (n\subseteq{\mathbb{Z}}^{n}) has a certain discrete convexity if and only if its indicator function δS\delta_{S} has the same discrete convexity. Therefore, the statement (2.19) implies the corresponding statement for a set:

A set is both A-convex and B-convex if and only if it is C-convex.\mbox{A set is both A-convex and B-convex if and only if it is C-convex}. (2.20)

While the implication “(2.19) \Rightarrow (2.20)” is obvious, the converse is also true under mild assumptions. In such cases, we can reduce the proof of (2.19) for functions to that of (2.20) for sets.

As a technical assumption for the reverse implication “(2.20) \Rightarrow (2.19)” we introduce the condition on a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} that

(i) domf is bounded or (ii) domf¯ is a polyhedron and f is convex-extensible,\mbox{(i) ${\rm dom\,}f$ is bounded or (ii) $\overline{{\rm dom\,}f}$ is a polyhedron and $f$ is convex-extensible}, (2.21)

where, in this paper, we call a function ff convex-extensible if it is extensible to a locally polyhedral convex function in the sense of [6, Section 15]. For the set version, we consider the condition on a set SnS\subseteq{\mathbb{Z}}^{n} that

(i) S is bounded or (ii) S¯ is a polyhedron and S=S¯n.\mbox{(i) $S$ is bounded or (ii) $\overline{S}$ is a polyhedron and $S=\overline{S}\cap{\mathbb{Z}}^{n}$}. (2.22)

For the sake of exposition, let us choose separable convexity as “C-convexity” in the above generic statement. Note that a box of integers is the set version of a separable convex function. We use the following fact, where argminf[p]\arg\min f[-p] means the set of minimizers of the function f[p]f[-p] defined in (2.6).

Proposition 2.2.

Under the assumption (2.21), a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is separable convex if and only if, for any vector pnp\in{\mathbb{R}}^{n}, argminf[p]\arg\min f[-p] is a box of integers or an empty set.

Proof.

See Remark 2.2 at the end of Section 2.3. ∎

For “A-convexity” and “B-convexity” in the generic statement, we assume the following natural properties:

If a function ff is A-convex (resp., B-convex), then ff satisfies (2.21)
and, for any pnargminf[p] is A-convex (resp., B-convex).\displaystyle\mbox{and, for any $p\in{\mathbb{R}}^{n}$, $\arg\min f[-p]$ is A-convex (resp., B-convex)}. (2.23)

By Proposition 2.2 (only-if part), separable convexity has this property. Moreover, it is known that all kinds of discrete convexity treated in this paper have this property.

Proposition 2.3.

Under the assumptions (2.21)–(2.23), the following two statements are equivalent:
(1) A set which is both A-convex and B-convex is a box of integers.
(2) A function which is both A-convex and B-convex is a separable convex function.

Proof.

The implication (2) \Longrightarrow (1) is obvious, as (1) for a set SS follows from (2) for its indicator function δS\delta_{S}. The converse, (1) \Longrightarrow (2), can be shown as follows. Let ff be both A-convex and B-convex. By the assumption (2.23), the set S=argminf[p]S=\arg\min f[-p] is both A-convex and B-convex for each pnp\in{\mathbb{R}}^{n}. Then, by (1), argminf[p]\arg\min f[-p] is a box for each pnp\in{\mathbb{R}}^{n}. It follows from this and Proposition 2.2 that ff is separable convex. ∎

Similarly, we can obtain the following propositions, where “C-convexity” in the generic statement is replaced by L-convexity and M-convexity, respectively.

Proposition 2.4.

Under the assumptions (2.21)–(2.23), the following two statements are equivalent:
(1) A set which is both A-convex and B-convex is an L-convex set.
(2) A function which is both A-convex and B-convex is an L-convex function.

Proof.

The proof is the same as that of Proposition 2.3, except that Proposition 2.2 is replaced by Theorem A.7. ∎

Proposition 2.5.

Under the assumptions (2.21)–(2.23), the following two statements are equivalent:
(1) A set which is both A-convex and B-convex is an M-convex set.
(2) A function which is both A-convex and B-convex is an M-convex function.

Proof.

The proof is the same as that of Proposition 2.3, except that Proposition 2.2 is replaced by Theorem A.8. ∎

Proposition 2.3 will be used in the proof of Theorem 5.7, and Propositions 2.4 and 2.5 will be used in the proofs of Theorems 4.1 and 4.2, respectively.

Remark 2.2.

A proof of Proposition 2.2 is outlined here for completeness. Suppose that ff is a separable convex function represented as f(x)=i=1nφi(xi)f(x)=\sum_{i=1}^{n}\varphi_{i}(x_{i}) in (1.1). Then f[p](x)=i=1nφi[pi](xi)f[-p](x)=\sum_{i=1}^{n}\varphi_{i}[-p_{i}](x_{i}), from which xx is a minimizer of f[p]f[-p] if and only if, for each ii, xix_{i} is a minimizer of φi[pi]\varphi_{i}[-p_{i}]. Therefore argminf[p]\arg\min f[-p] is a box [a,b][a,b]_{{\mathbb{Z}}}, where aa and bb are the vectors whose components are defined by [ai,bi]=argminφi[pi][a_{i},b_{i}]_{{\mathbb{Z}}}=\arg\min\varphi_{i}[-p_{i}] (i=1,2,,n)(i=1,2,\ldots,n). To show the converse under (2.21), suppose that argminf[p]\arg\min f[-p] is a box of integers for any pnp\in{\mathbb{R}}^{n}.

First we consider the case (ii) where domf¯\overline{{\rm dom\,}f} is a polyhedron and ff is convex-extensible. Let f¯\overline{f} denote the convex extension of ff. We use notation Sp:=argminf[p]S_{p}:=\arg\min f[-p]. By the assumption, SpS_{p} is a box of integers and Sp¯=argminf[p]¯=argminf¯[p]\overline{S_{p}}=\overline{\arg\min f[-p]}=\arg\min\overline{f}[-p] is a box of reals. We have domf¯=pSp¯\overline{{\rm dom\,}f}=\bigcup_{p}\overline{S_{p}}, which implies that domf¯\overline{{\rm dom\,}f} is a box of reals and hence domf{\rm dom\,}f is a box of integers. Fix an arbitrary zdomfz\in{\rm dom\,}f. For i=1,2,,ni=1,2,\ldots,n, define φi:{+}\varphi_{i}:{\mathbb{Z}}\to{\mathbb{R}}\cup\{+\infty\} by φi(t)=f(z1,,zi1,t,zi+1,,zn)\varphi_{i}(t)=f(z_{1},\ldots,z_{i-1},t,z_{i+1},\ldots,z_{n}) for tt\in{\mathbb{Z}}. By the assumed convex-extensibility of ff, we have that domφi{\rm dom\,}\varphi_{i} is an interval of integers and φi(t1)+φi(t+1)2φi(t)\varphi_{i}(t-1)+\varphi_{i}(t+1)\geq 2\varphi_{i}(t) for all tt\in{\mathbb{Z}}. Using these univariate functions we obtain the desired expression f(x)=i=1nφi(xi)f(x)=\sum_{i=1}^{n}\varphi_{i}(x_{i}).

Next, we turn to the other case (i) where domf{\rm dom\,}f is assumed to be bounded. Let f¯\overline{f} denote the convex envelope of ff, by which we mean that f¯(x)\overline{f}(x) is equal to the maximum of g(x)g(x) taken over all closed convex functions gg satisfying g(y)f(y)g(y)\leq f(y) for all ydomfy\in{\rm dom\,}f. Since domf{\rm dom\,}f is bounded, f¯\overline{f} is a polyhedral convex function and domf¯=domf¯{\rm dom\,}\overline{f}=\overline{{\rm dom\,}f} is a polyhedron. To show f¯=f\overline{f}=f (convex-extensibility of ff) by contradiction, suppose that f¯(y)<f(y)\overline{f}(y)<f(y) for some ydomfy\in{\rm dom\,}f. Let pp be a subgradient of f¯\overline{f} at yy, which is equivalent to saying that yargminf¯[p]y\in\arg\min\overline{f}[-p]. Since argminf¯[p]=argminf[p]¯\arg\min\overline{f}[-p]=\overline{\arg\min f[-p]}, this implies yargminf[p]¯y\in\overline{\arg\min f[-p]}, whereas yargminf[p]y\notin\arg\min f[-p] by f¯(y)<f(y)\overline{f}(y)<f(y). This is a contradiction to the assumption that argminf[p]\arg\min f[-p] is a box of integers. Hence ff is convex-extensible, and the proof in the case of (i) is reduced to that in the case of (ii).  

3 Inclusion Relations Between Convexity Classes

In Section 2.2 we have presented basic inclusion relations for fundamental classes of discrete convex functions (Proposition 2.1 and Fig. 2). The objective of this section is to offer supplementary facts. In particular, we present examples to demonstrate proper inclusion (\subsetneqq), and cover (globally, directed) discrete midpoint convex functions. The relation of multimodularity to L- and M-convexity will be investigated later in Sections 5.3 and 5.4.

Table 1: Relations between classes of discrete convex functions
sep int-c L L2 L L2{}^{\natural}_{2} M M2 M M2{}^{\natural}_{2} m.m. g-dmc d-dmc
sep = \subset \triangledown \triangledown \subset \subset \triangledown \triangledown \subset \subset \subset \subset \subset
lin lin point point
int-c = \supset \supset \supset \supset \supset \supset \supset \supset \supset \supset \supset
L = \subset \subset \subset \triangledown \triangledown \triangledown \triangledown \triangledown \subset \subset
none none lin lin lin
L2 = \triangledown \subset \triangledown \triangledown \triangledown \triangledown \triangledown \triangledown \triangledown
L none none lin lin lin \supseteq L \supseteq L
L = \subset \triangledown \triangledown \triangle \triangle \triangle \subset \subset
point point sep sep sep
L2{}^{\natural}_{2} = \triangledown \triangledown \triangle \triangle \triangle \triangle \triangle
point point sep sep sep \supseteq L \supseteq L
M = \subset \subset \subset \triangledown \triangledown \triangledown
M2 = \triangledown \subset \triangledown \triangledown \triangledown
M
M = \subset \triangle \triangle \triangle
\supsetneqq sep \supsetneqq sep \supsetneqq sep
M2{}^{\natural}_{2} = \triangle \triangle \triangle
\supsetneqq sep \supsetneqq sep \supsetneqq sep
m.m. = \triangle \triangle
\supsetneqq sep \supsetneqq sep
g-dmc = \triangle
\supsetneqq L
d-dmc =

sep = separable convex,  int-c = integrally convex,  m.m. = multimodular.  
g-dmc=globally discrete midpoint convex,  d-dmc=directed discrete midpoint convex.
\subset: The class of the row is properly contained (\subset) by the class of the column.
\supset: The class of the row properly contains (\supset) the class of the column.
\triangle: There is no inclusion relation between the classes of the row and the column, and their intersection includes all separable convex functions.
\triangledown: There is no inclusion relation between the classes of the row and the column, and their intersection does not include the set of separable convex functions.
lin: linear function on n{\mathbb{Z}}^{n},  point: function defined on a single point.
: results of this paper, : results in previous studies, Unmarked: easy observations.

Table 1 is a summary of the inclusion and intersection relations for various classes of discrete convex functions. In each row, corresponding to a class of discrete convex functions, the first (upper) line shows its inclusion relation by \subset, \supset, \triangle, and \triangledown. The symbol \subset (resp., \supset) means that the class of the row is properly contained in (resp., properly contains) the class of the column. The symbols \triangle and \triangledown indicate that there is no inclusion relation between the classes of the row and the column; \triangle or \triangledown is used depending on whether their intersection includes all separable convex functions or not. The second (lower) line of each row shows what their intersection is, to be treated in Section 4.

A set SS is always assumed to be a subset of n{\mathbb{Z}}^{n} and a function ff is defined on n{\mathbb{Z}}^{n}, that is, f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}. We can demonstrate that a certain class of discrete convex functions, say, A-convex functions is not contained in another class of discrete convex functions, say, B-convex functions by exhibiting a set SS that is A-convex and not B-convex. Note that, in this case, the indicator function δS\delta_{S} is an A-convex function which is not B-convex.

3.1 L-convexity, M-convexity, and multimodularity

The following examples are concerned with L-convexity, M-convexity, and multimodularity (m.m.). For function classes A,B,CA,B,C with {A,B,C}={L,M,m.m.}\{A,B,C\}=\{\mbox{L},\mbox{M},\mbox{m.m.}\}, we have A(BC)A\setminus(B\cup C)\neq\emptyset.

Example 3.1.

Let S={x3x=t(1,1,1),t}S=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,1,1),t\in{\mathbb{Z}}\}. This set is L-convex (hence L-convex), but it is neither M-convex nor multimodular. Multimodularity of SS is denied in Example 5.1 in Section 5.1.  

Example 3.2.

Let S={x3x=t(1,1,1),t}S=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,-1,1),t\in{\mathbb{Z}}\}. This set is multimodular but it is neither L-convex nor M-convex. Multimodularity of SS will be demonstrated in Example 5.2 in Section 5.2.  

Example 3.3.

Let

S={x30xi3(i=1,2,3),x(X)5(|X|=2),x1+x2+x36},S=\{x\in{\mathbb{Z}}^{3}\mid 0\leq x_{i}\leq 3~{}(i=1,2,3),\ x(X)\leq 5\ (|X|=2),\ x_{1}+x_{2}+x_{3}\leq 6\}, (3.1)

which is depicted in Fig. 1(b). This set is M-convex, but it is neither L-convex nor multimodular. For x=(1,2,3)x=(1,2,3) and y=(2,2,2)y=(2,2,2), we have (x+y)/2=(3/2,2,5/2)(x+y)/2=(3/2,2,5/2) and (x+y)/2=(2,2,3)S\lceil(x+y)/2\rceil=(2,2,3)\notin S. Hence SS is not L-convex. Multimodularity of SS is denied in Example 5.4 in Section 5.4.1.  

Example 3.4.

From Example 3.3, we can make an example of an M-convex set that is neither L-convex nor multimodular. Using SS in (3.1), define

S^={x4(x1,x2,x3)S,x4=6(x1+x2+x3)},\hat{S}=\{x\in{\mathbb{Z}}^{4}\mid(x_{1},x_{2},x_{3})\in S,\ x_{4}=6-(x_{1}+x_{2}+x_{3})\},

which is indeed M-convex by (A.37). This set is not L-convex since discrete midpoint convexity fails for x=(1,2,3,0)x=(1,2,3,0) and y=(2,2,2,0)y=(2,2,2,0). Multimodularity of S^\hat{S} is denied in Example 5.5 in Section 5.4.1.  

Remark 3.1.

When n=2n=2, every M-convex function is multimodular, and the converse is also true [9, Remark 2.2]. When n=3n=3, every M-convex function is multimodular (Proposition 5.10 in Section 5.4.1), but the converse is not true.  

3.2 Global and directed discrete midpoint convexity

The following examples show that there is no inclusion relation between global d.m.c. and directed d.m.c., that is, ABA\setminus B\neq\emptyset when {A,B}={g-dmc,d-dmc}\{A,B\}=\{\mbox{g-dmc},\mbox{d-dmc}\}.

Example 3.5 ([35, Example 3]).

Let

S={(0,0,0),(1,1,0),(1,0,1),(2,1,1)}.S=\{(0,0,0),(1,1,0),(1,0,-1),(2,1,-1)\}.

This set is globally d.m.c. but not directed d.m.c. Indeed, x=(0,0,0)x=(0,0,0) and y=(2,1,1)y=(2,1,-1) are the only pair with xy2\|x-y\|_{\infty}\geq 2. We have (x+y)/2=(1,1/2,1/2)(x+y)/2=(1,1/2,-1/2), (x+y)/2=(1,1,0)S\lceil(x+y)/2\rceil=(1,1,0)\in S and (x+y)/2=(1,0,1)S\lfloor(x+y)/2\rfloor=(1,0,-1)\in S. On the other hand, using notation μ~\tilde{\mu} defined in (A.26) and (A.27), we have μ~(x,y)=(1,0,0)S\tilde{\mu}(x,y)=(1,0,0)\notin S and μ~(y,x)=(1,1,1)S\tilde{\mu}(y,x)=(1,1,-1)\notin S. Hence SS is not directed d.m.c.  

Example 3.6 ([35, Example 3]).

Let

S={(0,0,0),(1,0,0),(1,1,1),(2,1,1),(1,1,1),(2,1,1),(1,1,0),(2,1,0)}.S=\{(0,0,0),(1,0,0),(1,1,1),(2,1,1),(1,1,-1),(2,1,-1),(1,1,0),(2,1,0)\}.

This set is directed d.m.c. but not globally d.m.c. For x=(0,0,0)x=(0,0,0) and y=(2,1,1)y=(2,1,-1) with xy2\|x-y\|_{\infty}\geq 2, we have (x+y)/2=(1,1/2,1/2)(x+y)/2=(1,1/2,-1/2) and (x+y)/2=(1,0,1)S\lfloor(x+y)/2\rfloor=(1,0,-1)\notin S. Hence SS is not globally d.m.c.  

Example 3.7.

Let

f(x1,x2,x3)=(x1x2)2+(x1+x3)2+(x2+x3)2.f(x_{1},x_{2},x_{3})=(x_{1}-x_{2})^{2}+(x_{1}+x_{3})^{2}+(x_{2}+x_{3})^{2}.

This function is not globally d.m.c. Indeed, for x=(0,0,0)x=(0,0,0) and y=(2,1,1)y=(2,1,-1) with xy2\|x-y\|_{\infty}\geq 2, we have (x+y)/2=(1,1/2,1/2)(x+y)/2=(1,1/2,-1/2), u:=(x+y)/2=(1,1,0)u:=\lceil(x+y)/2\rceil=(1,1,0), and v:=(x+y)/2=(1,0,1)v:=\lfloor(x+y)/2\rfloor=(1,0,-1). Then

f(x)+f(y)=(0+0+0)+(1+1+0)<f(u)+f(v)=(0+1+1)+(1+0+1),f(x)+f(y)=(0+0+0)+(1+1+0)<f(u)+f(v)=(0+1+1)+(1+0+1),

which is a violation of discrete midpoint convexity. This function ff is a quadratic function represented as f(x)=xQxf(x)=x^{\top}Qx with a diagonally dominant matrix Q=[211121112]Q=\left[\begin{array}[]{rrr}2&-1&1\\ -1&2&1\\ 1&1&2\\ \end{array}\right]. It is known [35, Theorem 9] that a quadratic function defined by a diagonally dominant matrix with nonnegative diagonal elements is directed d.m.c. More generally, 2-separable convex functions are directed d.m.c. [35, Theorem 4].  

Example 3.8.

Let

f(x1,x2,x3)=(x1+x2)2=[x1x2x3][110110000][x1x2x3].f(x_{1},x_{2},x_{3})=(x_{1}+x_{2})^{2}=[\begin{array}[]{ccc}x_{1}&x_{2}&x_{3}\end{array}]\left[\begin{array}[]{rrr}1&1&0\\ 1&1&0\\ 0&0&0\\ \end{array}\right]\left[\begin{array}[]{c}x_{1}\\ x_{2}\\ x_{3}\end{array}\right].

This function is directed d.m.c. (by diagonal dominance) but not globally d.m.c. Indeed, for x=(1,0,0)x=(1,0,0) and y=(0,1,2)y=(0,1,2), we have u:=(x+y)/2=(1,1,1)u:=\lceil(x+y)/2\rceil=(1,1,1), v:=(x+y)/2=(0,0,1)v:=\lfloor(x+y)/2\rfloor=(0,0,1), and

f(x)+f(y)=1+1<f(u)+f(v)=4+0,f(x)+f(y)=1+1<f(u)+f(v)=4+0,

which is a violation of discrete midpoint convexity.  

3.3 L2-convexity and discrete midpoint convexity

The following examples show that there is no inclusion relation between L2-convexity and (global, directed) discrete midpoint convexity, that is, ABA\setminus B\neq\emptyset when {A,B}={L2,(g-, d-)dmc}\{A,B\}=\{\mbox{{L${}_{2}$}},\mbox{(g-, d-)dmc}\}.

Example 3.9 ([13, Remark 2], [35, Example 2]).

Let

S={(0,0,0),(0,1,1),(1,1,0),(1,2,1)}.S=\{(0,0,0),(0,1,1),(1,1,0),(1,2,1)\}.

This set is L2{}^{\natural}_{2}-convex, since S=S1+S2S=S_{1}+S_{2} for two L-convex sets S1={(0,0,0),(0,1,1)}S_{1}=\{(0,0,0),(0,1,1)\} and S2={(0,0,0),(1,1,0)}S_{2}=\{(0,0,0),(1,1,0)\}. The set SS is not (globally, directed) d.m.c., since, for x=(0,0,0)x=(0,0,0) and y=(1,2,1)y=(1,2,1) with xyx\leq y and xy2\|x-y\|_{\infty}\geq 2, we have (x+y)/2=(1/2,1,1/2)(x+y)/2=(1/2,1,1/2), (x+y)/2=(1,1,1)S\lceil(x+y)/2\rceil=(1,1,1)\notin S and (x+y)/2=(0,1,0)S\lfloor(x+y)/2\rfloor=(0,1,0)\notin S. From this L2{}^{\natural}_{2}-convex set SS, we define a set T4T\subseteq{\mathbb{Z}}^{4} by

T={(x,0)+μ(𝟏,1)xS,μ},T=\{(x,0)+\mu({\bf 1},1)\mid x\in S,\ \mu\in{\mathbb{Z}}\},

which is L2-convex and not (globally, directed) d.m.c.  

Example 3.10.

Let

S\displaystyle S ={x4x1+x2x3x4=0},\displaystyle=\{x\in{\mathbb{Z}}^{4}\mid x_{1}+x_{2}-x_{3}-x_{4}=0\},

which is L2-convex, since S=S1+S2S=S_{1}+S_{2} for two L-convex sets

S1\displaystyle S_{1} ={x4x1=x3,x2=x4},\displaystyle=\{x\in{\mathbb{Z}}^{4}\mid x_{1}=x_{3},\ x_{2}=x_{4}\},
S2\displaystyle S_{2} ={x4x1=x4,x2=x3}.\displaystyle=\{x\in{\mathbb{Z}}^{4}\mid x_{1}=x_{4},\ x_{2}=x_{3}\}.

The set SS is not (globally, directed) d.m.c., since, for x=(0,0,0,0)x=(0,0,0,0) and y=(2,2,1,3)y=(2,2,1,3) with xyx\leq y and xy2\|x-y\|_{\infty}\geq 2, we have (x+y)/2=(1,1,1/2,3/2)(x+y)/2=(1,1,1/2,3/2), (x+y)/2=(1,1,1,2)S\lceil(x+y)/2\rceil=(1,1,1,2)\notin S and (x+y)/2=(1,1,0,1)S\lfloor(x+y)/2\rfloor=(1,1,0,1)\notin S.  

Example 3.11.

The set S={(1,0),(0,1)}S=\{(1,0),(0,1)\} is (globally, directed) d.m.c. but not L2{}^{\natural}_{2}-convex (hence not L2-convex).  

3.4 M2-convexity and discrete midpoint convexity

The following examples show that there is no inclusion relation between M2-convexity and (global, directed) discrete midpoint convexity, that is, ABA\setminus B\neq\emptyset when {A,B}={M2,(g-, d-)dmc}\{A,B\}=\{\mbox{{M${}_{2}$}},\mbox{(g-, d-)dmc}\}.

Example 3.12.

Let SS be the set of integer points on the maximal face in Fig. 1(a), that is,

S\displaystyle S ={x3xi3(i=1,2,3),x1+x25,x2+x35,x1+x2+x3=6}\displaystyle=\{x\in{\mathbb{Z}}^{3}\mid x_{i}\leq 3~{}(i=1,2,3),\ x_{1}+x_{2}\leq 5,\ x_{2}+x_{3}\leq 5,\ x_{1}+x_{2}+x_{3}=6\}
={(3,0,3),(1,2,3),(3,2,1),(1,3,2),(2,3,1),(2,1,3),(2,2,2),(3,1,2)}.\displaystyle=\{(3,0,3),(1,2,3),(3,2,1),(1,3,2),(2,3,1),(2,1,3),(2,2,2),(3,1,2)\}. (3.2)

This set SS is M-convex and hence M2-convex. However, it is not (globally, directed) d.m.c. For x=(3,0,3)x=(3,0,3) and y=(2,2,2)y=(2,2,2) with xy2\|x-y\|_{\infty}\geq 2, we have (x+y)/2=(5/2,1,5/2)(x+y)/2=(5/2,1,5/2), (x+y)/2=μ~(x,y)=(3,1,3)S\lceil(x+y)/2\rceil=\tilde{\mu}(x,y)=(3,1,3)\notin S and (x+y)/2=μ~(y,x)=(2,1,2)S\lfloor(x+y)/2\rfloor=\tilde{\mu}(y,x)=(2,1,2)\notin S using notation μ~\tilde{\mu} defined in (A.26) and (A.27). Hence SS is neither globally d.m.c. nor directed d.m.c.  

Example 3.13.

The set S={(0,0),(1,1)}S=\{(0,0),(1,1)\} is (globally, directed) d.m.c. but not M2{}^{\natural}_{2}-convex (hence not M-convex).  

4 Intersection of Two Convexity Classes

The following are the major findings of this paper concerning the intersection of two convexity classes of functions f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}.

  • A function which is both L-convex and multimodular is separable convex (Theorem 5.8).

  • Furthermore, a function which is both L2{}^{\natural}_{2}-convex and multimodular is separable convex (Theorem 5.7).

  • A function which is both M-convex and multimodular is not necessarily separable convex (Section 5.4).

For the ease of reference we summarize remarkable relations below:

{L2-convex}{L-convex}={L-convex},\displaystyle\{\mbox{\rm{L${}_{2}$}-convex}\}\cap\{\mbox{\rm{L${}^{\natural}$}-convex}\}=\{\mbox{\rm L-convex}\}, (4.1)
{M2-convex}{M-convex}={M-convex},\displaystyle\{\mbox{\rm{M${}_{2}$}-convex}\}\cap\{\mbox{\rm{M${}^{\natural}$}-convex}\}=\{\mbox{\rm M-convex}\}, (4.2)
{L-convex}{M-convex}=,\displaystyle\{\mbox{\rm L-convex}\}\cap\{\mbox{\rm M-convex}\}=\emptyset, (4.3)
{L2-convex}{M2-convex}=,\displaystyle\{\mbox{\rm{L${}_{2}$}-convex}\}\cap\{\mbox{\rm{M${}_{2}$}-convex}\}=\emptyset, (4.4)
{L-convex}{M-convex}={separable convex},\displaystyle\{\mbox{\rm{L${}^{\natural}$}-convex}\}\cap\{\mbox{\rm{M${}^{\natural}$}-convex}\}=\{\mbox{\rm separable convex}\}, (4.5)
{L2-convex}{M2-convex}={separable convex},\displaystyle\{\mbox{\rm{L${}^{\natural}_{2}$}-convex}\}\cap\{\mbox{\rm{M${}^{\natural}_{2}$}-convex}\}=\{\mbox{\rm separable convex}\}, (4.6)
{L-convex}{M-convex}={linear on n},\displaystyle\{\mbox{\rm L-convex}\}\cap\{\mbox{\rm{M${}^{\natural}$}-convex}\}=\{\mbox{\rm linear on ${\mathbb{Z}}^{n}$}\}, (4.7)
{L-convex}{M-convex}={singleton effective domain},\displaystyle\{\mbox{\rm{L${}^{\natural}$}-convex}\}\cap\{\mbox{\rm M-convex}\}=\{\mbox{\rm singleton effective domain}\}, (4.8)
{multimodular}{L-convex}={separable convex},\displaystyle\{\mbox{\rm multimodular}\}\cap\{\mbox{\rm{L${}^{\natural}$}-convex}\}=\{\mbox{\rm separable convex}\}, (4.9)
{multimodular}{L2-convex}={separable convex},\displaystyle\{\mbox{\rm multimodular}\}\cap\{\mbox{\rm{L${}^{\natural}_{2}$}-convex}\}=\{\mbox{\rm separable convex}\}, (4.10)
{multimodular}{M-convex}{separable convex},\displaystyle\{\mbox{\rm multimodular}\}\cap\{\mbox{\rm{M${}^{\natural}$}-convex}\}\supsetneqq\{\mbox{\rm separable convex}\}, (4.11)
{globally d.m.c.}{directed d.m.c.}{L-convex}.\displaystyle\{\mbox{\rm globally d.m.c.}\}\cap\{\mbox{\rm directed d.m.c.}\}\supsetneqq\{\mbox{\rm{L${}^{\natural}$}-convex}\}. (4.12)

The relations (4.1) and (4.2) will be established in Theorems 4.1 and 4.2, respectively. The relations (4.3) and (4.4) are obvious. The relations (4.5) and (4.6) are well known, originating in [23, Theorem 3.17] and stated in [16, Theorem 8.49]. The relations (4.7) and (4.8) will be discussed in Section 4.3. The relations (4.9) and (4.10) will be established in Theorems 5.8 and 5.7, respectively, and (4.11) is discussed in Section 5.4. The relation (4.12) is treated in Section 4.5.

In the following we make observations, major and minor, for each case of intersection.

4.1 L-convexity

In this section we deal with intersection of classes related to L-convexity (and its relatives like L-, L2-, and L2{}^{\natural}_{2}-convexity).

  • L-convexity & separable convexity: An L-convex set SS is a box if and only if S=nS={\mathbb{Z}}^{n}. Hence, a function ff is both L-convex and separable convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because An L-convex set SS has translation invariance (xS,μx+μ𝟏Sx\in S,\ \mu\in{\mathbb{Z}}\Rightarrow x+\mu{\bf 1}\in S) in (A.23). If a box has this property, it must be n{\mathbb{Z}}^{n}. The statement for a function follows from the statement for a set.)

  • L2-convexity & separable convexity: An L2-convex set SS is a box if and only if S=nS={\mathbb{Z}}^{n}. Hence, a function ff is both L2-convex and separable convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because An L2-convex set SS also has translation invariance.)

  • L2-convexity & L-convexity: A set SS is both L2-convex and L-convex if and only if SS is L-convex. Hence, a function ff is both L2-convex and L-convex if and only if ff is L-convex. See Theorem 4.1 below.

The following theorem shows that the combination of L2-convexity and L-convexity is equivalent to L-convexity.

Theorem 4.1.

(1) A set SS (n)(\subseteq{\mathbb{Z}}^{n}) is L-convex if and only if it is both L2-convex and L-convex.

(2) A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is L-convex if and only if it is both L2-convex and L-convex.

Proof.

(1) First note that the only-if-part of (1) is obvious. To prove the if-part, suppose that a set SS is both L2-convex and L-convex. Recall that an L-convex set SS is L-convex if and only if it is translation invariant in the sense that x+μ𝟏Sx+\mu{\bf 1}\in S for every xSx\in S and every integer μ\mu (see Theorem A.6). Since SS is L2-convex, it can be represented as S=S1+S2S=S_{1}+S_{2} with two L-convex sets S1S_{1} and S2S_{2}. Since S1S_{1} and S2S_{2} are translation invariant, SS is also translation invariant. Therefore, SS must be L-convex.

(2) The statement (2) for functions follows from (1) for sets by the general principle given in Proposition 2.4. Note that the assumption (2.23) holds for L2-convexity and L-convexity (see [16, Proposition 8.40] for this statement about L2-convexity). ∎

4.2 M-convexity

In this section we deal with intersection of classes related to M-convexity (and its relatives like M-, M2-, and M2{}^{\natural}_{2}-convexity).

  • M-convexity & separable convexity: An M-convex set SS is a box if and only if SS is a singleton (a set consisting of a single point). Hence, a function ff is both M-convex and separable convex if and only if ff is a function whose effective domain is a singleton.

    (\because An M-convex set SS consists of vectors with a constant component sum, i.e., x(N)=y(N)x(N)=y(N) for all x,ySx,y\in S. If a box has this property, it must be a singleton. The statement for a function follows from the statement for a set.)

  • M2-convexity & separable convexity: An M2-convex set SS is a box if and only if SS is a singleton. Hence, a function ff is both M2-convex and separable convex if and only if ff is a function whose effective domain is a singleton.

    (\because An M2-convex set SS also consists of vectors with a constant component sum.)

  • M2-convexity & M-convexity: A set SS is both M2-convex and M-convex if and only if SS is M-convex. Hence, a function ff is both M2-convex and M-convex if and only if ff is M-convex. See Theorem 4.2 below.

The following theorem shows that the combination of M2-convexity and M-convexity is equivalent to M-convexity.

Theorem 4.2.

(1) A set SS (n)(\subseteq{\mathbb{Z}}^{n}) is M-convex if and only if it is both M2-convex and M-convex.

(2) A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is M-convex if and only if it is both M2-convex and M-convex.

Proof.

(1) First note that the only-if-part of (1) is obvious. To prove the if-part, suppose that a set SS is both M2-convex and M-convex. Recall that an M-convex set SS is M-convex if and only if x(N)=y(N)x(N)=y(N) for all x,ySx,y\in S (see Section A.4.2). On the other hand, an M2-convex set has this property. Therefore, SS must be M-convex.

(2) The statement (2) for functions follows from (1) for sets by the general principle given in Proposition 2.5. Note that the assumption (2.23) holds for M2-convexity and M-convexity (see [16, Theorems 8.17] for this statement about M2-convexity). ∎

4.3 L-convexity and M-convexity

In this section we deal with intersection of classes of L-convexity and M-convexity (including their variants).

  • L-convexity & M-convexity: There exists no set that is both L-convex and M-convex. Hence, there exists no function that is both L-convex and M-convex.

    (\because An L-convex set SS has translation invariance (xS,μx+μ𝟏Sx\in S,\ \mu\in{\mathbb{Z}}\Rightarrow x+\mu{\bf 1}\in S) in (A.23). An M-convex set SS consists of vectors with a constant component sum, i.e., x(N)=y(N)x(N)=y(N) for all x,ySx,y\in S. These two properties are not compatible.)

  • L-convexity & M-convexity: A set SS is both L-convex and M-convex if and only if SS is a box. Hence, a function ff is both L-convex and M-convex if and only if ff is separable convex. See [23, Theorem 3.17] and [16, Theorem 8.49].

  • L-convexity & M-convexity: A set SS is both L-convex and M-convex if and only if S=nS={\mathbb{Z}}^{n}. Hence, a function ff is both L-convex and M-convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because An M-convex set SS has the property that if x,ySx,y\in S and xyx\leq y, then [x,y]S[x,y]_{{\mathbb{Z}}}\subseteq S, whereas an L-convex set has translation invariance. Hence S=nS={\mathbb{Z}}^{n}.)

  • L-convexity & M-convexity: A set SS is both L-convex and M-convex if and only if SS is a singleton. Hence, a function ff is both L-convex and M-convex if and only if ff is a function whose effective domain is a singleton.

    (\because An L-convex set SS has the property that x,ySx,y\in S implies xy,xySx\vee y,x\wedge y\in S, whereas an M-convex set consists of vectors with a constant component sum. Hence SS must be a singleton.)

The four statements above can be extended to L2-convexity and M2-convexity without substantial changes.

  • L2-convexity & M2-convexity: There exists no set that is both L2-convex and M2-convex. Hence, there exists no function that is both L2-convex and M2-convex. This implies that there exists no function that is both L2-convex and M-convex (or L-convex and M2-convex).

    (\because An L2-convex set has translation invariance and an M2-convex set consists of vectors with a constant component sum.)

  • L2{}^{\natural}_{2}-convexity & M2{}^{\natural}_{2}-convexity: A set SS is both L2{}^{\natural}_{2}-convex and M2{}^{\natural}_{2}-convex if and only if SS is a box. Hence, a function ff is both L2{}^{\natural}_{2}-convex and M2{}^{\natural}_{2}-convex if and only if ff is separable convex. See [23, Theorem 3.17], [16, Theorem 8.49], and [11]. This implies that a function ff is both L2{}^{\natural}_{2}-convex and M-convex (or M2{}^{\natural}_{2}-convex and L-convex) if and only if ff is separable convex.

  • L2-convexity & M2{}^{\natural}_{2}-convexity: A set SS is both L2-convex and M2{}^{\natural}_{2}-convex if and only if S=nS={\mathbb{Z}}^{n} (proved below). Hence, a function ff is both L2-convex and M2{}^{\natural}_{2}-convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}. This implies that a function ff is both L2-convex and M-convex (or L-convex and M2{}^{\natural}_{2}-convex) if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because An M2{}^{\natural}_{2}-convex set SS has the property that if x,ySx,y\in S and xyx\leq y, then [x,y]S[x,y]_{{\mathbb{Z}}}\subseteq S, whereas an L2-convex set has translation invariance. Hence S=nS={\mathbb{Z}}^{n}.)

  • L2{}^{\natural}_{2}-convexity & M2-convexity: A set SS is both L2{}^{\natural}_{2}-convex and M2-convex if and only if SS is a singleton (proved below). Hence, a function ff is both L2{}^{\natural}_{2}-convex and M2-convex if and only if domf{\rm dom\,}f is a singleton. This implies that a function ff is both L2{}^{\natural}_{2}-convex and M-convex (or L-convex and M2-convex) if and only if domf{\rm dom\,}f is a singleton.

    (\because Let S=S1+S2S=S_{1}+S_{2} with L-convex sets SkS_{k} for k=1,2k=1,2. Take x,ySx,y\in S and represent them as x=x1+x2x=x_{1}+x_{2} and y=y1+y2y=y_{1}+y_{2} with xk,ykSkx_{k},y_{k}\in S_{k} for k=1,2k=1,2. Define zk=xkykz_{k}=x_{k}\vee y_{k} for k=1,2k=1,2, and z=z1+z2z=z_{1}+z_{2}. Since zkSkz_{k}\in S_{k} for k=1,2k=1,2, we have zSz\in S. By M2-convexity of SS, we have z(N)=x(N)z(N)=x(N), while

    z(N)=z1(N)+z2(N)=(x1y1)(N)+(x2y2)(N)x1(N)+x2(N)=x(N).z(N)=z_{1}(N)+z_{2}(N)=(x_{1}\vee y_{1})(N)+(x_{2}\vee y_{2})(N)\geq x_{1}(N)+x_{2}(N)=x(N).

    It then follows that xkyk=xkx_{k}\vee y_{k}=x_{k} for k=1,2k=1,2, that is, ykxky_{k}\leq x_{k} for k=1,2k=1,2. By symmetry we also have ykxky_{k}\geq x_{k} for k=1,2k=1,2, and hence x=yx=y.)

4.4 Multimodularity

In this section we deal with intersection of classes related to multimodularity.

  • Multimodularity & L-convexity: A set SS is both multimodular and L-convex if and only if S=nS={\mathbb{Z}}^{n}. Hence, a function ff is both multimodular and L-convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because A multimodular set SS has the property that if x,ySx,y\in S and xyx\leq y, then [x,y]S[x,y]_{{\mathbb{Z}}}\subseteq S (Proposition 5.6), whereas an L-convex set has translation invariance. Hence S=nS={\mathbb{Z}}^{n}.)

  • Multimodularity & L2-convexity: A set SS is both multimodular and L2-convex if and only if S=nS={\mathbb{Z}}^{n}. Hence, a function ff is both multimodular and L2-convex if and only if ff is a linear function with domf=n{\rm dom\,}f={\mathbb{Z}}^{n}.

    (\because A multimodular set SS has the property that if x,ySx,y\in S and xyx\leq y, then [x,y]S[x,y]_{{\mathbb{Z}}}\subseteq S (Proposition 5.6), whereas an L2-convex set has translation invariance. Hence S=nS={\mathbb{Z}}^{n}.)

  • Multimodularity & L-convexity: A set SS is both multimodular and L-convex if and only if SS is a box. Hence, a function ff is both multimodular and L-convex if and only if ff is separable convex. See Theorem 5.8 in Section 5.3.

  • Multimodularity & L2{}^{\natural}_{2}-convexity: A set SS is both multimodular and L2{}^{\natural}_{2}-convex if and only if SS is a box. Hence, a function ff is both multimodular and L2{}^{\natural}_{2}-convex if and only if ff is separable convex. See Theorem 5.7 in Section 5.3.

  • Multimodularity & M-convexity: A function in two variables is multimodular if and only if it is M-convex [9, Remark 2.2]. For functions in more than two variables, these classes are distinct, without inclusion relation. The intersection of these classes can be analyzed and understood fairly well (Section 5.4.1). Figure 1(a) demonstrates their nonempty intersection, giving a concrete example of a set that is both multimodular and M-convex (see Example 5.3). This is also an instance of a set that is both multimodular and M2{}^{\natural}_{2}-convex.

  • Multimodularity & M-convexity: The intersection of these two classes can be analyzed and understood fairly well (Section 5.4.1). Their intersection is nonempty, which is demonstrated by the set SS in (3.2). That is, the set of integer points on the maximal face (x1+x2+x3=6x_{1}+x_{2}+x_{3}=6) in Fig. 1(a) is both multimodular and M-convex (see Example 5.3). This is also an instance of a set that is both multimodular and M2-convex.

4.5 Discrete midpoint convexity

In this section we deal with intersection of classes related to discrete midpoint convexity. We first note that the indicator function of any nonempty subset of {0,1}n\{0,1\}^{n} is (globally, directed) discrete midpoint convex. We also note that the indicator function of S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\} is (globally, directed) discrete midpoint convex.

  • d.m.c. & L2-convexity: An L-convex function is both L2-convex and (globally, directed) discrete midpoint convex, but it is not known whether the converse is true or not.

  • d.m.c. & L2{}^{\natural}_{2}-convexity: An L-convex function is both L2{}^{\natural}_{2}-convex and (globally, directed) discrete midpoint convex, but it is not known whether the converse is true or not.

  • d.m.c. & M-convexity: For every matroid, the indicator function of the set of its bases is both M-convex and (globally, directed) discrete midpoint convex. The indicator function of S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\} is both M-convex and (globally, directed) discrete midpoint convex. Note that general separable convex functions do not belong to this class because a separable convex function ff is M-convex only if domf{\rm dom\,}f is a singleton.

  • d.m.c. & M2-convexity: For every pair of matroids, the indicator function of the set of their common bases is both M2-convex and (globally, directed) discrete midpoint convex. For example, the indicator function of S={(1,1,0,0),(0,0,1,1)}S=\{(1,1,0,0),(0,0,1,1)\} is both M2-convex and (globally, directed) discrete midpoint convex.

  • d.m.c. & M-convexity: A separable convex function is both M-convex and (globally, directed) discrete midpoint convex. The converse is not true. Indeed, the set S={(1,0),(0,1)}S=\{(1,0),(0,1)\} is M-convex and (globally, directed) discrete midpoint convex, but it is not a box. Another example is given by S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\}.

  • d.m.c. & M2{}^{\natural}_{2}-convexity: A separable convex function is both M2{}^{\natural}_{2}-convex and (globally, directed) discrete midpoint convex. The converse is not true, as shown by the indicator function of S={(1,0),(0,1)}S=\{(1,0),(0,1)\} or S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\}.

  • d.m.c. & multimodularity: A separable convex function is both multimodular and (globally, directed) discrete midpoint convex. The converse is not true, as shown by the indicator function of S={(1,0),(0,1)}S=\{(1,0),(0,1)\} or S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\}.

  • globally d.m.c. & directed d.m.c.: An L-convex function is both globally and directed discrete midpoint convex. The converse is not true, as shown by the indicator function of S={(1,0),(0,1)}S=\{(1,0),(0,1)\} or S={(2,0),(1,1),(0,2)}S=\{(2,0),(1,1),(0,2)\}.

5 Results Concerning Multimodularity

In this section we present results concerning multimodularity, which constitute the major technical contributions of this paper.

5.1 Definition of multimodularity

Recall that eie^{i} denotes the iith unit vector for i=1,2,,ni=1,2,\ldots,n, and let n\mathcal{F}\subseteq{\mathbb{Z}}^{n} be a set of vectors defined by

={e1,e1e2,e2e3,,en1en,en}.\mathcal{F}=\{-e^{1},e^{1}-e^{2},e^{2}-e^{3},\ldots,e^{n-1}-e^{n},e^{n}\}. (5.1)

A finite-valued function f:nf:{\mathbb{Z}}^{n}\to{\mathbb{R}} is said to be multimodular [8] if it satisfies

f(z+d)+f(z+d)f(z)+f(z+d+d)f(z+d)+f(z+d^{\prime})\geq f(z)+f(z+d+d^{\prime}) (5.2)

for all znz\in{\mathbb{Z}}^{n} and all distinct d,dd,d^{\prime}\in\mathcal{F}. It is known [8, Proposition 2.2] that f:nf:{\mathbb{Z}}^{n}\to{\mathbb{R}} is multimodular if and only if the function f~:n+1\tilde{f}:{\mathbb{Z}}^{n+1}\to{\mathbb{R}} defined by

f~(x0,x)=f(x1x0,x2x1,,xnxn1)(x0,xn)\tilde{f}(x_{0},x)=f(x_{1}-x_{0},x_{2}-x_{1},\ldots,x_{n}-x_{n-1})\qquad(x_{0}\in{\mathbb{Z}},x\in{\mathbb{Z}}^{n}) (5.3)

is submodular in n+1n+1 variables. This characterization enables us to define multimodularity for a function that may take the infinite value ++\infty. That is, we say [9, 17] that a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} with domf{\rm dom\,}f\not=\emptyset is multimodular if the function f~:n+1{+}\tilde{f}:{\mathbb{Z}}^{n+1}\to{\mathbb{R}}\cup\{+\infty\} associated with ff by (5.3) is submodular.

Multimodularity and L-convexity have the following close relationship.

Theorem 5.1 ([17]).

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is multimodular if and only if the function g:n{+}g:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} defined by

g(y)=f(y1,y2y1,y3y2,,ynyn1)(yn)g(y)=f(y_{1},\ y_{2}-y_{1},\ y_{3}-y_{2},\ldots,y_{n}-y_{n-1})\qquad(y\in{\mathbb{Z}}^{n}) (5.4)

is L-convex.  

The relation (5.4) between ff and gg can be rewritten as

f(x)=g(x1,x1+x2,x1+x2+x3,,x1++xn)(xn).f(x)=g(x_{1},\ x_{1}+x_{2},\ x_{1}+x_{2}+x_{3},\ldots,x_{1}+\cdots+x_{n})\qquad(x\in{\mathbb{Z}}^{n}). (5.5)

Using a bidiagonal matrix D=(dij1i,jn)D=(d_{ij}\mid 1\leq i,j\leq n) defined by

dii=1(i=1,2,,n),di+1,i=1(i=1,2,,n1),d_{ii}=1\quad(i=1,2,\ldots,n),\qquad d_{i+1,i}=-1\quad(i=1,2,\ldots,n-1), (5.6)

we can express (5.4) and (5.5) more compactly as g(y)=f(Dy)g(y)=f(Dy) and f(x)=g(D1x)f(x)=g(D^{-1}x), respectively. The matrix DD is unimodular, and its inverse D1D^{-1} is an integer lower-triangular matrix with (D1)ij=1(D^{-1})_{ij}=1 for iji\geq j and (D1)ij=0(D^{-1})_{ij}=0 for i<ji<j. For n=4n=4, for example, we have

D=[1000110001100011],D1=[1000110011101111].D={\small\left[\begin{array}[]{rrrr}1&0&0&0\\ -1&1&0&0\\ 0&-1&1&0\\ 0&0&-1&1\\ \end{array}\right]},\qquad D^{-1}={\small\left[\begin{array}[]{rrrr}1&0&0&0\\ 1&1&0&0\\ 1&1&1&0\\ 1&1&1&1\\ \end{array}\right]}.

A nonempty set SS is called multimodular if its indicator function δS\delta_{S} is multimodular. The effective domain of a multimodular function is a multimodular set. The following proposition, a special case of Theorem 5.1, connects multimodular sets with L-convex sets.

Proposition 5.2 ([9, 17]).

A set SnS\subseteq{\mathbb{Z}}^{n} is multimodular if and only if it can be represented as S={DyyT}S=\{Dy\mid y\in T\} for some L-convex set TT, where TT is uniquely determined from SS as T={D1xxS}T=\{D^{-1}x\mid x\in S\}.  

Theorem 5.3.

Under the assumption (2.21), a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is multimodular if and only if, for any vector pnp\in{\mathbb{R}}^{n}, argminf[p]\arg\min f[-p] is a multimodular set or an empty set.

Proof.

By Theorem 5.1 and Proposition 5.2, this follows from the corresponding statement (Theorem A.7) for L-convex functions. ∎

Example 5.1.

Proposition 5.2 is applied to the set S={x3x=t(1,1,1),t}S=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,1,1),t\in{\mathbb{Z}}\} treated in Example 3.1. We have

T={D1xxS}={x3x=t(1,2,3),t},T=\{D^{-1}x\mid x\in S\}=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,2,3),t\in{\mathbb{Z}}\},

which is not L-convex. Hence SS is not multimodular.  

The reader is referred to [1, 2, 8, 9, 17, 18] for more about multimodularity.

5.2 Polyhedral description of multimodular sets

As a technical basis for our argument in Sections 5.3 and 5.4, we show a description of multimodular sets by inequalities. A subset II of the index set N={1,2,,n}N=\{1,2,\ldots,n\} is said to be consecutive if it consists of consecutive numbers, that is, it is a set of the form I={k,k+1,,l1,l}I=\{k,k+1,\ldots,l-1,l\} for some klk\leq l.

Theorem 5.4.

(1) If SnS\subseteq\mathbb{Z}^{n} is a multimodular set, then S=S¯nS=\overline{S}\cap{\mathbb{Z}}^{n} and its convex hull S¯\overline{S} is represented as

S¯={xnaIx(I)bI(I: consecutive subset of N)},\overline{S}=\{x\in{\mathbb{R}}^{n}\mid a_{I}\leq x(I)\leq b_{I}\ \ (\mbox{\rm$I$: consecutive subset of $N$})\}, (5.7)

where aI=inf{x(I)xS}a_{I}=\inf\{x(I)\mid x\in S\} and bI=sup{x(I)xS}b_{I}=\sup\{x(I)\mid x\in S\} for consecutive subsets II of NN.

(2) For any aI{}a_{I}\in{\mathbb{Z}}\cup\{-\infty\} and bI{+}b_{I}\in{\mathbb{Z}}\cup\{+\infty\} indexed by the family of consecutive subsets II of NN, the polyhedron PP defined by

P={xnaIx(I)bI(I: consecutive subset of N)}P=\{x\in{\mathbb{R}}^{n}\mid a_{I}\leq x(I)\leq b_{I}\ \ (\mbox{\rm$I$: consecutive subset of $N$})\} (5.8)

is an integer polyhedron, and S:=PnS:=P\cap{\mathbb{Z}}^{n} is a multimodular set, provided PP\neq\emptyset.

(3) A nonempty set SnS\subseteq\mathbb{Z}^{n} is multimodular if and only if it can be represented as

S={xnaIx(I)bI(I: consecutive subset of N)}S=\{x\in{\mathbb{Z}}^{n}\mid a_{I}\leq x(I)\leq b_{I}\ \ (\mbox{\rm$I$: consecutive subset of $N$})\} (5.9)

for some aI{}a_{I}\in{\mathbb{Z}}\cup\{-\infty\} and bI{+}b_{I}\in{\mathbb{Z}}\cup\{+\infty\} indexed by the family of consecutive subsets II of NN.

Proof.

First we mention some basic facts about the transformation x=Dyx=Dy connecting multimodularity and L-convexity. Define T={yy=D1x,xS}T=\{y\mid y=D^{-1}x,\ x\in S\} for any nonempty set SnS\subseteq{\mathbb{Z}}^{n}. Then the convex hulls of TT and SS correspond to each other by the same transformation, that is, T¯={yy=D1x,xS¯}\overline{T}=\{y\mid y=D^{-1}x,\ x\in\overline{S}\}. Since DD is unimodular, T=T¯nT=\overline{T}\cap{\mathbb{Z}}^{n} holds if and only if S=S¯nS=\overline{S}\cap{\mathbb{Z}}^{n}. By Theorem A.4, TT is L-convex if and only if T=T¯nT=\overline{T}\cap{\mathbb{Z}}^{n} and its convex hull T¯\overline{T} can be described as

αiyiβi(i=1,2,,n),\displaystyle\alpha_{i}\leq y_{i}\leq\beta_{i}\quad(i=1,2,\ldots,n),
αijyjyiβij(1i<jn).\displaystyle\alpha_{ij}\leq y_{j}-y_{i}\leq\beta_{ij}\quad(1\leq i<j\leq n).

By substituting yi=x1+x2++xiy_{i}=x_{1}+x_{2}+\cdots+x_{i} (i=1,2,,n)(i=1,2,\ldots,n), i.e., y=D1xy=D^{-1}x, into these inequalities we obtain the following inequalities

αix1+x2++xiβi(i=1,2,,n),\displaystyle\alpha_{i}\leq x_{1}+x_{2}+\cdots+x_{i}\leq\beta_{i}\quad(i=1,2,\ldots,n), (5.10)
αijxi+1+xi+2++xjβij(1i<jn)\displaystyle\alpha_{ij}\leq x_{i+1}+x_{i+2}+\cdots+x_{j}\leq\beta_{ij}\quad(1\leq i<j\leq n) (5.11)

to describe S¯\overline{S}.

(1) If SS is multimodular, then TT is L-convex by Proposition 5.2. By the above argument, S¯\overline{S} is described by (5.10) and (5.11). These inequalities are of the form of aIx(I)bIa_{I}\leq x(I)\leq b_{I} for consecutive subsets II; (5.10) corresponding to I={1,2,,i}I=\{1,2,\ldots,i\} and (5.11) to I={i+1,i+2,,j}I=\{i+1,i+2,\ldots,j\}. Thus we obtain (5.7). Then the validity of the expressions aI=inf{x(I)xS}a_{I}=\inf\{x(I)\mid x\in S\} and bI=sup{x(I)xS}b_{I}=\sup\{x(I)\mid x\in S\} is obvious.

(2) Let 1\mathcal{I}_{1} (resp., 2\mathcal{I}_{2}) denote the family of consecutive subsets II for which bIb_{I} (resp., aIa_{I}) is finite. Let A1A_{1} (resp., A2A_{2}) denote the matrix whose rows are indexed by 1\mathcal{I}_{1} (resp., 2\mathcal{I}_{2}) and whose row vector corresponding to II is the characteristic vector eIe^{I}. Let bb (resp., aa) be the vector (bII1)(b_{I}\mid I\in\mathcal{I}_{1}) (resp., (aII2)(a_{I}\mid I\in\mathcal{I}_{2})). Then the system of inequalities in (5.8) can be expressed as

[A1A2]x[ba].\left[\begin{array}[]{r}A_{1}\\ -A_{2}\\ \end{array}\right]x\leq\left[\begin{array}[]{r}b\\ -a\\ \end{array}\right]. (5.12)

Each of the matrices A1A_{1} and A2A_{2} is a so-called interval matrix (row-oriented), or a matrix with consecutive ones property. This implies that the combined matrix [A1A2]\left[\begin{array}[]{r}A_{1}\\ A_{2}\\ \end{array}\right] is also an interval matrix, which is known to be totally unimodular [29, p.279, Example 7]. Therefore, the coefficient matrix [A1A2]\left[\begin{array}[]{r}A_{1}\\ -A_{2}\\ \end{array}\right] is also totally unimodular, and hence (5.12) determines an integer polyhedron, which shows the integrality of PP in (5.8). The integrality of PP implies P=Pn¯P=\overline{P\cap{\mathbb{Z}}^{n}}, which is equivalent to saying that the convex hull of S=PnS=P\cap{\mathbb{Z}}^{n} is described by (5.10) and (5.11). Then the basic facts presented at the beginning of the proof shows that T={yy=D1x,xS}T=\{y\mid y=D^{-1}x,\ x\in S\} is L-convex, which implies, by Proposition 5.2, that SS is multimodular.

(3) If SS is multimodular, the statement (1) shows that S=S¯nS=\overline{S}\cap{\mathbb{Z}}^{n} and S¯\overline{S} is described as (5.7). Hence follows (5.9). Conversely, suppose that (5.9) holds. Then P:=S¯P:=\overline{S} is described as (5.8) by the integrality of a polyhedron of the form of (5.8) shown in the statement (2). The expression (5.9) implies S=S¯n=PnS=\overline{S}\cap{\mathbb{Z}}^{n}=P\cap{\mathbb{Z}}^{n}, while PnP\cap{\mathbb{Z}}^{n} is multimodular by the statement in (2). Thus SS is multimodular if it is represented as (5.9). ∎

It is worth while mentioning box-total dual integrality of the inequality system given in Theorem 5.4. A linear inequality system AxbAx\leq b (in general) is said to be totally dual integral (TDI) if the entries of AA and bb are rational numbers and the minimum in the linear programming duality equation

max{wxAxb}=min{ybyA=w,y0}\max\{w^{\top}x\mid Ax\leq b\}\ =\ \min\{y^{\top}b\mid y^{\top}A=w^{\top},\ y\geq 0\}

has an integral optimal solution yy for every integral vector ww such that the minimum is finite ([29, 30]). A linear inequality system AxbAx\leq b is said to be box-totally dual integral (box-TDI) if the system [Axb,dxc][Ax\leq b,d\leq x\leq c] is TDI for each choice of rational (finite-valued) vectors cc and dd. It is known [30, Theorem 5.35] that the system AxbAx\leq b is box-TDI if the matrix AA is totally unimodular. A polyhedron is called a box-TDI polyhedron if it can be described by a box-TDI system.

Theorem 5.5.

The inequality system (5.8) describing the convex hull of a multimodular set is box-TDI, and the convex hull of a multimodular set is a box-TDI integer polyhedron.

Proof.

In the proof of Theorem 5.4, the matrix [A1A2]\left[\begin{array}[]{r}A_{1}\\ -A_{2}\\ \end{array}\right] in (5.12) is totally unimodular. This implies that the inequality system (5.12) is box-TDI. ∎

Theorem 5.4 implies immediately that a box of integers is a multimodular set, which was pointed out first in [9, Proposition 2]. Another immediate consequence of Theorem 5.4 is the following property of multimodular sets. This fact will be used in the proof of Theorem 5.7 in Section 5.3.

Proposition 5.6.

Let SS be a multimodular set. If x,ySx,y\in S and xyx\leq y, then [x,y]S[x,y]_{{\mathbb{Z}}}\subseteq S.

Proof.

Recall the description of SS in (5.9). For any z[x,y]z\in[x,y]_{{\mathbb{Z}}} we have aIx(I)z(I)y(I)bIa_{I}\leq x(I)\leq z(I)\leq y(I)\leq b_{I} for every consecutive index set II, which shows zSz\in S. ∎

Example 5.2.

We show the multimodularity of S={x3x=t(1,1,1),t}S=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,-1,1),t\in{\mathbb{Z}}\}, which was stated in Example 3.2 without proof. This set admits a representation

S={x3x1+x2=0,x2+x3=0}S=\{x\in{\mathbb{Z}}^{3}\mid x_{1}+x_{2}=0,\ x_{2}+x_{3}=0\}

of the form of (5.9), and therefore it is multimodular by Theorem 5.4. Multimodularity of SS can also be verified by Proposition 5.2. Indeed we have

T={D1xxS}={x3x=t(1,0,1),t},T=\{D^{-1}x\mid x\in S\}=\{x\in{\mathbb{Z}}^{3}\mid x=t(1,0,1),t\in{\mathbb{Z}}\},

which is L-convex.  

5.3 Multimodularity and L2{}^{\natural}_{2}-convexity

The following theorem shows the combination of multimodularity and L2{}^{\natural}_{2}-convexity is equivalent to separable convexity.

Theorem 5.7.

(1) A set SS (n)(\subseteq{\mathbb{Z}}^{n}) is a box of integers if and only if it is both multimodular and L2{}^{\natural}_{2}-convex.

(2) A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is separable convex if and only if it is both multimodular and L2{}^{\natural}_{2}-convex.

Proof.

(1) First note that the only-if-part of (1) is obvious. The if-part can be shown as follows. Let S1,S2nS_{1},S_{2}\subseteq{\mathbb{Z}}^{n} be L-convex sets such that S=S1+S2S=S_{1}+S_{2}.

In the case of bounded SS, the proof is quite easy as follows. Each SkS_{k} has the unique minimum element akSka^{k}\in S_{k} and the unique maximum element bkSkb^{k}\in S_{k}. Then a=a1+a2a=a^{1}+a^{2} is the unique minimum of SS and b=b1+b2b=b^{1}+b^{2} is the unique maximum of SS, for which we have S[a,b]S\subseteq[a,b]_{{\mathbb{Z}}}. By Proposition 5.6, on the other hand, it follows from a,bSa,b\in S and aba\leq b that [a,b]S[a,b]_{{\mathbb{Z}}}\subseteq S. Thus we have proved S=[a,b]S=[a,b]_{{\mathbb{Z}}}.

The general case with (possibly) unbounded SS can be treated as follows. For each iNi\in N, put ai=infySyia_{i}=\inf_{y\in S}y_{i} and bi=supySyib_{i}=\sup_{y\in S}y_{i}, where we have the possibility of ai=a_{i}=-\infty and/or bi=+b_{i}=+\infty. Obviously, S[a,b]S\subseteq[a,b]_{{\mathbb{Z}}} holds. To prove [a,b]S[a,b]_{{\mathbb{Z}}}\subseteq S, take any x[a,b]x\in[a,b]_{{\mathbb{Z}}}. For each iNi\in N, there exist vectors pi,qiSp^{i},q^{i}\in S such that piixiqiip^{i}_{i}\leq x_{i}\leq q^{i}_{i}, where piip^{i}_{i}, xix_{i}, and qiiq^{i}_{i} denote the iith component of vectors pip^{i}, xx, and qiq^{i}, respectively. Since pi,qiS=S1+S2p^{i},q^{i}\in S=S_{1}+S_{2}, we can express them as pi=pi1+pi2p^{i}=p^{i1}+p^{i2}, qi=qi1+qi2q^{i}=q^{i1}+q^{i2} with some pik,qikSkp^{ik},q^{ik}\in S_{k} (k=1,2)(k=1,2). Consider

p^k=iNpikSk(k=1,2),p^=p^1+p^2S,q^k=iNqikSk(k=1,2),q^=q^1+q^2S.\begin{array}[]{ll}\displaystyle\hat{p}^{k}=\bigwedge_{i\in N}p^{ik}\in S_{k}\quad(k=1,2),\qquad\hat{p}=\hat{p}^{1}+\hat{p}^{2}\in S,\\ \displaystyle\hat{q}^{k}=\bigvee_{i\in N}q^{ik}\in S_{k}\quad(k=1,2),\qquad\hat{q}=\hat{q}^{1}+\hat{q}^{2}\in S.\end{array}

Then, for each component iNi\in N,we have

p^i\displaystyle\hat{p}_{i} =p^i1+p^i2pii1+pii2=piixi,\displaystyle=\hat{p}^{1}_{i}+\hat{p}^{2}_{i}\leq p^{i1}_{i}+p^{i2}_{i}=p^{i}_{i}\leq x_{i},
q^i\displaystyle\hat{q}_{i} =q^i1+q^i2qii1+qii2=qiixi,\displaystyle=\hat{q}^{1}_{i}+\hat{q}^{2}_{i}\geq q^{i1}_{i}+q^{i2}_{i}=q^{i}_{i}\geq x_{i},

which shows x[p^,q^]x\in[\hat{p},\hat{q}]_{{\mathbb{Z}}}. By Proposition 5.6, it follows from p^,q^S\hat{p},\hat{q}\in S and p^q^\hat{p}\leq\hat{q} that [p^,q^]S[\hat{p},\hat{q}]_{{\mathbb{Z}}}\subseteq S. Therefore, x[p^,q^]Sx\in[\hat{p},\hat{q}]_{{\mathbb{Z}}}\subseteq S, where xx is an arbitrarily chosen element of [a,b][a,b]_{{\mathbb{Z}}}. Hence [a,b]S[a,b]_{{\mathbb{Z}}}\subseteq S. Thus we complete the proof of S=[a,b]S=[a,b]_{{\mathbb{Z}}}.

(2) The statement (2) for functions follows from (1) for sets by the general principle given in Proposition 2.3. Note that the assumption (2.23) holds for multimodularity and L2{}^{\natural}_{2}-convexity. ∎

The following statement, with L-convexity in place of L2{}^{\natural}_{2}-convexity, is an immediate corollary of Theorem 5.7, since separable convex functions are L-convex.

Theorem 5.8.

(1) A set SS (n)(\subseteq{\mathbb{Z}}^{n}) is a box of integers if and only if it is both multimodular and L-convex.

(2) A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is separable convex if and only if it is both multimodular and L-convex.  

5.4 Multimodularity and M-convexity

5.4.1 Theorem

For a,bN={1,2,,n}a,b\in N=\{1,2,\ldots,n\}, we denote the set of integers between aa and bb by I(a,b)I(a,b), that is,

I(a,b)={iaib}.I(a,b)=\{i\in{\mathbb{Z}}\mid a\leq i\leq b\}.

In particular, I(a,b)=I(a,b)=\emptyset if a>ba>b. We consider an integer-valued function r(a,b)r(a,b), defined for all (a,b)(a,b) with 1abn1\leq a\leq b\leq n, satisfying the following conditions:

r(a,a)0(aN),\displaystyle r(a,a)\geq 0\qquad(a\in N), (5.13)
max{r(a,b1),r(a+1,b)}r(a,b)(a<b),\displaystyle\max\{r(a,b-1),r(a+1,b)\}\leq r(a,b)\qquad(a<b), (5.14)
r(a,b)r(a,b1)+r(a+1,b)r(a+1,b1)(a<b).\displaystyle r(a,b)\leq r(a,b-1)+r(a+1,b)-r(a+1,b-1)\qquad(a<b). (5.15)

Recall the relation of M-convexity and polymatroids that the set of integer points of an integral polymatroid is precisely a bounded M-convex set containing the origin 𝟎{\bf 0} and contained in the nonnegative orthant. The following theorem characterizes an integral polymatroid that is also multimodular. The proof is given in Section 5.4.2.

Theorem 5.9.

A bounded set SS containing the origin 𝟎{\bf 0} and contained in the nonnegative orthant (𝟎S+n)({\bf 0}\in S\subseteq{\mathbb{Z}}_{+}^{n}) is both M-convex and multimodular if and only if it is described as

S={x+nx(I(a,b))r(a,b)(1abn)}S=\{x\in{\mathbb{Z}}_{+}^{n}\mid x(I(a,b))\leq r(a,b)\ (1\leq a\leq b\leq n)\} (5.16)

with a function rr satisfying (5.13)–(5.15).  

Remark 5.1.

The function rr can be interpreted as a set function ρ\rho defined for consecutive subsets of NN by ρ(I(a,b))=r(a,b)\rho(I(a,b))=r(a,b). The conditions (5.13), (5.14), and (5.15) correspond, respectively, to nonnegativity, monotonicity, and submodularity of ρ\rho (see Section 5.4.2 for the precise meaning). Every function rr satisfying (5.13)–(5.15) can be constructed as follows. First assign arbitrary nonnegative integers to r(a,a)r(a,a) for aNa\in N. Next, for each aN{n}a\in N-\{n\}, assign to r(a,a+1)r(a,a+1) an arbitrary integer between max{r(a,a),r(a+1,a+1)}\max\{r(a,a),r(a+1,a+1)\} and r(a,a)+r(a+1,a+1)r(a,a)+r(a+1,a+1). Then, for k=2,3,,n1k=2,3,\ldots,n-1, define r(a,b)r(a,b) with ba=kb-a=k that satisfy (5.14) and (5.15), which is possible because the right-hand side of (5.15) is not smaller than the left-hand side of (5.14).  

It follows from Theorem 5.9 with Theorem 5.4 that, if a bounded set SS is M-convex and multimodular, then the set of its maximal elements SmaxS_{\max} is M-convex and multimodular (Example 5.3). However, the converse is not true (Example 5.4).

Example 5.3.

Let SS be the set of integer points of the polymatroid in Fig. 1(a). This set can be described in the form of (5.16) as

S={x+3xi3(i=1,2,3),x1+x25,x2+x35,x1+x2+x36},S=\{x\in{\mathbb{Z}}_{+}^{3}\mid x_{i}\leq 3~{}(i=1,2,3),\ x_{1}+x_{2}\leq 5,\ x_{2}+x_{3}\leq 5,\ x_{1}+x_{2}+x_{3}\leq 6\},

which does not involve an inequality for x1+x3x_{1}+x_{3} corresponding to a non-consecutive subset {1,3}\{1,3\}. By Theorem 5.9, SS is both M-convex and multimodular. The set of the maximal elements

Smax={(3,0,3),(1,2,3),(3,2,1),(1,3,2),(2,3,1),(2,1,3),(2,2,2),(3,1,2)}S_{\max}=\{(3,0,3),(1,2,3),(3,2,1),(1,3,2),(2,3,1),(2,1,3),(2,2,2),(3,1,2)\}

is both M-convex and multimodular.  

Example 5.4.

Let SS be the set of integer points of the polymatroid in Fig. 1(b). This set can be described as

S=\displaystyle S= {x+3xi3(i=1,2,3),x1+x25,x1+x35,\displaystyle\{x\in{\mathbb{Z}}_{+}^{3}\mid x_{i}\leq 3~{}(i=1,2,3),\ x_{1}+x_{2}\leq 5,\ x_{1}+x_{3}\leq 5,
x2+x35,x1+x2+x36}.\displaystyle\phantom{AAAAAA}x_{2}+x_{3}\leq 5,\ x_{1}+x_{2}+x_{3}\leq 6\}. (5.17)

Since this expression involves a (non-redundant) inequality for x1+x3x_{1}+x_{3} corresponding to a non-consecutive subset {1,3}\{1,3\}, Theorem 5.9 (or Theorem 5.4) shows that SS is not multimodular. While SS is not multimodular, the set of its maximal elements

Smax\displaystyle S_{\max} ={x3xi3(i=1,2,3),x1+x25,x1+x35,\displaystyle=\{x\in{\mathbb{Z}}^{3}\mid x_{i}\leq 3~{}(i=1,2,3),\ x_{1}+x_{2}\leq 5,\ x_{1}+x_{3}\leq 5,
x2+x35,x1+x2+x3=6}\displaystyle\phantom{=AAAAAA}x_{2}+x_{3}\leq 5,\ x_{1}+x_{2}+x_{3}=6\}
={x31xi3(i=1,2,3),x1+x2+x3=6}\displaystyle=\{x\in{\mathbb{Z}}^{3}\mid 1\leq x_{i}\leq 3~{}(i=1,2,3),\ x_{1}+x_{2}+x_{3}=6\}

is multimodular by Theorem 5.4. Thus the set SmaxS_{\max} is both M-convex and multimodular in spite of the fact that SS itself is M-convex and not multimodular.  

Example 5.5.

Here is an example of an M-convex set that is not multimodular. Using the set SS in (5.17), consider

S^={x4(x1,x2,x3)S,x4=6(x1+x2+x3)},\hat{S}=\{x\in{\mathbb{Z}}^{4}\mid(x_{1},x_{2},x_{3})\in S,\ x_{4}=6-(x_{1}+x_{2}+x_{3})\},

which is the M-convex set treated in Example 3.4. Here we show that S^\hat{S} is not multimodular. To use Proposition 5.2, consider the set T^={D1xxS^}\hat{T}=\{D^{-1}x\mid x\in\hat{S}\}. Discrete midpoint convexity fails for y=(2,2,5,6)T^y=(2,2,5,6)\in\hat{T} (corresponding to (2,0,3,1)S^(2,0,3,1)\in\hat{S}) and z=(3,4,6,6)T^z=(3,4,6,6)\in\hat{T} (corresponding to (3,1,2,0)S^(3,1,2,0)\in\hat{S}) with (y+z)/2=(5/2,3,11/2,6)(y+z)/2=(5/2,3,11/2,6) and (y+z)/2=(3,3,6,6)\lceil(y+z)/2\rceil=(3,3,6,6), where w=(3,3,6,6)w=(3,3,6,6) is not contained in T^\hat{T} since the corresponding vector Dw=(3,0,3,0)Dw=(3,0,3,0) is not in S^\hat{S}.  

Finally, we make an observation for the case of three variables.

Proposition 5.10.

(1) When n=3n=3, every M-convex set is multimodular.

(2) When n=3n=3, every M-convex function is multimodular.

Proof.

(1) Let SS be an M-convex set in 3{\mathbb{Z}}^{3}. Then SS is described (cf., (A.36)) as

S={x3x(X)ρ(X)(XN),x1+x2+x3=ρ(N)},S=\{x\in{\mathbb{Z}}^{3}\mid x(X)\leq\rho(X)\ (X\subset N),\ x_{1}+x_{2}+x_{3}=\rho(N)\},

where N={1,2,3}N=\{1,2,3\}. Among the subsets of NN, X={1,3}X=\{1,3\} is the only non-consecutive subset. With the use of the equality constraint x1+x2+x3=ρ(N)x_{1}+x_{2}+x_{3}=\rho(N), the corresponding inequality x1+x3ρ({1,3})x_{1}+x_{3}\leq\rho(\{1,3\}) can be rewritten as x2ρ(N)ρ({1,3})x_{2}\geq\rho(N)-\rho(\{1,3\}), which is an inequality constraint for a consecutive subset {2}\{2\}. Then SS is multimodular by Theorem 5.4.

(2) By Theorem 5.3, the statement (2) for functions follows from (1) for sets. ∎

5.4.2 Proof

This section is devoted to the proof of Theorem 5.9.

For the proof of the if-part, suppose that SS is described as (5.16) with rr satisfying (5.13)–(5.15), where SS is nonempty since 𝟎S{\bf 0}\in S. Then SS is multimodular by Theorem 5.4. To discuss M-convexity, we define a set function ρ\rho from the given rr. We represent a subset XX of NN as a union of disjoint consecutive subsets:

X=I(a1,b1)I(a2,b2)I(am,bm)X=I(a_{1},b_{1})\cup I(a_{2},b_{2})\cup\cdots\cup I(a_{m},b_{m})

with a1b1<a2b2<<ambma_{1}\leq b_{1}<a_{2}\leq b_{2}<\cdots<a_{m}\leq b_{m}, and define ρ(X)\rho(X) by

ρ(X)=r(a1,b1)+r(a2,b2)++r(am,bm),\rho(X)=r(a_{1},b_{1})+r(a_{2},b_{2})+\cdots+r(a_{m},b_{m}), (5.18)

where ρ()=0\rho(\emptyset)=0. Then we have

ρ(X)=ρ(I(a1,b1))+ρ(I(a2,b2))++ρ(I(am,bm)).\rho(X)=\rho(I(a_{1},b_{1}))+\rho(I(a_{2},b_{2}))+\cdots+\rho(I(a_{m},b_{m})). (5.19)

For any xSx\in S and XNX\subseteq N, we have

x(X)=j=1mx(I(aj,bj))j=1mr(aj,bj)=ρ(X),x(X)=\sum_{j=1}^{m}x(I(a_{j},b_{j}))\leq\sum_{j=1}^{m}r(a_{j},b_{j})=\rho(X),

which shows that inequalities x(X)ρ(X)x(X)\leq\rho(X) for non-consecutive subsets XX may be added to the expression in (5.16). Therefore we have

S={x+nx(X)ρ(X)(XN)}.S=\{x\in{\mathbb{Z}}_{+}^{n}\mid x(X)\leq\rho(X)\ (X\subseteq N)\}.

By the construction, ρ\rho is monotone (nondecreasing) with ρ()=0\rho(\emptyset)=0. Moreover, ρ\rho is submodular, which is shown in Lemma 5.11 at the end of this section. Therefore, by (A.32), SS is an M-convex set. This completes the proof of the if-part of Theorem 5.9.

For the proof of the only-if part, let SS be a bounded M-convex and multimodular set satisfying 𝟎S+n{\bf 0}\in S\subseteq{\mathbb{Z}}_{+}^{n}. By (A.32) for an M-convex set, we have

S={x+nx(X)ρ(X)(XN)}S=\{x\in{\mathbb{Z}}_{+}^{n}\mid x(X)\leq\rho(X)\ (X\subseteq N)\}

with a nondecreasing integer-valued submodular function ρ:2N\rho:2^{N}\to{\mathbb{Z}} with ρ()=0\rho(\emptyset)=0, where we may assume that ρ\rho is tight in the sense of ρ(X)=max{x(X)xS}\rho(X)=\max\{x(X)\mid x\in S\} (XN)(X\subseteq N). Since SS is multimodular, every inequality x(X)ρ(X)x(X)\leq\rho(X) for a non-consecutive subset XX must be redundant by Theorem 5.4. Hence we have

S={x+nx(I)ρ(I)(I: consecutive subset of N)}.S=\{x\in{\mathbb{Z}}_{+}^{n}\mid x(I)\leq\rho(I)\ \ (\mbox{\rm$I$: consecutive subset of $N$})\}.

This coincides with the expression (5.16) for r(a,b)r(a,b) defined by r(a,b)=ρ(I(a,b))r(a,b)=\rho(I(a,b)). Note that r(a,b)r(a,b) satisfies (5.13)–(5.15) by the properties (A.33)–(A.35) of ρ\rho. This completes the proof of the only-if-part of Theorem 5.9.

Finally we show the submodularity of ρ\rho in (5.18).

Lemma 5.11.

ρ\rho in (5.18) is submodular.

Proof.

For any subset ZZ of NN, we consider submodularity of ρ\rho restricted to subsets of ZZ:

ρ(X)+ρ(Y)ρ(XY)+ρ(XY)(X,YZ).\rho(X)+\rho(Y)\geq\rho(X\cup Y)+\rho(X\cap Y)\qquad(X,Y\subseteq Z). (5.20)

We prove this by induction on α=|Z|\alpha=|Z|.

Obviously, (5.20) is true when |Z|=1|Z|=1. When |Z|=2|Z|=2, we have two cases, depending on whether Z={a,b}Z=\{a,b\} is consecutive or not. If a+1=ba+1=b, we may assume X={a}X=\{a\} and Y={a+1}Y=\{a+1\}, for which (5.20) follows from (5.15) since ρ(X)=r(a,a)\rho(X)=r(a,a), ρ(Y)=r(a+1,a+1)\rho(Y)=r(a+1,a+1), ρ(XY)=r(a,a+1)\rho(X\cup Y)=r(a,a+1), and ρ(XY)=0\rho(X\cap Y)=0. If a+1<ba+1<b, we may assume X={a}X=\{a\} and Y={b}Y=\{b\}, for which (5.20) holds with equality by (5.19) and ρ(XY)=0\rho(X\cap Y)=0.

Now assume α=|Z|3\alpha=|Z|\geq 3. To prove (5.20) under the induction hypothesis, it suffices to show

ρ(Z)+ρ(Z{p,q})ρ(Zp)ρ(Zq)0(p<q;p,qZ),\rho(Z)+\rho(Z-\{p,q\})-\rho(Z-p)-\rho(Z-q)\leq 0\qquad(p<q;p,q\in Z), (5.21)

where ρ(Zp)\rho(Z-p) means ρ(Z{p})\rho(Z\setminus\{p\}), etc. We divide the subsequent argument into three cases. Case (a): ZZ is a consecutive set; Case (b): Z=I1I2ImZ=I_{1}\cup I_{2}\cup\cdots\cup I_{m} with consecutive sets I1,I2,,ImI_{1},I_{2},\ldots,I_{m} (m2m\geq 2), and p,qIkp,q\in I_{k} for some kk; Case (c): Z=I1I2ImZ=I_{1}\cup I_{2}\cup\cdots\cup I_{m} as in (b), but pIkp\in I_{k} and qIlq\in I_{l} for some klk\neq l. It turns out that Case (a) is the essential case.

Case (a): Let Z=I(a,b)Z=I(a,b). We have ap<qba\leq p<q\leq b. Define

I={iZi<p},J={iZp<i<q},K={iZi>q}.I=\{i\in Z\mid i<p\},\quad J=\{i\in Z\mid p<i<q\},\quad K=\{i\in Z\mid i>q\}.

We have

ρ(Z)ρ(Za)+ρ(Zb)ρ(Z{a,b})\rho(Z)\leq\rho(Z-a)+\rho(Z-b)-\rho(Z-\{a,b\})

by (5.15), whereas (5.19) gives

ρ(Z{p,q})\displaystyle\rho(Z-\{p,q\}) =ρ(I)+ρ(J)+ρ(K),\displaystyle=\rho(I)+\rho(J)+\rho(K),
ρ(Zp)\displaystyle\rho(Z-p) =ρ(I)+ρ(JqK),\displaystyle=\rho(I)+\rho(JqK),
ρ(Zq)\displaystyle\rho(Z-q) =ρ(IpJ)+ρ(K),\displaystyle=\rho(IpJ)+\rho(K),

where ρ(JqK)\rho(JqK) means ρ(J{q}K)\rho(J\cup\{q\}\cup K), etc. Substituting these into the left-hand side of (5.21), we obtain

LHS of (5.21)
ρ(Za)+ρ(Zb)ρ(Z{a,b})+ρ(J)ρ(JqK)ρ(IpJ)\displaystyle\leq\rho(Z-a)+\rho(Z-b)-\rho(Z-\{a,b\})+\rho(J)-\rho(JqK)-\rho(IpJ)
=[ρ(Za)+ρ(JqKb)ρ(Z{a,b})ρ(JqK)]\displaystyle=[\rho(Z-a)+\rho(JqK-b)-\rho(Z-\{a,b\})-\rho(JqK)]
+[ρ(Zb)+ρ(J)ρ(IpJ)ρ(JqKb)]\displaystyle\phantom{=}\ {}+[\rho(Z-b)+\rho(J)-\rho(IpJ)-\rho(JqK-b)]
=[ρ(XY)+ρ(XY)ρ(X)ρ(Y)]\displaystyle=[\rho(X^{\prime}\cup Y^{\prime})+\rho(X^{\prime}\cap Y^{\prime})-\rho(X^{\prime})-\rho(Y^{\prime})]
+[ρ(X′′Y′′)+ρ(X′′Y′′)ρ(X′′)ρ(Y′′)],\displaystyle\phantom{=}\ {}{}+[\rho(X^{\prime\prime}\cup Y^{\prime\prime})+\rho(X^{\prime\prime}\cap Y^{\prime\prime})-\rho(X^{\prime\prime})-\rho(Y^{\prime\prime})], (5.22)

where X:=Z{a,b}X^{\prime}:=Z-\{a,b\} and Y:=JqKY^{\prime}:=JqK, for which Za=XYZ-a=X^{\prime}\cup Y^{\prime} and JqKb=XYJqK-b=X^{\prime}\cap Y^{\prime}, and X′′:=IpJX^{\prime\prime}:=IpJ and Y′′:=JqKbY^{\prime\prime}:=JqK-b (=XY)(=X^{\prime}\cap Y^{\prime}), for which Zb=X′′Y′′Z-b=X^{\prime\prime}\cup Y^{\prime\prime} and J=X′′Y′′J=X^{\prime\prime}\cap Y^{\prime\prime}. Since |Za|=|Zb|=α1|Z-a|=|Z-b|=\alpha-1, (5.22) is non-positive (0\leq 0) by the induction hypothesis (5.20) for α1\alpha-1.

Case (b): By (5.19) we have

ρ(Z)=ρ(Ik)+jkρ(Ij),ρ(Z{p,q})=ρ(Ik{p,q})+jkρ(Ij),\displaystyle\rho(Z)=\rho(I_{k})+\sum_{j\neq k}\rho(I_{j}),\quad\rho(Z-\{p,q\})=\rho(I_{k}-\{p,q\})+\sum_{j\neq k}\rho(I_{j}),
ρ(Zp)=ρ(Ikp)+jkρ(Ij),ρ(Zq)=ρ(Ikq)+jkρ(Ij).\displaystyle\rho(Z-p)=\rho(I_{k}-p)+\sum_{j\neq k}\rho(I_{j}),\quad\rho(Z-q)=\rho(I_{k}-q)+\sum_{j\neq k}\rho(I_{j}).

Therefore,

LHS of (5.21)=ρ(Ik)+ρ(Ik{p,q})ρ(Ikp)ρ(Ikq),\mbox{LHS of \eqref{sbmZloc}}=\rho(I_{k})+\rho(I_{k}-\{p,q\})-\rho(I_{k}-p)-\rho(I_{k}-q),

which is non-positive (0\leq 0) by the induction hypothesis (5.20) since |Ik|α1|I_{k}|\leq\alpha-1.

Case (c): By (5.19) we have

ρ(Z)=ρ(Ik)+ρ(Il)+jk,lρ(Ij),ρ(Z{p,q})=ρ(Ikp)+ρ(Ilq)+jk,lρ(Ij),\displaystyle\rho(Z)=\rho(I_{k})+\rho(I_{l})+\sum_{j\neq k,l}\rho(I_{j}),\quad\rho(Z-\{p,q\})=\rho(I_{k}-p)+\rho(I_{l}-q)+\sum_{j\neq k,l}\rho(I_{j}),
ρ(Zp)=ρ(Ikp)+ρ(Il)+jk,lρ(Ij),ρ(Zq)=ρ(Ik)+ρ(Ilq)+jk,lρ(Ij).\displaystyle\rho(Z-p)=\rho(I_{k}-p)+\rho(I_{l})+\sum_{j\neq k,l}\rho(I_{j}),\quad\rho(Z-q)=\rho(I_{k})+\rho(I_{l}-q)+\sum_{j\neq k,l}\rho(I_{j}).

Therefore, LHS of (5.21)=0\mbox{LHS of \eqref{sbmZloc}}=0.

Thus we have shown (5.21) in all cases (a), (b), and (c). Therefore, (5.20) holds when |Z|=α|Z|=\alpha. ∎

Appendix A Definitions of Discrete Convex Functions

This section offers definitions of various concepts of discrete convex functions such as integrally convex, L-convex, and M-convex functions. The definition of multimodular functions is given in Section 5.1. We consider functions defined on integer lattice points, f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}, where the function may possibly take ++\infty but it is assumed that the effective domain, domf={xf(x)<+}{\rm dom\,}f=\{x\mid f(x)<+\infty\}, is nonempty.

A.1 Separable convexity

For integer vectors a({})na\in({\mathbb{Z}}\cup\{-\infty\})^{n} and b({+})nb\in({\mathbb{Z}}\cup\{+\infty\})^{n} with aba\leq b, [a,b][a,b]_{{\mathbb{Z}}} denotes the box of integers (discrete rectangle, integer interval) between aa and bb, i.e.,

[a,b]={xnaixibi(i=1,2,,n)}.[a,b]_{{\mathbb{Z}}}=\{x\in{\mathbb{Z}}^{n}\mid a_{i}\leq x_{i}\leq b_{i}\ (i=1,2,\ldots,n)\}. (A.1)

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} in x=(x1,x2,,xn)nx=(x_{1},x_{2},\ldots,x_{n})\in{\mathbb{Z}}^{n} is called separable convex if it can be represented as

f(x)=φ1(x1)+φ2(x2)++φn(xn)f(x)=\varphi_{1}(x_{1})+\varphi_{2}(x_{2})+\cdots+\varphi_{n}(x_{n}) (A.2)

with univariate functions φi:{+}\varphi_{i}:{\mathbb{Z}}\to{\mathbb{R}}\cup\{+\infty\} satisfying

φi(t1)+φi(t+1)2φi(t)(t),\varphi_{i}(t-1)+\varphi_{i}(t+1)\geq 2\varphi_{i}(t)\qquad(t\in{\mathbb{Z}}), (A.3)

where domφi{\rm dom\,}\varphi_{i} is an interval of integers.

A.2 Integral convexity

For xnx\in{\mathbb{R}}^{n} the integral neighborhood of xx is defined as

N(x)={zn|xizi|<1(i=1,2,,n)}.N(x)=\{z\in{\mathbb{Z}}^{n}\mid|x_{i}-z_{i}|<1\ (i=1,2,\ldots,n)\}. (A.4)

It is noted that strict inequality “ << ” is used in this definition and hence N(x)N(x) admits an alternative expression

N(x)={znxizixi(i=1,2,,n)}.N(x)=\{z\in{\mathbb{Z}}^{n}\mid\lfloor x_{i}\rfloor\leq z_{i}\leq\lceil x_{i}\rceil\ \ (i=1,2,\ldots,n)\}. (A.5)

For a set SnS\subseteq{\mathbb{Z}}^{n} and xnx\in{\mathbb{R}}^{n} we call the convex hull of SN(x)S\cap N(x) the local convex hull of SS at xx. A nonempty set SnS\subseteq{\mathbb{Z}}^{n} is said to be integrally convex if the union of the local convex hulls SN(x)¯\overline{S\cap N(x)} over xnx\in{\mathbb{R}}^{n} is convex [16]. This is equivalent to saying that, for any xnx\in{\mathbb{R}}^{n}, xS¯x\in\overline{S} implies xSN(x)¯x\in\overline{S\cap N(x)}.

For a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} the local convex extension f~:n{+}\tilde{f}:{\mathbb{R}}^{n}\to{\mathbb{R}}\cup\{+\infty\} of ff is defined as the union of all convex envelopes of ff on N(x)N(x). That is,

f~(x)=min{yN(x)λyf(y)yN(x)λyy=x,(λy)Λ(x)}(xn),\tilde{f}(x)=\min\{\sum_{y\in N(x)}\lambda_{y}f(y)\mid\sum_{y\in N(x)}\lambda_{y}y=x,\ (\lambda_{y})\in\Lambda(x)\}\quad(x\in{\mathbb{R}}^{n}), (A.6)

where Λ(x)\Lambda(x) denotes the set of coefficients for convex combinations indexed by N(x)N(x):

Λ(x)={(λyyN(x))yN(x)λy=1,λy0for all yN(x)}.\Lambda(x)=\{(\lambda_{y}\mid y\in N(x))\mid\sum_{y\in N(x)}\lambda_{y}=1,\lambda_{y}\geq 0\ \ \mbox{for all }\ y\in N(x)\}.

If f~\tilde{f} is convex on n{\mathbb{R}}^{n}, then ff is said to be integrally convex [5]. The effective domain of an integrally convex function is an integrally convex set. A set SnS\subseteq{\mathbb{Z}}^{n} is integrally convex if and only if its indicator function δS:n{0,+}\delta_{S}:{\mathbb{Z}}^{n}\to\{0,+\infty\} is an integrally convex function.

Integral convexity of a function can be characterized by a local condition under the assumption that the effective domain is an integrally convex set.

Theorem A.1 ([5, 12]).

Let f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} be a function with an integrally convex effective domain. Then the following properties are equivalent:

(a) ff is integrally convex.

(b) For every x,ynx,y\in{\mathbb{Z}}^{n} with xy=2\|x-y\|_{\infty}=2 we have

f~(x+y2)12(f(x)+f(y)).\tilde{f}\,\bigg{(}\frac{x+y}{2}\bigg{)}\leq\frac{1}{2}(f(x)+f(y)). (A.7)

 

The reader is referred to [5, 10, 12, 26, 27, 28], [16, Section 3.4], and [20, Section 13] for more about integral convexity,

A.3 L-convexity

L- and L-convex functions form major classes of discrete convex functions [16, Chapter 7]. The concept of L-convex functions was introduced in [7] as an equivalent variant of L-convex functions introduced earlier in [15].

A.3.1 L-convex functions

A nonempty set SnS\subseteq{\mathbb{Z}}^{n} is called L-convex if

x,ySx+y2,x+y2S,x,y\in S\ \Longrightarrow\ \left\lceil\frac{x+y}{2}\right\rceil,\left\lfloor\frac{x+y}{2}\right\rfloor\in S, (A.8)

where, for tt\in{\mathbb{R}} in general, t\left\lceil t\right\rceil denotes the smallest integer not smaller than tt (rounding-up to the nearest integer) and t\left\lfloor t\right\rfloor the largest integer not larger than tt (rounding-down to the nearest integer), and this operation is extended to a vector by componentwise applications. The property (A.8) is called discrete midpoint convexity (in the original sense of the word).

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} with domf{\rm dom\,}f\not=\emptyset is said to be L-convex if it satisfies a quantitative version of discrete midpoint convexity, i.e., if

f(x)+f(y)f(x+y2)+f(x+y2)f(x)+f(y)\geq f\left(\left\lceil\frac{x+y}{2}\right\rceil\right)+f\left(\left\lfloor\frac{x+y}{2}\right\rfloor\right) (A.9)

holds for all x,ynx,y\in{\mathbb{Z}}^{n}. The effective domain of an L-convex function is an L-convex set. A set SS is L-convex if and only if its indicator function δS\delta_{S} is an L-convex function.

It is known [16, Section 7.1] that L{\rm L}^{\natural}-convex functions can be characterized by several different conditions, stated in Theorem A.2 below. The condition (b) in Theorem A.2 imposes discrete midpoint convexity (A.9) for all points x,yx,y at \ell_{\infty}-distance 1 or 2. The condition (c) refers to submodularity, which means that

f(x)+f(y)f(xy)+f(xy)f(x)+f(y)\geq f(x\vee y)+f(x\wedge y) (A.10)

holds for all x,ynx,y\in{\mathbb{Z}}^{n}, where xyx\vee y and xyx\wedge y denote, respectively, the vectors of componentwise maximum and minimum of xx and yy; see (2.1). The condition (d) refers to a generalization of submodularity called translation-submodularity, which means that

f(x)+f(y)f((xμ𝟏)y)+f(x(y+μ𝟏))f(x)+f(y)\geq f((x-\mu{\bf 1})\vee y)+f(x\wedge(y+\mu{\bf 1})) (A.11)

holds for all x,ynx,y\in{\mathbb{Z}}^{n} and nonnegative integers μ\mu, where 𝟏=(1,1,,1)\bm{1}=(1,1,\ldots,1). The condition (e) refers to the condition111This condition (A.12) is labeled as (L-APR[{\mathbb{Z}}]) in [16, Section 7.2]. that, for any x,ynx,y\in{\mathbb{Z}}^{n} with supp+(xy){\rm supp}^{+}(x-y)\not=\emptyset, the inequality

f(x)+f(y)f(xeA)+f(y+eA)f(x)+f(y)\geq f(x-e^{A})+f(y+e^{A}) (A.12)

holds with A=argmaxi{xiyi}\displaystyle A=\arg\max_{i}\{x_{i}-y_{i}\}, where supp+(xy)={ixi>yi}{\rm supp}^{+}(x-y)=\{i\mid x_{i}>y_{i}\} and eAe^{A} denotes the characteristic vector of AA. The condition (f) refers to submodularity of the function

f~(x0,x)=f(xx0𝟏)(x0,xn)\tilde{f}(x_{0},x)=f(x-x_{0}{\bf 1})\qquad(x_{0}\in{\mathbb{Z}},x\in{\mathbb{Z}}^{n}) (A.13)

in n+1n+1 variables associated with the given function ff.

Theorem A.2 ([21, Theorem 2.2]).

For f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}, the following conditions, (a) to (f), are equivalent:

(a) ff is an L-convex function, that is, it satisfies discrete midpoint inequality (A.9) for all x,ynx,y\in{\mathbb{Z}}^{n}.

(b) domf{\rm dom\,}f is an L-convex set, and ff satisfies discrete midpoint inequality (A.9) for all x,ynx,y\in{\mathbb{Z}}^{n} with xy2\|x-y\|_{\infty}\leq 2.

(c) ff is integrally convex and submodular (A.10).

(d) ff satisfies translation-submodularity (A.11) for all nonnegative μ\mu\in{\mathbb{Z}}.

(e) ff satisfies the condition (A.12).

(f) f~\tilde{f} in (A.13) is submodular (A.10).  

For a set SnS\subseteq{\mathbb{Z}}^{n} we consider conditions

x,yS\displaystyle x,y\in S xy,xyS,\displaystyle\ \Longrightarrow\ x\vee y,\ x\wedge y\in S, (A.14)
x,yS\displaystyle x,y\in S (xμ𝟏)y,x(y+μ𝟏)S,\displaystyle\ \Longrightarrow\ (x-\mu{\bf 1})\vee y,\ x\wedge(y+\mu{\bf 1})\in S, (A.15)
x,yS,supp+(xy)\displaystyle x,y\in S,\ {\rm supp}^{+}(x-y)\not=\emptyset xeA,y+eAS for A=argmaxi{xiyi}.\displaystyle\ \Longrightarrow\ x-e^{A},\ y+e^{A}\ \in S\mbox{ for }A=\arg\max_{i}\{x_{i}-y_{i}\}. (A.16)

The first condition (A.14), meaning that SS is a sublattice of n{\mathbb{Z}}^{n}, corresponds to submodularity (A.10), whereas (A.15) and (A.16) correspond to (A.11) and (A.12), respectively.

Theorem A.3 ([21, Proposition 2.3]).

For a nonempty set SnS\subseteq{\mathbb{Z}}^{n}, the following conditions, (a) to (d), are equivalent:

(a) SS is an L-convex set, that is, it satisfies (A.8).

(b) SS is an integrally convex set that satisfies (A.14).

(c) SS satisfies (A.15) for all nonnegative μ\mu\in{\mathbb{Z}}.

(d) SS satisfies (A.16).  

The following polyhedral description of an L-convex set is known [16, Section 5.5].

Theorem A.4.

A set SnS\subseteq{\mathbb{Z}}^{n} is L-convex if and only if S=S¯nS=\overline{S}\cap{\mathbb{Z}}^{n} and its convex hull S¯\overline{S} can be represented as

S¯={xnαixiβi(iN),xjxiγij(i,jN)}\overline{S}=\{x\in{\mathbb{R}}^{n}\mid\alpha_{i}\leq x_{i}\leq\beta_{i}\ \ (i\in N),\ x_{j}-x_{i}\leq\gamma_{ij}\ \ (i,j\in N)\} (A.17)

for some αi{}\alpha_{i}\in{\mathbb{Z}}\cup\{-\infty\}, βi{+}\beta_{i}\in{\mathbb{Z}}\cup\{+\infty\}, and γij{+}\gamma_{ij}\in{\mathbb{Z}}\cup\{+\infty\} (i,jN)(i,j\in N) with γii=0\gamma_{ii}=0 (iN)(i\in N) such that γ~ij\tilde{\gamma}_{ij} defined for i,jN{0}i,j\in N\cup\{0\} by

γ~00=0,γ~ij=γij,γ~i0=αi,γ~0j=βj(i,jN)\tilde{\gamma}_{00}=0,\qquad\tilde{\gamma}_{ij}=\gamma_{ij},\quad\tilde{\gamma}_{i0}=-\alpha_{i},\quad\tilde{\gamma}_{0j}=\beta_{j}\qquad(i,j\in N) (A.18)

satisfies the triangle inequality:

γ~ij+γ~jkγ~ik(i,j,kN{0}).\tilde{\gamma}_{ij}+\tilde{\gamma}_{jk}\geq\tilde{\gamma}_{ik}\qquad(i,j,k\in N\cup\{0\}). (A.19)

Such αi\alpha_{i}, βi\beta_{i}, γij\gamma_{ij} are determined from SS by

αi=min{xixS},βi=max{xixS}(iN),\displaystyle\alpha_{i}=\min\{x_{i}\mid x\in S\},\quad\beta_{i}=\max\{x_{i}\mid x\in S\}\qquad(i\in N), (A.20)
γij=max{xjxixS}(i,jN).\displaystyle\gamma_{ij}=\max\{x_{j}-x_{i}\mid x\in S\}\qquad(i,j\in N). (A.21)

 

Remark A.1.

Here are additional remarks about the polyhedral descriptions in Theorem A.4.

  • The correspondence between SS and integer-valued (α,β,γ)(\alpha,\beta,\gamma) with (A.19) is bijective (one-to-one and onto) through (A.17), (A.20), and (A.21).

  • For any integer-valued (α,β,γ)(\alpha,\beta,\gamma) (independent of the triangle inequality), SS in (A.17) is an L-convex set if SS\neq\emptyset. We have SS\neq\emptyset if and only if there exists no negative cycle with respect to γ~\tilde{\gamma}, where a negative cycle means a set of indices i1,i2,,imi_{1},i_{2},\ldots,i_{m} such that γ~i1i2+γ~i2i3++γ~im1im+γ~imi1<0\tilde{\gamma}_{i_{1}i_{2}}+\tilde{\gamma}_{i_{2}i_{3}}+\cdots+\tilde{\gamma}_{i_{m-1}i_{m}}+\tilde{\gamma}_{i_{m}i_{1}}<0.  

A.3.2 L-convex functions

A function f(x1,x2,,xn)f(x_{1},x_{2},\ldots,x_{n}) with domf{\rm dom\,}f\not=\emptyset is called L-convex if it is submodular (A.10) and there exists rr\in{\mathbb{R}} such that

f(x+μ𝟏)=f(x)+μrf(x+\mu{\bf 1})=f(x)+\mu r (A.22)

for all xnx\in{\mathbb{Z}}^{n} and μ\mu\in{\mathbb{Z}}. If ff is L-convex, the function g(x2,,xn):=f(0,x2,,xn)g(x_{2},\ldots,x_{n}):=f(0,x_{2},\ldots,x_{n}) is an L-convex function, and any L-convex function arises in this way. The function f~\tilde{f} in (A.13) derived from an L-convex function ff is an L-convex function with f~(x0+μ,x+μ𝟏)=f~(x0,x)\tilde{f}(x_{0}+\mu,x+\mu{\bf 1})=\tilde{f}(x_{0},x), and we have f(x)=f~(0,x)f(x)=\tilde{f}(0,x).

Theorem A.5 ([21, Theorem 2.4]).

For f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\}, the following conditions, (a) to (c), are equivalent:

(a) ff is an L-convex function, that is, it satisfies (A.10) and (A.22) for some rr\in{\mathbb{R}}.

(b) ff is an L-convex function that satisfies (A.22) for some rr\in{\mathbb{R}}.

(c) ff satisfies translation-submodularity (A.11) for all μ\mu\in{\mathbb{Z}} (including μ<0\mu<0).  

A nonempty set SS is called L-convex if its indicator function δS\delta_{S} is an L-convex function. This means that SS is L-convex if and only if it satisfies (A.14) and

xS,μx+μ𝟏S.x\in S,\ \mu\in{\mathbb{Z}}\ \Longrightarrow\ \ x+\mu{\bf 1}\in S. (A.23)

This property is sometimes called the translation invariance in the direction of 𝟏{\bf 1}. The effective domain of an L-convex function is an L-convex set.

The following theorem gives equivalent conditions for a set to be L-convex.

Theorem A.6 ([21, Proposition 2.5]).

For a nonempty set SnS\subseteq{\mathbb{Z}}^{n}, the following conditions, (a) to (c), are equivalent:

(a) SS is an L-convex set, that is, it satisfies (A.14) and (A.23).

(b) SS is an L-convex set that satisfies (A.23).

(c) SS satisfies (A.15) for all μ\mu\in{\mathbb{Z}} (including μ<0\mu<0).  

The following theorem reduces the concept of L-convex functions to that of L-convex sets.

Theorem A.7 ([6, Section 16.2], [16, Theorem 7.17]).

Under the assumption (2.21), a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is L-convex (resp., L-convex) if and only if, for any vector pnp\in{\mathbb{R}}^{n}, argminf[p]\arg\min f[-p] is an L-convex (resp., L-convex) set or an empty set.  

A.3.3 Discrete midpoint convex functions

A nonempty set SnS\subseteq{\mathbb{Z}}^{n} is said to be discrete midpoint convex if

x,yS,xy2x+y2,x+y2S.x,y\in S,\ \|x-y\|_{\infty}\geq 2\ \Longrightarrow\ \left\lceil\frac{x+y}{2}\right\rceil,\left\lfloor\frac{x+y}{2}\right\rfloor\in S. (A.24)

This condition is weaker than the defining condition (A.8) for an L-convex set, and hence every L-convex set is a discrete midpoint convex set.

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is called globally discrete midpoint convex [13] if the discrete midpoint convexity

f(x)+f(y)f(x+y2)+f(x+y2)f(x)+f(y)\geq f\left(\left\lceil\frac{x+y}{2}\right\rceil\right)+f\left(\left\lfloor\frac{x+y}{2}\right\rfloor\right) (A.25)

is satisfied by every pair (x,y)n×n(x,y)\in{\mathbb{Z}}^{n}\times{\mathbb{Z}}^{n} with xy2\|x-y\|_{\infty}\geq 2. The effective domain of a globally discrete midpoint convex function is necessarily a discrete midpoint convex set. Obviously, every L-convex function is globally discrete midpoint convex. We sometimes abbreviate “discrete midpoint convex(ity)” to “d.m.c.”

We define notation μ~(a,b)\tilde{\mu}(a,b) for a pair of integers (a,b)(a,b) by

μ~(a,b)={(a+b)/2(ab),(a+b)/2(ab),\tilde{\mu}(a,b)=\begin{cases}\left\lceil(a+b)/{2}\right\rceil&\ (a\geq b),\\ \left\lfloor(a+b)/{2}\right\rfloor&\ (a\leq b),\end{cases} (A.26)

and extend this notation to a pair of integer vectors (x,y)(x,y) as

μ~(x,y)=(μ~(x1,y1),μ~(x2,y2),,μ~(xn,yn)).\tilde{\mu}(x,y)=\left(\tilde{\mu}(x_{1},y_{1}),\tilde{\mu}(x_{2},y_{2}),\ldots,\tilde{\mu}(x_{n},y_{n})\right). (A.27)

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is called directed discrete midpoint convex [35] if the inequality

f(x)+f(y)f(μ~(x,y))+f(μ~(y,x))f(x)+f(y)\geq f(\tilde{\mu}(x,y))+f(\tilde{\mu}(y,x)) (A.28)

is satisfied by every pair (x,y)n×n(x,y)\in{\mathbb{Z}}^{n}\times{\mathbb{Z}}^{n}. A nonempty set SS is called directed discrete midpoint convex if its indicator function δS\delta_{S} is a directed discrete midpoint convex function. L-convex functions are directed discrete midpoint convex functions, and L-convex sets are directed discrete midpoint convex sets (see (2.18) in Proposition 2.1).

A.3.4 L2-convex functions

A nonempty set SS n\subseteq{\mathbb{Z}}^{n} is called L2-convex (resp., L2{}^{\natural}_{2}-convex) if it can be represented as the Minkowski sum (vector addition) of two L-convex (resp., L-convex) sets [16, Section 5.5]. That is,

S={x+yxS1,yS2},S=\{x+y\mid x\in S_{1},y\in S_{2}\},

where S1S_{1} and S2S_{2} are L-convex (resp., L-convex) sets. An L2{}^{\natural}_{2}-convex set is the intersection of an L2-convex set with a coordinate hyperplane. That is, for L-convex sets, the operations of the Minkowski addition and the restriction to a coordinate hyperplane commute with each other. This fact is stated in [16, p.129] without a proof, and a proof can be found in [11, Section 2.3]. The polyhedral description of L2-convex and L2{}^{\natural}_{2}-convex sets is given in [11]. An L2-convex set has the property of translation invariance in (A.23).

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is said to be L2-convex if it can be represented as the (integral) infimal convolution f1f2f_{1}\Box f_{2} of two L-convex functions f1f_{1} and f2f_{2}, that is, if

f(x)=(f1f2)(x)=inf{f1(y)+f2(z)x=y+z;y,zn}(xn).f(x)=(f_{1}\Box f_{2})(x)=\inf\{f_{1}(y)+f_{2}(z)\mid x=y+z;\ y,z\in{\mathbb{Z}}^{n}\}\qquad(x\in{\mathbb{Z}}^{n}).

It is known [34] (see also [16, Note 8.37]) that the infimum is always attained as long as it is finite. The (integer) infimal convolution of two L-convex functions is called an L2{}^{\natural}_{2}-convex function. A nonempty set SS is L2-convex (resp., L2{}^{\natural}_{2}-convex) if and only if its indicator function δS\delta_{S} is L2-convex (resp., L2{}^{\natural}_{2}-convex).

A.4 M-convexity

M- and M-convex functions form major classes of discrete convex functions [16, Chapter 6]. The concept of M-convex functions was introduced in [22] as an equivalent variant of M-convex functions introduced earlier in [14].

A.4.1 M-convex functions

For two vectors x,ynx,y\in{\mathbb{Z}}^{n} we use notations

supp+(xy)={ixi>yi},supp(xy)={ixi<yi}.{\rm supp}^{+}(x-y)=\{i\mid x_{i}>y_{i}\},\qquad{\rm supp}^{-}(x-y)=\{i\mid x_{i}<y_{i}\}.

We say that a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} with domf{\rm dom\,}f\not=\emptyset is M-convex, if, for any x,ynx,y\in{\mathbb{Z}}^{n} and isupp+(xy)i\in{\rm supp}^{+}(x-y), we have (i)

f(x)+f(y)f(xei)+f(y+ei)f(x)+f(y)\geq f(x-e^{i})+f(y+e^{i}) (A.29)

or (ii) there exists some jsupp(xy)j\in{\rm supp}^{-}(x-y) such that

f(x)+f(y)f(xei+ej)+f(y+eiej).f(x)+f(y)\geq f(x-e^{i}+e^{j})+f(y+e^{i}-e^{j}). (A.30)

This property is referred to as the exchange property. A more compact expression of the exchange property is as follows:

(M-EXC)

For any x,ynx,y\in{\mathbb{Z}}^{n} and isupp+(xy)i\in{\rm supp}^{+}(x-y), we have

f(x)+f(y)minjsupp(xy){0}{f(xei+ej)+f(y+eiej)},f(x)+f(y)\geq\min_{j\in{\rm supp}^{-}(x-y)\cup\{0\}}\{f(x-e^{i}+e^{j})+f(y+e^{i}-e^{j})\}, (A.31)

where e0=𝟎e^{0}={\bf 0} (zero vector). In the above statement we may change “For any x,ynx,y\in{\mathbb{Z}}^{n} ” to “For any x,ydomfx,y\in{\rm dom\,}f ” since if xdomfx\not\in{\rm dom\,}f or ydomfy\not\in{\rm dom\,}f, the inequality (A.31) trivially holds with f(x)+f(y)=+f(x)+f(y)=+\infty.

M-convex functions can be characterized by a number of different exchange properties including a local exchange property under the assumption that function ff is (effectively) defined on an M-convex set. See [32, Theorem 6.8], [16, Chapters 4 and 6], [20, Section 4], and [24] for detailed discussion about the exchange properties.

It follows from (M-EXC) that the effective domain S=domfS={\rm dom\,}f of an M-convex function ff has the following exchange property:

(B-EXC)

For any x,ySx,y\in S and isupp+(xy)i\in{\rm supp}^{+}(x-y), we have (i) xeiSx-e^{i}\in S and y+eiSy+e^{i}\in S  or  
(ii) there exists some jsupp(xy)j\in{\rm supp}^{-}(x-y) such that xei+ejSx-e^{i}+e^{j}\in S and y+eiejSy+e^{i}-e^{j}\in S.

A nonempty set SnS\subseteq{\mathbb{Z}}^{n} having this property is called an M-convex set.

An M-convex set is nothing but the set of integer points in an integral generalized polymatroid [16, Section 4.7]. In particular, an integral polymatroid can be identified with a bounded M-convex set containing 𝟎{\bf 0} and consisting of nonnegative vectors. As is well known [6, Section 2.2], the set of integer points of an integral polymatroid is described as

S={xnx𝟎,x(X)ρ(X)(XN)}S=\{x\in{\mathbb{Z}}^{n}\mid x\geq{\bf 0},\ x(X)\leq\rho(X)\ (X\subseteq N)\} (A.32)

with a nondecreasing integer-valued submodular function ρ\rho. To be more specific, the set function ρ:2N\rho:2^{N}\to{\mathbb{Z}} should satisfy

ρ()=0,\displaystyle\rho(\emptyset)=0, (A.33)
XYρ(X)ρ(Y),\displaystyle X\subseteq Y\ \Longrightarrow\ \rho(X)\leq\rho(Y), (A.34)
ρ(X)+ρ(Y)ρ(XY)+ρ(XY)(X,YN).\displaystyle\rho(X)+\rho(Y)\geq\rho(X\cup Y)+\rho(X\cap Y)\quad(X,Y\subseteq N). (A.35)

Moreover, the convex hull S¯\overline{S} of SS is described similarly as S¯={xnx𝟎,x(X)ρ(X)(XN)}\overline{S}=\{x\in{\mathbb{R}}^{n}\mid x\geq{\bf 0},\ x(X)\leq\rho(X)\ (X\subseteq N)\} and we can determine ρ\rho from SS by ρ(X)=max{x(X)xS}\rho(X)=\max\{x(X)\mid x\in S\} (XN)(X\subseteq N).

A.4.2 M-convex functions

If a set SnS\subseteq{\mathbb{Z}}^{n} lies on a hyperplane with a constant component sum (i.e., x(N)=y(N)x(N)=y(N) for all x,ySx,y\in S), the exchange property (B-EXC) takes a simpler form (without the possibility of the first case (i)):

(B-EXC)

For any x,ySx,y\in S and isupp+(xy)i\in{\rm supp}^{+}(x-y), there exists some jsupp(xy)j\in{\rm supp}^{-}(x-y) such that xei+ejSx-e^{i}+e^{j}\in S and y+eiejSy+e^{i}-e^{j}\in S.

A nonempty set SnS\subseteq{\mathbb{Z}}^{n} having this exchange property is called an M-convex set, which is an alias for the set of integer points in an integral base polyhedron. An M-convex set SS contained in the nonnegative orthant +n{\mathbb{Z}}_{+}^{n} can be described as

S={xnx(X)ρ(X)(XN),x(N)=ρ(N)}S=\{x\in{\mathbb{Z}}^{n}\mid x(X)\leq\rho(X)\ (X\subset N),\ x(N)=\rho(N)\} (A.36)

with a set function ρ:2N\rho:2^{N}\to{\mathbb{Z}} satisfying (A.33), (A.34), and (A.35).

An M-convex function whose effective domain is an M-convex set is called an M-convex function [14, 15, 16]. In other words, a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is M-convex if and only if it satisfies the exchange property:

(M-EXC)

For any x,ydomfx,y\in{\rm dom\,}f and isupp+(xy)i\in{\rm supp}^{+}(x-y), there exists jsupp(xy)j\in{\rm supp}^{-}(x-y) such that (A.30) holds.

M-convex functions can be characterized by a local exchange property under the assumption that function ff is (effectively) defined on an M-convex set. See [16, Section 6.2].

M-convex functions and M-convex functions are equivalent concepts, in that M-convex functions in nn variables can be obtained as projections of M-convex functions in n+1n+1 variables. More formally, let “0” denote a new element not in NN and N~={0}N={0,1,,n}\tilde{N}=\{0\}\cup N=\{0,1,\ldots,n\}. A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is M-convex if and only if the function f~:n+1{+}\tilde{f}:{\mathbb{Z}}^{n+1}\to{\mathbb{R}}\cup\{+\infty\} defined by

f~(x0,x)={f(x) if x0=x(N)+ otherwise(x0,xn)\tilde{f}(x_{0},x)=\left\{\begin{array}[]{ll}f(x)&\mbox{ if $x_{0}={-}x(N)$}\\ +\infty&\mbox{ otherwise}\end{array}\right.\qquad(x_{0}\in{\mathbb{Z}},x\in{\mathbb{Z}}^{n}) (A.37)

is an M-convex function.

The following theorem reduces the concept of M-convex functions to that of M-convex sets.

Theorem A.8 ([6, Section 17], [16, Theorem 6.30]).

Under the assumption (2.21), a function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is M-convex (resp., M-convex) if and only if, for any vector pnp\in{\mathbb{R}}^{n}, argminf[p]\arg\min f[-p] is an M-convex (resp., M-convex) set or an empty set.  

A.4.3 M2-convex functions

A nonempty set SS n\subseteq{\mathbb{Z}}^{n} is called M2-convex if it can be represented as the intersection of two M-convex sets [16, Section 4.6]. Similarly, a nonempty set SS n\subseteq{\mathbb{Z}}^{n} is called M2{}^{\natural}_{2}-convex if it can be represented as the intersection of two M-convex sets [16, Section 4.7]. An M2-convex set SS lies on a hyperplane with a constant component sum (i.e., x(N)=y(N)x(N)=y(N) for all x,ySx,y\in S).

A function f:n{+}f:{\mathbb{Z}}^{n}\to{\mathbb{R}}\cup\{+\infty\} is said to be M2-convex if it can be represented as the sum of two M-convex functions f1f_{1} and f2f_{2}, that is, if

f(x)=f1(x)+f2(x)(xn).f(x)=f_{1}(x)+f_{2}(x)\qquad(x\in{\mathbb{Z}}^{n}).

The sum of two M-convex functions is called an M2{}^{\natural}_{2}-convex function. A nonempty set SS is M2-convex (resp., M2{}^{\natural}_{2}-convex) if and only if its indicator function δS\delta_{S} is M2-convex (resp., M2{}^{\natural}_{2}-convex).

Acknowledgement. This work was supported by JSPS KAKENHI Grant Numbers JP17K00037, JP20K11697, and JP21K04533. The authors thank Akihisa Tamura for careful reading of the manuscript.

References

  • [1] Altman, E., Gaujal, B., Hordijk, A.: Multimodularity, convexity, and optimization properties. Mathematics of Operations Research 25, 324–347 (2000)
  • [2] Altman, E., Gaujal, B., Hordijk, A.: Discrete-Event Control of Stochastic Networks: Multimodularity and Regularity. Lecture Notes in Mathematics 1829, Springer, Heidelberg (2003)
  • [3] Chen, X.: L-convexity and its applications in operations. Frontiers of Engineering Management 4, 283–294 (2017)
  • [4] Chen, X., Li, M.: Discrete convex analysis and its applications in operations: A survey. Management Science 30, 1904–1926 (2021)
  • [5] Favati, P., Tardella, F.: Convexity in nonlinear integer programming. Ricerca Operativa 53, 3–44 (1990)
  • [6] Fujishige, S.: Submodular Functions and Optimization, 2nd edn. Annals of Discrete Mathematics 58, Elsevier, Amsterdam (2005)
  • [7] Fujishige, S., Murota, K.: Notes on L-/M-convex functions and the separation theorems. Mathematical Programming 88, 129–146 (2000)
  • [8] Hajek, B.: Extremal splittings of point processes. Mathematics of Operations Research 10, 543–556 (1985)
  • [9] Moriguchi, S., Murota, K.: On fundamental operations for multimodular functions. Journal of the Operations Research Society of Japan 62, 53–63 (2019)
  • [10] Moriguchi, S., Murota, K.: Projection and convolution operations for integrally convex functions. Discrete Applied Mathematics 255, 283–298 (2019)
  • [11] Moriguchi, S., Murota, K.: Note on the polyhedral description of the Minkowski sum of two L-convex sets. Japan Journal of Industrial and Applied Mathematics 40, 223–263 (2023)
  • [12] Moriguchi, S., Murota, K., Tamura, A., Tardella, F.: Scaling, proximity, and optimization of integrally convex functions. Mathematical Programming 175, 119–154 (2019)
  • [13] Moriguchi, S., Murota, K., Tamura, A., Tardella, F.: Discrete midpoint convexity. Mathematics of Operations Research 45, 99–128 (2020)
  • [14] Murota, K.: Convexity and Steinitz’s exchange property. Advances in Mathematics 124, 272–311 (1996)
  • [15] Murota, K.: Discrete convex analysis. Mathematical Programming 83, 313–371 (1998)
  • [16] Murota, K.: Discrete Convex Analysis. Society for Industrial and Applied Mathematics, Philadelphia (2003)
  • [17] Murota, K.: Note on multimodularity and L-convexity. Mathematics of Operations Research 30, 658–661 (2005)
  • [18] Murota, K.: Primer of Discrete Convex Analysis—Discrete versus Continuous Optimization (in Japanese). Kyoritsu Publishing Co., Tokyo (2007)
  • [19] Murota, K.: Recent developments in discrete convex analysis. In: Cook, W., Lovász, L., Vygen, J. (eds.) Research Trends in Combinatorial Optimization, Chapter 11, pp. 219–260. Springer, Berlin (2009)
  • [20] Murota, K.: Discrete convex analysis: A tool for economics and game theory. Journal of Mechanism and Institution Design 1, 151–273 (2016)
  • [21] Murota, K.: A survey of fundamental operations on discrete convex functions of various kinds. Optimization Methods and Software 36, 472–518 (2021)
  • [22] Murota, K., Shioura, A.: M-convex function on generalized polymatroid. Mathematics of Operations Research 24, 95–105 (1999)
  • [23] Murota, K., Shioura, A.: Relationship of M-/L-convex functions with discrete convex functions by Miller and by Favati–Tardella. Discrete Applied Mathematics 115, 151–176 (2001)
  • [24] Murota, K., Shioura, A.: Simpler exchange axioms for M-concave functions on generalized polymatroids. Japan Journal of Industrial and Applied Mathematics 35, 235–259 (2018)
  • [25] Murota, K., Tamura, A.: Application of M-convex submodular flow problem to mathematical economics. Japan Journal of Industrial and Applied Mathematics 20, 257–277 (2003)
  • [26] Murota, K., Tamura, A.: Integrality of subgradients and biconjugates of integrally convex functions. Optimization Letters 14, 195–208 (2020)
  • [27] Murota, K., Tamura, A.: Discrete Fenchel duality for a pair of integrally convex and separable convex functions. Japan Journal of Industrial and Applied Mathematics 39, 599–630 (2022)
  • [28] Murota, K., Tamura, A.: Recent progress on integrally convex functions. arXiv: http://arxiv.org/abs/2211.10912 (2022)
  • [29] Schrijver, A.: Theory of Linear and Integer Programming. Wiley, New York (1986)
  • [30] Schrijver, A.: Combinatorial Optimization—Polyhedra and Efficiency. Springer, Heidelberg (2003)
  • [31] Shioura, A.: Algorithms for L-convex function minimization: Connection between discrete convex analysis and other research areas. Journal of the Operations Research Society of Japan 60, 216–243 (2017)
  • [32] Shioura, A., Tamura, A.: Gross substitutes condition and discrete concavity for multi-unit valuations: a survey. Journal of the Operations Research Society of Japan 58, 61–103 (2015)
  • [33] Simchi-Levi, D., Chen, X., Bramel, J.: The Logic of Logistics: Theory, Algorithms, and Applications for Logistics Management, 3rd ed. Springer, New York (2014)
  • [34] Tamura, A.: On convolution of L-convex functions. Optimization Methods and Software 18, 231–245 (2003)
  • [35] Tamura, A., Tsurumi, K.: Directed discrete midpoint convexity. Japan Journal of Industrial and Applied Mathematics 38, 1–37 (2021)