This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Compression of M-convex Functions — Flag Matroids and Valuated Permutohedra

Satoru FUJISHIGE111 Research Institute for Mathematical Sciences, Kyoto University, Kyoto 606-8502, Japan. E-mail: [email protected]  and  Hiroshi HIRAI222 Department of Mathematical Informatics, Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, 113-8656, Japan. E-mail: [email protected]
(May 26, 2020)
Abstract

Murota (1998) and Murota and Shioura (1999) introduced concepts of M-convex function and M-convex function as discrete convex functions, which are generalizations of valuated matroids due to Dress and Wenzel (1992). In the present paper we consider a new operation defined by a convolution of sections of an M-convex function that transforms the given M-convex function to an M-convex function, which we call a compression of an M-convex function. For the class of valuated generalized matroids, which are special M-convex functions, the compression induces a valuated permutohedron together with a decomposition of the valuated generalized matroid into flag-matroid strips, each corresponding to a maximal linearity domain of the induced valuated permutohedron. We examine the details of the structure of flag-matroid strips and the induced valuated permutohedron by means of discrete convex analysis of Murota.

Keywords: Discrete convex functions, compression, flag matroids, permutohedra
MSC: 90C27 \cdot 52B40

1 Introduction

Murota (1998) and Murota and Shioura (1999) introduced the concepts of M-convex function [12] and M-convex function [16], as discrete convex functions. Their original ideas can be traced back to Dress and Wenzel’s valuated matroids [3] introduced in 1992. See [13, 14, 15] for details about the theory of discrete convex analysis and its applications developed by Murota and others (also see [9, Chapter VII]).

In the present paper we consider a new operation defined by a convolution of sections of an M-convex function that transforms the given M-convex function to an M-convex function, which we call a compression of an M-convex function. For the class of valuated generalized matroids, which is a special class of M-convex functions, the compression induces a valuated permutohedron together with a decomposition of the valuated generalized matroid into flag-matroid strips, each corresponding to a maximal linearity domain of the valuated permutohedron. We examine the details of the structure of flag-matroid strips and the induced valuated permutohedron. We investigate the structures of the strip decomposition of valuated generalized-matroids, special M-convex functions by means of discrete convex analysis of Murota ([13]). The strip decomposition of a valuated generalized-matroid uniquely determines a valuated permutohedron, identified with a special M-convex function.

The present paper is organized as follows. In Section 2 we give some definitions and preliminaries about (i) submodular/supermodular functions and related polyhedra such as base polyhedra, submodular/supermodular polyhedra, and generalized polymatroids, and (ii) M-/M-convex functions and L-/L-convex functions. In Section 3 we introduce a new operation called a compression of M-convex functions, which leads us to the concepts of flag-matroid strips and a strip decomposition of M-convex functions in Section 4. In Section 5 we consider valuated generalized matroids, which are special M-convex functions, and examine implications of our results in valuated generalized matroids. The strip decomposition of a valuated generalized matroid gives a collection of strips of flag matroids [2], each inducing a sub-permutohedra. The compression of a valuated generalized matroid induces a valuated permutohedron whose maximal linearity domains corresponds to flag-matroid strips of the valuated generalized matroid. Section 6 gives some concluding remarks.

2 Definitions and Preliminaries

Let E=[n](={1,,n})E=[n](=\{1,\cdots,n\}) for a positive integer n>1n>1. For any xEx\in\mathbb{R}^{E} and XEX\subseteq E define x(X)=eXx(e)x(X)=\sum_{e\in X}x(e), where x()=0x(\emptyset)=0. For any subset XEX\subseteq E its characteristic vector χX\chi_{X} in E\mathbb{R}^{E} is defined by χX(e)=1\chi_{X}(e)=1 if eXe\in X and χX(e)=0\chi_{X}(e)=0 if eEXe\in E\setminus X. We also write χe\chi_{e} instead of χ{e}\chi_{\{e\}} for eEe\in E.

2.1 Basics of submodular/supermodular functions

A function f:2Ef:2^{E}\to\mathbb{R} is called a submodular function if it satisfies

f(X)+f(Y)f(XY)+f(XY)(X,YE).f(X)+f(Y)\geq f(X\cup Y)+f(X\cap Y)\qquad(\forall X,Y\subseteq E). (2.1)

We assume that f()=0f(\emptyset)=0 for any set function f:2Ef:2^{E}\to\mathbb{R} in the sequel. A negative of a submodular function is called a supermodular function. (Also see [4, 9].)

For a submodular function f:2Ef:2^{E}\to\mathbb{R} define

P(f)={xEXE:x(X)f(X)},{\rm P}(f)=\{x\in\mathbb{R}^{E}\mid\forall X\subseteq E:x(X)\leq f(X)\}, (2.2)

which is called the submodular polyhedron associated with submodular function ff. Also define

B(f)={xP(f)x(E)=f(E)},{\rm B}(f)=\{x\in{\rm P}(f)\mid x(E)=f(E)\}, (2.3)

which is called the base polyhedron associated with submodular function ff. As is well known (see [9]), the base polyhedron B(f){\rm B}(f) is always nonempty (and is a face of P(f){\rm P}(f)).

For a supermodular function g:2Eg:2^{E}\to\mathbb{R} we define in a dual manner the supermodular polyhedron

P(g)={xEXE:x(X)g(X)}{\rm P}(g)=\{x\in\mathbb{R}^{E}\mid\forall X\subseteq E:x(X)\geq g(X)\} (2.4)

and the base polyhedron

B(g)={xP(g)x(E)=g(E)}.{\rm B}(g)=\{x\in{\rm P}(g)\mid x(E)=g(E)\}. (2.5)

For a submodular function f:2Ef:2^{E}\to\mathbb{R} define a supermodular function f#:2Ef^{\#}:2^{E}\to\mathbb{R} by

f#(X)=f(E)f(EX)(XE),f^{\#}(X)=f(E)-f(E\setminus X)\qquad(\forall X\subseteq E), (2.6)

which is called the dual supermodular function of ff. Then we have B(f)=B(f#){\rm B}(f)={\rm B}(f^{\#}). For more details about submodular/supermodular functions and associated polyhedra see [9, Chapter II], where submodular/supermodular functions defined on distributive lattices 𝒟2E{\mathcal{D}}\subseteq 2^{E} are also investigated and their base polyhedra are unbounded unless 𝒟=2E\mathcal{D}=2^{E}.

Define Q=QEQ_{\mathbb{Z}}=Q\cap\mathbb{Z}^{E} for any set QEQ\subseteq\mathbb{R}^{E}. Also denote by Conv(Q){\rm Conv}(Q) the convex hull of QQ in E\mathbb{R}^{E}. When Conv(Q)=Q{\rm Conv}(Q_{\mathbb{Z}})=Q, we identify QQ with QQ_{\mathbb{Z}}.

When a submodular function f:2Ef:2^{E}\to\mathbb{R} is integer-valued, its submodular polyhedron and base polyhedron are integral (i.e., every vertex of the polyhedra is an integral vector). Moreover, we have

Conv(P(f))=P(f),Conv(B(f))=B(f).{\rm Conv}({\rm P}(f)_{\mathbb{Z}})={\rm P}(f),\qquad{\rm Conv}({\rm B}(f)_{\mathbb{Z}})={\rm B}(f). (2.7)

Most of the following arguments are valid even if we regard \mathbb{Z} in place of \mathbb{R} as the underlying totally ordered additive group. When ff is integer-valued, we call P(f){\rm P}(f)_{\mathbb{Z}} and B(f){\rm B}(f)_{\mathbb{Z}} the submodular polyhedron and the base polyhedron, respectively, associated with ff as well. (This is the approach taken in [9], indeed.)

For a submodular function f:2Ef:2^{E}\to\mathbb{R} and a nonempty proper subset AA of EE define functions fA:2Af^{A}:2^{A}\to\mathbb{R} and fA:2EAf_{A}:2^{E\setminus A}\to\mathbb{R} by

fA(X)=f(X)(XA),fA(X)=f(XA)f(A)(XEA).f^{A}(X)=f(X)\quad(\forall X\subseteq A),\quad f_{A}(X)=f(X\cup A)-f(A)\quad(\forall X\subseteq E\setminus A). (2.8)

We also define fE=f=ff^{E}=f_{\emptyset}=f. We call fAf^{A} the restriction of ff on AA and fAf_{A} the contraction of ff by AA. Similarly we define the restriction and contraction for supermodular functions.

2.2 Permutohedra and sub-permutohedra

For a permutation π:[n][n]\pi:[n]\to[n] we identify π\pi with the permutation vector (π(1),,π(n))(\pi(1),\cdots,\pi(n)) in n\mathbb{Z}^{n}, which we denote by vπv_{\pi} (or simply by π\pi when there is no possibility of confusion). Suppose that an integer-valued submodular function f:2Ef:2^{E}\to\mathbb{Z} is given by f(X)=i=1|X|(ni+1)f(X)=\sum_{i=1}^{|X|}(n-i+1) for all XE=[n]X\subseteq E=[n]. Then the base polyhedron B(f){\rm B}(f) has the n!n! extreme points, each being a permutation vector identified with a permutation of [n][n], which is called the permutohedron (or permutahedron) in E\mathbb{R}^{E}.

For any permutation π\pi of [n][n] we have a unique complete flag

:F1Fn=[n]\mathcal{F}:F_{1}\subset\cdots\subset F_{n}=[n] (2.9)

such that for each i[n]i\in[n] FiF_{i} is the set of the first ii elements of (π(1),,π(n))(\pi(1),\cdots,\pi(n)), i.e.,

  1. 1.

    |Fi|=i|F_{i}|=i for each i=1,,ni=1,\cdots,n and

  2. 2.

    i=1nχFi=vπ\displaystyle{\sum_{i=1}^{n}\chi_{F_{i}}=v_{\pi}}, where χFi\chi_{F_{i}} is the characteristic vector of a set Fi[n]F_{i}\subseteq[n].

Denote the flag \mathcal{F} in (2.9) by π:F1πFnπ\mathcal{F}^{\pi}:F^{\pi}_{1}\subset\cdots\subset F^{\pi}_{n}.

Let us consider a polyhedron PP satisfying the following two:

  • (P1)

    PP is the convex hull of a set of some permutation vectors.

  • (P2)

    PP is a base polyhedron.

We call such a polyhedron PP a sub-permutohedron.333 We may call the sub-permutohedron a permutohedron, and an ordinary permutohedron a complete permutohedron, but we resist the temptation. A sub-permutohedron is precisely the Coxeter matroid polytope associated with a flag matroid of complete flag (see [2] and the discussions to be made in Section 5). A recent interesting appearance of a sub-permutohedron is from the theory of Bruhat order ([1]), due to Tsukerman and Williams [19], that every Bruhat interval polytope is a sub-permutohedron.

2.3 Base polyhedra, generalized polymatroids, and strong maps

Suppose that a submodular function f:2Ef:2^{E}\to\mathbb{R} and a supermodular function g:2Eg:2^{E}\to\mathbb{R} with f()=g()=0f(\emptyset)=g(\emptyset)=0 satisfy the following inequalities

f(X)g(Y)f(XY)g(YX)(X,YE).f(X)-g(Y)\geq f(X\setminus Y)-g(Y\setminus X)\quad(\forall X,Y\subseteq E). (2.10)

Then the polyhedron

P(f,g)={xEXE:g(X)x(X)f(X)}{\rm P}(f,g)=\{x\in\mathbb{R}^{E}\mid\forall X\subseteq E:g(X)\leq x(X)\leq f(X)\} (2.11)

is called a generalized polymatroid ([6, 10]). There exists a one-to-one correspondence between base polyhedra and generalized polymatroids up to translation along a coordinate axis as follows (see [9, Figure 3.7]).

Theorem 2.1 ([8, 9])

For the base polyhedron B(f){\rm B}(f) associated with a submodular function f:2Ef:2^{E}\to\mathbb{R} the projection of B(f){\rm B}(f) along an axis eEe\in E on the coordinate subspace given by the hyperplane x(e)=0x(e)=0 is a generalized polymatroid P(f,g){\rm P}(f^{\prime},g^{\prime}) in E\mathbb{R}^{E^{\prime}} with E=E{e}E^{\prime}=E-\{e\}, where ff^{\prime} is the restriction of ff on E{E^{\prime}} and gg^{\prime} is the restriction of f#f^{\#} on E{E^{\prime}}.

Conversely, every generalized polymatroid in E\mathbb{R}^{E^{\prime}} is obtained in this way.

When a generalized polymatroid is a convex hull of {0,1}\{0,1\}-valued points (vertices), then it is called a generalized-matroid polytope, which can be identified with the family 𝒢{\mathcal{G}} of subsets XEX\subseteq E such that χX\chi_{X} are vertices of the generalized-matroid polytope. The family 𝒢\mathcal{G} is called a generalized matroid (see [7]).

An ordered pair (f1,f2)(f_{1},f_{2}) of submodular functions fi:2Ef_{i}:2^{E}\to\mathbb{R} (i=1,2)(i=1,2) is called a weak map if we have

P(f2)P(f1).{\rm P}(f_{2})\subseteq{\rm P}(f_{1}). (2.12)

Moreover, the ordered pair (f1,f2)(f_{1},f_{2}) is called a strong map if we have

P((f2)X)P((f1)X)(XE),{\rm P}((f_{2})_{X})\subseteq{\rm P}((f_{1})_{X})\qquad(\forall X\subset E), (2.13)

i.e., every ordered pair ((f1)X,(f2)X)((f_{1})_{X},(f_{2})_{X}) of contractions of f1f_{1} and f2f_{2} by XEX\subset E is a weak map. The concept of strong map was originally considered for matroids (see [17, 18, 20]), and we adopt it to any submodular functions (or submodular systems).

We also have the following theorem.

Theorem 2.2

P(f,g){\rm P}(f,g) is a generalized polymatroid if and only if (f,g#)(f,g^{\#}) is a strong map.
(Proof) Relation (2.13) is equivalent to the following inequalities

f2(ZX)f2(X)f1(ZX)f1(X)(XE,ZEX).f_{2}(Z\cup X)-f_{2}(X)\leq f_{1}(Z\cup X)-f_{1}(X)\qquad(X\subset E,\ Z\subseteq E\setminus X). (2.14)

Putting W=E(ZX)W=E\setminus(Z\cup X), (2.14) is rewritten in terms of the dual supermodular function f2#f_{2}^{\#} of f2f_{2} as

f2#(ZW)f2#(W)f1(ZX)f1(X)(XE,ZEX)f_{2}^{\#}(Z\cup W)-f_{2}^{\#}(W)\leq f_{1}(Z\cup X)-f_{1}(X)\qquad(X\subset E,\ Z\subseteq E\setminus X) (2.15)

with W=E(ZX)W=E\setminus(Z\cup X). Because of the supermodularity of f2#f_{2}^{\#} (2.15) is equivalent to

f2#(ZW)f2#(W)f1(ZX)f1(X)f_{2}^{\#}(Z\cup W)-f_{2}^{\#}(W)\leq f_{1}(Z\cup X)-f_{1}(X) (2.16)

for all X,W,ZEX,W,Z\subset E with XW=XZ=ZW=X\cap W=X\cap Z=Z\cap W=\emptyset, which is equivalent to (2.10). \Box

A sequence of submodular functions f1,,fp:2Ef_{1},\cdots,f_{p}:2^{E}\to\mathbb{R} is called a strong map sequence if for each i=1,,i=1,\cdots, pp-11 the pair (fi+1,fi)(f_{i+1},f_{i}) is a strong map. It follows from Theorem 2.2 that

  • (F1)

    Given a strong map sequence f1,,fp:2Ef_{1},\cdots,f_{p}:2^{E}\to\mathbb{R}, we have a sequence of generalized polymatroids P(fi+1,(fi)#){\rm P}(f_{i+1},(f_{i})^{\#}) for i=1,,p1i=1,\cdots,p-1.

We also have the following.

  • (F2)

    For a generalized polymatroid P(f,g){\rm P}(f,g) and α\alpha\in\mathbb{R} such that g(E)αf(E)g(E)\leq\alpha\leq f(E) the intersection of P(f,g){\rm P}(f,g) and the hyperplane x(E)=αx(E)=\alpha is a base polyhedron (we call such a base polyhedron a section of P(f,g){\rm P}(f,g) and denote it by P(f,g)(α){\rm P}(f,g)_{(\alpha)}). When P(f,g){\rm P}(f,g) is integral and α\alpha is an integer, the section P(f,g)(α){\rm P}(f,g)_{(\alpha)} is integral. Moreover, letting B(f){\rm B}(f^{\prime}) be a section of P(f,g){\rm P}(f,g) with a submodular function ff^{\prime}, we have a strong map sequence g#,f,fg^{\#},f^{\prime},f, i.e., P(f,g){\rm P}(f^{\prime},g) and P(f,(f)#){\rm P}(f,(f^{\prime})^{\#}) are generalized polymatroids.

A strong map sequence f1,,fp:2Ef_{1},\cdots,f_{p}:2^{E}\to\mathbb{Z} with fif_{i} (i=1,,p)(i=1,\cdots,p) being matroid rank functions is called a flag matroid (see [2]).

2.4 M-convex functions and L-convex functions

Let f:E{+}f:\mathbb{R}^{E}\to\mathbb{R}\cup\{+\infty\} be a polyhedral convex function such that

  1. 1.

    its effective domain, domf{xEf(x)<+}{\rm dom}f\equiv\{x\in\mathbb{R}^{E}\mid f(x)<+\infty\}, is a generalized polymatroid (hence nonempty) and

  2. 2.

    every linearity domain of ff is a generalized polymatroid,

where a linearity domain (or affinity domain) of ff is Argmin(fh){\rm Arg}\min(f-h) for a linear function h(x)=z,x(eEz(e)x(e))h(x)=\langle z,x\rangle(\equiv\sum_{e\in E}z(e)x(e)) with some z(E)z\in(\mathbb{R}^{E})^{*}. Then ff is called an M-convex function444Here we employ another equivalent definition of M-convexity, instead of the original definition by means of the exchange axiom introduced by Murota and Shioura [16, 13]., which is due to Murota and Shioura [16, 13] (also see [9, Section 17]). The negative of an M-convex function is called an M-concave function (see Figure 1).

Refer to caption
Figure 1: An M-concave function gg ([9, Fig. 17.4]).

When domf{\rm dom}f of an M-convex function ff is a base polyhedron, ff is called an M-convex function (see [13]).555 M-convex functions were introduced earlier than M-convex functions by Murota (see [12, 13]). There is a one-to-one correspondence between M-convex functions and M-convex convex functions because of Theorem 2.1. Any concept related to M-/M-concave functions is defined in a natural way from that defined for M-/M-convex functions.

Now let f:E{+}f:\mathbb{R}^{E}\to\mathbb{R}\cup\{+\infty\} be an M-convex function satisfying the following (a) and (b):

  • (a)

    The effective domain of ff is an integral generalized polymatroid.

  • (b)

    Every linearity domain of ff is also an integral generalized polymatroid.

Such an M-convex function ff can be identified with ff being restricted on the integer lattice E\mathbb{Z}^{E}. So we can consider an M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\}. An integer-valued M-concave function g:E{+}g:\mathbb{Z}^{E}\to\mathbb{Z}\cup\{+\infty\} with its effective domain being a matroid base polytope coincides with a valuated matroid due to Dress and Wenzel [3]. Also, if an M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\} has an effective domain dom(f){\rm dom}(f) whose convex hull Conv(dom(f)){\rm Conv}({\rm dom}(f)) is a permutohedron, we call ff a valuated permutohedron.

For any M-convex function f:E{+}f:\mathbb{R}^{E}\to\mathbb{R}\cup\{+\infty\} define the Legendre-Fenchel transform (or convex conjugate) of ff by

f(y)=sup{y,xf(x)xE}(y(E)),f^{\bullet}(y)=\sup\{\langle y,x\rangle-f(x)\mid x\in\mathbb{R}^{E}\}\qquad(y\in(\mathbb{R}^{E})^{*}), (2.17)

where y,x=eEy(e)x(e)\langle y,x\rangle=\sum_{e\in E}y(e)x(e). The function ff^{\bullet} is called an L-convex function ([13]), which is equivalent to submodular integrally convex function due to Favati and Tardella [5]. The original ff is recovered from ff^{\bullet} by taking another Legendre-Fenchel transform as follows.

f(x)=sup{y,xf(y)y(E)}(xE).f(x)=\sup\{\langle y,x\rangle-f^{\bullet}(y)\mid y\in(\mathbb{R}^{E})^{*}\}\qquad(x\in\mathbb{R}^{E}). (2.18)

(See [13, 14].) Hence there exists a one-to-one correspondence between M-convex functions and L-convex functions. Furthermore, Murota [13] showed the integrality property that (2.17) and (2.18) with \mathbb{R} being replaced by \mathbb{Z} hold for any M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{Z}\cup\{+\infty\}. When ff is an M-convex function, its Legendre-Fenchel transform is what is called an L-convex function ([13]).

For any xEx\in\mathbb{R}^{E} the subdifferential of ff at xx, denoted by f(x)\partial f(x), is defined by

f(x)={w(E)zE:f(z)f(x)+w,zx}.\partial f(x)=\{w\in(\mathbb{R}^{E})^{*}\mid\forall z\in\mathbb{R}^{E}:f(z)\geq f(x)+\langle w,z-x\rangle\}. (2.19)

The subdifferential of ff^{\bullet} is defined similarly as

f(w)={xEy(E):f(y)f(w)+yw,x}.\partial f^{\bullet}(w)=\{x\in\mathbb{R}^{E}\mid\forall y\in(\mathbb{R}^{E})^{*}:f^{\bullet}(y)\geq f^{\bullet}(w)+\langle y-w,x\rangle\}. (2.20)

Then we have the following.

Lemma 2.3

We have wf(x)w\in\partial f(x) if and only if xf(w)x\in\partial f^{\bullet}(w).
(Proof) Note that both statements, wf(x)w\in\partial f(x) and xf(w)x\in\partial f^{\bullet}(w), are equivalent to that f(x)+f(w)w,xf(x)+f^{\bullet}(w)\leq\langle w,x\rangle. \Box

Remark: When f:E{+}f:\mathbb{R}^{E}\to\mathbb{R}\cup\{+\infty\} is an M-convex function, subdifferentials f(w)\partial f^{\bullet}(w) for all wdom(f)w\in{\rm dom}(f^{\bullet}) are generalized polymatroids (or M-convex sets). Furthermore, if ff is defined on integer lattice E\mathbb{Z}^{E}, then f(w)\partial f^{\bullet}(w) for all wdom(f)w\in{\rm dom}(f^{\bullet}) are integral generalized polymatroids (restricted on E\mathbb{Z}^{E}). \Box

For more details about M-/M-convex functions and L-/L-convex functions see [13, 14] and [9, Chapter VII].

3 Compression of M-convex Functions

In this section we introduce a new transformation, called compression, of M-convex functions defined on the integer lattice E\mathbb{Z}^{E}. The compression of an M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\} is a transformation of the M-convex function ff to an M-convex function f^:E{+}\hat{f}:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\}.

Consider any M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\}. We suppose the following:

  • The effective domain dom(f){\rm dom}(f) is bounded and full-dimensional. That is, dom(f){\rm dom}(f) is a full-dimensional generalized polymatroid P(f,g){\rm P}(f^{*},g^{*}) (with finite f(E)>g(E)f^{*}(E)>g^{*}(E)).

For each integer α\alpha such that f(E)αg(E)f^{*}(E)\geq\alpha\geq g^{*}(E) let f(α):E{+}f_{(\alpha)}:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\} be the M-convex function defined by

f(α)(x)={f(x)ifx(E)=α+otherwise(xE)f_{(\alpha)}(x)=\left\{\begin{array}[]{ll}f(x)&{\rm if\ }\ x(E)=\alpha\\ +\infty&{\rm otherwise}\end{array}\right.\quad(x\in\mathbb{Z}^{E}) (3.1)

(cf. [13, 14, 15]). We call f(α)f_{(\alpha)} the α\alpha-section of ff. Then put

If={αf(E)αg(E)}I_{f}=\{\alpha\in\mathbb{Z}\mid f^{*}(E)\geq\alpha\geq g^{*}(E)\} (3.2)

and consider the convolution of all the sections f(α)f_{(\alpha)} (αIf)(\alpha\in I_{f}), which is given by

f^(x)=min{αIff(α)(yα)|x=αIfyα,αIf:yαE}(xE).\hat{f}(x)=\min\left\{\sum_{\alpha\in I_{f}}f_{(\alpha)}(y_{\alpha})\ \Bigl{|}\ x=\sum_{\alpha\in I_{f}}y_{\alpha},\ \forall\alpha\in I_{f}:y_{\alpha}\in\mathbb{Z}^{E}\right\}\quad(x\in\mathbb{Z}^{E}). (3.3)

where note that f(α)(yα)<+f_{(\alpha)}(y_{\alpha})<+\infty only if yα(E)=αy_{\alpha}(E)=\alpha. The convolution of M-convex functions f(α)f_{(\alpha)} (αIf)(\alpha\in I_{f}) is an M-convex function ([13, 14, 15]). We call f^\hat{f} the compression of ff.

Theorem 3.1

For the compression f^\hat{f} of an M-convex function f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\} the Legendre-Fenchel transform of f^\hat{f} is given by

f^=αIff(α).\hat{f}^{\bullet}=\sum_{\alpha\in I_{f}}f_{(\alpha)}{}^{\bullet}. (3.4)

We also have

dom(f^)=αIfdom(f(α)),{\rm dom}(\hat{f})=\sum_{\alpha\in I_{f}}{\rm dom}(f_{(\alpha)}), (3.5)

where the right-hand side is the Minkowski sum of the effective domains dom(f(α)){\rm dom}(f_{(\alpha)}) for all αIf\alpha\in I_{f}.
(Proof) For any w(E)w\in(\mathbb{R}^{E})^{*} we have from (3.3)

f^(w)\displaystyle\hat{f}^{\bullet}(w) =\displaystyle= sup{w,xf^(x)xE}\displaystyle\sup\{\langle w,x\rangle-\hat{f}(x)\mid x\in\mathbb{Z}^{E}\} (3.6)
=\displaystyle= sup{αIf(w,yαf(α)(yα))|αIf:yαE}\displaystyle\sup\left\{\sum_{\alpha\in I_{f}}(\langle w,y_{\alpha}\rangle-f_{(\alpha)}(y_{\alpha}))\ \Bigl{|}\ \ \forall\alpha\in I_{f}:y_{\alpha}\in\mathbb{Z}^{E}\right\}
=\displaystyle= αIfsup{w,xf(α)(x)xE}\displaystyle\sum_{\alpha\in I_{f}}\sup\{\langle w,x\rangle-f_{(\alpha)}(x)\mid x\in\mathbb{Z}^{E}\}
=\displaystyle= αIff(α)(w).\displaystyle\sum_{\alpha\in I_{f}}f_{(\alpha)}{}^{\bullet}(w).

This implies (3.4) and (3.5) because of the definition of the Legendre-Fenchel transform. \Box

4 M-convex Functions and Strip Decomposition

Now, we introduce a concept of the strip decomposition of an M-convex function defined on the integer lattice E\mathbb{Z}^{E}. Let f:E{+}f:\mathbb{Z}^{E}\to\mathbb{R}\cup\{+\infty\} be an M-convex function with a full-dimensional and bounded effective domain dom(f){\rm dom}(f).

4.1 Strip decomposition of M-convex functions

For the compression f^\hat{f} of ff and for any w(E)w\in(\mathbb{R}^{E})^{*} let us define D(f^,w)ED(\hat{f},w)\subseteq\mathbb{Z}^{E} and D(f(α),w)ED(f_{(\alpha)},w)\subseteq\mathbb{Z}^{E} (αIf)(\alpha\in I_{f}) by

D(f^,w)=Argmin{f^(x)w,xxE}D(\hat{f},w)={\rm Arg}\min\{\hat{f}(x)-\langle w,x\rangle\mid x\in\mathbb{Z}^{E}\} (4.1)
D(f(α),w)=Argmin{f(α)(x)w,xxE}(αIf),D(f_{(\alpha)},w)={\rm Arg}\min\{f_{(\alpha)}(x)-\langle w,x\rangle\mid x\in\mathbb{Z}^{E}\}\quad(\alpha\in I_{f}), (4.2)

where recall (3.2) for the definition of IfI_{f}. We call D(f^,w)D(\hat{f},w) and D(f(α),w)D(f_{(\alpha)},w) (αIf)(\alpha\in I_{f}) linearity domains of f^\hat{f} and f(α)f_{(\alpha)} (αIf)(\alpha\in I_{f}), respectively, associated with ww. We see from Lemma 2.3 that for every xEx\in\mathbb{Z}^{E} we have

xD(f^,w)wf^(x)xf^(w).x\in D(\hat{f},w)\Longleftrightarrow w\in\partial\hat{f}(x)\Longleftrightarrow x\in\partial\hat{f}^{\bullet}(w). (4.3)

This means that the linearity domains of f^\hat{f} are exactly the subdifferentials of f^\hat{f}^{\bullet}.

Let 𝕊\mathbb{S} be the collection of all maximal linearity domains of f^\hat{f} (or maximal subdifferentials of f^\hat{f}^{\bullet}). Then 𝕊\mathbb{S} gives a polyhedral division of dom(f^){\rm dom}(\hat{f}) into generalized polymatroids. Since dom(f){\rm dom}(f) is full-dimensional, the dimension of dom(f^){\rm dom}(\hat{f}) is equal to |E|1(=n1)|E|-1(=n-1).

For each maximal linearity domain S𝕊S\in\mathbb{S} of f^\hat{f} let ww be a vector in (E)(\mathbb{R}^{E})^{*} such that S=f^(w)S=\partial\hat{f}^{\bullet}(w) and put D(α,S)=D(f(α),w){D}(\alpha,S)=D({f}_{(\alpha)},w) (αIf)(\alpha\in I_{f}), which are independent of the choice of ww satisfying S=f^(w)S=\partial\hat{f}^{\bullet}(w). Define D(S)={D(α,S)αIf}{D}(S)=\bigcup\{{D}(\alpha,S)\mid\alpha\in I_{f}\} and let fD(S)f^{{D}(S)} be the restriction of ff on D(S){D}(S). Then we call fD(S)=(f(α)D(α,S)αIf)f^{{D}(S)}=(f_{(\alpha)}^{{D}(\alpha,S)}\mid\alpha\in I_{f}) a strip of ff associated with S𝕊S\in\mathbb{S}, where f(α)D(α,S)f_{(\alpha)}^{{D}(\alpha,S)} is the restriction of f(α)f_{(\alpha)} on D(α,S){D}(\alpha,S). (See Figure 2 for an example of a strip of an M-concave function.) The collection of the strips fD(S)f^{{D}(S)} for all S𝕊S\in\mathbb{S} is called the strip decomposition of ff.

Refer to caption
Figure 2: A strip of an M-concave function gg indicated by shade.

4.2 Strips viewed from parametric optimization

For any strip fD(S)=(f(α)D(α,S)αIf)f^{{D}(S)}=(f_{(\alpha)}^{{D}(\alpha,S)}\mid\alpha\in I_{f}) of an M-convex function ff associated with S𝕊S\in\mathbb{S} let ww be a vector in (E)(\mathbb{R}^{E})^{*} such that S=f^(w)S=\partial\hat{f}^{\bullet}(w). Then consider a parametric optimization problem 𝐏(λ){\bf P}(\lambda) with a parameter λ\lambda\in\mathbb{R} described as follows.

𝐏(λ):Minimizef(x)w+λ𝟏,xsubjecttoxdom(f),{\bf P}(\lambda):\ \ {\rm Minimize}\ f(x)-\langle w+\lambda{\bf 1},x\rangle\ {\rm\ subject\ to\ }\ x\in{\rm dom}(f), (4.4)

where 𝟏=χE{\bf 1}=\chi_{E} is the nn-dimensional vector of all ones. We then have the following theorem.

Theorem 4.1

For w(E)w\in(\mathbb{R}^{E})^{*} chosen as above there exist a finite sequence of values λ0=<λ1<<λp<λp+1=+\lambda_{0}=-\infty<\lambda_{1}<\cdots<\lambda_{p}<\lambda_{p+1}=+\infty and that of integers k0=g(E)<k1<<kp=f(E)k_{0}=g^{*}(E)<k_{1}<\cdots<k_{p}=f^{*}(E) such that the set X(λ)X^{*}(\lambda) of optimal solutions of 𝐏(λ){\bf P}(\lambda) for each λ\lambda\in\mathbb{R} is given by

X(λ)={α=k1kD(α,S)ifλ=λ(=1,,p)D(k,S)ifλ(λ,λ+1)(=0,,p).X^{*}(\lambda)=\left\{\begin{array}[]{ll}\bigcup_{\alpha=k_{\ell-1}}^{\ k_{\ell}}{D}(\alpha,S)&{\rm if\ }\lambda=\lambda_{\ell}\quad(\ell=1,\cdots,p)\\ {D}(k_{\ell},S)&{\rm if\ }\ \lambda\in(\lambda_{\ell},\lambda_{\ell+1})\quad(\ell=0,\cdots,p).\end{array}\right. (4.5)

(Proof) Because of the discrete structure of the M-convex function ff and the assumption that dom(f){\rm dom}(f) is bounded, there exists a finite sequence of values λ0=<λ1<<λp<λp+1=+\lambda_{0}=-\infty<\lambda_{1}<\cdots<\lambda_{p}<\lambda_{p+1}=+\infty such that

  1. 1.

    for each i=0,1,,pi=0,1,\cdots,p Problems 𝐏(λ){\bf P}(\lambda) for all λ(λi,λi+1)\lambda\in(\lambda_{i},\lambda_{i+1}) have one and the same optimal solution set, and

  2. 2.

    the set X(λi)X^{*}(\lambda_{i}) of optimal solutions of 𝐏(λi){\bf P}(\lambda_{i}) for each i=1,,pi=1,\cdots,p consists of more than one optimal solution and we have X(λi)X(λi+1)=X(λ)X^{*}(\lambda_{i})\cap X^{*}(\lambda_{i+1})=X^{*}(\lambda) for each i=1,,p1i=1,\cdots,p-1 and λ(λi,λi+1)\lambda\in(\lambda_{i},\lambda_{i+1}).

Hence the optimal solution sets X(λ)X^{*}(\lambda) are expressed as (4.5) for a sequence of some integers k0=g(E)<k1<<kp=f(E)k_{0}=g^{*}(E)<k_{1}<\cdots<k_{p}=f^{*}(E) that gives a division of the interval IfI_{f}. \Box

Here it should be noted that values λi\lambda_{i} (i=1,,p)(i=1,\cdots,p) depend on the choice of wSw\in S, while the vectors w+λi𝟏w+\lambda_{i}{\bf 1} (i=1,,p)(i=1,\cdots,p) are uniquely determined by ff and S𝕊S\in\mathbb{S}, because of the assumptions that dom(f){\rm dom}(f) is full-dimensional and S𝕊S\in\mathbb{S} is a maximal linearity domain of f^\hat{f}.

5 Valuated Generalized Matroids

In this section we further investigate the structures of strips and their compressions for a class of valuated generalized matroids, which are special M-convex functions defined on the unit hypercube {0,1}E\{0,1\}^{E}, in more details.

Let f:E{+}f:\mathbb{Z}^{E}\to\mathbb{Z}\cup\{+\infty\} be an M-convex function such that dom(f)={0,1}E{\rm dom}(f)=\{0,1\}^{E}, which is called a valuated generalized matroid. For any XEX\subseteq E we often write f(X)f(X) as f(χX)f(\chi_{X}) and regard ff as a function on 2E2^{E} in the sequel.

5.1 Compression of valuated generalized matroids and valuated permutohedra

For a valuated generalized matroid f:2Ef:2^{E}\to\mathbb{Z} the compression f^\hat{f} of ff given by (3.3) becomes

f^(x)=min{α[n]f(Yα)|x=α[n]χYα,α[n]:Yα(Eα)}(xE),\hat{f}(x)=\min\left\{\sum_{\alpha\in[n]}f(Y_{\alpha})\ \Bigl{|}\ x=\sum_{\alpha\in[n]}\chi_{Y_{\alpha}},\ \forall\alpha\in[n]:Y_{\alpha}\in{E\choose\alpha}\right\}\quad(x\in\mathbb{Z}^{E}), (5.1)

where (Eα)={XE|X|=α}{E\choose\alpha}=\{X\subseteq E\mid|X|=\alpha\} for α[n]\alpha\in[n], and we define f^(x)=+\hat{f}(x)=+\infty if the minimum on the right-hand side does not exist for xEx\in\mathbb{Z}^{E}. Then the effective domain of the compression f^\hat{f} given by (3.6) is expressed by the following Minkowski sum:

dom(f^)=α[n]{χY|Y(Eα)}.{\rm dom}(\hat{f})=\sum_{\alpha\in[n]}\left\{\chi_{Y}\ \Bigl{|}\ Y\in{E\choose\alpha}\right\}. (5.2)

Recall that E=[n]E=[n].

Theorem 5.1

The effective domain dom(f^){\rm dom}(\hat{f}) of the compression f^\hat{f} is a permutohedron. Hence the compression f^\hat{f} is a valuated permutohedron, whose linearity domains are sub-permutohedra.
(Proof) The right-hand side of (5.2) is the Minkowski sum of the sets of the characteristic vectors of bases of uniform matroids Uα,nU_{\alpha,n} of rank α\alpha for α[n]\alpha\in[n]. Hence it is a base polyhedron whose every extreme point (a greedy solution in the sense of Edmonds [4]) is a permutation (π(1),,π(n))n(\pi(1),\cdots,\pi(n))\in\mathbb{Z}^{n} of [n][n] and vice versa. It follows that (the convex hull of) dom(f^){\rm dom}(\hat{f}) is a permutohedron and f^\hat{f} is a valuated permutohedron. Moreover, for any generic w(E)w\in(\mathbb{R}^{E})^{*} and every α[n]\alpha\in[n], D(f(α),w)D(f_{(\alpha)},w) in (4.2) is a singlton, χF(α)\chi_{F(\alpha)} say. Then, for each α=1,,n\alpha=1,\cdots,n we have F(α1)F(α)F(\alpha-1)\subset F(\alpha) and χF(α)χF(α1)=χi\chi_{F(\alpha)}-\chi_{F(\alpha-1)}=\chi_{i} for some i[n](=E)i\in[n]\ (=E) with F(0)=F(0)=\emptyset. Hence sets F(α)F(\alpha) for α=1,,n\alpha=1,\cdots,n  form a complete flag

=F(0)F(1)F(2)F(n)=[n]\emptyset=F(0)\subset F(1)\subset F(2)\subset\cdots\subset F(n)=[n] (5.3)

and it determines a permutation π\pi of [n][n] with the permutation vector vπ=α=1nχF(α)v_{\pi}=\sum_{\alpha=1}^{n}\chi_{F(\alpha)}. It follows that every linearity domain of the compression f^\hat{f} is a sub-permutohedron. \Box

5.2 Strips of valuated generalized matroids and flag matroids

For every S𝕊S\in\mathbb{S} we have the strip fD(S)=(f(α)D(α,S)α=0,1,,n)f^{{D}(S)}=(f_{(\alpha)}^{{D}(\alpha,S)}\mid\alpha=0,1,\cdots,n) of ff associated with S𝕊S\in\mathbb{S}, which is characterized as follows. We identify χX\chi_{X} with XX for any X[n]X\subseteq[n].

Theorem 5.2

For each α=0,1,,n\alpha=0,1,\cdots,n we have a base family D(α,S){D}(\alpha,S) of a matroid ([n],ραS)([n],\rho_{\alpha}^{S}) with a rank function ραS\rho_{\alpha}^{S} satisfying ραS([n])=α\rho_{\alpha}^{S}([n])=\alpha. Moreover, the sequence of (ραSα=0,1,,n)(\rho_{\alpha}^{S}\mid\alpha=0,1,\cdots,n) is that of strong maps, i.e., a flag matroid.
(Proof) The present theorem follows from the definition of the strip fD(S)=(f(α)D(α,S)α=0,1,,n)f^{{D}(S)}=(f_{(\alpha)}^{{D}(\alpha,S)}\mid\alpha=0,1,\cdots,n) and the assumption that ff is a valuated generalized matroid. \Box

We call the flag matroid (ραSα=0,1,,n)(\rho_{\alpha}^{S}\mid\alpha=0,1,\cdots,n) the flag matroid associated with a strip S𝕊S\in\mathbb{S} (or a flag-matroid strip) of the valuated generalized matroid ff. We see from Theorems 5.1 and 5.2 the following.

Theorem 5.3

Every valuated generalized matroid ff induces a valuated permutohedron f^\hat{f} by its compression and each flag-matroid strip of ff corresponds to a maximal linearity domain, a sub-permutohedron, of the induced valuated permutohedron.

Every valuated generalized matroid is regarded as a valuated permutohedron endowed with valuated flag-matroid strips, one for each maximal linearity domain of it.

6 Concluding Remarks

We have introduced the concepts of strip decomposition and compression of M-convex functions. We have examined the structures of valuated generalized-matroids by considering the strip decomposition of a valuated generalized-matroid into flag-matroid strips. The compression of a valuated generalized matroid induces a valuated permutohedron, a special M-convex function of Murota [13]. We thus have a new transformation, which we call the compression, of a valuated generalized-matroid (an M-convex function) to a valuated permutohedron (a special M-convex function). Every Bruhat interval polytope is known to be a sub-permutohedron, due to Tsukerman and Williams [19]. It is interesting to investigate Bruhat interval polytopes from a point of view of the strip decomposition of valuated generalized-matroids and also from a point of view of valuated permutohedra.

Acknowledgments

Satoru Fujishige’s work is supported by JSPS KAKENHI Grant Number 19K11839 and Hiroshi Hirai’s work is supported by JSPS KAKENHI Grant Number JP17K00029 and JST PRESTO Grant Number JPMJPR192A, Japan.

References

  • [1] A. Björner and F. Brenti: Combinatorics of Coxeter Groups (Graduate Texts in Mathematics 231), Springer, 2005.
  • [2] A. V. Borovik, I. M. Gelfand and N. White: Coxeter Matroids (Progress in Mathematics, Vol. 216) (Birkhäuser, Boston, 2003).
  • [3] A. W. M. Dress and W. Wenzel: Valuated matroids. Advances in Mathematics 93 (1992) 214–250.
  • [4] J. Edmonds: Submodular functions, matroids, and certain polyhedra. Proceedings of the Calgary International Conference on Combinatorial Structures and Their Applications (R. Guy, H. Hanani, N. Sauer and J. Schönheim, eds., Gordon and Breach, New York, 1970), pp. 69–87; also in: Combinatorial Optimization—Eureka, You Shrink! (M. Jünger, G. Reinelt and G. Rinaldi, eds., Lecture Notes in Computer Science 2570, Springer, Berlin, 2003), pp. 11–26.
  • [5] P. Favati and F. Tardella: Convexity in nonlinear integer programming. Ricerca Operativa 53 (1990) 3–44.
  • [6] A. Frank: Generalized polymatroids. Proceedings of the Sixth Hungarian Combinatorial Colloquium (Eger, 1981); also, Finite and Infinite Sets, I (A. Hajnal, L. Lovász and V. T. Sós, eds., Colloquia Mathematica Societatis János Bolyai 37, North-Holland, Amsterdam, 1984), pp. 285–294.
  • [7] A. Frank and É. Tardos: Generalized polymatroids and submodular flows. Mathematical Programming 42 (1988) 489–563.
  • [8] S. Fujishige: A note on Frank’s generalized polymatroids. Discrete Applied Mathematics 7 (1984) 105–109.
  • [9] S. Fujishige: Submodular Functions and Optimization (North-Holland, 1991), Second Edition (Elsevier, 2005).
  • [10] R. Hassin: Minimum cost flow with set-constraints. Networks 12 (1982) 1–21.
  • [11] K. Murota: Matroid valuation on independent sets. Journal of Combinatorial Theory 69B (1997) 59–78.
  • [12] K. Murota: Discrete convex analysis. Mathematical Programming, Ser. A 83 (1998) 313–371.
  • [13] K. Murota: Discrete Convex Analysis (SIAM Monographs on Discrete Mathematics and Applications 10, SIAM, 2003).
  • [14] K. Murota: Discrete convex analysis: A tool for economics and game theory, Journal of Mechanism and Institution Design, 1 (2016) 151–273.
  • [15] K. Murota: A survey of fundamental operations on discrete convex functions of various kinds. Optimization Methods and Software (to appear)
    DOI: 10.1080/10556788.2019.1692345
  • [16] K. Murota and A. Shioura: M-convex function on generalized polymatroid. Mathematics of Operations Research 24 (1999) 95–105.
  • [17] J. Oxley: Matroid Theory (Oxford University Press, Oxford, 1992).
  • [18] A. Schrijver: Combinatorial Optimization—Polyhedra and Efficiency (Springer, Heidelberg, 2003).
  • [19] E. Tsukerman and L. Williams: Bruhat interval polytopes. Advances in Mathematics 285 (2015) 766–810.
  • [20] D. J. A. Welsh: Matroid Theory (Academic Press, London, 1976).