1. A preliminary version of this paper appeared as [12]

Anuj Dawar and Gregory Wilsenach
Department of Computer Science and Technology
University of Cambridge.
[email protected], [email protected]

Abstract

Dawar and Wilsenach (ICALP 2020) introduce the model of symmetric arithmetic circuits and show an exponential separation between the sizes of symmetric circuits for computing the determinant and the permanent. The symmetry restriction is that the circuits which take a matrix input are unchanged by a permutation applied simultaneously to the rows and columns of the matrix. Under such restrictions we have polynomial-size circuits for computing the determinant but no subexponential size circuits for the permanent. Here, we consider a more stringent symmetry requirement, namely that the circuits are unchanged by arbitrary even permutations applied separately to rows and columns, and prove an exponential lower bound even for circuits computing the determinant. The result requires substantial new machinery. We develop a general framework for proving lower bounds for symmetric circuits with restricted symmetries, based on a new support theorem and new two-player restricted bijection games. These are applied to the determinant problem with a novel construction of matrices that are bi-adjacency matrices of graphs based on the CFI construction. Our general framework opens the way to exploring a variety of symmetry restrictions and studying trade-offs between symmetry and other resources used by arithmetic circuits.

1 Introduction

The central open question in the field of arithmetic circuit complexity is the separation of the complexity classes $\mathrm{VP}$ and $\mathrm{VNP}$ . Sometimes known as Valiant’s conjecture, this is also described as the algebraic analogue of the $\mathrm{P}$ vs. $\mathrm{NP}$ question. The conjecture is equivalent to the statement that the permanent of a matrix cannot be expressed by a family of polynomial-size arithmetic circuits. Lower bounds on the size of circuits computing the permanent have been established by imposing certain restrictions on the circuit model. For instance, it is known that there is no subexponential family of monotone circuits for the permanent [20] and an exponential lower bound for the permanent is also known for depth-3 arithmetic circuits [14]. In both these cases, the lower bound obtained for the permanent also applies to the determinant, which is known to be in $\mathrm{VP}$ ¹¹1Note that the determinant is not itself monotone in the usual sense, but there is a suitably adapted notion of monotonicity called syntactic monotonicity in [21] with respect to which the determinant does have circuits, but not subexponential size ones.. In that sense, the lower bounds tell us more about the weakness of the model resulting from the restriction than the difficulty of computing the permanent.

In this paper we focus on another restriction on arithmetic circuits introduced relatively recently: that of symmetry [11]. This has been shown to give an exponential separation in the size of circuits computing the permanent and the determinant. We first introduce this restriction informally (a more formal definition is given in Section 3). Given a field $\mathbb{F}$ and a set of variables $X$ , let $C$ be a circuit computing a polynomial $p$ in $\mathbb{F}[X]$ . For a group $G$ acting on the set $X$ , we say that $C$ is $G$ -symmetric if the action of any element $g\in G$ on the inputs of $C$ can be extended to an automorphism of $C$ . Of course, this makes sense only when the polynomial $p$ itself is invariant under the action of $G$ .

For example, both the permanent and the determinant are polynomials in a matrix of variables $X=\{x_{ij}\mid 1\leq i,j\leq n\}$ . Let $G$ be the group $\text{{Sym}}_{n}$ acting on $X$ by the action whereby $\pi\in G$ takes $x_{ij}$ to $x_{\pi(i)\pi(j)}$ . We call this the square symmetric action. It corresponds to arbitrary permutations applied simultaneously to the rows and columns of the matrix. We show in [11] that there are polynomial-size $G$ -symmetric circuits computing the determinant, but any family of $G$ -symmetric circuits for computing the permanent has exponential size. Both results are established for any field of characteristic zero.

The choice of the group action $G$ in these results is natural, but certainly not the only possibility. The lower bound immediately applies to any larger group of symmetries of the polynomial as well. Consider, for instance the action of the group $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ on $X$ whereby $(\pi,\sigma)$ takes $x_{ij}$ to $x_{\pi(i)\sigma(j)}$ . We call this the matrix symmetric action. It corresponds to independent permutations applied to the rows and columns. The permanent is invariant under this action and the exponential lower bound for square-symmetric circuits for the permanent applies a fortiori to matrix-symmetric circuits as well. As it happens, in this case we can strengthen the lower bound, if not in terms of size at least in terms of the range of application. That is, the exponential lower bound has been shown not only for circuits computing the permanent in fields of characteristic zero, but over all fields of characteristic other than two. This exponential lower bound matches known upper bounds as the smallest circuits for computing permanents, those based on Ryser’s formula, are in fact $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ -symmetric.

In the case of the determinant, it was left open whether the polynomial upper bound obtained for square symmetric circuits could be improved by requiring larger groups of symmetries on the circuits. The most efficient algorithms for computing the determinant (based on Gaussian elimination) are not square symmetric and the polynomial upper bound is obtained by an application of Le Verrier’s method [11]. It is a natural question to ask how much more stringent a symmetry requirement we can impose and still find efficient algorithms. The determinant is not matrix-symmetric like the permanent is. Let us write $\mathbf{D}_{n}$ for the group of permutations of $X=\{x_{ij}\mid 1\leq i,j\leq n\}$ which fix the determinant. This can be seen to be $D_{n}\ltimes T$ where $D_{n}$ is the subgroup of $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ of index $2$ consisting of pairs $(\sigma,\pi)$ of permutations with $\text{sgn}(\sigma)=\text{sgn}(\pi)$ and $T\cong\mathbb{Z}_{2}$ represents the transposition of rows and columns. We prove in the present paper that any family of $\mathbf{D}_{n}$ -symmetric circuits computing the determinant must have exponential size. Indeed, our lower bound is proved even for the subgroup of $D_{n}$ given by $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ .

Proving this lower bound requires substantially different methods than those of other results, and developing these methods is a central thrust of this paper. The exponential lower bound for square-symmetric circuits for the permanent is established in [11] by proving a lower bound on the orbit size of Boolean circuits computing the permanent of a $\{0,1\}$ -matrix. Here, the orbit size of a circuit $C$ is the maximal size of an orbit of a gate of $C$ under the action of the automorphism group of $C$ . This lower bound is proved using a connection between the orbit size of circuits computing a graph parameter and the counting width of the parameter as established in [1]. To be precise, it is shown that if a graph parameter has linear counting width, i.e. it distinguishes graphs on $n$ vertices which are not distinguished by $\Omega(n)$ -dimensional Weisfeiler-Leman equivalences, then it cannot be computed by symmetric circuits of subexponential orbit size. The Weisfeiler-Leman equivalences are well-studied approximations of the graph isomorphism relation, graded by dimension (see [15, Section 3.5] for an introduction). The equivalences have many equivalent characterizations arising in combinatorics, algebra, logic and linear optimization. The term counting width comes from the connection with counting logic (see [8]). The main technical ingredient in the lower bound proof in [11] is then a proof of a linear lower bound on the counting width of the number of perfect matchings in a bipartite graph.

We were able to rely on lower bounds on the counting width of graph parameters because a $\text{{Sym}}_{n}$ -invariant parameter of a $\{0,1\}$ -matrix can be seen as a graph parameter. That is, a graph parameter that does not distinguish between isomorphic graphs is necessarily $\text{{Sym}}_{n}$ -invariant on the adjacency matrices of graphs. Similarly, the $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ action on a matrix can be understood as the natural invariance condition of the biadjacency matrix of a bipartite graph. On the other hand, there seems to be no natural graph structure giving rise to an $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -invariance requirement on the matrices. For this reason, we develop here both a general framework for presenting and studying symmetric circuits and new methods for proving lower bounds under some of these symmetry assumptions.

The generality of this framework allows us to consider a variety of different symmetry conditions, providing both a broad vocabulary for working with these circuits and game-based characterizations of the expressive power of these various symmetric models. This opens up the possibility of studying symmetry as a resource. Our results suggest a spectrum of symmetry restrictions and it would be interesting to establish exactly where on this spectrum the boundary of efficient algorithms for the determinant lies. Similar questions can be asked about other polynomials which admit efficient evaluation. For the permanent, the natural question is how much can we relax the symmetry conditions and still prove lower bounds. This is all to say that, quite apart from the main result, we regard the framework developed here as a contribution in its own right, which lays out a landscape to explore symmetry as a resource in this area.

Table 1 summarizes what is currently known about the power of various symmetric circuit models computing the permanent and determinant. The first column is for unrestricted circuits, i.e. those symmetric under the trivial group action. The upper bound for the determinant is by an adapted Gaussian elimination algorithm [5] and the upper bound for the permanent, which in fact holds for every group in the table, is by Ryser’s formula [24]. The lower bound in the first column is the trivial one. The second and fourth column state results established in [11]. There is no result for the determinant in the fourth column as the determinant is not invariant under the $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ action. The results in the column for $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ are new to this paper. In the last two columns $\mathbf{D}_{n}$ and $\mathbf{P}_{n}$ represent the full invariance groups (as subgroups of $\text{{Sym}}_{n\times n}$ ) of the determinant and permanent respectively. The lower bounds stated in those columns follow from the ones obtained for their subgroups $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ and $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ respectively.

A more detailed discussion of some of the main innovations follows.

G

\{\text{id}\}

\text{{Sym}}_{n}

\text{{Alt}}_{n}\times\text{{Alt}}_{n}

\text{{Sym}}_{n}\times\text{{Sym}}_{n}

\mathbf{D}_{n}

\mathbf{P}_{n}

Det

O(n^{3})

O(n^{4})

(char 0)

2^{\Omega(n)}

(char 0)

N/A

2^{\Omega(n)}

(char 0)

N/A

Perm

O(n^{2}2^{n})

\Omega(n^{2})

2^{\Omega(n)}

(char 0)

2^{\Omega(n)}

(char $\neq 2$ )

2^{\Omega(n)}

(char $\neq 2$ )

2^{\Omega(n)}

(char $\neq 2$ )

2^{\Omega(n)}

(char $\neq 2$ )

Table 1: Table of upper and lower bounds for

G

-symmetric circuits for the determinant and permanent, for various group actions

G

. Here

\{\text{id}\}

denotes the trivial group, and thus the column gives upper and lower bound for general circuits.

1.

The lower bounds on counting width of graph parameters are proved using the method of bijection games applied to graphs based on a construction due to Cai, Fürer and Immerman [6] (we refer to this class of graph constructions as the CFI construction). The games are parameterized by a natural number $k$ and are used to establish the indistinguishability of graphs by means of $k$ -dimensional Weisfeiler-Leman equivalences. We adapt the bijection games and show that they can be directly used to obtain lower bounds for symmetric circuits without reference to graphs or Weisfeiler-Leman equivalences. The parameter $k$ is related to a parameter of the circuits we call support size. This is based on an idea outlined in [7] which we elaborate further.
2.

The support theorem established in [1] and stengthened in [11] is a key tool, which we extend further. It shows that small symmetric circuits have small support size. This was proved for symmetry under the action of the symmetric group originally and we extend the range of group actions for which it can be shown to hold.
3.

The original $k$ -pebble bijection games of Hella [17] are two-player games played on a pair of graphs (or more generally relational structures) between two players called Spoiler and Duplicator. In each move of the game, Duplicator is required to provide a bijection between the two graphs. A winning strategy for Duplicator shows that the two graphs are not distinguishable in the $k$ -dimensional Weisfeiler-Leman equivalences. We refine the games by restricting Duplicator to play bijections from a restricted set of bijections. This set is obtained by composing an initial bijection with permutations from some group $G$ . In this game, even an isormophism does not guarantee a Duplicator winning strategy. But, we are able to relate Duplicator winning strategies to indistinguishability by $G$ -symmetric circuits taking the adjacency matrices as input. At the same time, we generalize the game so it can be played on a much more general notion of structured input rather than a relational structure.
4.

A fourth key ingredient is the construction of matrices with distinct determinants on which we can show Duplicator winning strategies in the $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -restricted bijection game. The two matrices are obtained as biadjacency matrices of a single bipartite graph given by the CFI construction. The two matrices differ only through the interchange of two columns. The challenge here is to show that they have non-zero determinant.
5.

An important element of the construction of the matrices in (4) are bipartite $3$ -regular graphs with large tree-width and an odd number of perfect matchings. We show the existence of an infinite family of such graphs in Section 5.2. This may be of independent interest. A strengthening of the conditions on the number of perfect matchings could be used to extend our lower bound to fields of positive characteristic other than two. We expand on this in Section 5.4.

Our work may be compared to that of Landsberg and Ressayre [22] who establish an exponential lower bound on the complexity of the permanent (specifically over the complex field $\mathbb{C}$ ) under an assumption of symmetry. Their lower bound is for equivariant determinantal representations of the permanent, that is those that preserve all the symmetries of the permanent function. This approach doesn’t yield any lower bounds for symmetric circuits in the sense we consider here. On the other hand, lower bounds for equivariant determinantal complexity can be derived from the results in [11], albeit not ones as strong as in [22]. For a more detailed comparison of the two approaches see the full version of [11].

This paper is organized as follows. We begin with introducing the notation we need in Section 2. Symmetric circuits working on arbitrary structured inputs are defined in Section 3. The support theorem we need is proved in Section 4.2. The bijection games are defined in Section 4.1 where we also prove that Duplicator winning strategies imply indistinguishability by small circuits. The main construction establishing the lower bound for the determinant is presented in Section 5. The corresponding lower bound for the permanent is given in Section 6. The proof for the last result is only sketched as we only need to note that the proof from [11] can be easily adapted to the new games we define to tighten the symmetry result from $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ to $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ .

2 Background

In this section we discuss relevant background and introduce notation.

We write $\mathbb{N}$ for the positive integers and $\mathbb{N}_{0}$ for the non-negative integers. For $m\in\mathbb{N}_{0}$ , $[m]$ denotes the set $\{1,\ldots,m\}$ . For a set $X$ we write $\mathcal{P}(X)$ to denote the powerset of $X$ . We write $\operatorname{id}$ to denote the identity function on some specified set. For $f:X\rightarrow Y$ and $S\subseteq X$ we write $f(S)$ to denote the image of $S$ . Let $\mathrm{Bij}(A,B)$ denote the set of bijections from $A$ to $B$ .

2.1 Groups

Let $\text{{Sym}}_{A}$ and $\text{{Alt}}_{A}$ denote the symmetric group and alternating group on the set $A$ . We write $\text{{Sym}}_{n}$ and $\text{{Alt}}_{n}$ to abbreviate $\text{{Sym}}_{[n]}$ and $\text{{Alt}}_{[n]}$ , respectively. Let $\{\operatorname{id}\}$ denote the trivial group. For groups $G$ and $H$ we write $H\leq G$ to denote that $H$ is a subgroup of $G$ . The sign of a permutation $\sigma\in\text{{Sym}}_{A}$ is defined so that if $\sigma$ is even $\text{sgn}(\sigma)=1$ and otherwise $\text{sgn}(\sigma)=-1$ .

Let $G$ be a group acting on a set $X$ . We denote this as a left action, i.e. $\sigma x$ for $\sigma\in G$ , $x\in X$ . The action extends in a natural way to powers of $X$ . So, for $(x,y)\in X\times X$ , $\sigma(x,y)=(\sigma x,\sigma y)$ . It also extends to the powerset of $X$ and functions on $X$ as follows. The action of $G$ on $\mathcal{P}(X)$ is defined for $\sigma\in G$ and $S\in\mathcal{P}(X)$ by $\sigma S=\{\sigma x\mid x\in S\}$ . For $Y$ any set, the action of $G$ on $Y^{X}$ is defined for $\sigma\in G$ and $f\in Y^{X}$ by $(\sigma f)(x)=f(\sigma x)$ for all $x\in X$ . We refer to all of these as the natural action of $G$ on the relevant set.

A set $X$ with the action of group $G$ on it is called a $G$ -set. We do not distinguish notationally between a $G$ -set $X$ and the underlying set of elements. Thus, we can say that if $X$ is a $G$ -set, the collection of functions $Y^{X}$ is also a $G$ -set with the natural action.

Let $X=\prod_{i\in I}X_{i}$ and for each $i\in I$ let $G_{i}$ be a group acting on $X_{i}$ . The action of the direct product $G:=\prod_{i\in I}G_{i}$ on $X$ is defined for $x=(x_{i})_{i\in I}\in X$ and $\sigma=(\sigma_{i})_{i\in I}\in G$ by $\sigma x=(\sigma_{i}x_{i})_{i\in I}$ . If instead $X=\biguplus_{i\in I}X_{i}$ then the action of $G$ on $X$ is defined for $x\in X$ and $\sigma=(\sigma_{i})_{i\in I}\in G$ such that if $x\in X_{i}$ then $\sigma x=\sigma_{i}x$ . Again, we refer to either of these as the natural action of $G$ on $X$ .

Let $X$ be a $G$ -set. Let $S\subseteq X$ . Let $\text{{Stab}}_{G}(S):=\{\sigma\in G\mid\forall x\in S\,\,\sigma x=x\}$ denote the pointwise stabilizer of $S$ . Let $\text{{SetStab}}_{G}(S):=\{\sigma\in G\mid\sigma(S)=S\}$ denote the setwise stabilizer of $S$ . If $S=\{x\}$ is a singleton we omit set braces and write $\text{{Stab}}_{G}(x)$ . Note that $\text{{SetStab}}_{G}(S)$ is the pointwise stabilizer of $\{S\}$ in the $G$ -set $\mathcal{P}(X)$ . For $x\in X$ let $\text{{Orb}}_{G}(x):=\{\sigma(x)\mid\sigma\in G\}$ denote the orbit of $x$ . In all cases, we omit the subscript $G$ where it is clear from context.

2.2 Matrices

Let $A$ and $B$ be finite non-empty sets. We identify matrices with rows indexed by $A$ and columns by $B$ and entries from some set $X$ with functions of the form $M:A\times B\rightarrow X$ . So for $a\in A$ , $b\in B$ , $M_{ab}=M(a,b)$ . We also denote $M$ by $(M_{ab})_{a\in A,b\in B}$ , or just $(M_{ab})$ when the index sets are clear from context.

Let $R$ be a commutative ring and $M:A\times B\rightarrow R$ be a matrix with $|A|=|B|$ . The permanent of $M$ over $R$ is $\text{perm}_{R}(M)=\sum_{\sigma\in\mathrm{Bij}(A,B)}\prod_{a\in A}M_{a\sigma(a)}$ . Suppose $A=B$ . The determinant of $M$ over $R$ is $\det_{R}(M)=\sum_{\sigma\in\text{{Sym}}_{A}}\text{sgn}(\sigma)\prod_{a\in A}M_{a\sigma(a)}$ . The trace of $M$ over $R$ is $\operatorname{Tr}_{R}(M)=\sum_{a\in A}M_{aa}$ . We omit reference to the ring when it is obvious from context. When $R$ is a field, we write $\text{{rk}}(M)$ to denote the rank of the matrix $M$ .

We always use $\mathbb{F}$ to denote a field and $\text{char}(\mathbb{F})$ to denote the characteristic of $\mathbb{F}$ . For any prime power $q$ we write $\mathbb{F}_{q}$ for the finite field of order $q$ . We are often interested in polynomials and circuits defined over a set of variables $X$ with a natural matrix structure, i.e. $X=\{x_{ab}:a\in A,b\in B\}$ . We identify $X$ with this matrix. We also identify any function of the form $f:X\rightarrow Y$ with the $A\times B$ matrix with entries in $Y$ defined by replacing each $x_{ab}$ with $f(x_{ab})$ .

2.3 Graphs

Given a graph $\Gamma=(V,E)$ , the adjacency matrix $A_{\Gamma}$ of $\Gamma$ is the $V\times V$ $\{0,1\}$ -matrix with $A_{\Gamma}(u,v)=1$ if, and only if, $\{u,v\}\in E$ . If $\Gamma$ is bipartite, with bipartition $V=U\cup W$ , then the biadjacency matrix $B_{\Gamma}$ of $\Gamma$ is the $U\times W$ $\{0,1\}$ -matrix with $B_{\Gamma}(u,v)=1$ if, and only if, $\{u,v\}\in E$ . For a set $U\subseteq V$ we write $\Gamma[U]$ to denote the subgraph induced by $U$ .

A $k$ -factor of a graph $\Gamma$ is a spanning $k$ -regular subgraph. A perfect matching is a $1$ -factor. It is well known that for a bipartite graph $\Gamma$ , $\text{perm}(B_{\Gamma})$ over any field of characteristic zero counts the number of perfect matchings in $\Gamma$ [16] and for prime $p$ , $\text{perm}_{\mathbb{F}}(B_{\Gamma})$ for a field $\mathbb{F}$ of characteristic $p$ counts the number of perfect matchings in $\Gamma$ modulo $p$ . For bipartite $\Gamma$ , $A_{\Gamma}$ is a block anti-diagonal matrix with two blocks corresponding to $B_{\Gamma}$ and $B_{\Gamma}^{T}$ , and so $\text{perm}(A_{\Gamma})=\text{perm}(B_{\Gamma})^{2}$ .

Let $\Gamma=(V,E)$ be a graph. For $S\subseteq V$ let $N^{+}(S):=\{v\in V\setminus S\mid\exists s\in S,\,\,(v,s)\in E\}$ . We say $\Gamma$ is an $\alpha$ -expander for $\alpha\in[0,1]$ if for every $S\subseteq V$ of size at most $|V|/2$ we have $|N^{+}(S)|\geq\alpha|S|$ . For an introduction to expander graphs see [19]. A set $S$ of vertices in a graph $\Gamma$ is a balanced separator if no connected component of $\Gamma\setminus S$ contains more than half the vertices of $\Gamma$ . It is easy to see that for any $\alpha$ there is a constant $\tau$ such that if $\Gamma$ is an $\alpha$ -expander then $\Gamma$ has no balanced separator of size less than $\tau|V|$ .

The tree width of a graph is a well-known graph parameter which can be characterized in terms of the cops and robbers game [25]. This game is played on a graph $\Gamma=(V,E)$ , with one player controlling the single robber and another a team of $k$ cops. The cops and robbers both sit on nodes in the graph. At the start of a round the cops are sitting on nodes $X\subset V$ and the round begins with the cop player announcing he intends to move cops from $X^{\prime}\subseteq X$ to occupy the nodes $Y$ . The robber player must respond with a path from the robbers current location that does not intersect $X\setminus X^{\prime}$ . At the end of the round the cops and robber are moved to their new locations, and if any cop is on the same node as the robber the cops win. The characterization is as follows.

Theorem 2.1 ([25]).

There is a winning strategy for the cop player with $k$ cops if, and only if, the tree-width of $\Gamma$ is at most $k-1$ .

2.4 Circuits

We give a general definition of a circuit that incorporates both Boolean and arithmetic circuits.

Definition 2.2 (Circuit).

A circuit over the basis $\mathbb{B}$ with variables $X$ and values $K$ is a directed acyclic graph with a labelling where each vertex of in-degree $0$ is labelled by an element of $X\cup K$ and each vertex of in-degree greater than $0$ is labelled by an element of $\mathbb{B}$ .

Let $C$ be circuit that is the labelling of a directed graph $(D,W)$ , where $W\subset D\times D$ with values $K$ . We call the elements of $D$ gates, and the elements of $W$ wires. We call the gates with in-degree $0$ input gates and gates with out-degree $0$ output gates. We call those input gates labelled by elements of $K$ constant gates. We call those gates that are not input gates internal gates. For $g,h\in D$ we say that $h$ is a child of $g$ if $(h,g)\in W$ . We write $\text{child}(g)$ to denote the set of children of $g$ . We write $C_{g}$ to denote the sub-circuit of $C$ rooted at $g$ , i.e the sub-circuit induced by those gates with a directed path to $g$ . Unless otherwise stated we always assume a circuit has exactly one output gate.

If $K$ is a field $\mathbb{F}$ , and $\mathbb{B}$ is the set $\{+,\times\}$ , we have an arithmetic circuit over $\mathbb{F}$ . If $K=\{0,1\}$ , and $\mathbb{B}$ is a collection of Boolean functions, we have a Boolean circuit over the basis $\mathbb{B}$ . We define two bases here. The first is the standard basis $\mathbb{B}_{\text{std}}$ containing the functions $\land$ , $\lor$ , and $\neg$ . The second is the threshold basis $\mathbb{B}_{t}$ which is the union of $\mathbb{B}_{\text{std}}$ and $\{t_{\geq k}:k\in\mathbb{N}\}$ , where for each $k\in\mathbb{N}$ , $t_{\geq k}$ is defined for a string $\vec{x}\in\{0,1\}^{*}$ so that $t_{\geq k}(\vec{x})=1$ if, and only if, the number of $1$ s in $\vec{x}$ at least $k$ . We call a circuit defined over this basis a threshold circuit. Another useful Boolean function is $t_{=k}$ , which is defined by $t_{=k}(x)=t_{\leq k}(x)\land\neg t_{\leq k+1}(x)$ . We do not explicitly include it in the basis as it is easily defined in $\mathbb{B}_{t}$ .

In general, we require that a basis contain only functions that are invariant under all permutations of their inputs. That is, if $f\in\mathbb{B}$ is such that $f:K^{n}\rightarrow K$ for some $n\in\mathbb{N}$ then for all $\sigma\in\text{{Sym}}_{n}$ and $M\in K^{n}$ we have $f(M\circ\sigma)=f(M)$ . We call such a function $f$ fully symmetric. The arithmetic functions $+$ and $\times$ and all of the Boolean functions in $\mathbb{B}_{t}$ and $\mathbb{B}_{\text{std}}$ are fully symmetric. Let $C$ be a circuit defined over such a basis with variables $X$ and values $K$ . We evaluate $C$ for an assignment $M\in K^{X}$ by evaluating each gate labelled by some $x\in X$ to $M(x)$ and each gate labelled by some $k\in K$ to $k$ , and then recursively evaluating each gate $g$ according to its corresponding basis element. That is $g$ gets the value obtained by applying the function labelling $g$ to the set of values of $\text{child}(g)$ . We write $C[M](g)$ to denote the value of the gate $g$ and $C[M]$ to denote the value of the output gate. We say that $C$ computes the function $M\mapsto C[M]$ .

It is conventional to consider an arithmetic circuit $C$ over $\mathbb{F}$ with variables $X$ to be computing a polynomial in $\mathbb{F}[X]$ , rather than a function $\mathbb{F}^{X}\rightarrow\mathbb{F}$ . This polynomial is defined via a similar recursive evaluation, except that now each gate labelled by a variable evaluates to the corresponding formal variable, and we treat addition and multiplication as ring operations in $\mathbb{F}[X]$ . Each gate then evaluates to some polynomial in $\mathbb{F}[X]$ . The polynomial computed by $C$ is the value of the output gate.

For more details on arithmetic circuits see [26] and for Boolean circuits see [27].

3 Symmetric Circuits

Complexity theory is often concerned with computation models that take as input binary strings. In practice these strings are almost always taken to encode some structured input (e.g. graphs, matrices, numbers, etc.). In order to study the symmetries that arise from these structures we forgo this encoding. More precisely, we consider computation models such as circuits whose inputs are themselves functions of type $K^{X}$ where we think of $X$ as a set of variables and $K$ a domain of values that the variables can take. The set $X$ may have some further structure to reflect the intended structured input, but no more. In particular, we do not assume that $X$ is linearly ordered. For example, if $X=V^{2}$ and $K=\{0,1\}$ then the elements of $\{0,1\}^{X}$ may be naturally interpreted as directed graphs over the vertex set $V$ . Or, with the same $X$ , if we let $K=\mathbb{F}$ , we can think of the elements of $K^{X}$ as matrices over $\mathbb{F}$ with rows and columns indexed by $V$ .

The symmetries of interest for a given class of structures correspond to group actions on $X$ , which lift to actions on $K^{X}$ . In this section we introduce the definitions of $G$ -symmetric functions, i.e. functions which are invariant under the action of $G$ on its input, and $G$ -symmetric circuits, which are circuits computing $G$ -symmetric functions but where the structure of the circuit itself and not just the output is invariant under the action of $G$ . The definitions are variations and generalizations of those from [11]. We illustrate them with examples.

3.1 Group Actions and Symmetric Functions

Definition 3.1.

For a group $G$ acting on the domain of a function $F$ we say $F$ is $G$ -symmetric if for every $\sigma\in G$ , $\sigma F=F$ . We omit mention of the group when it is obvious from context.

We are interested in functions of type $K^{X}\rightarrow K$ with some group $G$ acting on $X$ , which then induces an action on $K^{X}$ . We think of elements of $K^{X}$ as “generalized structures”, and define notions of homomorphism and isomorphism below. We first consider some examples. Note that whenever $H$ is a subgroup of $G$ , then any $G$ -symmetric function is also $H$ -symmetric. In particular every function is $\{\operatorname{id}\}$ -symmetric.

Example 3.2.

1.

For a vertex set $V$ , a function $F:\{0,1\}^{V^{2}}\rightarrow\{0,1\}$ defines a property of (directed) graphs if it is $\text{{Sym}}_{V}$ -symmetric, with the action of $\text{{Sym}}_{V}$ being defined simultaneously on both coordinates. We get properties of simple undirected graphs by considering $\text{{Sym}}_{V}$ -symmetric functions $F:\{0,1\}^{X}\rightarrow\{0,1\}$ where $X={V\choose 2}$ is the collection of $2$ -element subsets of $V$ . Examples of graph properties include connectedness, Hamiltonicity and the existence of a perfect matching. A graph parameter is a function $F:\{0,1\}^{V^{2}}\rightarrow\mathbb{R}$ which is $\text{{Sym}}_{V}$ -symmetric. Examples include the number of connected components, the number of Hamiltonian cycles and the number of perfect matchings.
2.

The elementary symmetric polynomial of degree $k$ in the set of variables $X$ is the polynomial:

$e_{k}(X)=\sum_{S\in{X\choose k}}\prod_{x\in S}x.$

For any field $\mathbb{F}$ , $e_{k}(X)$ defines a function $e_{k}^{\mathbb{F}}:\mathbb{F}^{X}\rightarrow\mathbb{F}$ which is $\text{{Sym}}_{X}$ -symmetric.
3.

If $X=\{x_{ij}\mid 1\leq i,j\leq n\}$ is a matrix of variables, then the trace $\text{tr}(X)$ , determinant $\det(X)$ and permanent $\text{perm}(X)$ are polynomials that define, for any field $\mathbb{F}$ , functions of type $\mathbb{F}^{X}$ which are $\text{{Sym}}_{n}$ -symmetric where the group action is defined simultaneously on both coordinates.
4.

The permanent is invariant under separate row/column permutations. In other words, for $(\pi,\sigma)\in\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ , and matrix $M\in\mathbb{F}^{n\times n}$ , we have $\text{perm}((M_{xy}))=\text{perm}((M_{\pi(x)\sigma(y)}))$ . So the permanent for $n\times n$ -matrices is a $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ -symmetric function.
5.

The trace and determinant are not $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ -symmetric under the above action, but the symmetry group of the determinant is richer than just $\text{{Sym}}_{n}$ . Define $D_{n}\leq\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ to be the group $\{(\sigma,\pi)\mid\text{sgn}(\sigma)=\text{sgn}(\pi)\}$ . This is a subgroup of $\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ of index $2$ . The determinant is $D_{n}$ -symmetric and so, in particular $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric, since $\text{{Alt}}_{n}\times\text{{Alt}}_{n}\leq D_{n}$ .

Let $G$ and $H$ be groups. A homomorphism from the $G$ -set $X$ to $H$ -set $Y$ is a pair of functions $(f,\phi)$ where $f:X\rightarrow Y$ is a function and $\phi:G\rightarrow H$ is a group homomorphism such that for each $x\in X$ and $\pi\in G$ , $f(\pi x)=\phi(\pi)f(x)$ . We say $(f,\phi)$ is an isomorphism if both $f$ and $\phi$ are bijective. We abuse terminology and refer to a bijection $f:X\rightarrow Y$ as an isomorphism if there exists $\phi:G\rightarrow H$ such that $(f,\phi)$ is an isomorphism.

Let $M:X\rightarrow K$ with $X$ a $G$ -set and let $N:Y\rightarrow K$ with $Y$ an $H$ -set. A homomorphism from $(M,G)$ to $(N,H)$ is a homomorphism $(f,\phi):(X,G)\rightarrow(Y,H)$ such that $N\circ f=M$ . It is an isomorphism from $(M,G)$ to $(N,H)$ if both $f$ and $\phi$ are bijective. We omit mention of the groups and refer just to a homomorphisms or isomorphisms from $M$ to $N$ when the groups are clear from context (see Section 4.1.

Let $G$ be a group acting on $X$ and $F:K^{X}\rightarrow L$ . We are usually interested in such functions up to isomorphism. Let $M:Y\rightarrow K$ and $H$ be a group acting on $Y$ such that $(X,G)$ and $(Y,H)$ are isomorphic. Then we abuse notation and sometimes write $F(M)$ to denote $F(M\circ f)$ for some isomorphism $f:X\rightarrow Y$ .

3.2 Symmetric Circuits

We next define the notion of a symmetric circuit as it appears in [11]. These circuits take as input functions of the form $M:X\rightarrow K$ , where $X$ is a $G$ -set, and are symmetric in the sense that the computation itself, not just the value of the output, remains unchanged under the action of $G$ . We first need to formalize what it means for a permutation in $G$ to act on the gates of a circuit, and for this we define the notion of a circuit automorphism extending a permutation.

Definition 3.3 (Circuit Automorphism).

Let $C=(D,W)$ be a circuit over the basis $\mathbb{B}$ with variables $X$ and values $K$ . For $\sigma\in\text{{Sym}}_{X}$ , we say that a bijection $\pi:D\rightarrow D$ is an automorphism extending $\sigma$ if for every gate $g$ in $D$ we have that

•

if $g$ is a constant gate then $\pi(g)=g$ ,
•

if $g$ is a non-constant input gate then $\pi(g)=\sigma(g)$ ,
•

if $(h,g)\in W$ is a wire, then so is $(\pi h,\pi g)$
•

if $g$ is labelled by $b\in\mathbb{B}$ , then so is $\pi g$ .

We say that a circuit $C$ with variables $X$ is rigid if for every permutation $\sigma\in\text{{Sym}}(X)$ there is at most one automorphism of $C$ extending $\sigma$ . The argument used to prove [10, Lemma 5.5] suffices to show that any symmetric circuit may be transformed into an equivalent rigid one in time polynomial in the size of the circuit. As such, when proving lower bounds we often assume the circuit is rigid without a loss of generality.

We are now ready to define the key notion of a symmetric circuit.

Definition 3.4 (Symmetric Circuit).

Let $G$ be a group acting on a set $X$ and $C$ be a circuit with variables $X$ . We say $C$ is a $G$ -symmetric circuit if for every $\sigma\in G$ the action of $\sigma$ on $X$ extends to an automorphism of $C$ .

The following can be shown via a straightforward induction.

Proposition 3.5.

Let $C$ be a $G$ -symmetric circuit. Then $C$ defines a $G$ -symmetric function.

4 Games and Supports

Hella’s bijection game [17] is a two-player game played on relational structures, such as graphs. It was defined to demonstrate indistinguishability of structures in logics with counting and extensions thereof. The indistinguishability relations it defines on graphs are closely tied to Weisfeiler-Leman equivalences [6]. The games are played on a pair of structures $A$ and $B$ by two players called Spoiler and Duplicator. Spoiler aims to show that the two structures are different while Duplicator pretends they are the same. We associate with each structure a sequence of $k$ pebbles which are placed in the course of the game on elements of the two structures. The game proceeds in a sequence of rounds. The number of rounds can be greater than $k$ and so pebbles can be moved from one element to another during the course of the game. At each round, Spoiler chooses a pebble $p_{i}$ ( $i\in[k]$ ) from $A$ and the matching pebble $q_{i}$ from $B$ . Duplicator provides a bijection $h$ between the two structures. Spoiler then chooses an element $a$ of $A$ on which to place $p_{i}$ and $q_{i}$ is placed on $h(a)$ . In each round, Duplicator must ensure that the partial map between the two structures defined by the placement of the pebbles is a partial isomorphism, otherwise Spoiler wins. For more details on this game see [15].

The connection between bijection games and symmetric circuits is first made in [1] which showed a connection between the expressive power of counting logics and symmetric Boolean circuits in the threshold basis $\mathbb{B}_{t}$ . This leads to the suggestion (made in [7]) of using bijection games as a tool directly to prove circuit lower bounds. The main contribution of this section is the generalization of these bijection games in two different directions, as well as results establishing how this tool can be used to prove more general circuit lower bounds.

We noted earlier that we may think of functions on $G$ -sets as some sort of generalised structure. The development of the theory of bijection games (and supports) requires us to further restrict our attention to the case where $G$ is a subgroup of some symmetric group. The domain of the symmetric group corresponds in some sense to the universe of the structure in question.

We also develop in Section 4.2 a theory of supports. The support of a gate $g$ in a $G$ -symmetric circuit for $G\leq\text{{Sym}}_{A}$ is a subset of $A$ which determines both the evaluation of $g$ and its orbit. We establish in Section 4.4 the support theorem, which connects the minimum size of these supports with the minimum size of the orbits of a gate (and so the size of the circuit). We show in Section 4.3 that if Duplicator has a winning strategy in the game with $2k$ pebbles on a pair of indexed functions then these functions cannot be distinguished by any pair of $G$ -symmetric circuits with supports of size at most $k$ . In Section 4.4 we combine this result with the support theorem to show that to prove exponential lower bounds it suffices to establish a linear lower bound on the number of pebbles needed by Spoiler.

4.1 Bijection Games on Indexed function

We now introduce our generalization of Hella’s bijection game. The generalization is in two directions. The first is that we allow the game to be played on arbitrary indexed functions, rather than just graphs or relational structures. The second is that Duplicator is restricted on which bijections it is allowed to play. Specifically, the bijection must be obtained from a fixed initial bijection by composition with a permutation from a fixed group. We recover the usual requirement in the bijection game by just taking this group to be the full symmetric group.

We begin by introducing the key notions of indexed set and function.

Definition 4.1.

Let $A$ be a set, $G\leq\text{{Sym}}_{A}$ , and $X$ be a $G$ -set. We call the triple $(X,A,G)$ an indexed set. We call a triple $(M,A,G)$ , where $M:X\rightarrow K$ is a function on $X$ , an indexed function.

All of the structures discussed so far may be identified with indexed functions. For example, we can identify a (directed) graph $(V,E^{2})$ with the indexed function $(E:V^{2}\rightarrow\{0,1\},V,\text{{Sym}}_{V})$ . Similarly, a bipartitioned graph $(V,U,E)$ can be identified with the indexed function $(E:U\times V\rightarrow\{0,1\},U\uplus V,\text{{Sym}}_{U}\times\text{{Sym}}_{V})$ .

We now want to generalize the notion of isomorphism between structures to an equivalence notion on indexed functions, which is suitably restricted by the permutation group $G$ . Suppose $A$ and $B$ are index sets and $f:A\rightarrow B$ is a bijection. For a group $G\leq\text{{Sym}}_{A}$ , let $H\leq\text{{Sym}}_{B}$ be the group of permutations $\{f\pi f^{-1}\mid\pi\in G\}$ . Let $\mathcal{M}=(M:X\rightarrow K,A,G)$ and $\mathcal{N}=(M:Y\rightarrow K,B,H)$ be a pair of indexed functions and $\hat{f}:X\rightarrow Y$ be a bijection which lifts the bijection $f$ in the sense that $\hat{f}(\pi x)=(f\pi f^{-1})\hat{f}(x)$ for all $\pi\in G$ . We say that the indexed functions $\mathcal{M}$ and $\mathcal{N}$ are $(f,\hat{f})$ -isomorophic, if there is a permutation $\pi\in H$ such that for all $x\in X$ , $M(x)=N(\pi\hat{f}(x))$ .

In particular, for a pair of directed graphs $(V_{1},E_{1})$ and $(V_{2},E_{2})$ , if $f$ is any bijection between $V_{1}$ and $V_{2}$ , and $\hat{f}$ is its natural lift to a bijection between $V_{1}^{2}$ and $V_{2}^{2}$ (i.e. $\hat{f}(u,v)=(f(u),f(v))$ , then the pair of indexed functions $(E_{1}:V_{1}^{2}\rightarrow\{0,1\},V_{1},\text{{Sym}}_{V_{1}})$ and $(E_{2}:V_{2}^{2}\rightarrow\{0,1\},V_{2},\text{{Sym}}_{V_{2}})$ are $(f,\hat{f})$ -isomorphic if, and only if, $(V_{1},E_{1})$ and $(V_{2},E_{2})$ are isomorphic in the usual sense. Similar statements hold for bipartitioned graphs and other relational structures. As a further example, we can identify $n\times n$ matrices over a field $\mathbb{F}$ with structured functions of the form $(M:[n]\times[n]\rightarrow\mathbb{F},[n]\uplus[n],G)$ , for some group $G$ . Unlike the case of structures considered above, there is not a canonical choice of $G$ , and it depends on what invariants of the matrix we wish to take into account. For example, if $G=\text{{Sym}}_{n}$ then isomorphism corresponds to equivalence under simultaneous row and column permutations and if $G=\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ then isomorphism corresponds to equivalence under separate row and column permutations.

Our notion of isomorphism is parameterized by a fixed bijection between the index sets $A$ and $B$ and its lift to the indexed sets $X$ and $Y$ . To simplify notation, we often simply identify the sets $A$ and $B$ , and the sets $X$ and $Y$ , so that the base bijection can be taken to be the identity. This is akin to fixing the universe of any $n$ vertex graph to be the set $[n]$ , so that isomorphisms can be identified with permutations of $[n]$ .

In order to define the bijection game, we need a notion of partial isomorphism on the set of pebbled positions in an indexed function. In order to introduce this, we first define the notion of a lift.

Definition 4.2.

Let $(X,A,G)$ be an indexed set. For $S\subseteq A$ we define the lift of $S$ to be the set $X_{S}=\{x\in X\mid\text{{Stab}}(S)\leq\text{{Stab}}(x)\}$ .

Thus, $X_{S}$ is those elements of $X$ that are fixed by any permutation that fixes $S$ pointwise. The following is now an easy observation.

Lemma 4.3.

Let $(X,A,G)$ be an indexed set. If $X_{S}$ is the lift of $S\subseteq A$ then $\text{{SetStab}}(S)\leq\text{{SetStab}}(X_{S})$ .

Proof.

Let $\sigma\in\text{{SetStab}}(S)$ , $\pi\in\text{{Stab}}(S)$ , and $s\in S$ . Then $\sigma^{-1}\pi\sigma(s)=s$ , so $\sigma^{-1}\pi\sigma\in\text{{Stab}}(S)$ . Hence, by definition of $X_{S}$ , we have $\sigma^{-1}\pi\sigma(x)=x$ for any $x\in X_{S}$ . Hence $\pi\sigma(x)=\sigma(x)$ and therefore $\pi\in\text{{Stab}}(\sigma(x))$ . Since $\pi\in\text{{Stab}}(S)$ was arbitrary, $\sigma(x)\in X_{S}$ and therefore $\sigma\in\text{{SetStab}}(X_{S})$ . ∎

We define the notion of partial isomorphism specifically for two indexed functions over the same index set. It can easily be generalized to the case with distinct index sets with a fixed bijection between them.

Definition 4.4.

Let $(M:X\rightarrow K,A,G)$ and $(N:X\rightarrow K,A,G)$ be two indexed functions, and let $S\subseteq A$ and $\pi\in G$ . We say that $\pi$ induces a partial isomorphism on $S$ if, for each $x\in X_{S}$ , we have $M(x)=N(\pi x)$ .

It is easily seen that this gives the usual notion of partial isomorphism when the indexed functions are graphs or other relational structures. With this, we are ready to define the bijection game on indexed functions. The game is played by two players Spoiler and Duplicator on a pair of indexed functions, using a set of pebbles. During the course of the game, the pebbles are placed on elements of the indexing sets. Where it does not cause confusion, we do not distinguish notationally between the pebbles and the elements on which they are placed.

Definition 4.5.

Let $\mathcal{N}=(M:X\rightarrow K,A,G)$ and $\mathcal{N}=(N:X\rightarrow K,A,G)$ be indexed functions. The $k$ -pebble $G$ -bijection game on $(M,N)$ is defined as follows. The game is played between two players called the Spoiler and Duplicator using two sequences of pebbles $a_{1},\ldots,a_{k}$ and $b_{1},\ldots,b_{k}$ . In each round of the game:

1.

Spoiler picks a pair of pebbles $a_{i}$ and $b_{i}$ ,
2.

Duplicator picks a permutation $\pi\in G$ such that for each $j\neq i$ , $\pi(a_{j})=b_{j}$ , and
3.

Spoiler chooses some $a\in A$ and places $a_{i}$ on $a$ and $b_{i}$ on $\pi(a)$ .

Spoiler has won the game at the end of the round if the permutation $\pi$ does not induce a partial isomorphism on the set $\{a_{1},\ldots,a_{k}\}$ . We say that Duplicator has a winning strategy for the $k$ -pebble $G$ -bijection game if it has a strategy to play the game forever without Spoiler winning.

In the sequel, we abbreviate “ $k$ -pebble $G$ -bijection game” to $(G,k)$ -bijection game.

We recover the ordinary bijection game on graphs by taking $A=[n]$ , $X=[n]^{2}$ and $G=\text{{Sym}}_{n}$ . It is clear in this case that if $\mathcal{M}$ and $\mathcal{N}$ are isomorphic graphs, then Duplicator has a winning strategy by choosing a fixed isomorphism $\pi$ between them and playing it at every move. However, if $G\leq\text{{Sym}}_{A}$ is a more restricted group that does not contain an isomorphism between $\mathcal{M}$ and $\mathcal{N}$ , a Duplicator winning strategy is not guaranteed even when $\mathcal{M}$ and $\mathcal{N}$ are isomorphic as graphs. This is just the case in the application in Section 5.

4.2 Supports

Lower bounds for symmetric circuits rely on the notion of a support. We define the notion formally below but it is worthwhile developing an intuition. If we have a $G$ -symmetric circuit $C$ then the function computed by $C$ is invariant under permutations in $G$ on the input. This is not the case for individual gates $g$ in $C$ other than the output gate: applying a permutation $\pi\in G$ to the inputs of the circuit might yield a different function at $g$ . But, by the symmetry condition, there is then another gate $\pi g$ which computes this other function. If the circuit is small, the orbit of $g$ is small and thus the stabilizer group of $g$ is large. That is, for many $\pi$ we have $g=\pi g$ . What the support theorem tells us is that in this case, there is a small subset $S$ (which we call a support of $g$ ) of the permutation domain of $G$ such that the function computed at $g$ only depends on $S$ . This is in the sense that permutations that only move elements of the permutation domain outside of $S$ do not change the function computed at $g$ . This support theorem can be proved as long as the group $G$ is, in a sense, large enough. We now define the notion of a support formally and prove the relationship between support size and orbit size in Section 4.3.

Definition 4.6.

Let $X$ be a $G$ -set and $H\leq G$ a subgroup of $G$ . We say that $S\subseteq X$ is a support of $H$ if $\text{{Stab}}_{G}(S)\leq H$ .

We extend this notion to indexed sets, where the support is now a subset of the indices. A key example is the action of $\text{{Sym}}_{n}$ on the set $X=[n]\times[n]$ or on the collection of matrices $M:X\rightarrow\{0,1\}$ .

Definition 4.7.

Let $(X,A,G)$ be an indexed set. We say that a set $S\subseteq A$ is a support of $x\in X$ if it is a support of $\text{{Stab}}(x)$ .

Note that $S$ is a support of $x$ just in case $x$ is in the lift of $S$ , in the sense of Definition 4.2.

Let $(X,A,G)$ be an indexed set. We write $\text{{max-orbit}}(X)$ to denote the maximum size of the orbit of $x$ over all $x\in X$ . For $x\in X$ we write $\text{{min-supp}}(x)$ to denote the minimum size of a support of $x$ and $\text{{max-supp}}(X)$ to denote the maximum of $\text{{min-supp}}(x)$ over all $x\in X$ .

We now specialise our discussion to circuits. Let $C$ be a rigid $G$ -symmetric circuit with gate set $D$ , where $G\leq\text{{Sym}}_{n}$ for some natural number $n$ . Then $(D,[n],G)$ is an indexed set. In this way we can speak of the supports and orbits of a gate. We abuse notation slightly and write $\text{{max-orbit}}(C)$ and $\text{{max-supp}}(C)$ to denote $\text{{max-orbit}}(D)$ and $\text{{max-supp}}(D)$ , respectively.

4.3 Playing Games on Circuits

We are now ready to prove the first major theorem of this section. This links the number of pebbles in a bijection game with the size of the supports. Structures in which Duplicator has a $2k$ -pebble winning strategy cannot be distinguished by symmetric circuits with supports limited to size $k$ . The statement of the theorem and argument used generalize that in [7].

Theorem 4.8.

Let $(X,A,G)$ be an indexed set and let $C$ be a $G$ -symmetric circuit with values from $K$ and variables $X$ , such that $\text{{max-supp}}(C)\leq k$ . If Duplicator has a winning strategy for the $(G,2k)$ -bijection game played on $(M,A,G)$ and $(N,A,G)$ for functions $M:X\rightarrow K$ and $N:X\rightarrow K$ then $C[M]=C[N]$ .

Proof.

We prove this result by contraposition, i.e. we assume $C[M]\neq C[N]$ and show that Spoiler has a winning strategy for the $(G,2k)$ -game on $(M,N)$ .

The strategy we define for Spoiler can be understood informally as follows. We start at the output gate $g$ and note that for any bijection $\sigma\in G$ , $C[M](g)\neq C[\sigma N](g)$ . The goal of Spoiler now over the next (at most) $k$ rounds is to pebble a support of some child $h$ of $g$ such that for Duplicator’s most recent chosen bijection $\sigma$ we have that $C[M](h)\neq C[\sigma N](h)$ . Since the bijection chosen by Duplicator can be interpreted as a permutation $\sigma\in G$ respecting the pebble configuration, and the fact that an entire support of $h$ is pebbled and so any such permutation fixes $h$ , it follows that $C[M](h)\neq C[\sigma N](h)$ for any choice of $\sigma$ . We now iterate this process, aiming next to pebble a support of an appropriate child of $h$ over the next (at most) $k$ rounds. We terminate when we reach an input gate, which witnesses that Spoiler has won the game.

So, fix for each gate $g$ of $C$ a support of size at most $k$ . We write $\text{sp}(g)$ to denote this support.

Claim 4.9.

Suppose the pebbled position is $\vec{a}$ and $\vec{b}$ . Let $g$ be a gate such that $\text{sp}(g)\subseteq\vec{a}$ and for some $\sigma$ that maps $\vec{a}$ to $\vec{b}$ we have that $C[M](g)\neq C[\sigma N](g)$ . There exists a strategy for Spoiler such that after at most $k$ rounds where the pebbled positions are now $\vec{a}^{\prime}$ and $\vec{b}^{\prime}$ there is $h\in\text{child}(g)$ such that $\text{sp}(h)\subseteq\vec{a}^{\prime}$ and for all $\sigma^{\prime}\in G$ that maps $\vec{a}^{\prime}$ to $\vec{b}^{\prime}$ we have that $C[M](h)\neq C[\sigma^{\prime}N](h)$ .

Proof.

We describe an invariant that Spoiler is able to maintain inductively. Suppose that after $i\geq 0$ rounds $c_{1},\ldots,c_{i}$ and $d_{1},\ldots,d_{i}$ have been pebbled. Let $S_{i}$ be the pointwise stabiliser of $c_{1},\ldots,c_{i}$ in $\text{{Stab}}(\vec{a})$ . Then $S_{0}=\text{{Stab}}(\vec{a})$ . Spoiler ensures there is a gate $h_{i}\in\text{child}(g)$ with $c_{1},\ldots,c_{i}\in\text{sp}(h_{i})$ and a $\alpha\in K$ such that

\displaystyle|\{h\in\text{{Orb}}_{S_{i}}(h_{i})\mid C[M](h)=\alpha\}|\neq|\{h\in\text{{Orb}}_{S_{i}}(h_{i})\mid C[\sigma N](h)=\alpha\}|

for all $\sigma\in G$ that maps $\vec{a},c_{1},\ldots,c_{i}$ to $\vec{b},d_{1},\ldots,d_{i}$ . This suffices to prove the claim since $|\text{sp}(h_{i})|\leq k$ and therefore for some $i$ , we will have $i\geq|\text{sp}(h_{i})|$ and in this case, $\text{{Orb}}_{S_{i}}(h_{i})$ is a singleton.

We first consider the case $i=0$ . We have for any $\sigma\in G$ that maps $\vec{a}$ to $\vec{b}$ that $C[M](g)\neq C[\sigma N](g)$ . Since the basis element labelling $g$ is a fully symmetric function, there must be some $\alpha\in K$ such that $|\{h\in\text{child}(g)\mid C[M]h=\alpha\}|$ is different from $|\{h\in\text{child}(g)\mid C[\sigma N]h=\alpha\}|$ . Moreover, since $S_{0}\leq\text{{Stab}}(g)\leq\text{{SetStab}}(\text{child}(g))$ , $S_{0}$ partitions $\text{child}(g)$ into orbits. It follows that for some $h\in\text{child}(g)$

\displaystyle|\{h^{\prime}\in\text{{Orb}}_{S_{0}}(h)\mid C[M](h^{\prime})=\alpha\}|\neq|\{h^{\prime}\in\text{{Orb}}_{S_{0}}(h)\mid C[\sigma N](h^{\prime})=\alpha\}|.

Now, take any $\sigma^{\prime}\in G$ that maps $\vec{a}$ to $\vec{b}$ . Let $\rho:=\sigma^{-1}\sigma^{\prime}$ . Then $\rho\in S_{0}$ , and so $\text{{Orb}}_{S_{0}}(h)=\text{{Orb}}_{S_{0}}(\rho h)$ . Moreover, for any $h^{\prime}\in\text{{Orb}}_{S_{0}}(h)$ , $C[\sigma N](h^{\prime})=C[\sigma\rho N](\rho h^{\prime})=C[\sigma^{\prime}N](\rho h^{\prime})$ , and so

\displaystyle|\{h^{\prime}\in\text{{Orb}}_{S_{0}}(h)\mid C[M](h^{\prime})=\alpha\}|\neq|\{h^{\prime}\in\text{{Orb}}_{S_{0}}(h)\mid C[\sigma^{\prime}N](h^{\prime})=\alpha\}|.

Thus, taking $h_{0}:=h$ suffices.

Inductively, suppose Spoiler has maintained the invariant after $i>0$ rounds. At the start of round $i+1$ Spoiler picks up a pair of pebbles $(a_{j},b_{j})$ where $a_{j}$ is not placed on $c_{1},\ldots,c_{i}$ or an element of $\text{sp}(g)$ . Suppose Duplicator plays a bijection $\sigma_{i+1}$ . Then, by the induction hypothesis there is an $h_{i}$ and an $\alpha_{i}$ such that

\displaystyle|\{h^{\prime}\in\text{{Orb}}_{S_{i}}(h_{i})\mid C[M](h^{\prime})=\alpha_{i}\}|\neq|\{h^{\prime}\in\text{{Orb}}_{S_{i}}(h_{i})\mid C[\sigma_{i}N](h^{\prime})=\alpha_{i}\}|.

and $c_{1},\ldots,c_{i}\in\text{sp}(h_{i})$ . Let $c_{1},\ldots,c_{i},s_{i+1},\ldots,s_{u}$ enumerate $\text{sp}(h_{i})$ . Then $\text{{Orb}}_{S_{i}}(h_{i})$ is partitioned into sets $(O_{c})_{c\in A\setminus\vec{a}\cup\{c_{1},\ldots,c_{i}\}}$ , where $O_{c}=\{\pi h_{i}\mid\pi(s_{i+1})=c\}$ . Thus there is a value $c_{i+1}$ and $\alpha_{i+1}$ for which

\displaystyle|\{h^{\prime}\in O_{c_{i+1}}\mid C[M](h^{\prime})=\alpha_{i+1}\}|\neq|\{h^{\prime}\in O_{c_{i+1}}\mid C[\sigma N](h^{\prime})=\alpha_{i+1}\}|.

Spoiler then places $a_{j}$ on $c_{i+1}$ and $b_{j}$ on $\sigma_{i+1}(c_{i+1})$ . Let $h_{i+1}$ be any element of $O_{c+1}$ . The required invariant is satisfied by construction. ∎

It follows from Claim 4.9 that in at most $k\cdot 0pt(C)$ rounds Spoiler can force the pebbles to a position such that for some input gate $g$ labelled by some variable $x\in X$ we have: that for any $\sigma\in G$ that Duplicator may play we have that $C[M](g)\neq C[\sigma N](g)$ . Thus $M(x)\neq\sigma N(x)$ , and so Spoiler wins.

∎

4.4 Bounds on Supports

The main result of this subsection establishes that, for suitable choice of groups $G$ , if a family of $G$ -symmetric circuits has subexponential size orbits than it has sublinear size supports. We prove this specifically for the case when $G$ is an alternating group. The argument extends easily to cases where $G$ contains a large alternating group, and we derive one such instance in Corollary 4.12. The proof relies on a standard fact about permutation groups, attributed to Jordan and Liebeck, that translates a bound on orbit size to a bound on support size. To understand this, suppose $g$ is a gate with orbit size (relative to $\text{{Alt}}_{n}$ ) bounded by ${n\choose k}$ . Then by the orbit-stabilizer theorem, this is equivalent to $[\text{{Alt}}_{n}:\text{{Stab}}(g)]\leq{n\choose k}$ . One way for $\text{{Stab}}(g)$ to have small index in this way is for it to always contain a large copy of the alternating group. That is $\text{{Stab}}(g)$ contains the alternating group restricted to $n-k$ elements, or equivalently $g$ has a support of size at most $k$ . Theorem 4.10 asserts that indeed this is the only way.

Theorem 4.10 ([13], Theorem 5.2B).

Let $Y$ be a set such that $n:=|Y|>8$ , and let $k$ be an integer with $1\leq k\leq\frac{n}{4}$ . Suppose that $G\leq\text{{Alt}}_{Y}$ has index $[\text{{Alt}}_{Y}:G]<{n\choose k}$ then there exists $X\subset Y$ with $|X|<k$ such that $\text{{Stab}}_{\text{{Alt}}_{Y}}(X)\leq G$ .

We derive from Theorem 4.10 the following asymptotic relationship between orbit and support size. An analogous version of this with respect to the symmetric group is stated in the full version of [11] and the proof is largely the same. We include a proof for completeness.

Theorem 4.11.

Let $(C_{n})$ be a family of rigid $\text{{Alt}}_{n}$ -symmetric circuits. If $\text{{max-orbit}}(C_{n})=2^{o(n)}$ then $\text{{max-supp}}(C_{n})=o(n)$ .

Proof.

Let $k$ be the least value such that $\text{{max-orbit}}{C_{n}}\leq{n\choose k}$ . By the assumption that $\text{{max-orbit}}(C_{n})=2^{o(n)}$ , we have that $k$ is $o(n)$ . Indeed, otherwise there is a constant $c$ with $0<c<\frac{1}{2}$ , such that $k-1\geq cn$ for infinitely many $n$ . And since ${n\choose l}\geq(\frac{n}{l})^{l}$ for all $l$ it follows that ${n\choose{k-1}}\geq(\frac{n}{cn})^{cn}>2^{cn}$ . Since $k$ is the least value such that $\text{{max-orbit}}(C_{n})\leq{n\choose k}$ , it follows that $\text{{max-orbit}}(C_{n})\geq 2^{cn}$ for infinitely many $n$ , contradicting the assumption that $\text{{max-orbit}}(C_{n})=2^{o(n)}$ .

From $k=o(n)$ it follows that for all large enough $n$ , $k\leq\frac{n}{4}$ . Then, for any gate $g$ of $C_{n}$ , by the orbit-stabilizer theorem, we have $[\text{{Alt}}_{n}:\text{{Stab}}(g)]\leq{n\choose k}$ and so by Theorem 4.10, $g$ has a support of size less than $k$ . ∎

The following corollary establishes the analogue of Theorem 4.11 needed to prove our lower bounds for $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric circuits.

Corollary 4.12.

Let $(C_{n})_{n\in\mathbb{N}}$ be such that for each $n$ , $C_{n}$ is defined over the matrix of variables $X_{n}=\{x_{i,j}\mid i,j\in[n]\}$ and $C_{n}$ is a rigid $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric circuit. If $\text{{max-orbit}}(C_{n})=2^{o(n)}$ then $\text{{max-supp}}(C_{n})=o(n)$ .

Proof.

Fix $G=\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ and let $G_{r}=\text{{Alt}}_{n}\times\{\operatorname{id}\}\leq G$ and $G_{c}=\{\operatorname{id}\}\times\text{{Alt}}_{n}\leq G$ be the restriction of the action of $G$ to the two coordinates. We can think of them as the actions of $G$ on the rows and columns respectively. For any gate $g$ in $C_{n}$ , let $S_{r}$ and $S_{c}$ be supports of $g$ under the action of $G_{r}$ and $G_{c}$ respectively. Then it is easily seen that $S_{r}\cup S_{c}$ is a support of $g$ .

Suppose for all gates $g$ in $C_{n}$ we have $\text{{max-orbit}}(C_{n})=2^{o(n)}$ . Since the orbit of $g$ under the action of $G$ includes its orbits under the action of its subgroups $G_{r}$ and $G_{c}$ , we have that each of $\text{{Orb}}_{G_{r}}(g)$ and $\text{{Orb}}_{G_{c}}(g)$ has size $2^{o(n)}$ and therefore by Theorem 4.11 $S_{r}$ and $S_{c}$ can be chosen of size $o(n)$ and the result follows. ∎

We now combine these results along with Theorem 4.8 to establish the crucial connection between bijection games and exponential lower bounds for symmetric circuits. We prove the result for the particular case of interest to us in this paper, namely for $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric circuits taking as input $n\times n$ -matrices, but note that it holds more generally.

Theorem 4.13.

Fix a set $K$ , and for each $n\in\mathbb{N}$ , let $G_{n}$ denote the group $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ . Fix a $G_{n}$ -set $X_{n}$ and let $P=(P_{n})_{n\in\mathbb{N}}$ be a family of functions $P_{n}:K^{X_{n}}\rightarrow K$ , where $P_{n}$ is $G_{n}$ -symmetric. Suppose there are infinitely many $n$ for which there are pairs $M_{n},N_{n}:X_{n}\rightarrow K$ such that $P_{n}(M_{n})\neq P_{n}(N_{n})$ and Duplicator has a strategy to win the $(G_{n},k)$ -bijection game played on $(M_{n},[n]\uplus[n],\text{{Alt}}_{n}\times\text{{Alt}}_{n})$ and $(N_{n},[n]\uplus[n],\text{{Alt}}_{n}\times\text{{Alt}}_{n})$ for $k=\Omega(n)$ . Then there is no family of $G_{n}$ -symmetric circuits that computes $P$ and has size $2^{o(n)}$ .

Proof.

From Theorem 4.8 any $G_{n}$ -symmetric circuit $C_{n}$ that has $\text{{max-supp}}(C_{n})\leq k/2$ must have that $C[M_{n}]=C[N_{n}]$ , and so cannot compute $P_{n}$ . It follows then that any family of $G_{n}$ -symmetric circuits that computes $P_{n}$ must have $\text{{max-supp}}(C_{n})=\Omega(n)$ and so from Corollary 4.12 cannot have orbits of size $2^{o(n)}$ , and hence cannot have size $2^{o(n)}$ . ∎

Notice that both the definition of the bijection games and Theorem 4.8 place almost no restriction on the group actions considered. The link between the number of pebbles in the game and the size of the support is robust. However, the application in Theorem 4.13 is for a severely limited group action. This is because the link between support size and orbit size proved in Theorem 4.11 requires the presence of a large alternating group.

5 Lower Bound for the Determinant

We now deploy the machinery we have developed to prove the main lower bound result of this paper.

Theorem 5.1 (Main Theorem).

Let $\mathbb{F}$ be a field of characteristic $0$ . There is no family of $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric circuits $(C_{n})_{n\in\mathbb{N}}$ over $\mathbb{F}$ of size $2^{o(n)}$ computing the determinant over $\mathbb{F}$ .

To prove Theorem 5.1 we need to construct for each $k$ , a pair of $n\times n$ $\{0,1\}$ -matrices $M_{k}$ and $N_{k}$ with $n=O(k)$ , $\det(M_{k})\neq\det(N_{k})$ and such that Duplicator has a winning strategy in the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k)$ -bijection game played on $M_{k}$ and $N_{k}$ .

We construct the matrix $M_{k}$ as the biadjacency matrix of a bipartite graph $\Gamma$ . The matrix $N_{k}$ is obtained from $M_{k}$ by interchanging a pair of columns of $M_{k}$ . Hence, $N_{k}$ is also a biadjacency matrix of $\Gamma$ and $\det(M_{k})=-\det(N_{k})$ by construction. Thus, as long as $\det(M_{k})\neq 0$ the two determinants are different.

In Section 5.1 we describe the construction of the graph which gives rise to the biadjacency matrix $M_{k}$ . This graph is obtained by a CFI construction from a base graph $\Gamma$ satisfying a number of conditions. We show the existence of graphs $\Gamma$ satisfying these conditions in Section 5.2. Then, in Section 5.3 we argue that Duplicator has a winning strategy in the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k)$ -bijection game played on $M_{k}$ and $N_{k}$ . We bring it all together in Section 5.4 for a proof of Theorem 5.1.

5.1 Constructing the Graph

In proving the lower bound for symmetric circuits computing the permanent in [11], we adapted a construction due to Cai, Fürer and Immerman [6] of pairs of graphs not distinguished by the $k$ -dimensional Weisfeiler-Leman algorithm. We showed that we could obtain such a pair of bipartite graphs with different numbers of perfect matchings. Note that saying a pair of graphs are not distinguished by the $k$ -dimensional Weisfeiler-Leman algorithm is the same as saying that Duplicator has a winning strategy in the $(k+1)$ -pebble bijection game, using the full symmetric group. In the present construction, we consider a game played on a pair of isomorphic bipartite graphs but where the set of permissible bijections does not include any isomorphisms between them. Equivalently, we play the game on two distinct biadjacency matrices for the same graph. The graphs we consider are, indeed, exactly the graphs used in [6] except that we have to ensure they are bipartite.

CFI graphs and Determinants

Let $\Gamma=(U\cup V,E)$ be a $3$ -regular bipartite graph with bipartition $U,V$ . We obtain the graph $\hat{\Gamma}$ by replacing each vertex $v$ , with neigbours $x,y,z$ with the ten-vertex gadget depicted in Figure 1. This gadget is described as follows.

•

There is a set denoted $I_{v}$ of four inner vertices: a vertex $v_{S}$ for each set $S\subseteq\{x,y,z\}$ of even size.
•

There is a set denoted $O_{v}$ of six outer vertices: two $u_{0},u_{1}$ for each $u\in\{x,y,z\}$ .
•

There is an edge between $v_{S}$ and $u_{1}$ if $u\in S$ and an edge between $v_{S}$ and $u_{0}$ if $u\not\in S$ .

Corresponding to each edge $e=\{u,v\}\in E$ there is a pair of edges that we denote $e_{0}$ and $e_{1}$ in $\hat{\Gamma}$ : $e_{0}$ connects the vertex $u_{0}\in O_{v}$ with $v_{0}\in O_{u}$ and $e_{1}$ connects the vertex $u_{1}\in O_{v}$ with $v_{1}\in O_{u}$

Refer to caption — Figure 1: A gadget in $\hat{\Gamma}$ corresponding to vertex $v$ with neighbours $x,y,z$

Note that the graph $\hat{\Gamma}$ is bipartite. Indeed, if we let $X:=\bigcup_{v\in U}I_{v}\cup\bigcup_{v\in V}O_{v}$ and we let $Y:=\bigcup_{v\in V}I_{v}\cup\bigcup_{v\in U}O_{v}$ then it is easily seen that all edges in $\hat{\Gamma}$ are between $X$ and $Y$ . Since $\Gamma$ is a $3$ -regular bipartite graph, it follows that $|U|=|V|$ and therefore $|X|=|Y|$ . Writing $m$ for $|U|$ and $n$ for $|X|=10m$ , we obtain a biadjacency $n\times n$ matrix representing the graph $\hat{\Gamma}$ by fixing a pair of bijections $\eta:X\rightarrow[n]$ and $\eta^{\prime}:Y\rightarrow[n]$ . The action of the group $D_{n}$ divides the collection of all such matrices into two orbits. Letting $M$ and $N$ be representatives of the two orbits, we have $\det(M)=-\det(N)$ . We next aim to show that $\det(M)\neq 0$ , provided that $\Gamma$ has an odd number of perfect matchings.

Suppose we are given bijections $\eta:X\rightarrow[n]$ and $\eta^{\prime}:Y\rightarrow[n]$ which together determine a biadjacency matrix $M$ representation of $\hat{\Gamma}$ . We can also identify each perfect matching in $\hat{\Gamma}$ with a bijection $\mu:X\rightarrow Y$ such that $\mu(x)$ is a neighbour of $x$ for all $x\in X$ . We write $\text{match}(\hat{\Gamma})$ for the collection of all perfect matchings of $\hat{\Gamma}$ . Then, the determinant of $M$ is given by:

	$\displaystyle\det(M)$	$\displaystyle=$	$\displaystyle\|\{\mu\mid\mu\in\text{match}(\hat{\Gamma})\text{ with }\text{sgn}(\eta^{\prime}\mu\eta^{-1})=1\}\|$
			$\displaystyle-\|\{\mu\mid\mu\in\text{match}(\hat{\Gamma})\text{ with }\text{sgn}(\eta^{\prime}\mu\eta^{-1})=-1\}\|.$

From now on, we take $\eta$ and $\eta^{\prime}$ to be fixed and write $\text{sgn}(\mu)$ and talk of the sign of a matching $\mu$ as short hand for $\text{sgn}(\eta^{\prime}\mu\eta^{-1})$ .

To show that $\det(M)$ is non-zero, we analyze the structure of the set $\text{match}(\hat{\Gamma})$ . Note that since $\eta$ and $\eta^{\prime}$ are fixed, this imposes a linear order on the sets $X$ and $Y$ . It also induces a linear order on the sets $U$ and $V$ , for instance by saying that $u<v$ , for $u,v\in U$ if the least element of $\eta(I_{u}\cup O_{u})$ is less than the least element of $\eta(I_{v}\cup O_{v})$ . We make use of this order without further elaboration.

Perfect Matchings

In any perfect matching $\mu$ of $\hat{\Gamma}$ , all four vertices in $I_{v}$ for any $v\in U\cup V$ must be matched to vertices in $O_{v}$ as they have no neighbours outside this set. Thus, exactly two of the vertices of $O_{v}$ are matched to vertices in other gadgets. These two could be two vertices in a pair, e.g. $\{x_{0},x_{1}\}$ in Figure 1 or they could be from different pairs, e.g. $\{x_{0},y_{0}\}$ . It is easily checked, by inspection of the gadget, that in either case, removing the two vertices of $O_{v}$ from the gadget results in an 8-vertex graph that admits a perfect matching. Indeed, if we remove two vertices in a pair, such as $\{x_{0},x_{1}\}$ the resulting graph is a two-regular bipartite graph on eight vertices. The graph resulting from removing $x_{0}$ and $y_{0}$ from the gadget is depicted in Figure 2 and all other cases of removing two vertices of $O_{v}$ from different pairs result in a graph isomorphic to this one. We now classify perfect matchings in $\text{match}(\hat{\Gamma})$ according to which edges between gadgets are included in the matching.

Say that a matching $\mu\in\text{match}(\hat{\Gamma})$ is uniform if for each edge $e\in E$ , at most one of the two edges $e_{0}$ and $e_{1}$ is included in $\mu$ . Thus, $\mu$ is non-uniform if for some $e\in E$ both $e_{0}$ and $e_{1}$ are included in $\mu$ . Our first aim is to show that the non-uniform matchings contribute a net zero to the determinant of $M$ . Write $\text{uni-match}(\hat{\Gamma})$ to denote the set of uniform matchings of $\hat{\Gamma}$ .

Lemma 5.2.

	$\displaystyle\det(M)$	$\displaystyle=$	$\displaystyle\|\{\mu\mid\mu\in\text{uni-match}(\hat{\Gamma})\text{ with }\text{sgn}(\mu)=1\}\|$
			$\displaystyle-\|\{\mu\mid\mu\in\text{uni-match}(\hat{\Gamma})\text{ with }\text{sgn}(\mu)=-1\}\|.$

Proof.

For a non-uniform matching $\mu$ , let $e\in E$ be an edge for which both $e_{0}$ and $e_{1}$ are included in $\mu$ and and $v\in U\cup V$ a vertex such that $e$ is incident on $v$ . Then, the four vertices in $I_{v}$ are matched with the remaining four vertices in $O_{v}$ and there are exactly two ways this can be done. To see this, consider the gadget in Figure 1 and suppose that $e=\{v,x\}$ so $x_{0}$ and $x_{1}$ are matched outside the gadget. Then the only two possible matchings of the remaining eight vertices are $v_{\emptyset}-y_{0};v_{\{x,y\}}-z_{0};v_{\{y,z\}}-y_{1};v_{\{x,z\}}-z_{1}$ and $v_{\emptyset}-z_{0};v_{\{x,y\}}-y_{1};v_{\{y,z\}}-z_{1};v_{\{x,z\}}-y_{0}$ . Note that the second one is obtained from the first by composing with an odd permutation: namely the $4$ -cycle $(y_{0}z_{0}y_{1}z_{1})$ . We can then define an involution on the set of non-uniform matchings as follows: map $\mu$ to the unique non-uniform matching $\mu^{\prime}$ which differs from $\mu$ only in the set $O_{v}$ corresponding to the vertex $v\in U$ which is minimal (in the order on $U$ ) among all vertices of $U$ incident on some edge $e$ for which $e_{0},e_{1}$ are in $\mu$ . This is easily seen to be an involution by our previous observation. Moroever, it takes any matching $\mu$ to one of opposite sign.

We conclude that all non-uniform matchings $\mu$ come in pairs of opposite sign and therefore cancel out in the expression for the determinant of $M$ . ∎

Uniform Perfect Matchings.

Our next aim is to count uniform perfect matchings in $\hat{\Gamma}$ and classify them by sign. Suppose then that $\mu$ is a perfect matching that includes for each $e\in E$ at most one of the two edges $e_{0}$ and $e_{1}$ . Let $F_{\mu}\subseteq E$ be the set of those $e\in E$ such that exactly one of $e_{0}$ and $e_{1}$ is in $\mu$ . Furthermore, let $f_{\mu}:F_{\mu}\rightarrow\{0,1\}$ be the function given by $f_{\mu}(e)=i$ if, and only if, $e_{i}$ is in $\mu$ .

Since for each $v\in U\cup V$ , exactly two of the vertices of $O_{v}$ are matched to vertices in other gadgets we can see that $F_{\mu}$ includes exactly two edges incident on every vertex $v$ . In other words, $F_{\mu}$ is a $2$ -factor of $\Gamma$ and therefore has exactly $2m$ edges.

For a $2$ -factor $F$ of $\Gamma$ and a function $f:F\rightarrow\{0,1\}$ , write $\mu(F,f)$ for the collection of all matchings $\mu$ of $\hat{\Gamma}$ with $F_{\mu}=F$ and $f_{\mu}=f$ .

Lemma 5.3.

There are exactly $2^{2m}$ perfect matchings in $\mu(F,f)$ , for any $2$ -factor $F$ of $\Gamma$ and any function $f:F\rightarrow\{0,1\}$ .

Proof.

Let $v$ be a vertex of $\Gamma$ with neighbours $x,y,z$ and consider the gadget in Figure 1. Exactly two of the edges incident on $v$ are included in $F$ and suppose that it is the edges $\{v,x\}$ and $\{v,y\}$ . Further, suppose $f(\{v,x\})=f(\{v,y\})=0$ . This means that the matching must pair the vertices $x_{0}$ and $y_{0}$ with vertices outside the gadget and we verify that the gadget with these two vertices removed admits two distinct perfect matchings. The gadget with vertices $x_{0}$ and $y_{0}$ removed is pictured in Figure 2, where for clarity, we have removed the set brackets in the subscripts of the vertex labels. It is clear that $v_{\emptyset}$ must be matched with $z_{0}$ . This leaves a six-cycle $v_{\{x,y\}}-x_{1}-v_{\{x,z\}}-z_{1}-v_{\{y,z\}}-y_{1}$ which admits two matchings. Entirely analogously, if $f(\{v,x\})=f(\{v,y\})=1$ we can consider the gadget with the vertices $x_{1}$ and $y_{1}$ removed and it is easily checked that the resulting graph is isomorphic to the one depicted in Figure 2, as is also the case when $f(\{v,x\})\neq f(\{v,y\})$ .

Since the choice of the matching at each gadget is independent, and there are $2m$ gadgets, one for each vertex in $U\cup V$ , we see that there are $2^{2m}$ distinct matchings $\mu$ for the fixed choice of $2$ -factor $F$ and function $f:F\rightarrow\{0,1\}$ . ∎

The proof of Lemma 5.3 shows that the two matchings $\mu$ and $\mu^{\prime}$ in $\mu(F,f)$ obtained by varying the choice at just one gadget are related to each other by an even permutation. For instance, the two matchings in Figure 2 are related by the $3$ -cycle $x_{1}y_{1}z_{1}$ . We can immediately conclude that all $2^{2m}$ matchings in $\mu(F,f)$ have the same sign.

Lemma 5.4.

For any $2$ -factor $F$ of $\Gamma$ , any function $f:F\rightarrow\{0,1\}$ and any matchings $\mu_{1},\mu_{2}\in\mu(F,f)$ , $\text{sgn}(\mu_{1})=\text{sgn}(\mu_{2})$ .

We next show that we can go further and show that the sign of any matching in $\mu(F,f)$ does not depend on the choice of $f$ .

Lemma 5.5.

For any $2$ -factor $F$ of $\Gamma$ , any functions $f,g:F\rightarrow\{0,1\}$ and any matchings $\mu_{1}\in\mu(F,f)$ and $\mu_{1}\in\mu(F,g)$ , $\text{sgn}(\mu_{1})=\text{sgn}(\mu_{2})$ .

Proof.

It suffices to show the lemma for functions $f$ and $g$ which differ at exactly one edge $e\in F$ as the result then follows by transitivity. Moreover, by Lemma 5.4, it suffices to choose any $\mu_{1}\in\mu(F,f)$ and $\mu_{2}\in\mu(F,g)$ and show that they have the same sign

So, assume $f(e)=0$ and $g(e)=1$ and $f$ and $g$ agree on all other edges. Let $e=\{v,x\}$ where $v$ has neighbours $x,y,z$ and $x$ has neighbours $v,u,w$ . Without loss of generality, assume $\{v,y\}$ and $\{x,w\}$ are in $F$ with $f(\{v,y\})=f(\{x,w\})=g(\{v,y\})=g(\{x,w\})=0$ . By assumption, $\mu_{1}$ includes the edge $x_{0}-v_{0}$ and we can assume further that it includes the edges $v_{\emptyset}-z_{0}$ , $v_{\{x,y\}}-x_{1}$ in the gadget corresponding to $v$ and the edges $x_{\emptyset}-u_{0}$ , $v_{\{v,w\}}-v_{1}$ in the gadget corresponding to $x$ . In other words, it contains alternating edges along the $10$ -cycle depicted in Figure 3.

Now, choose $\mu_{2}$ to be the symmetric difference between $\mu_{1}$ and the cycle in Figure 3. This is also a perfect matching and it is in $\mu(F,g)$ as the only difference with $\mu_{1}$ as far as edges between gadgets is concerned is that $\mu_{2}$ contains $x_{1}-v_{1}$ rather than $x_{0}-v_{0}$ . Moreover, it is easily seen that $\mu_{2}$ is obtained from $\mu_{1}$ by composition with an even permutation: namely the $5$ -cycle: $(x_{0}z_{0}x_{1}x_{\{v,w\}}x_{\emptyset})$ . ∎

With this, we are now ready to establish the main result of this section.

Lemma 5.6.

If $\Gamma$ has an odd number of perfect matchings, then $\det(M)\neq 0$ .

Proof.

By Lemma 5.2, we have

	$\displaystyle\det(M)$	$\displaystyle=$	$\displaystyle\|\{\mu\mid\mu\in\text{uni-match}(\hat{\Gamma})\text{ with }\text{sgn}(\mu)=1\}\|$
			$\displaystyle-\|\{\mu\mid\mu\in\text{uni-match}(\hat{\Gamma})\text{ with }\text{sgn}(\mu)=-1\}\|.$

For any $2$ -factor $F$ of $\Gamma$ write $\mu(F)$ for $\bigcup_{f:F\rightarrow\{0,1\}}\mu(F,f)$ and note that by Lemma 5.3 $|\mu(F)|=2^{4m}$ for all $F$ . Moreover, by Lemma 5.5 all matchings in $\mu(F)$ have the same sign. Thus, we define $\text{sgn}(F)$ to be $\text{sgn}(\mu)$ for any $\mu\in\mu(F)$ . Hence, we have that

\det(M)=2^{4m}\sum_{F}\text{sgn}(F),

where the sum is over all $2$ -factors of $\Gamma$ .

Since $\Gamma$ is $3$ -regular, the number of $2$ -factors of $\Gamma$ is exactly the number of perfect matchings. Indeed, the complement of any $2$ -factor is a perfect matching and this gives a bijection between the collection of perfect matchings and $2$ -factors. Thus, since the number of perfect matchings of $\Gamma$ is odd, so is the number of $2$ -factors and we conclude that the sum $\sum_{F}\text{sgn}(F)$ cannot be zero, proving the result. ∎

5.2 Graphs with Odd Number of Perfect Matchings

We have seen that if $\Gamma$ has an odd number of perfect matchings, then the matrices $M$ and $N$ have different determinant. In order to play the bijection game on $M$ and $N$ we also need $\Gamma$ to be well connected. We now show that we can find suitable graphs that satisfy both of these conditions simultaneously.

For a positive integer $k$ , say that $\Gamma$ is $k$ -well-connected if any balanced separator of $\Gamma$ has size greater than $k$ . For our construction, we need $3$ -regular bipartite graphs on $2n$ vertices which are $k$ -well-connected for $k=\Omega(n)$ and which have an odd number of perfect matchings. The main purpose of this section is to prove the existence of such a family of graphs.

Theorem 5.7.

For all positive integers $n$ there is a bipartite graph $\Gamma_{n}=(U,V,E)$ satisfying the following conditions:

1.

$|U|=|V|=n$ ;
2.

$\Gamma_{n}$ is $3$ -regular;
3.

$\Gamma_{n}$ is $k$ -well-connected for $k=\Omega(n)$ ; and
4.

$\Gamma_{n}$ has an odd number of perfect matchings.

We prove the existence of these graphs by a randomized construction. To be precise, a random $3$ -regular bipartite graph on $2n$ vertices satisfies the third condition with high probability. We show that it can be modified to satisfy the fourth condition while keeping the connectivity high. To do this, we need some facts about the distribution of random $3$ -regular bipartite graphs.

Fix $U$ and $V$ to be two disjoint sets of $n$ vertices, and we are interested in the uniform distribution on $3$ -regular bipartite graphs on the vertices $U$ and $V$ . This distribution is not easy to sample from but it is known to be well-approximated by a number of other random models, including the union of disjoint random matchings, which we now describe. We say that a pair of bijections $\pi,\sigma:U\rightarrow V$ is disjoint if there is no $u\in U$ with $\pi(u)=\sigma(u)$ . Now consider a random graph $\mathcal{G}$ obtained by the following process:

1.

choose, uniformly at random, three bijections $\pi_{1},\pi_{2},\pi_{3}:U\rightarrow V$ ;
2.

if for some $j\in\{1,2,3\}$ with $i\neq j$ , $\pi_{i}$ and $\pi_{j}$ are not disjoint discard this choice of bijections; otherwise
3.

let $\mathcal{G}$ be the bipartite graph with parts $U$ and $V$ edges $\{\{u,\pi_{i}(u)\}\mid i\in\{1,2,3\}\}$ .

The random graph model obtained in this way is known to be contiguous to the uniform distribution on $3$ -regular bipartite graphs [23]. This means that any property that holds asymptotically almost surely in one also holds so in the other. The property we are interested in is that of being an expander. It is known [4] that a random $3$ -regular bipartite graph is an expander with probability tending to $1$ . This result is, in fact, proved in the configuration model of Bollobas [3] but this is also known to be contiguous to the uniform distribution. We can therefore conclude that the same is true for the random graph $\mathcal{G}$ .

Lemma 5.8.

There is a constant $\alpha>0$ such that with probability tending to $1$ , $\mathcal{G}$ is an $\alpha$ -expander.

An immediate consequence of this is that with high probability, $\mathcal{G}$ is $\epsilon n$ -well-connected for some constant $\epsilon$ . We now describe how we can obtain from $\mathcal{G}$ a graph which also has an odd number of perfect matchings.

Let $B_{\Gamma}$ denote the biadjacency matrix of $\Gamma=(U,V,E)$ with rows indexed by $U$ and columns by $V$ . Then the permanent of $B_{\Gamma}$ over a field of characteristic $p$ is exactly the number of perfect matchings in $\Gamma$ modulo $p$ . In particular, when $p=2$ , since the permanent is the same as the determinant, we have that the number of perfect matchings in $\Gamma$ is odd if, and only if, $\det(B_{\Gamma})\neq 0$ , where the determinant is over $\mathbb{F}_{2}$ . We do not expect that $\det(B_{\mathcal{G}})\neq 0$ with high probability. To prove Theorem 5.7 it would suffice to show that this is the case with positive probability and this does seem likely to be true. However, we adopt an indirect approach. We show that with probability bounded away from zero $\text{{rk}}(B_{\mathcal{G}})$ is at least $n-o(n)$ . And, we then show that we can transform any graph $\Gamma$ with $\text{{rk}}(B_{\Gamma})<n$ to a graph $\Gamma^{\prime}$ so that $\text{{rk}}(B_{\Gamma^{\prime}})>\text{{rk}}(B_{\Gamma})+1$ and $\Gamma^{\prime}$ is still well-connected if $\Gamma$ is. Together these give us the graphs we want. We next introduce some notation and terminology that is helpful in establishing these two facts.

In what follows, we treat the biadjacency matrix $B_{\Gamma}$ of a graph $\Gamma$ as being a matrix over $\mathbb{F}_{2}$ and so all arithmetic operations on elements of the matrix should be taken as being over this field.

Lemma 5.9.

There is a constant $\epsilon$ such that for all sufficiently large $n$ , $\Pr[\text{{rk}}(B_{\mathcal{G}})\geq n-\epsilon\log n]\geq 1/2$ .

Proof.

For any matrix $A$ , if $\text{{rk}}(A)=n-c$ , then the dimension of the null space of $A$ is $c$ , so there are $2^{c}-1$ non-zero vectors $\mathbf{x}$ such that $A^{T}\mathbf{x}=0$ . We show that for the random graph $\mathcal{G}$ , the expected number of vectors $\mathbf{x}\in\mathbb{F}_{2}$ such that $B_{\mathcal{G}}^{T}\mathbf{x}=0$ is at most linear in $n$ ²²2This is a very loose upper bound that suffices for our purposes. By more careful analysis, we can easily show that this expectation is bounded by a constant. Numerical simulations suggests that it in fact tends to $1$ as $n$ goes to infinity.. Let $\mathbf{X}$ denote the random variable that is the number of such vectors.

For an element $u\in U$ , write $r_{u}$ for the row of $B_{\mathcal{G}}$ indexed by $u$ . For a vector $\mathbf{x}\in\mathbb{F}_{2}^{U}$ , let $R_{\mathbf{x}}$ be the set $\{u\in U\mid\mathbf{x}_{u}=1\}$ and note that the condition $B_{\mathcal{G}}^{T}\mathbf{x}=0$ is equivalent to the statement $\sum_{u\in R_{\mathbf{x}}}r_{u}=0$ .

For any set $S\subseteq U$ and a bijection $\pi:U\rightarrow V$ , we write $\pi(S)$ to denote the image of $S$ under $\pi$ . Say that $S$ is a zero-sum set if $S$ is non-empty with $\sum_{u\in S}r_{u}=0$ . If $S$ is a zero-sum set, it must be the case that any vertex $v$ in $\pi_{1}(S)\cup\pi_{2}(S)\cup\pi_{3}(S)$ has exactly two neighbours in $S$ . If $|S|=m$ , this implies that $|\pi_{1}(S)\cap\pi_{2}(S)|=|\pi_{2}(S)\cap\pi_{3}(S)|=|\pi_{1}(S)\cap\pi_{3}(S)|=m/2$ . In particular $m$ is even, and $m\leq 2n/3$ .

We now estimate the expected number of zero-sum sets of size $m=2l$ in the random graph $\mathcal{G}$ . Fix a set $S\subseteq U$ of size $m$ and choose three permutations $\pi_{1},\pi_{2},\pi_{3}$ of $[n]$ independently at random. Note that a random choice of $\pi_{1}$ means that $\pi_{1}(S)$ is a uniformly random subset of $V$ of size $m$ . Thus, the probability that $|\pi_{1}(S)\cap\pi_{2}(S)|=m/2=l$ is:

p(n,l):=\frac{{{2l}\choose l}{{n-2l}\choose l}}{{n\choose{2l}}}.

For $S$ to be a zero-sum set, we further require that $\pi_{3}(S)=\pi_{1}(S)\triangle\pi_{2}(S)$ . The probability that a randomly chosen $\pi_{3}$ gives exactly this set is $1/{n\choose{2l}}$ . Summing over the ${n\choose{2l}}$ choices of the set $S$ , we get that the expected number of sets of size $m=2l$ satisfying the condition $|\pi_{1}(S)\cap\pi_{2}(S)|=|\pi_{2}(S)\cap\pi_{3}(S)|=|\pi_{1}(S)\cap\pi_{3}(S)|=m/2$ , taken over all choices of $\pi_{1},\pi_{2},\pi_{3}$ is $p(n,l)$ . Now, the probability that two random permutations are not disjoint is the same as the probability that a random permutation contains a fixed-point, which is well known to tend to $1/e$ from above as $n$ goes to infinity. Hence we see that the probability that all three permutations are disjoint is at least $1/e^{3}$ and therefore the expected number of zero-sum sets in $\mathcal{G}$ of size $2l$ is at most $e^{3}p(n,l)$ .

Hence the total expectation of the number of zero-sum sets is

\text{E}[\mathbf{X}]=\sum_{1\leq l\leq n/3}e^{3}p(n,l).

Since $p(n,l)<1$ for all $n$ and $l$ , we get $\text{E}[\mathbf{X}]<e^{3}n/3$ . By Markov’s inequality, it follows that the probability that $\mathbf{X}$ exceeds $2e^{3}n/3$ is less than $1/2$ . Since the dimension of the null space of $B_{\Gamma}$ is $\log(\mathbf{X}+1)$ , the theorem follows. ∎

To complete the construction, we show that if $\Gamma$ is a $3$ -regular bipartite graph with $n\times n$ biadjacency matrix $B_{\Gamma}$ and $\text{{rk}}(B)<n$ , then under mild assumptions satisfied by almost all such graphs, we can edit $\Gamma$ to get $\Gamma^{\prime}$ so that $\text{{rk}}(B_{\Gamma^{\prime}})>\text{{rk}}(B_{\Gamma})$ and $\Gamma^{\prime}$ is at least $(k-4)$ -well-connected if $\Gamma$ is $k$ -well-connected.

Assume then, that $\Gamma$ is a $3$ -regular graph on two sets $U$ and $V$ of $n$ vertices each and $B$ is its biadjacency matrix. As before, we write $r_{u}^{B}$ to denote the row of $B$ indexed by $u\in U$ . We drop the superscript $B$ where it is clear from context. We always treat these rows as vectors in $\mathbb{F}_{2}^{V}$ .

We say that a pair of edges $e_{1}=\{u_{1},v_{1}\}$ and $e_{2}=\{u_{2},v_{2}\}$ of $\Gamma$ with $u_{1},u_{2}\in U$ and $v_{1},v_{2}\in V$ are switchable if they are disjoint and neither of $\{u_{2},v_{1}\}$ nor $\{u_{1},v_{2}\}$ is an edge. For a switchable pair $e_{1},e_{2}$ we denote by $\tilde{\Gamma}_{e_{1},e_{2}}$ the graph obtained from $\Gamma$ by exchanging the two edges $e_{1}$ and $e_{2}$ . That is, $\tilde{\Gamma}_{e_{1},e_{2}}$ is the bipartite graph on the vertices $U$ , $V$ with edge set

E(\Gamma)\setminus\{e_{1},e_{2}\}\cup\{\{u_{1},v_{2}\},\{u_{2},v_{1}\}\}.

Note that $\tilde{\Gamma}_{e_{1},e_{2}}$ is also a $3$ -regular bipartite graph. We write $\tilde{B}_{e_{1},e_{2}}$ for the biadjacency matrix of $\tilde{\Gamma}_{e_{1},e_{2}}$ .

Assume now that $\text{{rk}}(B)<n$ . Then $B$ has a zero-sum set, i.e. a set $S\subseteq U$ such that $\sum_{u\in S}r_{u}=0$ . Moreover, by the $3$ -regularity of $\Gamma$ , we have $2\leq|S|\leq 2n/3$ . We can now state the lemma we aim to prove.

Lemma 5.10.

If $B$ has a zero-sum set $S$ with $|S|<2n/3$ , then there are switchable edges $e_{1},e_{2}\in E(\Gamma)$ so that $\text{{rk}}(\tilde{B}_{e_{1},e_{2}})>\text{{rk}}(B)$ .

Proof.

Fix a zero-sum set $S$ in $B$ with $|S|<2n/3$ . Let $N(S)\subseteq V$ denote the set of elements of $V$ which are neighbours in $\Gamma$ of vertices in $S$ . The assumption on the size of $S$ implies that $|N(S)|<n$ and so $V\setminus N(S)\neq\emptyset$ .

For $i,j\in V$ , write $t_{ij}$ for the vector in $\mathbb{F}_{2}^{V}$ which is $1$ at positions $i$ and $j$ and $0$ everywhere else. Now consider the following set of vectors in $\mathbb{F}_{2}^{V}$ :

T=\{t_{ij}\mid i\in N(S)\text{ and }j\in V\setminus N(S)\}.

Observe that $\mathrm{span}(T)=E$ where $E$ is the subspace of $\mathbb{F}_{2}^{V}$ consisting of vectors with even Hamming weight. To see this, first observe that $t_{ij}\in\mathrm{span}(T)$ for all pairs $i,j\in V$ . When $i\in N(S)$ and $j\not\in N(S)$ , this is true by definition of $T$ . For $i,j\in N(S)$ , choose any $k\not\in N(S)$ and note that $t_{ik}+t_{jk}=t_{ij}$ . Similarly, if $i,j\not\in N(S)$ , we can pick a $k\in N(S)$ and again $t_{ki}+t_{kj}=t_{ij}$ . Since the collection of vectors $\{t_{ij}\mid i,j\in V\}$ clearly spans $E$ , we are done.

Let $R=\{r_{u}\mid u\in U\}$ be the set of rows of $B$ . Since each $r_{u}$ has Hamming weight $3$ , $r_{u}\not\in E$ so $\mathrm{span}(R)\not\subseteq E$ . Since $\dim(E)=n-1\geq\text{{rk}}(B)=\dim(\mathrm{span}(R))$ , we conclude that $T\not\subseteq\mathrm{span}(R)$ . Let us fix a $t_{ij}\in T$ such that $t_{ij}\not\in\mathrm{span}(R)$ .

Since $i\in N(S)$ and $S$ is a zero-sum set, $i$ has exactly two neighbours in $S$ and one in $U\setminus S$ . On the other hand, $j$ has three neighbours, all in $U\setminus S$ . Thus, we can choose a $k\in S$ which is a neighbour of $i$ but not $j$ and an $l\in U\setminus S$ which is a neighbour of $j$ but not of $i$ . Let $e_{1}=\{i,k\}$ and $e_{2}=\{j,l\}$ and observe that this pair is switchable by construction.

To prove the lemma, it then suffices to prove that $\text{{rk}}(\tilde{B}_{e_{1},e_{2}})>\text{{rk}}(B)$ . We do this by establishing the following two facts.

1.

$\sum_{u\in S}r_{u}^{\tilde{B}_{e_{1},e_{2}}}\neq 0$ ; and
2.

for any $S^{\prime}\subseteq U$ , if $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=0$ , then $\sum_{u\in S^{\prime}}r_{u}^{B}=0$ .

Together these establish that the null space of $\tilde{B}_{e_{1},e_{2}}$ is strictly smaller than that of $B$ and hence the claim.

Note that for all $u\in U\setminus\{k,l\}$ , we have $r_{u}^{\tilde{B}_{e_{1},e_{2}}}=r_{u}^{B}$ , while $r_{k}^{\tilde{B}_{e_{1},e_{2}}}=r_{k}^{B}+t_{ij}$ and $r_{l}^{\tilde{B}_{e_{1},e_{2}}}=r_{l}^{B}+t_{ij}$ .

Thus, to prove the first fact, just note that since $k$ is in $S$ and $l$ is not, $\sum_{u\in S}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=\sum_{u\in S}r_{u}^{B}+t_{ij}=t_{ij}$ .

To prove the second fact, consider any set $S^{\prime}\subseteq U$ such that $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=0$ . We consider the following cases.

•

If neither $k$ nor $l$ is in $S^{\prime}$ , then $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=\sum_{u\in S^{\prime}}r_{u}^{B}$ , so since the former sum is $0$ , so is the latter.
•

If both $k$ and $l$ are in $S^{\prime}$ , then $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=\sum_{u\in S^{\prime}}r_{u}^{B}+2t_{ij}=\sum_{u\in S^{\prime}}r_{u}^{B}$ , and the same argument applies.
•

If exactly one of $k$ and $l$ is in $S^{\prime}$ , then $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=\sum_{u\in S^{\prime}}r_{u}^{B}+t_{ij}$ . Hence, if $\sum_{u\in S^{\prime}}r_{u}^{\tilde{B}_{e_{1},e_{2}}}=0$ , we must have $\sum_{u\in S^{\prime}}r_{u}^{B}=t_{ij}$ . However, $i$ and $j$ were chosen so that $t_{ij}\not\in\mathrm{span}(R)$ , so this is impossible.

∎

Proof of Theorem 5.7.

By Lemma 5.8, for large enough values of $n$ , the random $3$ -regular graph $\mathcal{G}$ is $\tau n$ -well-connected for some constant $\tau>0$ with probability tending to $1$ . Thus, with high probability, the first three conditions are satisfied. If the biadjacency matrix $B_{\Gamma}$ of the resulting graph $\Gamma$ has rank $n$ , we are done.

If $\text{{rk}}(B_{\Gamma})<n$ , then with probability at least $1/2$ , we have $\text{{rk}}(B_{\Gamma})\geq n-\epsilon\log n$ by Lemma 5.9. Moreover, since the expected number of zero-sum sets of size exactly $2n/3$ is at most $e^{3}p(n,n/3)$ by the argument in the proof of Lemma 5.9, and this value tends to $0$ as $n$ grows, with high probability, $B_{\Gamma}$ has no zero-sum sets of this size. Hence, with positive probability, $\mathcal{G}$ satisfies the pre-conditions of Lemma 5.10.

Note that if $\Gamma$ satisfies the conditions of Lemma 5.10, then any zero-sum set in the graph $\tilde{\Gamma}_{e_{1},e_{2}}$ is also a zero-sum set in $\Gamma$ . Hence, if $\Gamma$ contains no zero-sum sets of size exactly $2n/3$ , the same is true of $\tilde{\Gamma}_{e_{1},e_{2}}$ . We can thus repeatedly apply the construction of Lemma 5.10 to obtain a graph $\Gamma^{\prime}$ such that $\text{{rk}}(B_{\Gamma})=n$ . It then follows that $\Gamma^{\prime}$ has an odd number of perfect matchings. It remains to argue that $\Gamma^{\prime}$ is still well-connected. Note that since $\text{{rk}}(B_{\Gamma})\geq n-\epsilon\log n$ , we get $\Gamma^{\prime}$ from $\Gamma$ by at most $\epsilon\log n$ applications of Lemma 5.10. At each step edges incident at most $4$ vertices are modified. Thus, if $S$ is a balanced separator in $\Gamma^{\prime}$ , then adding these four vertices to it gives us a balanced separator in $\Gamma$ . Thus, if $\Gamma$ is $k$ -well-connected, then $\Gamma^{\prime}$ is at least $(k-4)$ -well-connected and the result is proved.

∎

5.3 Playing the Game

Suppose $\Gamma$ is a bipartite $3$ -regular graph on two sets $U$ and $V$ of $m$ vertices each that is $(k+3)$ -well-connected. Let $\hat{\Gamma}$ be the CFI-graph constructed from $\Gamma$ as described in Section 5.1, and $M$ and $N$ be two biadjacency matrices for $\hat{\Gamma}$ where $N$ is obtained from $M$ by interchanging exactly one pair of columns. We aim to prove that Duplicator has a winning strategy in the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k)$ -bijection game played on $M$ and $N$ .

Recall that $\hat{\Gamma}$ is a bipartite graph on two sets $X$ and $Y$ of $n=10m$ vertices each with $X=\bigcup_{u\in U}I_{u}\cup\bigcup_{v\in V}O_{v}$ and $Y=\bigcup_{v\in V}I_{u}\cup\bigcup_{u\in U}O_{u}$ . To say that Duplicator has a winning strategy in the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k)$ -bijection game played on $M$ and $N$ is the same as saying that Duplicator has a winning strategy in the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k)$ -bijection game played on the pair of graphs $\hat{\Gamma}$ and $\hat{\Gamma}^{\prime}$ where the latter is obtained from $\hat{\Gamma}$ by swapping two elements $y,y^{\prime}\in Y$ . It does not matter which two elements $y,y^{\prime}$ we choose as any swap can be obtained from $(yy^{\prime})$ by composing with a permutation in $\text{{Alt}}_{Y}$ . So, for some $u\in U$ , fix two vertices $x_{0},x_{1}\in O_{u}$ which form a single pair in the gadget corresponding to $u$ and let $\alpha=(x_{0}x_{1})$ . We write $\alpha\hat{\Gamma}$ for the graph on $X\cup Y$ which is exactly the same as $\hat{\Gamma}$ except the neighbours of $x_{0}$ in $\alpha\hat{\Gamma}$ are exactly the neighbours of $x_{1}$ in $\hat{\Gamma}$ and the neighbours of $x_{1}$ in $\alpha\hat{\Gamma}$ are exactly the neighbours of $x_{0}$ in $\hat{\Gamma}$ .

Lemma 5.11.

Duplicator has a winning strategy in the $(\text{{Alt}}_{X}\times\text{{Alt}}_{Y},k)$ -bijection game played on the graphs $\hat{\Gamma}$ and $\alpha\hat{\Gamma}$ .

To prove this lemma, we first introduce some notation and some observations about $\hat{\Gamma}$ . Consider permutations of the vertices $I_{v}\cup O_{v}$ in the gadget in Figure 1 which fix each of the sets $\{x_{0},x_{1}\}$ , $\{y_{0},y_{1}\}$ and $\{z_{0},z_{1}\}$ setwise. It is easily checked (and this is the key property of the gadget) that for any two of these three sets there is an autormorphism of the gadget that swaps the two vertices inside the two sets while leaving the two vertices in the third set fixed. For instance, there is an automorphism which we denote $\beta^{v}_{xy}$ , that exchanges $x_{0}$ with $x_{1}$ and $y_{0}$ with $y_{1}$ . The action of this automorphism on $I_{v}$ is to swap $v_{\emptyset}$ with $v_{\{x,y\}}$ and $v_{\{x,z\}}$ with $v_{\{y,z\}}$ . Each such automorphism consists of two swaps in $I_{v}$ and two in $O_{v}$ and so is a permutation in $\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ .

We can compose such automorphisms of the individual gadgets to get certain automorphisms of $\hat{\Gamma}$ . Let $C=v_{1}\cdots v_{l}$ be a simple cycle in the graph $\Gamma$ . That is, there are edges from $v_{1}$ to $v_{i+1}$ for each $i$ with $1\leq i<l$ and an edge from $v_{l}$ to $v_{1}$ . We define the permutation $\beta_{C}$ of $X\cup Y$ as the composition

\beta^{v_{1}}_{v_{l}v_{2}}\beta^{v_{2}}_{v_{1}v_{3}}\cdots\beta^{v_{i}}_{v_{i-1}v_{i+1}}\cdots\beta^{v_{l}}_{v_{l-1}v_{1}}.

This is easily seen to be an automorphism of $\hat{\Gamma}$ . Since it is the compostion of permutations each of which is in $\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ , $\beta_{C}\in\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ .

Now, say that a permutation $\beta$ of $X\cup Y$ is coherent if for each $v\in U\cup V$ , $\beta(I_{v})=I_{v}$ and $\beta(O_{v})=O_{v}$ . We are only interested in coherent permutations. Let vertices $u\in U$ and $v\in V$ be neighbours in $\Gamma$ and let $v_{0},v_{1}$ be the pair of vertices in $O_{u}$ which connect to vertices in $O_{v}$ . We say that a coherent permutation $\beta$ is good bar $uv$ if composing it with the swap $(v_{0}v_{1})$ yields an automorphism of $\hat{\Gamma}$ . We are now ready to describe the Duplicator winning strategy.

Proof of Lemma 5.11.

We describe Duplicator’s winning strategy. The position at any point of the game with Spoiler to move, consists of up to $k$ pebbled vertices of $\hat{\Gamma}$ along with a permutation $\beta$ of $X\cup Y$ which is obtained from the initial permutation $\alpha$ by means of composing it with an element of $\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ . If $x$ is the vertex covered by pebble $i$ , let $p_{i}\in U\cup V$ be the vertex of $\Gamma$ such that $x\in I_{p_{i}}\cup O_{p_{i}}$ . In other words $p_{1},\ldots,p_{k}$ enumerate the vertices of $\Gamma$ whose gadgets in $\hat{\Gamma}$ contain the pebbled vertices. Note that by the connectedness assumption, $\Gamma\setminus\{p_{1},\ldots,p_{k}\}$ has a component $\Delta$ which contains more than half of the vertices of $\Gamma$ , and $\Delta$ is $2$ -connected. We call $\Delta$ the large component at this game position.

We show that Duplicator can play to maintain the following invariant:

(*): there is an edge $\{u,v\}$ of $\Gamma$ such that $u\in U$ and $v\in V$ are both in the large component and $\beta$ is a coherent permutation that is good bar $uv$ .

In particular, this guarantees that $\beta$ is a partial isomorphism on the pebbled positions. Indeed, $\beta$ is a partial isomorphism on the graph $\hat{\Gamma}$ excluding $I_{u}\cup O_{u}\cup I_{v}\cup O_{v}$ , and none of these vertices is pebbled. Thus, if Duplicator can maintian the invariant (*) it is a winning strategy.

It is clear that the initial bijection $\alpha$ satisfies (*) as no vertices are pebbled.

At each subsequent move, Spoiler places a pebble on a vertex $x$ . Duplicator chooses any edge $\{u^{\prime},v^{\prime}\}$ in the large component (this could be $\{u,v\}$ if they are still in the large component). We then distinguish two cases.

1.

If $x\not\in\{v_{0},v_{1}\}$ then Duplicator’s response is to compose $\beta$ with $(v_{0}v_{1})(v_{0}^{\prime}v_{1}^{\prime})$ . This is a valid move as none of the four vertices is pebbled and the pemutation $(v_{0}v_{1})(v_{0}^{\prime}v_{1}^{\prime})$ is in $\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ since all four of $v_{0},v_{1},v_{0}^{\prime},v_{1}^{\prime}$ are in $Y$ . Moreover, the fact that $\beta$ is good bar $uv$ implies that composing it with $(v_{0}v_{1})$ yields an automorphism of $\hat{\Gamma}$ . Composing this automorphism with $(v_{0}^{\prime}v_{1}^{\prime})$ gives us a permutation that is good bar $u^{\prime}v^{\prime}$ . Thus, the invariant (*) is maintained.
2.

If $x\in\{v_{0},v_{1}\}$ Duplicator must compose $\beta$ with a permutation that fixes $x$ , so in fact fixes both $v_{0}$ and $v_{1}$ . By the fact that the large component $\Delta$ before the pebble is placed on $x$ is $3$ -connected, we know that there is a path in $\Delta$ from $u$ to $v$ that does not use the edge $e=\{u,v\}$ . Combining this with $e$ we obtain a simple cycle $C$ . Then, $\beta_{C}$ is an automorphism of $\hat{\Gamma}$ which, in particular, swaps $v_{0}$ and $v_{1}$ . Duplicator’s move is to compose $\beta$ with the permutation $(v_{0},v_{1})\beta_{C}(v_{0}^{\prime},v_{1}^{\prime})$ . Call the resulting permutation $\beta^{\prime}$ . Observe first that this is a valid move, as $\beta_{C}\in\text{{Alt}}_{X}\times\text{{Alt}}_{Y}$ and we are composing it with two swaps of elements of $Y$ . It remains to argue that $\beta^{\prime}$ is good bar $u^{\prime}v^{\prime}$ . By assumption composing $\beta$ with $(v_{0}v_{1})$ yields an automorphism of $\hat{\Gamma}$ and $\beta_{C}$ is also an automorphism of $\hat{\Gamma}$ . Thus, $\beta(v_{0},v_{1})\beta_{C}$ is an automorphism of $\hat{\Gamma}$ and $\beta^{\prime}$ is just the composition of this with $(v_{0}^{\prime},v_{1}^{\prime})$ .

∎

5.4 Bringing it Together

We pull things together to prove Theorem 5.1

Proof of Theorem 5.1.

We have from Theorem 5.7 that for each $n\in\mathbb{N}$ there exists a $3$ -regular balanced bipartite graph $\Gamma_{n}$ with $2n$ vertices that is $k(n)$ -well-connected for $k(n)=\Omega(n)$ and has an odd number of perfect matchings. Let $\hat{\Gamma}_{n}$ be the CFI-graph constructed from $\Gamma_{n}$ as described in Section 5.1, and $M_{n}$ and $N_{n}$ be two biadjacency matrices for $\hat{\Gamma}_{n}$ where $N_{n}$ is obtained from $M_{n}$ by interchanging exactly one pair of columns. From Lemma 5.11 we have that Duplicator has a winning strategy for the $(\text{{Alt}}_{n}\times\text{{Alt}}_{n},k(n)-3)$ -bijection game on $M_{n}$ and $N_{n}$ . From Lemma 5.6 and the fact that $\Gamma_{n}$ has an odd number of perfect matchings, it follows that $\det(M_{n})\neq 0$ and so, since $\det(M_{n})=-\det(N_{n})$ , we have $\det(M_{n})\neq\det(N_{n})$ . The result now follows from Theorem 4.13. ∎

Theorem 5.1 is stated and proved specifically for fields of characteristic zero. We could prove the result for fields of characteristic $p$ , when $p$ is an odd prime, provided that the matrices $M_{n}$ we construct have non-zero determinant modulo $p$ . As noted in the proof of Lemma 5.6, $\det(M_{n})=2^{4m}\sum_{F}\text{sgn}(F)$ , where the sum is over all $2$ -factors (or equivalently over all perfect matchings) of the bipartite graph $\Gamma_{n}$ . Of course, $\sum_{F}\text{sgn}(F)$ is just the determinant of the bi-adjacency matrix $B_{\Gamma_{n}}$ of $\Gamma_{n}$ . Hence, for odd prime $p$ , $\det{M_{n}}\not\equiv 0\pmod{p}$ provided that $\det(B_{\Gamma_{n}})\not\equiv\pmod{p}$ . Now, the construction in Section 5.2 is aimed at ensuring that $\det(B_{\Gamma_{n}})\not\equiv 0\pmod{2}$ . It seems plausible that we could just as well ensure that $\det(B_{\Gamma_{n}})\not\equiv 0\pmod{p}$ for some odd prime $p$ . This would immediately yield the analogue of Theorem 5.1 in characteristic $p$ . We leave this extension to future work.

6 Lower Bound for the Permanent

We previously established in [11] lower bounds on symmetric circuits for the permanent showing that there are no subexponential square-symmetric circuits computing the permanent of an $n\times n$ matrix in any field of characteristic zero, along with a similar result for matrix-symmetric circuits in any field of characteristic other than two.

The two bounds are consequences of the same construction: we give, for each $k$ , a pair of bipartite graphs $X_{k}$ and $\tilde{X}_{k}$ on which Duplicator has a winning strategy in the $k$ -pebble bijection game and which have different numbers of perfect matchings. The graphs $X_{k}$ and $\tilde{X}_{k}$ are on two sets $A$ and $B$ of $n=O(k)$ vertices each and the difference between the number of perfect matchings in $X_{k}$ and $\tilde{X}_{k}$ is a power of $2$ . The $k$ -pebble bijection game for which a Duplicator winning strategy is shown is essentially the $(\text{{Sym}}_{A}\times\text{{Sym}}_{B},k)$ game. This shows that the biadjacency matrices of $X_{k}$ and $\tilde{X}_{k}$ cannot be distinguished by $\text{{Sym}}_{A}\times\text{{Sym}}_{B}$ -symmetric circuits of subexponential size and also that the adjacency matrices of $X_{k}$ and $\tilde{X}_{k}$ cannot be distinguished by $\text{{Sym}}_{A\cup B}$ -symmetric circuits of subexponential size. Since $X_{k}$ and $\tilde{X}_{k}$ have different numbers of perfect matchings, their biadjacency matrices $B_{X_{k}}$ and $B_{\tilde{X}_{k}}$ have distinct permanents. Since the number of perfect matchings differ by a power of $2$ , they have distinct permanents modulo $p$ for any odd prime $p$ . Moreover, since the permanent of the adjacency matrix of a bipartite graph is the square of the permanent of its biadjacency matrix, we have that the adjacency matrices $A_{X_{k}}$ and $A_{\tilde{X}_{k}}$ also have distinct permanents. Together, these give us the stated lower bounds for circuits computing the permanent in the second and fourth columns of Table 1. To establish the lower bound in the third column, it suffices to observe that Duplicator has a winning strategy on $X_{k}$ and $\tilde{X}_{k}$ even in the restricted game $(\text{{Alt}}_{A}\times\text{{Alt}}_{B},k)$ . To see this, we give a brief account of the construction, and it is instructive to contrast it with the graphs we use in Section 5.1.

The construction of the graph $\hat{\Gamma}$ from $\Gamma$ given in Section 5.1 is essentially the original construction given by Cai et al. [6]. Their construction gives from $\Gamma$ a pair of non-isomorphic graphs which are not distinguishable by $k$ -dimensional Weisfeiler-Leman equivalence. We use only one graph $\hat{\Gamma}$ from the pair, as our game is played on two different biadjacency matrices of the same graph and our main concern is that this matrix has non-zero determinant. Of course, the different biadjacency matrices have the same permanent, and for a lower bound for the latter, we do need to play the game on a pair of non-isomorphic graphs. So, we look again at the pairs of graphs given by the CFI construction. The other graph in the pair would be obtained from $\hat{\Gamma}$ by “twisting” one of the edges. That is, for some edge $e=\{u,v\}$ of $\Gamma$ we replace the two edges $e_{0}=\{u_{0},v_{0}\}$ and $e_{1}=\{u_{1},v_{1}\}$ by the edges $\{u_{0},v_{1}\}$ and $\{u_{1},v_{0}\}$ . It can be verified that the number of perfect matchings in the two graphs is the same. Thus, the permanents of the biadjacency matrices of the two graphs are the same and they cannot be used directly to establish the lower bounds we want. In the construction we presented in [11] we adapted the CFI construction in two ways. The graph $X(\Gamma)$ is obtained from $\hat{\Gamma}$ by first, for each edge $e=\{u,v\}$ of $\Gamma$ , contracting the two edges $e_{0}=\{u_{0},v_{0}\}$ and $e_{1}=\{u_{1},v_{1}\}$ in $\hat{\Gamma}$ and secondly, for each vertex $v$ of $\Gamma$ , adding a new vertex $v_{b}$ to $\hat{\Gamma}$ which is adjacent to all four vertices in $I_{v}$ . Overall, this is equivalent to replacing each vertex $v$ of $\Gamma$ with incident edges $f,g,h$ with the gadget in Figure 4 where the dashed lines indicate edges whose endpoints are in other gadgets.

The resulting graph $X(\Gamma)$ is a $4$ -regular bipartite graph and $\tilde{X}(\Gamma)$ is obtained from it by taking one vertex $v$ of $\Gamma$ and in the corresponding gadget, for one edge $e$ incident on $v$ , interchanging the connections of $e_{0}$ and $e_{1}$ . The fact that $X(\Gamma)$ and $\tilde{X}(\Gamma)$ have biadjacency matrices with different permanents is proved in [11]. It is interesting to note, however, that these matrices have determinant zero, so do not yield a lower bound on the determinant.

Finally, we briefly describe a winning strategy for Duplicator in the $(\text{{Alt}}_{A}\times\text{{Alt}}_{B},k)$ -bijection game played on $X(\Gamma)$ and $\tilde{X}(\Gamma)$ . The winning strategy in the ordinary $k$ -pebble bijection game, is described in [9] and is based on the fact that the graph $\Gamma$ has tree-width greater than $k$ and so there is a winning strategy for robber in the $k$ -cops-and-robbers game played on $\Gamma$ . This is lifted to a winning strategy for Duplicator in the $k$ -bijection game which sees Duplicator maintain a bijection $\beta$ that is an isomorphism except at one gadget corresponding to a vertex $v$ of $\Gamma$ . The position $v$ is given as a robber winning position in a $k$ -cops-and-robbers game played on the graph $\Gamma$ . At each move, Duplicator takes a path from $v$ to $v^{\prime}$ in $\Gamma$ describing a robber move and changes $\beta$ to a bijection $\beta^{\prime}$ by composing it with automorphisms for the gadgets corresponding to the vertices along the path. This results in $\beta^{\prime}$ being an isomorphism except at the gadget corresponding to $v^{\prime}$ . It can be easily checked that $\beta^{\prime}$ is obtained from $\beta$ by composing with a permutation in $\text{{Alt}}_{A}\times\text{{Alt}}_{B}$ provided that the path from $v$ to $v^{\prime}$ is of even length. It is easy to ensure that the graph $\Gamma$ of tree-width greater than $k$ used is bipartite and robber has a winning strategy in the $k$ -cops and robber game in which robber always ends up on one side of the bipartition. This gives us the desired result.

Theorem 6.1.

Let $\mathbb{F}$ be a field of characteristic other than $2$ . There is no family of $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ -symmetric circuits $(C_{n})_{n\in\mathbb{N}}$ over $\mathbb{F}$ of size $2^{o(n)}$ computing the permanent over $\mathbb{F}$ .

7 Concluding Discussion

The study of the complexity of symmetric circuits began in the context of logic. Specifications of decision problems on graphs (or similar structures) formulated in formal logic translate naturally into algorithms that respect the symmetries of the graphs. This yields a restricted model of computation based on symmetric circuits for which we are able to prove concrete lower bounds, in a fashion similar to the restriction to monotone circuits. Methods developed in the realm of logic for proving inexpressibility results can be reinterpreted as circuit lower bound results.

One step in this direction was the connection established in [1] between polynomial-size Boolean threshold circuits on the one hand and fixed-point logic with counting on the other. This shows that the power of symmetric Boolean threshold circuits to decide graph properties is delimited by the counting width of those properties. In particular, this shows that a number of $\mathrm{NP}$ -complete graph problems including $3$ -colourability and Hamiltonicity cannot be decided by polynomial-size symmetric Boolean threshold circuits. This is particularly interesting as the power of such symmetric circuits has been shown to encompass many strong algorithmic methods based on linear and semidefinite programming (see, for instance, [2]). This methodology was extended to graph parameters beyond decision problems, and to arithmetic circuits rather than Boolean circuits in [11]. Together these extensions established that no subexponential size square symmetric (i.e. unchanged by simultaneous row and column permutations) arithmetic circuits could compute the permanent.

The permanent of a $\{0,1\}$ matrix $M$ has a natural interpretation as a graph invariant and so lends itself easily to methods for proving lower bounds on graph parameters. The situation with the determinant of $M$ is more subtle. If $M$ is a symmetric $n\times n$ matrix, then we can see at as the adjacency matrix of a graph $\Gamma$ on $n$ vertices and the determinant is a graph invariant, that is to say it only depends on the isomorphism class of $\Gamma$ . Moreover, it is a graph invariant that can be computed efficiently by symmetric circuits (at least in characteristic zero) as was shown for Boolean circuits in [18] and for arithmetic circuits in [11]. When $M$ is not a symmetric matrix, we could think of it as the adjacency matrix of a directed graph. In this case, the symmetries that a circuit must preserve are still the permutations of $n$ , so simultaneous permutations of the rows and columns of $M$ and the upper bounds obtained still apply. But, we can also think of $M$ as a biadjacency matrix of a bipartite graph $\Gamma$ , and now the determinant of $M$ is not an invariant of $\Gamma$ . We have a richer set of symmetries, and methods for proving bounds on the complexity of graph parameters do not directly apply.

What we have sought to do in the present paper is to develop the methods for proving lower bounds on the counting width of graph parameters to a general methodology for proving circuit lower bounds for polynomials or more generally functions invariant under certain permutations of their input variables. To do this, we prove a support theorem for circuits which is for a more general collection of symmetry groups than proved in prior literature; we adapt the Spoiler-Duplicator bijection game to work for more general invariance groups and more general structured inputs than those arising as symmetries of graph matrices; and we show a direct relationship between these games and orbit size of circuits that bypasses connections with width measures on graphs. This methodology is then applied to arithmetic circuits computing the determinant and we are able to prove an exponential lower bound for circuits symmetric under the full permutation group $D_{n}\leq\text{{Sym}}_{n}\times\text{{Sym}}_{n}$ that fixes the determinant of $M$ . Indeed, we do this for the smaller group $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ . The application requires considerable work in constructing the example matrices and applying the bijection games.

We see one main contribution to be establishing the general methodology for proving circuit lower bounds under various notions of symmetry. There are many ways in which this could be pushed further. First, our proof of the support theorem requires the presence of large alternating groups in the symmetry group under consideration. Perhaps more sophisticated notions of support could be developed which would allow us to consider smaller groups. Secondly, while we state our main result for the group $\text{{Alt}}_{n}\times\text{{Alt}}_{n}$ , the bijection game itself uses rather fewer symmetries. It would be interesting to establish tighter bounds on the symmetry group for which we get exponential lower bounds. Indeed, the results can be seen as giving a trade-off between circuit size and symmetries and this suggests an interesting terrain in which to explore the symmetry requirements of the circuit as a resource.

Acknowledgements

We are grateful to Benedikt Pago, whose comments resulted in a much improved version of the definitions and results in Section 4.1. We are also grateful to Albert Atserias for useful discussions on the construction in Section 5.2.

References

[1] M. Anderson and A. Dawar. On symmetric circuits and fixed-point logics. Theory Comput. Syst., 60(3):521–551, 2017.
[2] A. Atserias, A. Dawar, and J. Ochremiak. On the power of symmetric linear programs. J.ACM, 2021. to appear. arxiv:1901.07825.
[3] Béla Bollobás. The distribution of the maximum degree of a random graph. Discret. Math., 32:201–203, 1980.
[4] G. Brito, I. Dumitriu, and K. D. Harris. Spectral gap in random bipartite biregular graphs and applications. Combinatorics, Probability and Computing, 2021. to appear. arXiv:1804.07808.
[5] P. Bürgisser, M. Clausen, and M. A. Shokrollahi. Algebraic Complexity Theory. Springer, Berlin, Heidelberg, 1 edition, 1997.
[6] J-Y. Cai, M. Fürer, and N. Immerman. An optimal lower bound on the number of variables for graph identification. Combinatorica, 12(4):389–410, 1992.
[7] A. Dawar. On symmetric and choiceless computation. In Mohammad Taghi Hajiaghayi and Mohammad Reza Mousavi, editors, Topics in Theoretical Computer Science, pages 23–29, Cham, 2016. Springer International Publishing.
[8] A. Dawar. Symmetric computation (invited talk). In 28th EACSL Annual Conference on Computer Science Logic, CSL 2020, 2020.
[9] A. Dawar and D. Richerby. The power of counting logics on restricted classes of finite structures. In CSL 2007:Computer Science Logic, volume 4646 of LNCS, pages 84–98. Springer, 2007.
[10] A. Dawar and G. Wilsenach. Symmetric circuits for rank logic. In 27th EACSL Annual Conference on Computer Science Logic, CSL 2018, pages 20:1–20:16, 2018. Full version at arXiv:1804.02939.
[11] A. Dawar and G. Wilsenach. Symmetric Arithmetic Circuits. In 47th International Colloquium on Automata, Languages, and Programming (ICALP 2020), volume 168 of Leibniz International Proceedings in Informatics (LIPIcs), pages 36:1–36:18, 2020. Full version at arXiv:2002.06451.
[12] A. Dawar and G. Wilsenach. Lower bounds for symmetric circuits for the determinant. In 13th Innovations in Theoretical Computer Science Conference, ITCS, volume 215 of LIPIcs, pages 52:1–52:22. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2022.
[13] J.D. Dixon and B. Mortimer. Permutation Groups. Graduate Texts in Mathematics. Springer New York, 1996.
[14] D. Grigoriev and M. Karpinski. An exponential lower bound for depth 3 arithmetic circuits. In Proceedings of the Thirtieth Annual ACM Symposium on the Theory of Computing, pages 577–582, 1998.
[15] M. Grohe. Descriptive Complexity, Canonisation, and Definable Graph Structure Theory. Lecture Notes in Logic. Cambridge University Press, 2017.
[16] F. Harary. Determinants, permanents and bipartite graphs. Mathematics Magazine, 42:146–148, 1969.
[17] L. Hella. Logical hierarchies in PTIME. Information and Computation, 129(1):1 – 19, 1996.
[18] B. Holm. Descriptive Complexity of Linear Algebra. PhD thesis, University of Cambridge, 2010.
[19] S. Hoory, N. Linial, and A. Wigderson. Expander graphs and their applications. Bulletin of the American Mathematical Society, 43(4):439–561, 2006.
[20] M. Jerrum and M. Snir. Some exact complexity results for straight-line computations over semirings. J. ACM, 29:874–897, 1982.
[21] N. Kayal and R. Saptharishi. A selection of lower bounds for arithmetic circuits. In M. Agrawal and V. Arvind, editors, Perspectives in Computational Complexity. Birkhäuser Basel, 2014.
[22] J.M. Landsberg and N. Ressayre. Permanent v. determinant: An exponential lower bound assuming symmetry. In Proc. ACM Conference on Innovations in Theoretical Computer Science, pages 29–35. ACM, 2016.
[23] M. Molloy, H. D. Robalewska, R. W. Robinson, and N. C. Wormald. 1-factorizations of random regular graphs. Random Struct. Algorithms, 10:305–321, 1997.
[24] H.J. Ryser. Combinatorial Mathematics, volume 14. Mathematical Association of America, 1 edition, 1963.
[25] P.D. Seymour and R. Thomas. Graph searching and a min-max theorem for tree-width. Journal of Combinatorial Theory, Series B, 58(1):22–33, 1993.
[26] A. Shpilka and A. Yehudayoff. Arithmetic circuits: A survey of recent results and open questions. Foundations and Trends in Theoretical Computer Science, 5(3-4):207–388, 2010.
[27] H. Vollmer. Introduction to Circuit Complexity - A Uniform Approach. Texts in Theoretical Computer Science. An EATCS Series. Springer, 1999.

Lower Bounds for Symmetric Circuits for the Determinant††thanks: Research funded by EPSRC grant EP/S03238X/1. A preliminary version of this paper appeared as [12]

Abstract

1 Introduction

2 Background

2.1 Groups

2.2 Matrices

2.3 Graphs

Theorem 2.1 ([25]).

2.4 Circuits

Definition 2.2 (Circuit).

3 Symmetric Circuits

3.1 Group Actions and Symmetric Functions

Definition 3.1.

Example 3.2.

3.2 Symmetric Circuits

Definition 3.3 (Circuit Automorphism).

Definition 3.4 (Symmetric Circuit).

Proposition 3.5.

4 Games and Supports

4.1 Bijection Games on Indexed function

Definition 4.1.

Definition 4.2.

Lemma 4.3.

Proof.

Definition 4.4.

Definition 4.5.

4.2 Supports

Definition 4.6.

Definition 4.7.

4.3 Playing Games on Circuits

Theorem 4.8.

Proof.

Claim 4.9.

Proof.

4.4 Bounds on Supports

Theorem 4.10 ([13], Theorem 5.2B).

Theorem 4.11.

Proof.

Corollary 4.12.

Proof.

Theorem 4.13.

Proof.

5 Lower Bound for the Determinant

Theorem 5.1 (Main Theorem).

5.1 Constructing the Graph

CFI graphs and Determinants

Perfect Matchings

Lemma 5.2.

Proof.

Uniform Perfect Matchings.

Lemma 5.3.

Proof.

Lemma 5.4.

Lemma 5.5.

Proof.

Lemma 5.6.

Proof.

5.2 Graphs with Odd Number of Perfect Matchings

Theorem 5.7.

Lemma 5.8.

Lemma 5.9.

Proof.

Lemma 5.10.

Proof.

Proof of Theorem 5.7.

5.3 Playing the Game

Lemma 5.11.

Proof of Lemma 5.11.

5.4 Bringing it Together

Proof of Theorem 5.1.

6 Lower Bound for the Permanent

Theorem 6.1.

7 Concluding Discussion

Acknowledgements

References

Lower Bounds for Symmetric Circuits for the Determinant^†^†thanks: Research funded by EPSRC grant EP/S03238X/1. A preliminary version of this paper appeared as [12]