Unweighted Code Sparsifiers and Thin Subgraphs
Abstract
We show that for every -dimensional linear code there exists a set of size at most such that the projection of onto has distance at least . As a consequence we show that any connected graph with edges and vertices has at least many -thin subgraphs.
1 Introduction
Given a code over , its distance is defined as
where is the number of nonzero coordinates of .
We say is a linear code if for any , .
We say is -dimensional if it has codewords.
For convenience, we identify the set with its indicator vector (thus is identified with ).
The projection of onto is the code obtained by taking the coordinates of every codeword in . Recently, Khanna-Putterman-Sudan [KPS24] initiated the study of code sparsification where they proved that any linear code of dimension over has a βweightedβ code sparsifier which projects onto at most -coordinates and such that the weighted hamming weight of every code word is preserved (up to a multiplicative error).
In our main theorem, we show that every linear code has unweighted sparsifier such that and . In other words, (assuming , we can increase the rate of the code by a factor of 2 while preserving almost the same distance-rate). Our main theorem is the following.
Theorem 1.1.
For any linear code , there is a set of size such that for any , . In particular we have .
We remark that the proof of the theorem is not algorithmic.
We also briefly explain applications of this theorem to thin subgraphs.
Given an unweighted (undirected) graph with vertices and edges,
we say a set is a -thin w.r.t. if for any nonempty set ,
i.e., has at most -fraction of the edges of every cut. Recall that a graph is -edge-connected if every cut in has at least edges. The following thin tree conjecture is proposed by Goddyn two decades ago [God04] and has been a subject of intense study since then [Asa+10, OS11, HO14, AO15, MP19, Mou, Alg23, KO23].
Conjecture 1.2 (Thin Tree Conjecture).
For any , there exists such that any -edge-connected graph has a spanning tree that is -thin.
We remark that there has been several βspectralβ constructions of thin subsets but it remained an open problem whether one can construct thin subsets combinatorially without appealing to eigenvalue arguments (this is specially motivated to address the thin tree conjecture, since -edge-connected graphs do not necessarily have spectrally thin trees, see [AO15]).
Corollary 1.3.
For any connected graph with vertices and edges, there exists a -thin set with .
Proof.
The main observation is that if we identify every cut with the indicator vector of the set of edges of that cut, we obtain a linear code (over ) of dimension . Having said that to prove the statement it is enough to use the set promised in 1.1 which satisfies ; then is a -thin subgraph of and has size . β
The following stronger statement also follows from our proof.
Theorem 1.4.
Any graph with edges and vertices has at least many -thin subgraphs.
The bound is tight, as a spanning tree only has a single -thin subgraph, the empty-set.
We remark that although the existence of -thin subgraphs follows by spectral arguments such as [BSS14], we are not aware of any exponential lower-bound on the number of spectrally thin subgraphs.
2 Main Proof
Definition 2.1 (codeword flip).
Given a code word , the flip corresponding to is the map sending to .
When is linear, this defines an equivalence relation on subsets of . We say if can be obtained from by a codeword flip, i.e., . The transitivity of this relation follows from the linearity of .
Note that the collection of equivalence classes is the quotient space (this is a vector space over ). Since there are precisely equivalence classes.
Lemma 2.2.
Fix an equivalence class . Let be a set with the largest size in . Then for any
Proof.
We prove this by contradiction. Suppose that there exists a codeword such that
Let . By definition . But the above equation implies which is a contradiction. β
For an equivalence class , let be a set of the largest size in . LemmaΒ 2.2 implies that the collection contains distinct subsets s.t. .
Now, it follows by the pigeon-hole principle that there must be a set such that has distance at least half of the distance of .
Proof of 1.1.
Let
be the equivalence class whose largest set is the smallest. The set is the smallest among distinct sets. Hence by the following lemma, we must have . This finishes the proof. β
Lemma 2.3.
For every the number of subsets of size is less than .
Proof.
Let be independent Bernoulli random variable. Then, by Hoeffdingβs bound
Equivalently, the number of subsets of of size is at most . Letting we get . β
References
- [Alg23] Mahtab Alghasi βCombinatorially Thin Trees and Spectrally Thin Trees in Structured Graphsβ, 2023
- [AO15] Nima Anari and Shayan Oveis Gharan βEffective-Resistance-Reducing Flows, Spectrally Thin Trees, and Asymmetric TSPβ In FOCS IEEE Computer Society, 2015, pp. 20β39
- [Asa+10] Arash Asadpour et al. βAn -approximation Algorithm for the Asymmetric Traveling Salesman Problemβ In SODA, 2010, pp. 379β389
- [BSS14] Joshua Batson, Daniel A. Spielman and Nikhil Srivastava βTwice-Ramanujan Sparsifiersβ In SIAM Review 56.2, 2014, pp. 315β334
- [God04] Luis A. Goddyn βSome Open Problems I Likeβ available at http://www.math.sfu.ca/Β goddyn/Problems/problems.html, 2004
- [HO14] Nicholas J. A. Harvey and Neil Olver βPipage Rounding, Pessimistic Estimators and Matrix Concentrationβ In SODA, 2014, pp. 926β945
- [KO23] Nathan Klein and Neil Olver βThin Trees for Laminar Familiesβ In FOCS IEEE, 2023, pp. 50β59
- [KPS24] Sanjeev Khanna, Aaron (Louie) Putterman and Madhu Sudan βCode Sparsification and its Applicationsβ In SODA, 2024, pp. 5145β5168
- [Mou] βThin trees in 8-edge-connected planar graphsβ In Information Processing Letters 143, 2019, pp. 51β55
- [MP19] Martin Merker and Luke Postle βBounded diameter arboricityβ In Journal of Graph Theory 90.4, 2019, pp. 629β641
- [OS11] Shayan Oveis Gharan and Amin Saberi βThe Asymmetric Traveling Salesman Problem on Graphs with Bounded Genusβ In SODA, 2011, pp. 967β975