Sharp Bounds for Sets with Distinct Subset Products

Rushil Raghavan

Abstract

Let $A\subseteq[N]$ be such that for any pair of distinct subsets $B,C\subseteq A$ , the products $\prod_{b\in B}b$ and $\prod_{c\in C}c$ are distinct. We prove that $|A|\leq\pi(N)+\pi(N^{1/2})+o(\pi(N^{1/2}))$ , where $\pi$ is the prime counting function, answering a question of Erdős.

1 Introduction

For $N\in\mathbb{N}$ , let $f(N)$ denote the size of the largest subset $A$ of $[N]$ such that for any pair of distinct subsets $B,C\subseteq A$ , $\prod_{b\in B}b\neq\prod_{c\in C}c$ . We say that a set $A$ satisfying these conditions has distinct subset products. Erdős [3] initiated the study of the quantity $f(N)$ , proving¹¹1See Definition 1.9 for the asymptotic notation used in this paper.

f(N)\leq\pi(N)+O(\pi(N^{1/2})),

where $\pi(x)=|\{n\leq x:n\text{ is prime}\}|$ is the prime counting function.

He also produced the following example, establishing that the above bound is sharp up to the implicit constant:

Example 1.1.

Let $A=\{p\leq N:p\text{ is prime}\}\cup\{p^{2}:p\text{ is prime and }p\leq N^{1/2}\}.$ Then $A$ has distinct subset products, so $f(N)\geq|A|=\pi(N)+\pi(N^{1/2})$ .

He then asked [3], [4] whether this estimate is optimal up to lower order terms. This problem is also listed at https://www.erdosproblems.com/795.

Question 1.2 (Erdős #795).

Is $f(N)=\pi(N)+\pi(N^{1/2})+o(\pi(N^{1/2}))?$

We answer this question affirmatively:

Theorem 1.3.

$f(N)=\pi(N)+\pi(N^{1/2})+O(N^{5/12+o(1)})$ .

In view of the above estimate, it is natural to ask for more precise information about the lower-order term. Erdős also considered this in [3], and speculated that a refinement of Example 1.1 may be optimal. To be more specific, for $k\in\mathbb{N}$ , let $g(k)$ be the smallest element of $\mathbb{N}$ for which there is an $E_{k}\subseteq[g(k)]$ of size $k$ such that for any distinct $F,F^{\prime}\subseteq E_{k}$ , $\sum_{n\in F}n\neq\sum_{n\in F^{\prime}}n$ . In other words, let $g(k)$ be the smallest possible maximal element in a set of size $k$ with distinct subset sums. Then, by setting

A=\bigcup_{k=1}^{\infty}\bigcup_{n\in E_{k}}\{p^{n}:p\in(N^{1/g(k+1)},N^{1/g(k)}]\text{ is prime}\},

we see that the subset products of $A$ are distinct, and thus

f(N)\geq\sum_{k=1}^{\infty}\pi(N^{1/g(k)}).

Since $g(1)=1$ , $g(2)=2$ , $g(3)=4$ , and $g(4)=7$ (with $E_{4}=\{3,5,6,7\}$ ), Erdős established $f(N)\geq\pi(N)+\pi(N^{1/2})+\pi(N^{1/4})+\pi(N^{1/7})$ , and speculated that the above infinite sum may be best possible. However, we produce an example that improves upon this.

Theorem 1.4.

$f(N)\geq\pi(N)+\pi(N^{1/2})+\frac{1}{3}\pi(N^{1/3})-O(1)$ .

It is also natural to consider the additive variant of Question 1.2, or in other words, to determine the asymptotic behavior of the function $g(k)$ . This is another problem of Erdős, which can be seen at https://www.erdosproblems.com/1. Although our methods do not address this problem, the interested reader may consult [2], [5] for the best-known lower bounds on $g(k)$ and some history.

Since squares feature so prominently in Example 1.1, one may ask about what kinds of estimates can be obtained in the absence of squares. Our method answers this question as well, with a somewhat smaller second-order term.

Theorem 1.5.

Let $h(N)$ be the maximal size of a subset of $[N]$ consisting of squarefree integers with distinct subset products. Then $h(N)\leq\pi(N)+\frac{1}{2}\pi(N^{1/2})+O(N^{5/12+o(1)})$ .

This estimate is also sharp up to the error term.

Theorem 1.6.

$h(N)\geq\pi(N)+\frac{1}{2}\pi(N^{1/2})+o(\pi(N^{1/2}))$ .

1.1 Notation and strategy of proof

Definition 1.7 (Subset Product Set).

Given a finite set $S\subseteq\mathbb{N}$ , we define the subset product set $\Pi(S)=\{\prod_{t\in T}t:T\subseteq S\}$ .

Throughout the proof, we will use the fact that an element of $[N]$ can be divisible by at most one prime in $(N^{1/2},N]$ , and at most two primes (with multiplicity) in $(N^{1/3},N]$ . We thus define

Definition 1.8 (Small, Medium, and Large Primes, Valuations).

Let $\mathcal{P}_{\text{small}}$ denote the set of primes in $[N^{1/3}],$ $\mathcal{P}_{\text{med}}$ the set of primes in $(N^{1/3},N^{1/2}]$ , and $\mathcal{P}_{\text{large}}$ the set of primes in $(N^{1/2},N]$ . We will also use the terms “small primes”, “medium primes”, and “large primes” to refer to elements of $\mathcal{P}_{\text{small}}$ , $\mathcal{P}_{\text{med}}$ , and $\mathcal{P}_{\text{large}}$ , respectively.

For a fixed prime $p\in[N]$ , and $n\in\mathbb{N}$ , let $V_{p}(n)$ denote the valuation of $n$ at $p$ , i.e., the largest nonnegative integer $r$ such that $p^{r}$ divides $n$ . Then define functions

\mathbf{V}_{\text{small}}:\mathbb{N}\to\mathbb{Z}^{\mathcal{P}_{\text{small}}},\quad\mathbf{V}_{\text{med}}:\mathbb{N}\to\mathbb{Z}^{\mathcal{P}_{\text{med}}},\quad\mathbf{V}_{\text{large}}:\mathbb{N}\to\mathbb{Z}^{\mathcal{P}_{\text{large}}}\quad\text{by}

\mathbf{V}_{\text{small}}(n)=(V_{p}(n))_{p\in\mathcal{P}_{\text{small}}},\quad\mathbf{V}_{\text{med}}(n)=(V_{p}(n))_{p\in\mathcal{P}_{\text{med}}},\quad\mathbf{V}_{\text{large}}(n)=(V_{p}(n))_{p\in\mathcal{P}_{\text{large}}}.

We will also define $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}:\mathbb{N}\to\mathbb{Z}^{\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}}$ as $(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})(n)=(V_{p}(n))_{p\in\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}}$ .

We will also use some standard asymptotic notation:

Definition 1.9 (Big-O and Little-o Notation).

Given functions $F$ , $G$ , and $H$ from $\mathbb{N}$ to $[0,\infty)$ , we say

•

$F(n)=O(G(n))$ if there is a constant $C>0$ such that $F(n)\leq CG(n)$ for all $n\in\mathbb{N}$ ,
•

$F(n)=o(G(n))$ if for all $c>0$ , there is an $N_{0}\in\mathbb{N}$ such that for all $n\geq N_{0}$ , $F(n)\leq cG(N)$ ,
•

$F(n)=G(n)+O(H(n))$ if $|F(n)-G(n)|=O(H(n))$ .

To prove Theorem 1.3, we will use a graph-theoretic approach. Our graphs will be simple, i.e., containing no loops or multiple edges. We will need to pay attention to certain subgraphs in our analysis:

Definition 1.10 (Paths, Cycles, Circuits).

Let $G=(V,E)$ be a graph.

•

A path of length $k$ is a collection of edges of the form

$\{\{v_{1},v_{2}\},\{v_{2},v_{3}\},\dots,\{v_{k},v_{k+1}\}\}\subseteq E$ , where $v_{1},\dots,v_{k+1}$ are distinct vertices in $V$ .
•

A cycle of length $k$ is a collection of edges of the form

$\{\{v_{1},v_{2}\},\{v_{2},v_{3}\},\dots,\{v_{k-1},v_{k}\},\{v_{k},v_{1}\}\}\subseteq E$ , where $v_{1},\dots,v_{k}$ are distinct vertices in $V$ .
•

A circuit of length $k$ is a collection of edges of the form

$\{\{v_{1},v_{2}\},\{v_{2},v_{3}\},\dots,\{v_{k-1},v_{k}\},\{v_{k},v_{1}\}\}\subseteq E$ , where $v_{1},\dots,v_{k}$ are (not necessarily distinct) vertices in $V$ .

In [3], to prove $f(N)\leq\pi(N)+O(\pi(N^{1/2}))$ , Erdős counted the possible number of prime factorizations in elements of $\pi(A)$ , where $A\subseteq[N]$ has distinct subset products. His proof is a counting argument based on the following three observations:

•

An element of $[N]$ is divisible by at most one large prime, and at most two large or medium primes (with multiplicity).
•

The range of $\mathbf{V}_{\text{small}}$ on $\Pi([N])$ is small (see Proposition 2.1).
•

There are at most $\pi(N)$ elements $a$ of $A$ such that there is a prime $p$ for which $p$ divides $a$ , but no other element of $A$ is a multiple of $p$ .

Optimizing his argument, one can obtain the bound $f(N)\leq\pi(N)+22\pi(N^{1/2})$ . We will utilize each of these observations in our approach, but we take into account some more refined information as well. For example, if $p,q$ are large primes and $r,s$ are medium primes, the argument from [3] does not use the fact that $A$ cannot contain the elements $pr$ , $qr$ , $ps$ , and $qs$ (although it does show that $A$ cannot contain this configuration for many primes $p,q,r,s$ ).

The strategy of our proof is as follows. Given a set $A$ with distinct subset products, we can produce a graph whose vertices correspond to large and medium primes, and whose edges correspond to (most) elements of $A$ . We call this graph the prime factorization graph of $A$ . The condition that $A$ has distinct subset products restricts the number of short cycles that can appear in this graph. We can exploit this fact to prove an upper bound on the number of edges in this graph, and thus an upper bound on $|A|$ . In Section 2, we show how to construct this prime factorization graph. In Section 3, we use the condition that $A$ has distinct subset products to remove circuits and cycles from the prime factorization graph without removing too many edges. In Section 4, we estimate the number of edges in a graph without any short cycles, and finally establish Theorems 1.3 and 1.5.

In Section 5, we produce some new examples of nearly maximal sets with distinct subset products and prove Theorems 1.4 and 1.6.

Acknowledgements

The author would like to thank Terence Tao for many helpful conversations and for introducing the author to this problem.

2 Constructing the Prime Factorization Graph

Given a set $A$ with distinct subset products, we will construct a graph that encodes the large and medium prime factors of (most) elements of $A$ . For this to be effective, we first need to control how much information is lost by ignoring small primes.

Proposition 2.1.

Let $\Pi([N])$ denote the subset product set of $[N]$ . Then

|\mathbf{V}_{\text{small}}(\Pi([N]))|\leq\exp(O(N^{1/3+o(1)})).

Proof.

For a fixed $p\in\mathcal{P}_{\text{small}}$ and $n\in[N]$ , $V_{p}(n)\in[0,\log_{2}(N)]$ . Since an element of $\Pi([N])$ is a product of at most $N$ elements of $[N]$ , for a fixed $n\in\Pi([N])$ , $V_{p}(n)\in[0,N\log_{2}(N)]$ . Thus $|\mathbf{V}_{\text{small}}(\Pi([N]))|\leq$

\prod_{p\in\mathcal{P}_{\text{small}}}(N\log_{2}(N)+1)=\exp(|\mathcal{P}_{\text{small}}|\log(N\log_{2}(N)+1))=\exp(O(N^{1/3+o(1)})).\qed

In view of this proposition, if $A$ has distinct subset products, there cannot be many elements of $\Pi(A)$ with the same valuations at medium and large primes. In fact, the same is true for $A$ .

Proposition 2.2.

Given $\mathbf{r}\in\mathbb{Z}^{\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}}$ , let $A_{\mathbf{r}}=\{a\in A:(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})(a)=\mathbf{r}\}$ . Let $R=\{\mathbf{r}\in\mathbb{Z}^{\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}}:|A_{\mathbf{r}}|\geq 2\}$ . Then

\sum_{\mathbf{r}\in R}|A_{\mathbf{r}}|=O(N^{1/3+o(1)}).

Proof.

For each $\mathbf{r}\in R$ , let $B_{\mathbf{r}}\subseteq A_{\mathbf{r}}$ and $C_{\mathbf{r}}\subseteq A_{\mathbf{r}}$ each have size $\lfloor|A_{\mathbf{r}}|/2\rfloor$ . Then

\prod_{\mathbf{r}\in R}\prod_{b\in B_{\mathbf{r}}}b\quad\text{and}\quad\prod_{\mathbf{r}\in R}\prod_{c\in C_{\mathbf{r}}}c

each have the same valuations at all large and medium primes. There are thus at most $\exp(O(N^{1/3+o(1)}))$ elements of $\Pi(A)$ of the form $\prod_{\mathbf{r}\in R}\prod_{b\in B_{\mathbf{r}}}b$ . There are at least

\prod_{\mathbf{r}\in R}\binom{|A_{\mathbf{r}}|}{\lfloor|A_{\mathbf{r}}|/2\rfloor}\geq\prod_{\mathbf{r}\in R}(1.1)^{|A_{\mathbf{r}}|}

elements of this form, where we used the inequality $\binom{n}{\lfloor n/2\rfloor}\geq(1.1)^{n}$ for all $n\geq 2$ . Thus

(1.1)^{\sum_{\mathbf{r}\in R}|A_{\mathbf{r}}|}\leq\exp(O(N^{1/3+o(1)})),

and taking logarithms gives the desired inequality. ∎

Corollary 2.3.

Given $A\subseteq[N]$ with distinct subset products, one can remove $O(N^{1/3+o(1)})$ elements of $A$ to get a set $A^{\prime}\subseteq A$ such that $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}$ is injective when restricted to $A^{\prime}$ . By removing at most one more element, we may assume $(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})(a)\neq\mathbf{0}$ for all $a\in A^{\prime}.$

Because of this corollary, we will assume here and in the next section that $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}$ is injective and nonzero on $A$ . Observe also that for $a\in A$ , $a$ is divisible by at most one large prime, and at most two large or medium primes (with multiplicity). Thus the tuple $(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})(a)$ either consists of:

•

a 1 at one index, with 0 at all other indices,
•

a 1 at each of two indices, at most one of them from $\mathcal{P}_{\text{large}}$ , and a 0 at all other indices, or
•

a 2 at an index from $\mathcal{P}_{\text{med}}$ and a 0 at all other indices.

Under these conditions, we may define the prime factorization graph of a set $A$ .

Definition 2.4 (Prime Factorization Graph).

The prime factorization graph $G(A)$ associated to $A$ has vertex set $\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}\cup\{1\}$ . We connect $1$ to a vertex $p\in\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}$ by an edge if there is an $a\in A$ such that $V_{p}(a)=1$ and $V_{q}(a)=0$ for all other primes $q\in\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}$ . We connect $p,q\in\mathcal{P}_{\text{large}}\cup\mathcal{P}_{\text{med}}$ by an edge if there is an element of $A$ divisible by both $p$ and $q$ .

By the aforementioned discussion, there is a bijection between the edges of $G(A)$ and the elements of $A$ which are not divisible by the square of any medium prime. We will still need to consider those elements, so we define the following.

Definition 2.5 ( $\mathcal{P}_{\square}$ , $\mathcal{P}_{\not\square}$ ).

We define $\mathcal{P}_{\square}=\{p\in\mathcal{P}_{\text{med}}:p^{2}\mid a\text{ for some }a\in A\}$ and $\mathcal{P}_{\not\square}=\mathcal{P}_{\text{med}}\setminus\mathcal{P}_{\square}$ .

Then $|A|$ is the sum of the number of edges in $G(A)$ and $|\mathcal{P}_{\square}|$ .

Example 2.6.

Suppose $N=121$ , so $\mathcal{P}_{\text{small}}=\{2,3\}$ , $\mathcal{P}_{\text{med}}=\{5,7,11\}$ , and $\mathcal{P}_{\text{large}}$ is the remaining set of primes in $[N]$ . On the set $A=\{50,105,77,55,65,26,51\}=\{2*5^{2},3*5*7,7*11,5*11,5*13,2*13,3*17\}$ , $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}$ is injective and nonzero. Its prime factorization graph is (omitting the isolated vertices corresponding to the primes in $(17,121]$ ):

Each element of $A$ corresponds to an edge in the graph except for $50$ , since $50$ is a multiple of $5^{2}$ and $5\in\mathcal{P}_{\text{med}}$ .

Remark. Suppose $a\in A$ is divisible by $p^{2}$ for some $p\in\mathcal{P}_{\text{med}}$ . In our definition of $G(A)$ , we do not include an edge corresponding to $a$ . We could alternatively define the prime factorization graph so that $a$ would correspond to a loop from $p$ to itself. One can still make sense of the arguments in the following sections that way, but we found it clearer to express our graph-theoretic arguments using only simple graphs.

Having defined the necessary sets of primes, we will state our main estimate.

Theorem 2.7.

Let $A\subseteq[N]$ have distinct subset products and let $\mathcal{P}_{\square}=\\ \{p\in\mathcal{P}_{\text{med}}:p^{2}\mid a\text{ for some }a\in A\}$ . Then

|A|\leq\pi(N)+\frac{1}{2}\pi(N^{1/2})+\frac{1}{2}|\mathcal{P}_{\square}|+O(N^{5/12+o(1)}).

Since $\mathcal{P}_{\square}=\emptyset$ if the elements of $A$ are all squarefree and $|\mathcal{P}_{\square}|\leq\pi(N^{1/2})$ for any set $A$ , Theorems 1.3 and 1.5 follow immediately from Theorem 2.7.

3 Cycle Removal

Throughout this section, $A\subseteq[N]$ will have distinct subset products and be such that $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}$ is injective and nonzero on $A$ . Unless otherwise specified, the graph $G$ will denote the prime factorization graph of $A$ .

Lemma 3.1.

Let $C_{1},\dots,C_{n}$ be the sets of edges in a maximal collection of edge-disjoint even circuits in $G$ , each with length at most $2N^{1/12}$ . Then $n=O(N^{1/3+o(1)})$ , so $\sum_{k=1}^{n}|C_{k}|=O(N^{5/12+o(1)})$ .

Proof.

For each $k\in[n]$ , let $\{v_{1}^{k},\dots,v_{|C_{k}|}^{k}\}$ be the set of vertices in the cycle $C_{k}$ indexed so that $\{\{v_{1}^{k},v_{2}^{k}\},\{v_{2}^{k},v_{3}^{k}\},\dots,\{v_{|C_{k}|}^{k},v_{1}^{k}\}\}=C_{k}$ . Let $A_{0}^{k}$ be the subset of $A$ corresponding to the edges $\{v_{1}^{k},v_{2}^{k}\},\{v_{3}^{k},v_{4}^{k}\},\dots,\{v_{|C_{k-1}|}^{k},v_{|C_{k}|}^{k}\}$ , and let $A_{1}^{k}$ be the subset of $A$ corresponding to all the other edges in $C_{k}$ . Then

(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})\left(\prod_{a\in A_{0}^{k}}a\right)=(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})\left(\prod_{a\in A_{1}^{k}}a\right).

Thus, for any choice of $\mathbf{\epsilon}\in\{0,1\}^{n}$ ,

\prod_{k=1}^{n}\prod_{a\in A_{\epsilon_{k}}^{k}}a

has the same valuation at all large and medium primes. By Proposition 2.1, we thus have $2^{n}=\exp(O(N^{1/3+o(1)}))$ , as desired. ∎

Example 3.2.

Suppose $A=\{15,55,84,154,221,247,323,551,437,667\}$ , $\mathcal{P}_{\text{small}}=\{2\}$ , and $\mathcal{P}_{\text{med}}\cup\mathcal{P}_{\text{large}}=\{3,5,7,11,13,17,19,23,29$ }. Below is the prime factorization graph of $A$ , containing two even circuits. The product of the elements on the blue edges and the product of the elements on the red edges have the same valuations at all large and medium primes. By swapping the red-blue labeling in either circuit, we can produce four subsets whose products have the same valuations at all large and medium primes.

Corollary 3.3.

By removing $O(N^{5/12+o(1)})$ edges from $G$ , we may assume that $G$ has no even circuits of length at most $2N^{1/12}$ .

Proof.

Let $C_{1},\dots,C_{n}$ be the sets of edges in a maximal collection of edge-disjoint even circuits in $G$ , each with length at most $2N^{1/12}$ . We then have $\sum_{k=1}^{n}|C_{k}|=O(N^{5/12+o(1)})$ , so we may remove all the edges from $\bigcup_{k=1}^{n}C_{k}$ from $G$ . After doing so, if there remains an even circuit of length at most $2N^{1/12}$ in $G$ , this would contradict the maximality of $\{C_{1},\dots,C_{n}\}$ . ∎

From here forward, we will assume that $G$ contains no even circuits of length at most $2N^{1/12}$ . Having removed short even circuits from $G$ , we now turn our focus to cycles with odd length. We first have the following proposition for a general graph $G$ .

Proposition 3.4.

Let $G$ be a graph with no even circuits of length at most $2M$ . Then any two odd cycles in $G$ of length at most $M$ are vertex-disjoint.

Proof.

We will prove the contrapositive. Suppose $G$ contains two odd cycles with vertex sets $V_{1}=\{v_{1},\dots,v_{n}\}$ and $V_{2}=\{w_{1},\dots,w_{m}\}$ , respectively, such that $v_{1}=w_{1}$ . Supposing also that $n+m\leq 2M$ , we will show that $G$ has an even circuit of length at most $2M$ . Let $E_{1}=\{\{v_{1},v_{2}\},\dots,\{v_{n},v_{1}\}\}$ and $E_{2}=\{\{w_{1},w_{2}\},\dots,\{w_{m},w_{1}\}\}$ be their respective edge sets. We consider two cases.

First, assume $E_{1}\cap E_{2}=\emptyset.$ Then $\{\{v_{1},v_{2}\},\dots,\{v_{n},v_{1}\},\{w_{1},w_{2}\},\dots,\{w_{m},w_{1}\}\}$ form a circuit of length $n+m$ , which is even and at most $2M$ .

On the other hand, suppose $E_{1}\cap E_{2}\neq\emptyset$ . Since $E_{1}\neq E_{2}$ , we may assume without loss of generality that $v_{1}=w_{1}$ , $\{v_{1},v_{n}\}=\{w_{1},w_{m}\}$ , but $\{v_{1},v_{2}\}\neq\{w_{1},w_{2}\}$ . Let $j$ be the maximal index such that for all $1\leq i\leq j-1$ , $\{v_{i},v_{i+1}\}\notin E_{2}$ . Let $P_{1}$ and $P_{2}$ be the edge sets from the two paths in $E_{2}$ from $v_{1}$ to $v_{j}$ . Since $|P_{1}|+|P_{2}|$ is odd, we can combine one of $P_{1}$ or $P_{2}$ with the edges from $\{\{v_{1},v_{2}\},\dots,\{v_{j-1},v_{j}\}\}$ to form an even circuit of length at most $n+m\leq 2M$ . ∎

Lemma 3.5.

Let $C_{1},\dots,C_{n}$ be the sets of edges in a maximal collection of edge-disjoint odd circuits in $G$ , each with length at most $N^{1/12}$ and each having a vertex from $\mathcal{P}_{\square}$ . Then $n=O(N^{1/3+o(1)})$ , so $\sum_{k=1}^{n}|C_{k}|=O(N^{5/12+o(1)}).$

Proof.

For each $k\in[n]$ , let $\{v_{1}^{k},\dots,v_{|C_{k}|}^{k}\}$ be the set of vertices in the cycle $C_{k}$ indexed so that $v_{1}^{k}\in\mathcal{P}_{\square}$ and $\{\{v_{1}^{k},v_{2}^{k}\},\{v_{2}^{k},v_{3}^{k}\},\dots,\{v_{|C_{k}|}^{k},v_{1}^{k}\}\}=C_{k}$ . For each $k$ , let $p_{k}$ be the prime corresponding to $v_{1}^{k}$ , and let $a_{k}$ be the element of $A$ such that $p_{k}^{2}$ divides $a_{k}$ . Note that by Proposition 3.4, the elements $a_{k}$ are distinct for distinct $k$ . Let $A_{0}^{k}$ be the set of elements of $A$ corresponding to the edges $\{v_{1}^{k},v_{2}^{k}\},\{v_{3}^{k},v_{4}^{k}\},\dots,\{v_{|C_{k}|-1}^{k},v_{|C_{k}|^{k}}\}$ , and let $A_{1}^{k}$ be the set of elements of $A$ corresponding to the other edges in $C_{k}$ , union $\{a_{k}\}$ .

Then

(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})\left(\prod_{a\in A_{0}^{k}}a\right)=(\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}})\left(\prod_{a\in A_{1}^{k}}a\right).

Thus, for any choice of $\mathbf{\epsilon}\in\{0,1\}^{n}$ ,

\prod_{k=1}^{n}\prod_{a\in A_{\epsilon_{k}}^{k}}a

has the same valuation at all large and medium primes. By Proposition 2.1, we thus have $2^{n}=\exp(O(N^{1/3+o(1)}))$ , as desired. ∎

By a similar argument to Corollary 3.3, by removing $O(N^{5/12+o(1)})$ edges, we may assume without loss of generality that $G$ has no odd cycles of length at most $N^{1/12}$ with a vertex from $\mathcal{P}_{\square}$ .

Example 3.6.

Suppose $A=\{15,65,84,154,143,9\}$ , $\mathcal{P}_{\text{small}}=\{2\}$ , $\mathcal{P}_{\text{med}}\cup\mathcal{P}_{\text{large}}=\{3,5,7,11,13\}$ , and $\mathcal{P}_{\square}=\{3\}$ . Below is the prime factorization graph of $A$ . The product of the red elements (including $9$ ) and the product of the blue elements have the same valuations at all large and medium primes.

Lemma 3.7.

Let $E$ denote the edge set of $G$ . There is a set of edges $E^{\prime}\subseteq E$ of size at most $\frac{1}{2}(|\mathcal{P}_{\not\square}|+1)$ such that the graph with edge set $E\setminus E^{\prime}$ contains no cycles of length at most $N^{1/12}$ .

Proof.

Let $C_{1},\dots,C_{n}$ be the sets of edges of all cycles of length at most $N^{1/12}$ in $G$ . By Proposition 3.4, we may assume that these sets are disjoint and that they share no vertices. None of these cycles have a vertex from $\mathcal{P}_{\square}$ , and no vertices in $\mathcal{P}_{\text{large}}$ are adjacent, so each $C_{k}$ contains an edge between two vertices from $\mathcal{P}_{\not\square}\cup\{1\}$ . Since the vertex sets are disjoint, we thus have $2n\leq|\mathcal{P}_{\not\square}\cup\{1\}|$ . We may take $E^{\prime}$ to consist of one edge from each cycle $C_{k}$ . ∎

4 Proof of Theorems 1.3 and 1.5

Having removed all cycles of length at most $N^{1/12}$ from the prime factorization graph, we are ready to estimate the number of edges in it. We first record the following general estimate. This can be deduced from a result of Alon, Hoory, and Linial [1], but it is easier in our case, so we present a self-contained proof. The proof uses a standard breadth-first search argument; a similar argument can be found in [6], Theorem 1.6.5, for example.

Lemma 4.1.

Let $G$ be a graph with $n$ vertices and at least $(1+c)n$ edges. Then $G$ has a cycle of length at most $\frac{2(c+1)}{c}(\log_{2}(n)+1)$ .

Proof.

Without loss of generality, we may assume that $G$ is connected, since we may pass to a component satisfying $|V|\geq(1+c)|E|$ . We can remove degree $1$ vertices and their incident edges while preserving the inequality $|V|\geq(1+c)|E|$ , so we may assume without loss of generality that each vertex has degree at least 2. If $G$ contains a path of length greater than $\frac{c+1}{c}$ , then we can remove all internal edges and vertices from that path while preserving the inequality $|V|\geq(1+c)|E|$ , so we may assume that $G$ has no such paths.

With all these assumptions set, we fix a vertex $v_{0}\in V$ and consider a breadth-first search starting at $v_{0}$ . For each $k\in\mathbb{N}$ , let $V_{k}=\{v\in V:\text{ the shortest path from $v$ to $v_{0}$ has length $k$}\}$ . Fix $k\in\mathbb{N}$ with $1\leq k\leq\frac{c+1}{c}(\log_{2}(n)-1)$ and let $\ell$ be the least integer greater than $k+\frac{c+1}{c}$ .

If there is a $v\in v_{\ell}$ such that there are two distinct paths of length $\ell$ from $v$ to $v_{0}$ , then we are done, since $v$ must thus contain a cycle of length at most $2\ell$ . Otherwise, for each $v\in v_{\ell}$ , let $\varphi(v_{\ell})\in V_{k}$ be the vertex from $V_{k}$ on the shortest path from $v_{\ell}$ to $v_{0}$ . For each $v\in V_{k}$ , we must have $|\varphi^{-1}(v)|\geq 2$ , since otherwise the path from $v$ to the only element in $\varphi^{-1}(v)$ would be a path of length greater than $\frac{c+1}{c}$ consisting only of degree two vertices. We thus have $|V_{\ell}|\geq 2|V_{k}|$ . Letting $m$ be the least integer greater than $\frac{c+1}{c}\log_{2}(n)$ , we must thus have $|V_{m}|>2^{\log_{2}(n)}=|V|$ , so there must be a $v\in V_{m}$ with at least two distinct paths from $v$ to $v_{0}$ . There is thus a cycle of length at most $2m$ in $G$ , as desired. ∎

We will apply Lemma 4.1 with $c=O(N^{-1/12+o(1)}).$ The following fact is required to ensure that the error term from Lemma 4.1 is multiplied by $\pi(N^{1/2})$ instead of $\pi(N)$ in our final estimate.

Lemma 4.2.

Let $G$ be the prime factorization graph of $A$ , containing no even circuits of length at most $2N^{1/12}$ . The set of vertices from $\mathcal{P}_{\text{large}}$ with degree at least $2$ has size at most $(1+O(N^{-1/12+o(1)}))\pi(N^{1/2})$ .

Proof.

Let $\mathcal{Q}\subseteq\mathcal{P}_{\text{large}}$ be the set of vertices of degree at least $2$ in $G$ . Recall that no pair of vertices in $\mathcal{P}_{\text{large}}$ is connected by an edge, so if $p\in\mathcal{Q}$ , then there are distinct $v_{1},v_{2}\in\mathbf{V}_{\text{med}}\cup\{1\}$ which are each connected to $p$ by an edge.

Consider the graph with vertex set $\{1\}\cup\mathcal{P}_{\text{med}}$ , where $v_{1}$ is adjacent to $v_{2}$ if there is a path of length $2$ from $v_{1}$ to $v_{2}$ in $G$ with the middle vertex in $Q$ . By construction, there is a surjection from the set of edges in this graph to $\mathcal{Q}$ . Moreover, this graph is simple since $G$ contains no $4$ -cycles.

Any cycle of length $m$ in this graph corresponds to a circuit of length $2m$ in $G$ . This graph thus does not contain any cycles of length at most $N^{1/12}$ . Applying Lemma 4.1 with $c=\frac{4\log_{2}(N)}{N^{1/12}}$ , we thus find that the number of edges is at most $(1+O(N^{-1/12+o(1)}))(|\mathcal{P}_{\text{med}}|+1)$ , so $|\mathcal{Q}|\leq(1+O(N^{-1/12+o(1)}))\pi(N^{1/2})$ . ∎

We are now ready to prove Theorem 2.7, from which Theorems 1.3 and 1.5 follow immediately.

Proof of Theorem 2.7.

Let $A\subseteq[N]$ have distinct subset products. By Corollary 2.3, we may remove $O(N^{1/3+o(1)})$ elements from $A$ to ensure that $\mathbf{V}_{\text{large}}\times\mathbf{V}_{\text{med}}$ is injective and nonzero when restricted to $A$ . We may now let $G=(V,E)$ be the prime factorization graph of $A$ .

By Corollary 3.3 and Lemma 3.5, we may remove $O(N^{5/12+o(1)})$ elements from $E$ to ensure that $A$ has no even circuits of length at most $2N^{1/12}$ , and to ensure that $A$ has no odd cycles of length at most $N^{1/12}$ with a vertex from $\mathcal{P}_{\square}$ . Then, we may apply Lemma 3.7 and remove $\frac{1}{2}|\mathcal{P}_{\not\square}|+O(1)$ edges to remove all odd cycles of length at most $N^{1/12}$ from $G$ .

Let $\mathcal{Q}$ be the set of all vertices in $\mathcal{P}_{\text{large}}$ with degree at least two. We remove all vertices from $\mathcal{P}_{\text{large}}\setminus\mathcal{Q}$ and their incident edges. Let $V^{\prime}$ and $E^{\prime}$ be the remaining sets of vertices and edges after all these removals. We have

|V^{\prime}|\leq 1+\pi(N^{1/2})+|\mathcal{Q}|

and

|E^{\prime}|\geq|E|-(|\mathcal{P}_{\text{large}}|-|\mathcal{Q}|)-\frac{1}{2}|\mathcal{P}_{\not\square}|-O(N^{5/12+o(1)}).

The graph $(V^{\prime},E^{\prime})$ has no cycles of length at most $N^{1/12}$ . Applying Lemma 4.1 with $c=\frac{4\log_{2}(N)}{N^{1/12}}$ , we find

|E^{\prime}|\leq(1+O(N^{-1/12+o(1)})|V^{\prime}|.

This results in the inequality

|E|-(|\mathcal{P}_{\text{large}}|-|\mathcal{Q}|)-\frac{1}{2}|\mathcal{P}_{\not\square}|\leq(1+O(N^{-1/12+o(1)}))(\pi(N^{1/2})+|\mathcal{Q}|)+O(N^{5/12+o(1)})=

\pi(N^{1/2})+|\mathcal{Q}|+O(N^{5/12+o(1)}),

where we used Lemma 4.2 so that $O(N^{-1/12+o(1)})|\mathcal{Q}|=O(N^{5/12+o(1)})$ .

Finally, using the equation $|A|=|E|+|\mathcal{P}_{\square}|+O(N^{1/3+o(1)})$ , we find

|A|\leq\pi(N^{1/2})+|\mathcal{Q}|+|\mathcal{P}_{\text{large}}|-|\mathcal{Q}|+\frac{1}{2}|\mathcal{P}_{\not\square}|+|\mathcal{P}_{\square}|+O(N^{5/12+o(1)})

\leq\pi(N)+\frac{1}{2}\pi(N^{1/2})+\frac{1}{2}|\mathcal{P}_{\square}|+O(N^{5/12+o(1)}),

since $\mathcal{P}_{\text{large}}=\pi(N)-\pi(N^{1/2})$ and $|\mathcal{P}_{\square}|+|\mathcal{P}_{\not\square}|\leq\pi(N^{1/2})$ . ∎

5 Examples

In this section, we will discuss some examples of large sets with distinct subset products.

Our graph-theoretic perspective enables us to produce some new examples of sets $A\subseteq[N]$ with distinct subset products and $|A|\approx\pi(N)+\pi(N^{1/2})$ .

Example 5.1.

Let $G$ be a tree with vertex set $\mathcal{P}_{\text{med}}\cup\mathcal{P}_{\text{small}}$ . Let $A=\mathcal{P}_{\text{large}}\cup\{p^{2}:p\in\mathcal{P}_{\text{med}}\cup\mathcal{P}_{\text{small}}\}\cup\{pq:p,q\in\mathcal{P}_{\text{med}}\cup\mathcal{P}_{\text{small}}\text{ are connected by an edge in $G$}\}$ . Then $|A|=\pi(N)+\pi(N^{1/2})-1$ and $A$ has distinct subset products.

Next, we will produce an example of a set $|A|$ with distinct subset products and $|A|\geq\pi(N)+\pi(N^{1/2})+\frac{1}{3}\pi(N^{1/3})-O(1)$ , establishing Theorem 1.4.

Proof of Theorem 1.4.

Construct a set $A$ as follows. For any prime $p\in(N^{1/3},N]$ , let $p\in A$ . For any prime $p\in(N^{1/3},N^{1/2}]$ , let $p^{2}\in A$ . Let $p,q,r$ be distinct primes in $[N^{1/3}]$ . The set $\{p^{2}q,p^{2}r,p^{2},qr,p^{3},q^{3},r^{3}\}$ is a subset of $[N]$ with distinct subset products. We may thus divide the primes from $[N^{1/3}]$ into disjoint subsets of size $3$ (perhaps leaving one or two primes out) and for each triple $\{p,q,r\}$ , add the $7$ elements $\{p^{2}q,p^{2}r,p^{2},qr,p^{3},q^{3},r^{3}\}$ to $A$ . The number of elements of $A$ is thus at least

\pi(N)-\pi(N^{1/2})+2(\pi(N^{1/2})-\pi(N^{1/3}))+7\left(\frac{1}{3}\pi(N^{1/3})-O(1)\right)=

\pi(N)+\pi(N^{1/2})+\frac{1}{3}\pi(N^{1/3})-O(1).\qed

Finally, we may use our graph-theoretic perspective to produce a set of squarefree integers with distinct subset products and nearly $\pi(N)+\frac{1}{2}\pi(N^{1/2})$ elements, establishing Theorem 1.6.

Proof of Theorem 1.6.

Let $\epsilon>0$ , and let $N$ be sufficiently large (depending on $\epsilon$ ). By the prime number theorem, for every $k\in[1,1/\epsilon-2]$ , the number of primes $p$ satisfying the inequality

N^{1/2}-(k+1)\epsilon N^{1/2}\leq p<N^{1/2}-k\epsilon N^{1/2}

is less than twice the number of primes $p$ satisfying

N^{1/2}+(k-1)\epsilon N^{1/2}<p\leq N^{1/2}+k\epsilon N^{1/2}.

We may thus list all but at most $3\epsilon N^{1/2}$ of the primes in $[1,N^{1/2}]$ as $q_{1},r_{1},q_{2},r_{2},\dots,q_{n},r_{n}$ so that there are distinct primes $p_{1},\dots,p_{n}$ in $(N^{1/2},N]$ with $p_{i}q_{i}\leq N$ , $p_{i}r_{i}\leq N$ for all $i$ .

Let $\mathcal{Q}$ be the set of large primes not in $\{p_{1},\dots,p_{n}\}$ . Let

A=\mathcal{Q}\cup\bigcup_{i=1}^{n}\{p_{i}q_{i},p_{i}r_{i},q_{i}r_{i}\}\cup\bigcup_{i=1}^{n}\{r_{i}q_{i+1}\}.

Then $|A|\geq\pi(N)+\frac{1}{2}\pi(N^{1/2})-3\epsilon\pi(N^{1/2})-2$ and $A$ has distinct subset products.

The prime factorization graph of $A$ is as follows (omitting the vertices from $\mathcal{Q}$ ); it is constructed so that the argument from Lemma 3.7 is sharp.

We have established that for all $\epsilon>0$ , there is an $N_{0}\in\mathbb{N}$ such that for all $N\geq N_{0}$ , $h(N)\geq\pi(N)+\frac{1}{2}\pi(N^{1/2})-3\epsilon\pi(N^{1/2})-2$ . Thus, $h(N)\geq\pi(N)+\frac{1}{2}\pi(N^{1/2})-o(\pi(N^{1/2}))$ . ∎

References

[1] Noga Alon, Shlomo Hoory, and Nathan Linial. The Moore bound for irregular graphs. Graphs and Combinatorics, 18:53–57, 2002.
[2] Quentin Dubroff, Jacob Fox, and Max Wenqiang Xu. A note on the Erdős distinct subset sums problem. SIAM Journal on Discrete Mathematics, 35(1):322–324, 2021.
[3] Paul Erdős. Extremal problems in number theory II. Mat. Lapok, 17:135–155, 1966.
[4] Paul Erdős. Some applications of graph theory to number theory. In The Many Facets of Graph Theory: Proceedings of the Conference held at Western Michigan University, Kalamazoo/MI., October 31–November 2, 1968, pages 77–82. Springer, 1969.
[5] Stefan Steinerberger. Some remarks on the Erdős distinct subset sums problem. International Journal of Number Theory, 19(08):1783–1800, 2023.
[6] Yufei Zhao. Graph Theory and Additive Combinatorics: Exploring Structure and Randomness. Cambridge University Press, 2023.

Sharp Bounds for Sets with Distinct Subset Products

Abstract

1 Introduction

Example 1.1.

Question 1.2 (Erdős #795).

Theorem 1.3.

Theorem 1.4.

Theorem 1.5.

Theorem 1.6.

1.1 Notation and strategy of proof

Definition 1.7 (Subset Product Set).

Definition 1.8 (Small, Medium, and Large Primes, Valuations).

Definition 1.9 (Big-O and Little-o Notation).

Definition 1.10 (Paths, Cycles, Circuits).

Acknowledgements

2 Constructing the Prime Factorization Graph

Proposition 2.1.

Proof.

Proposition 2.2.

Proof.

Corollary 2.3.

Definition 2.4 (Prime Factorization Graph).

Definition 2.5 (𝒫□\mathcal{P}_{\square}, 𝒫□̸\mathcal{P}_{\not\square}).

Example 2.6.

Theorem 2.7.

3 Cycle Removal

Lemma 3.1.

Proof.

Example 3.2.

Corollary 3.3.

Proof.

Proposition 3.4.

Proof.

Lemma 3.5.

Proof.

Example 3.6.

Lemma 3.7.

Proof.

4 Proof of Theorems 1.3 and 1.5

Lemma 4.1.

Proof.

Lemma 4.2.

Proof.

Proof of Theorem 2.7.

5 Examples

Example 5.1.

Proof of Theorem 1.4.

Proof of Theorem 1.6.

References

Definition 2.5 ( $\mathcal{P}_{\square}$ , $\mathcal{P}_{\not\square}$ ).