1. Introduction
Let be a cubic form in variables.
Let be the hypersurface in .
Let be a smooth, compactly supported function.
Let be the closure of the set in .
Assume that is smooth and (for convenience) that .
For reals , let
(1.1) |
|
|
|
Let be the set of vector spaces over of dimension with .
The “Hardy–Littlewood model” for can fail via if .
But based on [hooley1986some]*Conjecture 2, [franke1989rational], [vaughan1995certain]*Appendix, [peyre1995hauteurs], et al., one may conjecture
(1.2) |
|
|
|
for a certain predicted constant and integer , where if .
Here is the rank of the Picard group of .
For further discussion and references on (1.2), see [bombieri2009problems]*§2 or [wang2022thesis]*§6.5.
The natural analog of (1.2) for rational points is closely related to (1.2), and is in fact equivalent when , but the precise sieve-theoretic relationship is delicate when ;
cf. [heath1996new]*paragraph preceding Corollary 2 and [heath1996new]*Theorem 8 versus Corollary 2 on quadrics.
In the present paper, we seek to organically detect, within the delta method of [duke1993bounds, heath1996new], the locus featured in (1.2).
In the delta method, as in [kloosterman1926representation]’s circle method,
one averages over numerators to a given denominator (modulus), and often then invokes Poisson summation.
Set as on [heath1998circle]*p. 676; then
(1.3) |
|
|
|
(for all ) by [heath1996new]*Theorem 2, (1.2), up to easy manipulations from §3,
where
(1.4) |
|
|
|
for a certain fixed smooth function
usually left in the background
(see [heath1998circle]*(2.3) for the precise definition of ),
and where
(1.5) |
|
|
|
Here , and in the variable runs over .
See Proposition 5.1 for the basic analytic properties
of .
Given ,
the sum is multiplicative in ,
and roughly governed by
the solutions to over .
Let be the intersection in .
There is a classical discriminant polynomial associated to the family .
The equation cuts out the projective dual variety of .
See §2 for details on , .
It is natural to analyze the right-hand side of (1.3) separately over and .
In another paper ([wang2023_HLH_vs_RMT]), we conditionally address for certain , .
In the present (self-contained) paper, we focus on .
From this locus in (1.3),
our main theorem unconditionally isolates , for certain :
Theorem 1.1.
Suppose is diagonal and .
For as above, we have
(1.6) |
|
|
|
For ,
Theorem 1.1
follows from [heath1998circle]*Lemmas 7.2 and 8.1,
with a better error of .
As we will explain in §8,
the case seems to present new difficulties, perhaps most easily resolved using our new, more geometric, strategy.
But [heath1998circle] does bound the left-hand side of (1.6) by for ,
giving us a good foundation to build on.
In view of [glas2022question], one might also hope to adapt our results to function fields.
Our main results assume is diagonal, but we do also discuss non-singular to a nontrivial extent.
In particular, we will isolate explicit technical ingredients
(listed in Remark 1.6)
that—if true more generally—would
allow one to generalize Theorem 1.1.
If (as in Theorem 1.1), the set is known to be finite
for general reasons (recalled in §3 below).
Furthermore, we can relate (1.6) to (1.2) by inclusion-exclusion:
|
|
|
For each , if we define as in (4.1), we also have (for all )
(1.7) |
|
|
|
If is diagonal,
has an explicit classical description:
see Proposition 3.6.
In general, linear subvarieties of cubics
are closely tied to the -invariant introduced by [davenport1962exponential, davenport1964non].
(See [dietmann2017h]*Lemma 1.1 for a more general relationship.)
In fact,
the present §7 relies on
a convenient choice of
“-decompositions of ”
corresponding to the elements of .
We believe that with enough work, the power saving in Theorem 1.1 could be improved.
It would be very interesting to go past for .
Theorem 1.1 has the following corollary, which we need for subsequent work ([wang2023_HLH_vs_RMT]).
Corollary 1.2.
Let .
Define , as in §6.
Then in the setting of Theorem 1.1,
|
|
|
Proof.
Combine (1.6) (on ) with
(6.5) (on ).
Note that .
∎
Theorem 1.1 and Corollary 1.2 are completely unconditional.
When and is diagonal,
these results let us reformulate
the conjecture (1.2)
as a statement purely about cancellation in
the sum .
A similar reformulation might be possible much more generally.
But at least when ,
subtleties in the constant in (1.2)
would likely demand
a recipe
beyond “restriction to ” in (1.3).
Before proceeding, we make some convenient definitions.
Let
(1.8) |
|
|
|
then in particular,
.
The following is also convenient:
Definition 1.3.
Given a vector space over ,
let be the orthogonal complement of
with respect to .
Then let and .
We now sketch the proof of Theorem 1.1.
The proof starts generally,
observing that for all
(see Proposition 3.1).
Conversely, at least for diagonal ,
most ’s on the left-hand side of (1.6)
are in fact linear, in the sense of the following definition:
Definition 1.4.
Call a solution to linear
if .
We expect “typical” linear ’s to be the simplest.
Proposition 3.1(2), which establishes a vanishing baseline for the jets over ,
thus inspires the following definition:
Definition 1.5.
Call unsurprising if uniformly over reals ,
the box contains at most points in the union of the following two sets:
(1.9) |
|
|
|
We prove (in §8) that when is diagonal,
is unsurprising,
so that if , then almost all solutions to are linear with nonzero -jet.
A weaker version of Definition 1.5, with in place of , would suffice for qualitative purposes.
For the “least degenerate” linear ’s,
Lemma 7.11 isolates an explicit positive bias
(1.10) |
|
|
|
for most primes ,
with and .
The resulting reduction in arithmetic complexity of lets us
dramatically simplify
by Poisson summation over various individual residue classes with dividing .
When , we must also carefully separate from ;
we use Lemma 6.1 (decay of the singular series over large moduli).
Eventually, appears, yielding (1.6).
As the positivity of for linear ’s might suggest,
we do not need cancellation over in (1.6).
The deepest result we use on -functions (when ) is
the (purely local) Weil bound for hyperelliptic curves of genus .
However, it might be possible to improve the error term in Theorem 1.1 using deeper results on -functions.
The following remark proposes axioms that would let us go beyond diagonal .
In Remark 1.6,
we expect (4) to be the most tractable in general out of (1), (3), and (4).
In fact, [wang2023dichotomous, BGW2024forthcoming] already make progress on (4).
Also, [salberger2023counting] could be helpful for (1), at least if .
Furthermore, the case where and is diagonal might be fully accessible, and might provide insight on secondary terms in the circle method.
This would be similar in spirit to [getz2018secondary],
where secondary terms were obtained for quadrics in an even number of variables.
The most mysterious axiom might be (3), but it does at least hold for diagonal , and could plausibly follow from some undiscovered geometric stratification.
Proposition 1.7.
Suppose is diagonal, and is arbitrary.
Then 1.6(3) holds.
Proof sketch.
Let .
One can prove (1.12) and (1.13) using (5.2).
(See [wang2023_large_sieve_diagonal_cubic_forms]*§7;
(1.12) holds with in place of .)
Also, Observation 3.10 implies (1.11).
∎
We now outline the rest of the paper.
§2 collects some classical algebraic geometry used
as a black box in parts of §3.
§3 suggests a geometric backbone in (1.3) for
the eventual harmonic detection of .
§4 proves some identities
to be used in §8 to obtain the expected “main terms” in (1.6).
§5 provides general upper bounds to be used at several points in §8.
§6 analyzes the contribution in (1.3),
while also proving Lemma 6.1.
§7 proves some new asymptotic formulas for (see (1.10) and Lemma 7.11).
§8 uses Lemmas 6.1 and 7.11,
plus results from §§3–5 and [heath1998circle],
to prove Theorem 1.1.
1.1. Conventions
We let denote the discriminant of , so that if and only if is smooth.
We always assume is smooth, unless specified otherwise.
We define to be if holds, and if does not hold.
We let ,
and .
Also, we use the following notation for integrals:
|
|
|
We let (if is finite and nonempty).
If , then we assume (after scaling if necessary) that
(1.14) |
|
|
|
For a vector space , we let denote the projectivization of .
We let denote the usual Hessian matrix of .
We let .
We let ;
we let (for ) and (for ).
We let
(1.15) |
|
|
|
denote the -jet of ,
recording all partial derivatives of of order .
We write , or , to mean “ for some real ”.
We let denote a quantity that is .
We write if .
When making estimates, we think of , , as fixed constants.
2. Algebraic geometry background
We do algebraic geometry in the language of schemes, with the symbols and interpreted scheme-theoretically unless specified otherwise.
We call a reduced, locally closed subscheme of projective space over a field a variety.
Ultimately, geometry lets us cut out subsets of , and count points over finite rings, in rigorous ways (see e.g. the key Lemma 7.11).
Most of our work can be interpreted classically, in terms of points over algebraically closed fields.
The rest of this section reviews the geometry of the gradient map
|
|
|
For diagonal ,
a more explicit analysis is possible
(see §3.2).
Since is smooth (and ), the gradient defines a morphism .
(We will often identify the dual projective space with , using as projective coordinates.)
The map , known as the polar map of , is finite surjective of degree , by dimension and intersection theory;
cf. [dolgachev2012classical]*p. 29.
Since is smooth, must then be flat (by “miracle flatness”).
Definition 2.1.
Let be the scheme-theoretic image of under .
Since is a smooth projective hypersurface, is the dual variety of .
Upon restricting to ,
we get the finite surjective Gauss map .
Here is geometrically irreducible, so must be too.
Since is finite, must therefore be a geometrically integral hypersurface, i.e. the zero scheme of some absolutely irreducible form .
The definition of then implies that if , then if and only if the projective scheme over is singular.
(At least for diagonal ,
one can explicitly compute ;
see (3.3) in §3.2.)
The polar map of is a rational map defined away from , the set of singular points of .
(Here is a proper closed subset of .
It is known that is singular, since ; see e.g. [wang2023dichotomous]*Proposition 4.4.)
The reflexivity theorem says that .
The biduality theorem says that if and are smooth points,
then if and only if .
(Both facts are on [dolgachev2012classical]*p. 30.)
For us, is smooth,
so biduality implies that the polar maps , restrict to
inverse morphisms between and .
It is known that [dolgachev2012classical]*p. 33, (1.47).
Since is a finite surjective morphism of smooth varieties (and in particular, is generically étale),
its ramification theory is well-behaved.
Following [stacks-project]*Tag 0BWJ,
let be the closed subscheme of cut out by the different ideal of .
Following [stacks-project]*Tag 0BW8 and Tag 0BWA, let be the norm of (or equivalently, the discriminant of ).
In our setting, is an effective Cartier divisor in , and is thus an effective Cartier divisor in .
The points of are precisely those at which is unramified.
Furthermore, we have a set-theoretic equality .
Definition 2.3.
Call and the ramification divisor and branch divisor of , respectively.
Let and be homogeneous polynomials defining in and in , respectively.
Since , one can show that and are nonempty, and thus hypersurfaces.
In fact,
by [dolgachev2012classical]*p. 29, Proposition 1.2.1,
,
where denotes the subscheme of .
Proposition 2.4.
Say we let vary over the locus .
Then one can choose and to be polynomials in and the coefficients of .
Proof.
This is possible by standard “universal” constructions compatible with our definitions of and .
For , one can appeal to [stacks-project]*Tag 0BD2; norms respect base change (and thus vary nicely in families).
For ,
see e.g. Remark 2.2.
∎
It is known that [hooley1988nonary]*Lemma 1.
How about after applying ?
Question 2.5.
Is it necessarily true that ?
Question 2.5 comes up in Proposition 3.1,
but we happen to be able to sidestep it there.
5. General upper bounds
We first recall some background on (1.3).
Let .
Recall (1.8).
Proposition 5.1 (See e.g. [wang2022thesis]*§3.1,
or [heath1998circle]*(2.3)–(3.7)).
The following hold:
-
(1)
vanishes over , for some positive independent of .
-
(2)
For any fixed , we have over .
-
(3)
We have
|
|
|
Now fix a set and a real .
Suppose has size for all reals .
Let
(5.1) |
|
|
|
At several points
in §8,
Lemma 5.2 will let us cleanly discard for various choices of .
Lemma 5.2.
Assume is diagonal.
Suppose .
Assume and .
Then .
The work [heath1998circle]*pp. 688–689 proves a simplified version of Lemma 5.2
with ,
and with instead of .
The method of [heath1998circle] should directly extend to Lemma 5.2.
However, we transpose Heath-Brown’s argument a bit in order to highlight an intermediate result that one may hope to generalize: Proposition 5.3.
Proposition 5.3 offers a potential partial alternative to axiom (3) in Remark 1.6.
Let denote the usual -adic valuation.
For integers ,
let and ; and for convenience, let and .
A positive integer is said to be square-full if ,
and cube-full if .
In the absence of a deeper algebro-geometric understanding of ,
one relies heavily on the following bound of Hooley and Heath-Brown, valid for diagonal (for all and ):
(5.2) |
|
|
|
(See [wang2023_large_sieve_diagonal_cubic_forms]*Proposition 4.9 or [wang2022thesis]*Proposition 3.3.3 for precise references.)
Suppose , and assume is diagonal.
Let .
For each , the coefficient of the homogeneous polynomial is nonzero, by (3.3).
Suppose with ;
then for at least two indices .
In particular,
(5.3) |
|
|
|
by [heath1998circle]*p. 688, (7.3).
We also have the following result (for , as above):
Proposition 5.3.
Here .
Proof.
Let .
Write with , so that is primitive.
For each prime ,
the equation implies, via (3.3), that .
The bound (5.2) for now implies
|
|
|
(since ,
and whenever ).
Thus
(5.4) |
|
|
|
However, for any integer and real , the sum is
|
|
|
Since , the right-hand side of (5.4) is therefore .
But , so Proposition 5.3 follows.
∎
Proof of Lemma 5.2.
By Proposition 5.1, (5.3), Proposition 5.3, and dyadic decomposition of , the quantity (see (5.1)) is
|
|
|
Our hypotheses on imply
(5.5) |
|
|
|
But the Dirichlet series converges absolutely for .
Since , the right-hand side of (5.5) is therefore .
So
(5.6) |
|
|
|
The total exponent of in (5.6) is ,
and the total exponent of in (5.6) is (since for ).
It follows that
|
|
|
Since , we get .
∎
Now drop the earlier assumptions “” and “ is diagonal”.
The axioms in Remark 1.6 would allow us to prove a version of Lemma 5.2 without assuming is diagonal.
Lemma 5.4.
Assume 1.6(2)–(3).
Say .
Then .
When , this beats the square-root threshold , corresponding to linear subspaces.
(Some degenerate ranges of ,
seem to prevent us from handling .)
Proof sketch for Lemma 5.4.
Under 1.6(2),
it is known that
(5.7) |
|
|
|
see e.g. [hooley2014octonary]*p. 252, (31).
It is also known that if is cube-free, then
(5.8) |
|
|
|
see e.g. [hooley2014octonary]*Lemmas 8–9.
Fix .
For integers , let
|
|
|
By (1.13), plus Cauchy on (1.12) over , it follows that if is sufficiently large, then
|
|
|
for some choice of with and .
Optimizing over yields
.
But (since ) and , so
|
|
|
Let , and plug in the bounds and , to get
|
|
|
But ,
so .
Here and , since .
∎
7. Establishing bias in exponential sums
In this section, we will realize (1.10) from §1.
We will start with general theory, and gradually impose restrictions.
It will be convenient to work over the -adic integers, .
Let be a prime.
Certain hyperplane sections govern the behavior of for , as ranges over or more generally, .
Let and denote the closed subschemes of
defined by the equations and , respectively.
Throughout the present §7 only,
let and ,
so that and
live over (not ).
We now recall some standard background recorded (with references) in [wang2022thesis]*§3.2.
Let .
In terms of the point counts and over , let
|
|
|
Then let
and .
By Deligne’s resolution of the Weil conjectures,
(7.1) |
|
|
|
Furthermore,
whenever , we have , or equivalently,
(7.2) |
|
|
|
For prime powers with , a different flavor of geometry, based on Hensel lifting, comes into play; cf. [hooley1986HasseWeil]*pp. 65–66.
For the sake of other work ([wang2023_HLH_vs_RMT]), we prove more than we presently need;
we encourage the reader to skip ahead to Corollary 7.5 on a first reading.
We start by identifying a clean source of cancellation in (1.5).
For each integer , let
(7.3) |
|
|
|
Lemma 7.1.
Let .
Let and .
Then , and
(7.4) |
|
|
|
Proof.
If , then ;
if , then .
Thus .
On the left-hand side of (7.4), write (with running over a complete set of residues modulo ).
Trivially, .
Also, since is homogeneous of degree , Taylor expansion (using ) gives
|
|
|
Furthermore,
(since is “homogeneous of degree ”,
and ),
whence (by (7.3)).
If , then the left-hand side of (7.4) directly vanishes.
Now suppose .
Then for each in (7.4), we have , whence the sum of
over (i.e. over ) vanishes.
Summing over gives (7.4).
∎
We now show that to understand for , it suffices to understand
(7.5) |
|
|
|
Before proceeding, note that by (1.5), (7.5), we have
(7.6) |
|
|
|
Lemma 7.2.
Fix a tuple and an integer .
Then
equals
-
(1)
if ,
and
-
(2)
if .
In particular,
if ,
then .
Proof.
Let ;
then Lemma 7.1 applies whenever .
Summing (7.4) over , we get (by (7.6))
|
|
|
Here ,
and ,
so (by (7.3)).
So if , then , which suffices (since if , and if ).
Now suppose .
If ,
then ,
so .
Now suppose .
Then , and .
Now write to get
|
|
|
Now let .
Then is determined by .
Therefore
|
|
|
which equals by (1.5).
This completes the proof.
∎
The general study of needs some setup.
For any vector , let .
Given and integers , let
(7.7) |
|
|
|
Let denote the usual Haar measure on , so that for all ,
we have
(7.8) |
|
|
|
Lemma 7.3.
Suppose and .
Let and be integers.
Then
(7.9) |
|
|
|
|
(7.10) |
|
|
|
|
Proof.
Let be integers.
Using (7.7) and the congruence
|
|
|
(valid since ), we find that
-
(1)
is invariant under addition by any element of (since for all , by (7.3)), and therefore
-
(2)
is invariant under .
Case 1: and .
Then and .
-
•
If , then the inclusion descends to a map
;
and this map has fibers of size , so (7.9) holds.
-
•
If , then the inclusion descends to a map
;
and this map has fibers of size , so (7.9) holds.
Case 2: .
Then , and for all .
So the inclusion descends to a map
|
|
|
This map has fibers of size , since for all and , we have and the “lifting congruence”
|
|
|
(This congruence holds because .)
Thus (7.10) holds.
∎
The next result synthesizes a lot of old and new Hensel work.
Proposition 7.4.
Let , and let .
Let if is smooth, and let if is singular.
Let .
Then
(7.11) |
|
|
|
Proof.
Lemma 7.1 applies, since .
Summing (7.4) over gives
|
|
|
Replacing with for , and summing over , we get
(via the scalar symmetries and that follow from (7.5) and (7.3), respectively)
(7.12) |
|
|
|
by a short calculation using (for ) and (7.8).
Let .
Before proceeding, we prove (by casework) that
(7.13) |
|
|
|
Case 1: .
Then is smooth, so .
Since , we conclude by (7.3), (7.7) that .
So both sides of (7.13) are empty.
Case 2: .
Then is smooth, and .
So by (7.7), we have (7.13).
Case 3: .
Then for all .
So by (7.7), we have (7.13).
Having established (7.13) in all cases, we now return to (7.12).
Since and , we may apply (7.10) (with and ) and (7.13) to get
|
|
|
But since and , we may use (7.9) to get
|
|
|
for all such that .
Thus the right-hand side of (7.12) equals , which simplifies to the right-hand side of (7.11).
So (7.12) implies (7.11).
∎
Corollary 7.5.
Suppose is primitive and is smooth.
Let .
Then
|
|
|
Proof.
Here , so (7.8) implies for .
Plug this into Proposition 7.4 (with );
then note that by Lemma 7.2.
∎
Now fix .
We will build up to Lemma 7.11 (realizing (1.10) from §1).
As Remark 1.6 suggests,
Lemma 7.11
might extend to more general .
But to maximize the accessibility of §7,
we focus on the diagonal case.
We will use an ad hoc change of coordinates,
highlighting specific features (of diagonal forms)
that may be of independent interest.
So for the rest of §7, assume is diagonal with .
Let be a permissible pairing corresponding to in Proposition 3.6.
Since is permissible, there exist unique cube-free integers (for ) such that is an integer cube for all and .
Suppose lies in or more generally, .
Assume
(7.14) |
|
|
|
In particular,
by (1.14),
we have
(7.15) |
|
|
|
Proposition 7.6.
Under (7.14) and (7.15), the following hold:
-
(1)
Each lies in .
-
(2)
If ,
then .
In particular,
, hence ,
for each .
Also,
when .
-
(3)
has exactly singular -points.
Proof.
(1):
By Definition 3.5,
for all .
Here by (7.15).
Next,
we use some results of §3.2,
carried over from to via Remark 3.11.
(2):
Use (7.14) and the -analog of Observation 3.10(1).
(3):
by Corollary 3.9,
carried over to .
But by (7.14).
So by the -analogs of Proposition 3.7 and (3.5), the scheme has
at least, but also at most, singular -points.
(Note that each singular -point of has all coordinates nonzero, by the Jacobian criterion, since by (2).)
∎
Assume, until further notice, that
(7.16) |
|
|
|
for all and .
For convenience, assume for each .
Then implies .
Let , so that
(7.17) |
|
|
|
Now consider the equations and defining .
These equations become
(7.18) |
|
|
|
after a linear change of variables over .
Explicitly,
if with ,
then we take and ,
so that the equation cuts out .
(We use the letter “” in analogy with
van der Corput or Weyl differencing.
The definition of is compatible with that in §4.)
Geometrically, over ,
the space
parameterizes projective -planes containing the fixed -plane .
Over this -space, we have the “quadratic fibration”
(related to the blow-up of along )
|
|
|
Concretely,
each slice
consists of and
a (possibly singular) quadric hypersurface of dimension ,
where is the fiber of over .
Below, let denote the Legendre symbol (if is odd), and write .
Lemma 7.7.
Under (7.14), (7.15), and (7.16),
we have .
Proof.
Let denote the affine cone of :
the subscheme of .
Then .
We must show that .
We count solutions to
using the coordinates in (7.18).
The locus contributes solutions to (7.18).
Let
|
|
|
and by slight abuse of notation, define the corresponding cones with origins removed:
|
|
|
Recall (7.17).
Since for all (by Proposition 7.6(2)),
the equation implies that if ,
and that , , are pairwise disjoint if .
Suppose first that .
Then is covered by a single affine chart, say with and .
The (remaining) defining equation becomes
|
|
|
Since (by Proposition 7.6(2)),
we get by comparing with .
Thus .
So , which is better than satisfactory.
Suppose next that .
Plugging into (7.18) identifies as the product of (with coordinate ) with a cone of the shape “ for ” (which has points by the previous paragraph).
Thus for each ,
and in total over .
For , first consider an individual with .
The equation (a ternary quadratic in ) has
solutions .
Indeed, if , then is an affine cone over a smooth conic ;
otherwise, is a non-degenerate affine quadric (and is thus the complement of a smooth conic in a smooth quadric in ).
Now identify with its affine chart .
The equation becomes .
Thus equals the sum of over (where we restrict so that ):
(7.19) |
|
|
|
where .
The count is times the right-hand side of (7.19).
Here ,
since by Proposition 7.6(2).
By a routine computer calculation,
the discriminant of the quintic polynomial
simplifies—up to a harmless “unit monomial” in , , —to
,
which lies in (by Proposition 7.6(2)).
Thus defines an affine hyperelliptic curve over of genus .
All in all, we have
(by the Weil bound for )
|
|
|
which simplifies to
,
as desired.
∎
Lemma 7.9.
Let be an integer.
Under (7.14), (7.15), and (7.16),
we have
(7.20) |
|
|
|
Proof.
By (1.8), the desired formula (7.20) is equivalent to
(7.21) |
|
|
|
By Proposition 7.6 and (7.17), we have
for all .
In particular, , so Corollary 7.5 applies, since is smooth by (7.15).
Consider the map
(7.22) |
|
|
|
Whenever lies over a smooth point of , the fiber of (7.22) over has size exactly (by Hensel’s lemma).
Therefore, Corollary 7.5 simplifies to
(7.23) |
|
|
|
where denotes the subset of lying over the singular locus of .
If ,
i.e. ,
for some ,
then :
in fact,
there are no with
, linearly dependent over .
So unless
(7.24) |
|
|
|
holds, we have by (7.23).
So, from now on, assume (7.24) holds.
By the conditions (7.24) and , there exists such that
for all .
Say for some choices ;
write when .
Then is the set of -points with .
But by Proposition 7.6(3),
the scheme has exactly singular points ,
which—by the Jacobian criterion, and the fact that for all —must
all lie in .
Explicitly,
these points arise from
the sign choices for which .
We now seek to count for .
Recall (7.18), expressing in terms of the coordinates.
Fix a point ,
say given in coordinates by and .
Upon writing ,
the system (7.18) modulo becomes
(7.25) |
|
|
|
Let be the set of solutions to (7.25) lying over our fixed .
Fix an affine chart (i.e. representatives in )
by setting identically over .
Write and
(with for now,
but all to be relevant below),
so .
Then (7.25) becomes
|
|
|
So ,
where is the (non-homogeneous, affine) system
|
|
|
Fix .
Clearly ,
while is isomorphic to a cone over a smooth
quadric in variables (i.e. in , of even dimension )
with discriminant in ,
so .
For ,
the origin of the cone
contributes points to ,
while points away from the origin (i.e. smooth points!)
lift uniformly to a total of points of .
Thus
(7.26) |
|
|
|
for and .
(The same holds for ,
provided we interpret .)
By induction on
(with base cases ),
we immediately find that is independent of
the choice of and the -adic square roots ;
furthermore,
for all ,
i.e. there is no -dependence or -dependence!
Finally,
by symmetry,
for all .
(For ,
recall .)
Thus (7.23) gives
|
|
|
for .
To prove (7.21),
it remains to show that
|
|
|
for .
To this end,
we compute (using (7.26) if ) to get
-
(1)
,
i.e. ,
for ;
-
(2)
,
i.e. ,
for ;
-
(3)
for ,
since .
By induction on ,
we are done,
since .
∎
For the rest of the paper, drop the assumption (7.16).
We can finally state and prove the main result of §7.
Let .
Lemma 7.11.
Assume is diagonal, with .
Let .
Assume (7.14).
Then
(7.27) |
|
|
|
Also, for all .
Finally,
(7.28) |
|
|
|
for all integers .
The implied constants in (7.27) and (7.28) depend only on .
Proof.
As we noted earlier, (7.14) implies (7.15).
Now consider the unique invertible -linear map such that for all and .
This map transforms into , where for all and .
If we let denote the dual linear map, then the following hold:
-
•
if , then , defined using as in (1.5), equals , defined using in place of ;
-
•
one can define the polynomial to be
times a power of ;
-
•
the vector space corresponding to is still associated to ;
and
-
•
we have for all .
By (7.15), we may thus assume (7.16) (when proving Lemma 7.11).
The claim (7.27) now follows upon plugging Lemma 7.7 and (7.1) into (7.2).
For the claim ,
see Proposition 7.6(1)–(2).
Finally, Lemma 7.9 implies (7.28) for .
∎
8. Main delta-method analysis
Fix in Theorem 1.1.
For each , recall , from Definition 1.3.
By Proposition 3.1, for all .
(Recall, from Definition 1.4, that we call such ’s linear.)
Since is diagonal,
Proposition 3.6
characterizes the linear ’s via certain pairings introduced in Definition 3.5.
More precisely,
the identification defines a bijection between
and the set of equivalence classes of permissible pairings of .
Consider the left-hand side of (1.6).
Recall the sets , from (1.9).
Proposition 8.1.
For reals , we have .
Proof.
This follows from the combinatorics of [heath1998circle]*p. 687.
(In Heath-Brown’s notation, any exponent coming from must lie in if , and in if .)
∎
Proposition 8.2.
For reals , we have .
Proof.
Let .
Let .
Suppose and .
Then by Observation 3.10(2),
there exist distinct with .
Fix and ;
then .
So ,
whence there exists with such that and .
Since , it follows (upon summing over all possibilities for , , , , ) that has size
.
∎
Propositions 8.1 and 8.2 imply that is
unsurprising (in the sense of Definition 1.5).
Lemma 5.2 then gives
the useful bound .
(Recall from (5.1).)
Corollary 8.3.
The equality (1.6) holds,
provided that for each , we have
(8.1) |
|
|
|
Proof.
Assume (8.1) for .
Since ,
the relation (8.1) implies
|
|
|
Upon summing over the finite set
(handling intersections using Lemma 5.2),
we obtain
|
|
|
By (1.7), (1.8), and the bound ,
the desired (1.6) follows.
∎
So (8.1) would imply Theorem 1.1.
The rest of §8 is devoted to the proof of (8.1).
Fix , and recall Proposition 3.6.
We first explain why Heath-Brown’s approach for in [heath1998circle] does not seem to directly extend to ;
we then describe our approach.
Using Lemma 4.6 (with and ) and Proposition 4.7, one can show that (in terms of certain quantities we briefly discussed in §4)
(8.2) |
|
|
|
cf. [heath1998circle]*p. 692, Poisson summation underlying Lemma 8.2.
When ,
Heath-Brown proves that in (8.2) captures the “-diagonal” contribution to (1.1),
and that the locus in (8.2) forms an “error term” of .
When and ,
the -diagonal in (1.1) strictly dominates the contribution to (8.2).
When and , however,
in (8.2) is comparable in size to the -diagonal in (1.1),
so that in (8.2) is likely no longer an error term.
Perhaps for typical ,
the sums can be analyzed in terms of -functions,
but it is not clear where the contribution to (8.2) would arise for .
To push (8.2) further—perhaps by considering small and large separately—thus seems technical and possibly delicate,
though it could be enlightening.
Our approach to (8.1) delays Poisson summation to the “endgame”, thus sidestepping
(8.2).
Consider the left-hand side of (8.1).
We open not with Poisson summation over , but with local geometry (Lemma 7.11).
Lemma 7.11 exposes a uniform bias in over ,
allowing us to decompose into simpler pieces (see (8.11)).
Even then, tricky issues remain (especially regarding the excised loci and ), but Lemma 7.11 is undoubtedly the driving force in our argument.
(However, if we could compute to greater precision when ,
that might reduce our reliance on Lemma 7.11.)
For , consider the Dirichlet series .
Let
|
|
|
be the Dirichlet series for .
By Lemma 7.11,
should typically resemble , to leading order.
So divide by to define the “error series”
(8.3) |
|
|
|
Since and , it follows from (8.3) that
(8.4) |
|
|
|
For convenience,
let .
The multiplicativity of , in leads to Euler products for , ,
and then to multiplicativity of .
We will need some basic properties of as a function of and .
Proposition 8.4.
The quantity is a function of and .
Also,
(8.5) |
|
|
|
Proof.
By the first sentence of Proposition 4.7,
depends at most on and .
But depends at most on the list of values ,
hence at most on and .
Yet by (8.3), the Dirichlet series identity holds for any .
Averaging formally (coefficient-wise) over , we get
(8.6) |
|
|
|
But by the final sentence of Proposition 4.7,
the left-hand side of (8.6) equals .
So (8.5) follows
formally by division.
∎
We now provide some bounds on for .
By (8.4), we have
(8.7) |
|
|
|
Let .
For any ,
the triangle inequality on (8.7), followed by an application of (5.8) to cube-free divisors of , yields
(8.8) |
|
|
|
If and , then (8.8) and (7.28) imply
(8.9) |
|
|
|
If , then (8.4) and (7.27) imply
(8.10) |
|
|
|
The bounds (8.8), (8.9), (8.10) are most useful in conjunction with multiplicativity.
Let
|
|
|
for each .
For each integer , let
|
|
|
Lemma 8.5 (Cf. [wang2023_large_sieve_diagonal_cubic_forms]*Lemma 3.4).
Let be integers.
Then .
Also, if , then
.
Proof.
The first bound is familiar.
It remains to prove the second.
Say .
Then the vector is nonzero, and thus has a nonzero coordinate .
But (by Rankin’s trick), and .
∎
Lemma 8.6.
Let and .
Suppose .
Then
|
|
|
Proof.
Any integer can be written (uniquely) as , where , , are pairwise coprime integers satisfying
,
,
.
Upon writing , and applying (8.10) to primes , (8.9) to primes , and (8.8) to , we get (by dyadic summation over , , )
|
|
|
Upon summing over , , (for each fixed ), we get, by Lemma 8.5,
|
|
|
But for .
And .
So Lemma 8.6 follows.
∎
Given in (8.1),
we may use (8.3) to decompose as a Dirichlet convolution:
(8.11) |
|
|
|
We will study the ranges and separately, for a parameter to be chosen in (8.16).
We first handle the range , using Lemma 8.6.
It turns out we will not need the full strength of Lemma 8.6
(which might however still be useful in the future).
Definition 8.7.
For a real number and a set , let
|
|
|
Similarly define (by replacing with ).
Using (8.11), one may rewrite the left-hand side of (8.1) as
(8.12) |
|
|
|
Lemma 8.8.
Uniformly over reals , we have
(8.13) |
|
|
|
Proof.
We proceed somewhat crudely.
By Proposition 5.1,
the absolute value of the left-hand side of (8.13)
is at most plus the quantity
(8.14) |
|
|
|
We now examine an individual .
By Observation 3.10(1), .
So
(8.15) |
|
|
|
by [heath1998circle]*Lemma 3.2.
Upon inserting (8.15) into (8.14), dyadically decomposing , and then applying Lemma 8.6, we find that the quantity (8.14) is
|
|
|
But (5.2) yields
(when ).
Plugging this in, and noting that by Lemma 8.5, we find that (8.14) is
|
|
|
in the notation of Definition 3.5.
But by Lemma 8.5 (or a calculation with Euler products).
So (8.14) is
|
|
|
Plugging in leads to (8.13).
∎
In terms of from Lemma 4.6, let
(8.16) |
|
|
|
We now turn to the range in (8.12).
Recall from Definition 8.7.
Lemma 8.10.
Uniformly over reals , we have
(8.17) |
|
|
|
Proof.
Suppose are integers with .
By the first part of Proposition 8.4,
only depends on .
Because , Lemma 4.6 therefore implies
|
|
|
By (8.5) and the equality ,
we conclude that
|
|
|
The left-hand side of (8.17) (see Definition 8.7) thus simplifies to
|
|
|
which equals by Proposition 8.11 (below).
∎
Proposition 8.11.
.
Proof.
By [heath1998circle]*final paragraph on p. 692, and second paragraph on p. 676, we have (in terms of the function defined on [heath1998circle]*p. 676)
|
|
|
Furthermore,
by [heath1996new]*Lemma 4,
so
|
|
|
Proposition 8.11 follows upon writing .
∎
Lemma 8.12.
Uniformly over reals , we have
(8.18) |
|
|
|
Proof.
The main subtlety here is that we must treat and separately.
First, given ,
the bound (8.8), applied directly to for each ,
implies
(8.19) |
|
|
|
Inserting (8.19) into (see Definition 8.7), and recalling (5.1), we get
|
|
|
where by Proposition 8.2 and Lemma 5.2.
Similarly, (8.19) gives
|
|
|
which is by (6.1) and Lemma 6.1 (summed over ).
But , since .
∎
By (8.13), (8.16), (8.17), and (8.18),
the quantity (8.12) simplifies to
|
|
|
matching
the right-hand side of (8.1).
So (8.1) holds, thus concluding §8.