Existence of Two View Chiral Reconstructions
Abstract.
A fundamental question in computer vision is whether a set of point pairs is the image of a scene that lies in front of two cameras. Such a scene and the cameras together are known as a chiral reconstruction of the point pairs. In this paper we provide a complete classification of point pairs for which a chiral reconstruction exists. The existence of chiral reconstructions is equivalent to the non-emptiness of certain semialgebraic sets. We describe these sets and develop tools to certify their non-emptiness. For up to three point pairs, we prove that a chiral reconstruction always exists while the set of five or more point pairs that do not have a chiral reconstruction is Zariski-dense. We show that for five generic point pairs, the chiral region is bounded by line segments in a Schläfli double six on a cubic surface with real lines. Four point pairs have a chiral reconstruction unless they belong to two non-generic combinatorial types, in which case they may or may not.
1. Introduction
A fundamental question in computer vision is whether a set of point pairs is the image of a set of world points that are visible in two cameras. If we ignore the constraint (as is commonly done) that the points need to lie in front of the cameras, we get a projective reconstruction [12]. In reality though, cameras can only see points in front of them. A reconstruction that obeys this additional constraint is known as a chiral reconstruction [1, 11]. The aim of this paper is to give a complete answer to the question: Given a set of point pairs , when does admit a chiral reconstruction?
Under the assumption that the points in each image are distinct, we prove the following facts.
-
(1)
A set of at most three point pairs always has a chiral reconstruction.
-
(2)
A set of four point pairs has a chiral reconstruction unless the configurations are of two specific non-generic types, in which case a chiral reconstruction may not exist.
-
(3)
Five or more point pairs can fail to have a chiral reconstruction with positive probability (in particular, even if they are in general position).
-
(4)
For five sufficiently generic point pairs, the problem translates to finding points in semialgebraic regions on a real cubic surface whose boundaries are segments of lines in a Schläfli double six of real lines, creating an unexpected bridge to classical results in algebraic geometry.
The study of chirality was initiated in [11] with follow up work by Werner, Pajdla and others [21, 22, 23, 24]. There is no agreement on the name for reconstructions that are chiral. Hartley in [11] and Werner & Pajdla in [24] call them strong realizations. Werner in [24, 22, 21] calls them oriented projective reconstructions. Here and in our previous work [1], we prefer the term chiral reconstruction. In [11], Hartley shows that chiral reconstructions are a special class of quasi-affine reconstructions. See [1, Section 5] for a detailed account of quasi-affine reconstructions in the context of our approach to chirality. Hartley’s work was done using projective geometry. In later work, such as [21], [22] and [23], the authors use oriented projective geometry [16], which also explains their use of the term oriented projected reconstructions. Following Hartley, our work uses projective geometry. Justifications of our choice of framework can be found in [1, Section 1]. We now comment briefly on how our results relate to the existing literature in computer vision and give more detailed citations in later sections.
A specific example of five point pairs in general linear position that do not admit a chiral reconstruction was given in [22] and appears again in [21]. Our result (3) shows that, in fact, the set of five point pairs without a chiral reconstruction is Zariski dense. The case of four point pairs covered in result (2) is the most involved since it does not assume genericity, and covers all possible configurations of four point pairs. We show that chiral reconstructions can fail in this case only in degenerate situations. To the best of our knowledge, both results (2) and (3) are novel. Result (1) says that three point pairs admit a chiral reconstruction unconditionally. While this is not too hard to prove, it also does not appear in the literature and completes the story.
Result (4) establishes a new connection between chiral reconstructions of five generic point pairs and the classical theory of cubic surfaces from algebraic geometry. The cubic surface in arises naturally in our set up and the two camera planes, modeled as , can be obtained by blowing down this surface. The papers [22] and [21] study chirality in the setting of using tools from oriented matroids. It was shown in [22] that the chiral regions are bounded by certain conics. Our results show that these conics are obtained by blowing down a Schläfli double six of real lines on the cubic surface, allowing the chiral regions to be seen as semialgebraic subsets of the cubic surface bounded by lines. Both frameworks produce a sixth pair of points whose existence was derived in [22] using the conics in . Our results show that the cubic surface is the blow up of the six points or in each . The blow up/blow down maps create a new interpretation of the results in [22] while also offering a unified picture of chirality that transfers seamlessly between the space of camera epipoles and the space of fundamental matrices of . We draw from, and build on, methods from the above mentioned papers from computer vision, and our own paper [1]. Our main tools are centered in complex and real algebraic geometry as well as semialgebraic geometry.
This paper is organized as follows. Formal definitions of projective and chiral reconstructions can be found in Section 3. In Section 4, we introduce the inequalities imposed by chirality and develop tools to certify them. Along the way we prove that a set of at most three point pairs always has a chiral reconstruction.
In Section 5, we prove that a set of four point pairs has a chiral reconstruction when the point configurations in each view have sufficiently similar geometry, and in particular, when they are in general position. The bad cases fall into two non-generic combinatorial types. In particular, the probability of choosing four point pairs that fail to have a chiral reconstruction is zero.
In Section 6 we show that when , point pairs that are in general linear position may not have a chiral reconstruction. Specific examples of this type when were known to Werner [22] and as mentioned before, there are close connections between our work and that of Werner’s [21, 22, 23, 24]. We make two new contributions for the case . In Section 6 we show that one can decide the existence of a chiral reconstruction when and is generic, by checking discrete points. We use this test to show that the set of five point pairs that do not admit a chiral reconstruction is Zariski-dense. In other words, five point pairs do not have a chiral reconstruction with positive probability. A set of six or more point pairs can have a chiral reconstruction only if any subset of five point pairs among them have one. Hence, for any value of , there will be point configurations without a chiral reconstruction. Our second contribution in Section 7 is to show that the case of is intimately related to the theory of cubic surfaces from classical algebraic geometry. Indeed, creates a Schläfli double six of lines on a cubic surface, all of whose lines are real. These lines determine the boundary of the semialgebraic regions corresponding to chiral reconstructions.
Acknowledgements. The question addressed in this paper is a natural follow up to our previous work on chirality in [1]. We thank Sameer Agarwal for many helpful conversations. We also thank Tomáš Pajdla for pointing us to several chirality papers from the literature.
2. Background and Notation
We now introduce some background and notation that will be needed in the paper. Let and denote -dimensional projective space over complex and real numbers respectively. More generally, we write for the projective space over a vector space , which is the set of lines in . We write if and are the same points in projective space, and reserve to mean coordinate-wise equality. A projective camera is a matrix in of rank three, i.e., a real matrix defined only up to scaling, hence naturally a point in the projective space over the vector space . Usually, an affine representative of a projective camera is fixed and we block-partition such a matrix as where and . The center of is the unique point such that . The camera is a rational map from the “world” to the “camera plane” sending a “world point” to its “image” . It is defined everywhere except at . Consider the hyperplane at infinity in , , as an oriented hyperplane in with fixed normal . The camera is said to be finite if is a finite point, i.e., . A special representative of a camera center can be obtained by Cramer’s rule where the th coordinate of is the determinant of the submatrix of obtained by dropping the th column. In particular, for a finite camera , the Cramer’s rule center is . Throughout this paper we use the Cramer’s rule representation of . The camera is finite if and only if , and all cameras in this paper will be finite.
The principal plane of a finite camera is the hyperplane where is the third row of , i.e. it is the set of points in that image to infinite points in . Note that the camera center lies on . We regard as an oriented hyperplane in with normal vector , which we call the principal ray of . The factor ensures that the normal vector of the principal plane does not flip sign under a scaling of . The depth of a finite point in a finite camera is defined as (see [12])
(1) |
Note that the sign of is unaffected by scaling and . The depth of a finite point in a finite camera defined in Equation 1 is zero if and only if which happens if and only if lies on the principal plane . Otherwise, and is either positive or negative. It is then natural to say that a finite point is in front of the camera if , see [11]. Since only the sign of matters, we refer to this sign as the chirality of in , denoted as , which is either or .
The above notion of chirality was introduced by Hartley in the seminal paper [11], where he was concerned with a pair of cameras, see also [12, Chapter 21]. In [1], the definition of chirality was extended to all points in , finite and infinite, and defined for an arrangement of cameras. Here is the two camera version we need.
Definition 2.1.
Let be a pair of finite projective cameras. Then the chiral domain of , is the Euclidean closure in of the set
A point is said to have chirality 1 with respect to , denoted as , if and only if lies in the chiral domain of .
In this paper we will be concerned with a pair of finite non-coincident cameras by which we mean that their centers are distinct. We will see that one can always take , and then the conditions of finite and non-coincident imply that where and . The pair gives rise to the unique (up to scale) real, rank two fundamental matrix where
The skew-symmetric matrix represents the cross product with as a linear map; that means that for we have . Also, if and only if . We are only ever interested in properties of fundamental matrices (mostly rank and kernels) that remain unaffected by scaling and will therefore mostly consider them up to scaling which is to say as a point of . We chose not to introduce a different notation to distinguish the matrix itself from the line it spans.
The epipole pair of the cameras is where is the image of the center in , and is the image of the center in . The line joining the centers and is called the baseline of the camera pair . Note that all world points on the baseline (with the exception of the respective camera centers) image to the epipole in each camera.
3. Projective and Chiral Reconstructions
Throughout this paper our input is a collection of point pairs . Each (and ) is the homogenization of a point in by adding a last coordinate one. Hence is a pair of finite points in with a fixed representation. We will also assume that all (and all ) are distinct.
In this section we formally define projective and chiral reconstructions of , and characterize their existence. We then set up the geometric framework within which these reconstructions will be studied in this paper.
3.1. Projective reconstructions
Definition 3.1.
A projective reconstruction of consists of a pair of projective cameras , , world points and non-zero scalars such that and for . If the cameras and world points are all finite, then is called a finite projective reconstruction of .
The basics of projective reconstructions can be found in [12, Chapters 9 & 10]. Theorem 3.1 in [17] proves that if has a projective reconstruction then it also has a finite projective reconstruction with .
We now recall the necessary and sufficient conditions for the existence of a finite projective reconstruction of . For a point , let denote the set of lines joining to each . The following geometric characterization is well-known [12, 18, 23, 22].
Theorem 3.2.
The points in the above theorem are the epipoles of the camera pair in the reconstruction. We now give a second characterization of the existence of a projective reconstruction in terms of fundamental matrices: Below, we will abuse notation and not distinguish between the matrix and the line it spans in the space of matrices (or in other words the corresponding point in projective space). All relevant properties of the matrices are invariant under scaling.
Definition 3.3.
-
(1)
A fundamental matrix of is a rank two matrix such that
(2) The linear equations (2) in are called the epipolar equations of .
-
(2)
Given a rank two matrix and a pair , such that , we say that is -regular if if and only if . i.e., and simultaneously generate the right and left kernels of , or neither generate a kernel.
-
(3)
A -regular fundamental matrix is a fundamental matrix of that is -regular for each point pair in .
It is commonly believed that has a projective reconstruction if and only if it has a fundamental matrix. However, a bit more care is needed as in the following theorem.
Theorem 3.4.
[17, Theorem 4.6] There exists a finite projective reconstruction of with two non-coincident cameras, and , if and only if there exists a -regular fundamental matrix.
Remarks 3.5.
-
(1)
Let and be the two finite non-coincident cameras in the projective reconstruction. Then recall that , , and the cameras correspond to the unique fundamental matrix up to scale. The epipoles of the cameras are and which generate the right and left kernels of . Further, defines a homography of that sends to , and the line to the line . Hence encodes the epipolar line homography of Theorem 3.2.
-
(2)
Conversely, any (rank two) fundamental matrix of can be factored as for some and and yield a pair of cameras whose epipoles generate the left and right kernels of . See [12, Section 9.5] for more details on the correspondence between fundamental matrices and camera pairs. These cameras can then be used to reconstruct a set of world points if is -regular. The resulting projective reconstruction is said to be associated to .
-
(3)
A fundamental matrix is -regular if and only if for each , either and are both epipoles of the cameras or neither are. Indeed, this is a necessary condition for a projective reconstruction since if , then its reconstruction lies on the baseline of the cameras and hence images to in camera requiring . This subtlety is often overlooked, and it is common to equate the existence of a projective reconstruction of to the existence of a fundamental matrix of .
- (4)
We now discuss the geometry encoded in Theorem 3.4 which will set the foundation for the work in this paper. Even though fundamental matrices are real, we will work over to allow for methods from complex algebraic geometry, and will specialize to as needed. A matrix can be identified with a point in by concatenating its rows. Under this identification we let be the determinantal hypersurface of matrices in of rank at most two, and be its subvariety of rank one matrices. As projective subvarieties of , and while, and . For a point pair , let denote the hyperplane in :
where denotes the Frobenius inner product on matrices. Let . Generically, is a linear space in of codimension .
Definition 3.6.
The variety in is the epipolar variety of .
Under sufficient genericity of , and . Hence, the epipolar variety is empty when , consists of three points when , and infinitely many points when .
For , consider the following five-dimensional linear spaces of that are in fact in :
Their intersections with the epipolar variety are the linear spaces: and , each of which generically has dimension since .
Definition 3.7.
The linear space (resp. ) will be called the (resp. ) wall and the intersection will be called the corner.
When , a corner has dimension generically since there are at most independent equations among . Thus a wall has codimension one and a corner has codimension two in the epipolar variety, generically. For non-generic data , all the dimensions computed above may be larger.
The second condition in Theorem 3.4 can now be rephrased as the existence of a real rank two matrix in the epipolar variety of such that for each , is either in the corner or in the complement of . Note that a rank two can lie in at most one corner because the points (and ) are pairwise distinct.
Going forward, we will work both in , the space of fundamental matrices, and in , the space of epipoles. These spaces are related by the adjoint map,
where and is the cofactor matrix of . If then and thus, if , then all non-zero rows (resp. columns) of are multiples of each other and generate the left (resp. right) kernel of . Since generators of the right and left kernels of a fundamental matrix represent epipoles, the adjoint map provides a convenient connection between epipole space and fundamental matrix space.
3.2. Chiral reconstructions
A physical constraint on a true reconstruction is that the reconstructed world points in must lie in front of the cameras and . Recall from the Introduction that this means we require to lie in the chiral domain of or equivalently, for all . A full development of multiview chirality can be found in [1]. For this paper, we use the following inequality description of the chiral domain for two views from [1, Theorem 1] as a definition. The cited result shows that these inequalities cut out the Euclidean closure of the set in Definition 2.1 (under the mild assumption that it has non-empty interior).
Definition 3.8.
A chiral reconstruction of is a projective reconstruction of with finite non-coindent cameras such that for all ,
where and is the principal ray of .
Recall that two projective reconstructions and are projectively equivalent if they are related by a homography of , i.e., there is a such that and . A projective reconstruction which is not chiral can sometimes be transformed into a chiral reconstruction by a homography [11, 1, 23]. We recall the conditions under which this is possible.
Theorem 3.9.
Consider a finite projective reconstruction of with non-coincident cameras , , and world points . Then the following are equivalent.
Proof.
The equivalence of (1) and (2) is Theorem 8 in [1]. The equivalence of (1) and (3) is Theorem 17 in [11]. The epipoles of the given cameras are and , and hence if no world point lies on the baseline, (equivalently, ) and for any . Also, since are non-coincident and , Theorem 3.2 and Item 1 imply that are collinear. Therefore, and so for all . The equivalence of (3) and (4) can then be derived from the same arguments as in Lemma 7 in [1]. ∎
If a world point lies on the baseline, then its images in are the epipoles of the cameras, and the expression in (4) becomes zero. However, since any point on the baseline can serve as the world point imaging to , we can control the sign of as shown next.
Lemma 3.10.
For a pair of non-coincident cameras whose baseline is not contained in either principal plane and , there exist and on the baseline such that and .
Proof.
Since , . On the other hand, since the baseline is not contained in and , , and so . By continuity, there exist perturbations and of on the baseline such that and . ∎
Remarks 3.11.
For reconstructions where both epipoles are finite, the hypothesis of Lemma 3.10 is satisfied. Indeed, if for instance the baseline was contained in the principal plane then and so would be an infinite point.
We now have a necessary and sufficient condition for the existence of a chiral reconstruction.
Lemma 3.12.
There exists a chiral reconstruction of if and only if there exist and such that is a -regular fundamental matrix and
(3) |
Proof.
Suppose is a chiral reconstruction of with non-coincident finite cameras where . Then for some and , and by Theorem 3.4, is a -regular fundamental matrix associated to and . For all such that is not on the baseline, has the same sign by Theorem 3.9. If some world point is on the baseline, then its image is the pair of epipoles , and hence . Therefore, the inequalities in (3) hold.
Conversely, suppose there exist and such that is a -regular fundamental matrix and the inequalities (3) hold. By Theorem 3.4, there exist world points such that is a projective reconstruction of . Let be the set of world points not on the baseline of and . Since the inequalities (3) hold, the quadruple products have the same sign for all . By Theorem 3.9, there exists a constant such that for all .
If some point lies on the baseline, then images to the pair of epipoles and in the two cameras and hence . By Lemma 3.10, we may replace by some world point on the baseline such that . Let be the modification of obtained by replacing all world points on the baseline as above, but keeping all other world points intact. By construction, for all . The transformed reconstruction is projectively equivalent to a chiral reconstruction by Theorem 3.9. ∎
Lemma 3.12 implies that for a chiral reconstruction to exist, there must be -regular fundamental matrices that satisfy the inequalities (3). In the next section, we examine these inequalities to understand the regions of the epipolar variety in which fundamental matrices that lead to chiral reconstructions live.
4. Chiral tools
In this section we develop tools to prove the existence of chiral reconstructions. In Section 4.1, we describe the semialgebraic chiral epipolar region of fundamental matrices associated to chiral reconstructions of . In Section 4.2, we show how inequalities defining the chiral epipolar region can be checked in epipole space. Even if not stated explicitly, we are working over when dealing with inequalities. We combine these tools in Section 4.3 to prove that a set of three point pairs has a chiral reconstruction. In Section 4.4, we show how the walls and corners of the epipolar variety can be used to decide if a set of more than three point pairs has a chiral reconstruction.
4.1. The chiral epipolar region
By Lemma 3.12, a fundamental matrix must satisfy (3) to yield a chiral reconstruction. In this section, we describe the strict subset of the epipolar variety satisfying these constraints.
Definition 4.1.
Let denote a real -matrix up to scaling. For each point pair , the -th chiral polynomial is
where . The set of all are called the chiral epipolar inequalities of . Here, the same representative for the left-kernel of must be used in and .
The -th chiral polynomial is, strictly speaking, not a polynomial because there is no way to write a generator of the left kernel of a matrix as a polynomial expression that works for every matrix of rank . To be technically precise, it is a section of a line bundle on the quasi-projective variety of matrices of rank exactly two. We avoid these technicalities and argue that the chiral epipolar inequalities are well-defined in an elementary way using the adjoint: Writing with in terms of is the composition of the adjoint map , whose image is in its Segre embedding, with the projection . So locally, is given as a row of the adjoint matrix (but only on the open set where that row is non-zero). The entries of the matrix are polynomials of degree three in the entries of . This shows that the inequalities are locally of degree six in the entries of . Also, if two rows and of are non-zero and differ by a negative multiple , i.e. , the sign of does not change because it essentially differs by . Therefore the sign of is well-defined for every real matrix of rank two up to scaling, i.e. for every fundamental matrix. The set of real matrices in for which is a semi-algebraic subset of in the following sense: There is an open affine cover of (by sets on which we can write as a polynomial function of ), such that the inequalities become polynomial and hence define a semi-algebraic set in each open subset of (the real points in) this cover. On the intersection of any two open sets in the cover, the regions cut out by these inequalities agree.
We now show that the chiral polynomial records the quadruple product from Lemma 3.12.
Lemma 4.2.
If , then for each .
Proof.
∎
The next theorem, which is analogous to Theorem 3 in [23], now follows from Lemma 3.12 and Lemma 4.2.
Theorem 4.3.
There exists a chiral reconstruction of if and only if there exists a -regular fundamental matrix such that for all .
Definition 4.4.
The chiral epipolar region of is the set of -regular fundamental matrices such that for all .
The chiral epipolar region of is contained in the semialgebraic subset of the real part of the epipolar variety that is cut out by the chiral epipolar inequalities. It is not necessarily equal to this set because the chiral epipolar region additionally requires the fundamental matrices to be -regular. However, since -regularity only fails on a proper algebraic subset, if the chiral epipolar region has non-empty interior, the boundary of the interior is determined by the points where the chiral epipolar inequalities change sign, which we study next.
Lemma 4.5.
Let be a fundamental matrix of . Then if and only if or .
Proof.
Clearly, if . If , then and are collinear and therefore , which implies . For the other implication, we know that and , where the three vectors , , and are real and non-zero. We assume that and show that . We know is orthogonal to and . Therefore, it must be collinear with , which is the same as . We also know that , which implies that is also orthogonal to , hence also to . The dot product , i.e. . The Cauchy-Schwarz inequality implies that and are collinear, which implies the claim . ∎
The goal of the paper is to understand when the chiral epipolar region of is non-empty, or equivalently, when has a chiral reconstruction. When , generically consists of three points and it is easy to check if the real points lie in the chiral epipolar region of . Therefore, our focus will be on values of .
4.2. Translating to epipole space
In this section, we show how we can check the validity of chiral epipolar inequalities in , the space of epipoles. Consider the -homogeneous quadratic polynomial
(4) |
where . Note that if and only if either factor is zero which is if and only if are collinear or are collinear. Werner uses the quantities to impose an orientation constraint on the epipolar line homography described in our Theorem 3.2, see [22, Section 5] and [21, Section 6.5]. We will show that is closely related to the products of chiral polynomials where and generate the right and left kernels of a fundamental matrix . We rely on the following well known identity [5, 21].
Lemma 4.6.
Suppose . Let be a finite camera with Cramer’s rule center . Then .
Lemma 4.7.
Consider a projective reconstruction of , i.e., and where . Suppose where is the Cramer’s rule center of . Then
Proof.
Expand as follows.
(5) | ||||
(6) | ||||
(7) | ||||
(8) | ||||
(9) |
Equation 7 follows from Equation 6 by applying Lemma 4.6 to both determinants in the product. Since
by assumption, we conclude that .
∎
Lemma 4.8.
Let be a fundamental matrix of . Suppose
where . Then
.
Proof.
Write for and some . We know that and generate the one-dimensional left and right kernels of , respectively. Since , we know that , is not collinear with and , and is not collinear with and . In particular, this means that neither nor is the right kernel of and neither nor is the left kernel of . It follows that is regular and regular and neither is the epipole pair. By Theorem 3.4, there exists a finite projective reconstruction of such that the world points are not on the baseline.
Lemma 4.7 implies that . By Theorem 3.9, if and only if . Combining this fact with Lemma 4.2, it follows that . Finally note that and which is a positive multiple of . Indeed,
(10) |
Substituting for and for , the result follows. ∎
The computation in the previous proof, in particular (10), shows that if is a non-zero generator of the left kernel of a fundamental matrix then is a non-zero generator of the right kernel of .
Note that can be zero without or being the epipole . Indeed, by Lemma 4.6 this happens whenever are coplanar. On the other hand, Lemma 4.5 implies that vanishes at if and only if or is an epipole of . Therefore, may vanish even when .
Lemma 4.8 shows that knowing the specific generators of the kernels of , i.e., and , respectively, is enough to compute the sign of the chiral epipolar inequalities. Note that a choice of generator for the left kernel of determines a signed generator for the right kernel. When does not vanish, we can use it to infer the validity of chiral epipolar inequalities via Lemma 4.8, and hence argue for the existence of a chiral reconstruction of . We now identify a situation where we can use any generators of the kernels of in .
Definition 4.9.
Suppose is a fundamental matrix of . Define to be the set of indices such that , i.e., the index set of inactive chiral polynomials at . Let be the subset of point pairs in indexed by .
Theorem 4.10.
Let be a fundamental matrix of where and generate the right and left kernels of , respectively. Suppose , and for all . Then there exists a chiral reconstruction of associated to if and only if has the same sign for all .
Proof.
Suppose there exists a chiral reconstruction of associated to . Then by Theorem 4.3, for all . In fact, for all since if for some while , we would contradict Lemma 4.8. Indeed, if for some kernel generators , it remains non-zero for any other pair of kernel generators. By Theorem 4.3, has the same sign for all , and since and are (non-zero) generators of the right and left kernels of , the result follows.
Conversely, suppose has the same non-zero sign for all where and generate the right and left kernels of , respectively. Then for some non-zero by (10). By Lemma 4.8, we know
for all . This shows that has the same sign for all . Since , this common sign cannot be negative and hence for all . These strict inequalities also imply that is -regular. Then by Theorem 4.3 there is a chiral reconstruction of associated to . ∎
We remark that does not have a well-defined sign on because it is linear in and . To get an inequality description of chirality in epipole space, we can take pairwise products which are quadratic in each factor. If , then for any fundamental matrix with epipoles and . However, since may vanish even when does not, we observe that for all triples is not equivalent to and . Due to this subtlety, we primarily study chirality using in as opposed to in .
4.3. Three point pairs always have a chiral reconstruction
In this section, we apply the tools developed so far to show that there is always a chiral reconstruction when , and hence also when since can have a chiral reconstruction only if all its subsets have one. We begin with two technical lemmas.
Lemma 4.11.
Suppose are three non-collinear points in . Then for each of the eight elements in , there is an such that are in general position (no three in a line) and
Further, for each , the corresponding choices of come from an open polyhedral cone in .
Proof.
The expression is the linear form whose kernel is the span of and . Since are non-collinear, the hyperplanes cut out by divide into eight regions, each of which is a polyhedral cone. The interiors of these cones correspond to the eight sign patterns . ∎
For , let be the convex cone spanned by and .
Lemma 4.12.
Suppose are points in on an affine line . If , then for all , .
Proof.
If , then either or . Suppose . Since and , we know . Write and where . Since , . The result follows from direct computation using that :
(11) |
Similar reasoning applies if . ∎
Theorem 4.13.
If then has a chiral reconstruction.
Proof.
We break the proof into two parts:
-
(1)
Suppose or is in general position, say is non-collinear. Choose not on the line spanned by and for any , so that for all . By Lemma 4.11, there exists an such that has the same non-zero sign for all . Since and and are chosen from open regions, contains at least one rank two matrix . By construction, this is a -regular fundamental matrix with epipoles and . By Theorem 4.10, yields a chiral reconstruction of .
-
(2)
Suppose both and are collinear and consider the affine lines and in spanned by these points, which all have last coordinate . Let be the furthest left and right points on the line, so that the third point lies strictly between . Similarly let be the furthest left and right points on the line. Let and choose such that and . Define . Since are collinear for all , the th epipolar equation is satisfied. Since the chosen epipoles for do not coincide with any data points, is a -regular fundamental matrix. By construction for each . Combining Lemma 4.2 and Lemma 4.12, it follows that for each , and there is a chiral reconstruction of associated to by Theorem 4.3.
∎
4.4. Walls and Corners
To understand the existence of chiral reconstructions when , we need one more tool that we now develop. Recall that the chiral epipolar region of is the set of -regular fundamental matrices that live in the semialgebraic subset of the real epipolar variety cut out by the chiral epipolar inequalities. Lemma 4.5 implies that the chiral epipolar region is bounded by the , walls. The fundamental matrices on walls are generally -irregular and do not correspond to a reconstruction. However, we show that -irregular fundamental matrices that are smooth points of the epipolar variety and yield partial chiral reconstructions, can be perturbed to -regular fundamental matrices that yield chiral reconstructions of .
Lemma 4.14.
Suppose is irreducible. If is a smooth fundamental matrix that is -irregular, then there is a tangent direction such that the directional derivative .
Proof.
Suppose is a smooth fundamental matrix of . Smoothness implies that the tangent space at to the epipolar variety has the same dimension as the variety. If is -irregular for some , then is in exactly one of or . Since is irreducible, each wall must be an embedded component of strictly smaller dimension. This means that the wall’s tangent space is strictly contained in the tangent space of the epipolar variety at . Therefore, we can choose a direction tangent to the epipolar variety at which is not tangent to the wall which contains . Lemma 4.5 implies that vanishes on the real part of the epipolar variety only on the walls. By construction . ∎
The following lemma is needed for Theorem 4.16 below, but its proof might be best understood after Section 7.
Lemma 4.15.
Suppose and is irreducible. If a wall (or ) contains a matrix of rank two, then a generic point on the wall is a smooth point of .
Proof.
We reduce to the case of as follows. If the wall contains a smooth point, then so will its intersection with generic data planes . Therefore, cutting with sufficiently many of these, using Bertini’s Theorem, we can assume that has dimension three, is an irreducible cubic surface in , and is a line on it. Suppose for contradiction that is singular at every point in .
The cubic surfaces which are singular along a line have been classified, see e.g. [3, in particular Case E]. We show that cannot be any of these types, essentially because it contains too many intersecting lines. Indeed, intersects as long as because the equations and impose at most three additional conditions on the three dimensional . Additionally, the assumption that the wall contains a matrix of rank two implies that this wall does not coincide with for .
The first examples of cubic surfaces singular along a line are the cones over a singular plane cubic curve. Our epipolar variety cannot be such a surface because it contains intersecting lines with distinct intersection points, which these cones do not. There are only two other types of cubic surfaces (up to change of coordinates) that are singular along a line.
The next type that is singular along a line, contains a one-dimensional family of lines, and one more line. A representative is given by the equation , which contains the lines , and a family of lines that form a twisted cubic in the Plücker quadric of lines in . The lines in the family are pairwise skew. This type of singular cubic surface only contains two lines that intersect lines in the family, which is inconsistent with the intersection pattern of lines on . The last relevant type is represented by the equation . It is singular along the line and it contains a one-dimensional family of lines. This family of lines is a twisted cubic curve in and they are mutually skew, which also does not fit the intersection pattern on our epipolar variety.
This discussion of cases shows that the epipolar variety cannot be singular along an entire line if it is an irreducible cubic surface in . ∎
Recall and from Definition 4.9.
Theorem 4.16.
Suppose and is irreducible. Then there exists a chiral reconstruction of if and only if has a chiral reconstruction associated to some smooth .
Proof.
Suppose there exists a chiral reconstruction of . By Theorem 4.3, there exists a fundamental matrix , which is -regular for all and for all . If for all , then the semialgebraic subset of the epipolar variety described by the chiral epipolar inequalities has non-empty interior. Since every non-empty, open semialgebraic set in an algebraic variety intersects its smooth locus, there is a smooth associated to a chiral reconstruction. Otherwise, -regularity implies that at most one can be the epipole pair of , i.e., at most one can vanish at . Without loss of generality, suppose so that . By Lemma 4.15, a generic point in is a smooth point of . We can move away from the corner along the wall to a smooth point of without changing the signs of any of the chiral epipolar inequalities, but keeping . This gives a chiral reconstruction of associated to a smooth fundamental matrix, finishing the proof of the first implication.
Conversely, suppose there is a chiral reconstruction of associated to a smooth . If , there is nothing to show. Otherwise, we have for some , which means that lies on some walls by Lemma 4.5. Smoothness of implies that must have rank two, so its left and right kernels are one-dimensional. Since the ’s and ’s are distinct, this means may lie on at most one wall and at most one wall . In other words, must be one of or .
Suppose . We argue locally: Choose an affine chart around such that for all , which is possible because we know that for all pairs . If , it is -regular and is associated to a chiral reconstruction. Otherwise, Lemma 4.14 implies that there is a tangent direction such that . Then we have because . Since in our affine chart, the sign of the directional derivative of is determined by , which is independent of . By flipping the sign of , if necessary, we can assume that . By Taylor approximation, there is a nearby smooth point such that for all . By Theorem 4.3, there is a chiral reconstruction of corresponding to .
Suppose . Choose an affine chart around such that for all , which is possible because we know that for all pairs . Choose linearly independent tangent vectors such that the directional derivatives and are non-zero. Choose a linear combination such that and . Since in our affine chart, the sign of the directional derivatives and at are determined by and , respectively. By Taylor approximation, there is a nearby smooth point such that for all . By Theorem 4.3, there is a chiral reconstruction of associated to . ∎
To apply Theorem 4.16, it is useful to understand the smooth locus of the epipolar variety which we describe next. For a more general discussion of tangent spaces to rank varieties we refer to [10, Example 14.16].
Lemma 4.17.
Suppose is a rank two matrtix such that spans the right kernel of and spans the left kernel of . Then is a smooth point of if and only if does not lie in the span of .
Proof.
The gradient of the determinant of is the cofactor matrix of , which is . Since and are collinear, the tangent hyperplane to at is
Since it is a hyperplane, it intersects transversely if and only if , or equivalently, if and only if does not lie in the span of . ∎
5. Four Point Pairs
In the previous section we saw that any set of three point pairs will always have a chiral reconstruction. In this section we completely analyze the case of four point pairs. In Theorem 5.13 we prove that if the point configurations in the two views have the same rank, then they admit a chiral reconstruction. Otherwise, a chiral reconstruction can fail to exist (Theorem 5.14). Our main tool will be Theorem 4.16 which requires understanding when the epipolar variety is reducible. We do this in Section 5.1 which leads to our chirality results in Section 5.2.
5.1. Irreducibility of the epipolar variety
Suppose the epipolar variety is reducible. Then is not contained in since otherwise, which is irreducible. Therefore, is a proper subvariety of . Further, any irreducible component of has codimension one in because is the only additional condition to those defining . Conversely, if is a proper subvariety of of codimension one in , then is an irreducible component of . Indeed, if lies in some irreducible component of of dimension larger than , then has codimension at least two in , which is a contradiction. Lastly, since has degree three, it must have a linear component if it is reducible and this linear component is a subspace of . Thus to understand whether is reducible, we need to understand the linear subspaces of that have codimension one in .
A subspace is said to have rank at most if is the maximum rank of matrices in . The following definition is from [8] for matrices.
Definition 5.1.
Let be a subspace of rank at most . Then is called a compression space if there exists a subspace of codimension and a subspace of dimension such that , and every maps into (compresses into ). We refer to as a -compression space.
Note that every subspace of a -compression space is again a -compression space. The following theorem from [8] attributed to Atkinson [2] tells us what subspaces of look like.
Theorem 5.2.
[8, Theorem 1.1] A vector space of matrices in of rank at most two is either a compression space or a subspace of the linear space of skew-symmetric matrices.
Theorem 5.3.
Suppose . Then the epipolar variety is reducible if and only if contains a compression space of rank at most two and codimension one.
Proof.
By the above discussion, if is reducible, then it has a linear component that is a subspace of and has codimension one in . Since when , and the space of skew-symmetric matrices has dimension two, by Theorem 5.2, must be a compression space of rank at most two. Conversely, if is a compression space in of rank at most two and codimension one, then , and since has codimension one in , must be an irreducible component of . ∎
Example 5.4.
Consider the three point pairs given by the columns of the following matrices:
Using Macaulay2 one can compute that where has degree two and has degree one. The ideal of is , so every has the form
Since and , has codimension one in . The linear space is a -compression space of rank at most two; each compresses the line spanned by the columns of into the orthogonal complement of the line spanned by the columns of .
Suppose we now enlarge the set of point pairs to
Now has a linear component consisting of matrices of the form
This subspace is clearly a subspace of the compression space and has codimension one in the new . The linear condition arises from the epipolar equation imposed by the new point pair.
The main result of this subsection is the following which will allow us to use Theorem 4.16 to establish chirality results for four point pairs in the next subsection. A set of points in is said to be in general (linear) position if no three of them are collinear.
Theorem 5.5.
Suppose . If for all triples of distinct indices , or are in general position, then is irreducible.
Theorem 5.5 will follow from Theorem 5.3 if under the assumptions of the theorem, there are no compression spaces of rank at most two and codimension one in . We will now prove that this is indeed the case by understanding the orthogonal complements of compression spaces of rank at most two.
Two vector spaces and in are equivalent if they have the same linear transformations up to a change of bases in the domain and codomain , i.e., if there exists such that . The following classification is straightforward.
Lemma 5.6.
Let be a -compression space of rank at most two in .
-
(1)
If then is equivalent to a vector space of matrices with a zero last column.
-
(2)
If then is equivalent to a vector space of matrices with a zero last row.
-
(3)
If then is equivalent to a vector space of matrices of the form
Call a -compression space maximal if it is not a proper subspace of another -compression space. In particular, a -compression space is maximal if the equations cutting it out are precisely the conditions that impose the zeros guaranteed in their standard forms. Non-maximal compression spaces satisfy further conditions on their potentially non-zero coordinates. The following lemmas help us understand maximal compression spaces.
Lemma 5.7.
Suppose . Then if and only if and .
Proof.
Suppose . Then multiplying on the right by , we get that . Similarly, multiplying on the left by , we get that . Conversely, if and , then it is immediate that . Note that for this last statement to hold, it was important that all possible outerproducts of the form are present in . ∎
Lemma 5.8.
Let be a maximal -compression space of rank at most two and let be its orthogonal complement.
-
(1)
If , then where and are linearly independent vectors in . Similarly if , then where and are linearly independent vectors in . In both cases, as a projective space, and all matrices in have rank one.
-
(2)
If , then , , , where are linearly independent and are linearly independent vectors in . As a projective space, and the rank one matrices in it cut out a variety isomorphic to .
Proof.
-
(1)
Let be the maximal -compression space in standard coordinates, consisting of all matrices with a zero last column. Then is is spanned by , . By Lemma 5.6, a general -compression space for a pair of fixed invertible matrices . Therefore, for all . Taking the first dot product:
Setting and , we get that . Since is invertible, are independent, and no row of is all zero. Hence, is spanned by independent rank one matrices and as a projective space, . Further, any linear combination of these three rank one matrices looks like which is a rank one matrix. Therefore, all matrices in have rank one. The proof for -compression spaces is analogous.
-
(2)
By the same reasoning as in the previous case, if is a -compression space, then . Since is maximal, as a projective space, and hence . Since the linear combinations
are rank one matrices, there is a worth of rank matrices in which forms a subvariety of dimension two. By Lemma 5.7, there are no more rank one matrices in .
∎
From now on we will denote a maximal compression space of rank at most two by and a subspace of it by . If a non-maximal compression space appears as an irreducible component of , then will be spanned by the generators of along with some data matrices as in Example 5.4. In particular a basis of is the union of the basis for from Lemma 5.8 and some data matrices.
Lemma 5.9.
Suppose for all distinct indices , or are in general position. Let be a -compression space contained in a maximal -compression space .
-
(1)
If is of type or , then contains at most one data matrix.
-
(2)
If is of type , then contains at most two data matrices.
Proof.
- (1)
-
(2)
If is a (1,1) compression space, then by Lemma 5.8 for some linearly independent and . By Lemma 5.7, if and only if and . If there are three data matrices in , say corresponding to indices , then and . However this contradicts our general position assumption, so we conclude that may contain at most two data matrices.
∎
We need one final lemma to show that the general position assumption in Theorem 5.5 prevents the existence of codimension one compression spaces of rank at most two in when . The following simple fact about our input point pairs will be useful.
Lemma 5.10.
If for , then and . Similarly, if for , then and .
Proof.
Since the ’s have last coordinate 1, if , the last columns of the two matrices are and . Therefore, and hence . The second statement is proved similarly. ∎
Lemma 5.11.
Suppose and for all distinct indices , or are in general position. Let be a -compression space of rank at most two and codimension one in , and let be a maximal -compression space containing .
-
(1)
If is of type or , then contains at least two data matrices.
-
(2)
If is of type , then contains at least three data matrices.
Proof.
By the general position assumption, at least three of the data matrices are linearly independent, and hence or .
-
(1)
Suppose is a maximal -compression space of rank at most two and codimension one in . Then by Lemma 5.8, and . This implies that which is a contradiction.
So suppose is a non-maximal -compression space of rank at most two and codimension one in , and is a maximal -compression space containing . Define
-
(a)
: Then and has a basis of cardinality four. We may assume without loss of generality that . By Lemma 5.7, is not a multiple of . Since , , and since has codimension one in , and so We need to argue that for , . Suppose for , equals the linear combination
(12) If then we are done, so suppose . If all the ’s are zero, then is a multiple of which is impossible by Lemma 5.10. If at least one , then (12) has rank two unless . In this case, (12) looks like . Again, by Lemma 5.10, this cannot equal for any . Therefore, it must be that for , .
-
(b)
: In this case, , and . Without loss of generality, . We need to argue that . Suppose for , is a linear combination of the form
(13) If all the ’s are 0 then lies in the span of the other two data matrices which contradicts that . If all ’s are then we are done. So assume that at least one and one are non-zero. If only one is non-zero, the claim follows from the same argument as in the previous case. So suppose . By Lemma 5.7, neither nor are multiples of which means that the second matrix in the product has rank at least two. The first matrix also has rank at least two since and are linearly independent and hence (13) has rank at least two and cannot equal for .
The case is argued similarly.
-
(a)
-
(2)
Suppose has an irreducible component which is a -compression space.
-
(a)
: In this case, and is a maximal compression space. Therefore, and . The matrix now has rows . Since it already has rank four, it must be that all the data matrices are in . They are in fact in the subvariety of containing all the rank one matrices.
-
(b)
: In this case which means that without loss of generality, . Also, and hence . We need to argue that these matrices in fact lie in . Suppose for , is a linear combination of the form
(14) with . As before, some must also be non-zero. By Lemma 5.7, or . Suppose . Then the first matrix in (14) has rank two and hence (14) has rank two unless all rows of the second matrix are multiples of . Then as in the previous cases, (14) cannot equal for any . If then the second matrix in (14) has rank two and hence the linear combination has rank two unless all columns of the first matrix are multiples of . Again by Lemma 5.10, (14) cannot be for any .
-
(a)
∎
Proof of Theorem 5.5.
If is reducible, then by Theorem 5.3 there is a codimension one compression space of rank at most two in . However, if for all distinct indices , or are in general position, then Lemma 5.9 and Lemma 5.11 together prohibit the existence of a codimension one compression space of rank at most two in , proving the theorem. ∎
5.2. Chiral Reconstructions for Four Point Pairs
We now have the tools to determine when a set of four distinct point pairs has a chiral reconstruction. Up to permuting the two images, every set of four point pairs belong to one of the combinatorial types shown in Figure 1.






We can further group these types by the rank of the points in and .
Definition 5.12.
We say that if and only if the matrix with columns has rank . Equivalently, if and only if the matrix with columns has rank .
Rank captures the geometry of and . Indeed, if , then all are coincident as points in . If , then all are collinear in , and if , then some three are non-collinear in . The assumption that points in and are distinct implies that . The spirit of Theorem 5.13, the main result of this section, is that when and have similar geometry, then has a chiral reconstruction. In particular, a chiral reconstruction exists for the combinatorial types in Figure 1(a), Figure 1(b), and Figure 1(c) where and for the type in Figure 1(d) where . We will present examples of configurations of the types in Figure 1(e) and Figure 1(f) for which a chiral reconstruction can fail to exist.
Theorem 5.13.
If and , then has a chiral reconstruction.
Proof.
We break the proof into three parts:
-
(1)
Suppose . Then the points in both and are collinear as in Figure 1(d), even as points in . Assume without loss of generality that the points appear in the order along the affine line they span in . The ordering of induces an ordering of the . Let be such that for all , , and define such that and . This forces for all . For a , by Lemma 4.12. Then, Lemma 4.2 implies that for all , proving the result.
-
(2)
Suppose and for all , or are in general position. Then by Theorem 5.5, is irreducible, and hence by Theorem 4.16, it suffices to show that there is a smooth rank two matrix for which has a chiral reconstruction.
We first claim that there is an ordering of point pairs such that are in general position and does not lie on any of the lines , , and where is the line spanned by and . Indeed, since , there is an ordering of the point pairs so that does not lie on any of the lines , , or . If are in general position for this ordering, we are done. Otherwise, collinear and so must be in general position by assumption, and hence are in general position. Also, since , must be non-collinear. Since cannot be on the lines , swapping point pairs with indices 3 and 4 gives the desired result.
Reorder point pairs so that are in general position and does not lie on any of , , and . Then choosing forces to all be non-zero. Removing from leaves three points in general position, so we may pick by Lemma 4.11 so that for all . Since the positivity of these determinant products is an open condition, there is a open polyhedral cone from which such an can be chosen. Consider the system
which consists of at most eight linearly independent equations. Therefore, this system has a solution . From Lemma 4.17, we know the tangent space to at has normal and is a smooth point of the epipolar variety if does not lie in . If , there is a minimal subset of the five matrices, including that are dependent. Placing the vectorizations of these matrices in the rows of a matrix, we get that all its maximal minors are zero. Each of these minors, set to zero, is a linear equation in and together they cut out a subspace in . Since we were able to choose from an open polyhedral cone, there is a choice of that avoids this subspace and the rank one variety. Hence, can be chosen (by choosing appropriately) to be a smooth rank two fundamental matrix and we are done by Theorem 4.16.
-
(3)
Suppose and there exist for which both and are collinear. Without loss of generality we may assume that , and that appear in this order on the affine line they span in . Let be such that , and define by , , and . Then for a , set . Since, and are collinear for each , satisfies all the epipolar equations . Also, since for , by Lemma 4.2 and Lemma 4.12. Finally, and so strictly satisfies all chiral epipolar inequalities and has a chiral reconstruction by Theorem 4.3.
∎
We conclude by showing that for four point pairs, Theorem 5.13 is best possible in the following sense.
Theorem 5.14.
For the two combinatorial types where , there are instances of without a chiral reconstruction.
We give examples to prove Theorem 5.14. Recall that the epipolar line homography of Theorem 3.2 cannot send coincident lines to distinct lines or vice versa. Therefore if is a -regular fundamental matrix with right and left kernels generated by and , respectively, then are collinear if and only if are collinear.
Example 5.15.
Consider the arrangement in Figure 2 where .


Let be the line in spanned by and . Suppose there is a fundamental matrix for with , and suppose and . Then (equivalently, ) because then the set of lines consists of a repeated line, while the set of lines contains at least two distinct lines for any choice of . If , then consists of four distinct lines, and hence must also have four distinct lines. In particular, , or equivalently, . These restrictions imply that are all non-zero. It is then clear that they cannot all have the same sign. Therefore, by Theorem 4.10, there is no chiral reconstruction of .
The epipolar variety in Example 5.15 is reducible; is a linear component containing the wall . Let and define such that and . Then is a fundamental matrix of . Indeed, and and therefore satisfies the second and fourth epipolar equations. It is straightforward to check that are collinear and are collinear, so also satisfies the first and third epipolar equations. By construction, , and by Lemma 4.2 and Lemma 4.12, . Thus, is a fundamental matrix of that satisfies all the chiral epipolar inequalities for all . However, lies in the corner and is not -regular and all neighboring matrices to on are also -irregular. Therefore, we cannot use Theorem 4.16 to perturb to a regular fundamental matrix.
Example 5.15 illustrates why the irreducibility of the epipolar variety is needed in Theorem 4.16. The next example shows that a chiral reconstruction may not exist even when the epipolar variety is irreducible.
Example 5.16.
Consider the arrangement in Figure 3 where .


By Theorem 5.5, this epipolar variety is irreducible. Suppose has a fundamental matrix with left epipole and right epipole . By Theorem 3.2, cannot be on the line spanned by the points since there is no for which the set of lines consists of coincident lines. For any not on the line spanned by the points, the vector
which has sign pattern (up to sign) since , and for any ,
We can guarantee a chiral reconstruction if
However, this implies that which is a contradiction. Similarly, we cannot achieve the sign pattern either. What might still be possible is to choose so that some of the determinants are zero, or equivalently, lies on the line joining two of the points. This will not work since then two coincident rays will have to map to two non-coincident rays under the homography in Theorem 3.2. Therefore, has no chiral reconstruction.
Unlike in Example 5.15, there is no fundamental matrix for Example 5.16 that satisfies all chiral epipolar inequalities. Indeed, Theorem 4.16 implies that there cannot be a smooth such that has a chiral reconstruction associated to . This rules out the possibility that there is an irregular with respect to some index that satisfies the chiral epipolar inequalities.
6. Five Point Pairs
We have seen so far that three point pairs unconditionally have a chiral reconstruction as do four point pairs unless their geometry is special and dissimilar. In this section we will see that five point pairs can fail to have a chiral reconstruction even if both sets of points are in general position. Specific examples of such point pairs were known to Werner [22]. We will show that, in fact, five point pairs do not have a chiral reconstruction with positive probability. We will use Theorem 4.16 to devise a simple test for chirality in this section, and connect to the classical theory of cubic surfaces in the next section.
Throughout this section, let be a set of five generic point pairs in the sense that is a generic subspace in of codimension five. In this case, misses the four-dimensional variety , and by Bertini’s Theorem, the epipolar variety is a smooth, irreducible cubic surface. Each wall or is a line on . We first see how these lines intersect.
Lemma 6.1.
Let be a set of five generic point pairs. Then
-
(1)
and , and similarly and , do not intersect for all .
-
(2)
and do not intersect for all .
-
(3)
The corner consists of a unique real rank two matrix for .
Proof.
-
(1)
Recall that for , , and so there is no rank two such that .
-
(2)
Since , is the intersection of the three-dimensional linear space with the generic codimension four linear space in , and hence empty.
-
(3)
When , is the intersection of the three-dimensional linear space with the codimension three linear space in . This intersection is zero-dimensional and since the defining equations are linear with real coefficients, it consists of a single real matrix . Generically, the intersection will miss , so we conclude that is a real rank two matrix.
∎
Remarks 6.2.
Concretely, when , the rank two matrix is where is the unique homography sending to and to for all . It is easy to verify that this satisfies the epipolar equations and is the unique point in .
Definition 6.3.
Let be the set of all wall intersections on the epipolar variety:
We call the points in the corners of the epipolar variety.
Lemma 6.1 shows that consists of 20 distinct fundamental matrices of . However, they do not correspond to full projective reconstructions of because necessarily the corner is neither regular nor regular. Despite this fact, we argue that checking the signs of the chiral epipolar inequalities at these corners suffices to determine whether has a chiral reconstruction.
Theorem 6.4.
Let be a generic set of five point pairs. Then has a chiral reconstruction if and only if has a chiral reconstruction for one of the 20 corners .
Proof.
Suppose has a chiral reconstruction. Then Theorem 4.3 implies that there is a -regular fundamental matrix such that for all . By Lemma 6.1, the epipolar variety does not contain the corner for any . Therefore regularity with respect to all image pairs implies for all , meaning for all . Let be the connected component of the semialgebraic subset of the epipolar variety described by that contains . The chiral epipolar inequalities only change sign along walls. Either is the entire epipolar variety or its boundary is contained in the walls. Every wall contains four corners and every interval on a wall that is contained in is bounded by corners (or unbounded, in which case it contains all four corners on the wall). So contains corners and every corner lies in and corresponds to a chiral reconstruction of the subset of point pairs.
Conversely, suppose has a chiral reconstruction associated to some . The epipolar variety is smooth, irreducible, and consists only of rank two matrices. Hence the points in are smooth fundamental matrices and Theorem 4.16 implies there is a chiral reconstruction of . ∎
Theorem 4.10 allows us to infer the sign of all non-zero chiral epipolar inequalities at a corner directly using the input point pairs. The next corollary makes this precise.
Corollary 6.5.
Let be a set of five generic point pairs. Then has a chiral reconstruction if and only if there is some corner such that and have the same sign for .
Proof.
Since is generic, no three are collinear and no three are collinear. Hence, the expressions and are all non-zero for pairwise distinct . There is some corner such that and have the same sign if and only if has a chiral reconstruction associated to (Theorem 4.10) if and only if has a chiral reconstruction (Theorem 6.4). ∎
Corollary 6.5 gives an algorithm to check whether five generic point pairs have a chiral reconstruction. We evaluate the sign of three polynomials on the input point pairs at each of the 20 corners. The signs are the same at a corner if and only if it lies in the boundary of the chiral epipolar region of . We illustrate the procedure on an example.
Example 6.6.
Let
(21) |
and where is the th column of and is the th column of V. By Corollary 6.5 and the following table, has no chiral reconstruction.
Now consider the following modification of the above example obtained by perturbing :
(28) |
We compute the products at all 20 corners and find that
and hence the corner lies in the boundary of the chiral epipolar region, so these point pairs have a chiral reconstruction. We point out that the same-signness property is not symmetric in the sense that if it holds for , it need not hold for . Indeed,
The point pairs in (21) are a slight modification of [22, Figure 1] for which the epipolar variety is singular. Our modification makes the variety smooth, a property of interest in the next section. Furthermore, since sufficiently small perturbations of point pairs do not change the signs of , , and at any corner, our methods show that having no chiral reconstruction is an open condition. This leads to the following conclusion.
Theorem 6.7.
The set of five point pairs without a chiral reconstruction is Zariski dense in the space of all five point pairs.
Finally, embedding the point pairs in (21) into instances of six or more point pairs one can construct larger configurations with no chiral reconstruction.
7. Connections to Classical Algebraic Geometry
In this section, we discuss the connection between five point pairs and the classical theory of real cubic surfaces in projective -space. This point of view is yet another way to study the space of epipoles, which goes back to Klein and Segre [20]. General references for cubic surfaces are still mostly classical books like [14],[15], or [20]. More modern accounts of classical facts about cubic surfaces can be found in [7] or [19]. Some of the history going back to Cayley and Salmon is discussed in [6].
As in Section 6, we consider sets of five point pairs that are generic though some of our discussion below might generalize to more singular situations. In the first three subsections, we use the following as a running example.
Example 7.1.
Consider the point correspondences where is the th column of and is the th column of shown below:
7.1. From two images to a cubic surface
Given five generic point pairs , the epipolar variety is a smooth cubic surface and its defining equation comes with a determinantal representation. Indeed, is defined by ; to get the equation of the intersection with the linear space , we compute a basis of and plug a general linear combination of this basis into the equation to obtain
where the entries of are linear forms in .
Example 7.2.
The variety is cut out by where
The coefficient matrices in the linear matrix polynomial form a basis of .
7.2. The 27 lines on a cubic surface
The cubic surface contains special lines coming from the input point pairs, namely the walls and . From Lemma 6.1, we know the lines do not intersect each other and the lines do not intersect each other. Additionally the lines intersect the lines in corners, but they do not all intersect pairwise. We have lines in a very special configuration: It turns out that this is only possible if these lines are contained in a real Schläfli double six on .
Definition 7.3.
A Schläfli double six on a smooth cubic surface is a pair of six-tuples of lines
on such that the six lines in each tuple are pairwise disjoint and is non-empty if and only if .
Every smooth cubic surface contains complex lines in total and they can be organized into different Schläfli double six configurations. The combinatorics behind this has been studied extensively. A discussion of the Schläfli double sixes is summarized in [4, Section 2]. For a development via the blow-up construction, see [13, Chapter V, Section 4]. With this approach, it is straightforward to check that a pair of five tuples of lines on a smooth cubic surface with , (for all ), and if and only if , uniquely determines a Schläfli double six.
The fact that our cubic surfaces, that appear as epipolar varieties, always contain a Schläfli double six consisting of real lines, means that they are all of the same real topological type. Schläfli, Klein, and Zeuthen classified the real topologies of cubic surfaces. There are five possible types. The real classification of cubic surfaces is summarized in [4, Theorem 5.4] and discussed in detail in [9, pp. 40–55]. Epipolar varieties are always of type containing real lines. Indeed, cubic surfaces of type , , or do not contain enough real lines to have a real Schläfli double six (namely only seven, three, and three). Type surfaces contain real lines, which would be enough for a Schläfli double six. These surfaces are the blow-up of in four real points and one conjugate pair of complex points. The lines are the four real exceptional divisors over the four real points, the strict transforms of the six real lines joining any pair of the four real points, the strict transform of the one real line joining the conjugate pair of complex points and the strict transforms of the four conics passing through all sets of five points that is complementary to one of the real points. It is straightforward to check that there are no six mutually skew lines among these real lines.
Every smooth cubic surface that arises as the epipolar variety of five point pairs is the blow-up of the real projective plane in six real points in general position. (The other types are obtained by blowing up in six complex points that are invariant under complex conjugation with different numbers of fixed points, namely , , and . The last remaining type is not isomorphic to a blow-up of over the reals.) This is curious because we start with five point pairs that specify lines on the epipolar variety, but these lines determine two other lines via the unique Schläfli double six containing the . This prescribes one more point in every image, as we will see below. In [22], Werner shows one way to construct this sixth point in each image using the five points pairs. We discuss two other ways to explain the sixth point (see Example 7.4 and Example 7.5).
Example 7.4.
The four-dimensional linear space intersects the four-dimensional, degree six variety in six points. The sixth point is where and .
The Schläfli double six specified by the lines and for accounts for twelve of the 27 lines on the epipolar variety. For , and span a tritangent plane, . The tritangent plane intersects the epipolar variety in , and one additional line
The plane coincides with , hence the line coincides with for all . The distinct lines for account for the remaining 15 real lines on the epipolar variety.
7.3. The determinantal representations of a cubic surface
From the classical point of view, we can start with six real points in the real projective plane and obtain a cubic surface of the correct topological type. Each of these surfaces is the epipolar variety for five generic points pairs but every cubic surface is compatible with many second images. To determine the second image, we go via a determinantal representation; different choices of a determinantal representation result in different second images (even modulo projective transformations).
Given six real points in , we get a cubic surface that is the blow-up of in these six points by considering the rational map given by the four-dimensional space of cubics that vanish at the six points. Fixing a basis of this space, the map is given by and is defined on . The closure of the image of this map is the cubic surface in (which is determined up to change of coordinates on by the six points in ).
We cannot determine the second image from this surface alone. However, this information is specified by a determinantal representation of the surface. Determinantal representations are closely related to the Hilbert-Burch matrix as is explained in [4, Section 3] (see also [9]). The vanishing ideal of six points in in general linear position is generated by the four-dimensional linear space of cubics vanishing on it. Its minimal free resolution looks like
where is a free, graded -module of rank three and the map from to is given by a Hilbert-Burch matrix with entries linear in . Every such matrix gives a determinantal representation of via a simple computation: Determine the matrix with entries in such that
and then . Since is also a determinantal representation of , we get another Hilbert-Burch matrix with linear entries such that
Therefore a determinantal representation determines six real points in the first image cut out by the minors of and six real points in the second image cut out by the minors of .
Example 7.5.
The Hilbert Burch matrix of is
while the Hilbert-Burch matrix of is
We can compute the six real points in each image from the Hilbert-Burch matrices, because the minors vanish exactly at these points. The zeros of these ideals of minors are where and where , respectively. This gives yet another way to compute and .
7.4. From a determinantal representation to epipoles
Given a determinantal representation of a cubic surface , consider the map , . Since the matrix has rank two for every point , the image lies indeed inside embedded via the standard Segre embedding in . With its given determinantal representation, the image of the cubic surface under the adjoint map is the space of epipoles in two images consistent with .
From our study of the chiral epipolar region in Section 6, we are interested in the lines and on the cubic surface corresponding to . We compute the images of these lines under the adjoint map, connecting our work with Werner’s results in epipole space [22]. The image of under the adjoint map is , where is a curve of degree two, namely the conic passing through for . Indeed, for every matrix , the vector is in the right kernel, by definition, whereas the left kernel varies. This left kernel varies along a conic in because every entry of the adjoint matrix of a matrix is a quadric. So intersecting the image of under the adjoint map with a line in translates, by pulling back via the adjoint map, to the intersection of a quadric hypersurface in with the line , which therefore consists of two intersection points. Symmetrically, the image of is , where is the conic passing through for . This behaviour is special and occurs only for the walls in the cubic surface. The image of the other lines under the adjoint is contained in .
The conics in the first image and in the second image are described in Section 4 of [22]. These conics divide each image into cells of possible epipoles with piecewise conic boundaries. Each cell is uniquely characterized by the subset of conics that participate to form its boundary or equivalently by the subset of points that belong to its boundary. Werner’s test for the existence of a chiral reconstruction involves looking for “allowed segments” of the conics and . In our work, we translate this question to studying the intersections of lines, i.e., the corners in the cubic surface in the preimage of the adjoint. In the following example, we show how our chirality test on corners relates to Werner’s allowed segments in the space of epipoles.
Example (6.6 continued).
Consider the five point pairs from (28):
We find that the same sign condition of Corollary 6.5 holds at the following corners: , , , , and . These corners are in the boundary of the chiral epipolar region of which lives in the cubic surface . Blowing down with the adjoint map, we get the non-empty region of epipoles in that correspond to a chiral reconstruction. The boundary of the region in the first image is defined by and at , and at , and and at . The boundary of the region in the second image is defined by and at , and at , and and at . See Figure 4.


We remark that our test for chirality requires more than just the cubic surface determined by six real points in one image . For different determinantal representations of , the six points in the other image may differ. In order to check the chiral epipolar inequalities, it is necessary to fix the second image, for example, by fixing a determinantal representation of .
It is also necessary to specify five out of the six point pairs to test for chirality. As the next example shows, while some sets of five point pairs may have a chiral reconstruction, another set associated with the same determinantal representation may not. These observations point out the intricate dependence of chirality on arithmetic information.
Example (6.6 continued).
Consider the five point pairs from (21):
We compute the sixth associated point pair and . Note that the need for all point pairs to have last coordinate one fixes the representatives of and . By computing the relevant products at the 20 corners, we find that does not have a chiral reconstruction. However, suppose we consider a different subset of five point pairs
Some corners now satisfy the same sign condition of Corollary 6.5, hence has a chiral reconstruction. In fact, the subset of five point pairs which omits index has a chiral reconstruction when and has no chiral reconstruction when . The cubic surface is the same for all and the determinantal representations are all equivalent, but whether a chiral reconstruction exists or not depends on which five point pairs we choose.
References
- [1] S. Agarwal, A. Pryhuber, R. Sinn, and R. R. Thomas, Multiview chirality, 2020.
- [2] M. Atkinson, Primitive spaces of matrices of bounded rank. II, Journal of the Australian Mathematical Society, 34 (1983), pp. 306–315.
- [3] J. W. Bruce and C. T. C. Wall, On the classification of cubic surfaces, J. London Math. Soc. (2), 19 (1979), pp. 245–256.
- [4] A. Buckley and T. Košir, Determinantal representations of smooth cubic surfaces, Geometriae Dedicata, 125 (2007), pp. 115–140.
- [5] O. Chum, T. Werner, and T. Pajdla, Joint orientation of epipoles, in Proceedings of the British Machine Vision Conference, 2003.
- [6] I. V. Dolgachev, Luigi Cremona and cubic surfaces, in Luigi Cremona (1830–1903) (Italian), vol. 36 of Incontr. Studio, Istituto Lombardo di Scienze e Lettere, Milan, 2005, pp. 55–70.
- [7] , Classical Algebraic Geometry: A Modern View, Cambridge University Press, Cambridge, 2012.
- [8] D. Eisenbud and J. Harris, Vector spaces of matrices of low rank, Advances in Mathematics, 70 (1988), pp. 135–155.
- [9] A. V. Geramita, Lectures on the nonsingular cubic surface in , in The Curves Seminar at Queen’s, Vol. VI (Kingston, ON, 1989), vol. 83 of Queen’s Papers in Pure and Appl. Math., Queen’s Univ., Kingston, ON, 1989, pp. Exp. No. A, 106. With appendixes by the author and E. Kani and by Robert McCann.
- [10] J. Harris, Algebraic Geometry: A First Course, Graduate Texts in Mathematics, Springer-Verlag New York, 1995.
- [11] R. I. Hartley, Chirality, Int. J. Comput. Vision, 26 (1998), pp. 41–61.
- [12] R. I. Hartley and A. Zisserman, Multiple View Geometry in Computer Vision, Cambridge University Press, second ed., 2004.
- [13] R. Hartshorne, Algebraic Geometry, Springer, 1977.
- [14] A. Henderson, The Twenty-Seven Lines Upon the Cubic Surface, Reprinting of Cambridge Tracts in Mathematics and Mathematical Physics, No. 13, Hafner Publishing Co., New York, 1960.
- [15] D. Hilbert and S. Cohn-Vossen, Geometry and the Imagination, Chelsea Publishing Company, New York, N. Y., 1952. Translated by P. Neményi.
- [16] S. Laveau and O. Faugeras, Oriented projective geometry for computer vision, in Proceedings of the European Conference on Computer Vision, Springer, 1996.
- [17] H. leung Lee, On the existence of a projective reconstruction, CoRR, abs/1608.05518 (2016).
- [18] S. Maybank, Theory of Reconstruction from Image Motion, Springer-Verlag, 1993.
- [19] M. Reid, Undergraduate Algebraic Geometry, vol. 12 of London Mathematical Society Student Texts, Cambridge University Press, Cambridge, 1988.
- [20] B. Segre, The Non-singular Cubic Surfaces, Oxford University Press, Oxford, 1942.
- [21] T. Werner, Combinatorial constraints on multiple projections of a set of points., in ICCV, 2003, pp. 1011–1016.
- [22] , Constraint on five points in two images, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2003.
- [23] T. Werner and T. Pajdla, Cheirality in epipolar geometry, in Proceedings of the IEEE International Conference on Computer Vision, 2001.
- [24] , Oriented matching constraints., in Proceedings of the British Machine Vision Conference, 2001.