Maximum Absolute Determinants of Upper Hessenberg Bohemian Matrices
Abstract
A matrix is called Bohemian if its entries are sampled from a finite set of integers. We determine the maximum absolute determinant of upper Hessenberg Bohemian Matrices for which the subdiagonal entries are fixed to be and upper triangular entries are sampled from , extending previous results for and and proving a recent conjecture of Fasi & Negri Porzio [8]. Furthermore, we generalize the problem to non-integer-valued entries.
keywords:
Upper Hessenberg, Bohemian Matrix, Maximum Absolute Determinant.11C20, 15A15, 15B36.
1 Introduction
Matrices whose entries are from a small subset of the integers are said to be Bohemian, an abbreviation of BOunded HEight Matrix of integers. These matrices appear in many different contexts, including adjacency matrices of graphs [10], Hadamard matrices [1, 2], random discrete matrices [4] and alternating sign matrices [3]. They have been studied for over a century and remain a subject of active research. The website [7] provides a comprehensive overview of recent results and open problems.
Recently, Chan et al. investigated several properties of the characteristic polynomials of upper Hessenberg Bohemian matrices [5], and Thornton et al. obtained a number of results concerning the distribution of their eigenvalues, characteristic heights, and maximum absolute determinant values [6]. These papers state several conjectures on the values of the determinants of Bohemian matrices [7], many of which have recently been solved and generalised by Massimiliano Fasi and Gian M. N. Porzio [8].
One of these conjectures, which is a refinement of a result of Li Ching [9], was until now lacking a solution that could be generalised. We here provide a generalisable solution for that problem and explore an extension of it. Specifically, we focus on the maximum absolute determinant of upper Hessenberg matrices with fixed subdiagonal entries and a given population of upper triangular entries. Our calculations provide a more comprehensive understanding of the behaviour of the determinants of these kind of matrices and include special cases that had previously been solved by other approaches.
2 Results
In 1993, Ching proved the following theorem.
Theorem 2.1.
The maximum absolute determinant of an upper-Hessenberg matrix with upper triangular entries from and subdiagonal entries fixed at is given by the Fibonacci sequence. [9]
Recently Fasi and Porzio have established the following theorem, proving a result conjectured by Thornton [7]:
Theorem 2.2.
The maximum absolute determinant of an upper-Hessenberg matrix with upper triangular entries from and subdiagonal entries fixed at is given by the following sequence. Let denote the maximum absolute determinant among these matrices, then , , and for all . [8]
It is a natural question what happens if the upper triangular population is . Fasi and Porzio stated the following conjecture in this context.
Conjecture 2.3.
The maximum absolute determinant of an upper-Hessenberg matrix with upper triangular entries from and subdiagonal entries fixed at is given by the following generalized Fibonacci sequence. Let denote the maximum absolute determinant among these matrices, then , , for all . [8]
We discuss the following, further generalisation of this problem.
Problem 2.4.
What is the maximum absolute determinant of an upper-Hessenberg matrix with upper triangular entries drawn from , , and subdiagonal entries taking a fixed value ?
If is negative the problem is relatively straightforward and the result can be stated as the following theorem, whose proof may be found in [8].
Theorem 2.5.
The maximum absolute determinant of an upper-Hessenberg matrix with upper triangular entries from and subdiagonal entries fixed at is given by for all .
However the proof of this theorem does not extend to .
We here solve Problem 2.4 in various regimes when . The first case we consider is when , the second is (where is a sufficiently small positive number depending on the dimension of the matrix) and the third case is . Our main results are contained in the following theorems:
Theorem 2.6.
The maximum absolute determinant of an upper-Hessenberg matrix with entries from the interval and subdiagonal entries fixed at , such that , is given by the following sequence. Let denote the maximum absolute determinant among these matrices; then , and for all in .
Remark 2.7.
Theorem 2.8.
The maximum absolute determinant of an upper-Hessenberg matrix with entries from the interval , subdiagonal entries fixed at and such that where is a sufficiently small positive number depending on is given by the following sequence. Let denote the maximum absolute determinant among these matrices; then , and for all in .
Theorem 2.9.
The maximum absolute determinant of an upper-Hessenberg matrix with entries from the interval and subdiagonal entries fixed at such that is given by .
Throughout this paper we use the following definitions and notation.
Definition 2.10.
The set of all upper Hessenberg matrices whose subdiagonal entries are fixed at and upper triangular entries are from the set is denoted by .
Definition 2.11.
For a given matrix , denote the determinant of the bottom-right part of by , and for convenience set for any matrix .
Notation 2.12.
For a given matrix , when referring to the matrix itself we use square brackets and when referring to the determinant of we use straight brackets. For instance, , . To avoid confusion, we use for absolute value sometimes.
3 Case I
Firstly we deal with the case of Problem 2.4. To prove Theorem 2.6 we introduce several lemmas. For convenience, set .
Lemma 3.1.
, .
Proof 3.2.
Suppose that maximum absolute determinant for matrices is attained by a matrix . Then the following inequality holds trivially
For the next lemma, we start by writing the determinant for any matrix using Laplace expansion twice, firstly for the first column of the matrix and secondly for the first rows of the resulting two matrices.
(3.1) |
(The last term depends on the parity of , if is even, it is .)
Now we state a new lemma which is inspired by the previous expansion.
Lemma 3.3.
For all k in we have the following inequality:
(3.2) |
and also, for the case when is even:
(3.3) |
Proof 3.4.
For (3.2), it is enough to show the following:
(3.4) |
It suffices to check four extreme cases of and , i.e,
Using , the triangle inequality, the definition of , and Lemma 3.1 we get the following inequalities for these four cases:
-
1.
:
-
2.
:
-
3.
:
-
4.
:
-
4.1
If :
-
4.2
If :
-
4.1
Using the triangle inequality and Lemma 3.3 in (3.1) yields the following inequality for :
(3.5) |
Note as well that we have , , .
Define a new sequence in the following way: , , and , for all . We state two simple lemmas concerning this sequence.
Lemma 3.5.
for all .
Proof 3.6.
We use induction on . For and , the equality is trivial. It is straightforward to verify that if the statement holds for and then it holds for where . By using the induction assumption:
Lemma 3.7.
for all .
Proof 3.8.
Lemma 3.9.
for all .
Proof 3.10.
By induction, it is clear that , .
Lemma 3.11.
for all .
Proof 3.12.
It suffices to give an example for which the absolute determinant value is equal to . Define an matrix as follows:
(3.6) |
i.e., if and is even; if and is odd. Note that , and by using Laplace expansion it is easy to see that
(3.7) |
which is the same recurrence relation as for . Hence, for all positive integers , we have .
In the next section we discuss what happens when . We are able to say something in two regimes.
4 Case II
We consider first the case when , i.e., is slightly greater than .
Define permutation matrices and in the following way: Let be obtained by interchanging the first two rows of the identity matrix and be obtained by interchanging the last two columns of the identity matrix. And then define in for as follows:
(4.1) |
We can determine the relation between the determinants of these matrices and the determinant of in the following way.
Using the Laplace expansion for the first column of we obtain
(4.2) |
and similarly
(4.3) |
for all . And again using the Laplace expansion for both the first column and the last row of we get
(4.4) |
for all , defining for convenience.
Proposition 4.1.
For the statement of Theorem 2.6, is the exact upper bound for . More explicitly, for any and , the matrix that gives the maximum absolute determinant is not .
Proof 4.2.
Proposition 4.3.
Proof 4.4.
Suppose that for an the maximizing matrix changes from to a matrix at . Then the absolute value of the determinant of must be equal to the absolute value of the determinant of . In [9], Li Ching proved that if and only if . Hence .
In view of the fact that has become an important matrix in our investigation, we establish the recurrence relation satisfied by its determinant in the following proposition.
Proposition 4.5.
, and satisfies the same recurrence relation as :
(4.7) |
for any .
Remark 4.7.
5 Case III
The third case we consider is when , i.e. is sufficiently larger than , depending on .
We start with an important observation which illustrates the reason why we are interested in instead of .
Observation 5.1.
The case when is excessively general to reach a conclusion on . More explicitly, for this case, there is no fixed matrix structure that gives the maximum absolute determinant for all values of .
Proof 5.2.
Consider the case when and and . By using MATLAB and testing all possible cases, it is not difficult to check that the maximum absolute determinant for this case is and this value is attained by only two matrices:
If there is a unique matrix that maximizes the absolute determinant value for all in , then it must be the matrix by Proposition 4.3. Hence must be the same with either or , but clearly it is not. So, the maximizing matrix still depends on the ratio .
The next theorem describes the case in Problem 2.4.
Theorem 5.3.
The maximum absolute determinant of an upper-Hessenberg matrix with entries from the interval and subdiagonal entries fixed at such that (i.e. can be taken sufficiently larger than for any case) is given by . (We shall go on afterwards to establish a precise lower bound for such that this statement holds.)
Define again. The proof is in two parts; the first is giving an example to show that , and the second showing that this example has the maximum absolute determinant. We start with the first part. Define a matrix for as follows:
(5.1) |
Proposition 5.4.
for all .
Proof 5.5.
For , it is clear. We use induction on , specifically, we assume that the statement is valid for and show that it then follows for . By using Laplace expansion for the last row, and the induction assumption:
Now, we define a new sequence of matrices depending on the parity of .
(5.2) |
Notice that contains a block full of ’s whereas contains a block; and note that if we define such that it would contain a block of ’s instead of a block it would not change the determinant value. For later convenience we denote the alternative version
Proposition 5.6.
for all .
Proof 5.7.
We need to show and . We are going to show only one, because the proofs in both cases are essentially identical.
Using Laplace expansion for the last row of (and by iteratively – exactly times – doing it to the last row of each the resulting matrices), and by using Proposition 5.4 we get:
Proposition 5.8.
for all .
Proof 5.9.
Using the previous proposition, this follows straightforwardly by induction.
As a corollary of the last proposition, we have
(5.3) |
for all .
So the first part of the proof is done. Next, we prove the reverse implication.
Since we are looking for the maximum absolute determinant it suffices to check instead of .
Notice that the determinant value of any is a polynomial in and . More explicitly, the determinant of is an element of the following set:
Recall that we assumed and already know (5.3). Then
(5.4) |
This means that the sign of the determinant which gives the maximum absolute determinant cannot be . So, its sign is .
By (5.3) and using again,
(5.5) |
So, the maximum absolute determinant must contain the term , i.e., the maximum absolute determinant is of the form .
Moreover, we also have the following
(5.6) |
This means that the maximum absolute determinant cannot contain the term , i.e., the maximum absolute determinant is of the form .
Again by and (5.3), we can state:
(5.7) |
Therefore, the coefficient of in the maximum absolute determinant must be at least .
Lemma 5.10.
For any matrix , if the coefficient of in is zero, then the absolute value of the coefficient of is at most .
Proof 5.11.
Consider the determinant as a polynomial with variable . It is easy to see that for
the determinant of can be expressed as follows:
(5.8) |
Notice that we can express the coefficient of in (5.11) as follows:
(5.11) |
The first equality at (5.10) means that there are at least zeros among the set
Suppose that there are and zeros in and respectively. Note that we can see the expansion (5.11) as an upper triangular half of an chessboard by observing that setting corresponds to colouring the row (from the top) black, and setting corresponds to colouring the column (from the right) black for and . (Notice that we do not colour for the cases and ) (See Figure 1).
The number of black squares is less than or equal to the number of zero terms in the expansion (5.11). We know that the total number of colored rows and columns is at least since . It is clear that to minimize the number of the black squares, coloured columns must be the leftmost ones and coloured rows must be lowermost ones, and must be or (depending on the parity of ). See Figure 2 for the minimizing example in the case .
It is easy to calculate that the minimum number of black squares is at least
(5.12) |
if is odd, and
(5.13) |
if is even.
Hence, in the expansion (5.11), we know that at least or terms are zero. And the number of terms in (5.11), is .
Now we are going to consider the entries of the matrix that makes the coefficient of equal to .
If is odd, say , we have a half chessboard, and we colour rows and columns in total. To make the coefficient , we colour exactly rows (the lowermost ones) and columns (the leftmost ones). This means we set
(5.16) |
(5.17) |
Additionally all the nonzero terms in (5.11) must be , which means
(5.18) |
(5.19) |
(5.20) |
because there is a term that corresponds to a white square in the chessboard representation for each .
We already know that the coefficient of in (5.11) is not zero, so
(5.22) |
For the case when is even, we have the same, but according to the choice of the difference between the number of zero terms in the sets and , i.e. , as or ; we are going to have or as defined at . From now on, we just consider the odd case, the even case can be done in exactly the same way.
Recall that we know that the maximum absolute determinant is
Hence, the coefficient of must be to have the maximum absolute determinant. Now we are going to show that this fact forces all entries with a question mark in (5.23) to be filled with .
Proposition 5.12.
Recall that we have defined in (5.1). If any of the entries of the triangular block of ’s in the upper half is instead of , then the determinant contains the term .
Proof 5.13.
Consider the following permutation:
This permutation gives the term , and because all the terms with have the same sign, does not vanish.
Proposition 5.14.
Consider the matrix in (5.23), if there is in any of the entries that are filled with a question mark, then the determinant has the term .
Proof 5.15.
Let be odd, the other case can be done by the same way. Suppose that the is placed in the top-left triangular block of ’s WLOG. Then consider the determinant as follows:
The determinant of the upper-left square contains the term by Proposition 5.12. And as we have boxed, there is a permutation that gives a term from the bottom-right square. When we consider these two together we get the term which is exactly what we are looking for.
As a corollary, we can state that all the entries with question mark must be filled with to have the maximum absolute determinant. Therefore the matrix which has the maximum absolute determinant is the matrix as we have defined at (5.2). QED.
Now we can discuss the condition . We have used this in our proof in the following ways in (5), (5.5), (5), (5) and (5.24). We can rewrite these inequalities setting :
(5.25) | ||||
(5.26) | ||||
(5.27) | ||||
(5.28) | ||||
(5.29) |
Note that except in the last inequality, these allow us to take asymptotic to . However, the last one requires . Fortunately we can overcome this issue using another way to tackle the problem. Recall that we deduced (5.29) from (5.24). Before stating (5.24), we found that the maximum absolute determinant has the form and the matrix with this determinant has the form (5.23)
Lemma 5.16.
Let
where . And let
(5.30) |
Then, for all .
Proof. Recall that in the permutation definition, if we consider the determinant as a function of , the absolute value of the coefficient of is:
(5.31) |
Then
(5.32) |
and
(5.33) |
Clearly some of the terms are going to vanish such as the ones starting with because is already determined as . Now we are going to define a function from non-vanishing terms in (5.33) to non-vanishing terms in (5.32). (From now on we consider them not as numbers, but as sequences of ’s) Define the function
as follows:
-
•
If , we have ,
-
•
If , then , we have ,
We are going to show that the preimage of any element in the range has cardinality less than or equal to .
Consider the preimage of an arbitrary element in the range of ,
Define a set if , otherwise
Similarly define if , otherwise
It is not difficult to see that
and for the cardinality of we have
Therefore, we get that the preimage of any element in has cardinality less than or equal to and this means .
Note that:
and similarly
Hence
So having is sufficient for the last case, namely (5.29) is not a requirement anymore if . Then there are four inequalities left that we still need to deal with: (5.25)-(5.28).
Lemma 5.17.
If these inequalities hold for .
Proof 5.18.
For the third case (5.27),
Remark 5.19.
Remark 5.20.
6 Concluding Remarks
We found the maximum absolute determinants (and the matrices giving the corresponding values) for the cases (Theorem 2.9) and (Theorem 2.6); in addition we already know the case (Theorem 2.5). Furthermore we showed that is the exact upper bound of for Theorem 2.6 (Proposition 4.1), and found the first maximizing matrix after (Proposition 4.3) with the recurrence relation satisfied by its absolute determinant value (Proposition 4.5). Nevertheless, for the most part the problem as to what happens if still remains open. Note that as goes to from , the matrix which gives the maximum absolute determinant changes from having a localized to a much more uniform structure. In view of these observations, further investigations might be based on the following questions:
-
1.
Which matrices maximize the absolute determinant as goes to from , in other words, what are the transition forms?
-
2.
It is reasonable to work on the case and as a first step to understanding the transition form. Is the maximizing matrix in this case when ?
-
3.
How many times do transitions occur depending on the size of the matrices?
-
4.
We know that for the first transition occurs at from to . When does the second transition occur, in other words what is the maximum possible value of in Remark 4.7?
-
5.
What should be the precise lower bound in Theorem 5.3? Is it asymptotic to ?
-
6.
When do other transitions occur?
- 7.
-
8.
Moreover, there is no alternating sign among the nonzero permutations in the determinants for any of these three matrices. Is this condition valid for all maximizing matrices?
Acknowledgements
We thank Professor N. J. Higham for drawing our attention to questions concerning Bohemian Matrices. The work reported here was carried out as part of an undergraduate summer research project by Mr Ahmet Abdullah Keleş at the University of Bristol, June–July 2019. Mr Keleş is grateful for financial support and the hospitality of the School of Mathematics at the University of Bristol during his visit. JPK was supported by a Royal Society Wolfson Research Merit Award, EPSRC Programme Grant EP/K034383/1 LMF: -Functions and Modular Forms, and by ERC Advanced Grant 740900 (LogCorRM).
References
- [1] Sylvester J. J. “Thoughts on inverse orthogonal matrices, simultaneous sign successions, and tessellated pavements in two or more colours, with applications to Newton’s rule, ornamental tile-work, and the theory of numbers.” Philos Mag. 1867; 34: 461-475.
- [2] Hadamard J. “Resolution d’une question relative aux determinants.” Bull Sci Math. 1893; 17: 240– 246.
- [3] Bressoud, D. Proofs and Confirmations: The Story of the Alternating Sign Matrix Conjecture. Cambridge, England: Cambridge University Press, 1999.
- [4] Vu V. “Random discrete matrices.” Bolyai Soc Math Stud 17 (2008), 257-280.
- [5] Chan, E. Y. S., Corless. R. M., Gonzalez-Vega, L., Sendra, J. R., Sendra, J. and Thornton, S.E. “Bohemian Upper Hessenberg Matrices.” arXiv:1809.10653, September 2018.
- [6] Thornton, Steven E. “Algorithms for Bohemian Matrices” (2019). Electronic Thesis and Dissertation Repository. 6069. https://ir.lib.uwo.ca/etd/6069
- [7] Thornton, Steven E. “The Characteristic Polynomial Database.” published electronically at http://bohemianmatrices.com/cpdb, 08.07.2019.
- [8] Fasi, Massimiliano and Negri Porzio, Gian Maria. “Determinants of Normalized Bohemian Upper Hessemberg Matrices.” MIMS EPrint 2019.7, Manchester Institute for Mathematical Sciences, The University of Manchester, UK, 2019. To appear in Electron. J. Linear Algebra.
- [9] Ching L. “The maximum determinant of an lower Hessenberg matrix.” Linear Algebra and its Applications, 183 (1993): 147-153.
- [10] Harary, F. “The determinant of the adjacency matrix of a graph.” SIAM Rev., 4 (1962): 202-210.