To appear, Journal of Fractal Geometry
Constrained quantization for the Cantor distribution
Abstract.
The theory of constrained quantization has been recently introduced by Pandey and Roychowdhury. In this paper, they have further generalized their previous definition of constrained quantization and studied the constrained quantization for the classical Cantor distribution. Toward this, they have calculated the optimal sets of -points, th constrained quantization errors, the constrained quantization dimensions, and the constrained quantization coefficients, taking different families of constraints for all . The results in this paper show that both the constrained quantization dimension and the constrained quantization coefficient for the Cantor distribution depend on the underlying constraints. It also shows that the constrained quantization coefficient for the Cantor distribution can exist and be equal to the constrained quantization dimension. These facts are not true in the unconstrained quantization for the Cantor distribution.
Key words and phrases:
Cantor distribution, constrained quantization error, optimal sets of -points, constrained quantization dimension, constrained quantization coefficient2010 Mathematics Subject Classification:
Primary 28A80; Secondary 94A34, 60Exx.1. Introduction
Real-life problems, such as information theory, data compression, signal processing, etc., consist of a large number of data that is not easy to handle. In order to deal with such a data set, the theory of quantization comes into play (see, for instance, [1, 2, 3, 4, 5, 6, 7]). Quantization is a process of discretization, in other words, to represent a set with a large number of elements, discrete or continuous, by a set with a smaller number of elements. Several mathematical theories have been introduced in the literature concerning the process of quantization. Graf and Luschgy gave the rigorous mathematical treatment in [2]. In [8], Graf and Luschgy studied the quantization problem for the canonical probability measure on the classical Cantor set.
Recently, in [9], the authors introduced the concept of constrained quantization. A quantization without a constraint is referred to as an unconstrained quantization, which traditionally in the literature is referred to as a quantization, as mentioned in the previous paragraph. The theory of constrained quantization is a fascinating area of research, and it invites a lot of new areas to work with a number of applications. With the introduction of constrained quantization, quantization now has two classifications: constrained quantization and unconstrained quantization. In this paper, the authors have further generalized the definition of constrained quantization given in [9], and study the concept of constrained quantization for the canonical probability measure on the classical Cantor set.
Definition 1.1.
Let be a Borel probability measure on space equipped with a metric induced by a norm on , and . Let , where denotes the set of all natural numbers, be a family of closed sets with nonempty. Then, for , the th constrained quantization error for , of order with respect to the family of constraints , is defined by
(1) |
where represents the cardinality of the set .
The number
is called the distortion error for , of order , with respect to a set . The sets are the constraints in the constrained quantization error. We assume that to make sure that the infimum in (1) exists (see [9]). A set for which the infimum in (1) exists and does not contain more than elements is called an optimal set of -points for . Elements of an optimal set are called optimal elements.
Remark 1.2.
In Definition 1.1 of constrained quantization error, if all for are identical, then it reduces to the definition of constrained quantization error introduced by Pandey and Roychowdhury in [9]. Furthermore, if for all , then it reduces to the definition of th unconstrained quantization error, which traditionally in the literature is referred to as the th quantization error (see [2]). For some recent work in the direction of unconstrained quantization, one can see [2, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20].
Let be a strictly decreasing sequence, and write . Then, the number defined by
(2) |
if it exists, is called the constrained quantization dimension of of order . The constrained quantization dimension measures the speed at which the specified measure of the constrained quantization error converges as tends to infinity. For any , the number
(3) |
if it exists, is called the -dimensional constrained quantization coefficient for of order .
Remark 1.3.
In unconstrained quantization, . Hence, in unconstrained quantization, i.e., when , the definitions of constrained quantization dimension and the -dimensional constrained quantization coefficient defined by (2) and (3), respectively, reduce to the corresponding definitions in unconstrained scenario (see [2]).
This paper deals with and , and the metric on as the Euclidean metric induced by the Euclidean norm . Instead of writing and we will write them as and , i.e., is omitted in the subscript as throughout the paper. Let us take the family , that occurs in Definition 1.1 as follows:
(4) |
Let be two contractive similarity mappings such that and . Then, there exists a unique Borel probability measure on such that , where denote the image measures of with respect to for (see [21]). If , and , then we call a word of length over the alphabet , and denote it by . By , we denote the set of all words, including the empty word . Notice that the empty word has a length zero. For any word , we write
Then, the set is known as the Cantor set generated by the two mappings and , and equals the support of the probability measure , where can be written as
For this probability measure , Graf and Luschgy determined the optimal sets of -means and the th quantization errors for all (see [8]). They also showed that the unconstrained quantization dimension of the measure exists and equals , which is the Hausdorff dimension of the Cantor set , and the unconstrained quantization coefficient does not exist. In fact, in [8], they showed that the lower and the upper unconstrained quantization coefficients exist as finite positive numbers.
1.4. Delineation
In this paper, first, we have determined the optimal sets of -points and the th constrained quantization errors for all for the Borel probability measure with support the Cantor set . Then, we have calculated the constrained quantization dimension and the constrained quantization coefficient. We have shown that both the constrained quantization dimension and the -dimensional constrained quantization coefficient exist and are equal to one, i.e., they coincide. Then, in the last section, taking different families of constraints for all , we investigate the optimal sets of -points, th constrained quantization errors, the constrained quantization dimensions, and the constrained quantization coefficients. From work in this paper, it can be seen that the constrained quantization dimension of the Cantor distribution depends on the family of constraints , i.e., the constrained quantization dimension is not always equal to the Hausdorff dimension of the Cantor set as it occurs in the case of unconstrained quantization (see [8]). In the unconstrained quantization, the -dimensional quantization coefficient does not exist (see [8]). But from work in the last section, we see that the -dimensional constrained quantization coefficient also depends on the constraints; it may or may not exist.
2. Preliminaries
In this section, we give some basic notations and definitions which we have used throughout the paper. As defined in the previous section, let be an alphabet. For any two words and in , by , we mean the word obtained from the concatenation of the two words and . For , is called an extension of if for some word . The mappings such that and are the generating maps of the Cantor set , which is the support of the probability measure on given by . For , write , where is a composition mapping. Notice that . Then, for any , as mentioned before, we have
The elements of the set are the intervals in the th level in the construction of the Cantor set , and are known as the basic intervals at the th level. The intervals , , into which is split up at the th level are called the children of .
With respect to a finite set , by the Voronoi region of an element , it is meant the set of all elements in which are nearest to among all the elements in , and is denoted by . For any two elements and in , we write
which gives the squared Euclidean distance between the two elements and . Let and be two elements that belong to an optimal set of -points for some positive integer . Then, and are called adjacent elements if they have a common boundary in their own Voronoi regions. Let be an element on the common boundary of the Voronoi regions of the adjacent elements and . Since the common boundary of the Voronoi regions of any two adjacent elements is the perpendicular bisector of the line segment joining the elements, we have
We call such an equation a canonical equation. Notice that any element can be identified as an element . Thus, the nonnegative real-valued function on defined by
represents the squared Euclidean distance between an element and an element . Let such that for any denote the projection mapping. For a random variable with distribution , let represent the expected value, and represent the variance of .
The following lemmas are well-known (see [8]).
Lemma 2.1.
Let be Borel measurable and . Then,
Lemma 2.2.
Let be a random variable with probability distribution . Then, . Moreover, for any , we have
Remark 2.3.
For words in , by we mean the conditional expectation of the random variable given i.e.,
Recall Lemma 2.1, for each , since is a similarity mapping, we have
Definition 2.4.
For with , let be the unique natural number with and . For with card let be the set such that
Proposition 2.5.
Proof.
If , then the subset can be chosen in different ways, and so, the number of such sets is given by , and the corresponding distortion error is obtained as
Thus, the proof of the proposition is complete. ∎
In the next sections, we give the main results of the paper.
3. Optimal sets of -points for all
In this section, we calculate the optimal sets of -points and the th constrained quantization errors for all . For , we have the constraints as
For all , the perpendiculars on the constraints passing through the points intersect at the points , where . Thus, for each , there exists a one-one correspondence between the element on and the element on . Thus, for , there exist bijective functions
(5) |
Hence, the inverse functions are defined as
Remark 3.1.
Proposition 3.2.
An optimal set of one-point is with constrained quantization error .
Proof.
Let be an optimal set of one-point. Since , we have . Now, the distortion error for with respect to the set is give by
the minimum value of which is and it occurs when . Thus, the proof of the proposition is complete. ∎
The following lemma plays an important role in the paper.
Lemma 3.3.
Let be an optimal set of -points for such that
where . Then, and , where are the Voronoi regions of the elements with respect to the set for .
Proof.
Let , as given in the statement of the lemma, be an optimal set of -points. Take any . Since , we can assume that , i.e., for some . Since the Voronoi region of , i.e., has positive probability, contains some basic intervals from that generates the Cantor set . Let , where is an index set, be the family of all basic intervals that are contained in . Now, the distortion error contributed by in its Voronoi region is given by
The above expression is minimum if both and
are minimum, i.e., when
Since is chosen arbitrarily, the proof of the lemma is complete. ∎
Remark 3.4.
Remark 3.5.
Lemma 3.3 implies that if is an optimal set of -points for , then for all .
Proposition 3.6.
The set forms an optimal set of two-points with constrained quantization error .
Proof.
Due to symmetry, the distortion error due to the set is given by
Let , where , be an optimal set of two-points. Since is the constrained quantization error for two-points, we have . We first show that . Suppose that . Then,
which leads to a contradiction. Suppose that . Then, the Voronoi region of does not contain any element from . For the sake of contradiction, assume that the Voronoi region of contains elements from . Then, we must have implying , which gives a contradiction. Thus, we see that is contained in the Voronoi region of . Hence,
which gives a contradiction. Assume that . Then, the distortion error is obtained as
which yields a contradiction. Hence, we can assume that , i.e., . Similarly, we can show that , i.e., . Hence, for . Notice that then the Voronoi region of does not contain any element from , and the Voronoi region of does not contain any element from yielding
with constrained quantization error . Thus, the proof of the proposition is complete. ∎
Lemma 3.7.
Let be an optimal set of three-points. Then, for , and does not contain any element from the open line segment joining and .
Proof.
The distortion error due to the set is given by
Let be an optimal set of three-points such that . Since is the constrained quantization error for three-points, we have . Let us first show that , i.e., . We prove it by contradiction. Notice that
if . Choose a number , and consider the following cases:
Case 1. .
Then,
which gives a contradiction.
Case 2. .
Then, we have implying . Hence,
which leads to a contradiction.
Case 3. .
Then, we have implying . Hence,
which gives a contradiction.
Taking into account the above cases, we see that a contradiction arises. Hence, . Reflecting the above arguments with respect to the line , we can show that . Thus, for . We now show that does not contain any element from the open line segment joining and . For the sake of contradiction, assume that contains an element from the open line segment joining and . Since for , we must have . The following cases can happen:
Case I. .
Then, we have implying . Again, . Hence,
which yields a contradiction.
Case II. .
This case is the reflection of Case I with respect to the line . Hence, a contradiction also arises in this case.
Considering Case I and Case II, we can deduce that does not contain any element from the open line segment joining and . Thus, the proof of the lemma is complete. ∎
Proposition 3.8.
Let be an optimal set of -points for all . Then, for , and does not contain any element from the open line segment joining and .
Proof.
For all , let us first prove that . By Proposition 3.6 and Lemma 3.7, it is true for . Using a similar technique as Lemma 3.7, we can also prove that it is true for any . However, here we give a general proof for all . The distortion error due to the set is given by
Since is the constrained quantization error for -points with , and is a decreasing sequence, we have . Let be an optimal set of -points such that . For the sake of contradiction, assume that . Then, , and so,
which is a contradiction. Hence, we can assume that . Similarly, we can show . Thus, the proof of the first part of the proposition is complete. We now show that does not contain any element from the open line segment joining and . For the sake of contradiction, assume that contains an element from the open line segment joining and . Let
Since , we have . Thus, we see that . Again, recall that the Voronoi regions of the elements in an optimal set of -points must have positive probability. Hence, we have .
The following cases can happen:
Case 1. .
Then, implying . Hence,
which gives a contradiction.
Case 2. .
Then, implying . Again, implying . Hence,
which leads to a contradiction.
Case 3. .
Then, implying . Hence,
which gives a contradiction.
Considering the above all possible cases, we see that a contradiction arises. Hence, does not contain any element from the open line segment joining and . Thus, the proof of the proposition is complete. ∎
Corollary 3.9.
Let . Then, the Voronoi region of any element in does not contain any element from , and the Voronoi region of any element in does not contain any element from .
Proof.
Let be an optimal set of -points such that . By Proposition 3.8, we see that contains elements from both and , and does not contain any element from the open line segment joining and . Let
Then, . For the sake of contradiction, assume that the Voronoi region contains an element from . Then, yielding , which is a contradiction. Similarly, we can show that if the Voronoi region contains an element from , then a contradiction arises. Thus, the proof of the corollary is complete. ∎
Lemma 3.10.
For let be an optimal set of -points. Set , , and . Then, is an optimal set of -points, is an optimal set of -points, and
Proof.
By Proposition 3.8, we have . Proceeding in a similar way as Lemma 3.3, we can show that
Hence,
implying
(6) |
If is not an optimal set of -points for , then there exists a set with such that . But then, is a set of cardinality . being the constrained quantization error for -points, we have
(7) |
Notice that
(8) |
Hence, by (6), (7), and (3), we have
which leads to a contradiction. Hence, is an optimal set of -points. Similarly, we see that is an optimal set of -points, and hence,
Thus, the proof of the lemma is complete. ∎
In view of Lemma 3.10, we give the following example.
Example 3.11.
Let us state and prove the following theorem, which gives the optimal sets of -points for all .
Theorem 3.12.
Let be a unique Borel probability measure on with support the Cantor set generated by the two contractive similarity mappings and for all . Then, the set given by Definition 2.4 forms an optimal set of -points for with the corresponding constrained quantization error , where is given by Proposition 2.5.
Proof.
We will proceed by induction on . If , then the theorem is true by Proposition 3.6. Let us assume that the theorem is true for all , where and . We now show that the theorem is true if . Let be an optimal set of -points for such that . Set , , , and . Then, by Lemma 3.10, we have
(9) |
Without any loss of generality, we can assume that . Let be such that
(10) |
We will show that . Since , we have and . Hence, and . If is the distortion error due to the set
where with card, then by Proposition 2.5, we have
Thus, by the induction hypothesis, we have , and then Equation (9) implies that
i.e.,
which is the same as Equation 5.9 in [8]. Thus, proceeding in a similar way as [8], we have . By Lemma 3.10, is an optimal set of -points, is an optimal set of -points. Moreover, we have proved , and . Hence, by the induction hypothesis,
where with card; and
where with card. Then, notice that
and
Take , and then
Hence,
Thus, we have
and using Equation (9), we have the constrained quantization error as
Thus, the theorem is true if . Hence, by the induction principle, the proof of the theorem is complete. ∎
4. Constrained quantization dimension and constrained quantization coefficient
Let be a unique Borel probability measure on with support the Cantor set generated by the two contractive similarity mappings and for all . Since the Cantor set under investigation satisfies the strong separation condition, with each having a contracting factor of , the Hausdorff dimension of the Cantor set is equal to the similarity dimension. Hence, from the equation , we have . It is known that the unconstrained quantization dimension of the probability measure exists and equals the Hausdorff dimension of the Cantor set (see [8]). The work in this section shows that it is not true in the constrained case, i.e., the constrained quantization dimension of the probability measure , though it exists, is not necessarily equal to the Hausdorff dimension of the Cantor set. In this section, we show that the constrained quantization dimension exists and equals one. We further show that the -dimensional constrained quantization coefficient exists as a finite positive number and equals , which is also not true in the unconstrained quantization for the Cantor distribution.
Theorem 4.1.
The constrained quantization dimension of the probability measure exists, and .
Proof.
For with , let be the unique natural number such that . Then, . By Theorem 3.12, we see that and as , and so as , i.e., . We can take large enough so that . Then,
yielding
Notice that
Hence, , i.e., the constrained quantization dimension of the probability measure exists and . Thus, the proof of the theorem is complete. ∎
Theorem 4.2.
The -dimensional constrained quantization coefficient for is a finite positive number and equals the constrained quantization dimension .
Proof.
For with , let be the unique natural number such that . Then, and . Hence,
which implies
yielding
Hence, by the squeeze theorem, we have
Again, as shown in the proof of Theorem 4.1, we have . Thus, we deduce that
i.e., the -dimensional constrained quantization coefficient for exists as a finite positive number and equals the constrained quantization dimension , which is the theorem. ∎
5. Constrained quantization with some other families of constraints
As defined in the previous sections, let be the unique Borel probability measure on with support the Cantor set generated by the two contractive similarity mappings and for all . In this section, in the following subsections, we give the optimal sets of -points and the th constrained quantization errors for different families of constraints. Then, for each family, we investigate the constrained quantization dimension and the constrained quantization coefficient.
5.1. Constrained quantization when the family is for .
Using the similar arguments as Lemma 3.3, it can be shown that if is an optimal set of -points for , then for all . Let us first state the following theorem, the proof of which is similar to Theorem 3.12.
Theorem 5.1.1.
For with let be the unique natural number with . For with card let be the set such that
Then, forms an optimal set of -points for with constrained quantization error
where is the variance.
5.2. Constrained quantization when the family is for .
Proceeding in a similar way as Theorem 3.12, we can show that the following theorem is true.
Theorem 5.2.1.
For with let be the unique natural number with . For with card let be the set such that
Then, forms an optimal set of -points for with constrained quantization error
where is the variance.
Notice that here . If is the Hausdorff dimension of the Cantor set , then, . Then, the following lemma and theorems are equivalent to the lemma and theorems that appear in the last section in [8]. For the proofs, one can consult [8].
Theorem 5.2.2.
The set of accumulation points of the sequence equals
where is such that .
Lemma 5.2.3.
Let . Then,
Theorem 5.2.4.
The constrained quantization dimension of equals the Hausdorff dimension of the Cantor set, i.e.,
Remark 5.2.5.
Thus, in this case, we see that the constrained quantization dimension exists and equals the Hausdorff dimension of the Cantor set, but the constrained quantization coefficient does not exist.
5.3. Constrained quantization when the family is for .
Using the similar arguments as Lemma 3.3, it can be shown that if is an optimal set of -points for , then for all . Let us first state the following theorem, the proof of which is similar to Theorem 3.12.
Theorem 5.3.1.
For with let be the unique natural number with . For with card let be the set such that
Then, forms an optimal set of -points for with constrained quantization error
where is the variance.
Remark 5.3.2.
By Theorem 5.3.1, we see that for the family , where , the optimal sets of -points and the corresponding constrained quantization error coincide with the optimal sets of -means and the corresponding quantization error for the Cantor distribution that occurs in [8]. Thus, all the results in [8] are also true here.
Acknowledgement
The first author is grateful to her supervisor Professor Tanmoy Som of the IIT(BHU), Varanasi, India, for his constant support in preparing this manuscript.
Declaration
Conflicts of interest. We do not have any conflict of interest.
Data availability: No data were used to support this study.
Code availability: Not applicable
Authors’ contributions: Each author contributed equally to this manuscript.
References
- [1] A. Gersho and R. M. Gray. Vector quantization and signal compression, volume 159. Springer Science & Business Media, 2012.
- [2] S. Graf and H. Luschgy. Foundations of quantization for probability distributions. Springer, 2007.
- [3] A. Gyorgy and T. Linder. On the structure of optimal entropy-constrained scalar quantizers. IEEE Transactions on Information Theory, 48(2):416–427, 2002.
- [4] R. M. Gray and D. L. Neuhoff. Quantization. IEEE Transactions on Information Theory, 44(6):2325–2383, 1998.
- [5] D. Pollard. Quantization and the method of k-means. IEEE Transactions on Information theory, 28(2):199–205, 1982.
- [6] P. Zador. Asymptotic quantization error of continuous signals and the quantization dimension. IEEE Transactions on Information Theory, 28(2):139–149, 1982.
- [7] R. Zamir. Lattice Coding for Signals and Networks: A Structured Coding Approach to Quantization, Modulation, and Multiuser Information Theory. Cambridge University Press, 2014.
- [8] S. Graf and H. Luschgy. The quantization of the Cantor distribution. Mathematische Nachrichten, 183(1):113–133, 1997.
- [9] M. Pandey and M.K. Roychowdhury. Constrained quantization for probability distributions. arXiv preprint arXiv:2305.11110, 2023.
- [10] Q. Du, V. Faber, and M. Gunzburger. Centroidal voronoi tessellations: Applications and algorithms. SIAM review, 41(4):637–676, 1999.
- [11] C. P. Dettmann and M.K. Roychowdhury. Quantization for uniform distributions on equilateral triangles. Real Analysis Exchange, 2017.
- [12] S. Graf and H. Luschgy. Quantization for probability measures with respect to the geometric mean error. In Mathematical Proceedings of the Cambridge Philosophical Society, volume 136, pages 687–717. Cambridge University Press, 2004.
- [13] M. Kesseböhmer, A. Niemann, and S. Zhu. Quantization dimensions of compactly supported probability measures via rényi dimensions. Transactions of the American Mathematical Society, 2023.
- [14] Eugen Mihailescu and M.K. Roychowdhury. Quantization coefficients in infinite systems. Kyoto Journal of Mathematics, 2015.
- [15] G. Pena, H. Rodrigo, M.K. Roychowdhury, J. Sifuentes, and E. Suazo. Quantization for uniform distributions on hexagonal, semicircular, and elliptical curves. Journal of Optimization Theory and Applications, 188:113–142, 2021.
- [16] K. Pötzelberger. The quantization dimension of distributions. In Mathematical Proceedings of the Cambridge Philosophical Society, volume 131, pages 507–519. Cambridge University Press, 2001.
- [17] M.K. Roychowdhury. Quantization and centroidal Voronoi tessellations for probability measures on dyadic Cantor sets. Journal of Fractal Geometry, 4(2):127–146, 2017.
- [18] M.K. Roychowdhury. Least upper bound of the exact formula for optimal quantization of some uniform Cantor distribution. Discrete & Continuous Dynamical Systems: Series A, 38(9), 2018.
- [19] M.K. Roychowdhury. Optimal quantization for the Cantor distribution generated by infinite similutudes. Israel Journal of Mathematics, 231:437–466, 2019.
- [20] M.K. Roychowdhury. Optimal quantization for mixed distributions. Real Analysis Exchange, 46(2), 2021.
- [21] J. E. Hutchinson. Fractals and self similarity. Indiana University Mathematics Journal, 30(5):713–747, 1981.