Asymptotic distribution-free change-point detection for data with repeated observations
Abstract
In the regime of change-point detection, a nonparametric framework based on scan statistics utilizing graphs representing similarities among observations is gaining attention due to its flexibility and good performances for high-dimensional and non-Euclidean data sequences, which are ubiquitous in this big data era. However, this graph-based framework encounters problems when there are repeated observations in the sequence, which often happens for discrete data, such as network data. In this work, we extend the graph-based framework to solve this problem by averaging or taking union of all possible optimal graphs resulted from repeated observations. We consider both the single change-point alternative and the changed-interval alternative, and derive analytic formulas to control the type I error for the new methods, making them fast applicable to large datasets. The extended methods are illustrated on an application in detecting changes in a sequence of dynamic networks over time. All proposed methods are implemented in an R package gSeg available on CRAN.
keywords:
and
1 Introduction
1.1 Background
Change-point analysis plays a significant role in various fields when a sequence of observations is collected. In general, the problem concerns testing whether or not a change has occurred, or several changes might have occurred, and identifying the locations of any such changes. In this paper, we consider the offline change-point detection problem where a sequence of independent observations is completely observed at the time when data analysis is conducted. Here, is the length of the sequence, and is the time index or other meaningful index depending on the specific application. We consider testing the null hypothesis
(1) |
against the single change-point alternative
(2) |
or the changed-interval alternative
(3) |
where and are two different distributions.
This problem has been extensively studied for univariate data; see Chen & Gupta (2011) for a survey. However, in many modern applications, is high-dimensional or even non-Euclidean. For high-dimensional data, most methods are based on parametric models; see for example Zhang et al. (2010); Wang et al. (2017); Wang & Samworth (2018). To apply these methods, the data sequence needs to follow specific parametric models. In the nonparametric domain, Harchaoui et al. (2009) made use of kernel methods, Lung-Yut-Fong et al. (2011) utilized marginal rankings, and Matteson & James (2014) made use of distances among observations. These methods can be applied to a wider range of problems than parametric methods. However, it is in general difficult to conduct theoretical analysis on nonparametric methods and none of these nonparametric methods provides analytic formulas for false discovery control, making them difficult to be applied to large datasets.
1.2 Graph-based change-point methods
Recently, Chen & Zhang (2015) and Chu & Chen (2019) developed a graph-based framework to detect the change-point for high-dimensional and non-Euclidean data sequences. This framework is based on a similarity graph , such as a minimum spanning tree (MST), which is a spanning tree that connects all observations with the sum of distances of the edges in the tree minimized, constructed on the sample space. Based on over , test statistics rely on three basic quantities, , , and , where for each is the number of edges connecting observations before and after , is the number of edges connecting observations prior to , and is the number of edges connecting observations after . Then, four scan statistics were studied: the original edge-count scan statistic , the weighted edge-count scan statistic , the generalized edge-count scan statistic , and the max-type edge-count scan statistic that can be applied to various alternatives. For detailed comparisons, see Chu & Chen (2019).
While the methods proposed by Chen & Zhang (2015) and Chu & Chen (2019) work well for continuous data, they are problematic for data with repeated observations, which is common for discrete data, such as network data. The reason is that these methods depend on the similarity graph constructed on observations. When there are repeated observations, the similarity graph is in general not uniquely defined and these methods are troublesome. For example, Chen & Zhang (2015) analyzed a phone-call network dataset and the task was to test whether there is any change in the dynamics of the networks over time. In this dataset, a few networks in the sequence are exactly the same. Chen & Zhang (2015) used the MST as the similarity graph. More specifically, in the phone-call network dataset, there are in total 330 networks in the sequence and among them are 290 distinct networks. For example, , , are exactly the same. All repeated observations are listed in Supplement A. Due to this, there are numerous ways in assigning edges in the MST with repeated observations. Hence, the MST is not uniquely defined and existing graph-based methods are not reliable since they are formulated by the unique similarity graph on pooled observations.
MST #1 | MST #2 | MST #3 | |
0.09 (2.32) | 0.91 (0.92) | 0.51 (1.57) | |
0.04 (13.61) | 0.08 (12.31) | 0.01 (16.36) | |
0.44 (2.11) | 0.02 (3.49) | 0.88 (1.54) | |
0.09 (3.05) | 0.02 (3.49) | 0.05 (3.27) |
Table 1 lists test statistics and their corresponding -values of four testing procedures proposed in Chen & Zhang (2015) and Chu & Chen (2019) on three randomly chosen MSTs. We see that the -value depends heavily on the choice of the MST: It could be very small under one MST, but very large on another MST, leading to completely different conclusions on whether the sequence is homogeneous or not. Moreover, since the number of possible MSTs is huge, it is impractical to compute the test statistics on all possible MSTs directly to get a summary.
1.3 Our contribution
We extend the methods in Chen & Zhang (2013) and Zhang & Chen (2017) to the change-point settings and propose new graph-based testing procedures that can deal with repeated observations properly. This work fills the gap for the graph-based framework in dealing with discrete data. We show that the new tests are asymptotically distribution-free under the null hypothesis of no change and reveal that the limiting distributions for two approaches in Chen & Zhang (2013) are the same, even for continuous data. We also derive analytic formulas to approximate permutation -values for those modified test statistics, making them fast applicable to real datasets. To improve the analytical -value approximations for finite sample sizes, skewness correction is also performed. We show that the proposed tests work well to detect the change when the data has repeated observations. We illustrate the new testing procedures through an analysis on phone-call network dataset. The new methods are implemented in an R package gSeg available on CRAN.
2 Notations and related existing works
For data with repeated observations, we represent the data using a contingency table for each . Suppose that there are in total observations and distinct values, which we also refer to as categories in the following. Each divides the sequence into two groups, before or at time (Group 1) and after time (Group 2). Let be the number of observations in group and category and be the number of observations in category . Notice that , , , and .
Index of distinct values | 1 | 2 | Total | ||
---|---|---|---|---|---|
Group 1 | |||||
Group 2 | |||||
Total |
In Chen & Zhang (2013) and Zhang & Chen (2017), the authors studied ways to extend the underlying graph-based two-sample tests to accommodate data with repeated observations under the two-sample testing framework. When data has repeated observations, the similarity graph is not uniquely defined based on common optimization criteria, such as the MST, leading to multiple optimal graphs. The authors considered two ways to incorporate information from these graphs: averaging and union. To be more specific, they first construct the similarity graph on the distinct values, denoted by . Here, could be the MST on all distinct values, the nearest neighbor link, the union of all possible MSTs on the distinct values, when the MST on the distinct values is not unique, or some other meaningful graphs. Then, the optimal graph initiated from is defined in the following way: For each pair of edges , randomly choose an observation with the value indexed by and an observation with the value indexed by , and connect these two observations; then, for each , if there are more than one observation (repeated observations) with the value indexed by , connect them by a spanning tree (any edges in this spanning tree has distance 0). More detail explanations for are provided in Supplement B. Based on these optimal graphs, averaging statistic is defined by averaging the test statistic over all optimal graphs and union statistic is defined by calculating the test statistic on the union of all optimal graphs.
3 Proposed tests
3.1 Extended test statistics for data with repeated observations
Here, we focus on extending the weighted, generalized, and max-type test statistics for repeated observations, which will turn out to be the asymptotic distribution-free tests. Details for extending the original edge-count test is in Supplement E. Based on the two-sample test statistics in Chen & Zhang (2013) and Zhang & Chen (2017), we could define the extended basic quantities at time under the averaging approach as follows:
(4) | ||||
(5) |
and under the union approach as follows:
(6) | ||||
(7) |
These are discrete data version of and to address infeasibility of computing test statistics in data with repeated observations. Hence, relatively large value of the sum of and or and could be the evidence against the null hypothesis.
Under the null hypothesis (1), the null distribution is defined to be the permutation distribution, which places probabilities on each of the permutations of . With no further specification, pr, , var, and cov denote the probability, the expectation, the variance, and the covaraince, respectively, under the permutation null distribution. Then, analytic formulas for the expectation and the variance of extended basic quantities can be calculated through combinatorial analysis and detailed expressions and proof can be found in Supplement C.
For any candidate value of , the extended test statistics can be defined as
where
with . Relatively large values of test statistics are the evidence against the null.
3.2 New scan statistics
Based on the extended statistics, we could define the scan statistics for the single change-point alternative to handle data with repeated observations as follows:
-
1.
Extended weighted edge-count scan statistic: & ,
-
2.
Extended generalized edge-count scan statistic: & ,
-
3.
Extended max-type edge-count scan statistic: & ,
where and are set to be pre-specified values. For example, we can set and in order to contain enough observations to represent the distribution.
Each scan statistic has its own characteristics aimed for different situations (see Section 5 for a comparison of them). For example, the extended weighted edge-count test is useful when a change-point occurs away from the middle of the sequence. The extended generalized edge-count test is effective under both location and scale alternatives. The extended max-type edge-count test is similar but gives more accurate -value approximation. The null hypothesis is rejected if the maxima is greater than a certain threshold. How to choose the threshold to control the type I error rate is described in Section 4.

For illustration, Figure 1 plots the processes of and for the first 50 observation generated from Multinomial and the second 50 observations generated from Multinomial with the nearest neighbor link constructed on the Euclidean distance. We see that both and peak at the true change-point (left panel). On the other hand, when there is no change-point, and have random fluctuations with smaller maximum values (right panel). Illustrations of other test statistics are provided in Supplement D.
For testing the null (1) against the changed-interval alternative (3), test statistics can be derived in a similar way to the single change-point case. For example, the extended generalized edge-count scan statistics are
for the averaging and union approaches, respectively, where and are the extended generalized edge-count statistics on the two samples: observations within and observations outside . The details of all statistics for the changed-interval alternative are in Supplement F.
4 Analytical -value approximation
4.1 Asymptotic distributions of the stochastic processes
We first consider the averaging approach. We are concerned with the tail distribution of the scan statistics under . Take the extended generalized edge-count scan statistic as an example, we want to compute
(8) |
for the single change-point alternative, and
(9) |
for the changed-interval alternative.
Under the null hypothesis, the scan statistics are defined as the permutation distribution. For small sample size , we can directly sample from the permutation distribution to compute the permutation -value. However, when is large, one needs to draw a large number of random permutations to get a good estimate of the -value, which is very time consuming. Hence, we seek to derive analytic approximations to these tail probabilities.
By the definition of , , and , stochastic processes , , and boil down to two pairs of basic processes: and for the single change-point case and and for the changed-interval case in the similar way. Therefore, we first study the properties of these basic stochastic processes. Let be the subgraph of containing all edges that connect to node , be the set of edges in that contains at least one node in , and and be the number of edges in the sets. To derive the asymptotic behavior of the stochastic processes in averaging approach, we work under the following conditions:
Condition 4.1.
, .
Condition 4.2.
.
Condition 4.3.
.
Condition 4.4.
.
Let denotes the largest integer that is no larger than .
Theorem 4.5.
The proof for this theorem uses the technique developed in Chen & Zhang (2015) that utilizes the Stein’s method (Chen & Shao, 2005). The details of the proof are in Supplement G.
Let and . The next theorem states explicit expressions for the covariance functions of the limiting Gaussian process, and . It is proved through combinatorial analysis and details are given in Supplement H.
Theorem 4.6.
The covariance functions of the Gaussian processes , and have the following expressions:
where and .
For the union approach, let be the set of graphs that the union of all possible optimal graphs between observations , be the subgraph of containing all edges that connect to node , and be the number of edges in the set. We work under
Condition 4.7.
.
Condition 4.8.
.
Condition 4.9.
Condition 4.10.
.
Theorem 4.11.
The details of the proof are in Supplement I.
Let and . The next theorem states explicit expressions for the covariance functions of the limiting Gaussian processes, , and . Its proof is in Supplement J.
Theorem 4.12.
The covariance functions of the Gaussian processes , and have the follwoing expressions:
Remark 1.
By Theorems 4.6, 4.12, we see that the limiting distributions for and are the same and do not depend on the graph at all. The same story for ’s. In addition, their covariance functions in Theorem 4.6 and 4.12 are the same as Theorem 4.3 in Chu & Chen (2019). Hence, limiting distributions of the extended graph-based tests based on , , are exactly the same as their corresponding versions for continuous data. On the other hand, the limiting distributions of the extended original edge-count scan statistics differ from their corresponding versions under the continuous setting (see details in Supplement E).
Remark 2.
Conditions 4.1–4.10 all constrain the number of repeated observations. Conditions 4.1 and 4.7 can usually be satisfied with an appropriate choice of . Conditions 4.2, 4.3, 4.8 and 4.9 constrain the degrees of nodes in the graph such that they cannot be too large. Condition 4.4 ensures that does not degenerate asymptotically so that is well defined. Similar for Condition 4.10.
We check these conditions through simulation studies (Supplement L) and see that some of them could be violated even when the -value approximation still works well. Zhu & Chen (2021) recently studied graph-based two-sample tests for continuous data and they proposed much more relaxed conditions than those in Chu & Chen (2019). They checked their conditions under both sparse and dense graphs and the conditions hold well. We believe that conditions for data with repeated observations under the change-point setting can also be relaxed. This requires substantial work and we leave this to our future research.
4.2 Asymptotic -value approximation
We now examine the asymptotic behavior of tail probabilities. Following similar arguments in the proof for Proposition 3.4 in Chen & Zhang (2015), we can obtain the foundation for analytical approximations to the probabilities. Assume that in a way such that for some and , .
Based on Theorem 4.5 and 4.11, as , for both averaging and union approaches, we have
where (Siegmund & Yakir, 2007) with and being the standard normal cumulative density function and probability density function, respectively, and
It can be shown that and .
Since and are independent and and are independent, for both averaging and union approaches, we have
In addition, following similar arguments in the proof for Proposition 4.4 in Chu & Chen (2019), we obtain the analytical -value approximations for the extended generalized edge-count test. Assume that in a way such that for some and , . Then, as , for both averaging and union approaches, we have
where . The analytical -value approximations for the changed-interval are in Supplement M.
Remark 3.
In practice, we use in place of , where is the finite-sample equivalent of . That is,
with . The explicit expression for is
It is clear from above expression that does not depend on the graph as well and it is easy to show that . The finite-sample equivalent of is exact the same as , that is,
where .
4.3 Skewness correction
The analytic -value approximations based on asymptotic results give ballpark estimates of the -values. However, they are in general not accurate enough if we set and close to the two ends and when the dimension is high (see the table in Section 4.4). The inaccuracy is largely attributed to the fact that the convergence of to the Gaussian distribution is slow when is close to 0 or 1.
To improve the analytical -value approximation, we add extra terms in the analytic formulas to correct for skewness. In our problem, the extent of the skewness depends on the value of . Hence, we adopt a skewness correction approach discussed in Chen & Zhang (2015) where different amount of the correction is done for different : In particular, this approach utilizes better approximation of the marginal probability by using the third moment, .
After skewness correction, the analytical -value approximations for averaging approach are
(10) |
where
(11) |
where
and , whose analytic expressions are provided in Supplement K. The skewness corrected analytical -value approximations for union approach and the changed-interval can be derived in the similar manner and details are provided in Supplement M.
Remark 4.
By jointly correcting for the marginal probabilities of and , we can derive skewness corrected -value approximations for (Chu & Chen, 2019). However, the integrand could be easily non-finite, so the method heavily relies on extrapolation. We thus do not perform skewness correction on and .
4.4 Checking analytical -value approximations under finite
We check the performance of analytical -value approximations obtained in Section 4.2 and 4.3. In particular, we compare the critical values for 0.05 -value threshold through analytical -value approximations based on asymptotic results and skewness correction to those obtained from doing 10,000 permutations under various simulation settings to check how analytical approximation works well for finite samples. Here, we focus on the extended max-type scan statistic for the single change-point alternative. The results for other scan statistics and the changed-interval alternative are provided in Supplement N.
We consider three distributions with different dimensions (Multinomial with equal probabilities (C1), Gaussian with repeated observations (C2), Multinomial with equal probabilities (C3)) and let be the nearest neighbor link constructed on Euclidean distance. The analytic approximations depend on constraints, and , on the region where the change-point is searched. To make things simple, we set .
Since analytical -value approximations without skewness correction do not depend on in the extended weighted, generalized, and max-type tests, the critical value is determined by , , and only. Notice that analytical -value approximations without skewness correction provide the same result in both averaging and union approaches. On the other hand, the skewness corrected approximated -values depend on certain characteristics of the graph structure. The structure of the nearest neighbor link depends on the underlying dataset, and thus the critical values vary by simulation runs.
Table 3 shows results of the extended max-type scan statistics. The first table labeled ‘A1’ presents the analytical critical values without skewness correction. ‘A2 (a)’ and ‘A2 (u)’ represent skewness corrected analytical critical values in averaging and union approaches, respectively, and ‘Per ’ and ‘Per ’ represent permutation critical values in averaging and union cases, respectively. We also show results for 2 randomly simulated sequences in each setting. We see that the asymptotic -value approximation is doing reasonably well. As window size decreases, the analytical critical values become less precise. However, skewness corrected approximation performs much better than the approximation without skewness correction. When the dimension is not too high, such as (C1), the skewness corrected analytical approximation is doing reasonably well for as low as 25. When the dimension is high, such as (C2) and (C3), the approximation performs well when .
A1 | 3.24 | 3.28 | 3.32 | 3.38 |
Critical Values | ||||||||
A2 | Per | A2 | Per | A2 | Per | A2 | Per | |
(C1) | 3.30 | 3.30 | 3.36 | 3.37 | 3.43 | 3.43 | 3.54 | 3.58 |
3.30 | 3.30 | 3.35 | 3.36 | 3.43 | 3.46 | 3.55 | 3.62 | |
(C2) | 3.36 | 3.34 | 3.44 | 3.45 | 3.56 | 3.59 | 3.72 | 3.98 |
3.34 | 3.36 | 3.42 | 3.47 | 3.53 | 3.64 | 3.76 | 4.03 | |
(C3) | 3.30 | 3.30 | 3.38 | 3.41 | 3.48 | 3.57 | 3.67 | 3.93 |
3.30 | 3.28 | 3.38 | 3.39 | 3.48 | 3.56 | 3.67 | 3.87 |
Critical Values | ||||||||
A2 | Per | A2 | Per | A2 | Per | A2 | Per | |
(C1) | 3.32 | 3.30 | 3.37 | 3.40 | 3.44 | 3.43 | 3.54 | 3.59 |
3.31 | 3.32 | 3.36 | 3.35 | 3.43 | 3.46 | 3.55 | 3.63 | |
(C2) | 3.35 | 3.36 | 3.42 | 3.43 | 3.51 | 3.52 | 3.62 | 3.80 |
3.34 | 3.39 | 3.40 | 3.46 | 3.48 | 3.55 | 3.67 | 3.84 | |
(C3) | 3.31 | 3.30 | 3.39 | 3.41 | 3.50 | 3.57 | 3.69 | 3.93 |
3.31 | 3.28 | 3.39 | 3.39 | 3.50 | 3.56 | 3.69 | 3.87 |
5 Performance of the new tests
We study the performance of the new tests in two aspects: (1) whether the test can reject the null hypothesis of homogenity when there is a change, and (2) if the test can reject , whether the test can estimate the location of the change-point accurately. We use the configuration model random graph to generate networks. Here, is the number of vertices and is a degree sequence on vertices, with being the degree of vertex . To generate configuration model random graphs, given a degree sequence, we choose a uniformly random matching on the degree stubs (half edges).
We generate a sequence of networks from the following model:
We explore two cases of the location of the change-point, in the middle and close to one end for vertices in this simulation. This dataset has repeated networks. Also, we consider two types of changes:
-
1.
An equal degree changes in the network (all elements in are 2 and 2 elements in are 4 and the rest are 2),
-
2.
A random degree changes in the network (all elements in are 2 and 2 elements in are randomly selected from 3 to 5 and the rest are 2).
That is, we present the results for the four combinations: an equal degree change at (S1), a random degree change at (S2), an equal degree change at (S3), and a random degree change at (S4).
For network at , we use an adjacency matrix to represent the network, with 1 for element if vertex and are connected, and 0 otherwise. We consider the dissimilarity defined as the number of different entries normalized by the geometric mean of the total edges in each of two networks, , where is the Frobenius norm of a matrix. We set to be the nearest neighbor link as a similarity graph for our new methods.
(S1) | 0.98 (0.96) | 0.96 (0.89) | 0.96 (0.94) | 0.95 (0.89) | 0.96 (0.95) | 0.95 (0.88) |
(S2) | 0.88 (0.83) | 0.89 (0.85) | 0.90 (0.84) | 0.91 (0.85) | 0.89 (0.83) | 0.90 (0.87) |
(S3) | 0.86 (0.83) | 0.65 (0.59) | 0.85 (0.83) | 0.85 (0.82) | 0.81 (0.80) | 0.70 (0.64) |
(S4) | 0.81 (0.81) | 0.73 (0.70) | 0.86 (0.84) | 0.93 (0.91) | 0.84 (0.81) | 0.86 (0.82) |
Table 4 shows the number of null rejection, out of 100, at 0.05 significance level for each method. For the accuracy of estimating the location of change-point, the count where the estimated change-point is within 20 from the true change-point is provided in parentheses when the null hypothesis is rejected. We see that all tests work well in the balanced equal degree changes case, while the extended generalized edge-count test outperforms in random degree changes case. In this simulation, equal degree changes would be considered the mean change and random degree changes would be considered the change in both location and scale. Hence, the extended generalized edge-count test and max-type edge-count test perform well in this general scenario. When the change-point is not in the center of the sequence, the extended weighted edge-count test outperforms, which complies with what we would expect. We see that the extended generalized edge-count test and max-type edge-count test work well when the change is in both mean and variance in the unbalnced sample size case.
6 Phone-call network data analysis
Here, we apply the new tests to the phone-call network dataset mentioned in Section 1 in details. The MIT Media Laboratory conducted a study following 87 subjects who used mobile phones with a pre-installed device that can record call logs. The study lasted for 330 days from July 2004 to June 2005 (Eagle et al., 2009). Given the richness of this dataset, one question of interest to answer is that whether there is any change in the phone-call pattern among subjects over time. This can be viewed as the change of friendship along time.
We bin the phone-calls by day and we construct of networks in total with 87 subjects as nodes. We encode each network by the adjacency matrix with value 1 for element if subject called on day and 0 otherwise. We define the distance measure as the number of different entries, i.e., , where is the Frobenius norm of a matrix. Due to the repeated observations, many equal distances among distinct values exist. We set to be the nearest neighbor link in this example.
We apply the single change-point detection method using the extended generalized scan statistic to the phone-call network dataset recursively so as to detect all possible change-points. As this dataset has a lot of noise, we focus on estimated change-points with -value less than 0.001. Figure 2 shows estimated change-points by averaging approach and union approach. We see that the two approaches produce quite a number of similar change-points. We define a change-point to be detected by both approaches if they each finds a change-point within the set . We then deem the location of the shared change-point to be the floor of the average of the two change-points detected by the two approaches.
Since we do not know the underlying distribution of the dataset, we perform more sanity check with the distance matrix of the whole period (Figure 3). It is evident that there are some signals in this dataset and they show comparably good match with our results from the new tests.

(a) | 53 | 68 | 97 | 140 | 164 | 247 | 289 | ||
---|---|---|---|---|---|---|---|---|---|
(u) | 28 | 52 | 66 | 79 | 140 | 164 | 247 | 293 | 323 |
Shared | 52 | 67 | 140 | 164 | 247 | 291 |

7 Discussion
In general, the two approaches work similarly; while they inevitably produce different results sometimes. A brief comparison of the two approaches is provided in Supplement O. The proposed methods detect the most significant single change-point or the changed-interval in the sequence. If the sequence has more than one change-point, the proposed methods can be applied recursively using techniques, such as binary segmentation, circular binary segmentation, or wild binary segmentation (Vostrikova, 1981; Olshen et al., 2004; Fryzlewicz et al., 2014).
Acknowledgement
Hoseung Song and Hao Chen are supported in part by NSF awards DMS-1513653 and DMS-1848579.
References
- Chen & Zhang (2015) Chen, H. & Zhang, N. (2015). Graph-based change-point detection. The Annals of Statistics 43, 139–176.
- Chen & Zhang (2013) Chen, H. & Zhang, N. R. (2013). Graph-based tests for two-sample comparisons of categorical data. Statistica Sinica , 1479–1503.
- Chen & Gupta (2011) Chen, J. & Gupta, A. K. (2011). Parametric statistical change point analysis: with applications to genetics, medicine, and finance. Springer Science & Business Media.
- Chen & Shao (2005) Chen, L. H. & Shao, Q.-M. (2005). Stein’s method for normal approximation. An introduction to Stein’s method 4, 1–59.
- Chu & Chen (2019) Chu, L. & Chen, H. (2019). Asymptotic distribution-free change-point detection for multivariate and non-euclidean data. The Annals of Statistics 47, 382–414.
- Eagle et al. (2009) Eagle, N., Pentland, A. S. & Lazer, D. (2009). Inferring friendship network structure by using mobile phone data. Proceedings of the national academy of sciences 106, 15274–15278.
- Fryzlewicz et al. (2014) Fryzlewicz, P. et al. (2014). Wild binary segmentation for multiple change-point detection. The Annals of Statistics 42, 2243–2281.
- Harchaoui et al. (2009) Harchaoui, Z., Moulines, E. & Bach, F. R. (2009). Kernel change-point analysis. In Advances in Neural Information Processing Systems.
- Lung-Yut-Fong et al. (2011) Lung-Yut-Fong, A., Lévy-Leduc, C. & Cappé, O. (2011). Homogeneity and change-point detection tests for multivariate data using rank statistics. arXiv preprint arXiv:1107.1971 .
- Matteson & James (2014) Matteson, D. S. & James, N. A. (2014). A nonparametric approach for multiple change point analysis of multivariate data. Journal of the American Statistical Association 109, 334–345.
- Olshen et al. (2004) Olshen, A. B., Venkatraman, E., Lucito, R. & Wigler, M. (2004). Circular binary segmentation for the analysis of array-based dna copy number data. Biostatistics 5, 557–572.
- Siegmund & Yakir (2007) Siegmund, D. & Yakir, B. (2007). The statistics of gene mapping. Springer Science & Business Media.
- Vostrikova (1981) Vostrikova, L. Y. (1981). Detecting “disorder” in multidimensional random processes. In Doklady Akademii Nauk, vol. 259. Russian Academy of Sciences.
- Wang et al. (2017) Wang, G., Zou, C. & Yin, G. (2017). Change-point detection in multinomial data with a large number of categories. Annals of Statistics .
- Wang & Samworth (2018) Wang, T. & Samworth, R. J. (2018). High dimensional change point estimation via sparse projection. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 80, 57–83.
- Zhang & Chen (2017) Zhang, J. & Chen, H. (2017). Graph-based two-sample tests for discrete data. arXiv preprint arXiv:1711.04349 .
- Zhang et al. (2010) Zhang, N. R., Siegmund, D. O., Ji, H. & Li, J. Z. (2010). Detecting simultaneous changepoints in multiple sequences. Biometrika .
- Zhu & Chen (2021) Zhu, Y. & Chen, H. (2021). Limiting distributions of graph-based test statistics. arXiv preprint arXiv:2108.07446 .