A Model of Distributed Disorders Detection
Abstract.
The paper deals with disorders detection in the multivariate stochastic process. We consider the multidimensional Poisson process or the multivariate renewal process. This class of processes can be used as a description of the distributed detection system. The multivariate renewal process can be seen as the sequence of random vectors, where parts of its coordinates are holding times, others are the size of jumps and the index of stream, at which the new event appears. It is assumed that at each stream two kinds of changes are possible: in the holding time or in the size of jumps distribution. The various specific mutual relations between the change points are possible. The aim of the research is to derive the detectors which realize the optimal value of the specified criterion. The change point moment estimates have been obtained in some cases. The difficulties have appeared for the dependent streams with unspecified order of change points. The presented results suggest further research on the construction of detectors in the general model.
Key words and phrases:
Disorder problem, change-point problems, quickest detection, sequential procedure, stopping time, dynamic programming, Markov process.1991 Mathematics Subject Classification:
Primary: 93E10, 62L15; Secondary: 93E03, 60G40 .Krzysztof J. Szajowski∗
Wrocław University of Science and Technology
Faculty of Pure and Applied Mathematics
Wybrzeże Wyspiańskiego 27, 50-370 Wrocław, Poland
(Communicated by the associate editor Arnd Rösch)
1. Introduction
The subject of discussion is the model that describes the phenomenon of piecewise deterministic signals. Time-intervals between jumps are random variables and jumps’ size is also random. The modeled object, for a random period, is in a homogeneous state. In a random time the signal changes its nature and time between jumps, although further random, but have a different distribution, or the size of jumps changes its distribution. After the change is a time homogeneous process and at random time another change. In the present case, there may be one change in the distribution of time between notifications and one change in the distribution of jumps’ size. The aim is to locate the two changes in real time. The other change in the behavior of the process does not continue to alter. Signals of this nature appear in technical issues, medicine, and finance.
1.1. A preliminary consideration
In this paper the construction of the mathematical model of the phenomenon described above requires the determination of a probabilistic space in which all random quantities, random variables and stochastic processes, are defined. Let be fixed probability space. The consideration will focus on the renewal-reward model of change points detection. A renewal–reward process is a jump process with a general holding time distributions and general distribution of jumps (see Brémaud [Brémaud(1981)], Jacobsen [Jacobsen(2006)]). Let us denote a sequence of iid rv (rewards) with the finite expected value. The random variable , where is the renewal process, is called a renewal-reward process. On the turn, a renewal process is a pure jump process with a general distribution of the holding times. To describe this type of processes, let us consider a sequence of positive iid rv with the finite and positive expectation. The renewal process , where for each . The mentioned changes of distributions are assumed to appear on the sequences and at some random moments and , respectively. The general detectors of disorders take values at . In this approach the class of decision function will be restricted to the event (the jump) moments. This means that the moments of the decision can be identified with the index jumping moments in the process. The underlined models can be represented as the sequences of random vectors . The r.v. and do not need to be necessarily independent in contrast to the classical theory of the renewal–reward processes. The aim of the research is to formulate the rigorous model of the problem and to investigate them.
1.2. A motivation
The motivation for the project is the wide literature related to the optimal stopping problem and the scarce results for the multiple stopping settings. The related questions for the change point models are also stimulating. There is satisfactory literature describing the state of art in the disorder problem or the change-point problems in the off-line and on–line setting. Let us mention the monograph by Brodsky and Darkhovsky [Brodsky and Darkhovsky(1993)] and the story by Shiryaev [Shiryaev(2006)].
The disorder problem has a long history (see Shiryaev [Shiryaev(2006)], Proceedings of AMS-IMS-SIAM Summer Research Conference on ”Change point problems” E. Carlstein et al. (ed) [Carlstein et al.(1994)Carlstein, Múller, and Siegmund]). It is known from the early papers by Page [Page(1954), Page(1955)], Girshick and Rubin [Girshick and Rubin(1952)]. However, it was A.N. Kolmogorow who had formulated the realistic and mathematically precise model of rapid detection of a change point or disorder. His student, A.N. Shiryaev [Shiryaev(1961), Shiryaev(1963)] (see the history described in [Shiryaev(2006)]) published the first important results for the sequential problems of disorder detection in the Poisson distribution. This direction of research is the subject of intensive work of contemporary projects. Let us mention papers by Gal’chuk and Rozowski [Gal′čuk and Rozovskiĭ(1971)], Davis [Davis(1974)], Peskir and Shiryaev [Peskir and Shiryaev(2002)] (see also Gapeev [Gapeev(2005)], Bayraktar, Dayanik and Karatzas [Bayraktar and Dayanik(2006)]. The manifold experiment to formulate the adequate model of disorder for more complex processes which appear in the observed phenomena of nature and economy, showed extreme difficulties (see Fuh [Fuh(2004)], Ivanoff and Merzbach [Ivanoff and Merzbach(2010)], Szajowski [Szajowski(2011)]).
The work continues to carry out research described in the papers published by Sarnowski and the author [Sarnowski and Szajowski(2008)], [Sarnowski and Szajowski(2011)] on the change point problem for the undefined, the Markovian type processes before and after the disorder. Related results can be found in the papers by Bojdecki i Hosza [Bojdecki and Hosza(1984)], Szajowski [Szajowski(1992)], Yoshida [Yoshida(1983)], Yakir [Yakir(1994)] and Moustakides [Moustakides(1998)]. The most important guidelines for considering the results of this work are contained in the authors’ works [Szajowski(2011)] on multivariate disorders detection.
The key technique used in the work is based on a multiple optimal stopping a Markov process. The fundamental knowledge on the optimal stopping of random processes can be found in the monographies by Chow, Robbins and Siegmund [Chow et al.(1971)Chow, Robbins, and Siegmund], [Shiryaev(2008)] or Peskir and Shiryaev [Peskir and Shiryaev(2006)]. The first work devoted to the multiple stopping of the discrete time sequences processes was published by Haggstroma [Haggstrom(1967)] and for the Markov processes Nikolaev [Nikolaev(1981)] (see also Eidukjavicjus [Eidukjavicjus(1979)], Nikolaev[Nikolaev(1998)]). The extension of the model to semi-Markov processes has been made by Stadje [Stadje(1985)]. Recent results for continuous time processes have been presented by Kobylanski, Quenez and Rouy-Mironescu [Kobylanski et al.(2010)Kobylanski, Quenez, and Rouy-Mironescu]) and the various applications of such an approach are described in papers by Feng and Xiao [Feng and Xiao(2000b)], [Feng and Xiao(2000a)], Karpowicz and Szajowski [Karpowicz and Szajowski(2007a), Karpowicz and Szajowski(2012)]. For the risk process such extension can be found in papers by Karpowicz and Szajowski [Karpowicz and Szajowski(2007b)]. However, in the approach applied here there is no need to refer to the results for the semi-Markov processes. The analytical difficulties for such problems open various questions concerning the construction of algorithms, approximate solutions and Monte Carlo methods.
1.3. Variation of the basic problems
A general formulation starts from a random sequences and having segments that are the homogeneous Markov sequences. Each segment has its own transition probability law and the length of the segment is unknown and random. It means that there are random moments and , at which the source of observations is changed. The transition probabilities of each process at the segment are chosen from a given set of distributions. An a priori distributions of the disorder moments are given. The problem is to construct the detection algorithm of the disorders. The algorithm should detect the change points with the given precision and maximal probability.
To be more precise, let us see examples. It is easy to ascertain that in the renewal–reward process the distribution of holding time can be first changed in the moment , and next, in the moment the distribution of the random variables changes. One can formulate three such problems:
-
•
it is known that , ie holding time has the distribution change moment earlier than the rewards distribution;
-
•
the order of change points is different: the first the reward sequences are disordered;
-
•
there is no information which sequences will change distribution as the first.
In Bayesian model the last case needs additional a priori information about the chance that the disorder of rewards will appear before the holding time disorder.
If the process is a model of signals from the distributed sources then the disorders in different sources are combined to construct the final decision about the reason of non-homogeneous behavior of the process. Based on the idea of a simple game the model of the fusion center is proposed. The strategy of detection at each segment (source) of the process is defined as the equilibrium in a non-cooperative game between the selfish sensors (see [Szajowski(2011)]).
For the single process having structure similar to those of the generalized compound Poisson processes the temporal disorder is possible. It is a natural problem, mentioned by researchers (see [Bayraktar and Poor(2007)], [Yoshida(1983)] and others). When the model of the process is equivalent to the sequence of a vector of random variables, it is possible that each coordinate changes their distribution in some moments which could be different at each coordinate.
Our environment is described by the states of nature and they are viable over time. There are no objective boundaries in the state space defining safe or unsafe world. These borders are marked by a subjective knowledge. After the appointment of the task of maintaining a safe environment the aim is to observe and monitor the states in time to know their relation to the boundaries. Methodology presented in the paper allows to prepare the environment description and the detectors of the borders of the areas. For the illustration of a description it was selected variant environment in which states form the renewal–reward process. Areas of the state space, acceptable or not, describe the observed distributions of the process. Acceptable states of the system have a distribution, and unacceptable otherwise. Due to the multi-dimensional character of the process the specified areas are determined by the boundaries of the components of the state vector. The model is analysed and the boundaries are determine for the areas of warning states. To determine the boundaries of the detector indicates that to have the correct detection it is necessary a priori knowledge of the possible scenarios achieve a state of emergency. Efficient detection of threats is only possible if the nature implements provided by our scenario. Strategy to little knowledge about the possible scenario is far less effective and it is much more cumbersome to implement.
2. Random switching between Markov processes
Let us formulate the general detection for at most two change points problem or switching detection problem. et us consider an observable sequence of random variables with the homogeneous structure on the time intervals: , and . The parameters , are pairs of random variables with values in having distributions:
(1) | |||||
(2) |
where , , , .
In segments the distribution depends additionally on the parameter with values from the finite set . It is assumed that , and .
In the sequel if there is only one possible process in the segment then the second index will be abandoned.
Additionally, there are Markov processes , , , where -fields are the smallest -fields for which are adapted, respectively. The process is connected with the random variables , , and the Markov processes as follows:
The observable sequence of rv is defined on the space with values in Borel subset , with -additive measure . The measures on , , have the following representation:
for any , where are different and for , and all .
2.1. Finite dimensional distribution of process
For any , where , and any define
Let be the set of all stopping times with respect to , and .
2.2. Criteria of change–point detection
The aim of the DM is to indicate the moments of switching with given precision (Problem ). We want to determine a pair of stopping times such that for every
(4) | |||
The problem with the fixed transition distributions at each segment has been formulated in [Szajowski(1996)] and has been extended to the case in [Szajowski(2011)]. The investigated models here assume that between and , the distribution is chosen from a given set (for simplification having two elements only). The distribution is predetermined in two models and chosen randomly in one model.
Let us introduce the following notation:
where the convention for is used.
Let , and let us assume that and denote . For , we have by properties of the density function with respect to the measure
where , . Let us now define functions and and the sequence of functions as follows: and for :
Aditionally, we have
(6) |
For further calculation it is important to have -dimensional distribution for various configurations of disorders.
(9) | |||||
(10) |
Denote and .We have (cf. [Sarnowski and Szajowski(2008)], [Dube and Mazumdar(2001)])
Lemma 2.1.
For the function follows recursion
(11) | ||||
where for , and | ||||
(12) | ||||
Here and . One can formulate: . Let us assume and suppose that , and and denote . For , we have by properties of the density function with respect to the measure
2.3. -dimensional vs. -dimensional distributions
It is not too difficult to get the recursive formula for .
(17) | ||||
(18) | ||||
Lemma 2.2.
For the density function follows recursion
(19) | ||||
where | ||||
(20) |
Now we split the conditional probability of into the following parts
Let us calculate: . This probability can be calculated as follows:
3. On some a posteriori processes
For (4) the following a posteriori processes are crucial (cf. [Yoshida(1983)], [Szajowski(1992)]).
(21) | |||||
(22) | |||||
(23) | |||||
(24) | |||||
(25) | |||||
(26) | |||||
(27) |
for , , . Also .
3.1. Recursive form of posteriors
A posteriori processes: , , , can be calculated based on the following formula:
(28) | |||||
(29) | |||||
(30) | |||||
(31) |
for , , .
3.2. Markov processes based on observations
Let us definie in the recursive way.
(32) | |||||
For recursive representation of (28)–(3.2) we need the following functions:
The function and will be used in representations of the posteriors.
There is . In the sequel we adopt the following denotations: and .
4. The disorder problem vs. the stopping problem
The basic formulae used in the transformation of the disorder problems to the stopping problems are given in the following
Lemma 4.1.
For each and each Borel function the following formulae for , , , hold:
(34) | |||||
(35) | |||||
(36) |
with the boundary condition , , .
Lemma 4.2.
For the problem 4 the following formulae are valied:
-
(1)
;
-
(2)
;
-
(3)
;
-
(4)
;
-
(5)
.
Lemma 4.3.
For each and each Borel function the following equations are fulfilled.
(38) | |||||
(40) | |||||
(41) |
5. An equivalent issue – the double optimal stopping problem
A compound stopping variable is a pair of stopping times such that a.e.. Denote , and (, , ).
We define two-parameter stochastic sequence , where .
5.1. The optimal compound stopping variable
For every , , let us define the optimal stopping problem of on
.
A compound stopping variable is said to be optimal in if
(or in if ).
Let us define
(42) |
If we put , then
From the theory of optimal stopping for the double indexed processes (cf. [Haggstrom(1967)], [Nikolaev(1981)]) the sequence satisfies
If , then is optimal in and a.e.. Define for if , then is optimal in and a.e..
Lemma 5.1.
Stopping time is optimal for every stopping problem (42).
What is left is to consider the optimal stopping problem for on . For further consideration denote . The first stop payoff will be determined. Let us define:
(43) |
Then and we define .
Lemma 5.2 (5.2).
The strategy is the optimal strategy of the first stop.
5.2. Solution of the equivalent double stopping problem
For this presentation the case is considered. Let us construct multidimensional Markov chains such that and will be the functions of their states. By considering the a posteriori processes we get and for
(44) |
The vector for is a function of and . Besides the conditional distribution of given depends on , , and only.
These facts imply that form a homogeneous Markov process. This allows us to reduce the basic problem (42) for each to the optimal stopping problem of the Markov process with the reward function .
Lemma 5.3.
A solution of the optimal stopping problem (42) for has a form
where and the function satisfies the equation . The value of the problem is equal
where .
Based on the results of Lemma 5.3 and properties of the a posteriori process we have the optimal second moment
By lemmas 5.3 and 4.1 (formula (36)) the optimal stopping problem (43) has been transformed to the optimal stopping problem for the homogeneous Markov process with the reward function
Theorem 5.4.
The function , where ,
satisfies the equation
The value of the problem .
References
- [Bayraktar and Dayanik(2006)] E. Bayraktar and S. Dayanik. Poisson disorder problem with exponential penalty for delay. Math. Oper. Res., 31(2):217–233, 2006. 10.1287/moor.1060.0190.
- [Bayraktar and Poor(2007)] E. Bayraktar and H. Poor. Quickest detection of a minimum of two Poisson disorder times. SIAM J. Control Optim., 46(1):308–331, 2007. 10.1137/050630933.
- [Bojdecki and Hosza(1984)] T. Bojdecki and J. Hosza. On a generalized disorder problem. Stochastic Processes Appl., 18:349–359, 1984.
- [Brémaud(1981)] P. Brémaud. Point processes and queues. Springer-Verlag, New York-Berlin, 1981. ISBN 0-387-90536-7. Martingale dynamics, Springer Series in Statistics.
- [Brodsky and Darkhovsky(1993)] B. Brodsky and B. Darkhovsky. Nonparametric Methods in Change-Point Problems. Mathematics and its Applications (Dordrecht). 243. Dordrecht: Kluwer Academic Publishers. 224 p., Dordrecht, 1993.
- [Carlstein et al.(1994)Carlstein, Múller, and Siegmund] E. Carlstein, H. Múller, and D. Siegmund, editors. Change-point Problems, volume 23 of IMS Lecture Notes-Monograph Series, pages 78–92. Institute of Mathematical Statistics, Hayward, California, 1994.
- [Chow et al.(1971)Chow, Robbins, and Siegmund] Y. Chow, H. Robbins, and D. Siegmund. Great expectations: The theory of optimal stopping. Houghton Mifflin Company, Boston MA, USA, 1971.
- [Davis(1974)] M. Davis. A note on the Poisson disorder problem. In Proc. Int. Conf. on Control Theory, Zakopane, Poland, volume 1 of Banach Center Publications, pages 65–72, Warszawa, 1974. PWN.
- [Dube and Mazumdar(2001)] P. Dube and R. Mazumdar. A framework for quickest detection of traffic anomalies in networks. Technical report, Electrical and Computer Engineering, Purdue University, November 2001. citeseer.ist.psu.edu/506551.html.
- [Eidukjavicjus(1979)] R. Eidukjavicjus. Optimalna ostanovka markovskoj cepi dvumia momentami ostanovki. Lit. Mat. Sb., 13:181–183, 1979.
- [Feng and Xiao(2000a)] Y. Feng and B. Xiao. Optimal policies of yield management with multiple predetermined prices. Oper. Res., 48(2):332–343, 2000a.
- [Feng and Xiao(2000b)] Y. Feng and B. Xiao. A continuous-time yield management model with multiple prices and reversible price changes. Manage. Sci., 46(5):644–657, 2000b. ISSN 0025-1909.
- [Fuh(2004)] C.-D. Fuh. Asymptotic operating characteristics of an optimal change point detection in hidden Markov models. Ann. Stat., 32(5):2305–2339, 2004. 10.1214/009053604000000580.
- [Gal′čuk and Rozovskiĭ(1971)] L. I. Gal′čuk and B. L. Rozovskiĭ. The problem of “disorder” for a Poisson process. Teor. Verojatnost. i Primenen., 16:729–734, 1971. ISSN 0040-361x.
- [Gapeev(2005)] P. V. Gapeev. The disorder problem for compound Poisson processes with exponential jumps. Ann. Appl. Probab., 15(1A):487–499, 2005. ISSN 1050-5164. 10.1214/105051604000000981.
- [Girshick and Rubin(1952)] M. Girshick and H. Rubin. A Bayes approach to a quality control model. Ann. Math. Stat., 23:114–125, 1952.
- [Haggstrom(1967)] G. Haggstrom. Optimal sequential procedures when more then one stop is required. Ann. Math. Statist., 38:1618–1626, 1967.
- [Ivanoff and Merzbach(2010)] B. Ivanoff and E. Merzbach. Optimal detection of a change-set in a spatial Poisson process. Ann. Appl. Probab., 20(2):640–659, 2010. 10.1214/09-AAP629.
- [Jacobsen(2006)] M. Jacobsen. Point process theory and applications. Marked point and picewise deterministic processes., volume 7 of Probability and Its Applications. Birkhäuser, Boston, 2006. ISBN 978-0-8176-4215-0; 0-8176-4215-3.
- [Karpowicz and Szajowski(2007a)] A. Karpowicz and K. Szajowski. Double optimal stopping times and dynamic pricing problem: description of the mathematical model. Math. Methods Oper. Res., 66(2):235–253, 2007a. ISSN 1432-2994. 10.1007/s00186-006-0132-y.
- [Karpowicz and Szajowski(2007b)] A. Karpowicz and K. Szajowski. Double optimal stopping of a risk process. GSSR Stochastics: An International Journal of Probability and Stochastic Processes, 79(1-2):155–167, 2007b. 10.1080/17442500601084204.
- [Karpowicz and Szajowski(2012)] A. Karpowicz and K. Szajowski. Anglers’ fishing problem. In Advances in dynamic games, volume 12 of Ann. Internat. Soc. Dynam. Games, pages 327–349. Birkhäuser/Springer, New York, 2012.
- [Kobylanski et al.(2010)Kobylanski, Quenez, and Rouy-Mironescu] M. Kobylanski, M.-C. Quenez, and E. Rouy-Mironescu. Optimal double stopping time problem. C. R., Math., Acad. Sci. Paris, 348(1-2):65–69, 2010. 10.1016/j.crma.2009.11.020.
- [Moustakides(1998)] G. Moustakides. Quickest detection of abrupt changes for a class of random processes. IEEE Trans. Inf. Theory, 44(5):1965–1968, 1998.
- [Nikolaev(1981)] M. Nikolaev. O kriterii optimal′nosti obobshchennoĭ posledovatel′noĭ procedury. Litovskiĭ Matematicheskiĭ Sbornik, 21:75–82, 1981.
- [Nikolaev(1998)] M. Nikolaev. Optimal multi-stopping rules. Obozr. Prikl. Prom. Mat., 5(2):309–348, 1998.
- [Page(1954)] E. Page. Continuous inspection schemes. Biometrika, 41:100–115, 1954.
- [Page(1955)] E. Page. A test for a change in a parameter occurring at an unknown point. Biometrika, 42:523–527, 1955.
- [Peskir and Shiryaev(2006)] G. Peskir and A. Shiryaev. Optimal stopping and free-boundary problems. Lectures in Mathematics, ETH Zürich. Birkhäuser, Basel, 2006.
- [Peskir and Shiryaev(2002)] G. Peskir and A. N. Shiryaev. Solving the Poisson disorder problem. In P. Schönbucher and K. Sandmann, editors, Advances in finance and stochastics. Essays in honour of Dieter Sondermann, pages 295–312. Springer-Verlag, Heidelberg, 2002.
- [Sarnowski and Szajowski(2008)] W. Sarnowski and K. Szajowski. On-line detection of a part of a sequence with unspecified distribution. Stat. Probab. Lett., 78(15):2511–2516, 2008. doi:10.1016/j.spl.2008.02.040.
- [Sarnowski and Szajowski(2011)] W. Sarnowski and K. Szajowski. Optimal detection of transition probability change in random sequence. Stochastics, 83(4-6):569–581, 2011. 10.1080/17442508.2010.540015.
- [Shiryaev(1961)] A. Shiryaev. The problem of the most rapid detection of a disturbance in a stationary process. Sov. Math., Dokl., 2:795–799, 1961. translation from Dokl. Akad. Nauk SSSR 138, 1039-1042 (1961).
- [Shiryaev(1963)] A. Shiryaev. On optimal methods in the quickest detection problems. Teor. Veroyatn. Primen., 8(1):26–51, 1963. translation from Teor. Veroyatn. Primen. 8, 26-51 (1963).
- [Shiryaev(2006)] A. Shiryaev. From “disorder” to nonlinear filtering and martingale theory. In A. Bolibruch, Y. Osipov, Y. Sinai, V. I. Arnold, L. Faddeev, Y. I. Manin, V. Philippov, V. Tikhomirov, and A. M. Vershik, editors, Mathematical events of the twentieth century, pages 371–397. Springer, Berlin, 2006. Transl. from the Russian, publ. PHASIS Moscow.
- [Shiryaev(2008)] A. N. Shiryaev. Optimal stopping rules, volume 8 of Stochastic Modelling and Applied Probability. Springer-Verlag, Berlin, 2008. ISBN 978-3-540-74010-0. Translated from the 1976 Russian second edition by A. B. Aries, Reprint of the 1978 translation.
- [Stadje(1985)] W. Stadje. On multiple stopping rules. Optimization, 16:401–418, 1985.
- [Szajowski(1992)] K. Szajowski. Optimal on-line detection of outside observation. J.Stat. Planning and Inference, 30:413–426, 1992.
- [Szajowski(1996)] K. Szajowski. A two-disorder detection problem. Appl. Math., 24(2):231–241, 1996.
- [Szajowski(2011)] K. Szajowski. Multi-variate quickest detection of significant change process. In J. S. Baras, J. Katz, and E. Altman, editors, Decision and Game Theory for Security. Second international conference, GameSec 2011, College Park, MD, Maryland, USA, November 14–15, 2011, volume 7037 of Lecture Notes in Computer Science, pages 56–66, Berlin, 2011. Springer. 10.1007/978-3-642-25280-8_7.
- [Szajowski(2011)] K. Szajowski. On a random number of disorders. Probab. Math. Stat., 31(1):17–45, 2011. ISSN 0208-4147.
- [Yakir(1994)] B. Yakir. Optimal detection of a change in distribution when the observations form a Markov chain with a finite state space. In E. Carlstein, H.-G. Müller, and D. Siegmund, editors, Change-point Problems. Papers from the AMS-IMS-SIAM Summer Research Conference held at Mt. Holyoke College, South Hadley, MA, USA, July 11–16, 1992, volume 23 of IMS Lecture Notes-Monograph Series, pages 346–358, Hayward, California, 1994. Institute of Mathematical Statistics.
- [Yoshida(1983)] M. Yoshida. Probability maximizing approach for a quickest detection problem with complicated Markov chain. J. Inform. Optimization Sci., 4:127–145, 1983.
Received 31st of March 2019; revised 31st of January 2020.