Block Markov Chains on Trees

Abdessatar Souissi

Abstract

We introduce block Markov chains (BMCs) indexed by an infinite rooted tree. It turns out that BMCs define a new class of tree-indexed Markovian processes. We clarify the structure of BMCs in connection with Markov chains (MCs) and Markov random fields (MRFs). Mainly, show that probability measures which are BMCs for every root are indeed Markov chains (MCs) and yet they form a strict subclass of Markov random fields (MRFs) on the considered tree. Conversely, a class of MCs which are BMCs is characterized. Furthermore, we establish that in the one-dimensional case the class of BMCs coincides with MCs. However, a slight perturbation of the one-dimensional lattice leads to us to an example of BMCs which are not MCs appear.

1. Introduction

Markov random fields (MRFs) on lattice have become standard tools in several branches of science and technology including computer science, machine learning, graphical models, statistical physics. Namely, MRFs are known to provide pertinent models for interacting particles systems in statistical mechanics.
We notice that MRFs were introduced by Dobrushin in [?] for the multi-dimensional integer lattice, and developed then on trees [?], [?], [?]. QMFs consist multi-dimensional extensions of Markov chains [?] but with a deeper Markovian structure. In, fact even in the one dimensional case MRFs were shown to be distinct from MCs [?].

MRFs play a crucial role in many areas such as computer science, image recognition, graphical models, psychology and in an increasing number of biological and neurological models. The reader is referred to [?], [?], [?] and the references cited therein for further applications.
In the present paper we introduce the notion of block Markov chains indexed by the vertex-set of a rooted tree $T=(V,E)$ . The definition of this notion is quite natural. Since in the one-dimensional case $V=\mathbf{N}_{0}$ with distinguished vertex (root) $"o=0"$ , a Markov chain $(Z_{n})_{n\in\mathbf{N}}$ with (finite) state space $\Xi$ is defined through the well known Markov property

\mathbf{P}[Z_{n+1}=\xi_{n+1}\,\mid\,Z_{n}=\xi_{n},\cdots,Z_{0}=\xi_{0}]=\mathbf{P}[Z_{n+1}=\xi_{n+1}\,\mid\,Z_{n}=\xi_{n}].

The above property can be reformulated by means of the joined probability measure $\mu$ on $\Xi^{V}$ of the process $(Z_{u})_{u\in V}$ as follows

\mu[\xi(.)\,\,\hbox{on}\,\,S(x)\,\mid\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x))]=\mu[\xi(.)\,\,\hbox{on}\,\,S_{(}x)\,\mid\,\,\xi(x)]

(1.1)

where $S(x)=\{x+1\}$ is the set of successors of the site $x\in V$ and $T^{\prime}(x)=\{x+1,x+2,\cdots\}$ it the set of successive descendants of the vertex $x$ w.r.t. the considered root $o$ . We emphasize a suitable natural generalization of the sets $S(x)$ and $T(x)$ for general trees. Roughly speaking, a BMC is a probability measure on $\Omega:=\Xi^{V}$ satisfying the Markov property (1.1) for a fixed root.
The main purpose of this paper is to clarify the structure of BMCs in connection with MCs and MRFs. Mainly, we show that a probability measure which is BMC for every root $o\in V$ is a MC in the sense of [?]. The correlation functions of BMCs are different from those of MCs and MRFs. Consequently, their Markov structure are also different. Namely, it turns out that some conditional independence conditions are necessary on a MC on the considered tree to be BMC.
On the other hand we show that in the one-dimensional case, the notions of MCs and QMCs coincide. This coincidence Makes BMCs strictly a sub-class of MRFs in the one dimensional case. However, we emphasize that a slight modification of the one-dimensional lattice leads to a counter-examples that confirms the huge difference between MCs and BMCs over multi-dimensional trees.
We notice that the natural hierarchical structure of rooted trees, due to the absence of loops, plays a crucial role in the mere definition of BMCs. Therefore, the results are no longer available on general graphs. We forecast that BMCs will play a crucial role in connection with Gibbs measures on trees and their associated phenomena of phase transitions (see [?], [?] and [?]). Namely, phenomena of phase transitions were associated with interesting p-adic models such as the Potts model and the Ising–Vannimenus model [?], [?]. In fact, a work under preparation is dedicated to the clarification of a bridge between BMCs and some p-adic models.
In [?], [?] we clarified the structure of quantum Markov states on a quasi-local algebra $\mathcal{A}$ trees in terms of classical Markovian measure and Gibbs measures on the spectrum of a maximal abelian subalgebra. We stress that this classical Markovian measure is indeed a BMC. This will makes a new bridge between classical and quantum Markov fields.
Let us mention the outlines of the paper. Section 2. is devoted to some notions and notions on rooted trees. In section 3., we recall the basic definition of MC and MRF on graphs. Section 4. is devoted to definition of BMCs as far as its correlation functions. Section 5. is dedicated to results related to the connection of BMCs with MCs and MRFs on trees. In section 6. we deal with the one-dimensional case for which the vertex set is the classical 1D integer lattice $\mathbf{Z}$ occupied with its natural tree structure. In section 7. we develop a counter-example for a BMC which is not a MC.

2. Rooted trees

Recall that [?] a tree is a connected graph with no cycles, i.e. a connected graph which becomes disconnected when each one of its edges is removed.
Let be given an infinite tree $T=(V,E)$ . First, we fix any vertex $o=x_{0}\in V$ as a ”root”. Recall that two vertices $x$ and $y$ are said to be nearest neighborsand we denote $x\sim y$ if they are joined through an edge (i.e. $<x,y>\in E$ ). A list of the vertices $x\sim x_{1}\sim\dots\sim x_{d-1}\sim y$ is called a path from the site $x$ to the site $y$ . The distance $d(x,y)$ on the tree is the length of the shortest path from $x$ to $y$ .
For $x\in V$ , its direct successors (children) is defined by

S^{o}(x):=\left\{y\in V\,\,:\,\,x\sim y\,\,\hbox{and}\,\,d(y,o)>d(x,o)\right\}

(2.2)

and its $k^{th}$ successors w.r.t. the root $o$ is defined by induction as follows

S_{1}^{o}(x):=S^{o}(x);

S^{o}_{k+1}(x)=S^{o}(S^{o}_{k}(x)),\,\,\forall k\geq 1.

The ”future” w.r.t. the vertex $x$ is defined by:

S^{o}_{[m,n]}(x)=\bigcup_{k=m}^{n}S^{o}_{k}(x);\quad T_{o}(x)=\bigcup_{k\geq 1}S^{o}_{k}(x);\quad T_{o}^{{}^{\prime}}(x)=T(x)\setminus\{x\}.

(2.3)

Note that in the homogeneous case, for which $|S_{o}(x)|=k$ is constant, the graph $T$ is the semi-infinite Cayley tree $\Gamma^{k}_{+}$ of order $k$ . Namely, for $k=1$ , the graph is reduced to the one-dimensional integer lattice $\mathbf{Z}$ .
Consider the map $r$ from $V$ into itself characterized by

r(o)=o,

r(y)=x\quad\hbox{if}\quad y\in S^{o}(x)

Let $x\in V$ . If $n=d(x,o)$ then

o=r^{n}(x)=x_{0}\sim r^{n-1}(x)\sim\cdots\sim r(x)\sim r^{0}(x)=x

(2.4)

is the minimal edge-path joining the root $o$ to the vertex $x$ , where $r^{k}=\underbrace{r\circ\cdots\circ r}_{k\,\,times}$ .
The set

R^{o}(x):=\{r(x),r^{2}(x),\cdots,r^{n}(x)=o\}

(2.5)

represents the ”past” of the vertex $x$ for the root $o$
The set of nearest-neighbors vertices of $x$ is given as follows:

N_{x}=\{y\in V\,\,:\,\,x\sim y\}

(2.6)

It is clear that $N_{x}=\{r(x)\}\cup S^{o}(x)$ .

In the sequel, the tree $T$ is assumed to be locally finite, i.e. $|N_{x}|<\infty$ for each $x\in V$ , in this case the integer $d_{x}:=|N_{x}|$ is called degree of $x$ .
The tree can be regarded as growing (upward) away from its fixed root $o$ . Each vertex $x\in V$ then has branches leading to its ”children”, which are represented here by $Sô(x)$ and $T^{{}^{\prime}}_{o}(x)$ . With the possibility of leaves, that is, vertices $x$ without children i.e. $S(x)=\mathchoice{\mbox{\normalsize\rm\O}}{\mbox{\normalsize\rm\O}}{\mbox{\rm\O}}{\mbox{\rm\O}}$ .

3. Some Reminders on Markov fields

Let $\Xi=\{1,\cdots,q\}$ . By a stochastic process we mean a family of random variables $(Z_{u})_{u\in V}$ defined on a probability space $(\Omega,\mathcal{F},\mathbf{P})$ and valued in a finite set $\Xi:=\{1,2,\cdots,q\}$ . The process $(Z_{u})_{u\in V}$ is defined through its joined probability measure $\mu$ on the Borel space $(\Xi^{V},\mathcal{B})$ where $\mathcal{B}_{V}$ is the cylindrical $\sigma$ -algebra, which is generated by the cylinder sets of the following form

C(a_{x},\;x\in\Lambda)=\left\{\xi\in\Xi^{V}\,\,:\,\,\xi(x)=a_{x},\,\,\forall x\in\Lambda\right\}

(3.7)

where $\Lambda\subset V$ finite and $(a_{x})_{x\in\Lambda}\in\Xi^{|\Lambda|}$ . For the sake of shortness we denote $\Omega$ instead of $\Omega_{V}$ and $\mathcal{B}$ instead of $\mathcal{B}_{V}$ . For $\Lambda\subset V$ , we denote $\Omega_{\Lambda}=\Xi^{\Lambda}$ . Recall that

\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\right]=\mathbf{P}\left[\bigcap_{u\in\Lambda}(Z_{u}=\xi(u))\right]

(3.8)

where $\xi\in\Omega_{\Lambda}$ .
The conditional probability is defined as follows

\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\,\,\mid\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda^{{}^{\prime}}\right]=\frac{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cup\Lambda^{{}^{\prime}}\right]}{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda^{{}^{\prime}}\right]}

(3.9)

where $\Lambda,\Lambda^{{}^{\prime}}\subseteq V$ and $\xi\in\Xi^{V}$ such that

\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda^{{}^{\prime}}\right]>0.

Denoting

\mathcal{F}_{u}:=\sigma(Z_{u})\,\,;\,\,\mathcal{F}_{\Lambda}=\sigma\left(Z_{u}\,\,;\,\,u\in\Lambda\right)

(3.10)

the $\sigma$ -algebra generated by $Z_{u}$ and $(Z_{v},\,v\in\Lambda),$ respectively.

DEFINITION 1

[?] A probability measure $\mu$ on $(\Omega,\mathcal{B})$ is said to be Markov random field (MRF) if it takes strictly positive values on finite cylinder sets of the form (3.7) and such that for every $\xi\in\Omega$

\mu\left[\xi(u)\,\mid\xi(.)\;\hbox{on}\;V\setminus\{u\}\right]=\mu\left[\xi(u)\,\mid\,\xi(.)\,\hbox{on}\,N_{u}\right].

(3.11)

The set of Markov random fields over $T$ will be denoted by $\mathcal{MF}(T)$ .

The conditional probabilities (3.11) are assumed to be invariant under graph isomorphism.

DEFINITION 2

[?]A probability measure $\mu$ on $(\Omega,\mathcal{B})$ is said to be Markov chain (MC) over the tree $T=(V,E)$ if for each subtree $T^{\prime}=(V^{\prime},E^{\prime})$ the restriction of $\mu$ on the measurable space $(\Omega_{V^{\prime}},\mathcal{B}_{V^{{}^{\prime}}})$ defines a Markov random field. i.e.

\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\,\hbox{on}\,\,V^{\prime}\setminus\{x\}\,]=\mu[\xi(x)\,\,\mid\,\xi(.)\,\,\hbox{on}\,N_{x}\cap V^{{}^{\prime}}\,]

(3.12)

for all $x\in V^{\prime}$ and all $\xi\in\Omega_{V^{{}^{\prime}}}$ . The set of Markov chains over $T$ will be denoted by $\mathcal{MC}(T)$ .

Remark 1

The class $\mathcal{MC}(T)$ is clearly included in $\mathcal{MF}(T)$ . Conversely, in [?] it was proven that if the tail $\sigma$ -field is trivial then the considered Markov field is indeed a MC.

4. Structure of Block Markov chains on trees

In what follows, a root $o$ for the tree $T=(V,E)$ is fixed. For each $n\in\mathbf{N}$ , we denote $\Lambda_{n}:=S_{n}^{o}(o)$ the set of vertices whose distance to the root $o$ equals $n$ . Let $\Lambda_{n]}=S_{[0,n]}^{o}(o)=\bigcup_{k=0}^{n}\Lambda_{k}$ . For the sake of shortness, when confusion seems impossible we will use the notations $S(x),\,T(x),T^{{}^{\prime}}$ and $r(x)$ instead of $S_{o}(x),\,T_{o}(x),T_{o}^{{}^{\prime}}(x)$ and $r_{o}(x)$ , respectively.

Let us set a random enumeration for elements of $\Lambda_{n}$ as follows

\displaystyle{\Lambda_{n}}:=\left(x^{(1)}_{\Lambda_{n}},x^{(2)}_{\Lambda_{n}},\cdots,x^{(|\Lambda_{n}|)}_{\Lambda_{n}}\right)

where $|\Lambda_{n}|$ denotes the cardinality of $\Lambda_{n}$ .

DEFINITION 3

A measure $\mu$ on $(\Omega,\mathcal{B})$ is called o-block Markov chain (o-BMC) if it satisfies

\mu\left[\xi(.)\,\,\hbox{on}\;S(x)\,\,\big{|}\xi(.)\,\,\hbox{on}\;V\setminus T^{{}^{\prime}}(x)\right]=\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x)\;\big{|}\xi(x)\right]

(4.13)

for all $x\in V$ and $\xi\in\Omega$ . The equation (4.13) will be referred as block Markov property. The set of $o$ -block Markov chains over the tree $T$ will be denoted $o-\mathcal{BMC}(T)$ .

In [?] a triplet of $\sigma$ -algebras $(\mathcal{F}_{1},\,\mathcal{F}_{2},\,\mathcal{F}_{3})$ such that

\mathbf{P}(A\,\mid\,\mathcal{F}_{1}\vee\mathcal{F}_{2})=\mathbf{P}(A\,\mid\,\mathcal{F}_{2}),\quad\forall A\in\mathcal{F}_{3}

(4.14)

was referred as Markov triple. In these notations (4.13) means that $(\mathcal{F}_{S(x)},\,\mathcal{F}_{V\setminus T(x)},\,\mathcal{F}_{x})$ is a Markov triple.

Remark 2

The word ”block” in Definition. 3 comes from the conditioning w.r.t. the $\sigma$ -algebra $\mathcal{F}_{V\setminus T(x)}$ rather then the $\sigma$ -algebra $\mathcal{F}_{R(x)}$ , while this latter represents the past of the vertex $x$ w.r.t. the root $o$ .

The following elementary formula for conditional probabilities will be used frequently in the sequel.

\mathbf{P}(A\cap B\mid C)=\mathbf{P}(A\mid B\cap C)\mathbf{P}(B\mid C).

(4.15)

Let $\mu$ is an $o$ -BMC. According to (4.15), we have

	$\displaystyle\mu[\xi(.)\,\hbox{on}\,\Lambda_{n]}]$	$\displaystyle=$	$\displaystyle\mu[\xi(.)\,\hbox{on}\,\Lambda_{n}\,\mid\,\xi(.)\,\hbox{on}\,\Lambda_{n-1]}]\times\mu[\xi(.)\,\hbox{on}\,\Lambda_{n-1]}]$
		$\displaystyle=$	$\displaystyle\mu[\xi(.)\,\hbox{on}\,\Lambda_{0}]\prod_{k=0}^{n-1}\mu[\xi(.)\,\hbox{on}\,\Lambda_{k+1}\,\mid\,\xi(.)\,\hbox{on}\,\Lambda_{k]}].$

For $k=1,\cdots,n-1$ , the same reason as above implies that

\mu[\xi(.)\,\hbox{on}\,\Lambda_{k+1}\,\mid\,\xi(.)\,\hbox{on}\,\Lambda_{k]}]=\prod_{j=1}^{|\Lambda_{k}|}\mu\bigl{[}\xi(.)\,\hbox{on}\,S(x_{\Lambda_{k}}^{(j)})\,\mid\,\xi(.)\,\hbox{on}\,\Lambda_{n-1]}\cup\bigcup_{i=j+1}^{|\Lambda_{k}|}S(x_{\Lambda_{k}}^{(i)})\bigr{]}.

Since $x_{\Lambda_{k}}^{(i)}\in\Lambda_{n-1]}\cup\bigcup_{i=j+1}^{|\Lambda_{k}|}S(x_{\Lambda_{k}}^{(i)})\subset V\setminus T^{{}^{\prime}}(x_{\Lambda_{k}}^{(i)})$ then the block Markov property (4.13) leads to

\mu\bigl{[}\xi(.)\,\hbox{on}\,S(x_{\Lambda_{k}}^{(j)})\,\mid\,\xi(.)\,\hbox{on}\,\Lambda_{n-1]}\cup\bigcup_{i=j+1}^{|\Lambda_{k}|}S(x_{\Lambda_{k}}^{(i)})\bigr{]}=\mu\bigl{[}\xi(.)\,\hbox{on}\,S(x_{\Lambda_{k}}^{(j)})\,\mid\,\xi(x_{\Lambda_{k}}^{(j)})\bigr{]}.

Therefore

\mu[\xi(.)\,\hbox{on}\,\Lambda_{n]}]=\mu[\xi(o)]\prod_{k=0}^{n-1}\prod_{x\in\Lambda_{k}}\mu\bigl{[}\xi(.)\,\hbox{on}\,S(x)\,\mid\,\xi(x)\bigr{]}.

(4.16)

Remark 3

The BMC $\mu$ is characterized by the initial distribution $\mu_{o}$ on $\Omega_{\{o\}}$ together with the family of transition probabilities $\mu\bigl{[}\xi(.)\,\hbox{on}\,S(x)\,\mid\,\xi(x)\bigr{]}$ . The $d\times(d^{|S(x)|})$ ”stochastic” matrices $\Pi_{x,S(x)}=\left(\mu[\xi^{{}^{\prime}}(.)\,\hbox{on}\,S(x)\,\mid\,\xi(x)]\right)_{\xi^{{}^{\prime}}\in\Xi^{S(x)},\xi\in\Xi^{\{x\}}}$ are clearly inhomogeneous. This lets the measure $\mu$ a multi-dimensional markovian process which is inhomogeneous both in space and time.

The following theorem extends the local Markov property (4.13) to a global one, which concerns the conditional independence of the $\sigma$ -algebras $\mathcal{F}_{T(x)}$ and $\mathcal{F}_{V\setminus T^{{}^{\prime}}(x)}$ given $\mathcal{F}_{x}$ .

THEOREM 1

Let $\mu$ be a block Markov chain on $(\Omega,\mathcal{B})$ . Then

\mu\left[\xi(.)\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\big{|}\,\xi(.)\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]=\mu\left[\xi(.)\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\big{|}\,\xi(x)\right]

(4.17)

For all $\xi\in\Omega$ and all $x\in V$ .

Proof. If $T^{{}^{\prime}}(x)=\mathchoice{\mbox{\normalsize\rm\O}}{\mbox{\normalsize\rm\O}}{\mbox{\rm\O}}{\mbox{\rm\O}}$ then (4.17) holds true.
We will proceed by induction on $S_{\left[1,n\right]}(x):=\bigcup_{k=1}^{n}S_{k}(x)$ . One has

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n+1\right]}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+1}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\cup V\setminus T^{{}^{\prime}}(x)\right]

\times\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right].

Denoting $S_{n}(x)=\{x_{1}^{(n)},\cdots,x^{(n)}_{|S_{n}(x)|}\}$ , one has

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+1}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\cup V\setminus T^{{}^{\prime}}(x)\right]

=\prod_{i=1}^{|S_{n}(x)|}\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x_{i}^{(n)})\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,(\bigcup_{k=i+1}^{n}S(x_{k}^{(n)}))\cup S_{\left[1,n\right]}(x)\cup V\setminus T^{{}^{\prime}}(x)\right].

From (4.13), one gets

\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x_{i}^{(n)})\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,(\bigcup_{k=i+1}^{n}S(x_{k}^{(n)}))\cup S_{\left[1,n\right]}(x)\cup V\setminus T^{{}^{\prime}}(x)\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x_{i}^{(n)})\,\,\big{|}\,\,\xi(x_{i}^{(n)})\right].

Thus

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+1}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\cup V\setminus T^{{}^{\prime}}(x)\right]

=\prod_{i=1}^{|S^{(n)}(x)|}\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x_{i}^{(n)})\,\,\big{|}\,\,\xi(x_{i}^{(n)})\right].

Using the same argument as above , we obtain

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+1}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\right]

(4.18)

=\prod_{i=1}^{|S_{n}(x)|}\mu\left[\xi(.)\,\,\hbox{on}\,\,S(x_{i}^{(n)})\,\,\big{|}\,\,\xi(x_{i}^{(n)})\right].

On the other hand, the induction’s hypthesis leads to

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n]}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]=\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n]}(x)\,\,\big{|}\,\,\xi(x)\right].

Therefore

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+1}(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\,\,S_{\left[1,n\right]}(x)\right]\times\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n]}(x)\,\,\big{|}\,\,\xi(x)\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{|}\,\,\xi(x)\right].

Finally, one finds

			$\displaystyle\mu\left[\xi(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]$
		$\displaystyle=$	$\displaystyle\lim_{n\to[1,\infty)}\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right];$
		$\displaystyle=$	$\displaystyle\lim_{n\to[1,\infty)}\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{\|}\,\,\xi(x)\right];$
		$\displaystyle=$	$\displaystyle\mu\left[\xi(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(x)\right].$

COROLLARY 1

In the notations of Theorem 1, if $\Lambda\subseteq T^{{}^{\prime}}(x)$ then

\mu\left[\xi(.)\,\hbox{on}\,\,\Lambda\,\,\big{|}\,\xi(.)\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]=\mu\left[\xi(.)\,\hbox{on}\,\,\Lambda\,\big{|}\,\xi(x)\right]

(4.19)

for all $\xi\in\Omega$ .

Proof. From Theorem 1, for each $\xi^{\prime}\in\Omega_{T^{{}^{\prime}}(x)\setminus\Lambda}$

\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda,\,\,\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\setminus\Lambda\,\,\big{|}\,\,\xi(.)\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda,\,\,\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\setminus\Lambda\,\,\big{|}\,\,\xi(x)\right].

Summing up on ${\xi^{\prime}\in\Omega_{T^{{}^{\prime}}(x)\setminus\Lambda}}$ , one finds (4.19).
The following result proposes a multi-dimensional analogue of the Chapmann-Kolmogorov equation.

THEOREM 2

Let $\mu$ be a BMC on $(\Omega,\mathcal{B})$ . Then for $x\in V$ and $m,n\in\mathbf{N}$ one has

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,\,\big{|}\,\,\xi(x)\right]

(4.20)

=\sum_{\xi^{{}^{\prime}}\in\Xi^{S_{n}(x)}}\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,\,\big{|}\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\right]\times\mu\left[\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\big{|}\xi(x)\,\,\right]

for all $\xi\in\Omega$ .

Proof. For each $\xi^{{}^{\prime}}\in\Omega_{S_{n}(x)}$ , using the same reason as in (4.18), we get

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,\,\big{|}\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\right]

=\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,\,\big{|}\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,,\,\xi(x)\,\right].

Then

\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,\,\big{|}\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\right]\times\mu\left[\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\big{|}\,\,\xi(x)\,\,\right]

=\left[\xi(.)\,\,\hbox{on}\,\,S_{n+m}(x)\,,\,\xi^{{}^{\prime}}(.)\,\,\hbox{on}\,\,S_{n}(x)\,\,\big{|}\,\,\xi(x)\,\,\right].

Summing up, one gets (4.20).

5. Connection with MCs and MRFs

LEMMA 1

Let $x\in V$ . If $\Lambda$ is a subset of $S(x)$ then the subgraph of the tree $T=(V,E)$ , whose set of vertices is $\Lambda\cup(V\setminus T^{{}^{\prime}}(x))$ is itself a tree.

Proof. First, we see that if $y\in T^{{}^{\prime}}(x)$ then $T^{{}^{\prime}}(y)\subseteq T^{{}^{\prime}}(x)$ . This implies that for each $y\in V\setminus T^{{}^{\prime}}(x)$ is connected, the set of its roots $\{r^{k}(y),\,k=0,\cdots\}$ ( defined in (2.4)) is disjoint of the set $T^{{}^{\prime}}(x)$ . Then the path $y\sim r(y)\sim\cdots\sim o$ is in $V\setminus T^{{}^{\prime}}(x)$ . Therefore, the subgraph whose vertex set $V\setminus T^{{}^{\prime}}(x)$ is connected. Since every element of $S(x)$ is joined to $x$ , we conclude that the subgraph $(\Lambda\cup(V\setminus T^{{}^{\prime}}(x)),\sim)$ is connected. Taking into account that the fact that every connected subgraph of a tree is a subtree, the proof is complete.

THEOREM 3

Let $\mu$ be a Markov chain on $\Omega$ . Then for each $x\in V$ the following property holds true.

\mu\left[\xi(.)\,\,\hbox{on}\;S(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\;V\setminus T^{{}^{\prime}}(x)\right]=\prod_{y\in S(x)}\mu\left[\xi(y)\,\,\mid\,\,\xi(x)\right].

(5.21)

If in addition, the $\sigma$ -algebras $(\mathcal{F}_{y})_{y\in S(x)}$ are conditionally independent given $\mathcal{F}_{x}$ then $\mu$ is an o-BMC.

Proof. First let us write $S(x):=\{y_{1},y_{2},\cdots,y_{|S(x)|}\}$ . According to (4.15), we have

\mu\left[\xi(.)\,\,\hbox{on}\;S(x)\,\,\big{|}\,\,\xi(.)\,\,\hbox{on}\;V\setminus T^{{}^{\prime}}(x)\right]

=\prod_{k=1}^{|S(x)|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;(V\setminus T^{{}^{\prime}}(x))\cup\{y_{k+1},\cdots,y_{n}\}\right].

By Lemma 1 the subgraph of $T$ whose set of vertices is $V^{{}^{\prime}}:=(V\setminus T^{{}^{\prime}}(x))\cup\{y_{k+1},\cdots,y_{n}\}$ is a tree. Then

			$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;(V\setminus T^{{}^{\prime}}(x))\cup\{y_{k+1},\cdots,y_{n}\}\right]$
		$\displaystyle=$	$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;(V\setminus\{y_{k}\})\cap V^{{}^{\prime}}\right]$
		$\displaystyle=$	$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;N_{y_{k}}\cap V^{{}^{\prime}}\right].$

where the last equality derives from the fact that $\mu$ is a Markov chain in the sense of Definition. 2. Since $N_{y_{k}}\cap V^{{}^{\prime}}=\{x\}$ , we get (5.21). For the second part of the proof, the conditional independence of $\mathcal{F}_{y}:=\sigma(Z_{y}),y\in S(x)$ leads to

\prod_{k=1}^{|S(x)|}\mu\left[\xi(y)\,\,\mid\,\,\xi(x)\right]=\mu\left[\xi(.)\,\,\hbox{on}\;S(x)\,\mid\,\,\xi(x)\right].

Hence, (5.21) leads to (4.17). Therefore $\mu$ is a o-block Markov chain, for any root $o\in V$ . This achieves the proof.

LEMMA 2

If $\mu$ is an $o$ -BMC on $(\Omega,\mathcal{B})$ and $x\in V$ then

\mu[\xi(x)\,\,\mid\,\,\xi\,\,\hbox{on}\,\Lambda]=\mu[\xi(x)\,\,\mid\,\xi(.)\,on\,\{r(x)\}\cup(T^{{}^{\prime}}(x)\cap\Lambda)]

(5.22)

for all $\Lambda\subseteq V\setminus\{x\}$ containing $r(x)$ .

Proof. Since $x\notin\Lambda$ , then according to (4.17) one gets

$\displaystyle\mu\left[\xi(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\right]$	$\displaystyle=$	$\displaystyle\frac{\mu\left[\xi(x)\,;\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,;\;\xi(.)\,\,\hbox{on}\,\,\Lambda\setminus T^{{}^{\prime}}(x)\right]}{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,;\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\setminus T^{{}^{\prime}}(x)\right]},$
	$\displaystyle=$	$\displaystyle\frac{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(x)\right]\mu\left[\xi(x)\,\,\big{\|}\,\,\xi(.)\,\,\Lambda\setminus T^{{}^{\prime}}(x)\right]}{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\setminus T^{{}^{\prime}}(x)\right]}$
	$\displaystyle=$	$\displaystyle\frac{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(x)\right]\mu\left[\xi(x)\,\,\big{\|}\,\,\xi(r(x))\right]}{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(r(x))\right]}.$

Again from (4.17), we have

\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{|}\,\,\xi(x)\right]=\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,\big{|}\,\,\xi(x)\,\,;\,\,\xi(r(x))\right].

This leads to

	$\displaystyle\mu\left[\xi(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\right]$	$\displaystyle=$	$\displaystyle\frac{\mu\left[\xi(x)\,\,;\,\,\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,,\,\,\xi(r(x))\right]}{\mu\left[\xi(.)\,\,\hbox{on}\,\,\Lambda\cap T^{{}^{\prime}}(x)\,\,;\,\,\xi(r(x))\right]}$
		$\displaystyle=$	$\displaystyle\mu\left[\xi(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,\{r(x)\}\cup\Lambda\cap T^{{}^{\prime}}(x)\right].$

This completes the proof.

Remark 4

Notice that Definition.2 extends the notion of Markov chain introduced in [?] and [?] into inhomogeneous trees and for inhomogeneous transition probabilities. It was shown [?] that the class of homogenous Markov chain is strictly included in the class of Markov random fields. In the inhomogeneous we have the following

THEOREM 4

Let $\mu$ be a probability measure on $(\Omega,\mathcal{B})$ . If $\mu$ is an $o$ -BMC for each $o\in V$ then it is a MC.

Proof. Consider a subtree $T^{{}^{\prime}}=(V^{{}^{\prime}},E^{{}^{\prime}})$ of $T$ . Let $x\in V^{{}^{\prime}}$ . If $V^{{}^{\prime}}\cap N_{x}=\mathchoice{\mbox{\normalsize\rm\O}}{\mbox{\normalsize\rm\O}}{\mbox{\rm\O}}{\mbox{\rm\O}}$ then $V^{{}^{\prime}}=\{x\}$ and (3.12) is trivial. Otherwise, let us denote $N_{x}\cap V^{\prime}=\{y_{1},y_{2},\cdots,y_{d}\}$ with $d=|N_{x}\cap V^{{}^{\prime}}|$ . Remark that if $o=y\in N_{x}\cap V^{{}^{\prime}}$ then $r_{o}(x)=y$ and $N_{x}\cap V^{{}^{\prime}}\setminus\{y\}\subseteq S(x)\cap V^{{}^{\prime}}\subseteq T^{{}^{\prime}}_{o}(x)\cap V^{{}^{\prime}}$ . As $\mu$ is an $y_{1}$ -BMC, by Lemma 2 we have

\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,V^{{}^{\prime}}\setminus\{x\}]=\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,\{y_{1}\}\cup(T_{y_{1}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}})].

Since $\mu$ is an $y_{2}$ -BMC by Lemma 2 , we have

			$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,\{y_{1}\}\cup(T_{y_{1}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}})]$
		$\displaystyle=$	$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,\{y_{2}\}\cup\bigl{(}\{y_{1}\}\cup T_{x_{1}}^{{}^{\prime}}(x)\cap T_{y_{2}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}}\bigr{)}]$
		$\displaystyle=$	$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,\{y_{1},y_{2}\}\cup\bigl{(}T_{y_{1}}^{{}^{\prime}}(x)\cap T_{y_{2}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}}\bigr{)}].$

Iterating this procedure, we get

	$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,V^{{}^{\prime}}\setminus\{x\}]$	$\displaystyle=$	$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,\{y_{1},y_{2},\cdots,y_{d}\}\cup\bigl{(}\bigcap_{i=1}^{d}T_{y_{i}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}}\bigr{)}]$
		$\displaystyle=$	$\displaystyle\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\hbox{on}\,\,N_{x}\cap V^{{}^{\prime}}]$

because

\bigcap_{i=1}^{d}T_{y_{i}}^{{}^{\prime}}(x)\cap V^{{}^{\prime}}=\bigcap_{y\in N_{x}}T_{y}^{{}^{\prime}}(x)\cap V^{{}^{\prime}}=\mathchoice{\mbox{\normalsize\rm\O}}{\mbox{\normalsize\rm\O}}{\mbox{\rm\O}}{\mbox{\rm\O}}.

(5.23)

Therefore, the measure $\mu$ satisfies (3.12). This finishes the proof, the verification of (5.23) being left to the reader.

COROLLARY 2

\bigcap_{o\in V}o-\mathcal{BMC}(T)\subseteq\mathcal{MC}(T)\subseteq\mathcal{MF}(T).

6. One-dimensional BMC

In this section we consider the one-dimensional lattice $V=\mathbf{Z}$ occupied with its natural structure of tree, where the edge set is $E=\{<k,k+1>,\quad k\in\mathbf{Z}\}$ . Here $\Omega=\Xi^{\mathbf{Z}}$ .

PROPOSITION 1

Let $\mu$ be a probability measure on $(\Omega,\mathcal{B})$ . The following assertions are equivalent:

: (i) $\mu$ is a o-BMC for each $o\in V$ ;
: (ii) $\mu$ is an o’-BMC, for some root $o^{\prime}\in\mathbf{Z}$ ;
: (iii) $\mu$ is a MC.

In particular, a probability measure on $(\Omega,\mathcal{B})$ is markovian for the backward direction if and only if it is for the forward direction.

Proof.
$(i)\Rightarrow(ii)$ straightforward.
$(ii)\Rightarrow(i)$ Let $o^{\prime}\in\mathbf{Z}$ , without loss of generality we can assume that $o<o^{\prime}$ . Observe that if $x\geq max(o,o^{\prime})$ or $x\leq min(o,o^{\prime})$ then $T_{0}^{{}^{\prime}}(x)=T_{o^{\prime}}^{{}^{\prime}}(x)$ . Then (4.13) is also true if we replace $o$ by $o^{\prime}$ .
Let us now examine the case $o<x<o^{\prime}$ then $S^{o^{\prime}}(x)=\{x-1\}$ and $T_{o^{\prime}}^{{}^{\prime}}(x)=(-\infty,x-1]$ . Let $m\in\mathbf{N}$ and $\xi\in\Omega$ .
Applying (4.13) to $y\geq x$ , we get

\mu[\xi(y)\,\,\mid\,\,\xi(y-1),\cdots,\xi(x)]=\mu[\xi(y)\,\,\mid\,\,\xi(y-1)]

because $\{x,\dots,y-1\}\subseteq(-\infty,y-1]=\mathbf{Z}\setminus T_{o}^{{}^{\prime}}(y)$ .
According to (4.15), it follows that

$\displaystyle\mu\left[\xi(x-1)\,\,\mid\,\,\xi(.)\,\,\hbox{on}\,\,[x,x+m]\right]$	$\displaystyle=$	$\displaystyle\frac{\mu[\xi(.)\,\,\hbox{on}\,\,[x-1,x+m]]}{\mu[\xi(.)\,\,\hbox{on}\,\,[x,x+m]]}$
	$\displaystyle=$	$\displaystyle\frac{\mu[\xi(x-1)]\prod_{k=x}^{x+m}\mu[\xi(k)\,\,\mid\,\,\xi(k-1)]}{\mu[\xi(x)]\prod_{k=x+1}^{x+m}\mu[\xi(k)\,\,\mid\,\,\xi(k-1)]}$
	$\displaystyle=$	$\displaystyle\frac{\mu[\xi(x)\,\,\mid\,\,\xi(x-1)]\times\mu[\xi(x-1)]}{\mu[\xi(x)]}$
	$\displaystyle=$	$\displaystyle\mu[\xi(x-1)\,\,\mid\,\,\xi(x)].$

Thus

\mu[\xi(x)\,\,\mid\,\xi(.)\,\,\hbox{on}\,\,\mathbf{Z}\setminus T_{o^{\prime}}^{{}^{\prime}}(x)]=\mu[\xi(x-1)\,\,\mid\,\,\xi(x)]

for all $x\in\mathbf{Z}$ . Hence $\mu$ is a $o^{{}^{\prime}}-BMC$ . $(ii)\Rightarrow(iii)$ Let $x\in\mathbf{Z}$ and $m\in\mathbf{N}$ . Since $\mu$ is $BMC$ then it is a $o-BMC$ for $o=x-1$ and $T^{{}^{\prime}}(x)=[x+1,\infty)$ . By (4.13), it follows that

\mu[\,\xi(x)\,\,\mid\,\,\xi(x-1),\cdots,\xi(x-m)]=\mu[\xi(x)\,\,\mid\,\,\xi(x-1)\,].

Therefore, $\mu$ is a Markov chain.
$(iii)\Rightarrow(i)$ If $\mu$ is a Markov chain then

\mu[\xi(x)\,\,\mid\,\,\xi(.)\,\,\hbox{on}\,\,(-\infty,x-1)].

By taking $x>0$ , this implies that $\mu$ is $0$ -block Markov chain, which completes the proof.

Remark 5

Proposition 1 may be summarized by saying that for each $x\in\mathbf{Z}$ the triple $(\mathcal{F}_{[x+1,\infty)},\mathcal{F}_{x},\mathcal{F}_{(-\infty,x-1]})$ is a Markov triple in the sense of (4.14). Namely, this result is still true by taking $\mathbf{N}$ instead of $\mathbf{Z}$ . However, a slight modification on the one dimensional lattice can provide a counter-example in the multi-dimensional case, in fact we have the following section.

7. Counter-example

Consider the sets $V=\mathbf{N}\times\{0\}\cup\{(0,1),(0,-1)\}\subset\mathbf{Z}^{2}$ and $E=\{\{x,y\}\in V\,;\,|x-y|=1\}$ where $|(a,b)|=|a|+|b|$ . We get then The tree $T=(V,E)$ (see Fig.LABEL:0RTgraph).

Consider a $\{0,1\}$ -valued Markov chain $(X_{n})_{n\geq 0}$ with initial measure $\mu_{0}=\frac{1}{2}(\delta_{0}+\delta_{1})$ and transition matrix $P=\left[\begin{array}[]{cc}1/2&1/2\\ 1&0\\ \end{array}\right]$ .

Define the $\{0,1\}$ -valued stochastic process $(Z_{u})_{u\in V}$ by

Z_{u}=\left\{\begin{array}[]{lll}X_{0},&\hbox{if}&u=(0,-1);\\ X_{1},&\hbox{if }&u=(0,0);\\ X_{2},&\hbox{if}&u=(0,1);\\ X_{n+2},&\hbox{if}&u=(n,0),\,n\geq 1.\end{array}\right.

Let $\Xi=\{0,1\}$ and $\mu$ be the probability measure on $\Xi^{V}$ associated with $(Z_{u})_{u\in V}$ . Let $o=(0,-1)$ and $o^{\prime}=(0,1)$ , it easy to check that $\mu$ is an $o$ -BMC. However, $\mu$ is not a $o^{\prime}$ -BMC. In fact, if $x=(0,0)$ we have $S_{o^{\prime}}(w)=\{(0,-1),(1,0)\},\,r(x)=(0,1)$ and $T_{o^{\prime}}^{{}^{\prime}}(x)=V\setminus\{x,r(x)\}$ . Let $\xi\equiv 0\in\Omega$

\mu[\xi(.)\,\,\hbox{on}\,S_{o^{\prime}}(x)\,\mid\,\xi(.)\,\,\hbox{on}\,V\setminus T^{{}^{\prime}}_{o^{\prime}}(x)]

=\mathbf{P}[Z_{(0,-1)}=0,\,Z_{(1,0)}=0\,\mid\,Z_{(0,0)}=0,\,Z_{(0,1)}=0]=\frac{1}{6}.

On the other hand

\mu[\xi(.)\,\,\hbox{on}\,\,S_{o^{\prime}}(x)\,\,\mid\,\,\xi(x)]=\mathbf{P}[Z_{(0,-1)}=0,Z_{(1,0)}=0\,\,\mid\,\,Z_{(0,0)}=0]=\frac{1}{4}.

This leads to

\mu[\xi(.)\,\,\hbox{on}\,S_{o^{\prime}}(x)\,\mid\,\xi(.)\,\,\hbox{on}\,V-T^{{}^{\prime}}_{o^{\prime}}(x)]\neq\mu[\xi(.)\,\,\hbox{on}S_{o^{\prime}}(x)\,\,\mid\,\,\xi(x)].

Hence $\mu$ is not an $o^{\prime}$ -BMC.
Furthermore, the probability measure $\mu$ is not a MC. In fact, by considering the subtree with vertex set $V_{0}=\{(0,1),(0,0),(1,0)\}$ . We get

\mu[\xi((1,0))\,\mid\,\xi((0,0)),\xi((0,1))]=\frac{1}{2}\neq\frac{3}{4}=\mu[\xi((1,o))\,\mid\,\xi((0,0))].

Bibliography

[1] Mukhamedov F., Souissi A., Quantum Markov States on Cayley trees, J. Math. Anal. Appl.473 (2019) 313-333.
[2] Mukhamedov F., Souissi A., Diagonalizability of quantum Markov States on trees, to appear J. stat. phys. (2020).
[3] H. Huilin, Y. Weiguo, S. Zhiyan The Shannon–McMillan theorem for Markov chains in Markovian environments indexed by homogeneous trees, Communications in Statistics - Theory and Methods 47:21, 5286-5297,(2018)
[4] R.L. Dobrushin, The Description of a Random Field by Means of Conditional Probabilities and Conditions of Its Regularity. Probab. Theory Appl. 13, 201-22 (1968).
[5] A. Spataru, Construction of a Markov Field on an infinite tree, Adv. Math. 81, 105-116 (1990).
[6] F. Spitzer, Markov random fields on an infinite tree , Ann. Probab.3, 387-398 (1987) .
[7] S. Zachary, Countable state space Markov random fields and Markov chains on trees, Ann. Probab. 4, 894-903 (1983).
[8] Georgi H.-O. Gibbs measures and phase transitions, de Gruyter Studies in Mathematics vol. 9, Walter de Gruyter, Berlin, 1988.
[9] N.Chandgotia, G. Han, B. Marcus, T. Meyerovitch, R. Pavlov One-dimensional Markov random fields, Markov chains and topological Markov fields, Am. Math. S 142, 1, 227-242 (2014).
[10] S.L. Lauritzen, Graphical Models, Oxford university press (1996)
[11] R. Lyons, Y. Peres, Probability on Trees and Networks, Combridge university press (2016).
[12] R. A. Minlos, E. A. Pecherskii, S. A. Pirogov, Gibbs Random Fields on a Lattice: Definitions, Existence, Uniqueness, and Phase Transitions, J. Comm. Tech. and Elec., 59, 6, 576-594 (2014).
[13] U. A. Rozikov, Gibbs Mesures on Cayley trees, World scientific (2013)
[14] Mukhamedov, F., Akın, H., Phase transitions for p-adic Potts model on the Cayley tree of order three. Journal of Statistical Mechanics: Theory and Experiment, 2013(07), 30
[15] Mukhamedov, F., Saburov, M., Khakimov, O., On p-adic Ising–Vannimenus model on an arbitrary order Cayley tree. Journal of Statistical Mechanics: Theory and Experiment, 2015(5), 26
[16] J. Cao, K. J. Worsley Applications of Random Fields in Human Brain Mapping, Spatial Statistics: Methodological Aspects and Applications. Lecture Notes in Statistics, vol 159. Springer, New York, NY (2001).
[17] C. Glymour, The Mind’s Arrows: Bayes Nets and Graphical Causal Models in Psychology, The MIT Press (2001).
[18] J. R. Norris, Markov chains, Cambridge Series in Statistical and Probabilistic Mathematics, vol. 2, Cambridge University Press, Cambridge, 1998. Reprint of 1997 original. MR1600720 (99c:60144).

			$\displaystyle\mu\left[\xi(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right]$
		$\displaystyle=$	$\displaystyle\lim_{n\to[1,\infty)}\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{\|}\,\,\xi(.)\,\,\hbox{on}\,\,V\setminus T^{{}^{\prime}}(x)\right];$
		$\displaystyle=$	$\displaystyle\lim_{n\to[1,\infty)}\mu\left[\xi(.)\,\,\hbox{on}\,\,S_{[1,n+1]}(x)\,\,\big{\|}\,\,\xi(x)\right];$
		$\displaystyle=$	$\displaystyle\mu\left[\xi(.)\,\,\hbox{on}\,\,T^{{}^{\prime}}(x)\,\,\big{\|}\,\,\xi(x)\right].$

			$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;(V\setminus T^{{}^{\prime}}(x))\cup\{y_{k+1},\cdots,y_{n}\}\right]$
		$\displaystyle=$	$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;(V\setminus\{y_{k}\})\cap V^{{}^{\prime}}\right]$
		$\displaystyle=$	$\displaystyle\prod_{k=1}^{\|S(x)\|}\mu\left[\xi(y_{k})\,\,\mid\,\,\xi(.)\,\,\hbox{on}\;N_{y_{k}}\cap V^{{}^{\prime}}\right].$