\stackMath

Finite Model Property and Bisimulation for LFD

Raoul Koudijs ILLC
Amsterdam, The Netherlands [email protected]

Abstract

Recently, Baltag & van Benthem introduced a decidable logic of functional dependence (LFD) that extends the logic of Cylindrical Relativized Set Algebras (CRS) with atomic local dependence statements. Its semantics can be given in terms of generalised assignment models or their modal counterparts, hence the logic is both a first-order and a modal logic. We show that LFD has the finite model property (FMP) using Herwig’s theorem on extending partial isomorphisms, and prove a bisimulation invariance theorem characterizing LFD as a fragment of first-order logic.

1 Introduction

Recently, Baltag & van Benthem introduced a decidable logic of functional dependence (LFD) that extends the logic of Cylindrical Relativized Set Algebras (CRS) [2] with atomic dependence statements. The semantics is given in terms of dependence models¹¹1These are just the generalised assignment models known from [2][5]., which are pairs $(M,A)$ of a first-order structure $M$ together with a fixed set of variable assignments (or ’team’) $A\subseteq M^{V}$ on $M$ , where $V$ is some (possibly finite) ambient set of variables. Formulas are evaluated at individual assignments $s\in A$ ; in particular the dependence atoms get the following semantics: $s\models D_{X}y$ if for all $t\in A$ , $s\restriction X=t\restriction X$ implies $s(y)=t(y)$ . This is in contrast with logics based on team semantics, where dependence formulas are evaluated at teams and this team is dynamically changed over the course of evaluation. Whereas most logics based on team semantics are undecidable, non-classical and have expressive going beyond FOL, LFD is decidable with a classical semantics and can be considered a fragment of FOL.

Many interesting notions of dependence (such as lineair dependence in vector spaces, temporal dependence in dynamical systems and strategic interaction in a multi-player game) can be formalized in LFD [3]. Moreover, LFD invites a natural epistemic interpretation where (sets of) variables may represent (groups of) agents, (joint) questions or objects.[3] The dependence modalities then capture distributed knowledge or the interrogative modality, while the dependence atoms capture epistemic superiority or inquisitive implication (or other ’mixed’ notions). More spectacularly, [4] introduces a complete and decidable dynamic-epistemic logic based on LFD with so-called ’reading events’ as well as a notion of ’common distributed knowledge’ that combines features of common knowledge and distributed knowledge.

Dependence models are closely related to relational databases: assignments are rows in the table and each variable represents a column, or attribute. Here is a simple numerical example of a dependence model, viewed as a database:

x	y	z
1	0	1
1	0	0
0	1	1
2	0	2

In this table we see that e.g. $y$ locally depends on $x$ in the first row, because the second row, which agrees on $x$ with the first, also agrees on $y$ with it (and no other rows agree on $x$ with the first row). In fact, this dependence holds at all rows, in which case we say that $y$ globally depends on $x$ . Conversely, $x$ does not depend globally on $y$ because it does not locally depend on $y$ at the first row: both $1$ and $2$ occur as $x$ -values of rows that share the current $y$ -value $0$ . Finally, because the fourth row is the only row with $z$ -value $2$ , all other variables locally depend on $z$ there.

The foregoing example witnesses the close connection between LFD and the study of dependence in databases, and indeed the Projection and Transitivity axioms of LFD recapture Armstrong’s Axioms for functional dependence [3]. Deeper connections with database theory as well as team semantics might arise by introducing dynamics on the level of teams, generalizing the semantics to dependence universes, i.e. families of dependence models [3]. In particular, dependence models

The decidability proof in [3] uses completeness of LFD w.r.t a purely syntactic ’type semantics’ resembling the ’quasi-models’ studied in connection with the Guarded Fragment [5][2]. The question whether LFD has the finite model property (FMP) w.r.t. dependence models remained an open problem [3]. Our main result is that LFD has the FMP, by a new application of Herwig’s theorem on extending partial isomorphisms. Moreover, we define dependence bisimulations and show that LFD can be characterized as the fragment of FOL that is invariant under this notion. Independently, another notion of bisimulation for LFD along more standard lines has been proposed in [6]. We show that these notions are equivalent, but that dependence bisimulations suggest a more efficient procedure for checking bisimilarity.

2 Preliminaries

We first introduce the language LFD, dependence models and type models. A pair $(V,\tau)$ , where $V$ is set of variables and $\tau$ is a relational language is called a vocabulary. When both $V$ and $\tau$ are finite, we say that $(V,\tau)$ is a finite vocabulary. We write $FOL[V,\tau]$ for the set of first-order formulas with variables in $V$ (both free and bound) and predicates in $\tau$ , and similarly for $LFD[V,\tau]$ . We assume that each vocabulary becomes equipped with an arity map $ar:\tau\to\mathbb{N}$ .

Definition 2.1.

(Syntax) The language $LFD[V,\tau]$ is recursively defined by:

\varphi::=P\mathbf{x}\;|\;\neg\varphi\;|\;\varphi\wedge\varphi\;|\;\mathbb{D}_{X}\varphi\;|\;D_{X}y

where $X\subseteq V$ is a finite set of variables, $y\in V$ an individual variable, $P\in\tau$ a predicate symbol and $\mathbf{x}=(x_{1},...,x_{n})\in V^{ar(P)}$ a finite string of variables.²²2LFD as a modal language is generated by the same definition, but where $D_{X}y(\;),P\mathbf{x}(\;)$ become unary predicates in $\tau$ . Fixing notation, for any $Y\subseteq V$ , we write $s\models D_{X}Y$ if $s\models D_{X}y$ holds for all $y\in Y$ . We also skip the set brackets for singletons, writing $D_{x}Y$ for $D_{\{x\}}Y$ , and $D_{x}y$ for $D_{\{x\}}\{y\}$ . For every $\varphi\in LFD$ , we define its free variables by:

•

$Free(Px_{1}...x_{n})=\{x_{1},...,x_{n}\}$
•

$Free(D_{X}y)=Free(\mathbb{D}_{X}\varphi)=X$
•

$Free(\neg\varphi)=Free(\varphi)$ , $Free(\varphi\wedge\psi)=Free(\varphi)\cup Free(\psi)$

Moreover, we let $V_{\varphi}$ denote the set of variables occurring in $\varphi$ . This is in general a superset of the free of variables, i.e. $V_{D_{X}y}=X\cup\{y\}$ . Further, we say that $\tau_{\varphi}:=\{P\in\tau\;|\;P\;\textrm{occurs in}\;\varphi\}$ .

Definition 2.2.

(Dependence Models) A dependence model (for the vocabulary $(V,\tau)$ ) $\mathbb{M}$ is a pair $\mathbb{M}=(M,A)$ of a relational structure $M$ for $\tau$ , together with a fixed team $A\subseteq O^{V}$ .³³3We use letters $M$ for first-order structures and blackboard bold letters $\mathbb{M}=(M,A)$ for dependence models. For each $X\subseteq V$ , we define an agreement relation $=_{X}$ on the team:

s=_{X}t\;\textrm{iff}\;s\restriction X=t\restriction X

Note that $V$ may be finite. We call a dependence model distinguished if all the assignments are injective.

Definition 2.3.

(Semantics) Truth of a formula $\varphi$ in a dependence model $\mathbb{M}=(M,A)$ at an assignment $s\in A$ is defined by the following clauses (the Boolean cases are defined as usual:

	$\displaystyle s\models P\mathbf{x}\;\textrm{iff}\;s(\mathbf{x})\in I^{\mathbb{M}}(P)$
	$\displaystyle s\models\mathbb{D}_{X}\varphi\;\textrm{iff}\;t\models\varphi\;\textrm{holds for all}\;t\in A\;\textrm{with}\;s=_{X}t$
	$\displaystyle s\models D_{X}y\;\textrm{iff}\;s=_{X}t\;\textrm{implies}\;s=_{y}t\;\textrm{for all}\;t\in A.$

Where $s(\mathbf{x})$ denotes the tuple $(s(x_{1}),...,s(x_{m}))$ for $\mathbf{x}=(x_{1},...,x_{m})$ . Clearly, for every dependence model $(M,A)$ and assignments $s,t\in A$ , there is a unique set $V^{s,t}:=\{v\in V\;|\;s=_{v}t\}$ that is the maximal set of variables on which $s,t$ agree. An important feature of the semantics is LFD satisfies Locality.

\textrm{{Locality}}:\;\;\textrm{If}\;\;s=_{X}t\;\textrm{and}\;Free(\varphi)\subseteq X,\;\;\textrm{then}\;\;s\models\varphi\;\textrm{iff}\;t\models\varphi

Next to dependence models, LFD is weakly complete w.r.t. a non-standard type semantics.[3] In other words, only LFD over finite vocabularies is complete for this semantics. Type models were used in [3] as a technical auxiliary to prove completeness and decidability. Types are defined relative to closures. We obtain the closure $\Psi:=Cl(\psi)$ of a formula $\psi$ by adding to $\{\psi\}$ all formulas $D_{X}y$ for $X\cup\{y\}\subseteq V_{\psi}$ and closing the resulting set under subformulas and single negation. ⁴⁴4For every non-negated formula $\varphi$ (i.e. a formula whose principal connective is not $\neg$ ) we add $\neg\varphi$ to the closure, and for negated formulas we we do nothing. The resulting closure set will not contain any formulas with double negations.

Definition 2.4.

( $\Psi$ -Types) Let $\Psi$ be a closure in $LFD[V,\tau]$ . A subset $\Sigma\subseteq\Psi$ is a $\Psi$ -type if it satisfies the following conditions (where all formulas mentioned run over $\Psi$ only):

(a)

$\neg\psi\in\Sigma$ iff $\psi\not\in\Sigma$
(b)

$(\psi\wedge\chi)\in\Sigma$ iff $\psi\in\Sigma$ and $\chi\in\Sigma$
(c)

if $\mathbb{D}_{X}\psi\in\Sigma$ , then $\psi\in\Sigma$
(d)

$D_{X}x\in\Sigma$ for all $x\in X\subseteq V$
(e)

$D_{X}Y,D_{Y}Z\in\Sigma$ implies $D_{X}Z\in\Sigma$

For $X\subseteq V_{\psi}$ , we define a relation $\sim_{X}$ on types $\Sigma,\Delta\subseteq\Psi$ :

\displaystyle\Sigma\sim_{X}\Delta\qquad\textrm{iff}\qquad

\displaystyle\{\phi\in\Sigma\;|\;Free(\phi)\subseteq D^{\Sigma}_{X}\}=\{\phi\in\Delta\;|\;Free(\phi)\subseteq D^{\Sigma}_{X}\}

where $D^{\Sigma}_{X}=\{y\in V_{\varphi}\;|\;D_{X}y\in\Sigma\}$ is the dependence-closure of $X$ w.r.t and $\Sigma$ . Observe that $\Sigma\sim_{X}\Delta$ implies $D^{\Sigma}_{X}=D^{\Delta}_{X}$ as $Free(D_{X}y)=X$ .

Definition 2.5.

(Type Models) A type model (for $\Psi$ ) is a family of $\Psi$ -types satisfying:

•

if $\neg\mathbb{D}_{X}\neg\psi\in\Sigma\in\mathfrak{M}$ , then there exists a $\Delta\in\mathfrak{M}$ , such that $\psi\in\Delta$ and $\Sigma\sim_{X}\Delta$ .
•

$\Sigma\sim_{\emptyset}\Delta$ holds for all $\Sigma,\Delta\in\mathfrak{M}$ .

Type models are always finite, as there are only finitely many $\Psi$ -types for a given closure $\Psi$ . This proves the decidability as LFD is (weakly) complete w.r.t. type models [3]. The semantic conditions for type models are given by membership:

\Delta\models\psi\qquad\textrm{iff}\qquad\psi\in\Delta

2.1 Tree Model Property

Every satisfiable LFD formula can be satisfied on a certain tree-like dependence model. This fact follows from the fact that dependence models and type models provide equivalent semantics for LFD, i.e. each type model can be represented as a dependence model and vice versa [3]. The interesting direction is representing arbitrary type models as dependence models by means of an unravelling construction in the sense of modal logic. To say what we mean by ’tree-like’ we need the graph-theoretic notion of a $k$ -tree (the definition is taken from [7]). Say that an $r$ -tuple of objects $\mathbf{a}$ from a $\tau$ -structure $M$ is live in $M$ , if there is some $r$ -ary $P\in\tau$ such that $M\models P\mathbf{a}$ .

Definition 2.6.

( $k$ -Tree) A $\tau$ -structure $M$ is a $k$ -tree if there exists a tree (i.e. an acyclic, connected graph) $T=(V,E)$ and a function $F:V\to\{M^{\prime}\subseteq M\;|\;|M^{\prime}|\leq k\}$ , assigning to every node $v\in V$ of $T$ a set $F(v)$ of at most $k$ elements of $M$ , such that the following two conditions hold.

(i)

For every live tuple $\mathbf{a}=(a_{1},...,a_{r})$ from $M$ , there is some node $v$ such that $\{a_{1},...,a_{r}\}\subseteq F(v)$ .
(ii)

For every element $a$ of $M$ , the set of nodes $\{v\in V\;|\;a\in F(v)\}$ is connected (and hence induces a subtree of $T$ ).

$M$ is of finite branching degree if $T$ is, that is if the set of neighbours of every node in $T$ is finite.

Theorem 2.1.

Representation of Type Models [3]
Let $\mathfrak{M}$ be a type model for $\Psi$ . There exists a dependence model $\mathbb{M}=(M,A)$ with $\mathfrak{M}=\{type_{\Psi}(s)\;|\;s\in A\}$ .

Proof.

Let $m=|\mathfrak{M}|$ and $k=|V|$ where $V$ is the set of variables occurring in formulas in $\Psi$ (i.e. $V=\bigcup\{V_{\psi}\;|\;\psi\in\Psi\}$ ). Fix a type $\Sigma_{0}\in\mathfrak{M}$ . A good path is a sequence $\pi=\langle\Sigma_{0},X_{1},...,X_{n},\Sigma_{n}\rangle$ with $n>0$ such that for each $i\leq n$ (a) $\Sigma_{i}\in\mathfrak{M},X_{i}\subseteq V$ and (b) $\Sigma_{i-1}\sim_{X_{i}}\Sigma_{i}$ . Write $last(\pi)=\Sigma_{n}$ for the last element of $\pi$ , and $lh(\pi)=n+1$ for the length of $\pi$ (not counting the variable sets). For each good path $\pi$ , we define the path assignment $v_{\pi}$ , assigning objects of the form $(\pi,v)$ to variables $v\in V$ :

	$\displaystyle v_{\pi}(v)=(\pi,v)\;\textrm{if}\;\pi\;\textrm{has length 1, i.e.}\;\pi=\langle\Sigma_{0}\rangle\;\textrm{is the root of our tree}.$		(1)
	$\displaystyle v_{\pi}(v)=v_{\pi^{\prime}}(v)\;\textrm{if}\;\pi=(\pi^{\prime},X,\Sigma)\;\textrm{with}\;v\in D^{last(\pi^{\prime})}_{X}$		(2)
	$\displaystyle v_{\pi}(v)=(\pi,v)\;\textrm{if}\;\pi=(\pi^{\prime},X,\Sigma)\;\textrm{with}v\not\in D^{last(\pi^{\prime})}_{X}$		(3)

So new objects are created whenever the value for a variable is not locally determined by the predecessor path. We obtain a team $A:=\{v_{\pi}\;|\;\pi\;\textrm{a good path}\}$ on the structure $M$ with domain $\bigcup_{v_{\pi}\in A}v_{\pi}[V]$ and where an $r$ -ary $P\in\tau$ holds of an $r$ -tuple $((\pi_{1},x_{1}),...,(\pi_{r},x_{r}))$ iff all paths $\pi_{i}$ are linearly ordered by initial segment and the formula $P\mathbf{x}\in last(\pi_{j})$ , where $\pi_{j}$ is the longest path amongst $\{\pi_{1},...,\pi_{r}\}$ .

This yields a distinguished dependence model $\mathbb{M}=(M,A)$ whose objects $(\pi,x)$ are typed by a unique variable $x$ . The set of all good paths, ordered by initial segment, forms a tree $T$ whose branching degree is bounded by $2^{k}\times m$ . Together with the map $\pi\mapsto v_{\pi}[V]$ , this shows that $M$ is a $k$ -tree of finite branching degree. Finally, we have the following crucial truth lemma [3]:

Lemma 2.1.

Truth Lemma
For all formulas $\psi\in\Psi$ and good paths $\pi:\;\;$ $\mathbb{M},v_{\pi}\models\psi$ iff $\psi\in last(\pi)$

This lemma implies that $\mathfrak{M}=\{type_{\Psi}(s)\;|\;s\in A\}$ : because every type $\Delta\in\mathfrak{M}$ occurs as the $last(\pi)$ for some unique good path of length 2 already, namely $\pi_{\Delta}:=(\Sigma_{0},\emptyset,\Delta)$ . Moreover, note that we are free to choose the initial fixed type $\Sigma_{0}$ from $\mathfrak{M}$ in the definition of good path, and hence, by the truth lemma, we can choose what type to be satisfied at the root. ∎

Corollary 2.1.

Tree Model Property
If $\psi\in LFD$ is satisfiable and $|V_{\psi}|=k$ , there is a dependence model $\mathbb{M}=(M,A)$ , where $M$ is $k$ -tree of finite branching degree, satisfying $\varphi$ at the root assignment.

Definition 2.7.

(First-Order Translation) Although interpreted over a generalised semantics, LFD in finitely many variables can be encoded back into FOL over standard structures. So let $V$ be a finite set of variables with enumeration $\mathbf{v}=(v_{1},...,v_{n})$ . We double the amount of variables, creating a set of copied variables $V^{\prime}$ from the variables in $V$ . We ensure that the relevant assignments agree on their values for variables $v$ and their copies $v^{\prime}$ by the conjunction $\mathbf{v}=\mathbf{v^{\prime}}$ .⁵⁵5This additional condition (it was not in the original formulation in [3]) is essential for encoding the semantics of the dependence atoms into FOL, which treats as variables as completely independent otherwise. Further, we introduce a new $n$ -ary predicate $A$ such that $A\mathbf{v}$ encodes the fact that the tuples of values assigned to $\mathbf{v}$ by the current assignment is the range of some admissible assignment from the team (this is a tuple because $V$ is finite). The first-order translation $tr:LFD[V,\tau]\to FOL[V\cup V^{\prime},\tau]$ is defined by [3]:

•

$tr(P\mathbf{x})=P\mathbf{x}$ and $tr$ commutes with Boolean connectives
•

$tr(\mathbb{D}_{X}\psi)=\forall\mathbf{z}(A\mathbf{v}\to tr(\psi))$ , where $\mathbf{v}$ is the enumeration of all the variables in $V$ and $\mathbf{z}$ is the enumeration of all the variables in $V-X$ .
•

$tr(D_{X}y):=\forall\mathbf{z}\forall\mathbf{z^{\prime}}((A\mathbf{v}\wedge A\mathbf{v}[\mathbf{z^{\prime}}/\mathbf{z}])\to y=y^{\prime})$ , where $\mathbf{v},\mathbf{z}$ are as in part (d), $\mathbf{z^{\prime}}$ and $y^{\prime}$ are the corresponding fresh $V^{\prime}$ -copies of $\mathbf{z}$ and $y$ respectively.⁶⁶6Furthermore, $A\mathbf{v}[\mathbf{z^{\prime}}/\mathbf{z}]$ denotes the formula that is obtained by replacing the variables $\mathbf{z}$ by $\mathbf{z^{\prime}}$ in the formula $A\mathbf{v}$ .

There is a one-to-one correspondence between dependence models and structures in this extended language. If $\mathbb{M}=(M,A)$ is a dependence model, $T(\mathbb{M})$ is the expansion of $M$ with the interpretation $I(A):=\{s(\mathbf{v})\;|\;s\in A\}$ . Conversely, given any $\tau\cup\{A\}$ -structure $M^{\prime}$ we obtain a team $A:\{s:V\to M^{\prime}\;|\;s(\mathbf{v})\in I^{M^{\prime}}(A)\}$ which together with a reduct of $M^{\prime}$ makes for the corresponding dependence model. We have the equivalence:

\mathbb{M},s\models\varphi\qquad\textrm{iff}\qquad T(\mathbb{M}),s^{+}\models\mathbf{v}=\mathbf{v^{\prime}}\to tr(\varphi)

for every $s\in A$ and all assignments $s^{+}\in M^{\mathrm{Var}}$ extending $s$ . This translation easily adapts to other local dependence atoms proposed in [6], e.g. $tr(x=y):=\;x=y$ and $tr(\mathbf{x}\in\mathbf{y}):=\;\exists\mathbf{v^{\prime}}(A\mathbf{v^{\prime}}\wedge\bigwedge_{i\leq|\mathbf{x}|}x_{i}=y^{\prime}_{i})$ .

3 Characterization

The original paper [3] left finding a bisimulation-invariance theorem characterizing LFD as an open problem. precisely which formulas in $FOL[V\cup V^{\prime},\tau\cup\{A\}]$ are equivalent to the $tr$ -translation of an LFD-formula over standard structures.⁷⁷7There is also a modal translation of LFD into FOL that extends the well-known standard translation of modal logic into the 2-variable fragment of FOL. A similar characterization theorem can be proved via this translation and the relational semantics for LFD, as our notion of bisimulation as well as the one proposed in [6] are naturally formulated on dependence models as well as their modal counterparts. The following notion of dependence bisimulation exactly characterizes $LFD$ as the largest fragment of $FOL$ invariant under this notion. Say that a set of variables $X$ is dependence-closed at $s^{\prime}$ if $D^{s^{\prime}}_{X}:=\{y\in V\;|\;;s^{\prime}\models D_{X}y\}=X$ , or equivalently if $s^{\prime}\models D_{X}y$ implies $y\in X$ .

Definition 3.1.

(Dependence Bisimulation) Let $\mathbb{M},\mathbb{M^{\prime}}$ be dependence models. We say that a non-empty relation $Z\subseteq A\times A^{\prime}$ is a dependence-bisimulation if for every $(s,s^{\prime})\in Z$ :

(Atom)

$s\models P\mathbf{x}$ iff $s^{\prime}\models P\mathbf{x}$
(Forth)

For every $t\in A$ , (i) the set $V^{s,t}$ is dependence-closed at $s^{\prime}$ and
there is some $t^{\prime}\in A$ such that (ii) $s^{\prime}=_{V^{s,t}}t^{\prime}$ and (iii) $(t,t^{\prime})\in Z$
(Back)

symmetric to the (Forth) clause

Dependence bisimulations are always total; every state is related to another by the bisimulation.

Proposition 3.1.

LFD-formulas are invariant under dependence bisimulations.

Proof.

Let $\mathbb{M},\mathbb{M^{\prime}}$ be dependence models and $Z\subseteq A\times A^{\prime}$ a dependence bisimulation with $(s,s^{\prime})\in Z$ and $\varphi\in$ LFD. We show that $s\models\varphi$ iff $s^{\prime}\models\varphi$ by induction on the complexity of $\varphi$ ; the atomic and Boolean cases are trivial. For the other cases, we show only one direction.

( $\mathbb{D}_{X}\psi)\quad$ Suppose that $s\models\mathbb{D}_{X}\psi$ and let $s^{\prime}=_{X}t^{\prime}$ , i.e. $X\subseteq V^{s^{\prime},t^{\prime}}$ , for some $t^{\prime}\in A^{\prime}$ . By the (Back)-clause there is some $t\in A$ such that $s=_{X}t$ and $(t,t^{\prime})\in Z$ . Hence $t\models\psi$ and so $t^{\prime}\models\psi$ by $(IH)$ .

( $D_{X}y)\quad$ Suppose that $s\models D_{X}y$ and let $s^{\prime}=_{X}t^{\prime}$ for some $t^{\prime}\in A^{\prime}$ . We want to show that $s^{\prime}=_{y}t^{\prime}$ , i.e. $y\in V^{s^{\prime},t^{\prime}}$ . By the (Back)-clause there is some $t\in A$ with $(t,t^{\prime})\in Z$ , $s=_{V^{s^{\prime},t^{\prime}}}t$ and $V^{s^{\prime},t^{\prime}}$ is dependence-closed at $s$ . As $X\subseteq V^{s^{\prime},t^{\prime}}$ , by monotonicity of dependence we have $s\models D_{V^{s^{\prime},t^{\prime}}}y$ . This shows that $y\in V^{s^{\prime},t^{\prime}}$ as $V^{{}^{\prime}s,t^{\prime}}$ is dependence-closed at $s$ . ∎

Dependence bisimulations in fact characterize LFD as a fragment of FOL. This can be shown by formulating an analogue of dependence bisimulations for structures of the form $T(\mathbb{M})$ , and showing that on $\omega$ -saturated structures of this form, LFD-equivalence implies dependence-bisimilarity.

Independently, another notion of bisimulation characterizing LFD has been proposed in [6] that treats dependence atoms like ordinary relational atoms. That is, instead of the dependence-closed condition they simply require that ” $s\models D_{X}y$ iff $s^{\prime}\models D_{X}y$ ” holds for all $X\cup\{y\}\subseteq V$ . It follows that proposition 3.1 shows that dependence bisimulations are also bisimulations in their sense. Conversely, ” $s\models D_{X}y$ iff $s^{\prime}\models D_{X}y$ ” clearly implies the dependence-closed condition, hence the two notions are equivalent. It follows that the proof given in [6] also shows that LFD is the dependence bisimulation-invariant fragment of FOL.

Theorem 3.1.

Van Benthem Characterization
$LFD$ is the largest fragment of $FOL$ that is invariant under dependence bisimulations.

Dependence bisimulations suggest a more efficient way to implement a bisimilarity-checking algorithm for LFD compared to the definition in [6]. For what proposition 3.1 shows is that, given $(M,A),(M^{\prime},A^{\prime})$ with $s\in A,s^{\prime}\in A^{\prime}$ , it actually suffices to check that ” $s\models D_{X}y$ iff $s^{\prime}\models D_{X}y$ for all $y\in V$ ” holds for all $X\in\{V^{s,t}\subseteq V\;|\;t\in A\}\cup\{V^{s^{\prime},t^{\prime}}\subseteq V\;|\;t^{\prime}\in A^{\prime}\}$ in order to conclude that ” $s\models D_{X}y$ iff $s^{\prime}\models D_{X}y$ for all $y\in V$ ” holds for all $X\subseteq V$ . This could be used to avoid an exponential blow-up in $|V|$ .

Dependence bisimulations generalise naturally to extensions of LFD. For instance, we can extend $LFD$ with the equality relation $=$ , yielding the logic $LFD^{=}$ which was shown to be a conservative reduction class of FOL and hence undecidable in [6]. Dependence bisimulations with an extended (Atom) clause that also ranges over equality can be shown to characterize $LFD^{=}$ as a fragment to FOL. Interestingly, over full dependence models (i.e. those $(M,A)$ with $A=M^{V}$ , which are standard first-order structures repackaged as dependence models), dependence bisimulations (for LFD over a finite vocabulary $(V,\tau)$ with $|V|=k$ ) coincides with $k$ -potential isomorphism, which characterizes first-order logic in $k$ variables.

4 Finite Model Property

We show that LFD has the FMP w.r.t the intended dependence model semantics, by an application of Herwig’s theorem similar to the one in [7]. Fix a satisfiable LFD-formula $\varphi$ , and let $\Phi:=Cl(\{\varphi\})$ . We let $(V,\tau):=(V_{\varphi},\tau_{\varphi})$ be the smallest vocabulary containing $\varphi$ and hence $\Phi$ . Note that $(V,\tau)$ is a finite vocabulary, so let $|V|=k$ . We know that there is a tree-like dependence model $\mathbb{M}=(M,A)$ , with associated tree $T$ of good paths, satisfying $\varphi$ at the root assignment. Furthermore, the degree of $T$ is bounded by $m\times 2^{k}$ , where $m$ is the number of distinct $\Phi$ -types. Our strategy is as follows: we will cut the underlying $k$ -tree $M$ to a finite structure, encode the dependence atoms in a richer language and finally use Herwig’s theorem to generate out of this a finite dependence model that is bisimilar to the original tree-model. Define a sub-team of $A$ by:

A_{cut}:=\{v_{\pi}\in A\;|\;lh(\pi)\leq 3\}

and let $M_{cut}$ be the submodel of $M$ induced by $\bigcup\{v_{\pi}[V]\subseteq M\;|\;v_{\pi}\in A_{cut}\}$ ; we call $\mathbb{M}_{cut}:=(M_{cut},A_{cut})$ the cut-off model. This is a finite model because the branching degree of $T$ is bounded and $V$ is finite. The truth lemma clearly no longer holds on this cut-off model, because some existential witnesses are missing for assignments of length 3.

We extend the language to include an $|X|$ -ary relation $R^{X,y}$ for each $X\cup\{y\}\subseteq V$ , and obtain the (still finite) richer language $\tau^{+}\supseteq\tau$ . We will use these relations to encode the semantics of the dependence atoms. We expand the structure $M_{cut}$ underlying the cut-off model to a $\tau^{+}$ structure by putting:

I^{M_{cut}}(R^{X,y}):=\{v_{\pi}(\mathbf{x})\;|\;D_{X}y\in last(\pi)\}

so that $\mathbb{M}_{cut},v_{\pi}\models R^{X,y}\mathbf{x}$ iff $D_{X}y\in last(\pi)$ . In the end, we want to show that $R^{x,y}\mathbf{x}\leftrightarrow D_{X}y$ holds on the Herwig extension, so that we can recover an appropriate dependence model from it. To show this, we will need the following restricted version of this claim on the cut-off model:

Proposition 4.1.

For each $v_{\pi}\in A_{cut}$ of length $lh(\pi)\leq 2:\quad v_{\pi}\models D_{X}y\to R^{X,y}\mathbf{x}$ .

Proof.

By contraposition, so suppose that $v_{\pi}\not\models R^{X,y}\mathbf{x}$ . This means that $D_{X}y\not\in last(\pi)$ , so for the good path $\pi^{+}:=(\pi,X,last(\pi))$ (it is a good path as $last(\pi)\sim_{X}last(\pi)$ trivially holds) we have that $v_{\pi}=_{X}v_{\pi^{+}}$ and $v_{\pi}\neq_{y}v_{\pi^{+}}$ , i.e. $v_{\pi}\not\models D_{X}y$ . ∎

Herwig’s theorem on extending partial isomorphism [8] is a result about first-order relational languages. It tells us that any finite structure with some set of partial isomorphisms on it has a finite extension in which all these partial isomorphisms extend to automorphisms. This theorem has already been used to show the FMP of the Guarded Fragment (GF) [7].

Theorem 4.1.

Herwig
Let $\sigma$ be a finite relational language, $C$ a finite $\sigma$ -structure and $\{p_{1},...,p_{k}\}$ a (finite) set of partial isomorphisms on $C$ . Then there exists a finite extension $C^{+}$ of $C$ that satisfies the following conditions:

(i)

Every $p_{i}$ extends to a unique automorphism $\widehat{p_{i}}$ of $C^{+}$ . This yields a subgroup $\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ of the automorphism group of $C^{+}$ .
(ii)

If a tuple $\mathbf{a}=(a_{1},....,a_{r})$ from $C^{+}$ is live or $r=1$ , then there exists an automorphism $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that for each $i\leq r$ , $f(a_{i})\in C$ .
(iii)

If $\exists f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ and $a,b\in C$ such that $f(a)=b$ , then either $f=id$ or there is a unique $p\in\langle p_{1},...,p_{k}\rangle$ such that $\widehat{p}=f$ and $p(a)=b$ .

where $\langle p_{1},...,p_{k}\rangle$ is the collection of all partial isomorphisms that can be obtained by composing the $p_{i}$ with their inverses. Note that $\langle p_{1},...,p_{k}\rangle$ is strictly speaking not a group as it need not be the case that $p\circ p^{-1}$ is the identity on $C$ (in general, it is the identity on a subset of $C$ ).

Condition (iii) is in need of further clarification. In words, it says that elements in the submodel $C$ are only mapped to each other by some $f\in\langle f_{1},...,f_{n}\rangle$ if this is forced given the choice of partial isomorphisms. Uniqueness of $p$ in this condition is ensured by the fact that the map $\widehat{(\;)}$ extends to a bijective map $\widehat{(\;)}:\langle p_{1},...,p_{k}\rangle\to\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ that commutes with the operations $\circ,(\;)^{-1}$ (and the identity $id$ ). By condition (i), $\widehat{(\;)}$ is defined on the subset $\{p_{1},...,p_{k}\}$ . Set $\widehat{p^{-1}}:=\widehat{p}^{-1}$ and $\widehat{p\circ p^{\prime}}:=\widehat{p}\circ\widehat{p^{\prime}}$ ; so commutation follows by definition. It immediately follows that the map is injective. For surjectivity, let $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ . By definition $f=\widehat{p_{i_{1}}}^{\epsilon_{1}}\circ...\circ\widehat{p_{i_{m}}}^{\epsilon_{m}}$ for some $\{i_{1},...,i_{m}\}\subseteq\{1,...,k\}$ and $\epsilon_{j}\in\{-1,1\}$ for each $j\leq m$ . Define $p:=p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}\in\langle p_{1},...,p_{k}\rangle$ . Now observe:

\widehat{p}=\savestack{\tmpbox}{\stretchto{\scaleto{\scalerel*[\widthof{p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}}]{\kern-0.6pt\bigwedge\kern-0.6pt}{\rule[-505.89pt]{4.30554pt}{505.89pt}}}{}}{0.5ex}}\stackon[1pt]{p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}}{\tmpbox}=\widehat{p_{i_{1}}^{\epsilon_{1}}}\circ...\circ\widehat{p_{i_{m}}^{\epsilon_{m}}}=\widehat{p_{i_{1}}}^{\epsilon_{1}}\circ...\circ\widehat{p_{i_{m}}}^{\epsilon_{m}}=f

We proceed with specifying a choice of partial isomorphisms on the cut-off model. If $\pi$ is a good path of $lh(\pi)=3$ and $last(\pi)=\Delta$ , then there is a partial isomorphism $p_{\pi}:v_{\pi}[V_{\varphi}]\to v_{\pi_{\Delta}}[V_{\varphi}]$ such that $p_{\pi}\circ v_{\pi}=v_{\pi_{\Delta}}$ , where $\pi_{\Delta}:=\langle\Sigma_{0},\emptyset,\Delta\rangle$ so $lh(\pi_{\Delta})=2$ . We pick the finite set of partial isomorphisms $\{p_{\pi}\;|\;\pi\;\textrm{good path of}\;lh(\pi)=3\}=\{p_{1},...,p_{k}\}$ . The following proposition tells us what kind of partial isomorphisms are in $\langle p_{1},...,p_{k}\rangle$ .

Lemma 4.1.

If $p\in\langle p_{1},...,p_{k}\rangle$ with $pv_{\pi}=_{X}v_{\pi^{\prime}}$ , then there are $v_{\rho},v_{\rho^{\prime}}\in A_{cut}$ with $last(\rho)=last(\rho^{\prime})$ such that $v_{\rho}=_{X}v_{\pi}$ , $v_{\rho^{\prime}}=_{X}v_{\pi^{\prime}}$ and $pv_{\rho}=v_{\rho^{\prime}}$ .

Proof.

Let $p\in\langle p_{1},...,p_{k}\rangle$ such that $pv_{\pi}=_{X}v_{\pi^{\prime}}$ . By definition, $p=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}}$ for some $\{i_{1},...,i_{m}\}\subseteq\{1,...,k\}$ and $\epsilon_{j}\in\{-1,1\}$ for each $1\leq j\leq m$ . Note that for each $j\leq m$ we have that $p_{i_{j}}\in\{p_{1},...,p_{k}\}=\{p_{\pi}\;|\;\pi\;\textrm{a good path of}\;lh(\pi)=3\}$ , so $p_{i_{j}}^{\epsilon_{j}}\circ v_{\pi_{j-1}}=v_{\pi_{j}}$ ⁸⁸8More specifically $p_{i_{j}}^{\epsilon_{j}}\circ v_{\pi_{j-1}}=_{V}v_{\pi_{j}}$ , but this is the same as equality as $dom(v_{\pi_{j}})=dom(v_{\pi_{j_{1}}})=V$ . Another way of putting this is that $dom(p_{i_{j}}^{\epsilon_{j}})=v_{\pi_{j-1}}[V]$ and $cod(p_{i_{j}}^{\epsilon_{j}})=v_{\pi_{j}}[V]$ . for some $v_{\pi_{j-1}},v_{\pi_{j}}\in A_{cut}$ with $last(\pi_{j-1})=last(\pi_{j})$ . In particular, there are $v_{\pi_{0}},v_{\pi_{1}}\in A_{cut}$ such that $last(\pi_{0})=last(\pi_{1})$ and $p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=v_{\pi_{1}}$ . Set $\rho:=\pi_{0}$ . It follows that $v_{\pi}=_{X}v_{\pi_{0}}$ and so $pv_{\pi_{0}}=_{X}pv_{\pi}=_{X}v_{\pi^{\prime}}$ , i.e.

pv_{\pi_{0}}=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=_{X}v_{\pi^{\prime}}

This was the base case for an inductive argument up to $m$ . So let $j\leq m$ and suppose that $v_{\pi_{j}}\in A_{cut}$ with $last(\pi_{j})=last(\pi_{0})$ and

pv_{\pi_{0}}=p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{j+1}}^{\epsilon_{j+1}}v_{\pi_{j}}=_{X}v_{\pi^{\prime}}

Now recall that $p_{i_{j+1}}^{\epsilon_{j+1}}v_{\pi_{j}}=v_{\pi_{j+1}}$ for some $v_{\pi_{j+1}}\in A_{cut}$ with $last(\pi_{j+1})=last(\pi_{j})$ . Moreover, it follows that $p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{j+2}}^{\epsilon_{j+2}}v_{\pi_{j+1}}=_{X}v_{\pi^{\prime}}$ . Hence by induction, there is some $v_{\pi_{m}}\in A_{cut}$ with $pv_{\pi_{0}}=v_{\pi_{m}}$ such that $last(\pi_{m})=last(\pi_{0})$ and

v_{\pi_{m}}=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=pv_{\pi_{0}}=_{X}v_{\pi^{\prime}}

then for $\rho=\pi_{0}$ and $\rho^{\prime}=\pi_{m}$ we have proved the lemma ∎

The associated first-order structure $T(\mathbb{M}_{cut})$ of the Herwig extension is a finite model in a finite relational language $\tau^{+}\cup\{A\}$ , and $\{p_{1},...,p_{k}\}$ is a finite set of partial isomorphisms on it. Hence, by Herwig’s theorem, there exists a finite extension $T(\mathbb{M}_{cut})^{+}$ of this structure, the Herwig extension, satisfying conditions (i)-(iii) w.r.t $\{p_{1},...,p_{k}\}$ . It is easy to see that the Herwig extension corresponds in the canonical way (i.e. see the first-order translation above) to a dependence model $\mathbb{M}_{cut}^{+}:=(M_{cut}^{+},A_{cut}^{+})$ such that $T(\mathbb{M}_{cut}^{+})=T(\mathbb{M}_{cut})^{+}$ . Recall that we want to establish a bisimulation between the finite Herwig extension $\mathbb{M}_{cut}^{+}$ and the infinite tree model $\mathbb{M}$ . To do this, we will need the following lemmas.

Lemma 4.2.

Level 2 Lemma
For every $s\in A_{cut}^{+}$ there is an $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $f\circ s=v_{\pi}\in A_{cut}$ where $lh(\pi)\leq 2$ .

Proof.

Let $s\in A_{cut}^{+}$ . Then the tuple $s(\mathbf{v})\in I(A)$ is live in $T(\mathbb{M}_{cut}^{+})$ . Hence by condition (ii) there is some automorphism $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs(\mathbf{v})$ is a tuple of objects of the submodel $T(M_{cut})$ . As $f$ is an isomorphism, it follows that $fs(\mathbf{v})\in I(A)$ as well. But this can only be if $fs(\mathbf{v})=v_{\pi}(\mathbf{v})$ for some $v_{\pi}\in A_{cut}$ . Now suppose that $lh(\pi)=3$ , with $last(\pi)=\Delta$ , then by (i) there is an automorphism $\widehat{p_{\pi}}$ such that $\widehat{p_{\pi}}f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ and $\widehat{p_{\pi}}fs=v_{\pi_{\Delta}}$ , where $lh(\pi_{\Delta})=2$ . Hence we may assume that there exists some $g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $gs=v_{\pi}$ for some path assignment $v_{\pi}\in A_{cut}$ of length $lh(\pi)\leq 2$ . ∎

Next, we generalise the notion of ’underlying type’ (i.e. $last(\pi)$ for a path assignment $v_{\pi}$ ) to all assignments in $A_{cut}^{+}$ . We define a function $type(\;):A_{cut}^{+}\to\{\Delta\subseteq\Phi\;|\;\Delta\;\textrm{is a}\;\Phi\textrm{-type}\}$ . Set $type(v_{\pi}):=last(\pi)$ for all $v_{\pi}\in A_{cut}\subset A_{cut}^{+}$ . For $s\in A_{cut}^{+}\setminus A_{cut}$ , by the level 2 lemma we know there is $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs=v_{\pi}\in A_{cut}$ , and we set $type(s):=last(\pi)$ .

Proof.

Well-definedness of $type(\;)$
Let $f,g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ be automorphisms with $fs=V_{\pi}\in A_{cut}$ and $gs=v_{\pi^{\prime}}\in A_{cut}$ . Observe that $f\circ g^{-1}$ is an automorphism in the subgroup $\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ that maps elements in $M_{cut}$ to each other, as $fg^{-1}\circ v_{\pi^{\prime}}=v_{\pi}$ . Hence by (iii) there must be a unique $p\in\langle p_{1},...,p_{k}\rangle$ such that $\widehat{p}=fg^{-1}$ and thus $pv_{\pi^{\prime}}=v_{\pi}$ . Lemma 4.1 tells us that there are assignments $v_{\rho},v_{\rho^{\prime}}\in A_{cut}$ with $v_{\pi}=_{V}v_{\rho}$ , $v_{\pi^{\prime}}=_{V}v_{\rho^{\prime}}$ and $last(\rho)=last(\rho^{\prime})$ . It is an easy consequence of the Truth Lemma (lemma 2.1) and Locality that $v_{\pi}=_{V}v_{\rho}$ implies that $last(\pi)=last(\rho)$ and similarly for $\pi^{\prime},\rho^{\prime}$ .⁹⁹9For suppose that $v_{\pi}=_{V}v_{\rho}$ . Observe that $D^{v_{\pi_{0}}}_{V}=V$ for any path assignment with $dom(v_{\pi_{0}})=V$ . By Locality the hypothesis gives that $\{\xi\;|\;v_{\pi}\models\xi\;\&\;Free(\xi)\subseteq V\}=\{\xi\;|\;v_{\rho}\models\xi\;\&\;Free(\xi)\subseteq V\}$ . By the Truth Lemma, this in turn implies that $\{\xi\;|\;\xi\in last(\pi)\;\&\;Free(\xi)\subseteq V\}=\{\xi\;|\;\xi\in last(\rho)\;\&\;Free(\xi)\subseteq V\}$ which says that $last(\pi)\sim_{V}last(\rho)$ , but this clearly implies that $last(\pi)=last(\rho)$ . Hence $last(\pi)=last(\rho)=last(\rho^{\prime})=last(\pi^{\prime})$ . ∎

This last fact used, i.e. that $v_{\pi}=_{X}v_{\pi^{\prime}}$ implies $last(\pi)\sim_{X}last(\pi^{\prime})$ , we will now generalise to all assignments in $s,t\in A_{cut}^{+}$ w.r.t their ’underlying types’ $type(s),type(t)$ .

Lemma 4.3.

Type Lemma
If $s,t\in A_{cut}^{+}$ with $s=_{X}t$ , then $type(s)\sim_{X}type(t)$ .

Proof.

Let $s,t\in A_{cut}^{+}$ with $s=_{X}t$ . By the level 2 lemma, there is $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs=v_{\pi}\in A_{cut}$ with $lh(\pi)\leq 2$ , so $type(s)=last(\pi)$ . As $f$ is an isomorphism on $T(\mathbb{M}_{cut}^{+})$ , we know that $ft\in A_{cut}^{+}$ is an assignment as well, with $fs=v_{\pi}=_{X}ft$ . By applying the level 2 lemma again to $ft$ , we get a $g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $gft=v_{\pi^{\prime}}\in A_{cut}$ with $lh(\pi^{\prime})\leq 2$ , so $type(t)=last(\pi^{\prime})$ . Again, we know that $gv_{\pi}\in A_{cut}^{+}$ is also an assignment (though in general not one in $A_{cut}$ ) such that $gv_{\pi}=gfs=_{X}gft=v_{\pi^{\prime}}$ . But observe that the automorphism $g$ maps $v_{\pi}(x)\mapsto v_{\pi^{\prime}}(x)$ for all $x\in X$ , hence by condition (iii) there must be a unique $p\in\langle p_{1},...,p_{k}\rangle$ such that $\widehat{p}=g$ and so $pv_{\pi}=_{X}v_{\pi^{\prime}}$ . By Lemma 4.1, there are $v_{\rho},v_{\rho^{\prime}}\in A_{cut}$ such that $v_{\pi}=_{X}v_{\rho}$ , $v_{\pi^{\prime}}=_{X}v_{\rho^{\prime}}$ and $last(\rho)=last(\rho^{\prime})$ . Invoking the Truth Lemma and Locality as before this implies that $last(\pi)\sim_{X}last(\rho)$ and $last(\pi^{\prime})\sim_{X}last(\rho^{\prime})$ . Concatenating these facts we see that

type(s)=last(\pi)\sim_{X}last(\rho)=last(\rho^{\prime})\sim_{X}last(\pi^{\prime})=type(t)

∎

Lemma 4.4.

Encoding Lemma
For all $s\in A_{cut}^{+}$ and all $R^{X,y}\in\tau^{+}:\quad s\models R^{X,y}\mathbf{x}\leftrightarrow D_{X}y$

Proof.

( $\leftarrow$ ) By the level 2 lemma, there is $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs=v_{\pi}\in A_{cut}$ with $lh(\pi)\leq 2$ . Applying the first-order translation to proposition 3.1 we get that $T(\mathbb{M}_{cut}),v_{\pi}\models tr(\neg R^{X,y}\mathbf{x}\to\neg D_{X}y)$ . But observe that

	$\displaystyle tr(\neg R^{X,y}\mathbf{x}\to\neg D_{X}y)\;=\;\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y)\;\equiv\;$	$\displaystyle R^{X,y}\mathbf{x}\vee\exists\mathbf{z},\mathbf{z^{\prime}}(A\mathbf{v}\wedge A\mathbf{v^{\prime}}[\mathbf{z}/\mathbf{z^{\prime}}]\wedge y\neq y^{\prime})$
	$\displaystyle\;\equiv\;$	$\displaystyle\exists\mathbf{z},\mathbf{z^{\prime}}(R^{X,y}\mathbf{x}\vee(A\mathbf{v}\wedge A\mathbf{v^{\prime}}[\mathbf{z}/\mathbf{z^{\prime}}]\wedge y\neq y^{\prime}))$

is an existential first-order formula. Hence by the dualized version of the Łoś-Tarski theorem, this still holds in the Herwig extension, i.e. $T(\mathbb{M}_{cut}^{+}),v_{\pi}\models\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y)$ . As $f$ is an isomorphism on $T(\mathbb{M}_{cut}^{+})$ and $fs=v_{\pi}$ , we get that $T(\mathbb{M}_{cut}^{+}),s\models\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y)$ , as desired.

( $\to$ ) Suppose that $s\models R^{X,y}\mathbf{x}$ , and let $s=_{X}t$ for some $t\in A_{cut}^{+}$ . The former fact implies that $D_{X}y\in type(s)$ and the latter by the Type Lemma implies that $type(s)\sim_{X}type(t)$ . It follows that $D_{X}y\in type(t)$ as well. Applying the level 2 lemma two times successively as before, we obtain automorphism $f,g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs=v_{\pi}\in A_{cut}^{+}$ , $gft=v_{\pi^{\prime}}\in A_{cut}^{+}$ with $V^{s,t}=V^{v_{\pi},ft}=V^{gv_{\pi},v_{\pi^{\prime}}}$ (recall the notation $V^{a,b}=\{v\in V\;|\;a=_{v}b\}$ ). As in the type lemma, we see that $g:v_{\pi}(x)\mapsto v_{\pi^{\prime}}(x)$ (i.e. $gv_{\pi}=_{X}v_{\pi^{\prime}}$ ) for all $x\in X$ and thus by condition (iii) there is a unique $p\in\langle p_{1},...,p_{k}\rangle$ such that $\widehat{p}=g$ and hence $pv_{\pi}=_{X}v_{\pi^{\prime}}$ .

We know by Lemma 4.1 that there must be $v_{\rho},v_{\rho^{\prime}}\in A_{cut}$ with $last(\rho)=last(\rho^{\prime})$ such that $v_{\pi}=_{X}v_{\rho}$ , $v_{\pi^{\prime}}=_{X}v_{\rho^{\prime}}$ and $pv_{\rho}=v_{\rho^{\prime}}$ . By fact 4.9 from [3], this implies that there is a path $last(\pi)\sim_{X}....\sim_{X}last(\rho)$ and similarly for $\pi^{\prime},\rho^{\prime}$ . We saw that $D_{X}y\in type(s)\cap type(t)=last(\pi)\cap last(\pi^{\prime})$ , so in fact $D_{X}y$ must be in all the types along these paths. But then it follows from condition (2) of the recursive definition of path assignments (in the proof of theorem 2.1) that $v_{\pi}=_{y}v_{\rho}$ and $v_{\pi^{\prime}}=_{y}v_{\rho^{\prime}}$ . But recall that $pv_{\rho}=v_{\rho^{\prime}}$ so:

ft=g^{-1}gft=g^{-1}v_{\pi^{\prime}}=\widehat{p^{-1}}v_{\pi^{\prime}}=_{y}\widehat{p^{-1}}v_{\rho^{\prime}}=v_{\rho}

But then $v_{\pi}=_{y}v_{\rho}=_{y}ft$ so by transitivity $y\in V^{v_{\pi},ft}=V^{s,t}$ and we conclude that $s=_{y}t$ . ∎

Theorem 4.2.

The dependence models $\mathbb{M}$ and $\mathbb{M}_{cut}^{+}$ are dependence-bisimilar.

Proof.

We show that the relation $Z\subseteq A_{cut}^{+}\times A$ defined by $Z:=\{(s,v_{\pi})\;|\;type(s)=last(\pi)\}$ is an LFD-bisimulation in the sense of [6] and hence, by our remark above, also a dependence bisimulation. Pick an arbitrary pair $(s,v_{\pi})\in Z$ . By the level 2 lemma, there is some $f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle$ such that $fs=v_{\pi^{\prime}}\in A_{cut}$ with $lh(\pi^{\prime})\leq 2$ , hence $type(s)=v_{\pi^{\prime}}$ . As $type(\;)$ is well-defined, it follows that $last(\pi)=last(\pi^{\prime})$ . We show that the pair $(s,v_{\pi})$ satisfies (Atom) (i.e. the one which also ranges over dependence atoms [6]) and is closed under the (Back) & (Forth) clauses (without the dependence-closedness condition).

(Atom) Observe that the chain of equivalences:

s\models_{\mathbb{M}_{cut}^{+}}P\mathbf{x}\quad\textrm{iff}\quad v_{\pi^{\prime}}\models_{\mathbb{M}_{cut}^{+}}P\mathbf{x}\quad\textrm{iff}\quad P\mathbf{x}\in last(\pi^{\prime})=last(\pi)\quad\textrm{iff}\quad v_{\pi}\models_{\mathbb{M}}P\mathbf{x}

holds for every $P\in\tau^{+}$ (i.e. including the relations $R^{X,y}$ !) by the fact that $f$ is an isomorphism with $fs=v_{\pi^{\prime}}$ and the way we have specified the interpretation $I(P)$ on both models. Invoking the encoding lemma, this implies that $\mathbb{M}_{cut}^{+},s\models D_{X}y$ iff $\mathbb{M},v_{\pi}\models D_{X}y$ .

(Forth) Let $t\in A_{cut}^{+}$ be some assignment in the Herwig extension, and let $V^{s,t}$ be the maximal set of variables on which $s$ and $t$ agree. By the Type Lemma $type(s)\sim_{V^{s,t}}type(t)$ . But $last(\pi)=last(\pi^{\prime})=type(s)$ , so it follows that $\pi^{+}:=(\pi,V^{s,t},type(t))$ is a good path. Clearly $v_{\pi^{+}}\in A$ with $v_{\pi}=_{V^{s,t}}v_{\pi^{+}}$ , and lastly $(t,v_{\pi^{+}})\in Z$ as $type(t)=last(\pi^{+})$ .

(Back) Let $v_{\pi^{\prime\prime}}\in A$ , with $V^{\pi,\pi^{\prime\prime}}=\{v\in V\;|\;v_{\pi}=_{v}v_{\pi^{\prime\prime}}\}$ the maximal set ot variables on which $v_{\pi},v_{\pi^{\prime\prime}}$ agree. By a now familiar argument involving the Truth lemma and Locality (i.e. the analogue of the Type Lemma for $\mathbb{M}$ ), it follows that $last(\pi)\sim_{V^{s,t}}last(\pi^{\prime\prime})$ . As $type(s)=last(\pi^{\prime})=last(\pi)$ , we see that $\pi^{\prime}_{+}:=(\pi^{\prime},V^{\pi,\pi^{\prime\prime}},last(\pi^{\prime\prime}))$ is a good path. Moreover, we know that $lh(\pi^{\prime})\leq 2$ which implies that $lh(\pi^{\prime}_{+})=lh(\pi^{\prime})+1\leq 2+1=3$ and so $v_{\pi^{\prime}_{+}}\in A_{cut}$ is in the cut-off model. Clearly $v_{\pi^{\prime}}=_{V^{\pi,\pi^{\prime\prime}}}v_{\pi^{\prime}_{+}}$ . Set $t:=f^{-1}v_{\pi^{\prime}_{+}}$ , then $s=_{V^{s,t}}t$ as $fs=v_{\pi^{\prime}}$ and moreover $(t,v_{\pi^{\prime\prime}})\in Z$ since $type(t)=last(\pi^{\prime+})=last(\pi^{\prime\prime})$ . ∎

Corollary 4.1.

Bounded Model Property
Every satisfiable $\varphi$ in LFD has a finite model whose size is bounded by a computable function of $\varphi$ . ¹⁰¹⁰10Any formula $\varphi$ determines a unique smallest finite vocabulary $(V,\tau)$ such that $Cl(\varphi)$ belongs to $LFD[V,\tau]$ ; the computable function takes as input $|V|$ , the maximal arity $r$ of relations in $\tau$ , and the number of distinct $\Phi$ -types $m$ .

Proof.

Let $\varphi$ be a satisfiable LFD-formula with closure $\Phi$ in the language $(V,\tau)$ and $|V|=k$ . By the tree model property, there is a $k$ -tree $M$ and a team $A$ such that $\mathbb{M}=(M,A)$ is a dependence model satisfying $\varphi$ at the root assignment. We cut this tree at length 3 and obtain the cut-off model whose size is upper bounded by $k(b+b^{2}+b^{3})$ , where $b\in\mathbb{N}$ is the branching degree of the $k$ -tree $\mathbb{M}$ . Note that $b$ itself has $m\times 2^{k}$ as upper bound, where $m:=|\{\Delta\subseteq\Phi\;|\;\Delta\;\textrm{is a}\;\Phi\textrm{-type}\}|$ and $\Phi=Cl(\varphi)$ . It follows that the size of the cut-off model is already exponential in the size of the variables $|V|$ .

Now construct the Herwig extension $\mathbb{M}_{cut}^{+}=(M_{cut}^{+},A_{cut}^{+})$ as above. Using the bound given in [8], we get that $|M_{cut}^{+}|\leq itexp(2r-1,p(|M_{cut}|)$ is upper bounded by an iterated exponential of a polynomial function $p$ of degree $r$ of $|M_{cut}|$ , where $r$ is the maximal arity of predicates in $\tau$ . By theorem 4.2, $\mathbb{M}$ and $\mathbb{M}_{cut}^{+}$ are-bisimilar. As dependence bisimulations are always total, there is some assignment $s\in A_{cut}^{+}$ with $(s,v_{\langle\Sigma_{0}\rangle})\in Z$ . By the invariance result above (proposition 3.1), it follows that $\mathbb{M}_{cut}^{+},s\models\varphi$ . ∎

5 Conclusion

We have introduced dependence bisimulations and have shown that this notion characterizes LFD as a fragment of FOL. Furthermore, we have shown that LFD has the finite (or bounded) model property, by a new application of Herwig’s theorem and a tree-model property established in [3]. The same strategy can be used to carry out a direct proof of the FMP through the equivalent modal semantics.¹¹¹¹11The proof of this can be found in an extended version of this paper (arXiv:2107.06042). With minor adaptations, the proof goes through, though we need to appeal to a more general version of Herwig’s theorem (theorem 5 in [8]) to ensure that the Herwig extension is a tree in order to obtain a standard relational model from it. By reducing the maximal arity $r$ to $2$ , going through the modal semantics significantly lowers the upper bound on the size of the Herwig extension to being singly exponential in the size of the cut-off model.

While LFD only adds local dependence atoms $D_{X}y$ to CRS, extensions of CRS with other local versions of atomic dependency properties have been considered in [6].¹²¹²12We will consider only the logics defined in [6] that are closed under negation, i.e. those $L[\Omega]$ for which $\Omega$ is closed under negation. The authors show that LFD extended with either equality or inclusion is undecidable and that the extension of CRS with both inclusion and equality is contained in GF. CRS with independence atoms was shown undecidable in [3], resulting in a complete characterization of the satisfiability problems of such logics. The same paper also studies the model-checking problem for such logics, and shows it to be PTIME-complete in restriction to finitely many variables. However, this tight bound is only obtained on the assumption that the local atoms considered (i.e. inclusion, dependence, independence and equality) are all efficiently checkable.

One open problem is to determine the computational complexity of the satisfiability problem for LFD. It seems that, with a few adaptations, the satisfiability test for GF given in [7] can be used for the case of LFD. Indeed, the ’witnesses for satisfiability’ defined there closely resemble type models. A more conceptual challenge is connecting the qualitative notion of dependence studied by LFD to probabilistic, i.e. quantitative notions of correlation and dependence.

References

[1]
[2] Hajnal Andréka, István Németi & Johan van Benthem (1998): Modal Languages and Bounded Fragments of Predicate Logic. J. Philos. Log. 27(3), pp. 217–274, 10.1023/A:1004275029985.
[3] Alexandru Baltag & Johan van Benthem (2021): A Simple Logic of Functional Dependence. Journal of Philosophical Logic, 10.1007/s10992-020-09588-z.
[4] Alexandru Baltag & Sonja Smets (2020): Learning What Others Know. In Elvira Albert & Laura Kovacs, editors: LPAR23. LPAR-23: 23rd International Conference on Logic for Programming, Artificial Intelligence and Reasoning, EPiC Series in Computing 73, EasyChair, pp. 90–119, 10.29007/plm4. Available at https://easychair.org/publications/paper/V8Jp.
[5] Johan van Benthem (2001): Exploring Logical Dynamics. Studia Logica 67(1), pp. 111–114, 10.1023/A:1017389612557.
[6] Erich Grädel & Phil Pützstück (2021): Logics of Dependence and Independence: The Local Variants.
[7] Erich Grädel (1999): On the Restraining Power of Guards. Journal of Symbolic Logic 64(4), p. 1719–1742, 10.2307/2586808.
[8] B. Herwig (1998): Extending partial isomorphisms for the small index property of many $\omega$ -categorical structures. Israel Journal of Mathematics 107, pp. 93–123, 10.1007/BF02764005.