This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\stackMath

Finite Model Property and Bisimulation for LFD

Raoul Koudijs ILLC
Amsterdam, The Netherlands [email protected]
Abstract

Recently, Baltag & van Benthem introduced a decidable logic of functional dependence (LFD) that extends the logic of Cylindrical Relativized Set Algebras (CRS) with atomic local dependence statements. Its semantics can be given in terms of generalised assignment models or their modal counterparts, hence the logic is both a first-order and a modal logic. We show that LFD has the finite model property (FMP) using Herwig’s theorem on extending partial isomorphisms, and prove a bisimulation invariance theorem characterizing LFD as a fragment of first-order logic.

1 Introduction

Recently, Baltag & van Benthem introduced a decidable logic of functional dependence (LFD) that extends the logic of Cylindrical Relativized Set Algebras (CRS) [2] with atomic dependence statements. The semantics is given in terms of dependence models111These are just the generalised assignment models known from [2][5]., which are pairs (M,A)(M,A) of a first-order structure MM together with a fixed set of variable assignments (or ’team’) AMVA\subseteq M^{V} on MM, where VV is some (possibly finite) ambient set of variables. Formulas are evaluated at individual assignments sAs\in A; in particular the dependence atoms get the following semantics: sDXys\models D_{X}y if for all tAt\in A, sX=tXs\restriction X=t\restriction X implies s(y)=t(y)s(y)=t(y). This is in contrast with logics based on team semantics, where dependence formulas are evaluated at teams and this team is dynamically changed over the course of evaluation. Whereas most logics based on team semantics are undecidable, non-classical and have expressive going beyond FOL, LFD is decidable with a classical semantics and can be considered a fragment of FOL.

Many interesting notions of dependence (such as lineair dependence in vector spaces, temporal dependence in dynamical systems and strategic interaction in a multi-player game) can be formalized in LFD [3]. Moreover, LFD invites a natural epistemic interpretation where (sets of) variables may represent (groups of) agents, (joint) questions or objects.[3] The dependence modalities then capture distributed knowledge or the interrogative modality, while the dependence atoms capture epistemic superiority or inquisitive implication (or other ’mixed’ notions). More spectacularly, [4] introduces a complete and decidable dynamic-epistemic logic based on LFD with so-called ’reading events’ as well as a notion of ’common distributed knowledge’ that combines features of common knowledge and distributed knowledge.

Dependence models are closely related to relational databases: assignments are rows in the table and each variable represents a column, or attribute. Here is a simple numerical example of a dependence model, viewed as a database:

x y z
1 0 1
1 0 0
0 1 1
2 0 2

In this table we see that e.g. yy locally depends on xx in the first row, because the second row, which agrees on xx with the first, also agrees on yy with it (and no other rows agree on xx with the first row). In fact, this dependence holds at all rows, in which case we say that yy globally depends on xx. Conversely, xx does not depend globally on yy because it does not locally depend on yy at the first row: both 11 and 22 occur as xx-values of rows that share the current yy-value 0. Finally, because the fourth row is the only row with zz-value 22, all other variables locally depend on zz there.

The foregoing example witnesses the close connection between LFD and the study of dependence in databases, and indeed the Projection and Transitivity axioms of LFD recapture Armstrong’s Axioms for functional dependence [3]. Deeper connections with database theory as well as team semantics might arise by introducing dynamics on the level of teams, generalizing the semantics to dependence universes, i.e. families of dependence models [3]. In particular, dependence models

The decidability proof in [3] uses completeness of LFD w.r.t a purely syntactic ’type semantics’ resembling the ’quasi-models’ studied in connection with the Guarded Fragment [5][2]. The question whether LFD has the finite model property (FMP) w.r.t. dependence models remained an open problem [3]. Our main result is that LFD has the FMP, by a new application of Herwig’s theorem on extending partial isomorphisms. Moreover, we define dependence bisimulations and show that LFD can be characterized as the fragment of FOL that is invariant under this notion. Independently, another notion of bisimulation for LFD along more standard lines has been proposed in [6]. We show that these notions are equivalent, but that dependence bisimulations suggest a more efficient procedure for checking bisimilarity.

2 Preliminaries

We first introduce the language LFD, dependence models and type models. A pair (V,τ)(V,\tau), where VV is set of variables and τ\tau is a relational language is called a vocabulary. When both VV and τ\tau are finite, we say that (V,τ)(V,\tau) is a finite vocabulary. We write FOL[V,τ]FOL[V,\tau] for the set of first-order formulas with variables in VV (both free and bound) and predicates in τ\tau, and similarly for LFD[V,τ]LFD[V,\tau]. We assume that each vocabulary becomes equipped with an arity map ar:τar:\tau\to\mathbb{N}.

Definition 2.1.

(Syntax) The language LFD[V,τ]LFD[V,\tau] is recursively defined by:

φ::=P𝐱|¬φ|φφ|𝔻Xφ|DXy\varphi::=P\mathbf{x}\;|\;\neg\varphi\;|\;\varphi\wedge\varphi\;|\;\mathbb{D}_{X}\varphi\;|\;D_{X}y

where XVX\subseteq V is a finite set of variables, yVy\in V an individual variable, PτP\in\tau a predicate symbol and 𝐱=(x1,,xn)Var(P)\mathbf{x}=(x_{1},...,x_{n})\in V^{ar(P)} a finite string of variables.222LFD as a modal language is generated by the same definition, but where DXy(),P𝐱()D_{X}y(\;),P\mathbf{x}(\;) become unary predicates in τ\tau. Fixing notation, for any YVY\subseteq V, we write sDXYs\models D_{X}Y if sDXys\models D_{X}y holds for all yYy\in Y. We also skip the set brackets for singletons, writing DxYD_{x}Y for D{x}YD_{\{x\}}Y , and DxyD_{x}y for D{x}{y}D_{\{x\}}\{y\}. For every φLFD\varphi\in LFD, we define its free variables by:

  • Free(Px1xn)={x1,,xn}Free(Px_{1}...x_{n})=\{x_{1},...,x_{n}\}

  • Free(DXy)=Free(𝔻Xφ)=XFree(D_{X}y)=Free(\mathbb{D}_{X}\varphi)=X

  • Free(¬φ)=Free(φ)Free(\neg\varphi)=Free(\varphi),  Free(φψ)=Free(φ)Free(ψ)Free(\varphi\wedge\psi)=Free(\varphi)\cup Free(\psi)

Moreover, we let VφV_{\varphi} denote the set of variables occurring in φ\varphi. This is in general a superset of the free of variables, i.e. VDXy=X{y}V_{D_{X}y}=X\cup\{y\}. Further, we say that τφ:={Pτ|Poccurs inφ}\tau_{\varphi}:=\{P\in\tau\;|\;P\;\textrm{occurs in}\;\varphi\}.

Definition 2.2.

(Dependence Models) A dependence model (for the vocabulary (V,τ)(V,\tau)) 𝕄\mathbb{M} is a pair 𝕄=(M,A)\mathbb{M}=(M,A) of a relational structure MM for τ\tau, together with a fixed team AOVA\subseteq O^{V}.333We use letters MM for first-order structures and blackboard bold letters 𝕄=(M,A)\mathbb{M}=(M,A) for dependence models. For each XVX\subseteq V, we define an agreement relation =X=_{X} on the team:

s=XtiffsX=tXs=_{X}t\;\textrm{iff}\;s\restriction X=t\restriction X

Note that VV may be finite. We call a dependence model distinguished if all the assignments are injective.

Definition 2.3.

(Semantics) Truth of a formula φ\varphi in a dependence model 𝕄=(M,A)\mathbb{M}=(M,A) at an assignment sAs\in A is defined by the following clauses (the Boolean cases are defined as usual:

sP𝐱iffs(𝐱)I𝕄(P)\displaystyle s\models P\mathbf{x}\;\textrm{iff}\;s(\mathbf{x})\in I^{\mathbb{M}}(P)
s𝔻Xφifftφholds for alltAwiths=Xt\displaystyle s\models\mathbb{D}_{X}\varphi\;\textrm{iff}\;t\models\varphi\;\textrm{holds for all}\;t\in A\;\textrm{with}\;s=_{X}t
sDXyiffs=Xtimpliess=ytfor alltA.\displaystyle s\models D_{X}y\;\textrm{iff}\;s=_{X}t\;\textrm{implies}\;s=_{y}t\;\textrm{for all}\;t\in A.

Where s(𝐱)s(\mathbf{x}) denotes the tuple (s(x1),,s(xm))(s(x_{1}),...,s(x_{m})) for 𝐱=(x1,,xm)\mathbf{x}=(x_{1},...,x_{m}). Clearly, for every dependence model (M,A)(M,A) and assignments s,tAs,t\in A, there is a unique set Vs,t:={vV|s=vt}V^{s,t}:=\{v\in V\;|\;s=_{v}t\} that is the maximal set of variables on which s,ts,t agree. An important feature of the semantics is LFD satisfies Locality.

Locality:Ifs=XtandFree(φ)X,thensφifftφ\textrm{{Locality}}:\;\;\textrm{If}\;\;s=_{X}t\;\textrm{and}\;Free(\varphi)\subseteq X,\;\;\textrm{then}\;\;s\models\varphi\;\textrm{iff}\;t\models\varphi

Next to dependence models, LFD is weakly complete w.r.t. a non-standard type semantics.[3] In other words, only LFD over finite vocabularies is complete for this semantics. Type models were used in [3] as a technical auxiliary to prove completeness and decidability. Types are defined relative to closures. We obtain the closure Ψ:=Cl(ψ)\Psi:=Cl(\psi) of a formula ψ\psi by adding to {ψ}\{\psi\} all formulas DXyD_{X}y for X{y}VψX\cup\{y\}\subseteq V_{\psi} and closing the resulting set under subformulas and single negation. 444For every non-negated formula φ\varphi (i.e. a formula whose principal connective is not ¬\neg) we add ¬φ\neg\varphi to the closure, and for negated formulas we we do nothing. The resulting closure set will not contain any formulas with double negations.

Definition 2.4.

(Ψ\Psi-Types) Let Ψ\Psi be a closure in LFD[V,τ]LFD[V,\tau]. A subset ΣΨ\Sigma\subseteq\Psi is a Ψ\Psi-type if it satisfies the following conditions (where all formulas mentioned run over Ψ\Psi only):

  • (a)

    ¬ψΣ\neg\psi\in\Sigma iff ψΣ\psi\not\in\Sigma

  • (b)

    (ψχ)Σ(\psi\wedge\chi)\in\Sigma iff ψΣ\psi\in\Sigma and χΣ\chi\in\Sigma

  • (c)

    if 𝔻XψΣ\mathbb{D}_{X}\psi\in\Sigma, then ψΣ\psi\in\Sigma

  • (d)

    DXxΣD_{X}x\in\Sigma for all xXVx\in X\subseteq V

  • (e)

    DXY,DYZΣD_{X}Y,D_{Y}Z\in\Sigma implies DXZΣD_{X}Z\in\Sigma

For XVψX\subseteq V_{\psi}, we define a relation X\sim_{X} on types Σ,ΔΨ\Sigma,\Delta\subseteq\Psi:

ΣXΔiff\displaystyle\Sigma\sim_{X}\Delta\qquad\textrm{iff}\qquad {ϕΣ|Free(ϕ)DXΣ}={ϕΔ|Free(ϕ)DXΣ}\displaystyle\{\phi\in\Sigma\;|\;Free(\phi)\subseteq D^{\Sigma}_{X}\}=\{\phi\in\Delta\;|\;Free(\phi)\subseteq D^{\Sigma}_{X}\}

where DXΣ={yVφ|DXyΣ}D^{\Sigma}_{X}=\{y\in V_{\varphi}\;|\;D_{X}y\in\Sigma\} is the dependence-closure of XX w.r.t and Σ\Sigma. Observe that ΣXΔ\Sigma\sim_{X}\Delta implies DXΣ=DXΔD^{\Sigma}_{X}=D^{\Delta}_{X} as Free(DXy)=XFree(D_{X}y)=X.

Definition 2.5.

(Type Models) A type model (for Ψ\Psi) is a family of Ψ\Psi-types satisfying:

  • if ¬𝔻X¬ψΣ𝔐\neg\mathbb{D}_{X}\neg\psi\in\Sigma\in\mathfrak{M}, then there exists a Δ𝔐\Delta\in\mathfrak{M}, such that ψΔ\psi\in\Delta and ΣXΔ\Sigma\sim_{X}\Delta.

  • ΣΔ\Sigma\sim_{\emptyset}\Delta holds for all Σ,Δ𝔐\Sigma,\Delta\in\mathfrak{M}.

Type models are always finite, as there are only finitely many Ψ\Psi-types for a given closure Ψ\Psi. This proves the decidability as LFD is (weakly) complete w.r.t. type models [3]. The semantic conditions for type models are given by membership:

ΔψiffψΔ\Delta\models\psi\qquad\textrm{iff}\qquad\psi\in\Delta

2.1 Tree Model Property

Every satisfiable LFD formula can be satisfied on a certain tree-like dependence model. This fact follows from the fact that dependence models and type models provide equivalent semantics for LFD, i.e. each type model can be represented as a dependence model and vice versa [3]. The interesting direction is representing arbitrary type models as dependence models by means of an unravelling construction in the sense of modal logic. To say what we mean by ’tree-like’ we need the graph-theoretic notion of a kk-tree (the definition is taken from [7]). Say that an rr-tuple of objects 𝐚\mathbf{a} from a τ\tau-structure MM is live in MM, if there is some rr-ary PτP\in\tau such that MP𝐚M\models P\mathbf{a}.

Definition 2.6.

(kk-Tree) A τ\tau-structure MM is a kk-tree if there exists a tree (i.e. an acyclic, connected graph) T=(V,E)T=(V,E) and a function F:V{MM||M|k}F:V\to\{M^{\prime}\subseteq M\;|\;|M^{\prime}|\leq k\}, assigning to every node vVv\in V of TT a set F(v)F(v) of at most kk elements of MM, such that the following two conditions hold.

  • (i)

    For every live tuple 𝐚=(a1,,ar)\mathbf{a}=(a_{1},...,a_{r}) from MM, there is some node vv such that {a1,,ar}F(v)\{a_{1},...,a_{r}\}\subseteq F(v).

  • (ii)

    For every element aa of MM, the set of nodes {vV|aF(v)}\{v\in V\;|\;a\in F(v)\} is connected (and hence induces a subtree of TT).

MM is of finite branching degree if TT is, that is if the set of neighbours of every node in TT is finite.

Theorem 2.1.

Representation of Type Models [3]
Let 𝔐\mathfrak{M} be a type model for Ψ\Psi. There exists a dependence model 𝕄=(M,A)\mathbb{M}=(M,A) with 𝔐={typeΨ(s)|sA}\mathfrak{M}=\{type_{\Psi}(s)\;|\;s\in A\}.

Proof.

Let m=|𝔐|m=|\mathfrak{M}| and k=|V|k=|V| where VV is the set of variables occurring in formulas in Ψ\Psi (i.e. V={Vψ|ψΨ}V=\bigcup\{V_{\psi}\;|\;\psi\in\Psi\}). Fix a type Σ0𝔐\Sigma_{0}\in\mathfrak{M}. A good path is a sequence π=Σ0,X1,,Xn,Σn\pi=\langle\Sigma_{0},X_{1},...,X_{n},\Sigma_{n}\rangle with n>0n>0 such that for each ini\leq n (a) Σi𝔐,XiV\Sigma_{i}\in\mathfrak{M},X_{i}\subseteq V and (b) Σi1XiΣi\Sigma_{i-1}\sim_{X_{i}}\Sigma_{i}. Write last(π)=Σnlast(\pi)=\Sigma_{n} for the last element of π\pi, and lh(π)=n+1lh(\pi)=n+1 for the length of π\pi (not counting the variable sets). For each good path π\pi, we define the path assignment vπv_{\pi}, assigning objects of the form (π,v)(\pi,v) to variables vVv\in V:

vπ(v)=(π,v)ifπhas length 1, i.e.π=Σ0is the root of our tree.\displaystyle v_{\pi}(v)=(\pi,v)\;\textrm{if}\;\pi\;\textrm{has length 1, i.e.}\;\pi=\langle\Sigma_{0}\rangle\;\textrm{is the root of our tree}. (1)
vπ(v)=vπ(v)ifπ=(π,X,Σ)withvDXlast(π)\displaystyle v_{\pi}(v)=v_{\pi^{\prime}}(v)\;\textrm{if}\;\pi=(\pi^{\prime},X,\Sigma)\;\textrm{with}\;v\in D^{last(\pi^{\prime})}_{X} (2)
vπ(v)=(π,v)ifπ=(π,X,Σ)withvDXlast(π)\displaystyle v_{\pi}(v)=(\pi,v)\;\textrm{if}\;\pi=(\pi^{\prime},X,\Sigma)\;\textrm{with}v\not\in D^{last(\pi^{\prime})}_{X} (3)

So new objects are created whenever the value for a variable is not locally determined by the predecessor path. We obtain a team A:={vπ|πa good path}A:=\{v_{\pi}\;|\;\pi\;\textrm{a good path}\} on the structure MM with domain vπAvπ[V]\bigcup_{v_{\pi}\in A}v_{\pi}[V] and where an rr-ary PτP\in\tau holds of an rr-tuple ((π1,x1),,(πr,xr))((\pi_{1},x_{1}),...,(\pi_{r},x_{r})) iff all paths πi\pi_{i} are linearly ordered by initial segment and the formula P𝐱last(πj)P\mathbf{x}\in last(\pi_{j}), where πj\pi_{j} is the longest path amongst {π1,,πr}\{\pi_{1},...,\pi_{r}\}.

This yields a distinguished dependence model 𝕄=(M,A)\mathbb{M}=(M,A) whose objects (π,x)(\pi,x) are typed by a unique variable xx. The set of all good paths, ordered by initial segment, forms a tree TT whose branching degree is bounded by 2k×m2^{k}\times m. Together with the map πvπ[V]\pi\mapsto v_{\pi}[V], this shows that MM is a kk-tree of finite branching degree. Finally, we have the following crucial truth lemma [3]:

Lemma 2.1.

Truth Lemma
For all formulas ψΨ\psi\in\Psi and good paths π:\pi:\;\; 𝕄,vπψ\mathbb{M},v_{\pi}\models\psi iff ψlast(π)\psi\in last(\pi)

This lemma implies that 𝔐={typeΨ(s)|sA}\mathfrak{M}=\{type_{\Psi}(s)\;|\;s\in A\}: because every type Δ𝔐\Delta\in\mathfrak{M} occurs as the last(π)last(\pi) for some unique good path of length 2 already, namely πΔ:=(Σ0,,Δ)\pi_{\Delta}:=(\Sigma_{0},\emptyset,\Delta). Moreover, note that we are free to choose the initial fixed type Σ0\Sigma_{0} from 𝔐\mathfrak{M} in the definition of good path, and hence, by the truth lemma, we can choose what type to be satisfied at the root. ∎

Corollary 2.1.

Tree Model Property
If ψLFD\psi\in LFD is satisfiable and |Vψ|=k|V_{\psi}|=k, there is a dependence model 𝕄=(M,A)\mathbb{M}=(M,A), where MM is kk-tree of finite branching degree, satisfying φ\varphi at the root assignment.

Definition 2.7.

(First-Order Translation) Although interpreted over a generalised semantics, LFD in finitely many variables can be encoded back into FOL over standard structures. So let VV be a finite set of variables with enumeration 𝐯=(v1,,vn)\mathbf{v}=(v_{1},...,v_{n}). We double the amount of variables, creating a set of copied variables VV^{\prime} from the variables in VV. We ensure that the relevant assignments agree on their values for variables vv and their copies vv^{\prime} by the conjunction 𝐯=𝐯\mathbf{v}=\mathbf{v^{\prime}}.555This additional condition (it was not in the original formulation in [3]) is essential for encoding the semantics of the dependence atoms into FOL, which treats as variables as completely independent otherwise. Further, we introduce a new nn-ary predicate AA such that A𝐯A\mathbf{v} encodes the fact that the tuples of values assigned to 𝐯\mathbf{v} by the current assignment is the range of some admissible assignment from the team (this is a tuple because VV is finite). The first-order translation tr:LFD[V,τ]FOL[VV,τ]tr:LFD[V,\tau]\to FOL[V\cup V^{\prime},\tau] is defined by [3]:

  • tr(P𝐱)=P𝐱tr(P\mathbf{x})=P\mathbf{x} and trtr commutes with Boolean connectives

  • tr(𝔻Xψ)=𝐳(A𝐯tr(ψ))tr(\mathbb{D}_{X}\psi)=\forall\mathbf{z}(A\mathbf{v}\to tr(\psi)), where 𝐯\mathbf{v} is the enumeration of all the variables in VV and 𝐳\mathbf{z} is the enumeration of all the variables in VXV-X.

  • tr(DXy):=𝐳𝐳((A𝐯A𝐯[𝐳/𝐳])y=y)tr(D_{X}y):=\forall\mathbf{z}\forall\mathbf{z^{\prime}}((A\mathbf{v}\wedge A\mathbf{v}[\mathbf{z^{\prime}}/\mathbf{z}])\to y=y^{\prime}), where 𝐯,𝐳\mathbf{v},\mathbf{z} are as in part (d), 𝐳\mathbf{z^{\prime}} and yy^{\prime} are the corresponding fresh VV^{\prime}-copies of 𝐳\mathbf{z} and yy respectively.666Furthermore, A𝐯[𝐳/𝐳]A\mathbf{v}[\mathbf{z^{\prime}}/\mathbf{z}] denotes the formula that is obtained by replacing the variables 𝐳\mathbf{z} by 𝐳\mathbf{z^{\prime}} in the formula A𝐯A\mathbf{v}.

There is a one-to-one correspondence between dependence models and structures in this extended language. If 𝕄=(M,A)\mathbb{M}=(M,A) is a dependence model, T(𝕄)T(\mathbb{M}) is the expansion of MM with the interpretation I(A):={s(𝐯)|sA}I(A):=\{s(\mathbf{v})\;|\;s\in A\}. Conversely, given any τ{A}\tau\cup\{A\}-structure MM^{\prime} we obtain a team A:{s:VM|s(𝐯)IM(A)}A:\{s:V\to M^{\prime}\;|\;s(\mathbf{v})\in I^{M^{\prime}}(A)\} which together with a reduct of MM^{\prime} makes for the corresponding dependence model. We have the equivalence:

𝕄,sφiffT(𝕄),s+𝐯=𝐯tr(φ)\mathbb{M},s\models\varphi\qquad\textrm{iff}\qquad T(\mathbb{M}),s^{+}\models\mathbf{v}=\mathbf{v^{\prime}}\to tr(\varphi)

for every sAs\in A and all assignments s+MVars^{+}\in M^{\mathrm{Var}} extending ss. This translation easily adapts to other local dependence atoms proposed in [6], e.g. tr(x=y):=x=ytr(x=y):=\;x=y and tr(𝐱𝐲):=𝐯(A𝐯i|𝐱|xi=yi)tr(\mathbf{x}\in\mathbf{y}):=\;\exists\mathbf{v^{\prime}}(A\mathbf{v^{\prime}}\wedge\bigwedge_{i\leq|\mathbf{x}|}x_{i}=y^{\prime}_{i}).

3 Characterization

The original paper [3] left finding a bisimulation-invariance theorem characterizing LFD as an open problem. precisely which formulas in FOL[VV,τ{A}]FOL[V\cup V^{\prime},\tau\cup\{A\}] are equivalent to the trtr-translation of an LFD-formula over standard structures.777There is also a modal translation of LFD into FOL that extends the well-known standard translation of modal logic into the 2-variable fragment of FOL. A similar characterization theorem can be proved via this translation and the relational semantics for LFD, as our notion of bisimulation as well as the one proposed in [6] are naturally formulated on dependence models as well as their modal counterparts. The following notion of dependence bisimulation exactly characterizes LFDLFD as the largest fragment of FOLFOL invariant under this notion. Say that a set of variables XX is dependence-closed at ss^{\prime} if DXs:={yV|;sDXy}=XD^{s^{\prime}}_{X}:=\{y\in V\;|\;;s^{\prime}\models D_{X}y\}=X, or equivalently if sDXys^{\prime}\models D_{X}y implies yXy\in X.

Definition 3.1.

(Dependence Bisimulation) Let 𝕄,𝕄\mathbb{M},\mathbb{M^{\prime}} be dependence models. We say that a non-empty relation ZA×AZ\subseteq A\times A^{\prime} is a dependence-bisimulation if for every (s,s)Z(s,s^{\prime})\in Z:

  • (Atom)

    sP𝐱s\models P\mathbf{x} iff sP𝐱s^{\prime}\models P\mathbf{x}

  • (Forth)

    For every tAt\in A, (i) the set Vs,tV^{s,t} is dependence-closed at ss^{\prime} and
    there is some tAt^{\prime}\in A such that (ii) s=Vs,tts^{\prime}=_{V^{s,t}}t^{\prime} and (iii) (t,t)Z(t,t^{\prime})\in Z

  • (Back)

    symmetric to the (Forth) clause

Dependence bisimulations are always total; every state is related to another by the bisimulation.

Proposition 3.1.

LFD-formulas are invariant under dependence bisimulations.

Proof.

Let 𝕄,𝕄\mathbb{M},\mathbb{M^{\prime}} be dependence models and ZA×AZ\subseteq A\times A^{\prime} a dependence bisimulation with (s,s)Z(s,s^{\prime})\in Z and φ\varphi\in LFD. We show that sφs\models\varphi iff sφs^{\prime}\models\varphi by induction on the complexity of φ\varphi; the atomic and Boolean cases are trivial. For the other cases, we show only one direction.

(𝔻Xψ)\mathbb{D}_{X}\psi)\quad Suppose that s𝔻Xψs\models\mathbb{D}_{X}\psi and let s=Xts^{\prime}=_{X}t^{\prime}, i.e. XVs,tX\subseteq V^{s^{\prime},t^{\prime}}, for some tAt^{\prime}\in A^{\prime}. By the (Back)-clause there is some tAt\in A such that s=Xts=_{X}t and (t,t)Z(t,t^{\prime})\in Z. Hence tψt\models\psi and so tψt^{\prime}\models\psi by (IH)(IH).

(DXy)D_{X}y)\quad Suppose that sDXys\models D_{X}y and let s=Xts^{\prime}=_{X}t^{\prime} for some tAt^{\prime}\in A^{\prime}. We want to show that s=yts^{\prime}=_{y}t^{\prime}, i.e. yVs,ty\in V^{s^{\prime},t^{\prime}}. By the (Back)-clause there is some tAt\in A with (t,t)Z(t,t^{\prime})\in Z, s=Vs,tts=_{V^{s^{\prime},t^{\prime}}}t and Vs,tV^{s^{\prime},t^{\prime}} is dependence-closed at ss. As XVs,tX\subseteq V^{s^{\prime},t^{\prime}}, by monotonicity of dependence we have sDVs,tys\models D_{V^{s^{\prime},t^{\prime}}}y. This shows that yVs,ty\in V^{s^{\prime},t^{\prime}} as Vs,tV^{{}^{\prime}s,t^{\prime}} is dependence-closed at ss. ∎

Dependence bisimulations in fact characterize LFD as a fragment of FOL. This can be shown by formulating an analogue of dependence bisimulations for structures of the form T(𝕄)T(\mathbb{M}), and showing that on ω\omega-saturated structures of this form, LFD-equivalence implies dependence-bisimilarity.

Independently, another notion of bisimulation characterizing LFD has been proposed in [6] that treats dependence atoms like ordinary relational atoms. That is, instead of the dependence-closed condition they simply require that ”sDXys\models D_{X}y iff sDXys^{\prime}\models D_{X}y” holds for all X{y}VX\cup\{y\}\subseteq V. It follows that proposition 3.1 shows that dependence bisimulations are also bisimulations in their sense. Conversely, ”sDXys\models D_{X}y iff sDXys^{\prime}\models D_{X}y” clearly implies the dependence-closed condition, hence the two notions are equivalent. It follows that the proof given in [6] also shows that LFD is the dependence bisimulation-invariant fragment of FOL.

Theorem 3.1.

Van Benthem Characterization
LFDLFD is the largest fragment of FOLFOL that is invariant under dependence bisimulations.

Dependence bisimulations suggest a more efficient way to implement a bisimilarity-checking algorithm for LFD compared to the definition in [6]. For what proposition 3.1 shows is that, given (M,A),(M,A)(M,A),(M^{\prime},A^{\prime}) with sA,sAs\in A,s^{\prime}\in A^{\prime}, it actually suffices to check that ”sDXys\models D_{X}y iff sDXys^{\prime}\models D_{X}y for all yVy\in V” holds for all X{Vs,tV|tA}{Vs,tV|tA}X\in\{V^{s,t}\subseteq V\;|\;t\in A\}\cup\{V^{s^{\prime},t^{\prime}}\subseteq V\;|\;t^{\prime}\in A^{\prime}\} in order to conclude that ”sDXys\models D_{X}y iff sDXys^{\prime}\models D_{X}y for all yVy\in V” holds for all XVX\subseteq V. This could be used to avoid an exponential blow-up in |V||V|.

Dependence bisimulations generalise naturally to extensions of LFD. For instance, we can extend LFDLFD with the equality relation ==, yielding the logic LFD=LFD^{=} which was shown to be a conservative reduction class of FOL and hence undecidable in [6]. Dependence bisimulations with an extended (Atom) clause that also ranges over equality can be shown to characterize LFD=LFD^{=} as a fragment to FOL. Interestingly, over full dependence models (i.e. those (M,A)(M,A) with A=MVA=M^{V}, which are standard first-order structures repackaged as dependence models), dependence bisimulations (for LFD over a finite vocabulary (V,τ)(V,\tau) with |V|=k|V|=k) coincides with kk-potential isomorphism, which characterizes first-order logic in kk variables.

4 Finite Model Property

We show that LFD has the FMP w.r.t the intended dependence model semantics, by an application of Herwig’s theorem similar to the one in [7]. Fix a satisfiable LFD-formula φ\varphi, and let Φ:=Cl({φ})\Phi:=Cl(\{\varphi\}). We let (V,τ):=(Vφ,τφ)(V,\tau):=(V_{\varphi},\tau_{\varphi}) be the smallest vocabulary containing φ\varphi and hence Φ\Phi. Note that (V,τ)(V,\tau) is a finite vocabulary, so let |V|=k|V|=k. We know that there is a tree-like dependence model 𝕄=(M,A)\mathbb{M}=(M,A), with associated tree TT of good paths, satisfying φ\varphi at the root assignment. Furthermore, the degree of TT is bounded by m×2km\times 2^{k}, where mm is the number of distinct Φ\Phi-types. Our strategy is as follows: we will cut the underlying kk-tree MM to a finite structure, encode the dependence atoms in a richer language and finally use Herwig’s theorem to generate out of this a finite dependence model that is bisimilar to the original tree-model. Define a sub-team of AA by:

Acut:={vπA|lh(π)3}A_{cut}:=\{v_{\pi}\in A\;|\;lh(\pi)\leq 3\}

and let McutM_{cut} be the submodel of MM induced by {vπ[V]M|vπAcut}\bigcup\{v_{\pi}[V]\subseteq M\;|\;v_{\pi}\in A_{cut}\}; we call 𝕄cut:=(Mcut,Acut)\mathbb{M}_{cut}:=(M_{cut},A_{cut}) the cut-off model. This is a finite model because the branching degree of TT is bounded and VV is finite. The truth lemma clearly no longer holds on this cut-off model, because some existential witnesses are missing for assignments of length 3.

We extend the language to include an |X||X|-ary relation RX,yR^{X,y} for each X{y}VX\cup\{y\}\subseteq V, and obtain the (still finite) richer language τ+τ\tau^{+}\supseteq\tau. We will use these relations to encode the semantics of the dependence atoms. We expand the structure McutM_{cut} underlying the cut-off model to a τ+\tau^{+} structure by putting:

IMcut(RX,y):={vπ(𝐱)|DXylast(π)}I^{M_{cut}}(R^{X,y}):=\{v_{\pi}(\mathbf{x})\;|\;D_{X}y\in last(\pi)\}

so that 𝕄cut,vπRX,y𝐱\mathbb{M}_{cut},v_{\pi}\models R^{X,y}\mathbf{x} iff DXylast(π)D_{X}y\in last(\pi). In the end, we want to show that Rx,y𝐱DXyR^{x,y}\mathbf{x}\leftrightarrow D_{X}y holds on the Herwig extension, so that we can recover an appropriate dependence model from it. To show this, we will need the following restricted version of this claim on the cut-off model:

Proposition 4.1.

For each vπAcutv_{\pi}\in A_{cut} of length lh(π)2:vπDXyRX,y𝐱lh(\pi)\leq 2:\quad v_{\pi}\models D_{X}y\to R^{X,y}\mathbf{x}.

Proof.

By contraposition, so suppose that vπ⊧̸RX,y𝐱v_{\pi}\not\models R^{X,y}\mathbf{x}. This means that DXylast(π)D_{X}y\not\in last(\pi), so for the good path π+:=(π,X,last(π))\pi^{+}:=(\pi,X,last(\pi)) (it is a good path as last(π)Xlast(π)last(\pi)\sim_{X}last(\pi) trivially holds) we have that vπ=Xvπ+v_{\pi}=_{X}v_{\pi^{+}} and vπyvπ+v_{\pi}\neq_{y}v_{\pi^{+}}, i.e. vπ⊧̸DXyv_{\pi}\not\models D_{X}y. ∎

Herwig’s theorem on extending partial isomorphism [8] is a result about first-order relational languages. It tells us that any finite structure with some set of partial isomorphisms on it has a finite extension in which all these partial isomorphisms extend to automorphisms. This theorem has already been used to show the FMP of the Guarded Fragment (GF) [7].

Theorem 4.1.

Herwig
Let σ\sigma be a finite relational language, CC a finite σ\sigma-structure and {p1,,pk}\{p_{1},...,p_{k}\} a (finite) set of partial isomorphisms on CC. Then there exists a finite extension C+C^{+} of CC that satisfies the following conditions:

  • (i)

    Every pip_{i} extends to a unique automorphism pi^\widehat{p_{i}} of C+C^{+}. This yields a subgroup p1^,,pk^\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle of the automorphism group of C+C^{+}.

  • (ii)

    If a tuple 𝐚=(a1,.,ar)\mathbf{a}=(a_{1},....,a_{r}) from C+C^{+} is live or r=1r=1, then there exists an automorphism fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that for each iri\leq r, f(ai)Cf(a_{i})\in C.

  • (iii)

    If fp1^,,pk^\exists f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle and a,bCa,b\in C such that f(a)=bf(a)=b, then either f=idf=id or there is a unique pp1,,pkp\in\langle p_{1},...,p_{k}\rangle such that p^=f\widehat{p}=f and p(a)=bp(a)=b.

where p1,,pk\langle p_{1},...,p_{k}\rangle is the collection of all partial isomorphisms that can be obtained by composing the pip_{i} with their inverses. Note that p1,,pk\langle p_{1},...,p_{k}\rangle is strictly speaking not a group as it need not be the case that pp1p\circ p^{-1} is the identity on CC (in general, it is the identity on a subset of CC).

Condition (iii) is in need of further clarification. In words, it says that elements in the submodel CC are only mapped to each other by some ff1,,fnf\in\langle f_{1},...,f_{n}\rangle if this is forced given the choice of partial isomorphisms. Uniqueness of pp in this condition is ensured by the fact that the map ()^\widehat{(\;)} extends to a bijective map ()^:p1,,pkp1^,,pk^\widehat{(\;)}:\langle p_{1},...,p_{k}\rangle\to\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle that commutes with the operations ,()1\circ,(\;)^{-1} (and the identity idid). By condition (i), ()^\widehat{(\;)} is defined on the subset {p1,,pk}\{p_{1},...,p_{k}\}. Set p1^:=p^1\widehat{p^{-1}}:=\widehat{p}^{-1} and pp^:=p^p^\widehat{p\circ p^{\prime}}:=\widehat{p}\circ\widehat{p^{\prime}}; so commutation follows by definition. It immediately follows that the map is injective. For surjectivity, let fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle. By definition f=pi1^ϵ1pim^ϵmf=\widehat{p_{i_{1}}}^{\epsilon_{1}}\circ...\circ\widehat{p_{i_{m}}}^{\epsilon_{m}} for some {i1,,im}{1,,k}\{i_{1},...,i_{m}\}\subseteq\{1,...,k\} and ϵj{1,1}\epsilon_{j}\in\{-1,1\} for each jmj\leq m. Define p:=pi1ϵ1pimϵmp1,,pkp:=p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}\in\langle p_{1},...,p_{k}\rangle. Now observe:

p^=\savestack\tmpbox\stretchto\scaleto\scalerel[pi1ϵ1pimϵm] 0.5ex\stackon[1pt]pi1ϵ1pimϵm\tmpbox=pi1ϵ1^pimϵm^=pi1^ϵ1pim^ϵm=f\widehat{p}=\savestack{\tmpbox}{\stretchto{\scaleto{\scalerel*[\widthof{p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}}]{\kern-0.6pt\bigwedge\kern-0.6pt}{\rule[-505.89pt]{4.30554pt}{505.89pt}}}{}}{0.5ex}}\stackon[1pt]{p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{m}}^{\epsilon_{m}}}{\tmpbox}=\widehat{p_{i_{1}}^{\epsilon_{1}}}\circ...\circ\widehat{p_{i_{m}}^{\epsilon_{m}}}=\widehat{p_{i_{1}}}^{\epsilon_{1}}\circ...\circ\widehat{p_{i_{m}}}^{\epsilon_{m}}=f

We proceed with specifying a choice of partial isomorphisms on the cut-off model. If π\pi is a good path of lh(π)=3lh(\pi)=3 and last(π)=Δlast(\pi)=\Delta, then there is a partial isomorphism pπ:vπ[Vφ]vπΔ[Vφ]p_{\pi}:v_{\pi}[V_{\varphi}]\to v_{\pi_{\Delta}}[V_{\varphi}] such that pπvπ=vπΔp_{\pi}\circ v_{\pi}=v_{\pi_{\Delta}}, where πΔ:=Σ0,,Δ\pi_{\Delta}:=\langle\Sigma_{0},\emptyset,\Delta\rangle so lh(πΔ)=2lh(\pi_{\Delta})=2. We pick the finite set of partial isomorphisms {pπ|πgood path oflh(π)=3}={p1,,pk}\{p_{\pi}\;|\;\pi\;\textrm{good path of}\;lh(\pi)=3\}=\{p_{1},...,p_{k}\}. The following proposition tells us what kind of partial isomorphisms are in p1,,pk\langle p_{1},...,p_{k}\rangle.

Lemma 4.1.

If pp1,,pkp\in\langle p_{1},...,p_{k}\rangle with pvπ=Xvπpv_{\pi}=_{X}v_{\pi^{\prime}}, then there are vρ,vρAcutv_{\rho},v_{\rho^{\prime}}\in A_{cut} with last(ρ)=last(ρ)last(\rho)=last(\rho^{\prime}) such that vρ=Xvπv_{\rho}=_{X}v_{\pi}, vρ=Xvπv_{\rho^{\prime}}=_{X}v_{\pi^{\prime}} and pvρ=vρpv_{\rho}=v_{\rho^{\prime}}.

Proof.

Let pp1,,pkp\in\langle p_{1},...,p_{k}\rangle such that pvπ=Xvπpv_{\pi}=_{X}v_{\pi^{\prime}}. By definition, p=pimϵmpi1ϵ1p=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}} for some {i1,,im}{1,,k}\{i_{1},...,i_{m}\}\subseteq\{1,...,k\} and ϵj{1,1}\epsilon_{j}\in\{-1,1\} for each 1jm1\leq j\leq m. Note that for each jmj\leq m we have that pij{p1,,pk}={pπ|πa good path oflh(π)=3}p_{i_{j}}\in\{p_{1},...,p_{k}\}=\{p_{\pi}\;|\;\pi\;\textrm{a good path of}\;lh(\pi)=3\}, so pijϵjvπj1=vπjp_{i_{j}}^{\epsilon_{j}}\circ v_{\pi_{j-1}}=v_{\pi_{j}} 888More specifically pijϵjvπj1=Vvπjp_{i_{j}}^{\epsilon_{j}}\circ v_{\pi_{j-1}}=_{V}v_{\pi_{j}}, but this is the same as equality as dom(vπj)=dom(vπj1)=Vdom(v_{\pi_{j}})=dom(v_{\pi_{j_{1}}})=V. Another way of putting this is that dom(pijϵj)=vπj1[V]dom(p_{i_{j}}^{\epsilon_{j}})=v_{\pi_{j-1}}[V] and cod(pijϵj)=vπj[V]cod(p_{i_{j}}^{\epsilon_{j}})=v_{\pi_{j}}[V]. for some vπj1,vπjAcutv_{\pi_{j-1}},v_{\pi_{j}}\in A_{cut} with last(πj1)=last(πj)last(\pi_{j-1})=last(\pi_{j}). In particular, there are vπ0,vπ1Acutv_{\pi_{0}},v_{\pi_{1}}\in A_{cut} such that last(π0)=last(π1)last(\pi_{0})=last(\pi_{1}) and pi1ϵ1vπ0=vπ1p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=v_{\pi_{1}}. Set ρ:=π0\rho:=\pi_{0}. It follows that vπ=Xvπ0v_{\pi}=_{X}v_{\pi_{0}} and so pvπ0=Xpvπ=Xvπpv_{\pi_{0}}=_{X}pv_{\pi}=_{X}v_{\pi^{\prime}}, i.e.

pvπ0=pimϵmpi1ϵ1vπ0=Xvπpv_{\pi_{0}}=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=_{X}v_{\pi^{\prime}}

This was the base case for an inductive argument up to mm. So let jmj\leq m and suppose that vπjAcutv_{\pi_{j}}\in A_{cut} with last(πj)=last(π0)last(\pi_{j})=last(\pi_{0}) and

pvπ0=pi1ϵ1pij+1ϵj+1vπj=Xvπpv_{\pi_{0}}=p_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{j+1}}^{\epsilon_{j+1}}v_{\pi_{j}}=_{X}v_{\pi^{\prime}}

Now recall that pij+1ϵj+1vπj=vπj+1p_{i_{j+1}}^{\epsilon_{j+1}}v_{\pi_{j}}=v_{\pi_{j+1}} for some vπj+1Acutv_{\pi_{j+1}}\in A_{cut} with last(πj+1)=last(πj)last(\pi_{j+1})=last(\pi_{j}). Moreover, it follows that pi1ϵ1pij+2ϵj+2vπj+1=Xvπp_{i_{1}}^{\epsilon_{1}}\circ...\circ p_{i_{j+2}}^{\epsilon_{j+2}}v_{\pi_{j+1}}=_{X}v_{\pi^{\prime}}. Hence by induction, there is some vπmAcutv_{\pi_{m}}\in A_{cut} with pvπ0=vπmpv_{\pi_{0}}=v_{\pi_{m}} such that last(πm)=last(π0)last(\pi_{m})=last(\pi_{0}) and

vπm=pimϵmpi1ϵ1vπ0=pvπ0=Xvπv_{\pi_{m}}=p_{i_{m}}^{\epsilon_{m}}\circ...\circ p_{i_{1}}^{\epsilon_{1}}v_{\pi_{0}}=pv_{\pi_{0}}=_{X}v_{\pi^{\prime}}

then for ρ=π0\rho=\pi_{0} and ρ=πm\rho^{\prime}=\pi_{m} we have proved the lemma ∎

The associated first-order structure T(𝕄cut)T(\mathbb{M}_{cut}) of the Herwig extension is a finite model in a finite relational language τ+{A}\tau^{+}\cup\{A\}, and {p1,,pk}\{p_{1},...,p_{k}\} is a finite set of partial isomorphisms on it. Hence, by Herwig’s theorem, there exists a finite extension T(𝕄cut)+T(\mathbb{M}_{cut})^{+} of this structure, the Herwig extension, satisfying conditions (i)-(iii) w.r.t {p1,,pk}\{p_{1},...,p_{k}\}. It is easy to see that the Herwig extension corresponds in the canonical way (i.e. see the first-order translation above) to a dependence model 𝕄cut+:=(Mcut+,Acut+)\mathbb{M}_{cut}^{+}:=(M_{cut}^{+},A_{cut}^{+}) such that T(𝕄cut+)=T(𝕄cut)+T(\mathbb{M}_{cut}^{+})=T(\mathbb{M}_{cut})^{+}. Recall that we want to establish a bisimulation between the finite Herwig extension 𝕄cut+\mathbb{M}_{cut}^{+} and the infinite tree model 𝕄\mathbb{M}. To do this, we will need the following lemmas.

Lemma 4.2.

Level 2 Lemma
For every sAcut+s\in A_{cut}^{+} there is an fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcutf\circ s=v_{\pi}\in A_{cut} where lh(π)2lh(\pi)\leq 2.

Proof.

Let sAcut+s\in A_{cut}^{+}. Then the tuple s(𝐯)I(A)s(\mathbf{v})\in I(A) is live in T(𝕄cut+)T(\mathbb{M}_{cut}^{+}). Hence by condition (ii) there is some automorphism fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs(𝐯)fs(\mathbf{v}) is a tuple of objects of the submodel T(Mcut)T(M_{cut}). As ff is an isomorphism, it follows that fs(𝐯)I(A)fs(\mathbf{v})\in I(A) as well. But this can only be if fs(𝐯)=vπ(𝐯)fs(\mathbf{v})=v_{\pi}(\mathbf{v}) for some vπAcutv_{\pi}\in A_{cut}. Now suppose that lh(π)=3lh(\pi)=3, with last(π)=Δlast(\pi)=\Delta, then by (i) there is an automorphism pπ^\widehat{p_{\pi}} such that pπ^fp1^,,pk^\widehat{p_{\pi}}f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle and pπ^fs=vπΔ\widehat{p_{\pi}}fs=v_{\pi_{\Delta}}, where lh(πΔ)=2lh(\pi_{\Delta})=2. Hence we may assume that there exists some gp1^,,pk^g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that gs=vπgs=v_{\pi} for some path assignment vπAcutv_{\pi}\in A_{cut} of length lh(π)2lh(\pi)\leq 2. ∎

Next, we generalise the notion of ’underlying type’ (i.e. last(π)last(\pi) for a path assignment vπv_{\pi}) to all assignments in Acut+A_{cut}^{+}. We define a function type():Acut+{ΔΦ|Δis aΦ-type}type(\;):A_{cut}^{+}\to\{\Delta\subseteq\Phi\;|\;\Delta\;\textrm{is a}\;\Phi\textrm{-type}\}. Set type(vπ):=last(π)type(v_{\pi}):=last(\pi) for all vπAcutAcut+v_{\pi}\in A_{cut}\subset A_{cut}^{+}. For sAcut+Acuts\in A_{cut}^{+}\setminus A_{cut}, by the level 2 lemma we know there is fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcutfs=v_{\pi}\in A_{cut}, and we set type(s):=last(π)type(s):=last(\pi).

Proof.

Well-definedness of type()type(\;)
Let f,gp1^,,pk^f,g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle be automorphisms with fs=VπAcutfs=V_{\pi}\in A_{cut} and gs=vπAcutgs=v_{\pi^{\prime}}\in A_{cut}. Observe that fg1f\circ g^{-1} is an automorphism in the subgroup p1^,,pk^\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle that maps elements in McutM_{cut} to each other, as fg1vπ=vπfg^{-1}\circ v_{\pi^{\prime}}=v_{\pi}. Hence by (iii) there must be a unique pp1,,pkp\in\langle p_{1},...,p_{k}\rangle such that p^=fg1\widehat{p}=fg^{-1} and thus pvπ=vπpv_{\pi^{\prime}}=v_{\pi}. Lemma 4.1 tells us that there are assignments vρ,vρAcutv_{\rho},v_{\rho^{\prime}}\in A_{cut} with vπ=Vvρv_{\pi}=_{V}v_{\rho}, vπ=Vvρv_{\pi^{\prime}}=_{V}v_{\rho^{\prime}} and last(ρ)=last(ρ)last(\rho)=last(\rho^{\prime}). It is an easy consequence of the Truth Lemma (lemma 2.1) and Locality that vπ=Vvρv_{\pi}=_{V}v_{\rho} implies that last(π)=last(ρ)last(\pi)=last(\rho) and similarly for π,ρ\pi^{\prime},\rho^{\prime}.999For suppose that vπ=Vvρv_{\pi}=_{V}v_{\rho}. Observe that DVvπ0=VD^{v_{\pi_{0}}}_{V}=V for any path assignment with dom(vπ0)=Vdom(v_{\pi_{0}})=V. By Locality the hypothesis gives that {ξ|vπξ&Free(ξ)V}={ξ|vρξ&Free(ξ)V}\{\xi\;|\;v_{\pi}\models\xi\;\&\;Free(\xi)\subseteq V\}=\{\xi\;|\;v_{\rho}\models\xi\;\&\;Free(\xi)\subseteq V\}. By the Truth Lemma, this in turn implies that {ξ|ξlast(π)&Free(ξ)V}={ξ|ξlast(ρ)&Free(ξ)V}\{\xi\;|\;\xi\in last(\pi)\;\&\;Free(\xi)\subseteq V\}=\{\xi\;|\;\xi\in last(\rho)\;\&\;Free(\xi)\subseteq V\} which says that last(π)Vlast(ρ)last(\pi)\sim_{V}last(\rho), but this clearly implies that last(π)=last(ρ)last(\pi)=last(\rho). Hence last(π)=last(ρ)=last(ρ)=last(π)last(\pi)=last(\rho)=last(\rho^{\prime})=last(\pi^{\prime}). ∎

This last fact used, i.e. that vπ=Xvπv_{\pi}=_{X}v_{\pi^{\prime}} implies last(π)Xlast(π)last(\pi)\sim_{X}last(\pi^{\prime}), we will now generalise to all assignments in s,tAcut+s,t\in A_{cut}^{+} w.r.t their ’underlying types’ type(s),type(t)type(s),type(t).

Lemma 4.3.

Type Lemma
If s,tAcut+s,t\in A_{cut}^{+} with s=Xts=_{X}t, then type(s)Xtype(t)type(s)\sim_{X}type(t).

Proof.

Let s,tAcut+s,t\in A_{cut}^{+} with s=Xts=_{X}t. By the level 2 lemma, there is fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcutfs=v_{\pi}\in A_{cut} with lh(π)2lh(\pi)\leq 2, so type(s)=last(π)type(s)=last(\pi). As ff is an isomorphism on T(𝕄cut+)T(\mathbb{M}_{cut}^{+}), we know that ftAcut+ft\in A_{cut}^{+} is an assignment as well, with fs=vπ=Xftfs=v_{\pi}=_{X}ft. By applying the level 2 lemma again to ftft, we get a gp1^,,pk^g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that gft=vπAcutgft=v_{\pi^{\prime}}\in A_{cut} with lh(π)2lh(\pi^{\prime})\leq 2, so type(t)=last(π)type(t)=last(\pi^{\prime}). Again, we know that gvπAcut+gv_{\pi}\in A_{cut}^{+} is also an assignment (though in general not one in AcutA_{cut}) such that gvπ=gfs=Xgft=vπgv_{\pi}=gfs=_{X}gft=v_{\pi^{\prime}}. But observe that the automorphism gg maps vπ(x)vπ(x)v_{\pi}(x)\mapsto v_{\pi^{\prime}}(x) for all xXx\in X, hence by condition (iii) there must be a unique pp1,,pkp\in\langle p_{1},...,p_{k}\rangle such that p^=g\widehat{p}=g and so pvπ=Xvπpv_{\pi}=_{X}v_{\pi^{\prime}}. By Lemma 4.1, there are vρ,vρAcutv_{\rho},v_{\rho^{\prime}}\in A_{cut} such that vπ=Xvρv_{\pi}=_{X}v_{\rho}, vπ=Xvρv_{\pi^{\prime}}=_{X}v_{\rho^{\prime}} and last(ρ)=last(ρ)last(\rho)=last(\rho^{\prime}). Invoking the Truth Lemma and Locality as before this implies that last(π)Xlast(ρ)last(\pi)\sim_{X}last(\rho) and last(π)Xlast(ρ)last(\pi^{\prime})\sim_{X}last(\rho^{\prime}). Concatenating these facts we see that

type(s)=last(π)Xlast(ρ)=last(ρ)Xlast(π)=type(t)type(s)=last(\pi)\sim_{X}last(\rho)=last(\rho^{\prime})\sim_{X}last(\pi^{\prime})=type(t)

Lemma 4.4.

Encoding Lemma
For all sAcut+s\in A_{cut}^{+} and all RX,yτ+:sRX,y𝐱DXyR^{X,y}\in\tau^{+}:\quad s\models R^{X,y}\mathbf{x}\leftrightarrow D_{X}y

Proof.

(\leftarrow) By the level 2 lemma, there is fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcutfs=v_{\pi}\in A_{cut} with lh(π)2lh(\pi)\leq 2. Applying the first-order translation to proposition 3.1 we get that T(𝕄cut),vπtr(¬RX,y𝐱¬DXy)T(\mathbb{M}_{cut}),v_{\pi}\models tr(\neg R^{X,y}\mathbf{x}\to\neg D_{X}y). But observe that

tr(¬RX,y𝐱¬DXy)=¬RX,y𝐱tr(¬DXy)\displaystyle tr(\neg R^{X,y}\mathbf{x}\to\neg D_{X}y)\;=\;\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y)\;\equiv\; RX,y𝐱𝐳,𝐳(A𝐯A𝐯[𝐳/𝐳]yy)\displaystyle R^{X,y}\mathbf{x}\vee\exists\mathbf{z},\mathbf{z^{\prime}}(A\mathbf{v}\wedge A\mathbf{v^{\prime}}[\mathbf{z}/\mathbf{z^{\prime}}]\wedge y\neq y^{\prime})
\displaystyle\;\equiv\; 𝐳,𝐳(RX,y𝐱(A𝐯A𝐯[𝐳/𝐳]yy))\displaystyle\exists\mathbf{z},\mathbf{z^{\prime}}(R^{X,y}\mathbf{x}\vee(A\mathbf{v}\wedge A\mathbf{v^{\prime}}[\mathbf{z}/\mathbf{z^{\prime}}]\wedge y\neq y^{\prime}))

is an existential first-order formula. Hence by the dualized version of the Łoś-Tarski theorem, this still holds in the Herwig extension, i.e. T(𝕄cut+),vπ¬RX,y𝐱tr(¬DXy)T(\mathbb{M}_{cut}^{+}),v_{\pi}\models\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y). As ff is an isomorphism on T(𝕄cut+)T(\mathbb{M}_{cut}^{+}) and fs=vπfs=v_{\pi}, we get that T(𝕄cut+),s¬RX,y𝐱tr(¬DXy)T(\mathbb{M}_{cut}^{+}),s\models\neg R^{X,y}\mathbf{x}\to tr(\neg D_{X}y), as desired.

(\to) Suppose that sRX,y𝐱s\models R^{X,y}\mathbf{x}, and let s=Xts=_{X}t for some tAcut+t\in A_{cut}^{+}. The former fact implies that DXytype(s)D_{X}y\in type(s) and the latter by the Type Lemma implies that type(s)Xtype(t)type(s)\sim_{X}type(t). It follows that DXytype(t)D_{X}y\in type(t) as well. Applying the level 2 lemma two times successively as before, we obtain automorphism f,gp1^,,pk^f,g\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcut+fs=v_{\pi}\in A_{cut}^{+}, gft=vπAcut+gft=v_{\pi^{\prime}}\in A_{cut}^{+} with Vs,t=Vvπ,ft=Vgvπ,vπV^{s,t}=V^{v_{\pi},ft}=V^{gv_{\pi},v_{\pi^{\prime}}} (recall the notation Va,b={vV|a=vb}V^{a,b}=\{v\in V\;|\;a=_{v}b\}). As in the type lemma, we see that g:vπ(x)vπ(x)g:v_{\pi}(x)\mapsto v_{\pi^{\prime}}(x) (i.e. gvπ=Xvπgv_{\pi}=_{X}v_{\pi^{\prime}}) for all xXx\in X and thus by condition (iii) there is a unique pp1,,pkp\in\langle p_{1},...,p_{k}\rangle such that p^=g\widehat{p}=g and hence pvπ=Xvπpv_{\pi}=_{X}v_{\pi^{\prime}}.

We know by Lemma 4.1 that there must be vρ,vρAcutv_{\rho},v_{\rho^{\prime}}\in A_{cut} with last(ρ)=last(ρ)last(\rho)=last(\rho^{\prime}) such that vπ=Xvρv_{\pi}=_{X}v_{\rho}, vπ=Xvρv_{\pi^{\prime}}=_{X}v_{\rho^{\prime}} and pvρ=vρpv_{\rho}=v_{\rho^{\prime}}. By fact 4.9 from [3], this implies that there is a path last(π)X.Xlast(ρ)last(\pi)\sim_{X}....\sim_{X}last(\rho) and similarly for π,ρ\pi^{\prime},\rho^{\prime}. We saw that DXytype(s)type(t)=last(π)last(π)D_{X}y\in type(s)\cap type(t)=last(\pi)\cap last(\pi^{\prime}), so in fact DXyD_{X}y must be in all the types along these paths. But then it follows from condition (2) of the recursive definition of path assignments (in the proof of theorem 2.1) that vπ=yvρv_{\pi}=_{y}v_{\rho} and vπ=yvρv_{\pi^{\prime}}=_{y}v_{\rho^{\prime}}. But recall that pvρ=vρpv_{\rho}=v_{\rho^{\prime}} so:

ft=g1gft=g1vπ=p1^vπ=yp1^vρ=vρft=g^{-1}gft=g^{-1}v_{\pi^{\prime}}=\widehat{p^{-1}}v_{\pi^{\prime}}=_{y}\widehat{p^{-1}}v_{\rho^{\prime}}=v_{\rho}

But then vπ=yvρ=yftv_{\pi}=_{y}v_{\rho}=_{y}ft so by transitivity yVvπ,ft=Vs,ty\in V^{v_{\pi},ft}=V^{s,t} and we conclude that s=yts=_{y}t. ∎

Theorem 4.2.

The dependence models 𝕄\mathbb{M} and 𝕄cut+\mathbb{M}_{cut}^{+} are dependence-bisimilar.

Proof.

We show that the relation ZAcut+×AZ\subseteq A_{cut}^{+}\times A defined by Z:={(s,vπ)|type(s)=last(π)}Z:=\{(s,v_{\pi})\;|\;type(s)=last(\pi)\} is an LFD-bisimulation in the sense of [6] and hence, by our remark above, also a dependence bisimulation. Pick an arbitrary pair (s,vπ)Z(s,v_{\pi})\in Z. By the level 2 lemma, there is some fp1^,,pk^f\in\langle\widehat{p_{1}},...,\widehat{p_{k}}\rangle such that fs=vπAcutfs=v_{\pi^{\prime}}\in A_{cut} with lh(π)2lh(\pi^{\prime})\leq 2, hence type(s)=vπtype(s)=v_{\pi^{\prime}}. As type()type(\;) is well-defined, it follows that last(π)=last(π)last(\pi)=last(\pi^{\prime}). We show that the pair (s,vπ)(s,v_{\pi}) satisfies (Atom) (i.e. the one which also ranges over dependence atoms [6]) and is closed under the (Back) & (Forth) clauses (without the dependence-closedness condition).

(Atom) Observe that the chain of equivalences:

s𝕄cut+P𝐱iffvπ𝕄cut+P𝐱iffP𝐱last(π)=last(π)iffvπ𝕄P𝐱s\models_{\mathbb{M}_{cut}^{+}}P\mathbf{x}\quad\textrm{iff}\quad v_{\pi^{\prime}}\models_{\mathbb{M}_{cut}^{+}}P\mathbf{x}\quad\textrm{iff}\quad P\mathbf{x}\in last(\pi^{\prime})=last(\pi)\quad\textrm{iff}\quad v_{\pi}\models_{\mathbb{M}}P\mathbf{x}

holds for every Pτ+P\in\tau^{+} (i.e. including the relations RX,yR^{X,y}!) by the fact that ff is an isomorphism with fs=vπfs=v_{\pi^{\prime}} and the way we have specified the interpretation I(P)I(P) on both models. Invoking the encoding lemma, this implies that 𝕄cut+,sDXy\mathbb{M}_{cut}^{+},s\models D_{X}y iff 𝕄,vπDXy\mathbb{M},v_{\pi}\models D_{X}y.

(Forth) Let tAcut+t\in A_{cut}^{+} be some assignment in the Herwig extension, and let Vs,tV^{s,t} be the maximal set of variables on which ss and tt agree. By the Type Lemma type(s)Vs,ttype(t)type(s)\sim_{V^{s,t}}type(t). But last(π)=last(π)=type(s)last(\pi)=last(\pi^{\prime})=type(s), so it follows that π+:=(π,Vs,t,type(t))\pi^{+}:=(\pi,V^{s,t},type(t)) is a good path. Clearly vπ+Av_{\pi^{+}}\in A with vπ=Vs,tvπ+v_{\pi}=_{V^{s,t}}v_{\pi^{+}}, and lastly (t,vπ+)Z(t,v_{\pi^{+}})\in Z as type(t)=last(π+)type(t)=last(\pi^{+}).

(Back) Let vπ′′Av_{\pi^{\prime\prime}}\in A, with Vπ,π′′={vV|vπ=vvπ′′}V^{\pi,\pi^{\prime\prime}}=\{v\in V\;|\;v_{\pi}=_{v}v_{\pi^{\prime\prime}}\} the maximal set ot variables on which vπ,vπ′′v_{\pi},v_{\pi^{\prime\prime}} agree. By a now familiar argument involving the Truth lemma and Locality (i.e. the analogue of the Type Lemma for 𝕄\mathbb{M}), it follows that last(π)Vs,tlast(π′′)last(\pi)\sim_{V^{s,t}}last(\pi^{\prime\prime}). As type(s)=last(π)=last(π)type(s)=last(\pi^{\prime})=last(\pi), we see that π+:=(π,Vπ,π′′,last(π′′))\pi^{\prime}_{+}:=(\pi^{\prime},V^{\pi,\pi^{\prime\prime}},last(\pi^{\prime\prime})) is a good path. Moreover, we know that lh(π)2lh(\pi^{\prime})\leq 2 which implies that lh(π+)=lh(π)+12+1=3lh(\pi^{\prime}_{+})=lh(\pi^{\prime})+1\leq 2+1=3 and so vπ+Acutv_{\pi^{\prime}_{+}}\in A_{cut} is in the cut-off model. Clearly vπ=Vπ,π′′vπ+v_{\pi^{\prime}}=_{V^{\pi,\pi^{\prime\prime}}}v_{\pi^{\prime}_{+}}. Set t:=f1vπ+t:=f^{-1}v_{\pi^{\prime}_{+}}, then s=Vs,tts=_{V^{s,t}}t as fs=vπfs=v_{\pi^{\prime}} and moreover (t,vπ′′)Z(t,v_{\pi^{\prime\prime}})\in Z since type(t)=last(π+)=last(π′′)type(t)=last(\pi^{\prime+})=last(\pi^{\prime\prime}). ∎

Corollary 4.1.

Bounded Model Property
Every satisfiable φ\varphi in LFD has a finite model whose size is bounded by a computable function of φ\varphi. 101010Any formula φ\varphi determines a unique smallest finite vocabulary (V,τ)(V,\tau) such that Cl(φ)Cl(\varphi) belongs to LFD[V,τ]LFD[V,\tau]; the computable function takes as input |V||V|, the maximal arity rr of relations in τ\tau, and the number of distinct Φ\Phi-types mm.

Proof.

Let φ\varphi be a satisfiable LFD-formula with closure Φ\Phi in the language (V,τ)(V,\tau) and |V|=k|V|=k. By the tree model property, there is a kk-tree MM and a team AA such that 𝕄=(M,A)\mathbb{M}=(M,A) is a dependence model satisfying φ\varphi at the root assignment. We cut this tree at length 3 and obtain the cut-off model whose size is upper bounded by k(b+b2+b3)k(b+b^{2}+b^{3}), where bb\in\mathbb{N} is the branching degree of the kk-tree 𝕄\mathbb{M}. Note that bb itself has m×2km\times 2^{k} as upper bound, where m:=|{ΔΦ|Δis aΦ-type}|m:=|\{\Delta\subseteq\Phi\;|\;\Delta\;\textrm{is a}\;\Phi\textrm{-type}\}| and Φ=Cl(φ)\Phi=Cl(\varphi). It follows that the size of the cut-off model is already exponential in the size of the variables |V||V|.

Now construct the Herwig extension 𝕄cut+=(Mcut+,Acut+)\mathbb{M}_{cut}^{+}=(M_{cut}^{+},A_{cut}^{+}) as above. Using the bound given in [8], we get that |Mcut+|itexp(2r1,p(|Mcut|)|M_{cut}^{+}|\leq itexp(2r-1,p(|M_{cut}|) is upper bounded by an iterated exponential of a polynomial function pp of degree rr of |Mcut||M_{cut}|, where rr is the maximal arity of predicates in τ\tau. By theorem 4.2, 𝕄\mathbb{M} and 𝕄cut+\mathbb{M}_{cut}^{+} are-bisimilar. As dependence bisimulations are always total, there is some assignment sAcut+s\in A_{cut}^{+} with (s,vΣ0)Z(s,v_{\langle\Sigma_{0}\rangle})\in Z. By the invariance result above (proposition 3.1), it follows that 𝕄cut+,sφ\mathbb{M}_{cut}^{+},s\models\varphi. ∎

5 Conclusion

We have introduced dependence bisimulations and have shown that this notion characterizes LFD as a fragment of FOL. Furthermore, we have shown that LFD has the finite (or bounded) model property, by a new application of Herwig’s theorem and a tree-model property established in [3]. The same strategy can be used to carry out a direct proof of the FMP through the equivalent modal semantics.111111The proof of this can be found in an extended version of this paper (arXiv:2107.06042). With minor adaptations, the proof goes through, though we need to appeal to a more general version of Herwig’s theorem (theorem 5 in [8]) to ensure that the Herwig extension is a tree in order to obtain a standard relational model from it. By reducing the maximal arity rr to 22, going through the modal semantics significantly lowers the upper bound on the size of the Herwig extension to being singly exponential in the size of the cut-off model.

While LFD only adds local dependence atoms DXyD_{X}y to CRS, extensions of CRS with other local versions of atomic dependency properties have been considered in [6].121212We will consider only the logics defined in [6] that are closed under negation, i.e. those L[Ω]L[\Omega] for which Ω\Omega is closed under negation. The authors show that LFD extended with either equality or inclusion is undecidable and that the extension of CRS with both inclusion and equality is contained in GF. CRS with independence atoms was shown undecidable in [3], resulting in a complete characterization of the satisfiability problems of such logics. The same paper also studies the model-checking problem for such logics, and shows it to be PTIME-complete in restriction to finitely many variables. However, this tight bound is only obtained on the assumption that the local atoms considered (i.e. inclusion, dependence, independence and equality) are all efficiently checkable.

One open problem is to determine the computational complexity of the satisfiability problem for LFD. It seems that, with a few adaptations, the satisfiability test for GF given in [7] can be used for the case of LFD. Indeed, the ’witnesses for satisfiability’ defined there closely resemble type models. A more conceptual challenge is connecting the qualitative notion of dependence studied by LFD to probabilistic, i.e. quantitative notions of correlation and dependence.

References

  • [1]
  • [2] Hajnal Andréka, István Németi & Johan van Benthem (1998): Modal Languages and Bounded Fragments of Predicate Logic. J. Philos. Log. 27(3), pp. 217–274, 10.1023/A:1004275029985.
  • [3] Alexandru Baltag & Johan van Benthem (2021): A Simple Logic of Functional Dependence. Journal of Philosophical Logic, 10.1007/s10992-020-09588-z.
  • [4] Alexandru Baltag & Sonja Smets (2020): Learning What Others Know. In Elvira Albert & Laura Kovacs, editors: LPAR23. LPAR-23: 23rd International Conference on Logic for Programming, Artificial Intelligence and Reasoning, EPiC Series in Computing 73, EasyChair, pp. 90–119, 10.29007/plm4. Available at https://easychair.org/publications/paper/V8Jp.
  • [5] Johan van Benthem (2001): Exploring Logical Dynamics. Studia Logica 67(1), pp. 111–114, 10.1023/A:1017389612557.
  • [6] Erich Grädel & Phil Pützstück (2021): Logics of Dependence and Independence: The Local Variants.
  • [7] Erich Grädel (1999): On the Restraining Power of Guards. Journal of Symbolic Logic 64(4), p. 1719–1742, 10.2307/2586808.
  • [8] B. Herwig (1998): Extending partial isomorphisms for the small index property of many ω\omega-categorical structures. Israel Journal of Mathematics 107, pp. 93–123, 10.1007/BF02764005.