This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\hideLIPIcs

University of Helsinki, Department of Mathematics and Statistics, Helsinki, Finlandemailhttps://orcid.org/0000-0003-0115-5154Funded by grants 308712 and 338259 of the Academy of Finland Leibniz Universität Hannover, Institut für Theoretische Informatik, Hannover, [email protected]://orcid.org/0000-0002-8061-5376Funded by the German Research Foundation (DFG), project ME4279/1-2 Leibniz Universität Hannover, Institut für Theoretische Informatik, Hannover, [email protected]://orcid.org/0000-0002-5651-5391Funded by the German Research Foundation (DFG), project ME4279/1-2 \CopyrightJuha Kotinen, Arne Meier, and Yasir Mahmood {CCSXML} <ccs2012> <concept> <concept_id>10003752.10003790.10003800</concept_id> <concept_desc>Theory of computation Higher order logic</concept_desc> <concept_significance>500</concept_significance> </concept> <concept> <concept_id>10003752.10003777.10003779</concept_id> <concept_desc>Theory of computation Problems, reductions and completeness</concept_desc> <concept_significance>500</concept_significance> </concept> </ccs2012> \ccsdesc[500]Theory of computation Higher order logic \ccsdesc[500]Theory of computation Problems, reductions and completeness

A Parameterized View on the Complexity of Dependence Logic

Juha Kontinen    Arne Meier    Yasir Mahmood
Abstract

In this paper, we investigate the parameterized complexity of model checking for Dependence Logic which is a well studied logic in the area of Team Semantics. We start with a list of nine immediate parameterizations for this problem, namely: the number of disjunctions (i.e., splits)/(free) variables/universal quantifiers, formula-size, the tree-width of the Gaifman graph of the input structure, the size of the universe/team, and the arity of dependence atoms. We present a comprehensive picture of the parameterized complexity of model checking and obtain a division of the problem into tractable and various intractable degrees. Furthermore, we also consider the complexity of the most important variants (data and expression complexity) of the model checking problem by fixing parts of the input.

keywords:
Team Semantics, Dependence Logic, Parameterized Complexity, Model Checking

1 Introduction

In this article, we explore the parameterized complexity of model checking for dependence logic (𝒟\mathcal{D}). We give a concise classification of this problem and its standard variants (expression and data complexity) with respect to several syntactic and structural parameters. Our results lay down a solid foundation for a systematic study of the parameterized complexity of team-based logics.

The introduction of Dependence Logic [27] in 2007 marks also the birth of the general semantic framework of team semantics that has enabled a systematic study of various notions of dependence and independence during the past decade. Team semantics differs from Tarski’s semantics by interpreting formulas by sets of assignments instead of a single assignment as in first-order logic. Syntactically, dependence logic is an extension of first-order logic by new dependence atoms 𝖽𝖾𝗉(𝐱;𝐲){\mathsf{dep}}({\mathbf{x}};{\mathbf{y}}) expressing that the values of variables 𝐱\mathbf{x} functionally determine values of the variables 𝐲\mathbf{y} (in the team under consideration). Soon after the introduction of dependence logic many other interesting team-based logics and atoms were introduced such as inclusion, exclusion, and independence atoms that are intimately connected to the corresponding inclusion, exclusion, and multivalued dependencies studied in database theory [13, 9]. Furthermore, the area has expanded, e.g., to propositional, modal and probabilistic variants (see [15, 19, 14] and the references therein).

For the applications, it is important to understand the complexity theoretic aspects of dependence logic and its variants. In fact, during the past few years, these aspects have been addressed in several studies. For example, on the level of sentences dependence logic and independence logic are equivalent to existential second-order logic while inclusion logic corresponds to positive greatest fixed point logic and thereby captures P over finite (ordered) structures [11]. Furthermore, there are (non-parameterized) studies that restrict the syntax and try to pin the intractability of a problem to a particular (set of) connective(s). For instance, Durand and Kontinen [5] characterize the data complexity of fragments of dependence logic with bounded arity of dependence atoms/number of universal quantifiers, and Grädel [12] characterizes the combined and the expression complexity of the model checking problem of dependence logic. These studies will be of great help in developing our parameterized approach.

A formalism to enhance the understanding of the inherent intractability of computational problems is brought by the framework of parameterized complexity [4]. Initiated by the founding fathers Downey and Fellows, in this area within computational complexity theory one strives for more structure within the darkness of intractability. Essentially, one tries to identify so-called parameters of a considered problem Π\Pi to find algorithms solving Π\Pi with runtimes of the form f(k)|x|O(1)f(k)\cdot|x|^{O(1)} for inputs xx, corresponding parameter values kk, and a computable function ff. These kind of runtimes are called FPT-runtimes (from fixed-parameter tractable; short FPT) and tame the combinatoric explosion of the solution space to a function ff in the parameter. As a very basic example in this vein, we can consider the propositional satisfiability problem SAT\mathrm{SAT}. An immediate parameter that pulls the problem into the class FPT is the number of variables, as one can solve SAT\mathrm{SAT} in time 2k|φ|2^{k}\cdot|\varphi| if kk is the number of variables of a given propositional formula φ\varphi. Yet, this parameter is not very satisfactory as it neither is seen fixed nor slowly growing in its practical instances. However, there are several interesting other parameters under which SAT\mathrm{SAT} becomes fixed-parameter tractable, e.g., the so-called treewidth of the underlying graph representations of the considered formula [26]. This term was coined by Robertson and Seymour in 1984 [25] and established a profound position (currently DBLP lists 812 papers with treewidth in its title) also in the area of parameterized complexity in the last years [3, 4].

Coming back to fpt-runtimes, a runtime of a very different quality (yet still polynomial for fixed parameters) than FPT is summarized by the complexity class XP: |x|f(k)|x|^{f(k)} for inputs xx, corresponding parameter values kk, and a computable function ff. Furthermore, analogously as XP but on nondeterministic machines, the class XNP will be of interest in this paper. Further up in the hierarchy, classes of the form para𝒞\textbf{para}\mathcal{C} for a classical complexity class 𝒞{NP,PSPACE,NEXP}\mathcal{C}\in\{\textbf{NP},\textbf{PSPACE},\textbf{NEXP}\} play a role in this paper. Such classes intuitively capture all problems that are in the complexity class 𝒞\mathcal{C} after fpt-time preprocessing. In Fig. 1 an overview of these classes and their relations are depicted (for further details see, e.g., the work of Elberfeld et al. [7]).

paraNEXPparaPSPACEXNPW[P]paraNPXPFPTDTM: f(k)poly(|x|)f(k)\cdot\text{poly}(|x|) timeNTM: f(k)poly(|x|)f(k)\cdot\text{poly}(|x|) timeDTM: poly(|x|)f(k)\text{poly}(|x|)^{f(k)} timeDTM: f(k)poly(|x|)f(k)\cdot\text{poly}(|x|) spaceNTM: poly(|x|)f(k)\text{poly}(|x|)^{f(k)} timeNTM: f(k)2poly(|x|)f(k)\cdot 2^{\text{poly}(|x|)} time
Figure 1: Landscape showing relations of relevant parameterized complexity classes with machine definitions.

Recently, the propositional variant of dependence logic (𝒫𝒟\mathcal{PDL}) has been investigated regarding its parameterized complexity [23, 20]. Moreover, propositional independence and inclusion logic have also been studied from the perspective of parameterized complexity [21]. In this paper, we further pursue the parameterized journey through the world of team logics and will visit the problems of first-order dependence logic 𝒟\mathcal{D}. As this paper is the first one that investigates 𝒟\mathcal{D} from the parameterized point of view, we need to gather the existing literature and revisit many results particularly from this perspective. As a result, this paper can be seen as a systematic study with some of the result following in a straightforward manner from the known non-parameterized results and some shedding light also on the non-parameterized view of model checking.

We give an example below to illustrate how the concept of dependence arises as a natural phenomenon in the physical world.

Example 1.1.
Flight Destination Gate Date Time
FIN-70 HEL – FI C1 04.10.2021 09:55
SAS-475 OSL – NO C3 04.10.2021 12:25
SAS-476 HAJ – DE C2 04.10.2021 12:25
FIN-80 HEL – FI C1 04.10.2021 19:55
KLM-615 ATL – USA A5 05.10.2021 11:55
QR-70 DOH – QR B6 05.10.2021 12:25
THY-159 IST – TR A1 05.10.2021 15:55
FIN-80 HEL – FI C1 05.10.2021 19:55
Table 1: An example flight departure screen at an airport

The database in Table 1 presents a screen at an airport for showing details about departing flights. Alternatively, it can be seen as a team TT over attributes in the top row as variables. Clearly, T𝖽𝖾𝗉(Flight,Date,Time;Destination,Gate)T\models{\mathsf{dep}}({\texttt{Flight,Date,Time}};{\texttt{Destination,Gate}}), as well as T𝖽𝖾𝗉(Gate,Date,Time;Destination, Flight)T\models{\mathsf{dep}}({\texttt{Gate,Date,Time}};{\texttt{Destination, Flight}}). Whereas, T⊧̸𝖽𝖾𝗉(Destination,Gate;Time)T\not\models{\mathsf{dep}}({\texttt{Destination,Gate}};{\texttt{Time}}) as witnessed by the pair (FIN-7070, HEL – FI, C11 , 04.10.202104.10.2021, 09:5509:55) and (FIN-8080, HEL – FI, C11 , 04.10.202104.10.2021, 19:5519:55).

Contribution.

Our classification is two-dimensional:

  1. 1.

    We consider the model checking problem of 𝒟\mathcal{D} under various parameterizations: number of split-junctions in a formula #𝗌𝗉𝗅𝗂𝗍𝗌\#\mathsf{splits}, the length of the formula |Φ||\Phi|, number of free variables #𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables}, the treewidth of the structure 𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A}), the size of the structure |𝒜|{|\mathcal{A}|}, the size of the team |T||T|, the number of universal quantifiers in the formula #\#\forall, the arity of the dependence atoms 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity}, as well as the total number of variables #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables}.

  2. 2.

    We distinguish between expression complexity 𝖾𝖼\mathsf{ec} (the input structure is fixed), data complexity 𝖽𝖼\mathsf{dc} (the formula is fixed), and combined complexity 𝖼𝖼\mathsf{cc}.

The results are summarized in Table 2. For instance, the parameters #,𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒,\#\forall,\mathsf{dep\text{-}arity}, and #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables} impact in lowering the complexity for 𝖾𝖼\mathsf{ec} (and not for 𝖼𝖼\mathsf{cc} or 𝖽𝖼\mathsf{dc}), while the parameter |𝒜|{|\mathcal{A}|} impacts for 𝖽𝖼\mathsf{dc} but not for 𝖼𝖼\mathsf{cc} or 𝖾𝖼\mathsf{ec}.

Related work.

The parameterized complexity analyses in the propositional setting [23, 20, 21] have considered the combined complexity of model checking and satisfiability as problems of interest. On the 𝖼𝖼\mathsf{cc}-level, the picture there is somewhat different, e.g., team size as a parameter for propositional dependence logic enabled a FPT algorithm while in our setting it has no effect on the complexity (paraNEXP). Grädel [12] studied the expression and the combined complexity for 𝒟\mathcal{D} in the classical setting, whereas the data complexity was considered by Kontinen [16].

Organization of the paper.

In Section 2, we introduce the foundational concepts of dependence logic as well as parameterized complexity. In Section 3 our results are presented while Section 4 concludes the article.

2 Preliminaries

We require standard notions from classical complexity theory [24]. We encounter the classical complexity classes P,NP,PSPACE,NEXP\textbf{P},\textbf{NP},\textbf{PSPACE},\textbf{NEXP} and their respective completeness notions, employing polynomial time many-one reductions (mP\leq^{\textbf{P}}_{m}).

Parameterized Complexity Theory.

A parameterized problem (PP) PΣ×P\subseteq\Sigma^{*}\times\mathbb{N} is a subset of the crossproduct of an alphabet and the natural numbers. For an instance (x,k)Σ×(x,k)\in\Sigma^{*}\times\mathbb{N}, kk is called the (value of the) parameter. A parameterization is a polynomial-time computable function that maps a value from xΣx\in\Sigma^{*} to its corresponding kk\in\mathbb{N}. The problem PP is said to be fixed-parameter tractable (or in the class FPT) if there exists a deterministic algorithm 𝒜\mathcal{A} and a computable function ff such that for all (x,k)Σ×(x,k)\in\Sigma^{*}\times\mathbb{N}, algorithm 𝒜\mathcal{A} correctly decides the membership of (x,k)P(x,k)\in P and runs in time f(k)|x|O(1)f(k)\cdot|x|^{O(1)}. The problem PP belongs to the class XP if 𝒜\mathcal{A} runs in time |x|f(k)|x|^{f(k)} on a deterministic machine, whereas XNP is the non-deterministic counterpart of XP. Abusing a little bit of notation, we write 𝒞\mathcal{C}-machine for the type of machines that decide languages in the class 𝒞\mathcal{C}, and we will say a function ff is “𝒞\mathcal{C}-computable” if it can be computed by a machine on which the resource bounds of the class 𝒞\mathcal{C} are imposed.

Also, we work with classes that can be defined via a precomputation on the parameter.

Definition 2.1.

Let 𝒞\mathcal{C} be any complexity class. Then para𝒞\textbf{para}\mathcal{C} is the class of all PPs PΣ×P\subseteq\Sigma^{*}\times\mathbb{N} such that there exists a computable function π:Δ\pi\colon\mathbb{N}\to\Delta^{*} and a language L𝒞L\in\mathcal{C} with LΣ×ΔL\subseteq\Sigma^{*}\times\Delta^{*} such that for all (x,k)Σ×(x,k)\in\Sigma^{*}\times\mathbb{N} we have that (x,k)P(x,π(k))L(x,k)\in P\Leftrightarrow(x,\pi(k))\in L.

Notice that paraP=FPT\textbf{para}\textbf{P}=\textbf{FPT}. The complexity classes 𝒞{NP,PSPACE,NEXP}\mathcal{C}\in\{\textbf{NP},\textbf{PSPACE},\textbf{NEXP}\} are used in the para𝒞\textbf{para}\mathcal{C} context by us.

A problem PP is in the complexity class W[P], if it can be decided by a NTM running in time f(k)|x|O(1)f(k)\cdot|x|^{O(1)} steps, with at most g(k)g(k)-many non-deterministic steps, where f,gf,g are computable functions. Moreover, W[P] is contained in the intersection of paraNP and XP (for details see the textbook of Flum and Grohe [8]).

Let cc\in\mathbb{N} and PΣ×P\subseteq\Sigma^{*}\times\mathbb{N} be a PP, then the cc-slice of PP, written as PcP_{c} is defined as Pc:={(x,k)Σ×k=c}P_{c}:=\{\,(x,k)\in\Sigma^{*}\times\mathbb{N}\mid k=c\,\}. Notice that PcP_{c} is a classical problem then. Observe that, regarding our studied complexity classes, showing membership of a PP PP in the complexity class para𝒞\textbf{para}\mathcal{C}, it suffices to show that for each slice Pc𝒞P_{c}\in\mathcal{C} is true.

Definition 2.2.

Let PΣ×,QΓP\subseteq\Sigma^{*}\times\mathbb{N},Q\subseteq\Gamma^{*} be two PPs. One says that PP is fpt-reducible to QQ, PFPTQP\leq^{\textbf{FPT}}Q, if there exists an FPT-computable function f:Σ×Γ×f\colon\Sigma^{*}\times\mathbb{N}\to\Gamma^{*}\times\mathbb{N} such that

  • for all (x,k)Σ×(x,k)\in\Sigma^{*}\times\mathbb{N} we have that (x,k)Pf(x,k)Q(x,k)\in P\Leftrightarrow f(x,k)\in Q,

  • there exists a computable function g:g\colon\mathbb{N}\to\mathbb{N} such that for all (x,k)Σ×(x,k)\in\Sigma^{*}\times\mathbb{N} and f(x,k)=(x,k)f(x,k)=(x^{\prime},k^{\prime}) we have that kg(k)k^{\prime}\leq g(k).

Finally, in order to show that a problem PP is para𝒞\textbf{para}\mathcal{C}-hard (for some complexity class 𝒞\mathcal{C}) it is enough to prove that for some cc\in\mathbb{N}, the slice PcP_{c} is 𝒞\mathcal{C}-hard in the classical setting.

Dependence Logic.

We assume basic familiarity with predicate logic [6]. We consider first-order vocabularies τ\tau that are sets of function symbols and relation symbols with an equality symbol ==. Let VAR\mathrm{VAR} be a countably infinite set of first-order variables. Terms over τ\tau are defined in the usual way, and the set of well-formed formulas of first order logic (𝒪\mathcal{FO}) is defined by the following BNF:

ψt1=t2R(t1,,tk)¬R(t1,,tk)ψψψψxψxψ,\psi\Coloneqq t_{1}=t_{2}\mid R(t_{1},\dots,t_{k})\mid\lnot R(t_{1},\dots,t_{k})\mid\psi\land\psi\mid\psi\lor\psi\mid\exists x\psi\mid\forall x\psi,

where tit_{i} are terms 1ik1\leq i\leq k, RR is a kk-ary relation symbol from σ\sigma, kk\in\mathbb{N}, and xVARx\in\mathrm{VAR}. If ψ\psi is a formula, then we use VAR(ψ)\mathrm{VAR}(\psi) for its set of variables, and Fr(ψ)\mathrm{Fr}(\psi) for its set of free variables. We evaluate 𝒪\mathcal{FO}-formulas in τ\tau-structures, which are pairs of the form 𝒜=(A,τ𝒜)\mathcal{A}=(A,\tau^{\mathcal{A}}), where AA is the domain of 𝒜\mathcal{A} (when clear from the context, we write AA instead of dom(𝒜)\mathrm{dom}(\mathcal{A})), and τ𝒜\tau^{\mathcal{A}} interprets the function and relational symbols in the usual way (e.g., t𝒜s=s(x)t^{\mathcal{A}}\langle s\rangle=s(x) if t=xVARt=x\in\mathrm{VAR}). If 𝐭=(t1,,tn)\mathbf{t}=(t_{1},\dots,t_{n}) is a tuple of terms for nn\in\mathbb{N}, then we write 𝐭𝒜s\mathbf{t}^{\mathcal{A}}\langle s\rangle for (t1𝒜s,,tn𝒜s)(t_{1}^{\mathcal{A}}\langle s\rangle,\dots,t_{n}^{\mathcal{A}}\langle s\rangle).

Dependence logic (𝒟\mathcal{D}) extends 𝒪\mathcal{FO} by dependence atoms of the form 𝖽𝖾𝗉(𝐭;𝐮){\mathsf{dep}}({\mathbf{t}};{\mathbf{u}}) where 𝐭\mathbf{t} and 𝐮\mathbf{u} are tuples of terms. The semantics is defined through the concept of a team. Let 𝒜\mathcal{A} be a structure and XVARX\subseteq\mathrm{VAR}, then an assignment ss is a mapping s:XAs\colon X\rightarrow A.

Definition 2.3.

Let XVARX\subseteq\mathrm{VAR}. A team TT in 𝒜\mathcal{A} with domain XX is a set of assignments s:XAs\colon X\to A.

For a team TT with domain XYX\supseteq Y define its restriction to YY as TY{sYsT}T\upharpoonright Y\coloneqq\{\,s\upharpoonright Y\mid s\in T\,\}. If s:XAs\colon X\to A is an assignment and xVARx\in\mathrm{VAR} is a variable, then sax:X{x}As^{x}_{a}\colon X\cup\{x\}\to A is the assignment that maps xx to aa and yX{x}y\in X\setminus\{x\} to s(y)s(y). Let TT be a team in 𝒜\mathcal{A} with domain XX. Then we define f:T𝒫(A){}f\colon T\to\mathcal{P}(A)\setminus\{\emptyset\} as the supplementing function of TT. This is used to extend or modify TT to the supplementing team Tfx{saxsT,af(s)}T^{x}_{f}\coloneqq\{\,s^{x}_{a}\mid s\in T,a\in f(s)\,\}. For the case f(s)=Af(s)=A is the constant function we simply write T𝒜xT^{x}_{\mathcal{A}} for TfxT^{x}_{f}. The semantics of 𝒟\mathcal{D}-formulas is defined as follows.

Definition 2.4.

Let τ\tau be a vocabulary, 𝒜\mathcal{A} be a τ\tau-structure and TT be a team over 𝒜\mathcal{A} with domain XVARX\subseteq\mathrm{VAR}. Then,

(𝒜,T)t1=t2\displaystyle(\mathcal{A},T)\models t_{1}=t_{2} iff sT:t1𝒜s=t2𝒜s\displaystyle\forall s\in T:t_{1}^{\mathcal{A}}\langle s\rangle=t_{2}^{\mathcal{A}}\langle s\rangle
(𝒜,T)R(t1,,tn)\displaystyle(\mathcal{A},T)\models R(t_{1},\ldots,t_{n}) iff sT:(t1𝒜s,,tn𝒜s)R𝒜\displaystyle\forall s\in T:(t_{1}^{\mathcal{A}}\langle s\rangle,\ldots,t_{n}^{\mathcal{A}}\langle s\rangle)\in R^{\mathcal{A}}
(𝒜,T)¬R(t1,,tn)\displaystyle(\mathcal{A},T)\models\neg R(t_{1},\ldots,t_{n}) iff sT:(t1𝒜s,,tn𝒜s)R𝒜\displaystyle\forall s\in T:(t_{1}^{\mathcal{A}}\langle s\rangle,\ldots,t_{n}^{\mathcal{A}}\langle s\rangle)\not\in R^{\mathcal{A}}
(𝒜,T)𝖽𝖾𝗉(𝐭;𝐮)\displaystyle(\mathcal{A},T)\models{\mathsf{dep}}({\mathbf{t}};{\mathbf{u}}) iff s1,s2T:𝐭𝒜s1=𝐭𝒜s2𝐮𝒜s1=𝐮𝒜s2\displaystyle\forall s_{1},s_{2}\in T:\mathbf{t}^{\mathcal{A}}\langle s_{1}\rangle=\mathbf{t}^{\mathcal{A}}\langle s_{2}\rangle\implies\mathbf{u}^{\mathcal{A}}\langle s_{1}\rangle=\mathbf{u}^{\mathcal{A}}\langle s_{2}\rangle
(𝒜,T)ϕ0ϕ1\displaystyle(\mathcal{A},T)\models\phi_{0}\land\phi_{1} iff (𝒜,T)ϕ0 and (𝒜,T)ϕ1\displaystyle(\mathcal{A},T)\models\phi_{0}\quad\text{ and }\quad(\mathcal{A},T)\models\phi_{1}
(𝒜,T)ϕ0ϕ1\displaystyle(\mathcal{A},T)\models\phi_{0}\lor\phi_{1} iff T0T1:T0T1=T and (𝒜,Ti)ϕi for i=0,1\displaystyle\exists T_{0}\exists T_{1}:T_{0}\cup T_{1}=T\quad\text{ and }\quad(\mathcal{A},T_{i})\models\phi_{i}\,\text{ for }i=0,1
(𝒜,T)xϕ\displaystyle(\mathcal{A},T)\models\exists x\phi iff (𝒜,Tfx)ϕ for some f:T𝒫(A){}\displaystyle(\mathcal{A},T^{x}_{f})\models\phi\text{ for some }f\colon T\to\mathcal{P}(A)\setminus\{\emptyset\}
(𝒜,T)xϕ\displaystyle(\mathcal{A},T)\models\forall x\phi iff (𝒜,T𝒜x)ϕ\displaystyle(\mathcal{A},T^{x}_{\mathcal{A}})\models\phi

Notice that we only consider formulas in negation normal form (NNF) as any formula of dependence logic can be transformed into logically equivalent NNF-form. Further note that (𝒜,T)ϕ(\mathcal{A},T)\models\phi for all ϕ\phi when T=T=\emptyset (this is also called the empty team property). Furthermore, 𝒟\mathcal{D}-formulas are local, that is, for a team TT in 𝒜\mathcal{A} over domain XX and a 𝒟\mathcal{D}-formula ϕ\phi, we have that (𝒜,T)ϕ(\mathcal{A},T)\models\phi if and only if (𝒜,TFr(ϕ))ϕ(\mathcal{A},T\upharpoonright\mathrm{Fr}(\phi))\models\phi. Finally, every 𝒟\mathcal{D}-formula ϕ\phi, if (𝒜,T)ϕ(\mathcal{A},T)\models\phi then (𝒜,P)ϕ(\mathcal{A},P)\models\phi for every PTP\subseteq T. This property is known as the downwards closure.

Definition 2.5 (Gaifman graph).

Given a vocabulary τ\tau and a τ\tau-structure 𝒜\mathcal{A}, the Gaifman graph G𝒜=(A,E)G_{\mathcal{A}}=(A,E) of 𝒜\mathcal{A} is defined as

E:={{u,v}|\displaystyle E\mathrel{\mathop{:}}=\big{\{}\,\{u,v\}\;\big{|}\; if there is an Rnτ and 𝐚An with R𝒜(𝐚) and u,v𝐚}.\displaystyle\text{ if there is an }R^{n}\in\tau\text{ and }\mathbf{a}\in A^{n}\text{ with }R^{\mathcal{A}}(\mathbf{a})\text{ and }u,v\in\mathbf{a}\,\big{\}}.

That is, there is a relation RτR\in\tau of arity nn such that uu and vv appear together in R𝒜R^{\mathcal{A}}.

Intuitively, the Gaifman graph of a structure 𝒜\mathcal{A} is an undirected graph with the universe of 𝒜\mathcal{A} as vertices and connects two vertices when they share a tuple in a relation (see also Fig. 2).

Definition 2.6 (Treewidth).

The tree decomposition of a given graph G=(V,E)G=(V,E) is a tree T=(B,ET)T=(B,E_{T}), where the vertex set B𝒫(V)B\subseteq\mathcal{P}(V) is the collection of bags and ETE_{T} is the edge relation such that the following is true.

  • bB=V\bigcup_{b\in B}=V,

  • for every {u,v}E\{u,v\}\in E there is a bag bBb\in B with u,vbu,v\in b, and

  • for all vVv\in V the restriction of TT to vv (the subset with all bags containing vv) is connected.

The width of a given tree decomposition T=(B,ET)T=(B,E_{T}) is the size of the largest bag minus one: maxbB|b|1\max_{b\in B}|b|-1. The treewidth of a given graph GG is the minimum over all widths of tree decompositions of GG.

Observe that if GG is a tree then the treewidth of GG is one. Intuitively, one can say that treewidth accordingly is a measure of tree-likeness of a given graph.

Example 2.7.
Flight Gate Time
FIN-70 C1 09:55
SAS-475 C3 12:25
SAS-476 C2 12:25
FIN-80 C1 19:55
F7C109F819S6C212S5C3
F7,F8,C1F7,C1,9F8,C1,19S5,S6,12S6,12,C2S5,12,C3
Figure 2: An 𝒪\mathcal{FO}-structure 𝒜=(A,S𝒜,R𝒜)\mathcal{A}=(A,S^{\mathcal{A}},R^{\mathcal{A}}) (Left) with the Gaifman graph G𝒜G_{\mathcal{A}} (Middle) and a possible treedecomposition of G𝒜G_{\mathcal{A}} (Right) of Example 2.7. For brevity, universe elements are written in short forms.

Consider the database form our previous example. Recall that the universe AA consists of entries in each row. Let τ={S2,R3}\tau=\{\mathrm{S}^{2},\mathrm{R}^{3}\} include a binary relation S\mathrm{S} (S(x,y):\mathrm{S}(x,y): flights xx and yy are owed by the same company) and a ternary relation R\mathrm{R} (R(x,y,z):\mathrm{R}(x,y,z): the gate xx is reserved by the flight yy at time zz). For simplicity, we only consider first four rows with the corresponding three columns from Table 3, see Figure 2 for an explanation. Since the largest bag size in our decomposition is 33, the treewidth of this decomposition is 22. Furthermore, the presence of cycles of length 33 suggests that there is no better decomposition. As a consequence the given structure has treewidth 22.

The decision problem to determine whether the treewidth of a given graph 𝒢=(V,E)\mathcal{G}=(V,E) is at most kk, is NP-complete [1]. See Bodlaender’s Guide [2] for an overview of algorithms that compute tree decompositions. When considering the parameter treewidth, one usually assumes it as a given value and does not need to compute it. We consider only the model checking problem (MC\mathrm{MC}) and two variants in this paper. First, let us define the most general version.

Problem: 𝖼𝖼\mathsf{cc} (combined complexity of model checking)
Input: a structure 𝒜\mathcal{A}, team TT and a 𝒟\mathcal{D}-formula Φ\Phi.
Question: (𝒜,T)Φ(\mathcal{A},T)\models\Phi?

We further consider the following two variants of the model checking problem.

Problem: 𝖽𝖼\mathsf{dc} (data complexity of model checking, Φ\Phi is fixed)
Input: a structure 𝒜\mathcal{A}, team TT.
Question: (𝒜,T)Φ(\mathcal{A},T)\models\Phi?
Problem: 𝖾𝖼\mathsf{ec} (expression complexity of model checking, 𝒜,T\mathcal{A},T are fixed)
Input: a 𝒟\mathcal{D}-formula Φ\Phi.
Question: (𝒜,T)Φ(\mathcal{A},T)\models\Phi?

List of Parameterizations.

Now let us turn to the parameters that are under investigation in this paper. We study the model checking problem of 𝒟\mathcal{D} under nine various parameters that naturally occur in an MC\mathrm{MC}-instance. Let 𝒜,T,Φ\langle\mathcal{A},T,\Phi\rangle be an instance of MC\mathrm{MC}, where Φ\Phi is a 𝒟\mathcal{D}-formula, 𝒜\mathcal{A} is a structure and TT is a team over 𝒜\mathcal{A}. The parameter #𝗌𝗉𝗅𝗂𝗍𝗌\#\mathsf{splits} denotes the number of occurrences of the split operator (\lor), #\#\forall is the number of universal quantifiers in Φ\Phi. Moreover, #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables} (resp., #𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables}) denotes the total number of (free) variables in Φ\Phi. The parameter |Φ||\Phi| is the size of the input formula Φ\Phi, and similarly the two other size parameters are |𝒜||\mathcal{A}| and |T||T|. The treewidth of the structure 𝒜\mathcal{A} (see Def. 2.6) is defined as the treewidth of G𝒜G_{\mathcal{A}} and denoted by 𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A}). Note that for formulas using the dependence atom 𝖽𝖾𝗉(𝐱;𝐲){\mathsf{dep}}({\mathbf{x}};{\mathbf{y}}), one can translate to a formula using only dependence atoms where |𝐲|=1|\mathbf{y}|=1 (via conjunctions). That is why the arity of a dependence atom 𝖽𝖾𝗉(𝐱;𝐲){\mathsf{dep}}({\mathbf{x}};{\mathbf{y}}) is defined as |𝐱||\mathbf{x}| and 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity} is the maximum arity of any dependence atom in Φ\Phi.

Let kk be any parameterization and P{𝖽𝖼,𝖾𝖼,𝖼𝖼}P\in\{\mathsf{dc},\mathsf{ec},\mathsf{cc}\}, then by kk-PP we denote the problem PP when parameterized by kk. If more than one parameterization is considered, then we use ‘++’ as a separator and write these parameters in brackets, e.g., (|Φ|+#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌)-𝖽𝖼(|\Phi|+\#\mathsf{free\text{-}variables})\text{-}\mathsf{dc} as the problem 𝖽𝖼\mathsf{dc} with parameterization |Φ|+#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌|\Phi|+\#\mathsf{free\text{-}variables}. Finally, notice that since the formula Φ\Phi is fixed for 𝖽𝖼\mathsf{dc} this implies that |Φ|-𝖽𝖼{|\Phi|}\text{-}\mathsf{dc} is nothing but 𝖽𝖼\mathsf{dc}. That is, bounding the parameter does not make sense for 𝖽𝖼\mathsf{dc} as the problem 𝖽𝖼\mathsf{dc} remains NP-complete.

3 Complexity results

Parameter 𝖼𝖼\mathsf{cc} 𝖽𝖼\mathsf{dc} 𝖾𝖼\mathsf{ec}
#𝗌𝗉𝗅𝗂𝗍𝗌\#\mathsf{splits} paraPSPACE-hL3.14\textbf{para}\textbf{PSPACE}\text{-h}^{L\ref{ec-splits}} paraNPL3.5\textbf{para}\textbf{NP}^{L\ref{dc-many}} paraPSPACE-hL3.14\textbf{para}\textbf{PSPACE}\text{-h}^{L\ref{ec-splits}}
|Φ||\Phi| paraNPL3.16\textbf{para}\textbf{NP}^{L\ref{cc-formula}} paraNPR3.7\textbf{para}\textbf{NP}^{R\ref{rem:dc-formulasize}} FPT3.18\textbf{FPT}^{\ref{ec-formula}}
#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}} paraNPL3.5\textbf{para}\textbf{NP}^{L\ref{dc-many}} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}}
𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A}) paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}} paraNPP3.4\textbf{para}\textbf{NP}^{P\ref{dc-all}} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}}
|𝒜|{|\mathcal{A}|} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}} FPTL3.8\textbf{FPT}^{L\ref{dc-strucsize}} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}}
|T||T| paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}} paraNPL3.10\textbf{para}\textbf{NP}^{L\ref{dc-teamsize}} paraNEXPL3.12\textbf{para}\textbf{NEXP}^{L\ref{cc-many}}
#\#\forall paraNP-hL3.23\textbf{para}\textbf{NP}\text{-h}^{L\ref{cc-universal}} paraNPL3.5\textbf{para}\textbf{NP}^{L\ref{dc-many}} paraNPL3.20\textbf{para}\textbf{NP}^{L\ref{ec-universal}}
𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity} paraPSPACE-hL3.28\textbf{para}\textbf{PSPACE}\text{-h}^{L\ref{cc-arity}} paraNPL3.5\textbf{para}\textbf{NP}^{L\ref{dc-many}} paraPSPACEL3.25\textbf{para}\textbf{PSPACE}^{L\ref{ec-arity}}
#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables} paraNPL3.31\textbf{para}\textbf{NP}^{L\ref{cc-variables}} paraNPL3.5\textbf{para}\textbf{NP}^{L\ref{dc-many}} FPTL3.33\textbf{FPT}^{L\ref{ec-variables}}
Table 2: Complexity classification overview. A suffix -h represents the hardness result, whereas other results are completeness. The numbers in the exponent point to the corresponding result (LxLx means Lemma xx, PxPx means Proposition xx, RxRx means Remark xx). Fig. 3 on page 3 is a graphical presentation of this table with a different angle.

We begin by proving relationships between various parameterizations.

Lemma 3.1.

The following relations among parameters hold.

  1. 1.

    |Φ|k|\Phi|\geq k for any k{#𝗌𝗉𝗅𝗂𝗍𝗌,#,𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒,#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌,#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌}k\in\{\,\#\mathsf{splits},\#\forall,\mathsf{dep\text{-}arity},\#\mathsf{free\text{-}variables},\#\mathsf{variables}\,\},

  2. 2.

    |𝒜|𝗍𝗐(𝒜){|\mathcal{A}|}\geq\mathsf{tw}(\mathcal{A}). Moreover, for 𝖽𝖼\mathsf{dc}, |𝒜|O(1)|T|{|\mathcal{A}|}^{O(1)}\geq|T|,

  3. 3.

    For 𝖾𝖼\mathsf{ec}, #𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables} is constant.

Proof 3.2.
  1. 1.

    Clearly, the size of the formula limits all parts of it including the parameters mentioned in the list.

  2. 2.

    Notice that for data complexity, the formula Φ\Phi and consequently the number of free variables in Φ\Phi is fixed. Moreover, due to locality of 𝒟\mathcal{D} it holds that TArT\subseteq A^{r}, where rr is the number of free variables in Φ\Phi. That is, the team TT can be considered only over the free variables of Φ\Phi. This implies that teamsize is polynomially bounded by the universe size, as |T||𝒜|r|T|\leq|\mathcal{A}|^{r}. Finally, the result for 𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A}) follows due to Definition 2.6. This is due to the reason that in the worst case all universe elements belong to one bag in the decomposition and 𝗍𝗐(𝒜)=|𝒜|1\mathsf{tw}(\mathcal{A})=|\mathcal{A}|-1.

  3. 3.

    Notice that the team TT is fixed in 𝖾𝖼\mathsf{ec}. Together with the locality of 𝒟\mathcal{D}-formulas (see Def. 2.4), this implies that the domain of TT (which is same as the set of free variables in the formula Φ\Phi) is also fixed and as a result, of constant size.

Remark 3.3.

If the number of free variables (#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables}) in a formula Φ\Phi is bounded then the total number of variables (#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables}) is not necessarily bounded, on the other hand, bounding #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables} also bounds #𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables}.

3.1 Data complexity (𝖽𝖼\mathsf{dc})

Classically, the data complexity of model checking for a fixed 𝒟\mathcal{D}-formula Φ\Phi is NP-complete [27].

Proposition 3.4.

For a fixed formula, the problem whether an input structure 𝒜\mathcal{A} and a team TT satisfies the formula is NP-complete. That is, the data complexity of dependence logic is NP-complete.

In this section we prove that none of the considered parameter lowers this complexity, except |𝒜||\mathcal{A}|. The proof relies on the fact that the complexity of model checking for already a very simple formula (see below) is NP-complete.

Lemma 3.5.

Let k{#𝗌𝗉𝗅𝗂𝗍𝗌,#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌,#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌,#,𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒,𝗍𝗐(𝒜)}k\in\{\#\mathsf{splits},\#\mathsf{free\text{-}variables},\#\mathsf{variables},\#\forall,\mathsf{dep\text{-}arity},\mathsf{tw}(\mathcal{A})\}. Then the problem k-𝖽𝖼{k}\text{-}\mathsf{dc}, is paraNP-c.

Proof 3.6.

The upper bound follows from Proposition 3.4. Kontinen [16, Theorem 4.9] proves that the data complexity for a fixed 𝒟\mathcal{D}-formula of the form 𝖽𝖾𝗉(x;y)𝖽𝖾𝗉(u;v)𝖽𝖾𝗉(u;v){\mathsf{dep}}({x};{y})\lor{\mathsf{dep}}({u};{v})\lor{\mathsf{dep}}({u};{v}) is already NP-complete. For clearity, we briefly sketch the reduction presented by Kontinen [16]. Let ϕ=im(i,1i,2i,3)\phi=\bigwedge\limits_{i\leq m}(\ell_{i,1}\lor\ell_{i,2}\lor\ell_{i,3}) be an instance of 3-SAT3\text{-}\mathrm{SAT}. Consider the structure 𝒜\mathcal{A} over the empty vocabulary, that is, τ=\tau=\emptyset. Let A=Var(ϕ){0,1,,m}A=\mathrm{Var}(\phi)\cup\{0,1,\ldots,m\}. The team TT is constructed over variables {x,y,u,v}\{x,y,u,v\} that take values from AA. As an example, the clause (p1¬p2¬p3)(p_{1}\lor\neg p_{2}\lor\neg p_{3}) gives rise to assignments in Table 3.

x=x= ‘variable’ y=y= ‘parity’ u=u= ‘clause’ v=v= ‘position’
p1p_{1} 11 11 0
p2p_{2} 0 11 11
p3p_{3} 0 11 22
Table 3: An example team for (p1¬p2¬p3)(p_{1}\lor\neg p_{2}\lor\neg p_{3})

Notice that, a truth assignment θ\theta for ϕ\phi is constructed using the division of TT according to each split. That is, T𝖽𝖾𝗉(x;y)𝖽𝖾𝗉(u;v)𝖽𝖾𝗉(u;v)T\models{\mathsf{dep}}({x};{y})\lor{\mathsf{dep}}({u};{v})\lor{\mathsf{dep}}({u};{v}) if and only if P0,P1,P2\exists P_{0},P_{1},P_{2} such that iPi=T\cup_{i}P_{i}=T for i2i\leq 2 and each PiP_{i} satisfies iith dependence atom. Let P0P_{0} be such that P0𝖽𝖾𝗉(x;y)P_{0}\models{\mathsf{dep}}({x};{y}), then we let θ(pj)=1sP, s.t. s(x)=pj\theta(p_{j})=1\iff\exists s\in P,\text{ s.t. }s(x)=p_{j} and s(y)=1s(y)=1. That is, one literal in each clause must be chosen in such a way that satisfies this clause, whereas, the remaining two literals per each clause are allowed to take values that does not satisfy it. As a consequence, each clause is satisfied by the variables chosen in this way, which proves correctness.

This implies that the 22-slice (for #𝗌𝗉𝗅𝗂𝗍𝗌-𝖽𝖼{\#\mathsf{splits}}\text{-}\mathsf{dc}), 44-slice (for #𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌-𝖽𝖼{\#\mathsf{free\text{-}variables}}\text{-}\mathsf{dc} as well as #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌-𝖽𝖼{\#\mathsf{variables}}\text{-}\mathsf{dc}), 0-slice (for #-𝖽𝖼{\#\forall}\text{-}\mathsf{dc}), and 11-slice (for 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒-𝖽𝖼{\mathsf{dep\text{-}arity}}\text{-}\mathsf{dc}) are NP-complete. Consequently, the paraNP-hardness for these cases follow. Finally, the case for 𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A}) also follows due to the reason that the vocabulary of the reduced structure is empty. As a consequence, our definition 2.6 yields a tree decomposition of width 11 trivially as no elements of the universe are related.

This completes the proof to our lemma.

Remark 3.7.

Recall that |Φ||\Phi| as a parameter for 𝖽𝖼\mathsf{dc} does not make sense as the input consists of 𝒜,T\langle\mathcal{A},T\rangle. That is, the formula Φ\Phi is already fixed which is stronger than fixing the size of Φ\Phi.

We now prove the only tractable case for the data complexity.

Lemma 3.8.

|𝒜|-𝖽𝖼FPT{{|\mathcal{A}|}}\text{-}\mathsf{dc}\in\textbf{FPT}.

Proof 3.9.

Notice first that restricting the universe size |𝒜||\mathcal{A}| polynomially bounds the teamsize |T||T|, due to Lemma 3.1. This implies that the size of whole input is (polynomially) bounded by the parameter |𝒜||\mathcal{A}|. The result follows trivially because any PP PP is FPT when the input size is bounded by the parameter [8].

Lemma 3.10.

|T|-𝖽𝖼{|T|}\text{-}\mathsf{dc} is paraNP-complete.

Proof 3.11.

For a fixed sentence Φ𝒟\Phi\in\mathcal{D} (that is, with no free variables) and for all models 𝒜\mathcal{A} and team TT we have that (𝒜,T)Φ(𝒜,{})Φ(\mathcal{A},T)\models\Phi\iff(\mathcal{A},\{\emptyset\})\models\Phi. As a result, the problem FPT\leq^{\textbf{FPT}}-reduces to the model checking problem with |T|=1|T|=1. Consequently, 1-slice of |T|-𝖽𝖼{|T|}\text{-}\mathsf{dc} is NP-complete because model checking for a fixed 𝒟\mathcal{D}-sentence is also NP-complete [27]. This gives paraNP-hardness.

For the membership, note that given a structure 𝒜\mathcal{A} and a team TT then for a fixed formula Φ\Phi the question whether (𝒜,T)Φ(\mathcal{A},T)\models\Phi is in NP. Consequently, giving paraNP-membership.

A comparison with the propositional dependence logic (𝒫𝒟\mathcal{PDL}) at this point might be interesting. If the formula size is a parameter then the model checking for 𝒫𝒟\mathcal{PDL} can be solved in FPT-time [20]. However, this is not the case for 𝒟\mathcal{D} even if the formula is fixed in advance.

3.2 Expression and Combined Complexity (𝖾𝖼,𝖼𝖼\mathsf{ec},\mathsf{cc})

Now we turn towards the expression and combined complexity of model checking for 𝒟\mathcal{D}. Here again, in most cases the problem is still intractable for the combined complexity. However, expression complexity when parameterized by the formula size (|Φ||\Phi|) and the total number of variables (#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables}) yields membership in FPT. Similar to the previous section, we first present results that directly translate from the known reductions for proving the NEXP-completeness for 𝒟\mathcal{D}.

Lemma 3.12.

Let k{|𝒜|,𝗍𝗐(𝒜),|T|,#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌}k\in\{\,{|\mathcal{A}|},\mathsf{tw}(\mathcal{A}),|T|,\#\mathsf{free\text{-}variables}\,\}. Then both k-𝖼𝖼{k}\text{-}\mathsf{cc} and k-𝖾𝖼{k}\text{-}\mathsf{ec} are paraNEXP-complete.

Proof 3.13.

In the classical setting, NEXP-completeness of the expression and the combined complexity for 𝒟\mathcal{D} was shown by Grädel [12, Theorem 5.1]. This immediately gives membership in paraNEXP. Interestingly, the universe in the reduction consists of {0,1}\{0,1\} with empty vocabulary and the formula obtained is a 𝒟\mathcal{D}-sentence. This implies that 22-slice (for |𝒜||\mathcal{A}|), 11-slice (for 𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A})), 11-slice (for |T||T|), and 0-slice (for the number of free variables) are NEXP-complete. As a consequence, paraNEXP-hardness for the mentioned cases follows and this completes the proof.

For the number of splits as a parameterization, we only know that this is also highly intractable, with the precise complexity open for now.

Lemma 3.14.

#𝗌𝗉𝗅𝗂𝗍𝗌-𝖾𝖼{\#\mathsf{splits}}\text{-}\mathsf{ec} and #𝗌𝗉𝗅𝗂𝗍𝗌-𝖼𝖼{\#\mathsf{splits}}\text{-}\mathsf{cc} are both paraPSPACE-h.

Proof 3.15.

Consider the equivalence of {,,}-𝒪-MC\{\exists,\forall,\land\}\text{-}\mathcal{FO}\text{-}\mathrm{MC} to quantified constraint satisfaction problem (QCSP) [22, p. 418]. That is, the fragment of 𝒪\mathcal{FO} with only operations in {,,}\{\exists,\forall,\land\} allowed. Then QCSP asks, whether the conjunction of quantified constraints (𝒪\mathcal{FO}-relations) is true in a fixed 𝒪\mathcal{FO}-structure 𝒜\mathcal{A}. This implies that already in the absence of a split operator (even when there are no dependence atoms), the model checking problem is PSPACE-hard. Consequently, the mentioned results follow.

The formula size as a parameter presents varying behaviour depending upon if we consider the expression or the combined complexity.

Lemma 3.16.

|Φ|-𝖼𝖼{|\Phi|}\text{-}\mathsf{cc} is paraNP-complete.

Proof 3.17.

Notice that, due to Lemma 3.1, the size kk of a formula Φ\Phi also bounds the maximum number of free variables in any subformula of Φ\Phi. This gives the membership in conjunction with [12, Theorem 5.1]. That is, the combined complexity of 𝒟\mathcal{D} is NP-complete if maximum number of free variables in any subformuala of Φ\Phi is fixed. The lower bound follows because of the construction by Kontinen [16] (see also Lemma 3.5) since for a fixed formula (of fixed size), the problem is already NP-complete.

Lemma 3.18.

|Φ|-𝖾𝖼{|\Phi|}\text{-}\mathsf{ec} is in FPT.

Proof 3.19.

Recall that in expression complexity, the team TT and the structure 𝒜\mathcal{A} are fixed. Whereas, the size of the input formula Φ\Phi is a parameter. The result follows trivially because any PP PP is FPT when the input size is bounded by the parameter.

The expression complexity regarding the number of universal quantifiers as a parameter drops down to paraNP-completeness, which is still intractable but much lower than paraNEXP-completeness. However, regarding the combined complexity we can only prove the membership in XNP, with paraNP-lower bound.

Lemma 3.20.

#-𝖾𝖼{\#\forall}\text{-}\mathsf{ec} is paraNP-complete.

Proof 3.21.

We first prove the lower bound through a reduction form the satisfiability problem for propositional dependence logic (𝒫𝒟\mathcal{PDL}). That is, given a 𝒫𝒟\mathcal{PDL}-formula ϕ\phi, whether there is a team TT such that TϕT\models\phi? Let ϕ\phi be a 𝒫𝒟\mathcal{PDL}-formula over propositional variables p1,,pnp_{1},\ldots,p_{n}. For ini\leq n, let xix_{i} denote a variable corresponding to the proposition pip_{i}. Let 𝒜={0,1}\mathcal{A}=\{0,1\} be the structure over empty vocabulary. Clearly ϕ\phi is satisfiable iff p1pnϕ\exists p_{1}\ldots\exists p_{n}\phi is satisfiable iff (𝒜,{})x1xnϕ(\mathcal{A},\{\emptyset\})\models\exists x_{1}\ldots\exists x_{n}\phi^{\prime}, where ϕ\phi^{\prime} is a 𝒟\mathcal{D}-formula obtained from ϕ\phi by simply replacing each proposition pip_{i} by the variable xix_{i}. Notice that the reduced formula does not have any universal quantifier, that is #(ϕ)=0\#\forall(\phi^{\prime})=0. This gives paraNP-hardness since the satisfiability for 𝒫𝒟\mathcal{PDL} is NP-complete [18].

For membership, notice that a 𝒟\mathcal{D}-sentence Φ\Phi with kk universal quantifiers can be reduced in P-time to an 𝒮𝒪\mathcal{ESO}-sentence Ψ\Psi of the form f1frx1xkψ\exists f_{1}\ldots\exists f_{r}\forall x_{1}\ldots\forall x_{k}\psi [5, Cor. 3.9], where ψ\psi is a quantifier free 𝒪\mathcal{FO}-formula, rr\in\mathbb{N}, and each function symbol fif_{i} is at most kk-ary for 1ir1\leq i\leq r. Finally, (𝒜,{})Φ𝒜f1frx1xkψ(\mathcal{A},\{\emptyset\})\models\Phi\iff\mathcal{A}\models\bigvee\limits_{f_{1}}\ldots\bigvee\limits_{f_{r}}\forall x_{1}\ldots\forall x_{k}\psi^{\prime}. Where the latter question can be solved by guessing an interpretation for each function symbol fif_{i} and iri\leq r. This requires r|𝒜|kr\cdot|\mathcal{A}|^{k} guessing steps, and can be achieved in paraNP-time for a fixed structure 𝒜\mathcal{A} (as we consider expression complexity). Consequently, the membership in paraNP follows. Notice that the arity of function symbols in the paraNP-membership above is bounded by kk if Φ\Phi is a 𝒟\mathcal{D}-sentence. However, if Φ\Phi is a 𝒟\mathcal{D}-formulas with mm free variables then the arity of function symbols as well as the number of universal quantifiers in the reduction, both are bounded by k+mk+m where k=#(Φ)k=\#\forall(\Phi) and m=#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌(Φ)m=\#\mathsf{free\text{-}variables}(\Phi). Nevertheless, recall that for 𝖾𝖼\mathsf{ec}, the team is also fixed. Moreover, due to Lemma 3.1 the collection of free variables in Φ\Phi has constant size. This implies that the reduction above provides an 𝒮𝒪\mathcal{ESO}-sentence with k+mk+m universal quantifiers as well as function symbols of arity k+mk+m at most. Finally, guessing the interpretation for functions still takes paraNP-steps (because mm is constant) and consequently, we get paraNP-membership for open formulas as well.

The following corollary immediately follows from the proof above.

Corollary 3.22.

(#+#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌)-𝖾𝖼{(\#\forall+\#\mathsf{free\text{-}variables})}\text{-}\mathsf{ec} is paraNP-complete.

Lemma 3.23.

#-𝖼𝖼{\#\forall}\text{-}\mathsf{cc} is paraNP-hard. Moreover, for sentences of 𝒟\mathcal{D}, #-𝖼𝖼{\#\forall}\text{-}\mathsf{cc} is in XNP.

Proof 3.24.

The paraNP-lower bound follows due to the fact that the expression complexity of 𝒟\mathcal{D} is already paraNP-complete when parameterized by #\#\forall (Lemma 3.20).

For sentences, similar to the proof in Lemma 3.20, a 𝒟\mathcal{D}-sentence Φ\Phi can be translated to an equivalent 𝒮𝒪\mathcal{ESO}-sentence Ψ\Psi in polynomial time. However, if the structure is not fixed as for expression complexity, then the computation of interpretations for functions can no longer be done in paraNP-time, but requires non-deterministic |𝒜|k|\mathcal{A}|^{k}-time for each guessed function, where k=#k=\#\forall. Consequently, we reach only membership in XNP for sentences.

For open formulas, we do not know if #-𝖼𝖼{\#\forall}\text{-}\mathsf{cc} is also in XNP. Our proof technique does not immediately settle this case as the team is not fixed for 𝖼𝖼\mathsf{cc}.

Similar to the case of universal quantifiers, the arity as a parameter also reduces the complexity but not as much as the universal quantifiers. Moreover, the precise combined complexity when parameterized by the arity is also open.

Lemma 3.25.

𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒-𝖾𝖼{\mathsf{dep\text{-}arity}}\text{-}\mathsf{ec} is paraPSPACE-complete.

Proof 3.26.

Notice that a 𝒟\mathcal{D}-sentence Φ\Phi with kk-ary dependence atoms can be reduced in P-time to an 𝒮𝒪\mathcal{ESO}-sentence Ψ\Psi of the form f1frψ\exists f_{1}\ldots\exists f_{r}\psi [5, Thm. 3.3], where ψ\psi is an 𝒪\mathcal{FO}-formula and each function symbol fif_{i} is at most kk-ary for 1ir1\leq i\leq r. Finally, 𝒜Φ𝒜f1frψ\mathcal{A}\models\Phi\iff\mathcal{A}\models\bigvee\limits_{f_{1}}\ldots\bigvee\limits_{f_{r}}\psi^{\prime}. That is, one needs to guess the interpretation for each function symbol fif_{i}, which can be done in paraNP-time. Finally, evaluating an 𝒪\mathcal{FO}-formula ψ\psi^{\prime} for a fixed structure 𝒜\mathcal{A} can be done in PSPACE-time. This yields membership in paraPSPACE. Moreover, if Φ\Phi is an open 𝒟\mathcal{D}-formula then the result follows due to a similar discussion as in the prof of Lemma 3.20.

For hardness, notice that the expression complexity of 𝒪\mathcal{FO} is PSPACE-complete. This implies that already in the absence of any dependence atoms, the complexity remains PSPACE-hard, as a consequence, the 0-slice of 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒-𝖾𝖼{\mathsf{dep\text{-}arity}}\text{-}\mathsf{ec} is PSPACE-hard.

This proves the desired result.

The combination (𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒+#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\mathsf{dep\text{-}arity}+\#\mathsf{free\text{-}variables}) also does not lower the expression complexity as discussed before in the case of #\#\forall.

Corollary 3.27.

(𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒+#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌)-𝖾𝖼{(\mathsf{dep\text{-}arity}+\#\mathsf{free\text{-}variables})}\text{-}\mathsf{ec} is paraPSPACE-complete.

Lemma 3.28.

𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒-𝖼𝖼{\mathsf{dep\text{-}arity}}\text{-}\mathsf{cc} is paraPSPACE-hard.

Proof 3.29.

Consider the fragment of 𝒟\mathcal{D} with only dependence atoms of the form 𝖽𝖾𝗉(;x){\mathsf{dep}}({};{x}), the so-called constancy logic. The combined complexity of constancy logic is PSPACE-complete [12, Theorem 5.3]. This implies that the 0-slice of 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒-𝖼𝖼{\mathsf{dep\text{-}arity}}\text{-}\mathsf{cc} is PSPACE-hard, proving the result.

The combined complexity of model checking for constancy logic is PSPACE [12, Thm. 5.3]. Aiming for an paraPSPACE-upper bound via squeezing the fixed arity of dependence atoms (in some way) into constancy atoms is unlikely to happen as 𝒟\mathcal{D} captures 𝒮𝒪\mathcal{ESO} whereas constancy logic for sentences (and also open formulas) collapses to 𝒪\mathcal{FO} [10].

Notice that a similar reduction as in the proof of Lemma 3.20 holds from 𝒫\mathcal{PL}, in which both parameters (#\#\forall and 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity}) are bounded. This implies that there is no hope for tractability even when both parameters are considered together. That is, the complexity of expression complexity remains paraNP-complete when parameterized by the combination of parameters (#\#\forall, 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity}).

Corollary 3.30.

(#+𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒)-𝖾𝖼{(\#\forall+\mathsf{dep\text{-}arity})}\text{-}\mathsf{ec} is also paraNP-complete.

Finally, for the parameter total number of variables, the expression complexity drops to FPT whereas, the combined complexity drops to paraNP-completeness. The case of expression complexity is particularly interesting. This is due to the reason that it was posed as an open question in [28] whether the expression complexity of the fixed variable fragment of dependence logic (𝒟k\mathcal{D}^{k}) is NP-complete similar to the case of the combined complexity therein. We answer this negatively by stating FPT-membership for #𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌-𝖾𝖼{\#\mathsf{variables}}\text{-}\mathsf{ec}, which as a corollary proves that the expression complexity of 𝒟k\mathcal{D}^{k} is in P for each k1k\geq 1.

Lemma 3.31.

#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌-𝖼𝖼{\#\mathsf{variables}}\text{-}\mathsf{cc} is paraNP-complete.

Proof 3.32.

Notice that if the total number of variables in Φ\Phi is fixed, then the number of free variables in any subformula ψ\psi of Φ\Phi is also fixed. This implies the membership in paraNP due to [12, Theorem 5.1]. On the other hand, by [28, Theorem 3.9.6] we know that the combined complexity of 𝒟k\mathcal{D}^{k} is NP-complete. This implies that for each kk, the kk-slice of the problem is NP-hard. This gives the desired lower bound.

Lemma 3.33.

#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌-𝖾𝖼{\#\mathsf{variables}}\text{-}\mathsf{ec} is FPT.

Proof 3.34.

Given a formula Φ\Phi of dependence logic with kk variables, we can construct an equivalent formula Ψ\Psi of 𝒮𝒪k+1\mathcal{ESO}^{k+1} in polynomial time [28, Theorem 3.3.17]. Moreover, since the structure 𝒜\mathcal{A} is fixed, there exists a reduction of Ψ\Psi to an 𝒪\mathcal{FO}-formula ψ\psi with k+1k+1 variables (big disjunction on the universe elements for each second order existential quantifier). Finally, the model checking for 𝒪\mathcal{FO}-formulas with kk variables is solvable in time O(|ψ||A|k)O(|\psi|\cdot|A|^{k}) [17, Prop 6.6]. This implies the membership in FPT.

Corollary 3.35.

The expression complexity of 𝒟k\mathcal{D}^{k} is in P for every k1k\geq 1.

Proof 3.36.

Since both, the number of variables and the universe size is fixed. The runtime of the form O(|ψ||A|k)O(|\psi|\cdot|A|^{k}) in Lemma 3.33 implies membership in P.

4 Conclusion

data paraNPFPTcombinedparaNEXPparaPSPACEparaNP expression paraNEXPparaPSPACEparaNPFPTFPTstructural𝗍𝗐(𝒜)\mathsf{tw}(\mathcal{A})𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity}quantitative#𝖿𝗋𝖾𝖾-𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{free\text{-}variables}#𝗌𝗉𝗅𝗂𝗍𝗌\#\mathsf{splits}#\#\forall#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables}size|T||T||𝒜|{|\mathcal{A}|}|Φ||\Phi|
Figure 3: Complexity classification overview for model checking problem of dependence logic, that takes grouping of parameters (quantitative, size, structural) and complexity classes into account.

In this paper, we started the parameterized complexity classification of model checking for dependence logic 𝒟\mathcal{D} with respect to nine different parameters (see Table 2 for an overview of the results). In Fig. 3 we depict a different kind of presentation of our results that also takes the grouping of parameters into quantitative, size related, and structural into account. The data complexity of 𝒟\mathcal{D} shows a dichotomy (FPT vs./ paraNP-complete), where surprisingly there is only one case (|𝒜|{|\mathcal{A}|}) where one can reach FPT. This is even more surprising in the light of the fact that the expression (𝖾𝖼\mathsf{ec} and the combined (𝖼𝖼\mathsf{cc}) complexities under the same parameter are still highly intractable. Furthermore, there are parameters when 𝖼𝖼\mathsf{cc} and 𝖾𝖼\mathsf{ec} vary in the complexity (#𝗏𝖺𝗋𝗂𝖺𝖻𝗅𝖾𝗌\#\mathsf{variables}). The combined complexity of 𝒟\mathcal{D} stays intractable under any of the investigated parameterizations. It might be interesting to study combination of parameters and see their joint effect on the complexity (yet, Corollaries 3.223.273.30 tackle already some cases).

We want to close this presentation with some further questions:

  • What other parameters could be meaningful (e.g., number of conjunction, number of existential quantifiers, treewidth of the formula)?

  • What is the exact complexity of #\#\forall-𝖼𝖼\mathsf{cc}, #𝗌𝗉𝗅𝗂𝗍𝗌\#\mathsf{splits}-𝖾𝖼\mathsf{ec}/-𝖼𝖼\mathsf{cc}, 𝖽𝖾𝗉-𝖺𝗋𝗂𝗍𝗒\mathsf{dep\text{-}arity}-𝖼𝖼\mathsf{cc}?

  • The parameterized complexity analysis for other team-based logics, such as independence logic and inclusion logic.

References

  • [1] Stefan Arnborg, Derek G. Corneil, and Andrzej Proskurowski. Complexity of finding embeddings in a kk-tree. SIAM Journal on Algebraic Discrete Methods, 2(8):277––284, 1987. doi:10.1137/0608024.
  • [2] Hans L. Bodlaender. A tourist guide through treewidth. Acta Cybern., 11(1-2):1–21, 1993.
  • [3] Hans L. Bodlaender. Discovering treewidth. In SOFSEM, volume 3381 of Lecture Notes in Computer Science, pages 1–16. Springer, 2005.
  • [4] Rodney G. Downey and Michael R. Fellows. Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013. doi:10.1007/978-1-4471-5559-1.
  • [5] Arnaud Durand and Juha Kontinen. Hierarchies in dependence logic. ACM Transactions on Computational Logic (TOCL), 13(4):31, 2012. doi:10.1145/2362355.2362359.
  • [6] Heinz-Dieter Ebbinghaus and Jörg Flum. Finite model theory. Perspectives in Mathematical Logic. Springer, 1995.
  • [7] Michael Elberfeld, Christoph Stockhusen, and Till Tantau. On the space and circuit complexity of parameterized problems: Classes and completeness. Algorithmica, 71(3):661–701, 2015.
  • [8] Jörg Flum and Martin Grohe. Parameterized Complexity Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2006. doi:10.1007/3-540-29953-X.
  • [9] Pietro Galliani. Inclusion and exclusion dependencies in team semantics: On some logics of imperfect information. Annals of Pure and Applied Logic, 163(1):68 – 84, 2012. doi:10.1016/j.apal.2011.08.005.
  • [10] Pietro Galliani. On strongly first-order dependencies. In Dependence Logic, pages 53–71. Springer, 2016.
  • [11] Pietro Galliani and Lauri Hella. Inclusion Logic and Fixed Point Logic. In Simona Ronchi Della Rocca, editor, Computer Science Logic 2013 (CSL 2013), volume 23 of Leibniz International Proceedings in Informatics (LIPIcs), pages 281–295, Dagstuhl, Germany, 2013. Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik. URL: http://drops.dagstuhl.de/opus/volltexte/2013/4203, doi:10.4230/LIPIcs.CSL.2013.281.
  • [12] Erich Grädel. Model-checking games for logics of imperfect information. Theor. Comput. Sci., 493:2–14, 2013. doi:10.1016/j.tcs.2012.10.033.
  • [13] Erich Grädel and Jouko Väänänen. Dependence and independence. Studia Logica, 101(2):399–410, 2013. doi:10.1007/s11225-013-9479-2.
  • [14] Miika Hannula, Juha Kontinen, Jan Van den Bussche, and Jonni Virtema. Descriptive complexity of real computation and probabilistic independence logic. In LICS, pages 550–563. ACM, 2020.
  • [15] Miika Hannula, Juha Kontinen, Jonni Virtema, and Heribert Vollmer. Complexity of propositional logics in team semantic. ACM Trans. Comput. Log., 19(1):2:1–2:14, 2018.
  • [16] Jarmo Kontinen. Coherence and computational complexity of quantifier-free dependence logic formulas. Studia Logica, 101(2):267–291, 2013. doi:10.1007/s11225-013-9481-8.
  • [17] Leonid Libkin. Elements of Finite Model Theory. Texts in Theoretical Computer Science. An EATCS Series. Springer, 2004. URL: http://www.cs.toronto.edu/%7Elibkin/fmt, doi:10.1007/978-3-662-07003-1.
  • [18] Peter Lohmann and Heribert Vollmer. Complexity results for modal dependence logic. Stud Logica, 101(2):343–366, 2013. doi:10.1007/s11225-013-9483-6.
  • [19] Martin Lück. Canonical models and the complexity of modal team logic. Log. Methods Comput. Sci., 15(2), 2019.
  • [20] Yasir Mahmood and Arne Meier. Parameterised complexity of model checking and satisfiability in propositional dependence logic. In Foundations of Information and Knowledge Systems - 11th International Symposium, FoIKS 2020, Dortmund, Germany, February 17-21, 2020, Proceedings, pages 157–174, 2020. doi:10.1007/978-3-030-39951-1\_10.
  • [21] Yasir Mahmood and Jonni Virtema. Parameterised complexity of propositional logic in team semantics. CoRR, abs/2105.14887, 2021.
  • [22] Barnaby Martin. First-order model checking problems parameterized by the model. In CiE, volume 5028 of Lecture Notes in Computer Science, pages 417–427. Springer, 2008.
  • [23] Arne Meier and Christian Reinbold. Enumeration complexity of poor man’s propositional dependence logic. In FoIKS, volume 10833 of Lecture Notes in Computer Science, pages 303–321. Springer, 2018.
  • [24] Christos H. Papadimitriou. Computational complexity. Addison-Wesley, 1994.
  • [25] Neil Robertson and Paul D. Seymour. Graph minors. III. planar tree-width. J. Comb. Theory, Ser. B, 36(1):49–64, 1984.
  • [26] Marko Samer and Stefan Szeider. Fixed-parameter tractability. In Handbook of Satisfiability, volume 185 of Frontiers in Artificial Intelligence and Applications, pages 425–454. IOS Press, 2009.
  • [27] Jouko A. Väänänen. Dependence Logic - A New Approach to Independence Friendly Logic, volume 70 of London Mathematical Society student texts. Cambridge University Press, 2007. URL: http://www.cambridge.org/de/knowledge/isbn/item1164246/?site_locale=de_DE.
  • [28] Jonni Virtema. Approaches to Finite Variable Dependence: Expressiveness and Computational Complexity. PhD thesis, School of Information Sciences of the University of Tampere, 2014. Available online at https://trepo.tuni.fi/handle/10024/95328.