Graphon-valued processes with
vertex-level fluctuations

Peter Braunsteins Korteweg-de-Vries Instituut, Universiteit van Amsterdam, PO Box 94248, 1090 GE Amsterdam, The Netherlands [email protected] , Frank den Hollander Mathematisch Instituut, Universiteit Leiden, PO Box 9512, 2300 RA Leiden, The Netherlands [email protected] and Michel Mandjes Korteweg-de Vries Instituut, Universiteit van Amsterdam, PO Box 94248, 1090 GE Amsterdam, The Netherlands [email protected]

(Date: March 3, 2025)

Abstract.

We consider a class of graph-valued stochastic processes in which each vertex has a type that fluctuates randomly over time. Collectively, the paths of the vertex types up to a given time determine the probabilities that the edges are active or inactive at that time. Our focus is on the evolution of the associated empirical graphon in the limit as the number of vertices tends to infinity, in the setting where fluctuations in the graph-valued process are more likely to be caused by fluctuations in the vertex types than by fluctuations in the states of the edges given these types. We derive both sample-path large deviation principles and convergence of stochastic processes. We demonstrate the flexibility of our approach by treating a class of stochastic processes where the edge probabilities depend not only on the fluctuations in the vertex types but also on the state of the graph itself.

Key words. Graphs, graphons, dynamics, sample paths, process convergence, large deviations, optimal paths.
MSC2010. 05C80, 60C05, 60F10.
Acknowledgment. The work in this paper was supported by the Netherlands Organisation for Scientific Research (NWO) through Gravitation-grant NETWORKS-024.002.003.

1. Introduction

1.1. Background

Graphons arise as a powerful tool for characterising the limit of a sequence of dense graphs, i.e., graphs in which the number of edges scales as the square of the number of vertices. The theory describing these graphons (see e.g. [21], [22], [4], [5], [20]) focuses on the limiting properties of large dense graphs in terms of their subgraph densities. The literature covers both typical and atypical behaviour, a notable result being the large deviation principle (LDP) for homogeneous Erdős-Rényi random graphs and associated graphons [9], and their inhomogeneous counterparts [12].

While most of the existing theory focuses on static random graphons, the attention has gradually shifted to dynamic random graphons (see e.g. [26], [7], [1]). Two notable contributions in this area are [1], which presents a stochastic process limit in the space of graphons for a class of processes where the edges evolve in a dependent manner, and [6], which extends the LDP of [9] to a sample-path LDP for a dynamic random graph in which the edges switch on and off independently in a random fashion. In [6] the authors leave open the question whether a sample-path large deviation principle can be established for processes where the edges switch on and off in a dependent manner, such as in [1].

1.2. Motivation

The goal of the present paper is two-fold: (1) to answer the open question raised in [6] by establishing a sample-path large deviation principle for a class of processes in which edges evolve in a dependent manner; (2) to strengthen the results in [1] while working in a more general framework.

In the class of processes we consider, each vertex is assigned a type that changes randomly over time, and fluctuations in the types of the vertices determine how the edges interact with each other while switching on and off. Specifically, the paths of the types of all the vertices up to time $t$ determine the probability that the edges in the random graph are active at time $t$ . Collectively, these paths are called the driving process.

Our results generalise those of [1] in a number of directions:

(i)

We establish sample-path large deviations (Theorem 3.5), whereas [1] restricts attention to diffusion limits.
(ii)

We consider a general driving process and a general edge-switching dynamics, whereas [1] restricts attention to a specific driving process (the multi-type Moran model) and to a specific edge-switching dynamics (modulated by a fitness function).
(iii)

We establish stochastic process convergence in the space of $({\mathscr{W}},d_{\square})$ -valued càdlàg paths (Theorem 3.10), whereas [1] works in the space of $(\tilde{\mathscr{W}},\delta_{\square})$ -valued Skorokhod paths. (For the definition of these two spaces, see Section 2.1 below.)
(iv)

We allow for processes in which the probabilities that edges are active depend not only on the fluctuations in the types but also on the state of the graph itself, i.e., on which edges are active or not (Section 4.1).

On the way to proving our results for graphon-valued processes, we also prove a new large deviation principle for static random graphs (Theorem 2.6). This result can be viewed as a generalisation of the large deviation principle for inhomogeneous Erdős-Rényi random graphs in which each vertex is assigned a random type. Our proofs rely on concentrations estimates, coupling arguments, and continuous mapping. Along the way, several examples are presented.

The models analysed in this paper have a one-way dependence: the states of the edges depend on the types of the vertices, but the types of the vertices do not depend on the states of the edges. There are many natural models that fall into this framework, coming from statistical physics, population genetics and the social sciences, where the strengths of the interactions between particles, alleles or individuals generally depend on the type they carry. It is much harder to analyse models that exhibit a two-way dependence. Such models capture the evolution of spins, infections or opinions on dynamic random networks with mutual feedback, an area that so far remains largely unexplored.

1.3. Outline

In Section 2 we recall basic LDPs for graphons, and present three LDPs for what we call inhomogeneous random graphs with type dependence (IRGTs), which are static random objects. In Section 3 we look at their dynamic counterparts, which are graph-valued processes, the main result being a sample-path LDP in graphon space. We illustrate our results via a running example, and derive convergence of the graph-valued process to a graphon process. In Section 4 we describe various applications, and discuss possible extensions. Section 5 contains the proofs of our main theorems. Appendix A identifies the rate function in the LDP of the underlying driving process.

2. Large deviations for static random graphs

While the goal of the present paper is to study a specific class of dynamic random graphs, we begin by analyzing their static counterparts. The reason is that the marginal distributions of the dynamic random graphs to be considered (introduced in Section 3) at any given time $t\geq 0$ corresponds to a random graph with type dependence (introduced in Section 2.3).

In Section 2.1 we recall a few basic definitions related to graphons. In Section 2.2 we introduce inhomogeneous Erdős-Rényi random graphs and recall the large deviation principle for their associated empirical graphons. In Section 2.3 we describe a generalisation of inhomogeneous Erdős-Rényi random graphs, referred to as inhomogeneous random graphs with type dependence (IRGT), which motivate the definition of the class of graph-valued stochastic processes that we will be working with from Section 3 onwards. In Section 2.4 we state a number of key assumptions that are needed along the way. In Section 2.5 we establish the large deviation principle for the associated empirical graphon processes under the assumption that the driving process satisfies the LDP. The latter assumption is investigated in detail in Appendix A.

2.1. Graphs and graphons

Let $\mathscr{W}$ be the space of functions $h\colon\,[0,1]^{2}\to[0,1]$ such that $h(x,y)=h(y,x)$ for all $(x,y)\in[0,1]^{2}$ , formed after taking the quotient with respect to the equivalence relation of almost everywhere equality. A finite simple graph $G$ on $n$ vertices can be represented as a graphon $h^{G}\in\mathscr{W}$ by setting

h^{G}(x,y):=\begin{cases}1&\quad\text{if there is an edge between vertex }\lceil nx\rceil\text{ and vertex }\lceil ny\rceil,\\ 0&\quad\text{otherwise.}\end{cases}

(2.1)

This object is referred to as an empirical graphon and has a block structure. The space of graphons $\mathscr{W}$ is endowed with the cut distance

d_{\square}(h_{1},h_{2}):=\sup_{S,T\subseteq[0,1]}\left|\int_{S\times T}\mathrm{d}x\,\mathrm{d}y\,[h_{1}(x,y)-h_{2}(x,y)]\right|,\quad h_{1},h_{2}\in\mathscr{W}.

(2.2)

It is noted that the space $(\mathscr{W},d_{\square})$ is not compact [19, Example F.6].

On $\mathscr{W}$ there is a natural equivalence relation, referred to as ‘ $\sim$ ’. Letting $\mathscr{M}$ denote the set of measure-preserving bijections $\sigma\colon\,[0,1]\to[0,1]$ , we write $h_{1}\sim h_{2}$ when there exists a $\sigma\in\mathscr{M}$ such that $h_{1}(x,y)=h_{2}(\sigma(x),\sigma(y))$ for all $(x,y)\in[0,1]^{2}$ . This equivalence relation induces the quotient space $(\tilde{\mathscr{W}},\delta_{\square})$ , where $\delta_{\square}$ is the cut metric defined by

\delta_{\square}(\tilde{h}_{1},\tilde{h}_{2}):=\inf_{\sigma_{1},\sigma_{2}\in\mathscr{M}}d_{\square}(h_{1}^{\sigma_{1}},h_{2}^{\sigma_{2}}),\quad\tilde{h}_{1},\tilde{h}_{2}\in\tilde{\mathscr{W}}.

(2.3)

Notably, the space $(\tilde{\mathscr{W}},\delta_{\square})$ is compact [21, Lemma 8].

2.2. Inhomogeneous Erdős-Rényi random graph

Let $r\in\mathscr{W}$ be a reference graphon. Fix $n\in\mathbb{N}$ and consider a random graph $\widehat{G}_{n}$ with vertex set $[n]:=\{1,\dots,n\}$ , where the pair of vertices $i,j\in[n]$ , $i\neq j$ , is connected by an edge with probability $r(\frac{i}{n},\frac{j}{n})$ , independently of other pairs of vertices. Write $\mathbb{P}_{n}$ to denote the law of $\widehat{G}_{n}$ . Use the same symbol to denote the law on $\mathscr{W}$ induced by the map that associates with the graph $\widehat{G}_{n}$ its graphon $h^{\widehat{G}_{n}}$ . Write $\tilde{\mathbb{P}}_{n}$ to denote the law of $\tilde{h}^{\widehat{G}_{n}}$ , the equivalence class associated with $h^{\widehat{G}_{n}}$ .

The following theorem is an extension of the LDP for homogeneous Erdős-Rényi random graphs established in [9]. It was first stated in [12] under additional assumptions. These assumptions were subsequently relaxed in [24], [3], [13]. The following version of the LDP corresponds to [13, Theorem 4.1].

Theorem 2.1.

Suppose that $r\log r$ and $(1-r)\log(1-r)$ are integrable. Then the sequence of probability measures $(\tilde{\mathbb{P}}_{n})_{n\in\mathbb{N}}$ satisfies the LDP on $(\tilde{\mathscr{W}},\delta_{\square})$ with rate ${n\choose 2}$ and with rate function $\tilde{I}_{r}$ , i.e.,

		$\displaystyle\limsup_{n\to\infty}\frac{1}{{n\choose 2}}\log\tilde{\mathbb{P}}_{n}(\mathcal{C})\leq-\inf_{\tilde{h}\in\mathcal{C}}\tilde{I}_{r}(\tilde{h})\qquad\forall\,\,\mathcal{C}\subseteq\tilde{\mathscr{W}}\text{ closed,}$		(2.4)
		$\displaystyle\liminf_{n\to\infty}\frac{1}{{n\choose 2}}\log\tilde{\mathbb{P}}_{n}(\mathcal{O})\geq-\inf_{\tilde{h}\in\mathcal{O}}\tilde{I}_{r}(\tilde{h})\qquad\forall\,\,\mathcal{O}\subseteq\tilde{\mathscr{W}}\text{ open,}$		(2.4)

where

\tilde{I}_{r}(\tilde{h})=\inf_{\sigma\in\mathscr{M}}I_{r}(h^{\sigma}),

(2.5)

$h$ is any representative of $\tilde{h}$ , and

I_{r}(h):=\int_{[0,1^{2}]}{\rm d}x\,{\rm d}y\,\,\mathcal{R}(h(x,y)\,|\,r(x,y)),

(2.6)

with

\mathcal{R}(a\,|\,b):=a\log\frac{a}{b}+(1-a)\log\frac{1-a}{1-b}.

(2.7)

2.3. Inhomogeneous random graphs with type dependence

Consider the following generalisation of the inhomogeneous Erdős-Rényi random graph defined in Section 2.2. Suppose that each vertex $i\in[n]$ is assigned a (possibly random) type $X_{i}^{(n)}\in[0,1]$ . Denote the empirical type measure by

\mu_{n}=\frac{1}{n}\sum_{i=1}^{n}\delta_{X_{i}^{(n)}}

(2.8)

and the empirical type distribution

F_{n}({\boldsymbol{X}}^{(n)},x)=\frac{1}{n}\sum_{i=1}^{n}\mathbbm{1}\{X_{i}^{(n)}\leq x\},\qquad x\in[0,1],

(2.9)

where $\mathbbm{1}\{A\}$ is the indicator function of the event $A$ and ${\boldsymbol{X}}^{(n)}\equiv(X_{1}^{(n)},\ldots,X_{n}^{(n)})$ . Let $\mathcal{M}([0,1])$ denote the space of measures on $[0,1]$ endowed with the topology of weak convergence.

The way the graph is constructed is as follows. Whether or not an edge $(i,j)$ is active depends on local properties, namely, the types $X_{i}^{(n)}$ and $X_{j}^{(n)}$ , as well as global properties, namely, the empirical type distribution $F_{n}({\boldsymbol{X}}^{(n)},\cdot\,)$ . Concretely, we let edge $ij$ be active with probability

H\Big{(}X_{i}^{(n)},X_{j}^{(n)},F_{n}\Big{)}\equiv H\Big{(}X_{i}^{(n)},X_{j}^{(n)},F_{n}({\boldsymbol{X}}^{(n)},\cdot\,)\Big{)},

(2.10)

where $H\colon\,[0,1]^{2}\times\mathcal{M}([0,1])\to[0,1]$ is symmetric in its first two inputs. Given ${\boldsymbol{X}}^{(n)}$ , the edge placement is independent for all vertex pairs. We label the resulting sequence of random graphs as $\{G_{n}\}_{n\in\mathbb{N}}$ , and refer to them as inhomogeneous random graphs with type dependence (IRGT).

Two relations with inhomogeneous Erdős-Rényi random graphs are worth mentioning:

$\circ$

Observe that if

$X_{i}^{(n)}=\frac{i}{n}\quad\forall\,n\in\mathbb{N},\,i\in[n],\qquad H(x,y,F)=r(x,y)\quad\forall\,x,y\in[0,1],$ (2.11)

then the IRGT is equivalent to the inhomogeneous Erdős-Rényi random graph defined in Section 2.2.
$\circ$

Let $\bar{F}$ denote the right-continuous generalised inverse of a distribution function $F$ with support $[0,1]$ , which is defined in the usual way as

$\bar{F}(u):=\inf\{x\in[0,1]\colon\,F(x)>u\},\qquad u\in[0,1).$ (2.12)

For $F\in\mathcal{M}([0,1])$ , define the induced reference graphon $g^{[F]}\in\mathscr{W}$ by setting

$g^{[F]}(x,y)=H\big{(}\bar{F}(x),\bar{F}(y),F\big{)}.$ (2.13)

Given the type distribution $F_{n}$ , $\tilde{h}^{G_{n}}$ has the same distribution as the inhomogeneous Erdős-Rényi random graph with reference graphon $g^{[F_{n}]}$ . In other words, we have

$\tilde{h}^{G_{n}}\,|\,F_{n}\stackrel{{\scriptstyle\rm d}}{{=}}\tilde{h}^{\widehat{G}_{n}},\,r=g^{[F_{n}]}.$ (2.14)

This observation is central to the LDP for IRGTs stated in Theorem 2.6 below.

2.4. Key assumptions

Before stating Theorem 2.6, we make a number of assumptions.

Assumption 2.2.

The sequence of type distributions $(F_{n}({\boldsymbol{X}}^{(n)},\cdot\,))_{n\in\mathbb{N}}$ satisfies the LDP on $\mathcal{M}([0,1])$ with rate $\ell(n)$ and with rate function $K$ . $\diamondsuit$

When $X^{(n)}_{i}$ , $i\in\mathbb{N}$ , are fixed and $\mu_{n}\to\mu$ in $\mathcal{M}([0,1])$ as $n\to\infty$ , as in (2.11), then Assumption 2.2 is satisfied with $\ell(n)=\infty$ . Assumption 2.2 holds, for instance, when $\{X^{(n)}_{i}\}_{n\in\mathbb{N},i\in[n]}$ are i.i.d. random variables with distribution $f$ , in which case $\ell(n)=n$ and $K(f^{\circ})={\mathscr{H}}(f^{\circ}\,|\,f)$ is the relative entropy (or Kullback-Leibler divergence) of $f^{\circ}$ with respect to $f$ . Assumption 2.2 may also hold with $\ell(n)$ not scaling linearly in $n$ .

To provide an example, we extend the setup in [13, Example 2.5]. Let $p>0$ , $f$ be a probability distribution on $\mathbb{R}$ with bounded support, and $\{Y_{i}^{(n)}\}_{i\in[\lfloor n^{p}\rfloor]}$ be i.i.d. random variables with distribution $f$ . Let $s_{n}$ be such that

s_{n}(x)=Y_{i}^{(n)},\qquad x\in\left(\frac{i-1}{\lfloor n^{p}\rfloor},\frac{i}{\lfloor n^{p}\rfloor}\right),

(2.15)

and identify $s_{n}$ with its periodic extension to $\mathbb{R}$ . Let $\rho$ be a smooth convolution kernel with compact support, and define

X_{i}^{(n)}=\int_{\mathbb{R}}{\rm d}y\,\rho\left(\frac{i}{\lfloor n^{p}\rfloor}-y\right)s_{n}(y).

(2.16)

In this case $\ell(n)=n^{p}$ and

K(v)=\inf_{f^{\circ}}\left\{{\mathscr{H}}(f^{\circ}\,|\,f):v(x)=\int_{\mathbb{R}}{\rm d}y\,\rho(x-y)f^{\circ}(y),\,x\in\mathbb{R}\right\}.

(2.17)

We refer the reader to [13, Example 2.5] for the arguments underlying this result.

Assumption 2.3.

The function $F\mapsto g^{[F]}$ defined in (2.13) is a continuous mapping from $\mathcal{M}([0,1])$ to $(\mathscr{W},\lVert\cdot\rVert_{L_{1}})$ . $\diamondsuit$

Assumption 2.3 holds, for example, when $H(x,y,F)\equiv H^{*}(x,y)$ (i.e., there is no dependence on the type distribution) and $H^{*}\colon\,[0,1]^{2}\to[0,1]$ is a continuous function. Assumption 2.3 also holds when, in addition, $f\colon\,\mathcal{M}([0,1])\to[0,1]$ and $h\colon\,[0,1]^{2}\to[0,1]$ are continuous functions, and

H(x,y;F)=h(H^{*}(x,y),f(F))\qquad\forall\,\,[x,y]\in[0,1]^{2},\,F\in\mathcal{M}([0,1]).

(2.18)

In certain settings we require two further assumptions that are of a more technical nature.

Assumption 2.4.

For all $F\in\mathcal{M}([0,1])$ , the induced graphon $g^{[F]}$ is away from the boundary, i.e.,

\eta\leq g^{[F]}(x,y)\leq 1-\eta\qquad\forall\,\,(x,y)\in[0,1]^{2},

(2.19)

for some $\eta>0$ . $\diamondsuit$

Assumption 2.5.

The rate function $K$ has a unique zero, labelled $F^{*}$ . $\diamondsuit$

2.5. LDP for IRGTs

Let

J(\tilde{h})=\inf_{F\in\mathcal{M}([0,1]):\tilde{g}^{[F]}=\tilde{h}}K(F)

(2.20)

and recall that $(r,\tilde{h})\mapsto I_{r}(\tilde{h})$ is a function from $\mathscr{W}\times\tilde{\mathscr{W}}$ to $\mathbb{R}_{+}$ . We are now ready to state our LDP for IRGTs.

Theorem 2.6.

Subject to Assumptions 2.2 and 2.3 the following hold:

(i)

If $\ell(n)=o\left({n\choose 2}\right)$ , then $\{\tilde{h}^{\widehat{G}_{n}}\}$ satisfies the LDP with rate $\ell(n)$ and with rate function $I^{*}(\tilde{h})=J(\tilde{h})$ .
(ii)

If $\lim_{n\to\infty}\ell(n)/{n\choose 2}=c$ and Assumption 2.4 holds as well, then $\{\tilde{h}^{\widehat{G}_{n}}\}$ satisfies the LDP with rate ${n\choose 2}$ and with rate function $I^{*}(\tilde{h})=\inf_{g\in\tilde{\mathscr{W}}}[cJ(\tilde{g})+I_{g}(\tilde{h})]$ , where $g$ is any representative of $\tilde{g}$ .
(iii)

If ${n\choose 2}=o(\ell(n))$ and Assumptions 2.4 and 2.5 hold as well, then $\{\tilde{h}^{\widehat{G}_{n}}\}$ satisfies the LDP with rate ${n\choose 2}$ and with rate function $I^{*}(\tilde{h})=I_{g^{[F^{*}]}}(\tilde{h})$ .

To understand where Theorem 2.6 comes from, it is instructive to realize that two random mechanisms play a role, and that the dominant mechanism determines the rare event behavior. Concretely, think of simulating outcomes of $\tilde{h}^{\widehat{G}_{n}}$ in two steps:

$\circ$

Simulate the types of the vertices, i.e., simulate the type distribution $F_{n}$ .
$\circ$

Simulate the edges given $F_{n}$ , i.e., simulate $\tilde{h}^{\widehat{G}_{n}}$ given the induced reference graphon $g^{[F_{n}]}$ .

Due to Assumption 2.2, large fluctuations in Step 1 are governed by the LDP with rate $\ell(n)$ and with rate function $K(\cdot)$ , whereas due to (2.14) large fluctuations in Step 2 are governed by the LDP with rate ${n\choose 2}$ and with rate function $I_{g^{[F_{n}]}}$ . In particular, this implies that when $\ell(n)=o\left({n\choose 2}\right)$ large fluctuations in $\tilde{h}^{\widehat{G}_{n}}$ are most likely to be caused by a rare event in Step 1, whereas when ${n\choose 2}=o(\ell(n))$ large fluctuations in $\tilde{h}^{\widehat{G}_{n}}$ are most likely to be caused by a rare event in Step 2. The regime $\ell(n)\asymp{n\choose 2}$ can be viewed as ‘balanced’, in the sense that large fluctuations in $\tilde{h}^{\widehat{G}_{n}}$ are most likely to be caused by a combination of rare events in both Steps 1 and 2. When $\ell(n)=o\left({n\choose 2}\right)$ we say that the IRGT exhibits vertex-level fluctuations, whereas when ${n\choose 2}=o(\ell(n))$ we say that it exhibits edge-level fluctuations.

The IRGT is of interest in its own right. However, our primary motivation for introducing the IRGT is that it can be generalized in a natural way to a stochastic process. The rough idea behind its dynamic counterpart is that at each point in time the distribution of the process corresponds to an IRGT. We will focus primarily on processes that exhibit vertex-level fluctuations.

3. Graphon-valued processes

In Section 3.1 we first introduce the graph-valued process of interest, which can be viewed as the dynamic counterpart of the model discussed in Section 2. Section 3.2 describes an illustrative example. Section 3.3 states the sample-path LDP for the graph-valued process under the assumption that the driving process satisfies an LDP. Section 3.4 states the stochastic process limit for the graph-valued process under the assumption that the driving process satisfies a stochastic process limit. The latter assumption is investigated in Appendix A.

3.1. The model

For a given time horizon $T$ , let $(G_{n}(t))_{t\in[0,T]}$ denote our graph-valued process. This process is constructed as follows. Suppose that each vertex $i\in[n]$ has a type $X^{(n)}_{i}(t)$ that may fluctuate randomly over time. Let $(\mu_{n}(t))_{t\in[0,T]}$ denote the process of empirical type measures defined by

\mu_{n}(t)=\frac{1}{n}\sum_{i=1}^{n}\delta_{X_{i}^{(n)}(t)},

(3.1)

i.e., the dynamic version of (2.8). This process evolves autonomously, i.e., independently of the graph-valued process $(G_{n}(t))_{t\in[0,T]}$ .

In addition, let $(F_{n}(t;\cdot))_{t\in[0,T]}$ denote the process of empirical type distribution defined by

F_{n}(t;x)=\frac{1}{n}\sum_{i=1}^{n}\mathbbm{1}\{X_{i}^{(n)}(t)\leq x\},\qquad x\in[0,1],

(3.2)

i.e., the dynamic counterpart of (2.9). The process $(\mu_{n}(t))_{t\in[0,T]}$ lives on $D(\mathcal{M}([0,1]),[0,T])$ , the space of $\mathcal{M}([0,1])$ -valued càdlàg paths. We suppose that, at any given time $t$ , edge $ij$ is active with probability

H\big{(}t;X^{(n)}_{i}(t),X^{(n)}_{j}(t),(F_{n}(t;\cdot))_{t\in[0,T]}\big{)},

(3.3)

independently of all other edges at time $t$ , of $X^{(n)}_{i}(t)$ , $X^{(n)}_{j}(t)$ , and of $(F_{n}(t;\cdot))_{t\in[0,T]}$ , where

H\colon\,[0,T]\times[0,1]^{2}\times D(\mathcal{M}([0,1]),[0,T])\mapsto[0,1].

(3.4)

This function $H$ gives rise to the induced reference graphon process $g^{[F]}$ , which, for $F\in D(\mathcal{M}([0,1]),[0,T])$ , is characterised by

g^{[F]}(t;x,y)=H\big{(}t;\bar{F}(t;x),\bar{F}(t;y),(F(t;\cdot)_{t\in[0,T]})\big{)}.

(3.5)

Observe that, for any $t\in[0,T]$ , given the outcome of the empirical type distribution $F_{n}(t,\cdot\,)$ , the distribution of $\tilde{h}^{G_{n}(t)}$ corresponds to that of an inhomogeneous Erdős-Rényi random graph with reference graphon $g^{[F_{n}]}(t;\cdot,\cdot)$ . In other words, for any $t\in[0,T]$ ,

h^{G_{n}(t)}|F_{n}\stackrel{{\scriptstyle\rm d}}{{=}}h^{\widehat{G}_{n}},\qquad r=g^{[F_{n}]}(t;\cdot),

(3.6)

where $\widehat{G}_{n}$ is the inhomogeneous Erdős-Rényi random graph defined in Section 2.2. We make the following assumption on the function $F\mapsto g^{[F]}$ , which due to (3.5) is an assumption on $H$ .

Assumption 3.1.

The map $F\mapsto g^{[F]}$ from $D(\mathcal{M}([0,1]),[0,T],)$ to $D((\mathscr{W},\lVert\cdot\rVert_{L_{1}}),[0,T])$ is continuous. $\diamondsuit$

3.2. An illustrative example

Suppose that $(G_{n}(t))_{t\in[0,T]}$ is characterised by the following dynamics:

•

$G_{n}(0)$ is the empty graph.
•

Each vertex is assigned an independent Poisson clock with rate $\gamma$ , i.e., the time intervals between two consecutive ring times are exponentially distributed with parameter $\gamma$ . Each time the clock attached to vertex $v$ rings, all the edges are adjacent to $v$ become inactive.
•

If the edge $ij$ is inactive, then it becomes active at rate $\lambda$ , independently of anything else.

We first describe the associated driving process. Let $\{\tau_{k}(v)\}_{k\in\mathbb{N}}$ denote the sequence of times at which the Poisson clock attached to vertex $v$ rings, and let

Y_{v}(t):=t-\max_{k}\{\tau_{k}(v)\colon\,\tau_{k}(v)\leq t\}\vee 0

(3.7)

denote the time since the clock last rung. The value of $Y_{v}(t)$ can be thought of as the age of vertex $v$ at time $t$ : each time the clock associated with $v$ rings, it dies and all its adjacent edges are lost. Recalling that we assumed that types take values in $[0,1]$ , we write

X_{v}(t):=F^{{\rm exp}}(Y_{v}(t))=1-\mathrm{e}^{-\gamma Y_{v}(t)}

(3.8)

to denote the type of vertex $v$ at time $t$ , where $F^{{\rm exp}}(\cdot)$ can be interpreted as the distribution function of an exponential random variable with rate $\gamma$ .

The function $H(t;u,v,F)$ can now also be identified. The probability that there is an active edge between vertices of ages $\bar{u}$ and $\bar{v}$ is $1-\exp\{-\lambda(\bar{u}\wedge\bar{v})\}$ . Putting $u=F^{{\rm exp}}(\bar{u})$ and $v=F^{{\rm exp}}(\bar{v})$ , we obtain, using that $\bar{u}=-\log(1-u)/\gamma$ and $\bar{v}=-\log(1-v)/\gamma$ ,

H(t;u,v,F)=1-\exp\left(\lambda\left(\frac{1}{\gamma}\log(1-u\wedge v\right)\right)=1-(1-u\wedge v)^{\lambda/\gamma}.

(3.9)

Because $H(t;u,v,F)$ is a continuous function of $u$ and $v$ , and is independent of $t$ and $F$ , it is straightforward to verify that Assumption 3.1 holds. A more involved example is given in Section 4.1.

3.3. Sample-path large deviations

Similarly as in Section 2.5, we assume that the driving process satisfies the LDP (which for the above illustrative example is established in Lemma A.1).

Assumption 3.2.

$\{F_{n}\}_{n\in\mathbb{N}}$ satisfies the LDP on $D(\mathcal{M}([0,1]),[0,T])$ with rate $\ell(n)=o({n\choose 2})$ and with rate function $K$ . $\diamondsuit$

To establish the sample-path LDP for the graphon-valued process $\{(\tilde{h}^{\widehat{G}_{n}(t)})_{t\geq 0}\}_{n\in\mathbb{N}}$ , we need to:

(I)

establish the LDP in the pointwise topology;
(II)

strengthen this topology by establishing exponential tightness.

Step (I) is settled by the following result. For $\tilde{h}\in D((\tilde{\mathscr{W}},\delta_{\square}),[0,T])$ , let

J(\tilde{h})=\inf_{F\in D(\mathcal{M}([0,1]),[0,T]):\tilde{g}^{[F]}=\tilde{h}}K(F).

(3.10)

Proposition 3.3.

If Assumptions 3.1 and 3.2 hold, then the sequence $((\tilde{h}^{G_{n}(t)})_{t\geq 0})_{n\in\mathbb{N}}$ satisfies the LDP in the pointwise topology with rate $\ell(n)$ and with rate function $J(\tilde{h})$ .

Note that Proposition 3.3 does not refer to any edge-switching dynamics. Specifically, if two process $\{G_{n}\}_{n\in\mathbb{N}}$ and $\{G^{*}_{n}\}_{n\in\mathbb{N}}$ have a common sequence of types $((X_{i}(t))_{t\geq 0})_{i\in[n]}$ and a common edge-connection function $H$ , then the marginal distributions are equivalent, i.e.,

\tilde{h}^{G_{n}(t)}\stackrel{{\scriptstyle\rm d}}{{=}}\tilde{h}^{G^{*}_{n}(t)},\qquad t\in[0,T].

(3.11)

However, this does not necessarily mean that the joint distributions are equivalent, i.e., we may have

\big{(}\tilde{h}^{G_{n}(t)}\big{)}_{t\in[0,T]}\stackrel{{\scriptstyle\rm d}}{{\neq}}\big{(}\tilde{h}^{G^{*}_{n}(t)}\big{)}_{t\in[0,T]},

(3.12)

because these depend on the specific edge-switching dynamics. Nonetheless, Proposition 3.3 implies that both $\{G_{n}\}_{n\in\mathbb{N}}$ and $\{G^{*}_{n}\}_{n\in\mathbb{N}}$ satisfy equivalent LDPs in the pointwise topology, i.e., the rate function depends only on the marginal distributions of the process and not on the specific edge-switching dynamics. In Sections 4.1 and 4.2 we provide examples of processes with equivalent marginals and different edge-switching dynamics.

The specific edge-switching dynamics do need to be taken into consideration when we want to perform step (II), i.e., strengthen the topology of the LDP in Proposition 3.3 by establishing exponential tightness. We next provide a condition that can be used to verify that $\{\tilde{h}^{G_{n}}\}_{n\in\mathbb{N}}$ are exponentially tight. Let

E^{(n)}_{ij}(t)=\begin{cases}1,\quad&\text{if edge $ij$ is active at time $t$,}\\ 0,\quad&\text{otherwise},\end{cases}

(3.13)

and define

C_{n}(t,\delta)=\sum_{1\leq i<j\leq n}\sup_{t\leq u\leq v\leq t+\delta}|E^{(n)}_{ij}(u)-E^{(n)}_{ij}(v)|.

(3.14)

In other words, $C_{n}(t,\delta)$ is the number of edges that change (i.e., go from active to inactive or vice versa) at some time between $t$ and $t+\delta$ .

Proposition 3.4.

If, for all $t\in[0,T]$ and $\varepsilon>0$ ,

\displaystyle\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{\ell(n)}\log\mathbb{P}\left(C_{n}(t,\delta)>\varepsilon{n\choose 2}\right)=-\infty,

(3.15)

then $((\tilde{h}^{G_{n}(t)})_{t\geq 0})_{n\in\mathbb{N}}$ is exponentially tight.

Combining the above two propositions, we obtain the following.

Theorem 3.5.

If the conditions of Propositions 3.3 and 3.4 are satisfied, then the sequence of processes $(\tilde{h}^{G_{n}(t)})_{t\geq 0})_{n\in\mathbb{N}}$ satisfies the LDP on $D(\tilde{\mathscr{W}},[0,T])$ with rate $\ell(n)$ and with rate function $J(\tilde{h})$ .

In view of Lemma A.1, the conditions of Theorem 3.5 can be readily verified for the illustrative example. In Theorem 3.10 we establish a sample-path LDP for a class of processes that includes the illustrative example.

3.4. Stochastic process convergence

In the sequel, $\Rightarrow$ denotes convergence in distribution, and $\stackrel{{\scriptstyle\rm fdd}}{{\Rightarrow}}$ convergence of the associated finite-dimensional distributions. We assume that the empirical type distribution has a stochastic process limit.

Assumption 3.6.

Suppose that $F_{n}\Rightarrow F$ as $n\to\infty$ on $D(\mathcal{M}([0,1]),[0,T])$ . $\diamondsuit$

We establish the stochastic process limit of $(h^{G_{n}(t)})_{t\in[0,T]}$ as $n\to\infty$ on $D((\mathscr{W},d_{\square}),[0,T])$ , i.e., we no longer take the quotient with respect to the equivalence relation $\sim$ . To establish a stochastic process limit in this finer topology, as explained below, we need to ensure that the labels of the vertices update dynamically.

Assumption 3.7.

At any time $t\in[0,1]$ the labels of the vertices are such that

X_{1}(t)\leq\dots\leq X_{n}(t).

(3.16)

$\diamondsuit$

The importance of Assumption 3.7 is illustrated in Figure 3.4, where we consider the illustrative example from Section 3.2 with $t=1$ , $\lambda=6$ and $\gamma=3$ . The left panel shows an outcome of $h^{G_{100}(1)}$ with a static labelling, where vertices are labelled arbitrarily at time $t=0$ and their labels do not change over time. We observe that $h^{G_{100}(1)}$ has no discernible structure, and so with probability 1 the sequence $(h^{G_{n}(1)})_{n\in\mathbb{N}}$ does not converge to a limit in $(\mathscr{W},d_{\square})$ . The center panel of Figure 3.4 illustrates an outcome of $h^{G_{100}(1)}$ under the dynamic labelling given in Assumption 3.7. Under this assumption $h^{G_{100}(1)}$ has a discernible structure and $h^{G_{n}(1)}\to g$ in $(\mathscr{W},d_{\square})$ as $n\to\infty$ , where $g$ is the smooth graphon illustrated in the right panel of Figure 3.4.

Refer to caption — Figure 1. . An illustration of $h^{G_{100}(1)}$ with static labelling (left panel) and dynamic labelling (center panel), and the corresponding limit for the dynamic labelling in $(\mathcal{W},d_{\square})$ (right panel). In this figure black corresponds to the value 1 and white corresponds to the value 0.

$\displaystyle H(t;x_{i},x_{j},F)$	$\displaystyle=\int_{t-x_{i}\wedge x_{j}}^{t}{\rm d}s\,\lambda(s,x_{i}-t+s,x_{j}-t+s,F(s;\cdot))$	(4.3)
	$\displaystyle\qquad\times\exp\bigg{\{}-\int_{s}^{t}{\rm d}a\,[\mu(a,x_{i}-t+a,x_{j}-t+a,F(a;\cdot))$
	$\displaystyle\qquad\qquad+\lambda(a,x_{i}-t+a,x_{j}-t+a,F(a;\cdot))]\bigg{\}};$

$\displaystyle H(t;x_{i},x_{j},F)$	$\displaystyle=\int_{t-x_{i}\wedge x_{j}}^{t}{\rm d}s\,\lambda(s,x_{i}-t+s,x_{j}-t+s,F(s;\cdot),\tilde{h}^{G(s)})$	(4.6)
	$\displaystyle\qquad\times\exp\bigg{\{}-\int_{s}^{t}{\rm d}a\,[\mu(a,x_{i}-t+a,x_{j}-t+a,F(a;\cdot),\tilde{h}^{G(a)})$
	$\displaystyle\qquad\qquad+\lambda(a,x_{i}-t+a,x_{j}-t+a,F(a;\cdot),\tilde{h}^{G(a)})]\bigg{\}}.$

$\displaystyle g^{(F)}(t;x,y)$	$\displaystyle=\int_{t-\bar{F}(t;x)\wedge\bar{F}(t;y)}^{t}{\rm d}s\,\lambda(s,\bar{F}(s;x)-t+s,\bar{F}(s;y)-t+s,F(s;\cdot),\tilde{g}^{(F)}(s))$	(4.7)
	$\displaystyle\quad\times\exp\bigg{\{}-\int_{s}^{t}{\rm d}a\,[\mu(a,\bar{F}(a;x)-t+a,\bar{F}(a;y)-t+a,F(a;\cdot),\tilde{g}^{(F)}(a))$
	$\displaystyle\quad\quad+\lambda(a,\bar{F}(a;x)-t+a,\bar{F}(a;y)-t+a,F(a;\cdot),\tilde{g}^{(F)}(a))]\bigg{\}}.$

	$\displaystyle I_{r}(h)$	$\displaystyle=\frac{1}{2}\int_{[0,1]^{2}}{\rm d}x\,{\rm d}y\left[h(x,y)\log\left(\frac{h(x,y)}{r(x,y)}\right)+(1-h(x,y))\log\left(\frac{1-g_{n}(x,y)}{1-r(x,y)}\right)\right]$		(5.4)
		$\displaystyle\geq\int_{[0,1]^{2}}{\rm d}x\,{\rm d}y\,(h(x,y)-r(x,y))^{2}\geq\lVert h-r\rVert_{L_{1}}^{2}\geq d_{\square}(h,r)^{2}\geq\varepsilon^{2}.$		(5.4)

		$\displaystyle\liminf_{n\to\infty}\frac{1}{{n\choose 2}}\log\mathbb{P}(\tilde{h}^{G_{n}}\in\mathcal{O})$		(5.14)
		$\displaystyle\geq\lim_{\varepsilon\downarrow 0}\liminf_{n\to\infty}\frac{1}{{n\choose 2}}\bigg{[}\log\mathbb{P}(F_{n}\in F(r,\varepsilon))+\log\mathbb{P}(\tilde{h}^{\widehat{G}_{n}}\in\mathcal{O}\|F_{n}\in F(r,\varepsilon))\bigg{]}$
		$\displaystyle\geq-\lim_{\varepsilon\downarrow 0}\left[cK(F(r,\varepsilon))+\sup_{F\in F(r,\varepsilon)}\inf_{\tilde{h}\in\mathcal{O}}I_{r^{[F]}}(\tilde{h})\right]$
		$\displaystyle=-[cJ(\tilde{r})+\inf_{\tilde{h}\in\mathcal{O}}I_{r}(\tilde{h})].$

$\displaystyle\limsup_{n\to\infty}$	$\displaystyle\frac{1}{{n\choose 2}}\log\mathbb{P}(\tilde{h}^{\widehat{G}_{n}}\in\mathcal{C})$	(5.16)
	$\displaystyle\leq\lim_{\varepsilon\downarrow 0}\limsup_{n\to\infty}\frac{1}{{n\choose 2}}\log\bigg{[}\sum_{F\in F[\varepsilon]}\mathbb{P}(F_{n}\in B_{L}(F,\varepsilon))\mathbb{P}(\tilde{h}^{\widehat{G}_{n}}\in\mathcal{C}\,\|\,F_{n}\in B_{L}(F,\varepsilon))\bigg{]}$
	$\displaystyle\leq-\lim_{\varepsilon\downarrow 0}\min_{F\in F[\varepsilon]}[cK(B_{L}(F,\varepsilon))+\inf_{F^{\star}\in B_{L}(F,\varepsilon)}\inf_{\tilde{h}\in\mathcal{C}}I_{g^{[F^{\star}]}}(\tilde{h})].$
	$\displaystyle\leq-\lim_{\varepsilon\to 0}\min_{F\in\mathcal{M}([0,1])}[cK(B_{L}(F,\varepsilon))+\inf_{F^{\star}\in B_{L}(F,\varepsilon)}\inf_{\tilde{h}\in\mathcal{C}}I_{g^{[F^{\star}]}}(\tilde{h})]$
	$\displaystyle=-\min_{F\in\mathcal{M}([0,1])}[cK(F)+\inf_{\tilde{h}\in\mathcal{C}}\tilde{I}_{r}^{[F]}(\tilde{h})]$
	$\displaystyle=\inf_{\tilde{h}\in\mathcal{C}}\big{\{}\min_{F\in\mathcal{M}([0,T])}[cK(F)+\tilde{I}_{r}^{[F]}(\tilde{h})]\big{\}}.$

		$\displaystyle\liminf_{n\to\infty}\frac{1}{\ell(n)}\log\mathbb{P}\left(\tilde{h}^{G_{n}(t_{i})}\in\mathcal{O}_{i},\,\forall i=1,\dots,k\right)$		(5.18)
		$\displaystyle\geq\lim_{\varepsilon\downarrow 0}\liminf_{n\to\infty}\frac{1}{\ell(n)}\log\bigg{[}\mathbb{P}\left(\tilde{g}^{[F_{n}]}(t_{i})\in\mathcal{O}^{(-\varepsilon)}_{i},\,\forall i=1,\dots,k\right)$
		$\displaystyle\qquad\qquad\qquad\qquad\qquad+\bigg{(}1-\sum_{i=1}^{k}\mathbb{P}(\delta_{\square}(\tilde{h}^{G_{n}(t_{i})},\tilde{g}^{[F_{n}]}(t_{i}))>\varepsilon\bigg{)}\bigg{]}$
		$\displaystyle\geq\inf_{\tilde{h}:\tilde{h}(t_{i})\in\mathcal{O}_{i},\,\forall i=1,\dots,k}J(\tilde{h}),$

	$\displaystyle d(g^{[F_{n}]}(\cdot),h^{G_{n}(\cdot)})$	$\displaystyle\leq d\left(g^{[F_{n}]}(\cdot),g_{\delta}^{[F_{n}]}(\cdot)\right)+d\left(g_{\delta}^{[F_{n}]}(\cdot),h^{G_{n}(\cdot)}\right)$		(5.30)
		$\displaystyle\leq d\left(g^{[F_{n}]}(\cdot),g_{\delta}^{[F_{n}]}(\cdot)\right)+\sup_{t\in[0,T]}d_{\square}\left(g_{\delta}^{[F_{n}]}(t),h^{G_{n}(t)}\right)$		(5.30)

$\displaystyle\lim_{\delta\downarrow 0}$	$\displaystyle\lim_{n\to\infty}\mathbb{P}\left(\sup_{t\in[0,T]}d_{\square}\left(g_{\delta}^{[F_{n}]}(t),h^{G_{n}(t)}\right)>2\varepsilon\right)$	(5.33)
	$\displaystyle\leq\lim_{\delta\downarrow 0}\lim_{n\to\infty}\sum^{T/\delta}_{i=1}\left\{\mathbb{P}\left(d_{\square}\left(g_{\delta}^{[F_{n}]}(\delta i),h^{G_{n}(\delta i)}\right)>\varepsilon\right)+\mathbb{P}\left(C_{n}(t,\delta)>\varepsilon{n\choose 2}\right)\right\}$
	$\displaystyle=0,$

	$\displaystyle\mathbb{P}$	$\displaystyle\left(\left\lVert\tilde{h}^{G_{n}(t)}-\tilde{h}^{G^{*}_{n}(t)}\right\rVert_{L_{1}}>\eta,\,\text{for some }t\in[0,T]\right)$		(5.42)
		$\displaystyle\leq\mathbb{P}(Z^{(\beta)}_{n}(T)\leq\eta n^{2}/2)+\mathbb{P}\left(\delta_{\square}\left(\tilde{g}^{(F_{n})}(t;\cdot,\cdot),\tilde{h}^{G^{*}_{n}(t)}\right)>\beta,\,\text{for some }t\in[0,T]\right),$		(5.42)

	$\displaystyle\varphi(s)$	$\displaystyle:=\mathbb{E}\left(\mathrm{e}^{s\sum_{k=1}^{Y^{(i)}}X_{0}^{(i,k)}}\right)=\mathbb{E}\left(\mathbb{E}\left(\mathrm{e}^{sX^{(i,k)}_{0}}\right)^{Y^{(i)}}\right)$		(5.47)
		$\displaystyle=\exp\left\{C^{}\beta T\left(\frac{\mathrm{e}^{-C^{}T+s}}{1-(1-\mathrm{e}^{-C^{*}T})\mathrm{e}^{s}}-1\right)\right\}$		(5.47)

$\displaystyle\Delta_{n}$	$\displaystyle(t+{\rm d}t):=\left\lVert g^{[F]}(t+{\rm d}t;\cdot,\cdot)-g^{[F_{n}]}(t+{\rm d}t;\cdot,\cdot))\right\rVert_{L_{1}}$	(5.50)
	$\displaystyle\leq\left\lVert g^{[F]}(t;\cdot,\cdot)-g^{[F_{n}]}(t;\cdot,\cdot)\right\rVert_{L_{1}}$
	$\displaystyle+{\rm d}t\int_{[0,1]^{2}}{\rm d}x{\rm d}y\,\bigg{\|}\lambda\left(t,\bar{F}(t;x^{\prime}),\bar{F}(t;y^{\prime}),F(t;\cdot),\tilde{g}^{(F)}(t;\cdot)\right)(1-g^{(F)}(t;x^{\prime},y^{\prime}))$
	$\displaystyle\quad\hskip 22.76219pt-\lambda\left(t,\bar{F}_{n}(t;x^{\prime}_{n}),\bar{F}_{n}(t;y^{\prime}_{n}),F_{n}(t;\cdot),\tilde{g}^{(F_{n})}(t;\cdot)\right)(1-g^{(F_{n})}(t;x^{\prime}_{n},y^{\prime}_{n}))\bigg{\|}$
	$\displaystyle+{\rm d}t\int_{[0,1]^{2}}{\rm d}x{\rm d}y\,\bigg{\|}\mu\left(t,\bar{F}(t;x^{\prime}),\bar{F}(t;y^{\prime}),F(t;\cdot),\tilde{g}^{(F)}(t;\cdot)\right)g^{(F)}(t;x^{\prime},y^{\prime})$
	$\displaystyle\quad\hskip 22.76219pt-\mu\left(t,\bar{F}_{n}(t;x^{\prime}_{n}),\bar{F}_{n}(t;y^{\prime}_{n}),F_{n}(t;\cdot),\tilde{g}^{(F_{n})}(t;\cdot)\right)g^{(F_{n})}(t;x^{\prime}_{n},y^{\prime}_{n})\bigg{\|}.$

		$\displaystyle\bigg{\|}\lambda\left(t,\bar{F}(t;x^{\prime}),\bar{F}(t;y^{\prime}),F(t;\cdot),\tilde{g}^{(F)}(t;\cdot)\right)$		(5.51)
		$\displaystyle\qquad-\lambda\left(t,\bar{F}_{n}(t;x^{\prime}_{n}),\bar{F}_{n}(t;y^{\prime}_{n}),F_{n}(t;\cdot),\tilde{g}^{(F_{n})}(t;\cdot)\right)\bigg{\|}\leq\bar{K}(\Delta_{n}(t)+o(1)),$		(5.51)

		$\displaystyle\Delta_{n}(t+{\rm d}t)\leq$		(5.52)
		$\displaystyle\Delta_{n}(t)+2{\rm d}t\bigg{[}\bar{K}(\Delta_{n}(t)+o(1))+\widehat{K}\int_{[0,1]^{2}}{\rm d}x\,{\rm d}y\,\left\|g^{(F_{n})}(t;x_{n}^{\prime},y^{\prime}_{n})-g^{(F)}(t;x^{\prime},y^{\prime})\right\|\bigg{]}.$		(5.52)

		$\displaystyle\Delta_{n}(t+{\rm d}t)\leq$		(5.53)
		$\displaystyle 2\int_{0}^{t+{\rm dt}}{\rm d}s\bigg{[}\bar{K}(\Delta_{n}(s)+o(1))+\widehat{K}\int_{[0,1]^{2}}{\rm d}x\,{\rm d}y\,\left\|g^{(F_{n})}(s;x_{n}^{\prime},y^{\prime}_{n})-g^{(F)}(s;x^{\prime},y^{\prime})\right\|\bigg{]}.$		(5.53)

		$\displaystyle\Delta_{n}(t+{\rm d}t)\leq$		(5.54)
		$\displaystyle 2\int_{0}^{t+{\rm dt}}{\rm d}s\bigg{[}\bar{K}(\Delta_{n}(s)+o(1))+(\widehat{K}+o(1))\int_{[0,1]^{2}}{\rm d}x\,{\rm d}y\,\left\|g^{(F_{n})}(s;x,y)-g^{(F)}(s;x,y)\right\|\bigg{]}$
		$\displaystyle=2\int_{0}^{t+{\rm dt}}{\rm d}s\Delta_{n}(s)(\bar{K}+\widehat{K}+o(1)).$

		$\displaystyle\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\mathbb{P}\left(\sup_{t\in[0,T]}C_{n}(t,\delta)>\varepsilon{n\choose 2}\right)$		(5.55)
		$\displaystyle\qquad\leq\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\mathbb{P}\left(\max_{i\in\{0,\dots,\lfloor\frac{T}{\delta}\rfloor\}}C_{n}(i\delta,2\delta)>\varepsilon{n\choose 2}\right)$
		$\displaystyle\qquad\leq\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\left\{\sum_{i=0}^{\lfloor\frac{T}{\delta}\rfloor}\max_{i\in\{0,\dots,\lfloor\frac{T}{\delta}\rfloor}\mathbb{P}\left(C_{n}(i\delta,2\delta)>\varepsilon{n\choose 2}\right)\right\}.$

	$\displaystyle\bigg{[}\min_{u\in[0,2\delta]}H(t+u,X_{i}(t)+u,$	$\displaystyle X_{j}(t)+u,F_{n}(t+u)),$		(5.56)
		$\displaystyle\qquad\max_{u\in[0,2\delta]}H(t+u,X_{i}(t)+u,X_{j}(t)+u,F_{n}(t+u)\bigg{]}.$		(5.56)

	$\displaystyle\max_{u\in[0,2\delta]}H(t+u,$	$\displaystyle X_{i}(t)+u,X_{j}(t)+u,F_{n}(t+u))$		(5.57)
		$\displaystyle-\min_{u\in[0,2\delta]}H(t+u,X_{i}(t)+u,X_{j}(t)+u,F_{n}(t+u))\leq 4K\delta.$		(5.57)

		$\displaystyle\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\left\{\sum_{i=0}^{\lfloor\frac{T}{\delta}\rfloor}\max_{i\in\{0,\dots,\lfloor\frac{T}{\delta}\rfloor}\mathbb{P}\left(C_{n}(i\delta,2\delta)>\varepsilon{n\choose 2}\right)\right\}$		(5.58)
		$\displaystyle\leq\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\left\{\sum_{i=0}^{\lfloor\frac{T}{\delta}\rfloor}\max_{i\in\{0,\dots,\lfloor\frac{T}{\delta}\rfloor}\mathbb{P}\left(C^{(v)}_{n}(i\delta,2\delta)>\frac{\varepsilon}{2}{n\choose 2}\right)+\mathbb{P}\left(C^{(e)}_{n}(i\delta,2\delta)>\frac{\varepsilon}{2}{n\choose 2}\right)\right\}$
		$\displaystyle\leq\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\frac{T}{\delta}\left\{\mathbb{P}\left(X_{n}^{(v)}(\delta)>\frac{\varepsilon}{2}{n\choose 2}\right)+\mathbb{P}\left(X_{n}^{(e)}(\delta)>\frac{\varepsilon}{2}{n\choose 2}\right)\right\}$
		$\displaystyle\leq\lim_{\delta\downarrow 0}\limsup_{n\to\infty}\frac{1}{n}\log\frac{T}{\delta}\left\{\exp\left({-n\frac{\varepsilon}{2}\left(\log\frac{\varepsilon}{1-\mathrm{e}^{-\gamma\delta}}-1\right)}\right)+\exp\left({-{n\choose 2}\frac{\varepsilon}{2}\left(\log\frac{\varepsilon}{4K\delta}-1\right)}\right)\right\}$
		$\displaystyle=-\infty,$

		$\displaystyle I^{(\boldsymbol{t})}_{v}(\mu_{1},\dots,\mu_{r})$		(A.6)
		$\displaystyle=\sup_{f_{1},\dots,f_{r}\in C_{b}(\mathbb{R}_{+})^{r}}\bigg{[}\sum_{i=1}^{r}\int_{\mathbb{R}_{+}}\mu_{i}({\rm d}z)f_{i}(z)$
		$\displaystyle\qquad\qquad-\int_{\mathbb{R}_{+}}v({\rm d}x)\log\int_{\mathbb{R}_{+}^{r}}P^{(\boldsymbol{t})}_{x}({\rm d}y^{(1)},\dots,{\rm d}y^{(r)})\exp\left(\sum_{i=1}^{r}f_{i}(y^{(i)})\right)\bigg{]},$

	$\displaystyle I_{v}^{(t)}(\mu)$	$\displaystyle=\sup_{f\in C_{b}([0,\infty))}\left[\int_{0}^{\infty}\mu({\rm d}z)f(z)-\int_{0}^{\infty}v({\rm d}x)\log\left(\int_{0}^{\infty}P^{(t)}_{x}({\rm d}y)\mathrm{e}^{f(y)}\right)\right]$
		$\displaystyle=\sup_{f\in C_{b}[0,\infty)}\left[\int_{0}^{\infty}\mu({\rm d}z)f(z)-\int_{0}^{\infty}v({\rm d}x)\log\left(\mathrm{e}^{-\gamma t+f(x+t)}+\int_{0}^{t}{\rm d}y\,\gamma\mathrm{e}^{-\gamma y+f(y)}\right)\right].$		(A.8)

	$\displaystyle\int_{0}^{t-}\mu({\rm d}z)\,f(z)-\int_{0}^{\infty}[v({\rm d}x)-\mu({\rm d}(x+t))]\log\left(\int^{t}_{0}{\rm d}z\,\gamma\mathrm{e}^{-\gamma z+f(z)}\right)$		(A.13)
	$\displaystyle\qquad=\int_{0}^{t-}\mu({\rm d}z)\,f(z)-\left(1-\int_{t+}^{\infty}\mu({\rm d}x)\right)\log\left(\int^{t}_{0}{\rm d}z\,\gamma\mathrm{e}^{-\gamma z+f(z)}\right),$		(A.14)

	$\displaystyle\eqref{JEq}$	$\displaystyle=\int_{0}^{\infty}\mu({\rm d}(x+t))\log\left(\frac{\mu({\rm d}(x+t))}{[v({\rm d}x)-\mu({\rm d}(x+t))]\,\mathrm{e}^{-\gamma t}}\right)$
		$\displaystyle\quad-\int_{0}^{\infty}v({\rm d}x)\log\left(\frac{v({\rm d}x)}{v({\rm d}x)-\mu({\rm d}(x+t))}\right)$
		$\displaystyle\quad+\int^{t-}_{0}\mu({\rm d}z)\log\left(\frac{\mu({\rm d}z)}{{\rm d}z\,\gamma\,\mathrm{e}^{-\gamma z}}\right)-\int^{t-}_{0}\mu({\rm d}z)\,\log\left(\int^{t-}_{0}\mu({\rm d}z)\right)$
		$\displaystyle=\int_{0}^{\infty}[v({\rm d}x)-\mu({\rm d}(x+t))]\log\left(v({\rm d}x)-\mu({\rm d}(x+t))\right)$
		$\displaystyle\quad+\int^{\infty}_{0}\mu({\rm d}(x+t))\log\left(\frac{\mu({\rm d}(x+t))}{\mathrm{e}^{-\gamma t}}\right)$
		$\displaystyle\quad-\int^{\infty}_{0}v({\rm d}x)\log(v({\rm d}x))-\int^{t-}_{0}\mu({\rm d}z)\,\log\left(\int^{t-}_{0}\mu({\rm d}z)\right)$
		$\displaystyle\quad+\int^{t-}_{0}\mu({\rm d}z)\,\log\left(\frac{\mu({\rm d}z)}{{\rm d}x\,\gamma\,\mathrm{e}^{-\gamma z}}\right)$
		$\displaystyle=\int^{\infty}_{0}v({\rm d}x)\left[\frac{v({\rm d}x)-\mu({\rm d}(x+t))}{v({\rm d}x)}\log\left(\frac{v({\rm d}x)-\mu({\rm d}(x+t))}{v({\rm d}x)}\right)\right.$
		$\displaystyle\quad\quad\left.+\frac{\mu({\rm d}(x+t))}{v({\rm d}x)}\log\left(\frac{\mu({\rm d}(x+t))}{\mathrm{e}^{-\gamma t}v({\rm d}x)}\right)\right]$
		$\displaystyle\quad-\left(\int^{t-}_{0}\mu({\rm d}z)\right)\log\left(\int^{t-}_{0}\mu({\rm d}z)\right)+\int^{t-}_{0}\mu({\rm d}z)\log\left(\frac{\mu({\rm d}z)}{{\rm d}z\,\gamma\,\mathrm{e}^{-\gamma z}}\right).$

	$\displaystyle I_{v}^{(t)}(\mu)$	$\displaystyle=\mu({\rm d}t)\log\left(\frac{\mu({\rm d}t)}{\mathrm{e}^{-\gamma t}}\right)+(1-\mu({\rm d}t))\log\left(\frac{1-\mu({\rm d}t)}{1-\mu({\rm d}t)}\right)+\int^{t-}_{0}\mu({\rm d}z)\log\left(\frac{\mu({\rm d}z}{{\rm d}z\,\gamma\mathrm{e}^{-\gamma z}}\right)$
		$\displaystyle=\mu({\rm d}t)\log\left(\frac{\mu({\rm d}t)}{\mathrm{e}^{-\gamma t}}\right)+\int^{t-}_{0}\mu({\rm d}z)\log\left(\frac{\mu({\rm d}z)}{{\rm d}z\,\gamma\mathrm{e}^{-\gamma z}}\right),$

Graphon-valued processes with vertex-level fluctuations

Abstract.

1. Introduction

1.1. Background

1.2. Motivation

1.3. Outline

2. Large deviations for static random graphs

2.1. Graphs and graphons

2.2. Inhomogeneous Erdős-Rényi random graph

Theorem 2.1.

2.3. Inhomogeneous random graphs with type dependence

2.4. Key assumptions

Assumption 2.2.

Assumption 2.3.

Assumption 2.4.

Assumption 2.5.

2.5. LDP for IRGTs

Theorem 2.6.

3. Graphon-valued processes

3.1. The model

Assumption 3.1.

3.2. An illustrative example

3.3. Sample-path large deviations

Assumption 3.2.

Proposition 3.3.

Proposition 3.4.

Theorem 3.5.

3.4. Stochastic process convergence

Assumption 3.6.

Assumption 3.7.

Proposition 3.8.

Assumption 3.9.

Theorem 3.10.

4. Applications and extensions

4.1. Beyond conditional independence of edges

4.1.1. Model and LDP

Theorem 4.1.

Proposition 4.2.

4.1.2. Numerical illustration

4.2. Different edge-switching dynamics, equivalent sample-path LDP

Proposition 4.3.

4.3. The most likely path to an unusually small edge density

4.3.1. Most likely state of the process at time TT

Proposition 4.4.

5. Proofs

5.1. Proofs of the results in Section 2

5.1.1. Inhomogeneous Erdős–Rényi random graphs

Lemma 5.1.

Proof.

Lemma 5.2.

Proof.

5.1.2. Inhomogeneous random graphs with type dependence

Lemma 5.3.

Proof.

Lemma 5.4.

Proof.

5.2. Proofs of the results in Section 3

5.2.1. Large deviations

Lemma 5.5.

Proof.

5.2.2. Weak convergence

Lemma 5.6.

Proof.

5.3. Proofs of the results in Section 4

5.3.1. Proofs of the results in Section 4.1

Lemma 5.7.

Proof.

Lemma 5.8.

Proof.

5.3.2. Proofs of the results in Section 4.2

5.3.3. Proofs of the results in Section 4.3

Appendix A Rate function for the driving process

A.1. LDP for driving process in Section 4

Proposition A.1.

Proof.

Lemma A.2.

Lemma A.3.

Lemma A.4.

Proof.

Lemma A.5.

Graphon-valued processes with
vertex-level fluctuations

4.3.1. Most likely state of the process at time $T$

$\displaystyle I^{(\boldsymbol{t})}_{v}(\boldsymbol{\mu})$	$\displaystyle=\sum^{k}_{i=1}I_{\mu_{i-1}}^{(t_{i}-t_{i-1})}(\mu_{i})$	(A.20)
	$\displaystyle=\sum_{i=1}^{k}\bigg{[}\int^{\infty}_{0}\mu_{i}({\rm d}(x+\Delta))\log\left(\frac{\mu_{i}({\rm d}(x+\Delta))}{\mu_{i-1}({\rm d}x)e^{-\gamma\Delta}}\right)$	(A.21)
	$\displaystyle\qquad+\int^{\infty}_{0}[\mu_{i-1}({\rm d}x)-\mu_{i}({\rm d}(x+\Delta))]\log\left(\frac{\mu_{i-1}({\rm d}x)-\mu_{i}({\rm d}(x+\Delta))}{\mu_{i-1}({\rm d}x)\int^{\Delta-}_{0}\mu_{i}({\rm d}z)}\right)$	(A.22)
	$\displaystyle\qquad+\int^{\Delta-}_{0}\mu_{i}({\rm d}z)\log\left(\frac{\mu_{i}({\rm d}z)}{{\rm d}z\,\gamma e^{-\gamma z}}\right)\bigg{]}.$	(A.23)