Algorithmic Randomness, Effective Disintegrations, and Rates of Convergence to the Truth

Simon M. Huttegger Department of Logic and Philosophy of Science
5100 Social Science Plaza
University of California, Irvine
Irvine, CA 92697-5100, U.S.A. [email protected] http://faculty.sites.uci.edu/shuttegg/ , Sean Walsh Department of Philosophy
University of California, Los Angeles
390 Portola Plaza, Dodd Hall 321
Los Angeles, CA 90095-1451 [email protected] http://philosophy.ucla.edu/person/sean-walsh/ and Francesca Zaffora Blando Department of Philosophy
Carnegie Mellon University
Baker Hall 161
5000 Forbes Avenue
Pittsburgh, PA 15213 [email protected]

Abstract.

Lévy’s Upward Theorem says that the conditional expectation of an integrable random variable converges with probability one to its true value with increasing information. In this paper, we use methods from effective probability theory to characterise the probability one set along which convergence to the truth occurs, and the rate at which the convergence occurs. We work within the setting of computable probability measures defined on computable Polish spaces and introduce a new general theory of effective disintegrations. We use this machinery to prove our main results, which (1) identify the points along which certain classes of effective random variables converge to the truth in terms of certain classes of algorithmically random points, and which further (2) identify when computable rates of convergence exist. Our convergence results significantly generalize earlier results within a unifying novel abstract framework, and there are no precursors of our results on computable rates of convergence. Finally, we make a case for the importance of our work for the foundations of Bayesian probability theory.

2010 Mathematics Subject Classification:

Primary 03D32 Secondary: 03A10, 03D78, 03F60, 60A10, 60B05, 60G48

Many thanks to Jeremy Avigad, Peter Cholak, Johanna Franklin, Alexander Kastner, Josiah Lopez-Wild, Christopher Porter, Michael Rescorla, and Jason Rute for discussion and feedback.

1. Introduction

Measure-theoretic probability was developed in the early 20th Century in response to pressing problems in statistical physics, astronomy, and pure mathematics, and today it is used throughout the mathematical sciences.¹¹1For a historical survey, see [73]. What proved to be an especially significant conceptual progress was the ability to say that certain properties are true with probability one. Early examples include Borel’s strong law of large numbers, irrational rotations of the unit interval, Birkhoff’s ergodic theorem, and Poincaré’s recurrence theorem. It is often unclear, however, what these sets are. That is to say, measure-theoretic results only assert the existence of certain sets of probability one but fail to characterise the points that are elements of those sets. This was pointed out as early as 1916 by Weyl, who insisted that a deeper understanding of the sets involved in zero-one laws was necessary in order to interpret the results of measure-theoretic probability.²²2[74].

The theory of algorithmic randomness involves a fine-grained classification of different measure one sets, with the primary exemplars being the Martin-Löf random points, the Schnorr random points, and the Kurtz random points.³³3The original papers of Martin-Löf, Schnorr, and Kurtz are: [41], [42], [64], [65], [36]. There are now several comprehensive references on algorithmic randomness, including [39], [50], [15], [68]. Originally this was done for the uniform “fair coin” measure on Cantor space (the space of infinite sequences of 0’s and 1’s) and the famous results pertained to algorithmic incompressibility and the Turing degrees.⁴⁴4For instance, the Levin-Schnorr characterisation of Martin-Löf randomness in terms of initial segment complexity, and the Kučera-Gács proof that every Turing degree is below the degree of a Martin-Löf random. See, e.g., [15, Theorem 6.3.10 p. 239, Theorem 8.3.2 p. 326] for statement and references. However, the theory has been recently developed for a more general class of computable probability measures on computable spaces, by authors such as Gács, Hoyrup and Rojas, Reimann, Rute, and Miyabe.⁵⁵5[22], [29], [30], [56], [61], [46], [32] (listed in rough chronological order). A related recent trend has been showing that effectivized versions of classical theorems on almost sure convergence prove convergence exactly on various classes of algorithmically random points.⁶⁶6[7], [47]. The latter is, in part, a survey and contains many further references. This arguably contributes to the deeper understanding along the lines suggested by Weyl. Further, this recent trend suggests reconceiving of the various notions of algorithmic randomness less as on a par with rival conceptual analyses of a pre-theoretic phenomenon, à la the Church-Turing thesis, and more as delineations of extensionally and conceptually distinct kinds of probability one events.⁷⁷7This point is due to [54]. Or, if one puts the point in terms of the corresponding null sets, the various notions demarcate different types of impossibility that occur throughout measure-theoretic mathematics and its many applications.

Our main theorems (Theorems 1.5, 1.6, 1.8, 1.9, 1.11) contribute to this recent literature by characterising, in terms of algorithmic randomness, the points at which Lévy’s Upward Theorem holds for various classes of effective random variables, as well as providing information about the rates of convergence to the truth.

Let us recall the classical statement of Lévy’s Theorem.⁸⁸8[75, p. 134], [38, §41 pp. 128 ff]. Suppose $(X,\mathscr{F},\nu)$ is a probability triple. Let $\mathscr{F}_{1},\mathscr{F}_{2},\ldots$ be an increasing sequence of sub- $\sigma$ -algebras of $\mathscr{F}$ whose union generates $\mathscr{F}$ . Then Lévy’s Upward Theorem states that one has $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]\rightarrow f$ both $\nu$ -a.s. and in $L_{1}(\nu)$ , for any $\mathscr{F}$ -measurable function $f$ in $L_{1}(\nu)$ . In this, $\mathbb{E}_{\nu}[f\mid\mathscr{G}]$ denotes the conditional expectation of $f$ relative to $\mathscr{G}$ , which, recall, is defined as the $\nu$ -a.s. unique $\mathscr{G}$ -measurable function $g$ such that $\int_{A}g\;d\nu=\int_{A}f\;d\nu$ for all events $A$ in the sub- $\sigma$ -algebra $\mathscr{G}$ of $\mathscr{F}$ .

The convergence in Lévy’s Upward Theorem is one of the cornerstones of Bayesian epistemology.⁹⁹9[18, pp. 144 ff], [28, pp. 28-29]. The random variable $f$ can be thought of as a quantity that a Bayesian agent, whose degrees of belief are captured by the underlying probability measure $\nu$ , is trying to estimate by repeatedly performing an experiment. The quantity $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ can be seen as encoding the agent’s opinions regarding the value of $f$ after having observed the outcomes of the first $n$ experiments. Lévy’s Upward Theorem then implies that, with probability one, the Bayesian agent’s opinions regarding the value of $f$ will converge to $f$ ’s true value in the limit.

To be able to characterise the $\nu$ -measure one set on which $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]\rightarrow f$ , one needs to choose versions of $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ and $f$ . It seems natural to focus attention on classes of effective random variables $f$ defined relative to spaces $X$ and probability measures $\nu$ which are themselves computable. For, computability seems like a natural constraint to place on our Bayesian agent, and many of the examples of probability measures and random variables that occur in practice and applications are computable. However, the Bayesian perspective recommends few other general constraints on what is eligible to be a credence or a prior. Hence it is important to develop the theory for a maximally broad class of computable spaces and probability measures.¹⁰¹⁰10The distinctive status of the computability hypothesis which we are suggesting raises a host of interesting and complex conceptual questions, ranging from the nature of cognition to the character of inductive inference. We put these issues aside here.

1.1. Effective probability and algorithmic randomness

The computable Polish spaces with computable probability measures are such an appropriately general class of spaces and measures. In this brief section we collect together the few definitions we need about their theory. The reader already familiar with these concepts can easily skip to the next section (§1.2).

A Polish space is a topological space which is separable and completely metrizable. All the paradigmatic spaces such as the reals and their products and their closed and open subspaces are Polish, and similarly for Cantor space. Descriptive set theory takes as its subject matter the Borel and projective subsets of Polish spaces.¹¹¹¹11[34], [48]. When topological considerations are salient or when one needs i.i.d. sequences with prescribed distributions, it is often assumed in contemporary probability theory that the sample space is a Polish space or a Borel subset thereof.¹²¹²12A Borel subset of a Polish space together with its Borel subsets is known as a standard Borel space in descriptive set theory (cf. §[34, Definition 12.5, Corollary 13.4]). For representative examples of standard Borel spaces within probability, see e.g. [17, p. 51], [33, p. 7, pp. 561 ff]. For a classic probability text that foregrounds standard Borel spaces, see [51, Chapter 1].

A computable Polish space $X$ is a Polish space with a distinguished countable dense set $x_{0},x_{1},\ldots$ and a distinguished complete compatible metric $d$ such that the distance $d(x_{i},x_{j})$ between any two elements of the countable dense set is a computable real, uniformly in $i,j\geq 0$ .¹³¹³13A standard reference for computable Polish spaces is [48, Chapter 3]. A comparison to the Weihrauch approach to computable analysis is given in [24]. One can view the treatment of metric spaces in [70] as an axiomatization of Polish spaces and reals which are computable in an oracle. Finally, the study of computable Polish spaces in and of themselves is distinct from effective descriptive set theory, which usually refers to techniques for proving results about all Borel sets by first proving it for lightface Borel sets in Baire space (the most famous example of this is the Glimm-Effros dichotomy (cf. [23, Chapter 6])). (The enumeration of the distinguished countable dense set can contain repetitions, and will need to do so in finite spaces.)

In a computable Polish space, an open set $U$ is c.e. open if there is a computable function $n(\cdot)$ which enumerates a subsequence $x_{n(i)}$ of the countable dense set and a computable sequence $r_{i}$ of rational radii such that $U=\bigcup_{i}B_{d}(x_{n(i)},r_{i})$ . In this, $B_{d}(x,r)$ denotes the open ball with centre $x$ and radius $r$ relative to metric $d$ (when the metric $d$ is clear from context, we just write $B(x,r)$ ). The name “c.e. open” is chosen since the natural numbers are a computable Polish space with the discrete metric, and the c.e. opens in this space are precisely the computably enumerable sets of natural numbers, one of the canonical objects of the contemporary theory of computation.¹⁴¹⁴14See [71], a standard reference. Further, many of the elementary methods of studying c.e. sets (e.g., universal enumerations) extend to c.e. open sets. The complements of c.e. open sets are called effectively closed sets (cf. §2.1).

In a computable Polish space, we say that a sequence $x_{n}\rightarrow x$ at geometric rate $b$ if $d(x,x_{n})\leq b^{-n}$ for all $n\geq 0$ . We say that a sequence $x_{n}\rightarrow x$ fast if $x_{n}\rightarrow x$ at geometric rate $b=2$ . We then say that a point $x$ is computable if there is a computable function $n(\cdot)$ which enumerates a subsequence $x_{n(i)}$ of the countable dense set such that $x_{n(i)}\rightarrow x$ fast. This subsequence is called a witness to the computability of $x$ . In Cantor space with its usual metric, the computable points are precisely the computable subsets of natural numbers.

In the real numbers, the countable dense set is the rationals, and the above definition of computable points is precisely how Turing defined computable real numbers at the outset of the theory of computation nearly a century ago.¹⁵¹⁵15[72]. An equivalent formalisation of computable reals is by Dedekind cuts. A real $x$ is left-c.e. (resp. right-c.e.) if its left Dedekind cut $\{q\in\mathbb{Q}:q<x\}$ in the rationals is a c.e. set (resp. if its right Dedekind cut $\{q\in\mathbb{Q}:x<q\}$ in the rationals is a c.e. set). One can show that a real is computable iff it is both left-c.e. and right-c.e., and uniformly so. (An example of a left-c.e. real that is not computable is $\sum_{n}2^{-f(n)}$ , where $f:\mathbb{N}\rightarrow\mathbb{N}$ is an injective computable function with non-computable range.)¹⁶¹⁶16These and other effective aspects of real numbers are treated extensively in e.g. [57], [15, Chapter 5].

These preliminaries in place, one can then quickly define the required core notions from algorithmic randomness and effective probability. These are all needed in order to formally state our main theorems, but one might restrict oneself to (1)-(8) on a first pass and come back to the others as needed.

Definition 1.1.

(Core notions)

(1)

A function $f:X\rightarrow(-\infty,\infty]$ is lower semi-computable (abbreviated lsc) if for all rational $q$ , the set $f^{-1}(q,\infty]$ is uniformly c.e. open.
(2)

A function $f:X\rightarrow[-\infty,\infty)$ is upper semi-computable (abbreviated usc) if for all rational $q$ , the set $f^{-1}[-\infty,q)$ is uniformly c.e. open.
(3)

A probability measure $\nu$ is computable if $\nu(U)$ is uniformly left-c.e. as $U$ ranges over c.e. opens.
(4)

Given a computable probability measure $\nu$ and a computable real $p\geq 1$ , a function $f:X\rightarrow[0,\infty]$ is an $L_{p}(\nu)$ Schnorr test if it is lsc and if $\|f\|_{p}$ is a computable real, where this denotes the $p$ -norm $\|f\|_{p}=(\int\left|f\right|^{p}\;d\nu)^{\frac{1}{p}}$ .
(5)

Given a computable probability measure $\nu$ and a computable real $p\geq 1$ , a function $f:X\rightarrow[0,\infty]$ is an $L_{p}(\nu)$ Martin-Löf test if it is lsc and $\|f\|_{p}<\infty$ .
(6)

A point $x$ in $X$ is Kurtz random relative to $\nu$ (abbreviated $\mathsf{KR}^{\nu}(X)$ ) if $x$ is in every c.e. open $U$ with $\nu(U)=1$ .
(7)

A point $x$ in $X$ is Schnorr random relative to $\nu$ (abbreviated $\mathsf{SR}^{\nu}(X)$ ) if $f(x)<\infty$ for any $L_{1}(\nu)$ Schnorr test $f$ (equivalently, for any $L_{p}(\nu)$ Schnorr test, for $p\geq 1$ computable).
(8)

A point $x$ in $X$ is Martin-Löf random relative to $\nu$ (abbreviated $\mathsf{MLR}^{\nu}(X)$ ) if $f(x)<\infty$ for any $L_{1}(\nu)$ Martin-Löf test $f$ (equivalently, for any $L_{p}(\nu)$ Martin-Löf test, for $p\geq 1$ computable).
(9)

A computable basis $\mathscr{B}$ for $X$ is a computable sequence of c.e. opens such that every c.e. open can be effectively written as a union of elements in $\mathscr{B}$ .
(10)

If $\nu$ is a computable probability measure, then a $\nu$ -computable basis $\mathscr{B}$ for $X$ is a computable basis such that (i) finite unions from $\mathscr{B}$ uniformly have $\nu$ -computable measure, and (ii) each c.e. open in $\mathscr{B}$ is uniformly paired with an effectively closed superset of the same $\nu$ -measure. If $\nu$ is clear from context, we simply say measure computable basis instead of $\nu$ -computable basis.
(11)

A sub- $\sigma$ -algebra $\mathscr{F}$ of the Borel sets on $X$ is $\nu$ -effective if it is generated by a computable sequence of events $\{A_{m}:m\geq 0\}$ from the algebra $\mathscr{A}$ generated by a $\nu$ -computable basis $\mathscr{B}$ .¹⁷¹⁷17When working with $\mathscr{A}$ , we assume that we are working with the codes for Boolean combinations of elements of $\mathscr{B}$ , and only by extension with the sets that they define. This is because there are some spaces where Boolean algebra structure on the quotient is not computable. We say that $\mathscr{F}$ is generated by $\{A_{m}:m\geq 0\}$ .
(12)

A full $\nu$ -effective filtration $\mathscr{F}_{n}$ (resp. almost-full $\nu$ -effective filtration) is an increasing sequence $\mathscr{F}_{n}$ of uniformly $\nu$ -effective sub- $\sigma$ -algebras generated by a uniformly computable sequence $\{A_{n,m}:m\geq 0\}$ from the algebra $\mathscr{A}$ generated by a $\nu$ -computable basis $\mathscr{B}$ , which is further equipped with a uniform procedure for going from a c.e. open $U$ to a computable sequence $A_{n_{i},m_{i}}$ such that $U=\bigcup_{i}A_{n_{i},m_{i}}$ (resp. $U=\bigcup_{i}A_{n_{i},m_{i}}$ on $\mathsf{KR}^{\nu}(X)$ ).
(13)

If $x$ is a point of the computable Polish space $X$ and $Y$ is a subset of Baire space (the space of all functions from natural numbers to natural numbers), then $x$ weakly computes an element of $Y$ if, for every sequence $x_{n(i)}$ from the countable dense set of $X$ such that $x_{n(i)}\rightarrow x$ fast, there is $y$ in $Y$ which is Turing reducible to the function $i\mapsto n(i)$ . If $Y=\{y\}$ , then we just say that $x$ weakly computes $y$ .¹⁸¹⁸18One can extend Turing reducibility from a relation between sets of natural numbers to a relation between closed subsets of Baire space. In this setting, the notion of weak computation is called Muchnik reducibility, and is contrasted to a strong uniform notion called Medvedev reducibility. See [69], [27] for introduction and references, although this theory is usually focused on effectively closed sets. Given a point $x$ , the set of functions $i\mapsto n(i)$ such that $x_{n(i)}\rightarrow x$ at a fixed rate, in the sense of (15), is a closed subset of Baire space. ${}^{,\;}$ ¹⁹¹⁹19If $X$ is Cantor space or the reals, then for each point $x$ of the space, there is a sequence $i\mapsto n(i)$ of least Turing degree such that $x_{n(i)}\rightarrow x$ fast. In these settings, computational properties of the point of the space usually refer to those of this sequence. However, there are spaces for which there are points with no sequence of least Turing degree. See Miller [43].
(14)

If $x$ is a point of the computable Polish space $X$ , and $\mathcal{C}$ is any collection of Turing degrees (equivalence classes of elements of Baire space under Turing reducibility), then we say that $x$ is in $\mathcal{C}$ if there is some $i\mapsto n(i)$ whose Turing degree is in $\mathcal{C}$ such that $x_{n(i)}\rightarrow x$ fast, where $x_{j}$ again enumerates the countable dense set. In the case where $\mathcal{C}$ just consists of the computable degree, note that $x$ is in $\mathcal{C}$ iff $x$ is computable as a point of $X$ .
(15)

Suppose $y_{n},y$ are elements in a metric space $Y$ such that $y_{n}\rightarrow y$ . Then a rate of convergence for $y_{n}\rightarrow y$ is a function $m:\mathbb{Q}^{>0}\rightarrow\mathbb{N}$ such that for all rational $\epsilon>0$ and all $n\geq m(\epsilon)$ one has $d(y_{n},y)<\epsilon$ .²⁰²⁰20If $y_{n}\rightarrow y$ at geometric rate $b>1$ in a computable Polish space, then one defines a rate in the sense of (15) by setting $m(\epsilon)=n$ for the least $n$ such that $b^{-n}<\epsilon$ . Often in practice we use the case where $Y$ is the reals and $y_{n}=f_{n}(x)$ and $y=f(x)$ , where $f_{n},f$ are real-valued functions. A synonym for rate is modulus, and so we often use the $m$ variable for rates.

For the algorithmic randomness notions in (6)-(8), we just write $\mathsf{KR}^{\nu}$ instead of $\mathsf{KR}^{\nu}(X)$ when $X$ is clear from context; and similarly for $\mathsf{SR}^{\nu}$ and $\mathsf{MLR}^{\nu}$ . For $\sigma$ -algebras $\mathscr{F}$ , it is always understood that they are sub- $\sigma$ -algebras of the Borel $\sigma$ -algebra, and when $\nu$ is clear from context we just say effective instead of $\nu$ -effective.

Algorithmic randomness is often formulated in terms of effective null sets, called sequential tests. But the definitions given above in terms of integral tests are easier to work with in our setting and are known to be equivalent to the sequential definitions, by theorems of Levin and Miyabe.²¹²¹21[37], [45, Theorem 3.5].

Before turning to disintegrations, it is helpful to introduce notational conventions regarding versions of integrable functions vs. equivalence classes thereof. In the following definition, the $\sigma$ -algebra on $[-\infty,\infty]$ is simply $\{B\cup C:B\subseteq\mathbb{R}\mbox{ Borel},C\subseteq\{-\infty,\infty\}\}$ , and similarly for $[0,\infty]$ .

Definition 1.2.

(Conventions on functions defined pointwise vs. functions defined up to $\nu$ -a.s. equivalence)

Suppose $X$ is a Polish space and suppose that $\nu$ is a finite non-negative measure on the Borel events of $X$ . Then we define:

$\mathbb{L}_{p}(\nu)$ is the set of pointwise defined Borel measurable functions $f:X\rightarrow[-\infty,\infty]$ such that $\|f\|_{p}<\infty$ .

$\mathbb{L}^{+}_{p}(\nu)$ is the set of pointwise defined Borel measurable functions $f:X\rightarrow[0,\infty]$ such that $\|f\|_{p}<\infty$ .

$L_{p}(\nu)$ is the set of equivalence classes of elements of $\mathbb{L}_{p}(\nu)$ under $\nu$ -a.s. equivalence. That is, $L_{p}(\nu)$ is the classical Banach space with norm $\|\cdot\|_{p}$ .

$L^{+}_{p}(\nu)$ is the set of equivalence classes of elements of $\mathbb{L}^{+}_{p}(\nu)$ under $\nu$ -a.s. equivalence. That is, $L^{+}_{p}(\nu)$ is a positive cone in the Banach space $L_{p}(\nu)$ .

Then $\mathbb{L}_{p}(\nu)$ projects onto $L_{p}(\nu)$ by sending a function to its equivalence class, and likewise $\mathbb{L}^{+}_{p}(\nu)$ projects onto $L^{+}_{p}(\nu)$ . Note that $L_{p}(\nu)$ Schnorr tests and Martin-Löf tests from Definition 1.1(4)-(5) are elements of $\mathbb{L}_{p}^{+}(\nu)$ , and they are elements of $L_{p}^{+}(\nu)$ only after passing to the equivalence class.

1.2. Classical and effective disintegrations

We use disintegrations for the versions $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ of conditional expectation. While, classically, conditional expectation is defined only $\nu$ -a.s., in order to characterise algorithmic randomness notions in terms of Lévy’s Upward Theorem, we need to select specific versions of conditional expectation. The concept of a disintegration provides a very general way of making such selections. It is due to Rohlin²²²²22[58], [59], [60]. and is routinely used today in ergodic theory and optimal transport,²³²³23It is often used in the proof of the Ergodic Decomposition Theorem and the Gluing Lemma. See [19, 154], [63, 182]. and it is closely related to conditional probability distributions.²⁴²⁴24[11], [53, §5.3].

Suppose that $X$ is a Polish space, $\nu$ is a probability measure on the Borel sets of $X$ and $\mathscr{F}$ is a countably generated sub- $\sigma$ -algebra of the Borel $\sigma$ -algebra. Define the equivalence relation $\sim_{\mathscr{F}}$ on $X$ by $x\sim_{\mathscr{F}}x^{\prime}$ iff, for all $A$ in $\mathscr{F}$ , one has $x$ in $A$ iff $x^{\prime}$ in $A$ , and let $[x]_{\mathscr{F}}$ be the corresponding equivalence class.²⁵²⁵25Since we are focused on Lévy’s Upward Theorem, we are focusing on countably generated sub- $\sigma$ -algebras $\mathscr{F}$ of the Borel $\sigma$ -algebra. Note that this has the consequence that the relation $\sim_{\mathscr{F}}$ is a smooth Borel equivalence relation (cf. [23, §5.4]). More complicated Borel equivalence relations occur naturally in nearby topics. For instance, Rute examines Lévy’s Downward Theorem (cf. [61, Theorem 11.2], [75, Theorem 14.4]), which in Cantor space results naturally in the sub- $\sigma$ -algebra of $E_{0}$ -invariant events, where $E_{0}$ is the Borel equivalence relation featuring in the Glimm-Effros dichotomy (cf. [23, Definition 6.1.1]).

Let $\mathcal{M}^{+}(X)$ be the Polish space of non-negative Borel measures on $X$ (cf. §2.2). For Borel measurable $\rho:X\rightarrow\mathcal{M}^{+}(X)$ , whose action is written as $x\mapsto\rho_{x}$ , we define the partial map

\mathbb{E}_{\nu}[\cdot\mid\mathscr{F}](\cdot):\mathbb{L}_{1}(\nu)\times X\dashrightarrow[-\infty,\infty]\hskip 8.53581pt\mbox{ by }\hskip 8.53581pt\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)=\int f(v)\;d\rho_{x}(v)

(1.1)

This map is a version, that is, it is partially defined on all pairs $(f,x)$ . Note that it is totally defined on $\mathbb{L}_{1}^{+}(\nu)\times X$ , with range $[0,\infty]$ ,²⁶²⁶26Indeed, it is totally defined on all pairs ( $f,x)$ where $f$ is non-negative Borel measurable. But for our purpose of defining a version of $\mathbb{E}_{\nu}[\cdot\mid\mathscr{F}]$ we only need to pay attention to when $f$ is in $\mathbb{L}_{1}(\nu)$ . and it is totally defined and finite on all simple functions. It is further helpful to keep in mind that whether $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)$ is finite depends on whether the element $f$ of $\mathbb{L}_{1}(\nu)$ is additionally in $\mathbb{L}_{1}(\rho_{x})$ : that is, it is integrability with respect to $\rho_{x}$ rather than $\nu$ which is at issue.

For a Polish space $X$ , one says that a map $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is the disintegration of $\mathscr{F}$ with respect to $\nu$ if both the following happen:²⁷²⁷27We are following the treatment of Einsiedler-Ward [19, 135]. Since the main examples of disintegrations involve products (cf. Appendicies A-B), often alternative definitions of disintegrations involve maps that axiomatize the role that the projection operators play in the paradigmatic examples. For an example of definitions along these lines, see [11], [53, §5.3].

–

for all $f$ in $\mathbb{L}_{1}(\nu)$ , one has that $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is a version of the conditional expectation of $f$ with respect to $\mathscr{F}$ and $\nu$ .²⁸²⁸28Hence it is in $\mathbb{L}_{1}(\nu)$ , and thus it is defined and finite for $\nu$ -a.s. many $x$ from $X$ .
–

For $\nu$ -a.s. many $x$ from $X$ , one has $\rho_{x}(X)=1$ and $\rho_{x}([x]_{\mathscr{F}})=1$ .

A disintegration of $\mathscr{F}$ with respect to $\nu$ exists for any countably generated sub- $\sigma$ -algebra $\mathscr{F}$ of the Borel $\sigma$ -algebra on $X$ .²⁹²⁹29[19, 135]. Indeed, a little more is true: one can replace $X$ by one of its Borel subsets. Further, one can relax the assumption that $\mathscr{F}$ is countably generated, provided that one does not insist on $\rho_{x}([x]_{\mathscr{F}})=1$ . Further, it is possible to be more agnostic about the codomain of $\rho$ outside the $\nu$ -measure one set on which it outputs probability measures. See Appendix A for two classical examples of disintegrations.

Here is our key definition of effective disintegration:

Definition 1.3.

(Effective disintegrations).

Let $X$ be a computable Polish space. Let $\nu$ be a computable probability measure on $X$ . Let $\mathscr{F}$ be a $\nu$ -effective $\sigma$ -algebra. Let $\mathsf{XR}^{\nu}$ be a $\nu$ -measure one subset of $\mathsf{KR}^{\nu}(X)$ . Then the map $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is an $\mathsf{XR}^{\nu}$ disintegration of $\mathscr{F}$ with respect to $\nu$ if each of the following happen:

(1)

For all $f$ in $\mathbb{L}_{1}(\nu)$ , one has that $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is a version of the conditional expectation of $f$ with respect to $\mathscr{F}$ and $\nu$ .
(2)

For all $x$ in $\mathsf{XR}^{\nu}$ , one has that $\rho_{x}(X)=1$ and $\rho_{x}([x]_{\mathscr{F}}\cap\mathsf{XR}^{\nu})=1$ .
(3)

For c.e. open $U$ , the map $x\mapsto\rho_{x}(U)$ is uniformly lsc from $X$ to $[0,\infty)$ .

We further define:

A Kurtz disintegration is simply a $\mathsf{KR}^{\nu}$ disintegration.

A Schnorr disintegration is a map which is both a $\mathsf{KR}^{\nu}$ disintegration and a $\mathsf{SR}^{\nu}$ disintegration.

A Martin-Löf disintegration is a map which is a $\mathsf{KR}^{\nu}$ disintegration and a $\mathsf{SR}^{\nu}$ disintegration and a $\mathsf{MLR}^{\nu}$ disintegration.

Technically, it appears possible to, e.g., be a $\mathsf{SR}^{\nu}$ disintegration but not a $\mathsf{KR}^{\nu}$ disintegration. This is due to the universal quantifier over $\mathsf{XR}^{\nu}$ at the outset of (2). But this possibility does not appear to occur naturally among examples.

Due to space constraints, we have opted to focus on theory in the body of the text, and have put a brief discussion of the many interesting examples of effective disintegrations in Appendix B.

Finally, we can define:

Definition 1.4.

Let $\mathscr{F}_{n}$ be an almost-full effective filtration, equipped uniformly with Kurtz disintegrations $\rho^{(n)}$ . A point $x$ in $X$ is said to be density random with respect to $\rho$ , abbreviated $\mathsf{DR}^{\nu}_{\rho}(X)$ , if $x$ is in $\mathsf{MLR}^{\nu}(X)$ and $\lim_{n}\rho_{x}^{(n)}(U)=\delta_{x}(U)$ for every c.e. open $U$ .

In this, $\delta_{x}$ is the Dirac measure centred at $x$ . With the limit written as such, by the Portmanteau Theorem one sees that it is a strengthening of the weak convergence of the measures $\rho_{x}^{(n)}\rightarrow\delta_{x}$ . Since we use the disintegration to define the conditional expectation, as in equation (1.1) above, the limit in Definition 1.4 can be written equivalently as $\lim_{n}\mathbb{E}_{\nu}[I_{U}\mid\mathscr{F}_{n}](x)=I_{U}(x)$ . With respect to the canonical filtration of length $n$ -strings on Cantor space and its natural disintegration (cf. Example B.1), density randomness has been a focal topic in recent literature on algorithmic randomness.³⁰³⁰30[3], [47], [35]. In this setting with $\nu$ being the uniform measure, it is known that $\mathsf{DR}^{\nu}_{\rho}$ is a proper subset of $\mathsf{MLR}^{\nu}$ . One example which shows this properness is an element $\omega$ of $\mathsf{MLR}^{\nu}$ such that $\{\omega^{\prime}:\omega^{\prime}<_{lex}\omega\}$ is c.e. open, where $<_{lex}$ is the lexicographic order. Definition 1.4 is our suggestion for how to generalise this to the setting of arbitrary effective disintegrations.

1.3. Statement of main results

Our first main theorem is the following:

Theorem 1.5.

(Effective Upward Lévy Theorem for Schnorr Randomness). Suppose that $X$ is a computable Polish space and $\nu$ is a computable probability measure. Suppose that $\mathscr{F}_{n}$ is an almost-full effective filtration, equipped uniformly with Kurtz disintegrations.

If $p\geq 1$ is computable, then the following four items are equivalent for $x$ in $X$ :

(1)

$x$ is in $\mathsf{SR}^{\nu}(X)$ .
(2)

$x$ is in $\mathsf{KR}^{\nu}(X)$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)=f(x)$ for all $L_{p}(\nu)$ Schnorr tests $f$ .
(3)

$x$ is in $\mathsf{KR}^{\nu}(X)$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists for all $L_{p}(\nu)$ Schnorr tests $f$ and $\lim_{n}\mathbb{E}_{\nu}[I_{U}\mid\mathscr{F}_{n}](x)=I_{U}(x)$ for all c.e. opens $U$ with $\nu(U)$ computable.
(4)

$x$ is in $\mathsf{KR}^{\nu}(X)$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists for all $L_{p}(\nu)$ Schnorr tests $f$ .

In condition (2), $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)=f(x)$ means that the limit of $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists and is finite and equal to $f(x)$ . Likewise in (3)-(4), the existence of the limit means that it is finite. Note that (3) implies that $\mathsf{SR}^{\nu}$ already proves the analogue of density randomness where we restrict to c.e. open $U$ with $0<\nu(U)<1$ computable.

For rates of convergence, we have:

Theorem 1.6.

(Rates for Upward Lévy Theorem for Schnorr Randomness). For all $X,\nu,\mathscr{F}_{n}$ as in Theorem 1.5, one has:

(1)

For all $x$ in $\mathsf{SR}^{\nu}(X)$ and all computable $p\geq 1$ and all $L_{p}(\nu)$ Schnorr tests $f$ one has that $x$ weakly computes a rate of convergence for $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\rightarrow f(x)$ .
(2)

For all $x$ in $\mathsf{SR}^{\nu}(X)$ of computably dominated degree and all computable $p\geq 1$ and all $L_{p}(\nu)$ Schnorr tests $f$ one has that there is a computable rate for the convergence $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\rightarrow f(x)$ .

The notion in Theorem 1.6(2) is a classical notion from the theory of computation: a Turing degree is computably dominated if any function from natural numbers to natural numbers that is computable from the degree is dominated by a computable function, in the sense that the computable function is eventually above it.³¹³¹31[71, 124], [50, 27]. A more traditional name for this concept is “of hyperimmune-free degree.” This more traditional name comes from an equivalent definition that emerged in the context of Post’s Problem (cf. [71, 133 ff]). For many but not all computable Polish spaces $X$ and $\nu$ in $\mathcal{P}(X)$ computable, there are non-atoms in $\mathsf{MLR}^{\nu}$ (and hence in $\mathsf{SR}^{\nu}$ ) of computably dominated degree. This is a consequence of the existence of universal tests for $\mathsf{MLR}^{\nu}$ and the Computably Dominated Basis Theorem (cf. discussion at Proposition 2.18, Example 2.19, Question 2.20).

It is unknown to us whether Theorem 1.6(2) can be improved, in the sense of an affirmative answer to the following question:

Question 1.7.

For all $x$ in $\mathsf{SR}^{\nu}$ that are not of computably dominated degree and all computable $p\geq 1$ and for all $L_{p}(\nu)$ Schnorr tests $f$ is there a computable rate for the convergence $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\rightarrow f(x)$ ?

The following is the simplest concrete version of the question (cf. Example B.1):

If $\nu$ is uniform measure on Cantor space, and $\omega$ in $\mathsf{SR}^{\nu}$ is not of computably dominated degree, and if $U$ is c.e. open with $0<\nu(U)<1$ computable and $\omega$ not in $U$ , then does the convergence $\nu(U\mid[\omega\upharpoonright n])\rightarrow 0$ have a computable rate?³²³²32For such $U$ , the set $\{\omega:\nu(U\mid[\omega\upharpoonright n])\rightarrow I_{U}(\omega)\}$ can be rather complex. In particular, Carotenuto-Nies [8] show that it is $\Pi^{0}_{3}$ -complete when $U$ is dense. It is not clear to us whether this complexity is located among the $\mathsf{SR}^{\nu}$ ’s or the non-computably dominated $\mathsf{SR}^{\nu}$ ’s, or whether it is reflected in their rates of convergence.

Under uniform measure on Cantor space, the points which are not of computably dominated degree have measure one, a result due to Martin.³³³³33Martin’s paper [40] is unpublished, but his proof has subsequently appeared in other sources, such as [15, Theorem 8.21.1 p. 381], [13, Theorem 1.2]. One way to negatively resolve the question would be to show that the non-computable-domination in Martin’s proof (or a variation on it) could be witnessed by a rate of convergence associated to an $L_{p}(\nu)$ Schnorr test, or perhaps even to an indicator function of a c.e. open $U$ with $0<\nu(U)<1$ computable.

We prove Theorems 1.5-1.6 in §8. Theorem 1.5 extends and unifies prior work by Pathak, Rojas, and Simpson, and of Rute (see discussion in §1.4 below), while Theorem-1.6 is entirely new.

Our next theorem pertains to convergence along Martin-Löf tests:

Theorem 1.8.

(Effective Upward Lévy Theorem for Density Randomness $p>1$ ).

Suppose that $X$ is a computable Polish space and $\nu$ in $\mathcal{P}(X)$ is computable. Suppose that $\mathscr{F}_{n}$ is an almost-full effective filtration, equipped uniformly with Kurtz disintegrations $\rho^{(n)}$ .

If $p>1$ is computable, then the following three items are equivalent for $x$ in $X$ :

(1)

$x$ is in $\mathsf{DR}_{\rho}^{\nu}(X)$ .
(2)

$x$ is in $\mathsf{KR}^{\nu}(X)$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)=f(x)$ for all $L_{p}(\nu)$ Martin-Löf tests $f$ .
(3)

$x$ is in $\mathsf{KR}^{\nu}(X)$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists for all $L_{p}(\nu)$ Martin-Löf tests $f$ and $\lim_{n}\mathbb{E}_{\nu}[I_{U}\mid\mathscr{F}_{n}](x)=I_{U}(x)$ for every c.e. open $U$ .

In contrast to Theorem 1.6(2), one has the following, whose proof is a traditional diagonalization argument deploying the halting set:

Theorem 1.9.

(Rates for Upward Lévy Theorem for Density Randomness).

There are $X,\nu,\mathscr{F}_{n},\rho^{(n)}$ as in Theorem 1.8 which have the property that for every computable $p>1$ and every $x$ in $\mathsf{DR}_{\rho}^{\nu}(X)$ there is $L_{p}(\nu)$ Martin-Löf test $f$ such that the convergence $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\rightarrow f(x)$ has no computable rate.

Hence, once we shift the tests from Schnorr tests to Martin-Löf tests, we never have points which possess computable rates for all tests. In one sense, Question 1.7 is asking whether there is some way to emulate a halting-set-like construction among the non-computably dominated $\mathsf{SR}^{\nu}$ ’s.

We prove Theorems 1.8-1.9 in §9. Theorem 1.8 extends work from a paper of Miyabe, Nies, and Zhang, which we discuss in the next section, while Theorem 1.9 is entirely new.

Given Theorem 1.5 and Theorem 1.8, it is natural to try to understand whether there is a convergence to the truth characterisation of $\mathsf{MLR}^{\nu}$ , or at least some nearby superset of it (by contrast, $\mathsf{DR}^{\nu}_{\rho}$ is a subset of $\mathsf{MLR}^{\nu}$ ). We thus isolate a class of Martin-Löf tests $f$ which have approximations $f_{s}$ such that $f_{s}\rightarrow f$ in $L_{p}(\nu)$ at an exponential rate, but not a rate that can necessarily be computed. Hence we define the following, where clauses (1)-(3) mimic the canonical approximations of $L_{p}(\nu)$ Schnorr tests (cf. Proposition 2.16, Lemma 3.2), and where clause (4) pertains to exponential rates:

Definition 1.10.

Suppose that $p\geq 1$ is computable.

A $L_{p}(\nu)$ maximal Doob test $f:X\rightarrow[0,\infty]$ is an lsc function in $L_{p}(\nu)$ such that there is a uniformly computable sequence $f_{s}$ of $L_{p}(\nu)$ Schnorr tests satisfying

(1)

$0\leq f_{s}\leq f_{s+1}$ on $\mathsf{KR}^{\nu}$ and $f=\sup_{s}f_{s}$ on $\mathsf{KR}^{\nu}$ .
(2)

$f-f_{s}$ is equal on $\mathsf{KR}^{\nu}$ to a non-negative lsc function.
(3)

$f_{t}-f_{s}$ for $t>s$ is equal on $\mathsf{KR}^{\nu}$ to an $L_{p}(\nu)$ Schnorr test, uniformly in $t>s$ .
(4)

For all $k\geq 0$ , $\sum_{s}\|f-f_{s}\|_{p}\cdot(s+1)^{k}<\infty$ .³⁴³⁴34By taking $k=0$ , we have $f_{s}\rightarrow f$ in $L_{p}(\nu)$ , and so $f$ is an $L_{p}(\nu)$ Martin-Löf test, and hence in conjunction with (2) we have that $f-f_{s}$ is equal on $\mathsf{KR}^{\nu}$ to an $L_{p}(\nu)$ Martin-Löf test.

A point $x$ is $p$ -maximal Doob random relative to $\nu$ , abbreviated $\mathsf{MDR}^{\nu,p}(X)$ , if $f(x)<\infty$ for all $L_{p}(\nu)$ maximal Doob tests.

Our theorem on this is the following:

Theorem 1.11.

(Effective Upward Lévy Theorem for Maximal Doob Randomness, $p>1$ ).

If $p>1$ is computable, then the following three items are equivalent for $x$ in $X$ :

(1)

$x$ in $\mathsf{MDR}^{\nu,p}$ .
(2)

$x$ is in $\mathsf{KR}^{\nu}$ and $f(x)=\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ for all $L_{p}(\nu)$ maximal Doob tests $f$ .
(3)

$x$ is in $\mathsf{KR}^{\nu}$ and $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists for all $L_{p}(\nu)$ maximal Doob tests $f$ .

We prove Theorem 1.11 in §10. The name “Maximal Doob” in Theorem 1.11 and Definition 1.10 comes from the role played by Doob’s Maximal Inequality (cf. Lemma 5.1(1)) in the proofs in §10. In Proposition 10.1, we note that $\mathsf{MLR}^{\nu}\subseteq\mathsf{MDR}^{\nu,p}\subseteq\mathsf{SR}^{\nu}$ . But we do not know the answer to the following question:

Question 1.12.

Are the inclusions $\mathsf{MLR}^{\nu}\subseteq\mathsf{MDR}^{\nu,p}\subseteq\mathsf{SR}^{\nu}$ proper?

We suspect that $\mathsf{MDR}^{\nu,p}$ is a proper subset of $\mathsf{SR}^{\nu}$ , and that one could show this by establishing the analogue of Theorem 1.9.

We add that we do not know the answer to the following:

Question 1.13.

Do Theorem 1.8 and Theorem 1.11 hold for $p=1$ ?

The proof of the former uses Hölder at one place (cf. equation (9.1)), and the latter uses Doob’s Maximal Inequality (cf. Lemma 5.1(1)).

1.4. Relation to previous work

Theorem 1.5 generalises the result of Pathak, Rojas, and Simpson, who show it for the specific case of $p=1$ and $X=[0,1]^{k}$ , $\nu$ being the $k$ -fold product of Lebesgue measure on $[0,1]$ with itself, and with $\mathscr{F}_{n}$ being given by dyadic partitions.³⁵³⁵35[52]. They state their result not in terms of $L_{p}(\nu)$ Schnorr tests, but in terms of computable points of $L_{p}(\nu)$ . See §11. Under this guise, Lévy’s Upward Theorem just is the Lebesgue Differentiation Theorem. Their proof goes through Tarski’s decidability results on the first-order theory of the reals, and so seems in certain key steps specific to the reals with Lebesgue measure.³⁶³⁶36Such as at [52, Lemma 3.3 p. 339]. However, see Proposition 4.1 below, which generalises rather directly from their setting to the general setting.

In conjunction with the properties of effective disintegrations (cf. §7), one can derive the equivalence of (1)-(2) in Theorem 1.5 from results of Rute.³⁷³⁷37In particular, for the (1) to (2) direction of Theorem 1.5, see Rute’s “Effective Levy 0/1 law” [61, Theorem 6.3 p. 31]. Our Proposition 7.5 and Proposition 2.4 implies that if $f$ is an $L_{1}(\nu)$ Schnorr test, then $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ is a computable point of $L_{1}(\nu)$ , and so Rute’s Theorem 6.3 applies, once one internalises how to translate back and forth between $L_{p}(\nu)$ Schnorr tests and computable points of $L_{p}(\nu)$ (cf. §11). For the (2) to (1) direction of Theorem 1.5, see Rute’s [61, Example 12.1 p. 31]. As for Schnorr randomness, our work then expands on Rute’s primarily by finding a large class of versions of conditional expectations to which his results apply, and by identifying the information on rates of convergence in Theorem 1.6. More generally, the theory we develop is organised around the elementary concept of an integral Schnorr test, and so we hope might be of value to others by virtue of being accessible.³⁸³⁸38In particular, we can avoid appeal to Rute’s theory of a.e. convergence, which is an alternative way to organise effective convergence in $L_{0}(\nu)$ (cf. §2.4). See [61, Proposition 3.15 p. 15] and his Convergence Lemma [61, Lemma 3.19 p. 17].

Further, we are able to strengthen what is, in our view, one of the more foundationally significant parts of Rute’s work. He notes that traditionally “algorithmic randomness is more concerned with success than convergence” and that “only computable randomness has a well-known characterisation in terms of martingale convergence instead of martingale success.”³⁹³⁹39[61, p. 7]. He is referring to what is called a “folklore” characterisation of computable randomness on Cantor space with the uniform measure in [15, Theorem 7.1.3 p. 270]. In Cantor space with the uniform measure, Rute has a characterisation of Schnorr randomness in terms of convergence of $L_{2}(\nu)$ martingales.⁴⁰⁴⁰40See items (1), (4) in his Example 1.5, immediately below the preceding quotation. We have been able to generalise this to all computable measures on computable Polish spaces: see Theorem 12.2. This proof follows Rute’s $L_{2}(\nu)$ Hilbert space proof in broad outline. It seems to us that keeping track of the maximal function, which we can then use in DCT arguments, has been helpful here.

In the setting of Cantor space with the uniform measure and the natural filtration of length $n$ -strings and the natural disintegration (cf. Example B.1), our Theorem 1.8 was already known for $p=1$ and hence all computable $p\geq 1$ . This result is in a paper of Miyabe, Nies, and Zhang, where it is attributed to the Madison group of Andrews, Cai, Diamondstone, Lempp and Miller.⁴¹⁴¹41[47, Theorem 3.3 p. 312]. Their proof is a little more general, in that it just concerns martingale convergence rather than martingales associated to random variables. Their proof, in the Cantor space setting, also can be modified to give not only convergence but convergence to the truth for random variables. Their argument goes through an auxiliary test notion of Madison test. While we only have it for computable $p>1$ , our proof of Theorem 1.8 goes through first principles about density randomness and effective disintegrations. It is not presently clear to us whether the Cantor space proof using Madison tests can be generalised to arbitrary computable probability measures on computable Polish spaces equipped with effective disintegrations.⁴²⁴²42As a final remark about the previous literature, we should mention that Lévy’s Upward Theorem has also been studied in the context of Shafer and Vovk’s game-theoretic probability ([67], [66, Chapter 8]). Their approach conceives of martingales primarily as game-theoretic strategies, and does not treat computational matters explicitly. By contrast, here we are focusing on the martingales $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ and on effective properties of them conceived of as sequences of random variables. Discerning the relation between our approach and their approach would involve, as a first step, carefully going through their approach and ascertaining the exact levels of effectivity needed to secure their results, and secondly translating back and forth between the strategy and random variable paradigms.

We hope our efforts brings this prior important work on algorithmic randomness to the attention of a broader audience. Bayesianism is an important and increasingly dominant framework in a variety of disciplines, and this prior work and our work hopefully makes vivid the way in which computability theory and algorithmic randomness bears directly on the question of when and how fast Bayesian inductive methods converge to the truth. As its proof makes clear, the non-computable rate of convergence to the truth in Theorem 1.9 is another presentation of the halting set, and so this proof gives the theory of computation a central role in a limitative theorem of inductive inference, similar to its central role in the great limitative theorems of deductive inference like the Incompleteness Theorems. Further, the existence of Schnorr random worlds that are computably dominated, as in Theorem 1.6(2), shows that a central kind of randomness is entirely compatible with there being effective ways of determining how close we are to the truth. This optimistic inductive possibility is not one that would be visible in absence of recent work in algorithmic randomness.

Internal to the discussion about Bayesianism within philosophy, authors such as Belot have voiced the concern that the classical theory only tells us that worlds at which we fail to converge to the truth have probability zero, but otherwise tells us little about when and where the failure happens.⁴³⁴³43[2]. From the perspective of Theorems 1.5, 1.8, 1.11, the probability zero event of non-convergence is not arbitrary, so long as one is insisting on convergence along a broad enough class of effective random variables. Namely, the sequences along which convergence to the truth fails for some element of this class are exactly those that are not random with respect to the underlying computable prior probability measure. In other words, those sequences can be determined by effective means to be atypical from the agent’s point of view.

Finally, we should emphasise that ours is not the only perspective on conditional expectations and its effectivity that one could adopt. In focusing on disintegrations, we are presupposing a framework where pointwise there is a single “formula” for the conditional expectation, namely the one displayed in equation (1.1) (and again see Appendicies A-B for examples). Likewise, the effectivity constraints in Definition 1.3 have the consequence that the conditional expectation operator is a continuous computable function (cf. Proposition 7.5), and so sends computable points to computable points (cf. Proposition 2.4). Both of these presuppositions constrain the applicability of our framework. For instance, Rao points out that conditional expectations are used throughout econometrics, but there one often uses the Dynkin-Doob Lemma as definitional of the conditional expectation,⁴⁴⁴⁴44[55, 376]. For an example, see the presentation of conditional expectation in [20, Chapter 7]. and there is no more hope of having a single formula come out of it than there is of having all variables expressible in linear terms of one another. Likewise, conditional expectations and martingales can be used to prove theorems like the Radon-Nikodym Theorem,⁴⁵⁴⁵45[75, p. 145-146]. which is “computably false” in that there are computable absolutely continuous probability measures with no computable Radon-Nikodym derivative.⁴⁶⁴⁶46[70, p. 396], [77], [31]. That, of course, is not to say that these are not of interest or that determining how non-effective they are is not of interest, but just to say they will not be available in a framework like ours where we restrict to computable continuous conditional expectation operators.⁴⁷⁴⁷47The paper Ackerman et. al. [1] is an important recent paper studying how non-effective, in general, it is to have disintegrations. Our Definition 1.3, by contrast, restricts attention to those disintegrations that are highly effective. This will not be all of them, and we do not claim that it would be all of the interesting ones.

1.5. Outline of paper

The paper is organised as follows. In §2, we begin with a brief discussion of some aspects of effectively closed sets and computable continuous functions and lsc functions which we need for our proof, and then we go over relevant aspects of the three computable Polish spaces which are central for effective probability theory:

–

The computable Polish space $\mathcal{M}^{+}(X)$ of non-negative finite Borel measures on $X$ and its computable Polish subspace $\mathcal{P}(X)$ of probability measures.
–

For each computable $\nu$ in $\mathcal{M}^{+}(X)$ and each computable $p\geq 1$ , the computable Polish space $L_{p}(\nu)$ .
–

For each computable $\nu$ in $\mathcal{M}^{+}(X)$ the computable Polish space $L_{0}(\nu)$ of Borel measurable functions which are finite $\nu$ -a.s. and whose topology is given by convergence in measure.

The space $L_{0}(\nu)$ is needed since when $f$ is in $L_{p}(\nu)$ , the maximal function $f^{\ast}=\sup_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ is in $L_{0}(\nu)$ , and is guaranteed to be in $L_{p}(\nu)$ iff $p>1$ . In addition to being needed in the proofs of the main theorems, the material in §2 also can serve to contextualise many components of Definition 1.1. For instance, we mention in §2.2 a result of Hoyrup-Rojas that an element of $\mathcal{P}(X)$ is computable in the sense of Definition 1.1(3) iff it is computable as an element of the Polish space $\mathcal{P}(X)$ . Likewise, in §2.3 we mention a result saying that an $L_{p}(\nu)$ Schnorr test is simply a non-negative lsc function whose equivalence class is a computable element of $L_{p}(\nu)$ (cf. Proposition 2.16). Finally, towards the close of §2.4, we define an $L_{0}(\nu)$ Schnorr test and prove a new characterisation of $\mathsf{SR}^{\nu}$ in terms of these tests (cf. Definition 2.28, Proposition 2.29).

In §3 we present two lemmas on Schnorr randomness. The second of these, called the Self-location Lemma (3.3) is a distinctive feature of Schnorr randomness (vis-à-vis the other algorithmic randomness notions), and is central to our proof of Theorem 1.6. In §4 we present some results on recovering the pointwise values of effective random variables on $\mathsf{SR}^{\nu}$ . In §5, we review various classical features of the maximal function which we shall need later. In §6 we present an abstract treatment of Theorem 1.5 in terms of various effective constraints that a version of the conditional expectation may satisfy. In §7 we develop the fundamental properties of effective disintegrations. In §8, we prove Theorems 1.5-1.6, and in §9 we prove Theorems 1.8-1.9 and in §10 we prove Theorem 1.11. In §11, we show how Miyabe’s translation method allows us to recast Theorem 1.5 in terms of computable points of $L_{p}(\nu)$ . In §12, we develop the theory of martingales in $L_{2}(\nu)$ and prove the aforementioned generalisation of Rute’s result characterising Schnorr randomness in terms of martingale convergence. In Appendix A we briefly exposit two classical examples of disintegrations, and in Appendix B we present several examples of effective disintegrations.

In a sequel to this paper, we present a similar analysis of the Blackwell-Dubins Theorem,⁴⁸⁴⁸48[5] which is also a “convergence to the truth” result, but wherein the pair “agent and world” is replaced with a pair of agents whose credences are variously absolutely continuous with respect to one another.

2. Computable Polish spaces for effective probability theory

2.1. Effectively closed, computable continuous, and lsc

In this section, we briefly describe two further concepts from the theory of computable Polish spaces: namely effectively closed subsets and computable continuous functions, and we close by mentioning a few brief aspects of lsc functions.

Before we do that, we mention one elementary proposition on computable Polish spaces which is worth having in hand (for e.g. the Self-location Lemma 3.3):

Proposition 2.1.

Let $X$ be a computable Polish space with metric $d$ and countable dense set $x_{0},x_{1},\ldots$ . Suppose the map $i\mapsto n(i)$ is such that $x_{n(i)}\rightarrow x$ fast. Then

(1)

The set $\{(j,q)\in\mathbb{N}\times\mathbb{Q}^{>0}:x\in B(x_{j},q)\}$ is c.e. in graph of $i\mapsto n(i)$ .
(2)

The point $x$ is computable iff the set $\{(j,q)\in\mathbb{N}\times\mathbb{Q}^{>0}:x\in B(x_{j},q)\}$ is c.e.

Proof.

For (1), since the distances between the points of the countable dense set is uniformly computable, they are also uniformly right-c.e. Hence, it suffices to note that $d(x_{j},x)<q$ iff there is $i\geq 0$ with $d(x_{j},x_{n(i)})<q-2^{-i}$ .

For (2), if $x$ is computable, then we can choose $i\mapsto n(i)$ computable, and then are done by (1). Conversely, if the set is c.e., given $i\geq 0$ , enumerate it until one finds a pair $(j,q)$ with $q\leq 2^{-i}$ , and set $n(i)=j$ . ∎

As mentioned in §1, the complement of a c.e. open set is called an effectively closed set. In Cantor space and Baire space, the effectively closed sets can be represented as paths through computable trees.⁴⁹⁴⁹49[9, p. 41]. The following provides a simple example on the real line of a classically closed set which is not effectively closed:

Example 2.2.

Suppose that $c<d$ and $c$ is right-c.e. and $d$ is left-c.e but neither $c,d$ are computable. Then $[c,d]$ is a computable Polish space. Further, $[c,d]$ is a classically closed subset of the reals which is not an effectively closed subset of the reals.

It is a computable Polish space since its countable dense set $(c,d)\cap\mathbb{Q}$ is c.e. since it is the intersection of the the right Dedekind cut of $c$ and the left Dedekind cut of $d$ . If $[c,d]$ were effectively closed in the reals then $U=(-\infty,c)\cup(d,\infty)$ would be c.e. open in the reals. Then by choosing a rational $r$ in $(c,d)$ , one has that $\{q\in\mathbb{Q}:q<c\}=\{q\in U:q<r\}$ is c.e., contrary to hypothesis.

Effectively closed subsets of a computable Polish space need not themselves have the structure of a computable Polish space, since one in addition needs to produce an enumeration of a countable dense set where the distance between the points is uniformly computable.⁵⁰⁵⁰50By contrast, classically, the Polish subspaces of a Polish space are precisely the $G_{\delta}$ subsets. See [34, p. 17]. Hence, we define: a computable Polish subspace $Y$ of $X$ is given by a an effectively closed subset $Y$ of $X$ and a countable sequence of points $y_{0},y_{1},\ldots$ which are uniformly computable points of $X$ and which are dense in $Y$ . One can check that the c.e. opens relative to $Y$ are just the c.e. opens of the space $X$ intersected with $Y$ , and further any effectively closed subset of $Y$ is also an effectively closed subset of $X$ . Similarly, one can check that a computable point of $Y$ is just a computable point of $X$ which happens to be in $Y$ . As a simple example of a Polish subspace which is not a computable Polish subspace, one has:

Example 2.3.

Suppose that $a<b$ and $a$ is left-c.e. and $b$ is right-c.e but neither $a,b$ are computable. Then the closed interval $[a,b]$ is an effectively closed subset of the reals which is not a computable Polish subspace of the reals.

It is effectively closed since $a$ being left-c.e. and $b$ being right-c.e. implies that $(-\infty,a)$ and $(b,\infty)$ are c.e. open. And if $[a,b]$ were a computable Polish subspace of the reals, and if $y_{0},y_{1},\ldots$ were a sequence of uniformly computable reals dense in $[a,b]$ , then for a rational $q$ we would have $a<q$ iff there is $i$ such that $y_{i}<q$ , which is a c.e. condition and so $a$ would be right-c.e. and thus computable.

An effectively closed set $C$ is computably compact if there is a partial computable procedure which, when given an index for a computable sequence of c.e. opens $U_{0},U_{1},\ldots$ in $X$ which covers $C$ , returns a natural number $n\geq 0$ such that $U_{0},\ldots,U_{n}$ covers $C$ . We further say that $C$ is strongly computably compact if there is a partial computable procedure which, when given an index for a computable sequence of c.e. opens $U_{0},U_{1},\ldots$ in $X$ halts iff this is a cover of $C$ , and when it halts returns a natural number $n\geq 0$ such that $U_{0},\ldots,U_{n}$ covers $C$ . If $X$ itself is strongly computably compact, then so are all of its effectively closed sets. If $c<d$ is computable, then $[c,d]$ is strongly computably compact. Likewise, Cantor space is strongly computably compact, and if $f:\mathbb{N}\rightarrow\mathbb{N}$ is computable then the computably bounded set $\{\omega\in\mathbb{N}^{\mathbb{N}}:\forall\;n\;\omega(n)\leq f(n)\}$ is a computable Polish subspace of Baire space which is strongly computably compact. Example 2.2 is an example of a compact computable Polish space which is not computably compact, since if it were then we could compute the endpoints using maxs and mins of the centres of finite coverings with fast decreasing radii. For another example of a compact computable Polish space which is not computably compact, one can take the paths through a computable subtree of Baire space which is not computably bounded.⁵¹⁵¹51See [10, Example 2.1.5 p. 59].

If $X,Y$ are two computable Polish spaces, then a function $f:X\rightarrow Y$ is computable continuous if inverse images of c.e. opens are uniformly c.e. open.⁵²⁵²52See Moschovakis [48, 110]. Simpson [70, Exercise II.6.9 p. 88] notes that it is equivalent to his preferred definition at [70, 85]. The following characterisation usefully parameterises each continuous computable function by a single c.e. set, where it is assumed for the sake of simplicity that both countable dense sets are identified with the natural numbers:⁵³⁵³53This is from [26, 1169]. It can be seen as a simplification of Simpson’s definition in [70, 85].

–

A function $f:X\rightarrow Y$ is computable continuous iff there is a c.e. set $I\subseteq\mathbb{N}\times\mathbb{Q}^{>0}\times\mathbb{N}\times\mathbb{Q}^{>0}$ such that both (i) if $(i,p,j,q)$ is in $I$ then $B(i,p)\subseteq f^{-1}(B(j,q))$ and (ii) for all $x$ in $X$ and all $\epsilon>0$ there is $(i,p,j,q)$ in $I$ with $x$ in $B(i,p)$ and $q<\epsilon$ .

Computable continuous maps are also computable continuous when restricted to computable Polish subspaces. The computable continuous maps preserve computability of points:

Proposition 2.4.

If $f:X\rightarrow Y$ is computable continuous and $x$ in $X$ is computable, then $f(x)$ in $Y$ is computable.

Proof.

Suppose that $x$ is computable. Then $\{(i,q)\in\mathbb{N}\times\mathbb{Q}^{>0}:d(i,x)<q\}$ is c.e. by Proposition 2.1 (1). For each $n\geq 0$ , by (ii) above, search in $I$ for a tuple $(i_{n},p_{n},j_{n},q_{n})$ with $x$ in $B(i_{n},p_{n})$ and $q_{n}<2^{-n}$ . Then by (i) above, $f(x)$ is in $f(B(i_{n},p_{n}))\subseteq B(j_{n},q_{n})$ and so $j_{n}\rightarrow f(x)$ fast. ∎

This proposition is important because many arguments for the computability of points in effective analysis and probability can be seen as the result of applying computable continuous functions to computable points.

There is a partial converse to the previous proposition in the uniformly continuous setting. A computable modulus of uniform continuity for a uniformly continuous function $f:X\rightarrow Y$ is a computable function $m:\mathbb{Q}^{>0}\rightarrow\mathbb{Q}^{>0}$ such that $d(x,x^{\prime})<m(\epsilon)$ implies $d(f(x),f(x^{\prime}))<\epsilon$ for all $\epsilon$ in $\mathbb{Q}^{>0}$ . For instance, if $c>0$ is rational, then a $c$ -Lipschitz function is just a function with linear modulus of uniform continuity $m(\epsilon)=\frac{c}{2}\cdot\epsilon$ . The partial converse to Proposition 2.4 is the following:

Proposition 2.5.

Suppose $X,Y$ are computable Polish spaces and that $f:X\rightarrow Y$ has a computable modulus of uniform continuity. Suppose that the image of the countable dense set in $X$ under $f$ is uniformly computable in $Y$ . Then $f$ is computable continuous.

Proof.

Suppose $x_{n}$ is the countable dense set in $X$ . Suppose that $m:\mathbb{Q}^{>0}\rightarrow\mathbb{Q}^{>0}$ is the computable modulus of uniform continuity. Suppose that $y_{n,i}\rightarrow f(x_{n})$ fast, where $y_{n,i}$ is a uniformly computable sequence from the countable dense set in $Y$ . Then define the c.e. set $I=\{(x_{n},m(2^{-i}),y_{n,i},2^{-i+1}):n,i\geq 0\}$ . First we show that $B(x_{n},m(2^{-i}))\subseteq f^{-1}(B(y_{n,i},2^{-i+1}))$ . For, suppose that $x$ is in $B(x_{n},m(2^{-i}))$ . Then $d(x,x_{n})<m(2^{-i})$ . Then $d(f(x),f(x_{n}))<2^{-i}$ . Further since $y_{n,i}\rightarrow f(x_{n})$ fast, we have $d(f(x_{n}),y_{n,i})\leq 2^{-i}$ , from which we obtain $d(f(x),y_{n,i})<2^{-i+1}$ by triangle inequality. Second suppose that $x$ is in $X$ and $\epsilon>0$ . Let $i\geq 0$ be such that $2^{-i+1}<\epsilon$ . Since $x_{n}$ is an enumeration of the countable dense set, there is $x_{n}$ such that $d(x,x_{n})<m(2^{-i})$ . Then $x$ is in $B(x_{n},m(2^{-i}))$ , and the tuple $(x_{n},m(2^{-i}),y_{n,i},2^{-i+1})$ is in $I$ and $2^{-i+1}<\epsilon$ . ∎

This proposition is widely applicable in our context since many operators in functional analysis are uniformly continuous, and since one often in practice has good control over what happens with the countable dense set (see the proofs of Proposition 2.16 and Proposition 2.21 for representative examples).

Finally, recall the notion of core notion of lsc from Definition 1.1(1), which is the effectivization of the classical notion of lower semi-continuous. This class of functions has some paradigmatic examples and useful closure conditions which we briefly enumerate without proof:

Proposition 2.6.

Constant functions that are left-c.e. reals are lsc. Indicator functions of c.e. opens are lsc.

Lsc functions are closed under addition, maxs and mins. Non-negative lsc functions are closed under multiplication.

Sups of uniformly lsc functions are lsc. Infinite sums of non-negative lsc functions are lsc. Compositions of lsc functions with computable continuous functions are lsc.

Since $f$ is lsc iff $-f$ is usc, one can use this proposition to obtain examples and closure conditions for usc functions as well.

The analogue of Proposition 2.4 for lsc functions is that they send computable points to left-c.e. reals.

2.2. The space of probability measures

If $X$ is a Polish space, then the space of real-valued finite signed Borel measures on $X$ is written as $\mathcal{M}(X)$ . Recall that the weak^∗-topology on $\mathcal{M}(X)$ is the smallest topology such that all the linear maps $\nu\mapsto\int_{X}f\;d\nu$ are continuous, where $f$ ranges over bounded continuous functions on the space.⁵⁴⁵⁴54Or, equivalently, as $f$ ranges over all bounded uniformly continuous functions on the space ([34, 110]). Unless the space $X$ is finite, the weak^∗-topology on $\mathcal{M}(X)$ is not metrizable.⁵⁵⁵⁵55[6, 17, 102] However, when $X$ is a Polish space, the space $\mathcal{P}(X)$ of all probability Borel measures on $X$ with the weak^∗-topology is a Polish space, as is the space $\mathcal{M}^{+}(X)$ of all finite non-negative Borel measures on $X$ .⁵⁶⁵⁶56[34, §17.E pp. 109 ff]. Convergence in $\mathcal{P}(X)$ is characterised by the Portmanteau Theorem.⁵⁷⁵⁷57[34, Theorem 17.20 p. 111], [4, Theorem 2.1 p. 16].

A natural countable dense set on the spaces $\mathcal{P}(X)$ and $\mathcal{M}^{+}(X)$ are the finite averages of Dirac measures associated to points from the countable dense set on $X$ , with rational values for the weights. These spaces can be completely metrized by the metric of Prohorov. However, when working on $\mathcal{P}(X)$ , it is often more useful to work with the Wasserstein metric, and further when the metric on $X$ is unbounded it is more convenient to work with the Kantorovich-Rubinshtein metric on $\mathcal{P}(X)$ :⁵⁸⁵⁸58[6, 104,111].

d_{KR}(\nu,\mu)=\sup\{\left|\mathbb{E}_{\nu}f-\mathbb{E}_{\mu}f\right|\;:f\mbox{ is $1$-Lipschitz}\;\&\;\|f\|_{\infty}\leq 1\}

In this, $\|f\|_{\infty}=\sup_{x\in X}\left|f(x)\right|$ . Hoyrup and Rojas prove that $\mathcal{P}(X)$ with the metrics of Prohorov or Wasserstein are computable Polish spaces, and their proof extends naturally to the Kantorovich-Rubinshtein metric. Likewise, their proof shows that $\mathcal{M}^{+}(X)$ is a computable Polish space and has $\mathcal{P}(X)$ as a computable Polish subspace.⁵⁹⁵⁹59[29, 49], [30, 838]. Further, Hoyrup and Rojas characterise the computable points in $\mathcal{P}(X)$ as follows, and their proof extends naturally to $\mathcal{M}^{+}(X)$ :⁶⁰⁶⁰60[29, 52], [30, 839].

Proposition 2.7.

A point $\nu$ in $\mathcal{M}^{+}(X)$ is computable iff $\nu(X)$ is computable and $\nu(U)$ is uniformly left-c.e. for c.e. opens $U$ in $X$ .

This proposition helps motivate Definition 1.1(3).

To illustrate the utility of this proposition, consider $[c,d]$ from Example 2.2. This proposition implies that Lebesgue measure $m$ on $[c,d]$ is not a computable point of $\mathcal{M}^{+}(X)$ since $m([c,d])=d\mbox{-}c$ is left-c.e. but not computable. Likewise, $\frac{1}{d-c}m$ is not a computable point of $\mathcal{P}(X)$ since one can choose rationals $q,\epsilon$ such that $(q-\epsilon,q+\epsilon)\subseteq(c,d)$ , and then one has $\frac{1}{d-c}m((q-\epsilon,q+\epsilon))=\frac{2\epsilon}{d-c}$ is right-c.e. but not computable.

Recall the notion of a comptuable basis and measure computable basis from Definition 1.1(9)-(10). Hoyrup and Rojas use an effective version of the Baire Category Theorem to prove every $\nu$ is a computable point of $\mathcal{M}^{+}(X)$ has a $\nu$ -computable basis. Moreover, the basis can be taken to be open balls $B(i,r_{j})$ with centres $i$ from the countable dense set and with radii given by a dense computable sequence $r_{j}$ of non-zero reals, with the closed balls $B[i,r_{j}]$ being the corresponding effectively closed supersets.⁶¹⁶¹61See [30, Corollary 5.2.1 p. 844], [29, Theorem 2.2.1.2 p. 60]. Rute also employs this result of Hoyrup and Rojas, [61, pp. 13-14], although he leaves out from the definition of a measure computable basis the pairing of each basis $U$ element with an effectively closed superset $C$ of the same measure. Hoyrup and Rojas include this pairing, but further require that $U\cup(X\setminus C)$ is dense (cf. [30, Definition 5.1.2 p. 842], [29, Definition 2.2.1.2 p. 58]).

The following proposition summarises some basic properties of $\nu$ -computable bases:

Proposition 2.8.

Suppose that $\nu$ is a computable point of $\mathcal{M}^{+}(X)$ .

(1)

Elements of the algebra generated by a $\nu$ -computable basis uniformly have $\nu$ -computable measure. Indeed, this holds for all holds for all sequences $B_{0},B_{1},\ldots$ of events such that finite unions of them have uniformly $\nu$ -computable measure.
(2)

If a computable sequence of c.e. opens with uniformly computable $\nu$ -measure is added to a $\nu$ -computable basis, then finite unions from the resulting sequence have uniformly computable $\nu$ -measure, as do elements from the algebra generated by the resulting sequence. Indeed, this holds for all computable bases such that finite unions of them have uniformly $\nu$ -computable measure.
(3)

If a computable sequence of c.e. opens with uniformly computable $\nu$ -measure and with uniformly effectively closed supersets of the same $\nu$ -measure is added to a $\nu$ -computable basis, then the result is a $\nu$ -computable basis.
(4)

The $\nu$ -computable bases are closed under effective union.

Proof.

For (1), suppose that $B_{0},B_{1},\ldots$ is a sequence of events such that finite unions of them have uniformly $\nu$ -computable measure.

First note that finite intersections $B_{i_{0}}\cap\cdots\cap B_{i_{n}}$ have uniformly computable $\nu$ -measure: this is an induction on $n\geq 1$ , and for the induction step, use inclusion-exclusion $\nu(B_{i_{0}}\cap\cdots\cap B_{i_{n}})=-\nu(B_{i_{0}}\cup\cdots\cup B_{i_{n}})+\sum_{\emptyset\neq J\subsetneq\{0,\ldots,n\}}(-1)^{\left|J\right|-1}\nu(\bigcap_{j\in J}B_{i_{j}})$ .

Likewise, finite unions $A_{1}\cup\cdots\cup A_{m}$ of finite intersections $A_{j}=\bigcap_{k=1}^{\ell_{j}}B_{i_{j,k}}$ of members of $B_{i}$ have uniformly computable $\nu$ -measure: this is an induction on $m\geq 1$ , and for the induction step, use distribution $\nu(A_{1}\cup\cdots\cup A_{m+1})=\nu(A_{1}\cup\cdots\cup A_{m})+\nu(A_{m+1})-\nu((A_{1}\cap A_{m+1})\cup\cdots\cup(A_{m}\cap A_{m+1}))$ .

Finally, note that finite intersections of members of $B_{i}$ and complements of members of $B_{i}$ have uniformly computable $\nu$ -measure. This follows from the previous steps, the elementary identity $\nu(C\setminus D)=\nu(C)-\nu(C\cap D)$ and distribution as follows when $n>0$ : $\nu(B_{i_{0}}\cap\cdots\cap B_{i_{n-1}}\cap(X\setminus B_{i_{n}})\cap\cdots\cap(X\setminus B_{i_{n+m-1}}))=\nu(B_{i_{0}}\cap\cdots\cap B_{i_{n-1}})-\nu(B_{i_{0}}\cap\cdots\cap B_{i_{n-1}}\cap(B_{i_{n}}\cup\cdots\cup B_{i_{n+m-1}}))=\nu(B_{i_{0}}\cap\cdots\cap B_{i_{n-1}})-\nu(\bigcup_{j=n}^{n+m-1}B_{i_{0}}\cap\cdots\cap B_{i_{n-1}}\cap B_{i_{j}})$ , which is uniformly computable by the two previous paragraphs. If $n=0$ then note that $\nu((X\setminus B_{i_{n}})\cap\cdots\cap(X\setminus B_{i_{n+m-1}}))=\nu(X\cap(X\setminus B_{i_{n}})\cap\cdots\cap(X\setminus B_{i_{n+m-1}}))$ , and since $\nu(X)$ is computable, we can argue as in the case of $n=1$ with $X$ playing the role of $B_{i_{0}}$ .

For (2), suppose that $U_{0},U_{1},\ldots$ is a computable sequence of c.e. opens such that $\nu(U_{0}),\nu(U_{1}),\ldots$ is uniformly computable. Suppose that $B_{0},B_{1},\ldots$ is a computable basis such that finite unions of them have uniformly $\nu$ -computable measure. We must show that finite unions from $B_{0},B_{1},\ldots,U_{0},U_{1},\ldots$ have uniformly $\nu$ -computable measure. It suffices to consider the case where $U_{0},U_{1},\ldots$ just consists of a single c.e. open $U_{i}$ , since by induction and (1) we may assume that the $U_{1},\ldots,U_{i-1}$ are already among the $B_{0},B_{1},\ldots$ . Since $B_{0},B_{1},\ldots$ is a computable basis, write $U_{i}=\bigcup_{j}B_{m(j)}$ , where $m$ is a computable function. Then $\nu(B_{1}\cup\cdots\cup B_{n}\cup U_{i})=\nu(\bigcup_{j}B_{1}\cup\cdots\cup B_{n}\cup B_{m(j)})=\lim_{k}\nu(\bigcup_{j<k}B_{1}\cup\cdots\cup B_{n}\cup B_{m(j)})$ . Since this limit is increasing, and $\nu(\bigcup_{j<k}B_{1}\cup\cdots\cup B_{n}\cup B_{m(j)})$ is uniformly computable, we have that $\nu(B_{1}\cup\cdots\cup B_{n}\cup U_{i})$ is left-c.e. Similarly, $\nu((B_{1}\cup\cdots\cup B_{n})\cap U_{i})=\nu(\bigcup_{j}(B_{1}\cup\cdots\cup B_{n})\cap B_{m(j)})=\lim_{k}\nu(\bigcup_{j<k}(B_{1}\cup\cdots\cup B_{n})\cap B_{m(j)})$ . Since this limit is increasing, and $\nu(\bigcup_{j<k}(B_{1}\cup\cdots\cup B_{n})\cap B_{m(j)})$ is uniformly computable by (1), we have that $\nu((B_{1}\cup\cdots\cup B_{n})\cap U_{i})$ is left-c.e. Then $\nu(B_{1}\cup\cdots\cup B_{n}\cup U_{i})=\nu(B_{1}\cup\cdots\cup B_{n})+\nu(U_{i})-\nu((B_{1}\cup\cdots\cup B_{n})\cap U_{i})$ is also right-c.e. and hence computable.

Finally, (3) follows from (2) and the definition of a $\nu$ -computable basis; and (4) follows directly from the uniformity in the proof of (3). ∎

Many of the canonical computable bases are measure computable bases:

Example 2.9.

If a computable basis on $X$ consists of sets which are also uniformly effectively closed, then the basis is $\nu$ -computable for any computable point $\nu$ of $\mathcal{M}^{+}(X)$ . This point applies to the canonical computable basis of clopens on Baire space or Cantor space.

Example 2.10.

If a computable basis on $X$ consists of c.e. open sets $U$ such that $\overline{U}$ is uniformly effectively closed with $\overline{U}\setminus U$ is finite, then the basis is $\nu$ -computable for any computable atomless $\nu$ in $\mathcal{M}^{+}(X)$ . This point applies to the canonical atomless measures on $[a,b]$ for $a<b$ computable.

Here is an example of a computable basis that is not a measure computable basis:

Example 2.11.

Let $f:\mathbb{N}\rightarrow\mathbb{N}\setminus\{0\}$ be an injective function whose range is c.e. but not computable, so that $b=\sum_{i}2^{-f(i)}<1$ is left-c.e. but not computable. Let $q_{i}=1-2^{-(i+1)}$ , which converges upwards to one, starting from $\frac{1}{2}$ . Define a computable point $\nu$ of $\mathcal{P}([0,1])$ by $\nu=(\sum_{i}2^{-f(i)}\cdot\delta_{q_{i}})+(1-b)\cdot\delta_{1}$ . A computable basis for $[0,1]$ is given by $(p,q)\cap[0,1]$ where $p<q$ are rationals. But this is not a $\nu$ -computable basis since $\nu(0,1)=b$ is left-c.e. but not computable.

To illustrate the utility of measure computable bases, consider the following approximation method. In this proof, we use the standard notation $W_{e}$ for the $e$ -th c.e. set, and we use $W_{e,s}$ for the points in $W_{e}$ which get enumerated in by stage $s$ in the canonical enumeration.⁶²⁶²62[71, pp. 17-18, 47].

Proposition 2.12.

Suppose $\nu$ is a computable point of $\mathcal{M}^{+}(X)$ .

From a rational $\epsilon>0$ and an index for a c.e. open $U$ with $\nu(U)$ computable, one can uniformly compute an index for an effectively closed set $C\subseteq U$ and an index for $\nu(C)$ as a computable real such that $\nu(U\setminus C)<\epsilon$ .

Proof.

We work with the $\nu$ -computable basis $B(i,r_{j})$ as above (discussed immediately before Proposition 2.8). Let $\epsilon>0$ rational be given. Suppose $U$ is c.e. open with $\nu(U)$ computable. Let $U=\bigcup_{k}B(i_{f(k)},r_{f(k)}))$ where $f$ is a computable function. For each $m\geq 0$ let $U_{m}=\bigcup_{k<m}B(i_{f(k)},r_{f(k)})$ . Note that $U_{m}$ has $\nu$ -computable measure, uniformly in $m\geq 0$ . Using this and the computability of $\nu(U)$ , compute $m\geq 0$ such that $\nu(U)-\nu(U_{m})<\frac{\epsilon}{2}$ . For each $k<m$ , the set $W_{g(k)}=\{j:0<r_{j}<r_{f(k)}\}$ is c.e. and dense in the open interval $(0,r_{f(k)})$ and so $B(i_{f(k)},r_{f(k)})=\bigcup_{j\in W_{g(k)}}B(i_{f(k)},r_{j})=\bigcup_{j\in W_{g(k)}}B[i_{f(k)},r_{j}]$ . Compute $s\geq 0$ such that $\nu(U_{m})-\nu(\bigcup_{k<m}\bigcup_{j\in W_{g(k),s}}B(i_{f(k)},r_{j}))<\frac{\epsilon}{2}$ . Then $C=\bigcup_{k<m}\bigcup_{j\in W_{g(k),s}}B[i_{f(k)},r_{j}]$ is a finite union of effectively closed sets and so effectively closed; and further $\nu(C)$ is a computable real since it is a finite union of elements from the $\nu$ -computable basis. Further $C\subseteq U$ and $\nu(U)-\nu(C)\leq\nu(U)-\nu(U_{m})+\nu(U_{m})-\nu(C)<\epsilon$ . ∎

The following is an important property of the interaction of $\mathsf{KR}^{\nu}$ with $\nu$ -computable bases:

Proposition 2.13.

Each element $A$ of the algebra generated by a $\nu$ -computable basis is uniformly identical on $\mathsf{KR}^{\nu}$ to a c.e. open $U$ , which is effectively paired with an effectively closed superset $C$ of $U$ of the same $\nu$ -measure.

Note that since $C\setminus U$ is an effectively closed $\nu$ -null set, $C=U$ on $\mathsf{KR}^{\nu}$ .

Proof.

Suppose that $B_{0},B_{1},\ldots$ is a $\nu$ -computable basis with corresponding effectively closed set $C_{i}\supseteq B_{i}$ of the same $\nu$ -measure. Again, since $C_{i}\setminus B_{i}$ is an effectively closed $\nu$ -null set, we have that $C_{i}=B_{i}$ on $\mathsf{KR}^{\nu}$ . Then $X\setminus C_{i}$ is c.e. open with effectively closed superset $X\setminus B_{i}$ which with it agrees on $\mathsf{KR}^{\nu}$ .

Suppose that $A$ is an element of the algebra generated by the $\nu$ -computable basis $B_{0},B_{1},\ldots$ . Then $A$ can be written as the finite union of finite intersections of the $B_{0},B_{1},\ldots$ and their relative complements $X\setminus B_{0},X\setminus B_{1},\ldots$ . This is indexed by a finite list of pairs of strings $\sigma_{1},\tau_{1},\ldots,\sigma_{n},\tau_{n}$ such that

A=\bigcup_{i=1}^{n}\bigg{(}\bigcap_{j<\left|\sigma_{i}\right|}B_{\sigma_{i}(j)}\cap\bigcap_{j<\left|\tau_{i}\right|}X\setminus B_{\tau_{i}(j)}\bigg{)}

(2.1)

Then form c.e. open $V$ by replacing the effectively closed $X\setminus B_{\tau_{i}(j)}$ with the c.e. open $X\setminus C_{\tau_{i}(j)}$ , and similarly form effectively closed $D$ by replacing c.e. open $B_{\sigma_{i}(j)}$ with effectively closed $C_{\sigma_{i}(j)}$ , as follows:

V=\bigcup_{i=1}^{n}\bigg{(}\bigcap_{j<\left|\sigma_{i}\right|}B_{\sigma_{i}(j)}\cap\bigcap_{j<\left|\tau_{i}\right|}X\setminus C_{\tau_{i}(j)}\bigg{)},\hskip 8.53581ptD=\bigcup_{i=1}^{n}\bigg{(}\bigcap_{j<\left|\sigma_{i}\right|}C_{\sigma_{i}(j)}\cap\bigcap_{j<\left|\tau_{i}\right|}X\setminus B_{\tau_{i}(j)}\bigg{)}

(2.2)

Then $A,V,D$ are equal on $\mathsf{KR}^{\nu}$ and hence have the same $\nu$ -measure, and further $V$ is c.e. open and $D\supseteq V$ is effectively closed.

∎

The previous proposition places topological constraints on the sets in $\nu$ -computable bases, at least when the measure has full support (that is, there are no open $\nu$ -null sets):

Proposition 2.14.

Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ .

(1)

If $\nu$ has full support and $\mathsf{XR}^{\nu}$ is a $\nu$ -measure one set and the c.e. open $U$ is equal to effectively closed $C$ on $\mathsf{XR}^{\nu}$ , then $\nu(\overline{U})=\nu(U)$ .
(2)

If $\nu$ has full support then no element $U$ of a $\nu$ -computable basis can satisfy $\nu(\overline{U})>\nu(U)$ .

In this, we use $\overline{\;\cdot\;}$ for topological closure.

Proof.

For (1), the c.e. open $U\setminus C$ is a subset of the $\nu$ -null $X\setminus\mathsf{XR}^{\nu}$ . Since $\nu$ has full support, we must have that $U\setminus C$ is empty, so that $U\subseteq C$ and $\overline{U}\subseteq C$ . Since $U,C$ have same $\nu$ -measure, the same must then be true of $U,\overline{U}$ . For (2), this follows from (1) and the previous proposition. ∎

By contrast, Proposition 2.8(3) implies any c.e. open $U$ with $\nu(U)$ computable and $\overline{U}$ effectively closed and $\nu(\overline{U})=\nu(U)$ can be added to any $\nu$ -computable basis to form a larger $\nu$ -computable basis.

For a simple example of c.e. open as in Proposition 2.14(2), one has the following:⁶³⁶³63This example is a minor modification of an example from a proof in [9, p. 58].

Example 2.15.

Consider Cantor space with the uniform measure. Let $0=c_{0}<c_{1}<c_{2}<\cdots$ be a computable sequence of natural numbers such that $\sum_{n}2^{-(c_{n+1}-c_{n})}<\infty$ (resp. is computable). Let $I$ be any computable set. For all $n\geq 0$ , consider the following clopen:

U_{n}=\{\omega:\forall\;i\in[c_{n},c_{n+1})\;\big{(}\omega(i)=1\leftrightarrow(n,i)\in I\big{)}\}

Since $U_{n}$ makes decisions on $c_{n+1}-c_{n}$ many bits, its measure is $2^{-(c_{n+1}-c_{n})}$ . Then $U=\bigcup_{n}U_{n}$ is a c.e. open. And $0<\nu(U)=\sum_{n}\nu(U_{n}\setminus\bigcup_{m<n}U_{m})\leq\sum_{n}\nu(U_{n})<\infty$ (resp. is computable by the Comparison Test and the fact that $U_{n}\setminus\bigcup_{m<n}U_{m}$ is clopen, cf. Example 2.9 and Proposition 2.8( 1)). Further, the set $U$ is dense and so its closure is the entire space.

For a similar example on the unit interval with Lebesgue measure, one can use the complements of positive measure Cantor sets.

2.3. The space of integrable functions

For $\nu$ a computable point of $\mathcal{M}^{+}(X)$ and $p\geq 1$ computable, there is a natural Polish space structure on $L_{p}(\nu)$ (cf. Definition 1.2). For, one can take as the countable dense set the simple functions $\sum_{i=1}^{n}q_{i}\cdot I_{A_{i}}$ , where $A_{i}$ come from the algebra of sets generated by a $\nu$ -computable basis. If $f,g$ are two such functions, then so is $f-g$ , and hence it suffices to show that if $h=\sum_{i=1}^{n}q_{i}\cdot I_{A_{i}}$ is such a simple function then $\|h\|_{p}$ is computable. Since $A_{i}$ comes from an algebra, we can assume that the $A_{i}$ are pairwise disjoint, which implies $\left|\sum_{i=1}^{n}q_{i}\cdot I_{A_{i}}\right|^{p}=\sum_{i=1}^{n}\left|q_{i}\right|^{p}\cdot I_{A_{i}}$ everywhere. Then one has $\|h\|_{p}=\big{(}\sum_{i=1}^{n}\left|q_{i}\right|^{p}\nu(A_{i})\big{)}^{\frac{1}{p}}$ , which is computable by Proposition 2.8(1). Note that the countable dense set is in $\mathbb{L}_{p}(\nu)$ , that is, it is defined everywhere rather than merely $\nu$ -a.s. (cf. Definition 1.2). But when we pass to their equivalence classes, they become elements of $L_{p}(\nu)$ , and they are a countable dense set in $L_{p}(\nu)$ .

We do not record the choice of the $\nu$ -computable basis in the notation for the computable Polish space $L_{p}(\nu)$ . This is for two reasons. First, the $\nu$ -computable bases are closed under effective union Proposition 2.8(4). Hence one can typically just assume that one is working with the union of whichever of them are salient in a given context. Second, one can check that any two $\nu$ -computable bases result in computably homeomorphic presentations of $L_{p}(\nu)$ .

Many of the natural continuous functions on the computable Polish space $L_{p}(\nu)$ are computable continuous, such as: addition, subtraction, multiplication by computable scalar, absolute value, maximum, minimum, positive part, and negative part.

By considering the continuous computable function $\Phi(f)=\left|f\right|-f$ , one sees that $L_{p}^{+}(\nu)=\Phi^{-1}(\{0\})$ , and so $L_{p}^{+}(\nu)$ is an effectively closed subset of $L_{p}(\nu)$ (cf. Definition 1.2). Further, it is a computable Polish subspace, since the equivalence classes of the non-negative elements of the countable dense set of $L_{p}(\nu)$ are dense in $L_{p}^{+}(\nu)$ .

Since we are working with a finite computable measure $\nu$ from $\mathcal{M}^{+}(X)$ , if $p\leq q$ , then the identity map is a computable continuous map from $L_{q}(\nu)$ into $L_{p}(\nu)$ and satisfies $\|f\|_{p}\leq\|f\|_{q}$ for all $f$ from $L_{q}(\nu)$ . We refer to this as the computable embedding of $L_{q}(\nu)$ into $L_{p}(\nu)$ .

In working with $L_{p}(\nu)$ for $p>1$ it is useful to remember the following inequalities:

u,v\geq 0:\hskip 14.22636ptu^{p}+v^{p}\leq(u+v)^{p}\hskip 8.53581pt\mbox{ \; \& \; }\hskip 8.53581pt(u+v)^{\frac{1}{p}}\leq u^{\frac{1}{p}}+v^{\frac{1}{p}}

(2.3)

By letting $u=x-y$ and $v=y$ , one obtains the following inequalities:

0\leq y\leq x:\hskip 14.22636pt(x-y)^{p}\leq x^{p}-y^{p}\hskip 8.53581pt\mbox{ \; \& \; }\hskip 8.53581ptx^{\frac{1}{p}}-y^{\frac{1}{p}}\leq(x-y)^{\frac{1}{p}}

(2.4)

The following proposition gives a canonical approximation of lsc functions which are bounded from below, and indicates that for the non-negative ones, being a computable point of $L_{p}(\nu)$ is solely a matter of the computability of the norm. The first part is due to Miyabe for $p=1$ .⁶⁴⁶⁴64[44, Lemma 4.6]. This kind of approximation is a mainstay of working with lsc functions, and different approximations tend to be appropriate for different purposes.⁶⁵⁶⁵65See [39, Definition 1.7.4 p. 35]. We will need a variation on this approximation in Proposition 7.10.

Proposition 2.16.

From a rational $q$ and a lsc function $f:X\rightarrow[q,\infty]$ , one can compute an index for a computable sequence of functions $f_{s}:X\rightarrow[q,\infty)$ from the countable dense set of $L_{1}(\nu)$ such that $f_{s}\leq f_{s+1}$ everywhere and $f=\sup_{s}f_{s}$ everywhere.

Further, if $p\geq 1$ is computable, then a non-negative lsc function $f:X\rightarrow[0,\infty]$ in $L_{p}(\nu)$ is a $L_{p}(\nu)$ Schnorr test (cf. Definition 1.1(4)) iff it is a computable point of $L_{p}(\nu)$ , and in this case the witness is a computable subsequence of $f_{s}$ .

Finally, if $p\geq 1$ is computable, then any non-negative lsc function $f:X\rightarrow[0,\infty]$ in $L_{p}(\nu)$ is a $L_{p}(\nu)$ Martin-Löf test (cf. Definition 1.1(5)), and $f_{s}\rightarrow f$ in $L_{p}(\nu)$ .

Proof.

Let $B_{0},B_{1},\ldots$ be a $\nu$ -computable basis. Enumerate $\mathbb{Q}\cap[q,\infty)$ as $q_{0},q_{1},\ldots$ . For each $n\geq 0$ , one has that $f^{-1}(q_{n},\infty]$ is uniformly c.e. open. Hence, there is a computable function $g$ such that $f^{-1}(q_{n},\infty)=\bigcup_{i\in W_{g(n)}}B_{i}$ . Then define

f_{s}(x)=\max\{q,q_{n}:n\leq s,i\in W_{g(n),s},x\in B_{i}\}

(2.5)

This is an element of the countable dense set of $L_{1}(\nu)$ since we just enumerate $\bigcup_{n\leq s}W_{g(n),s}$ as $i_{0},\ldots,i_{k(s)}$ and for each non-empty subset $K$ of $\{i_{0},\ldots,i_{k(s)}\}$ we consider the element $B_{K}=\bigcap_{i_{j}\in K}B_{i}\cap\bigcap_{i_{j}\notin K}X\setminus B_{i}$ of the algebra generated by the $\nu$ -computable basis, and we let $q_{K}=\max\{q_{n}:n\leq s,i_{j}\in K,i_{j}\in W_{g(n),s}\}$ , so that we have $f_{s}=\sum_{\emptyset\neq K\subseteq\{i_{0},\ldots,i_{k(s)}\}}q_{K}\cdot I_{B_{K}}$ . Further, at the initial stages $s$ (if any) where $\bigcup_{n\leq s}W_{g(n),s}$ is empty, we set $f_{s}=q\cdot I_{X}$ .

Further from (2.5) one sees that $f_{s}\leq f_{s+1}$ since the sum over which we taking the maximum grows in $s$ . Further, one has $f_{s}\leq f$ everywhere since if we had $f_{s}(x)>f(x)$ , then $f_{s}(x)=q_{n}$ for some $n\leq s$ with $i\in W_{g(n),s}$ and $x$ in $B_{i}$ . But then $B_{i}\subseteq f^{-1}(q_{n},\infty]$ , and so $f(x)>q_{n}$ . Finally, one has $\sup_{s}f_{s}=f$ everywhere, since if not we would have $\sup_{s}f_{s}(x)<q_{n}<f(x)$ for some $x$ and some $n$ and hence $x$ would be in $f^{-1}(q_{n},\infty]=\bigcup_{i\in W_{g(n)}}B_{i}$ and so $x$ would be in $B_{i}$ for some $i$ in $W_{g(n)}$ and hence there would be $s$ such that $i$ is in $W_{g(n),s}$ and hence by definition in (2.5) one would have that $f_{s}(x)\geq q_{n}$ .

Suppose $p\geq 1$ is computable and $f:X\rightarrow[0,\infty]$ is lsc and in $L_{p}(\nu)$ . If $f$ is a computable point of $L_{p}(\nu)$ , then since the norm is computable continuous (using Proposition 2.5), we have that $\|f\|_{p}$ is computable (using Proposition 2.4). Conversely, suppose that $f$ is an $L_{p}(\nu)$ Schnorr test, so that $\|f\|_{p}$ is computable. Then by taking $p$ -th roots, we have $\int f^{p}\;d\nu$ is computable. Since $\int f_{s}^{p}\;d\nu$ converges upwards to $\int f^{p}\;d\nu$ and we can compute both, we can compute a $s(n)$ such that $\int f^{p}-f_{s(n)}^{p}\;d\nu<2^{-np}$ for all $n\geq 0$ . Then using the estimate $(f-f_{s(n)})^{p}\leq f^{p}-f_{s(n)}^{p}$ from (2.4), we have that $\int(f-f_{s(n)})^{p}\;d\nu<2^{-np}$ , and so by taking $p$ -th roots again we have $\|f-f_{s(n)}\|_{p}<2^{-n}$ .

Similarly, for the last point, since $\int f_{s}^{p}\;d\nu$ converges upwards to $\int f^{p}\;d\nu$ , we can use the estimate from (2.4) to argue that for all $\epsilon>0$ there is $s_{0}\geq 0$ such that for all $s\geq s_{0}$ one has $\int(f-f_{s})^{p}\;d\nu\leq\int f^{p}\;d\nu-\int f_{s}^{p}\;d\nu<\epsilon^{p}$ , and so $\|f-f_{s}\|_{p}<\epsilon$ . ∎

The following records the “universal test” for $\mathsf{MLR}^{\nu}$ . For integral tests, it is due Gács and Hoyrup-Rojas in the case $p=1$ .⁶⁶⁶⁶66Gács [22, 102, Corollary 3.3], Hoyrup-Rojas [30, 845-6]. Further, the version stated here is simplified in that it is only stated for a single measure, whereas these authors state a version where the lsc functions have domain $\mathcal{P}(X)\times X$ . Hoyrup-Rojas improve on Gács by removing any assumption about the computability of the Boolean algebra structure on the algebra generated by the canonical computable basis.

Proposition 2.17.

Suppose $\nu$ is a computable point of $\mathcal{P}(X)$ and $p\geq 1$ is computable.

Then there is an $L_{p}(\nu)$ Martin-Löf test $f$ with $\|f\|_{p}\leq 1$ such that for all $L_{p}(\nu)$ Martin-Löf tests $g$ with $\|g\|_{p}\leq 1$ there is constant $c>0$ such that $g\leq c\cdot f$ everywhere.

Hence $\mathsf{MLR}^{\nu}=\bigcup_{n}f^{-1}[0,n]$ , an increasing sequence of effectively closed sets.

Proof.

(Sketch) Enumerate the $L_{p}(\nu)$ Martin-Löf tests with $p$ -norm $\leq 1$ as $h_{0},h_{1},\ldots$ . Do this by enumerating approximations to them (as in Proposition 2.16) which have $p$ -norm $<1$ . Then set $f=\sum_{e}2^{-e}\cdot h_{e}$ . ∎

The previous proposition has the following useful consequence regarding computable domination, which recall features in Theorem 1.5(2):

Proposition 2.18.

Suppose that $X$ is computably compact and $\nu$ is a computable point of $\mathcal{P}(X)$ . Then there are points in $\mathsf{MLR}^{\nu}$ of computably dominated degree.

The main idea of the proof is to build a computably compact space of fast Cauchy sequences above $X$ , and to apply there the Computably Dominated Basis Theorem.⁶⁷⁶⁷67[9, Theorem 3.7 p. 54], [71, Theorem 9.5.1 p. 179]. One can of course thematize the space of fast Cauchy sequences more than we are doing in this short paper, and in part what we are doing in the below proof is doing the construction out “by hand” in the computably compact case.

Proof.

Without loss of generality, we identify the countable dense set with the natural numbers. By effective Baire Category Theorem, choose a strictly decreasing computable sequence of positive reals $\eta_{s}<2^{-(s+1)}$ such that $\{\eta_{s}:s\geq 0\}\cap\{\frac{1}{2}\cdot d(i,j):i,j\geq 0\}$ are disjoint. We define a non-decreasing computable sequence $n_{s}$ of natural numbers as follows. Suppose that we have already defined things up to stage $s$ . To define at stage $s$ , we consider the open cover $B(0,\eta_{s}),B(1,\eta_{s}),\ldots$ and use computable compactness to compute an $n_{s}\geq n_{s-1}$ such that $B(0,\eta_{s}),\ldots,B(n_{s},\eta_{s})$ covers $X$ . Define the following computable trees:

	$\displaystyle T_{0}$	$\displaystyle=\{\sigma\in\mathbb{N}^{<\mathbb{N}}:\forall\;t<\left\|\sigma\right\|\;\exists\;i\leq n_{t}\;\sigma(t)=i\}$
	$\displaystyle T$	$\displaystyle=\{\sigma\in T_{0}:\forall\;t<\left\|\sigma\right\|\;\forall\;r\in[t,\left\|\sigma\right\|)\;d(\sigma(t),\sigma(r))\leq 2\eta_{t}\}$

The tree $T$ is computable since $\{2\cdot\eta_{s}:s\geq 0\}\cap\{d(i,j):i,j\geq 0\}$ are disjoint. Further $T$ has no dead ends since we can just extend by repeating the last entry (since $n_{s}\geq n_{s-1}$ ). Let $C=[T]$ , which is then a computable Polish space with countable dense set given by extending any node $\sigma$ in $T$ by means of repeating its last entry indefinitely. Since the function $t\mapsto n_{t}$ is computable, one has that $C$ is strongly computably compact.

The map $\pi:C\rightarrow X$ given by sending $\omega$ to $\lim_{i}\omega(i)$ in $X$ is well-defined. For, since $\eta_{s}<2^{-(s+1)}$ , every $\omega$ in $C$ is a Cauchy sequence.

By definition of $T$ , note that $d(\pi(\omega),\omega(t))\leq 2\eta_{t}$ for all $t\geq 0$ . For let $\epsilon>0$ . Since $d(\pi(\omega),\omega(r))\rightarrow 0$ , choose $r>t$ such that $d(\pi(\omega),\omega(r))<\epsilon$ . Then $d(\pi(\omega),\omega(t))\leq d(\pi(\omega),\omega(r))+d(\omega(t),\omega(r))\leq\epsilon+2\eta_{t}$ .

Note that any $\omega$ in $C$ is a sequence from the countable dense set of $X$ which converges fast to $\pi(\omega)$ . This is because $2\eta_{t}<2^{-t}$ .

Further, $\pi:C\rightarrow X$ is surjective: if $x$ in $X$ is given, then for each $j$ choose $\omega(j)\leq n_{j}$ such that $x$ is in $B(\omega(j),\eta_{j})$ . Then $\omega$ is in $C$ since for all $k\geq j$ one has $d(\omega(j),\omega(k))\leq d(\omega(j),x)+d(x,\omega(k))\leq\eta_{j}+\eta_{k}\leq 2\eta_{j}$ .

Then by Proposition 2.5, the map $\pi:C\rightarrow X$ is computable continuous since it has a computable modulus of uniform continuity. For, if rational $\epsilon>0$ is given, compute least $\ell\geq 0$ such that $4\cdot\eta_{\ell}<\epsilon$ . Suppose that $\omega,\omega^{\prime}$ are in $C$ with $\omega,\omega^{\prime}$ agreeing $\leq\ell$ . Then $d(\pi(\omega),\pi(\omega^{\prime}))\leq d(\pi(\omega),\omega(\ell))+d(\omega(\ell),\omega^{\prime}(\ell))+d(\omega^{\prime}(\ell),\pi(\omega^{\prime}))\leq 2\cdot\eta_{\ell}+0+2\cdot\eta_{\ell}<\epsilon$ .

By the previous proposition, choose a non-empty effectively closed subset $D$ of $X$ which consists only of $\mathsf{MLR}^{\nu}$ ’s. Then $\pi^{-1}(D)\subseteq C$ is an effectively closed subset of $C$ , which is thus strongly computably compact since $C$ is. By the Computably Dominated Basis Theorem, there is an element $\omega$ of $\pi^{-1}(D)$ of computably dominated degree. ∎

The following example shows that one cannot in general assume that the $\mathsf{MLR}^{\nu}$ ’s of computably dominated degree in the previous proposition are non-atoms.

Example 2.19.

There is an uncountable computably compact computable Polish space $X$ and a computable point $\nu$ of $\mathcal{P}(X)$ such that the only elements of $\mathsf{MLR}^{\nu}(X)$ of computably dominated degree are among the atoms.

This follows from a construction of Ng et. al.⁶⁸⁶⁸68[49, Lemma 2.1, Theorem 2.2]. Let $\mu$ be the uniform measure on Cantor space $Y=\{0,1\}^{\mathbb{N}}$ and let $Z=\{0,1,2\}^{\mathbb{N}}$ . Ng et. al. constructs a computable continuous map $f:Y\rightarrow Z$ with image $X$ such that every $\omega$ in $\mathsf{MLR}^{\mu,\emptyset^{\prime}}(Y)$ is such that $f(\omega)$ is non-isolated in $X$ , and vice-versa, and in this circumstance $f(\omega)$ and $\omega$ have the same Turing degree.

The image $X$ is a computable Polish space.⁶⁹⁶⁹69Since it is the computable continuous image of Cantor space, cf. [10, Theorem 2.4.8(3) pp. 73-74]. Further, pushforwards of computable probability measures under computable continuous maps are computable probability measures (by Proposition 2.7), and so $\nu:=f\#\mu$ is a computable point of $\mathcal{P}(X)$ . Note that $\nu$ has full support since $\mu$ has full support.

Suppose that $\omega^{\prime}$ in $X$ is in $\mathsf{MLR}^{\nu}(X)$ and is of computably dominated degree. Then we claim that $\omega^{\prime}$ is an atom. For reductio, suppose not. Since any isolated point in a space with full support is an atom, one has that $\omega^{\prime}$ is not isolated. Since $f:Y\rightarrow X$ is a surjection, choose $\omega$ in $Y$ with $f(\omega)=\omega^{\prime}$ . By the construction, $\omega$ is in $\mathsf{MLR}^{\mu,\emptyset^{\prime}}(Y)$ . But these points are not of computably dominated degree.⁷⁰⁷⁰70E.g. [15, Theorem 8.21.2 p. 382]. Since $\omega,\omega^{\prime}$ have the same Turing degree, $\omega^{\prime}$ is not of computably dominated degree, contrary to hypothesis.

It is not clear to us what happens in the general atomless non-compact case:

Question 2.20.

Suppose that $X$ is a computable Polish space which is not computably compact, and that $\nu$ in $\mathcal{P}(X)$ is computable and atomless. Is there an element in $\mathsf{MLR}^{\nu}(X)$ that is of computably dominated degree?

If $X$ is the reals, it can be written as an effective union of computably compact Polish subspaces, and so the answer is affirmative, by Proposition 2.18. If $X$ is Baire space, then the answer is again affirmative, by using effective tightness to describe the $\mathsf{MLR}^{\nu}$ ’s as a subset of a countable union of computably compact sets, and then applying the Computably Dominated Basis Theorem again. Hence to answer the question negatively one should be looking for spaces which are not “effectively $K_{\sigma}$ ” and spaces where effective tightness does not produce a union of computably compact sets containing the $\mathsf{MLR}^{\nu}$ ’s.

If $\nu$ is in $\mathcal{P}(X)$ and $f:X\rightarrow[-\infty,\infty]$ is in $L_{1}(\nu)$ , then it induces the push-forward probability measure $(f\#\nu)(A)=\nu(f^{-1}(A))$ in $\mathcal{P}(\mathbb{R})$ . The following proposition tells us that the map $f\mapsto f\#\nu$ is computable continuous. We use this proposition primarily in conjunction with Proposition 2.4 and Proposition 2.7, which together tell us that pushforwards of $L_{1}(\nu)$ -computable functions are themselves computable.

Proposition 2.21.

Let $X$ be a computable Polish space. Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ . Then the map from $L_{1}(\nu)$ to $\mathcal{P}(\mathbb{R})$ given by sending $f$ to $f\#\nu$ is continuous computable. Similarly, the map from $L_{1}^{+}(\nu)$ to $\mathcal{P}(\mathbb{R}^{\geq 0})$ given by sending $f$ to $f\#\nu$ is continuous computable.

Proof.

We apply Proposition 2.5.

Suppose that $f$ is an element of the countable dense set of $L_{1}(\nu)$ . Then $f=\sum_{i=1}^{m}q_{i}\cdot I_{A_{i}}$ , where $q_{i}$ is rational and the $A_{i}$ are elements of the algebra generated by a $\nu$ -computable basis. Then uniformly in rationals $p<q$ one has that $\nu(f^{-1}(p,q))=\nu(\cup\{A_{i}:1\leq i\leq m,q_{i}\in(p,q)\})$ , which is left-c.e. and indeed computable. Hence $f\#\nu$ is a computable point of $\mathcal{P}(\mathbb{R}^{\geq 0})$ by Proposition 2.7.

Any computable function $m:\mathbb{Q}^{>0}\rightarrow\mathbb{Q}^{>0}$ satisfying $m(\epsilon)<\epsilon$ is a computable modulus of uniform continuity. To see this, suppose that $\epsilon>0$ , and suppose that $h:\mathbb{R}\rightarrow\mathbb{R}$ is 1-Lipschitz, and that $\mathbb{E}_{\nu}\left|f-g\right|<m(\epsilon)$ . By change of variables, one has that $\left|\mathbb{E}_{f\#\nu}h-\mathbb{E}_{g\#\nu}h\right|=\left|\mathbb{E}_{\nu}(h\circ f)-\mathbb{E}_{\nu}(h\circ g)\right|\leq\mathbb{E}_{\nu}\left|h\circ f-h\circ g\right|\leq\mathbb{E}_{\nu}\left|f-g\right|<m(\epsilon)$ , where the second-to-last inequality uses that $h$ is 1-Lipschitz. By taking the supremum over all $1$ -Lipschitz $h:\mathbb{R}\rightarrow\mathbb{R}$ with $\|h\|_{\infty}\leq 1$ , one has $d_{KR}(f\#\nu,g\#\nu)\leq m(\epsilon)$ , which by construction is $<\epsilon$ .

Since $\{f\in L_{1}(\nu):f\geq 0\}$ is a computable Polish subspace of $L_{1}(\nu)$ , the restriction of $f\mapsto f\#\nu$ to it is also computable continuous. ∎

The above proposition has the following extremely useful consequence:⁷¹⁷¹71Outside of density, the statement of this lemma is contained in Miyabe’s proof of his characterisation of $\mathsf{SR}^{\nu}$ in terms of $L_{p}(\nu)$ Schnorr tests. See e.g. the line “It follows that $\mu(\{x:t(x)>r_{n}\})$ is computable uniformly in $n$ ” ([45, p. 6]). Miyabe does not use pushforwards, but rather does it out by hand for $L_{p}(\nu)$ Schnorr tests.

Lemma 2.22.

Let $X$ be a computable Polish space. Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ .

Suppose $f:X\rightarrow[0,\infty]$ is lsc with $f<\infty$ $\nu$ -a.s. Suppose that $f\#\nu$ is a computable point of $\mathcal{P}(\mathbb{R}^{\geq 0})$ . Then there is a computable sequence of reals $r_{i}>0$ dense in $[0,\infty)$ such that $f^{-1}(r_{i},\infty]$ is c.e. open with uniformly $\nu$ -computable measure.

In particular, this is true of any $L_{p}(\nu)$ Schnorr test.

Proof.

Let $\mu:=f\#\nu$ , which by hypothesis is a computable point of $\mathcal{P}(\mathbb{R}^{\geq 0})$ . By the Hoyrup-Rojas result discussed in §2.2, there is a $\mu$ -computable basis of the form $(q-r_{i},q+r_{i})\cap\mathbb{R}^{\geq 0}$ , where $q$ ranges over rationals and $r_{i}>0$ is a computable sequence dense in $[0,\infty)$ , and where further $(q-r_{i},q+r_{i})\cap\mathbb{R}^{\geq 0}$ has the same $\mu$ -measure as $[q-r_{i},q+r_{i}]\cap\mathbb{R}^{\geq 0}$ . Since $f<\infty$ $\nu$ -a.s., we have $\nu(f^{-1}(r_{i},\infty])=\nu(f^{-1}(r_{i},\infty))=(f\#\nu)(r_{i},\infty)=\mu(r_{i},\infty)=1-\mu[0,r_{i}]=1-\mu([-r_{i},r_{i}]\cap\mathbb{R}^{\geq 0})$ , which is computable.

The last point follows from the previous proposition. ∎

The following proposition is elementary but useful. (Recall usc was defined in Definition 1.1(2)).

Proposition 2.23.

For any element $f$ of the countable dense set of $L_{p}^{+}(\nu)$ , one can compute an index for a non-negative lsc function $g$ and a non-negative usc function $h$ such that $f=g=h$ on $\mathsf{KR}^{\nu}$ .

Likewise, for any element $f$ of the countable dense set of $L_{p}(\nu)$ , one can compute a rational $q$ and an index for a non-negative lsc function $g$ and a non-negative usc function $h$ such that $f-q=g=h$ on $\mathsf{KR}^{\nu}$ .

Proof.

Let $f$ be an element of the countable dense set of $L_{p}^{+}(\nu)$ . Then $f=\sum_{i=1}^{k}q_{i}\cdot I_{A_{i}}$ , where $q_{i}\geq 0$ is rational and $A_{i}$ is an element of the algebra generated by a $\nu$ -computable basis. By Proposition 2.13, suppose that $U_{i}$ is a c.e. open which is uniformly equal on $\mathsf{KR}^{\nu}$ to $A_{i}$ , and suppose that $C_{i}$ is an effectively closed superset of $U_{i}$ of the same $\nu$ -measure. Then $g:=\sum_{i=1}^{k}q_{i}\cdot I_{U_{i}}$ is non-negative lsc, and $h:=\sum_{i=1}^{k}q_{i}\cdot I_{C_{i}}$ is non-negative usc, and they agree with $f$ on $\mathsf{KR}^{\nu}$ .

Let $f$ be an element of the countable dense set of $L_{p}(\nu)$ . Then $f=\sum_{i=1}^{k}q_{i}\cdot I_{A_{i}}$ , where $q_{i}$ is rational and $A_{i}$ is an element of the algebra generated by a $\nu$ -computable basis. Let $q=\min_{i}q_{i}$ . Then $f-q$ is an element of the countable dense set of $L_{p}^{+}(\nu)$ . ∎

2.4. The space of measurable functions

The space of equivalence classes of Borel measurable functions that are finite $\nu$ -a.s. under $\nu$ -a.s. identity is denoted by $L_{0}(X,\nu)$ , where $\nu$ is in $\mathcal{M}^{+}(X)$ . We write $L_{0}(\nu)$ when $X$ is clear from context.

In keeping with the notational conventions in §1.2, we write $\mathbb{L}_{0}(\nu)$ for the pointwise-defined Borel measurable functions that are finite $\nu$ -a.s.

The topology on $L_{0}(\nu)$ is given by convergence in measure. To enhance readability, if $h$ is a measurable function, then we write $\nu(\left|h\right|>\epsilon)$ for the more cumbersome $\nu(\{x\in X:\left|h\right|(x)>\epsilon\})$ . Then recall $f_{n}\rightarrow f$ in measure iff for all $\epsilon>0$ one has that $\lim_{n}\nu(\left|f_{n}-f\right|>\epsilon)=0$ . Recall that a consequence of Egoroff’s Theorem is that $f_{n}\rightarrow f$ $\nu$ -a.s. implies $f_{n}\rightarrow f$ in $L_{0}(\nu)$ for $\nu$ in $\mathcal{M}^{+}(X)$ .⁷²⁷²72[21, p. 62].

A compatible complete metric is given by $d(f,g)=\|f-g\|_{0}$ where $\|h\|_{0}=\inf\{\epsilon>0:\nu(\left|h\right|>\epsilon)<\epsilon\}$ . Note that the set $\{\epsilon>0:\nu(\left|h\right|>\epsilon)<\epsilon\}$ is upwards closed, so that $\|h\|_{0}=\sup\{\epsilon>0:\nu(\left|h\right|>\epsilon)>\epsilon\}$ . When $\nu$ in $\mathcal{P}(X)$ , this is called the Ky Fan metric.⁷³⁷³73[16, 289] While $\|\cdot\|_{0}$ satisfies the triangle inequality $\|f+g\|_{0}\leq\|f\|_{0}+\|g\|_{0}$ and satisfies $\|f\|_{0}=0$ iff $f=0$ $\nu$ -a.s., it does not in general satisfy $\|c\cdot h\|_{0}=\left|c\right|\cdot\|h\|_{0}$ .⁷⁴⁷⁴74[14, 65-69], [16, 289-290].^,⁷⁵⁷⁵75More generally, $L_{0}(\nu)$ is not a Banach space. In working with the metric, it is useful to note that $\|h\|_{0}\leq\epsilon$ iff $\nu(\left|h\right|>\epsilon)\leq\epsilon$ . Finally, note that $\left|f\right|\leq\left|g\right|$ $\nu$ -a.s. implies $\|f\|_{0}\leq\|g\|_{0}$ in $L_{0}(\nu)$ .

The natural countable dense set for $L_{0}(\nu)$ is the the rational-valued simple functions formed from the algebra generated by a $\nu$ -computable basis, that is, the same countable dense set as we used for $L_{p}(\nu)$ for $p\geq 1$ computable. Classically, this set is dense in $L_{0}(\nu)$ , so it remains to verify that the distance between these two points is uniformly computable:

Proposition 2.24.

If $h$ is a rational-valued simple functions formed from the algebra generated by a $\nu$ -computable basis, then $\|h\|_{0}$ is computable, and uniformly so. If $f,g$ are two such simple functions, then $\|f-g\|_{0}$ is computable, and uniformly so.

Proof.

If $h$ is one of these functions, then so too is $\left|h\right|$ . Suppose that $\left|h\right|=\sum_{i=1}^{n}q_{i}\cdot I_{A_{i}}$ , where $q_{i}\geq 0$ is rational and $A_{i}$ is are pairwise disjoint events from the algebra generated by a $\nu$ -computable basis.

For $\epsilon$ in $\mathbb{Q}^{>0}$ , let $J_{\epsilon}=\{i\in[1,n]:q_{i}>\epsilon\}$ , which is a finite set whose index is computable uniformly from $\epsilon>0$ . Then $\nu(\left|h\right|>\epsilon)=\sum_{i\in J_{\epsilon}}\nu(A_{i})$ , which is a computable real, uniformly in $\epsilon>0$ (by Proposition 2.8(1).

Then $\epsilon$ in $\mathbb{Q}^{>0}$ satisfies $\nu(\left|h\right|>\epsilon)<\epsilon$ iff $\sum_{i\in J_{\epsilon}}\nu(A_{i})<\epsilon$ , which is a c.e. condition. If we enumerate these rational $\epsilon$ and take mins as we go, we get a computable decreasing sequence of rationals which converges down to $\|h\|_{0}$ , so that $\|h\|_{0}$ is right-c.e.

Likewise, $\delta$ in $\mathbb{Q}^{>0}$ satisfies $\nu(\left|h\right|>\delta)>\delta$ iff $\sum_{i\in J_{\delta}}\nu(A_{i})>\delta$ , which is a c.e. condition. If we enumerate these rational $\delta$ and take maxes as we go, we get a increasing computable sequence of rationals which converges up to $\|h\|_{0}$ , so that $\|h\|_{0}$ is left-c.e.

Similarly if $f,g$ are from a countable dense set then $\|f-g\|_{0}$ is a computable real since $f-g$ is also an element of the countable dense set. ∎

We call the following the computable embedding of $L_{p}(\nu)$ into $L_{0}(\nu)$ . The square root in the rate of convergence is, in our view, explanatory of the many $\sqrt{2}$ ’s that appear in Pathak et. al. when dealing computable points of $L_{1}(\nu)$ .⁷⁶⁷⁶76See e.g. [52, p. 343].

Proposition 2.25.

Suppose that $p\geq 1$ is computable. Then the inclusion map is a uniformly continuous computable map from $L_{p}(\nu)$ to $L_{0}(\nu)$ . Further, if $f_{n}\rightarrow f$ at geometric rate $b>1$ of convergence in $L_{p}(\nu)$ , then $f_{n}\rightarrow f$ at geometric rate $\sqrt{b}$ in $L_{0}(\nu)$ .

Proof.

Suppose that $p\geq 1$ . Since $L_{p}(\nu)$ and $L_{0}(\nu)$ have the same countable dense set, by Proposition 2.5, it suffices to show that there is a computable modulus $m:\mathbb{Q}^{>0}\rightarrow\mathbb{Q}^{>0}$ of uniform continuity. Given rational $\epsilon>0$ , compute rational $\delta<\epsilon$ and compute a rational $m(\epsilon)<\delta^{1+\frac{1}{p}}$ . Suppose $f,g$ are in $L_{p}(\nu)$ with $\|f-g\|_{p}<m(\epsilon)$ . Then $\nu(\left|f-g\right|>\delta)\leq\frac{1}{\delta^{p}}\|f-g\|_{p}^{p}\leq\frac{1}{\delta^{p}}m(\epsilon)^{p}<\delta$ , and so $\|f-g\|_{0}\leq\delta<\epsilon$ .

Suppose that $p\geq 1$ and suppose $f_{n}\rightarrow f$ at geometric rate $b>1$ of convergence in $L_{p}(\nu)$ . Then $\|f-f_{n}\|_{1}\leq\|f-f_{n}\|_{p}$ , and so $f_{n}\rightarrow f$ at geometric rate $b>1$ in $L_{1}(\nu)$ . Let $n\geq 0$ . Then $\nu(\left|f-f_{n}\right|>(\sqrt{b})^{-n})\leq(\sqrt{b})^{n}\cdot\|f-f_{n}\|_{1}\leq(\sqrt{b})^{-n}$ . Then $\|f-f_{n}\|_{0}\leq(\sqrt{b})^{-n}$ . ∎

The following proposition is the natural effectivization of the Bounded Convergence Theorem:⁷⁷⁷⁷77[75, 130].

Proposition 2.26.

(Effective Bounded Convergence Theorem). Suppose $\nu$ is a computable point of $\mathcal{P}(X)$ . Then:

(1)

Suppose that $f_{n}\rightarrow f$ in $L_{0}(\nu)$ at a geometric rate of convergence $b\geq\sqrt{2}$ . Suppose that $K\geq 0$ such that $\left|f_{n}\right|\leq K$ $\nu$ -a.s. for all $n\geq 0$ . Then $f_{n+2}\rightarrow f$ at a geometric rate of convergence $b$ in $L_{1}(\nu)$ .
(2)

Suppose $f$ is a computable point of $L_{0}(\nu)$ and $K\geq 0$ is a rational such $\left|f\right|\leq K$ $\nu$ -a.s. Then $f$ is a computable point of $L_{1}(\nu)$ .
(3)

Suppose $f_{n}$ is a uniformly computable point of $L_{0}(\nu)$ and $K_{n}\geq 0$ is a uniformly computable sequence of rationals such that $\left|f_{n}\right|\leq K_{n}$ $\nu$ -a.s. Then $f_{n}$ is uniformly a computable point of $L_{1}(\nu)$ .

Proof.

For (1), classically some subsequence of $f_{n}$ converges $\nu$ -a.s. to $f$ . Hence, $\left|f\right|\leq K$ $\nu$ a.s. Further, we may suppose $K>1$ . Suppose that $f_{n}\rightarrow f$ at a geometric rate of $b\geq\sqrt{2}$ convergence in $L_{0}(\nu)$ , so that $\nu(\left|f_{n}-f\right|>b^{-n})\leq b^{-n}$ for all $n\geq 0$ . Choose $n_{0}\geq 2$ sufficiently large so that $b^{-n_{0}}<\frac{1}{2K}$ . Let $c=b^{n_{0}}$ , so that for all $n\geq 2$ we have $2K\cdot c^{-n}\leq 2K\cdot b^{-n_{0}}\cdot b^{-n}<b^{-n}$ . Then $\|f_{n+2}-f\|_{1}\leq\int_{\left|f_{n+2}-f\right|>c^{-(n+2)}}\left|f_{n+2}-f\right|\;d\nu+\int_{\left|f_{n+2}-f\right|\leq c^{-(n+2)}}\left|f_{n+2}-f\right|\;d\nu\leq 2K\cdot c^{-(n+2)}+c^{-(n+2)}\leq 2b^{-(n+2)}\leq b^{-n}$ , where the last inequality follows from $b\geq\sqrt{2}$ .

For (2), suppose $f_{n}\rightarrow f$ fast in $L_{0}(\nu)$ . For $0<\epsilon<1$ one has $\nu(\left|f_{n}\cdot I_{\left|f_{n}\right|\leq K+1}-f\right|>\epsilon)\leq\nu(\left|f_{n}-f\right|>\epsilon)$ , and hence we may assume that $\left|f_{n}\right|\leq K+1$ . Then we apply (1).

For (3), this is just the uniformisation of (2). ∎

Using Proposition 2.5, one has that many of the usual operations on $L_{0}(\nu)$ are computable continuous, such as addition and minimum and maximum. Indeed, each of these three has modulus $m(\epsilon)=\frac{\epsilon}{2}$ . We can use this observation to prove the following. It had been previously established by Rute, although our proof is different.⁷⁸⁷⁸78[61, Proposition 3.26].

Proposition 2.27.

If $f$ is a computable point of $L_{0}(\nu)$ (resp. $L_{0}^{+}(\nu)$ ), then $f\#\nu$ is a computable point of $\mathcal{P}(\mathbb{R})$ (resp. $\mathcal{P}(\mathbb{R}^{\geq 0})$ ).

Proof.

Let $f_{n}=\min(\max(f,-n),n)$ , so that $f_{n}=f$ on $f^{-1}(-n,n)$ , and $f_{n}$ is uniformly a computable point of $L_{0}(\nu)$ (resp. $L_{0}^{+}(\nu)$ ). By Proposition 2.26 (3) one has that $f_{n}$ is uniformly a computable point of $L_{1}(\nu)$ (resp. $L_{1}^{+}(\nu)$ ). By Proposition 2.21, one has that $f_{n}\#\nu$ is uniformly a computable point of $\mathcal{P}(\mathbb{R})$ (resp. $\mathcal{P}(\mathbb{R}^{\geq 0})$ ). For rational $p<q$ (resp. rational $0\leq p<q$ ), compute natural number $n>\max(\left|p\right|,\left|q\right|)$ , so that by Proposition 2.7 the real $(f\#\nu)(p,q)=(f_{n}\#\nu)(p,q)$ is uniformly left-c.e. Then by Proposition 2.7, the probability measure $f\#\nu$ is a computable point of $\mathcal{P}(\mathbb{R})$ (resp. $\mathcal{P}(\mathbb{R}^{\geq 0})$ ). ∎

We then define:

Definition 2.28.

An $L_{0}(\nu)$ Schnorr test is a lsc function $f:X\rightarrow[0,\infty]$ which is a computable point of $L_{0}(\nu)$ .

In parallel to Definition 1.1(7), we have the following new characteriation of $\mathsf{SR}^{\nu}$ :

Proposition 2.29.

A point $x$ is in $\mathsf{SR}^{\nu}$ iff $f(x)<\infty$ for all $L_{0}(\nu)$ Schnorr tests $f$ .

Proof.

If $f(x)<\infty$ for all $L_{0}(\nu)$ Schnorr tests $f$ , then by the computable embedding of $L_{1}(\nu)$ into $L_{0}(\nu)$ , we have $f(x)<\infty$ for all $L_{1}(\nu)$ Schnorr tests $f$ , and so $x$ is in $\mathsf{SR}^{\nu}$ . Conversely, suppose that $x$ is in $\mathsf{SR}^{\nu}$ . Let $f$ be an $L_{0}(\nu)$ Schnorr test. Since $f$ is in $L_{0}(\nu)$ , it is finite $\nu$ -a.s. By Lemma 2.22 and the previous proposition, there is a computable sequence of reals $\eta_{n}$ in the open interval $(2^{n},2^{n+1})$ such that $U_{n}:=f^{-1}(\eta_{n},\infty]$ is c.e. open with uniformly $\nu$ -computable measure. So we have $\nu(U_{n})\rightarrow 0$ , and since $\nu(U_{n})$ is computable, we can compute a subsequence with $\nu(U_{n_{i}})<2^{-i}$ . Hence $f=\sum_{i}I_{U_{n_{i}}}$ is an $L_{1}(\nu)$ Schnorr test, and so $f(x)<\infty$ and so $x$ is only finitely many of the $U_{n_{i}}$ , and hence there is $i$ such that $f(x)\leq\eta_{n_{i}}$ . ∎

Miyabe has shown that $x$ being in every $\Sigma^{0}_{2}$ $\nu$ -measure one class is equivalent to $f(x)<\infty$ for every non-negative lsc $f$ in $L_{0}(\nu)$ .⁷⁹⁷⁹79[47, Proposition 3.3]. This notion of randomness is also called weak 2-randomness. In conjunction with the above proposition, it suggests that there is little room for a simple characterisation of $\mathsf{MLR}^{\nu}$ in terms of non-negative lsc functions in $L_{0}(\nu)$ .

Finally, we update our previous approximation theorem for $L_{p}(\nu)$ Schnorr tests to $L_{0}(\nu)$ Schnorr tests:

Proposition 2.30.

For all $L_{0}(\nu)$ Schnorr tests $f:X\rightarrow[0,\infty]$ , one can compute a subsequence of $f_{s(n)}$ the $f_{s}$ from Proposition 2.16 such that $f_{s(n)}\rightarrow f$ fast in $L_{0}(\nu)$ .

Proof.

The $f_{s}$ come from the countable dense set of $L_{0}(\nu)$ and hence are computable points of $L_{0}(\nu)$ . Since $f_{s}\rightarrow f$ everywhere, we have $f_{s}\rightarrow f$ in measure, and so $f_{s}\rightarrow f$ in $L_{0}(\nu)$ . Since $\|f_{s}-f\|_{0}$ is computable, we just search for a subsequence $f_{s(n)}$ with $\|f_{s(n)}-f\|_{0}<2^{-n}$ . ∎

3. Two Schnorr lemmas: Flipping an approximation and Self-location

In this section, we provide two lemmas on Schnorr tests, one involving turning an approximation from below into a non-increasing subsequence converging down to zero, and another based upon a distinctive self-location property of Schnorr randoms.

The first of these involves a partial subtraction operator, which involves some care since it helps one avoid situations with $\infty-\infty$ . These situations can potentially arise since lsc functions are allowed to take infinite values.

Proposition 3.1.

Suppose that $p\geq 1$ computable or $p=0$ .

Suppose that $f:X\rightarrow(-\infty,\infty]$ is an lsc function in $L_{p}(\nu)$ (resp. an $L_{p}(\nu)$ Schnorr test). Suppose that $\mathsf{XR}^{\nu}$ is a $\nu$ -measure one set on which the function $f$ is finite.

Suppose that $g:X\rightarrow(-\infty,\infty]$ is an lsc function such that $g\leq f$ on $\mathsf{XR}^{\nu}$ . Suppose that $g$ is paired with a usc function $\breve{g}:X\rightarrow[-\infty,\infty)$ such that $g,\breve{g}$ are equal on $\mathsf{XR}^{\nu}$ .

Define $f\ominus g=\max(0,f-\breve{g})$ . Then

–

$g,\breve{g}$ are finite on $\mathsf{XR}^{\nu}$ .
–

$f\ominus g$ is non-negative lsc and in $L_{p}(\nu)$ (resp. an $L_{p}(\nu)$ Schnorr test) and is equal on $\mathsf{XR}^{\nu}$ to $f-g$ .

Proof.

For the first item, since the lsc function $g$ has codomain $(-\infty,\infty]$ and the usc function $\breve{g}$ has codomain $[-\infty,\infty)$ , then when the two agree they have finite value. And they agree on $\mathsf{XR}^{\nu}$ .

Since $\breve{g}$ is usc, $-\breve{g}$ is lsc. Since the lsc functions are closed under addition (cf. Proposition 2.6), one has that $f-\breve{g}$ is lsc. Since the lsc functions are preserved under max (cf. again Proposition 2.6), we have that $f\ominus g$ is non-negative lsc. Further, since $f,g$ are in $L_{p}(\nu)$ (resp. are $L_{p}(\nu)$ -computable) and this property is preserved under subtraction and maxes, we have that $f\ominus g$ is also in $L_{p}(\nu)$ (resp. $L_{p}(\nu)$ -computable). On $\mathsf{XR}^{\nu}$ , one has that $f-g$ is both equal to $f-\breve{g}$ and is non-negative, and hence equal to $f\ominus g$ . ∎

While partial subtraction operation $f\ominus g$ is not defined absolutely, but only relative to the hypotheses of the previous proposition, the situation of the following lemma is the one which tends to be operative in applications. We call it “flipping an approximation” since it takes a non-decreasing approximation $f_{s}\rightarrow f$ and turns it into a non-increasing approximation $f\ominus f_{n}\rightarrow 0$ . While classically trivial, it requires some organisation to handle within effective categories:

Lemma 3.2.

(Flipping an approximation) Suppose that $p\geq 1$ computable (resp. $p=0$ ).

For each $L_{p}(\nu)$ Schnorr test $f$ , let $g_{s}$ be from the countable dense set of $L_{p}(\nu)$ as in Proposition 2.16 (resp. Proposition 2.30), so that $g_{s}\leq g_{s+1}$ everywhere and $f=\sup_{s}g_{s}$ everywhere and $g_{s}\rightarrow f$ fast in $L_{p}(\nu)$ . Using Proposition 2.23, let $f_{s},\breve{f}_{s}$ be non-negative lsc and usc respectively with $g_{s}=f_{s}=\breve{f}_{s}$ on $\mathsf{KR}^{\nu}$ . Then by the previous proposition, we have:

–

$f_{s},\breve{f}_{s}$ are finite on $\mathsf{KR}^{\nu}$ .
–

$f\ominus f_{s}$ is an $L_{p}(\nu)$ Schnorr test and is equal on $\mathsf{KR}^{\nu}$ to $f-g_{s}$ .
–

$f\ominus f_{s}$ is non-increasing and $f\ominus f_{s}\rightarrow 0$ on $\mathsf{KR}^{\nu}$ .
–

for $t>s$ , similarly $f_{t}\ominus f_{s}$ is an $L_{p}(\nu)$ Schnorr test and is equal on $\mathsf{KR}^{\nu}$ to $f_{t}-g_{s}$ .

Now we turn to self-location. The idea is that given a certain kind of computable “chart” of the computable Polish space, the Schnorr randoms can weakly compute their position on the chart. (For the notion of weak computation, see Definition 1.1(13).

Lemma 3.3.

(Self-location lemma).

Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ .

Suppose that $V_{m}$ is a computable sequence of c.e. opens with uniformly computable $\nu$ -measure.

Suppose that $x$ is in $\mathsf{SR}^{\nu}$ . Then $x$ weakly computes the element $\{m:x\in V_{m}\}$ of Cantor space.

Proof.

Let $B_{0},B_{1},\ldots$ be a $\nu$ -computable basis, with associated sequence $C_{0},C_{1},\ldots$ of effectively closed supersets of the same measure. Let $B_{m,t}$ be a computable subsequence such that $V_{m}=\bigcup_{t}B_{m,t}$ . Since $V_{m}$ and the $B_{m,t}$ have uniformly computable $\nu$ -measure, there is a computable function $m\mapsto s(m)$ such that $\nu(V_{m}\setminus U_{m})<2^{-m}$ where $U_{m}=\bigcup_{t\leq s(m)}B_{m,t}$ . Let $D_{m}=\bigcup_{t\leq s(m)}C_{m,t}$ , which is an effectively closed set equal to $U_{m}$ on $\mathsf{KR}^{\nu}$ . Then $f=\sum_{m}I_{V_{m}\setminus D_{m}}$ is an $L_{1}(\nu)$ Schnorr test.

Let $x$ be in $\mathsf{SR}^{\nu}$ . Since $f(x)<\infty$ , there are only finitely many $m$ such that $x$ is in $V_{m}\setminus D_{m}$ . Hence, there are there are only finitely many $m$ such that $x$ is in $V_{m}\setminus U_{m}$ . Then the sets $\{m:x\in V_{m}\}$ and $\{m:x\in U_{m}\}$ differ by only finitely much and hence are Turing equivalent.

Since $U_{m}$ comes from the algebra generated by the $\nu$ -computable basis, using the $\nu$ -computable basis as in Proposition 2.13 we can compute indexes for c.e. opens $U_{m}^{\prime}$ such that $U_{m}^{\prime}=X\setminus U_{m}$ on $\mathsf{KR}^{\nu}$ . Since both $U_{m}$ and $U_{m}^{\prime}$ are uniformly c.e. open, choose computable sequences $p_{m,i},p_{m,i}^{\prime}$ from the countable dense set and $\epsilon_{m,i},\epsilon_{m,i}^{\prime}$ from $\mathbb{Q}^{>0}$ such that $U_{m}=\bigcup_{i}B(p_{m,i},\epsilon_{m,i})$ and $U_{m}^{\prime}=\bigcup_{i}B(p_{m,i}^{\prime},\epsilon_{m,i}^{\prime})$ , where $B(p,\epsilon)$ denotes again the open ball around $p$ of radius $\epsilon$ . We can enumerate these sets as $U_{m}=\bigcup_{s}U_{m,s}$ and $U_{m}^{\prime}=\bigcup_{s}U_{m,s}^{\prime}$ , where $U_{m,s}=\bigcup_{i\leq s}B(p_{m,i},\epsilon_{m,i})$ and $U_{m,s}^{\prime}=\bigcup_{i\leq s}B(p_{m,i}^{\prime},\epsilon_{m,i}^{\prime})$ .

Consider a sequence from the countable dense set which converges fast to our point $x$ in $\mathsf{SR}^{\nu}$ . Given $m$ , to compute from the sequence whether $x$ is in $U_{m}$ , we simply start enumerating both $U_{m}$ and $U_{m}^{\prime}$ : eventually $x$ gets in one of them (and $x$ only ever gets in one of them), and we use the sequence to determine when this happens, by Proposition 2.1(1). ∎

Here are some simple applications of self-location, which we use to obtain the information about weak computation in Theorem 1.5(1):

Proposition 3.4.

Suppose that $p\geq 1$ computable (resp. $p=0$ ).

(1)

Suppose that $f_{m}$ is a sequence of $L_{p}(\nu)$ Schnorr tests such that $f_{m}$ is non-increasing on $\mathsf{SR}^{\nu}$ and such that $f_{m}\rightarrow 0$ on $\mathsf{SR}^{\nu}$ . Every $x$ in $\mathsf{SR}^{\nu}$ weakly computes a modulus of convergence for $f_{m}(x)\rightarrow 0$ .
(2)

Suppose that $f$ is an $L_{p}(\nu)$ Schnorr test. Suppose that $f_{s}$ is the approximation as in Proposition 2.16 (resp. Proposition 2.30). Every $x$ in $\mathsf{SR}^{\nu}$ weakly computes a modulus of convergence for $f_{s}(x)\rightarrow f(x)$ .

Proof.

For (1), using Lemma 2.22 (resp. in conjunction with Proposition 2.27), choose a computable sequence of reals $r_{i}$ decreasing to zero such that the c.e. open $V_{m,i}=f_{m}^{-1}(r_{i},\infty]$ has $\nu$ -computable measure, uniformly. Consider a sequence from the countable dense set which converges fast to $x$ . By the Self-location lemma, we can Turing compute from it the “chart” set $C=\{(m,i):x\in V_{m,i}\}$ . Let $\epsilon>0$ be rational. We show how to compute from $C$ a natural number $m(\epsilon)$ such that $f_{n}(x)<\epsilon$ for all $n\geq m(\epsilon)$ . By hypothesis, $f_{n}(x)$ decreases down to zero. Hence to compute $m(\epsilon)$ from $C$ we just search for $r_{i}<\epsilon$ and then search for $m$ with $x\notin V_{m,i}$ .

For (2), just use Lemma 3.2 to rewrite the convergence $f_{s}\rightarrow f$ as $(f\ominus g_{s})\rightarrow 0$ on $\mathsf{KR}^{\nu}$ , where $g_{s}$ is an $L_{p}(\nu)$ Schnorr test equal to $f_{s}$ on $\mathsf{KR}^{\nu}$ , and then use (1). ∎

4. Recovering pointwise values on Schnorr randoms

In this section we prove some results about pointwise limits existing on the Schnorr randoms for various effective functions convering fast in $L_{p}(\nu)$ . By way of motivation for these kinds of results, consider $X=[0,1]$ and let $\nu$ be Lebesgue measure and recall the canonical example of $L_{1}(\nu)$ convergence with $\nu$ -a.s. lack of pointwise convergence:

\displaystyle\ f_{1}=I_{[0,\frac{1}{2})},\ f_{2}=I_{[\frac{1}{2},1]},\ f_{3}=I_{[0,\frac{1}{4})},\ f_{4}=I_{[\frac{1}{4},\frac{1}{2})},\ f_{5}=I_{[\frac{1}{2},\frac{3}{4})},\ f_{6}=I_{[\frac{3}{4},1]},\ \ldots

Proposition 4.1 below says that the slow $L_{1}(\nu)$ -convergence in this example is essential to the lack of pointwise limits on $\mathsf{SR}^{\nu}$ . By modifying the events in $f_{n}$ to be open, one similarly gets a sequence of $L_{1}(\nu)$ Schnorr tests $g_{n}$ which lacks pointwise limits on all $\mathsf{KR}^{\nu}$ . Proposition 4.3 likewise says that the slow $L_{1}(\nu)$ convergence of $g_{n}$ is essential to the $\nu$ -a.s. lack of pointwise limits on $\mathsf{SR}^{\nu}$ .

In the setting of $p=1$ and $X=[0,1]^{k}$ and $\nu$ being the $k$ -fold product of Lebesgue measure on $[0,1]$ , the following result is due to Pathak, Rojas, and Simpson, who used sequential Schnorr tests.⁸⁰⁸⁰80[52, Lemma 3.7]. See also [61, §3.3] and [76, Chapter 3] (cf. [70, p. 394]).

Proposition 4.1.

Suppose that $p\geq 1$ is computable. Suppose that $f$ is a computable point of $L_{p}(\nu)$ . Suppose that $f_{n}$ is a computable sequence from the countable dense set of $L_{p}(\nu)$ such that $f_{n}\rightarrow f$ fast in $L_{p}(\nu)$ . Then $\lim_{n}f_{n}$ exists on $\mathsf{SR}^{\nu}$ and is a version of $f$ .

Moreover, on $\mathsf{SR}^{\nu}$ this limit does not depend on the choice of $f_{n}$ or the choice of the $\nu$ -computable basis.

Finally, if $f$ is in addition an $L_{p}(\nu)$ Schnorr test, then $\lim_{n}f_{n}(x)=f(x)$ for all $x$ in $\mathsf{SR}^{\nu}$ .

Proof.

(Sketch) Let $g=\sum_{i}\left|f_{i}-f_{i+1}\right|$ . Then using Proposition 2.23, it is equal on $\mathsf{KR}^{\nu}$ to a $L_{p}(\nu)$ Schnorr test. This shows that $f_{i}(x)$ is a Cauchy sequence for $x$ in $\mathsf{SR}^{\nu}$ .

If $f_{n}^{\prime}$ is another such witness to the $L_{p}(\nu)$ computabilty of $f$ , then let $h=\sum_{i}\left|f_{i}-f_{i}^{\prime}\right|$ , and it is similarly equal on $\mathsf{KR}^{\nu}$ to a $L_{p}(\nu)$ Schnorr test.

To see that the partially defined function $\lim_{n}f_{n}$ is a version of $f$ in $L_{p}(\nu)$ , simply note that classically some subsequence $f_{n}^{\prime}:=f_{m(n)}$ converges to $f$ $\nu$ -a.s. and so $\lim_{n}f_{n}^{\prime}$ is a version of $f$ . The sequence $f_{n}^{\prime}$ is computable in some oracle, and so by the previous paragraph we get that $\lim_{n}f_{n},\lim_{n}f_{n}^{\prime}$ agree on all the Schnorr randoms relative to that oracle, and so $\lim_{n}f_{n}$ is also a version of $f$ .

The final remark follows from the second paragraph by choosing $f_{n}^{\prime}$ to be the approximation to the lsc function $f$ from Proposition 2.16. ∎

The following is an analogue of the above proposition for $L_{0}(\nu)$ . This proposition is essentially the natural effectivization of the classical proof that Cauchy-in-measure sequences converge in measure.⁸¹⁸¹81E.g. [21, Theorem 2.30 p. 61].

Proposition 4.2.

Suppose that $f$ is a computable point of $L_{0}(\nu)$ . Suppose that $f_{n}$ is a computable sequence from the countable dense set of $L_{0}(\nu)$ such that $f_{n}\rightarrow f$ at a geometric rate of convergence in $L_{0}(\nu)$ . Then for all $x$ in $\mathsf{SR}^{\nu}$ one has that $\lim_{n}f_{n}(x)$ exists and is a version of $f$ .

This limit does not depend on the choice of $f_{n}$ or the choice of the $\nu$ -computable basis or the choice of the rate of geometric convergence.

Hence, if $f$ is in addition an $L_{0}(\nu)$ Schnorr test, then $\lim_{n}f_{n}(x)=f(x)$ for all $x$ in $\mathsf{SR}^{\nu}$ .

Note that by the computable embedding of $L_{p}(\nu)$ into $L_{0}(\nu)$ , the limit in this proposition agrees with the limit in the previous proposition on $\mathsf{SR}^{\nu}$ .

Proof.

We may suppose that the geometric rate of convergence $b>1$ is rational. Then we can compute whether $b^{-j}$ is rational or irrational, and hence uniformly in $j\geq 0$ we have that $b^{-j}$ has uniformly computable left- and right Dedekind cuts. Since the $f_{j}$ are from the countable dense set, so is $\left|f_{j}-f_{j+1}\right|$ , and hence we can write it as $\sum_{k=1}^{n_{j}}q_{j,k}\cdot I_{A_{j,k}}$ , where $q_{j,k}\geq 0$ is rational and the events $\{A_{j,k}:1\leq k\leq n_{j}\}$ are pairwise disjoint and come from the algebra generated by a $\nu$ -computable basis. By Proposition 2.13, this is equal on $\mathsf{KR}^{\nu}$ to the finite sum $\sum_{k=1}^{n_{j}}q_{j,k}\cdot I_{U_{j,k}}$ , where $U_{j,k}$ is a c.e. open which is equal on $\mathsf{KR}^{\nu}$ to $A_{j,k}$ . Let $E_{j}=\{x\in X:\sum_{k=1}^{n_{j}}q_{j,k}\cdot I_{U_{j,k}}>2\cdot b^{-j}\}$ , which is a c.e. open since it is equal to $\bigcup_{K\in J_{j}}\bigcap_{k\in K}U_{j,k}$ , where $J_{j}=\{K\subseteq[1,n_{j}]:\sum_{k\in K}q_{j,k}>2\cdot b^{-j}\}$ , and $J_{j}$ is computable since $b^{-j}$ has uniformly computable right Dedkind cuts. Then $E_{j}$ is equal on $\mathsf{KR}^{\nu}$ to $\{x\in X:\left|f_{j}-f_{j+1}\right|>2\cdot b^{-j}\}$ and $E_{j}$ is a c.e. open with computable $\nu$ -measure, uniformly in $j\geq 0$ .

We then have $\nu(E_{j})\leq\nu(\left|f-f_{j}\right|>b^{-j})+\nu(\left|f-f_{j+1}\right|>b^{-(j+1)})\leq b^{-j}+b^{-(j+1)}\leq 2\cdot b^{-j}$ . Letting $F_{k}$ be the c.e. open $\bigcup_{j=k}^{\infty}E_{j}$ , we have $\nu(F_{k})\leq 2\cdot\frac{b}{b-1}\cdot b^{-k}$ . Further for $k^{\prime}>k$ we have $\bigcup_{j=k}^{k^{\prime}}E_{j}$ has computable $\nu$ -measure since it is a finite union of events with $\nu$ -computable measure coming from the algebra generated by a $\nu$ -computable basis. And then $\nu(F_{k})$ is computable since we can approximate it by $\nu(\bigcup_{j=k}^{k^{\prime}}E_{j})$ since $\nu(F_{k})-\nu(\bigcup_{j=k}^{k^{\prime}}E_{j})\leq\nu(F_{k^{\prime}})\leq 2\cdot\frac{b}{b-1}\cdot b^{-k^{\prime}}$ . Hence $\sum_{k}I_{F_{k}}$ is an $L_{1}(\nu)$ Schnorr test.

If a point is in $\mathsf{SR}^{\nu}$ , then it is not in some $F_{k}$ , while it is in $\mathsf{KR}^{\nu}$ . Then we argue for the following six items about elements of $\mathsf{KR}^{\nu}\setminus F_{k}$ :

(1)

For all $k\geq 0$ and all $x$ in $\mathsf{KR}^{\nu}\setminus F_{k}$ , for all $j_{1}>j_{0}\geq k$ we have $\left|f_{j_{0}}(x)-f_{j_{1}}(x)\right|\leq\sum_{i=j_{0}}^{j_{1}-1}\left|f_{i}(x)-f_{i+1}(x)\right|\leq\sum_{i=j_{0}}^{\infty}2\cdot b^{-i}\leq 2\cdot\frac{b}{b-1}\cdot b^{-j_{0}}$ .
(2)

Hence for all $k\geq 0$ and all $x$ in $\mathsf{KR}^{\nu}\setminus F_{k}$ , we have that $f_{j}(x)$ for $j\geq k$ is a Cauchy sequence and thus $\lim_{j}f_{j}(x)$ exists.
(3)

For all $x$ in $\mathsf{KR}^{\nu}\setminus F_{k}$ and all $j\geq k\geq 0$ , we have $\left|f_{j}(x)-\lim_{j}f_{j}(x)\right|\leq 2\cdot\frac{b}{b-1}\cdot b^{-j}$ . For, let $\epsilon>0$ . Let $j_{0}=j$ and choose $j_{1}>j_{0}$ such that $\left|f_{j_{1}}(x)-\lim_{j}f_{j}(x)\right|<\epsilon$ . Then by (1) one has that $\left|f_{j}(x)-\lim_{j}f_{j}(x)\right|\leq\left|f_{j_{1}}(x)-\lim_{j}f_{j}(x)\right|+\left|f_{j_{1}}(x)-f_{j_{0}}(x)\right|<\epsilon+2\cdot\frac{b}{b-1}\cdot b^{-j_{0}}$ . Since this holds for all $\epsilon>0$ , we are done.
(4)

Since $\mathsf{SR}^{\nu}$ is a $\nu$ -measure one set, one has that $\lim_{j}f_{j}$ exists $\nu$ -a.s.
(5)

Further one has that $f_{j}\rightarrow\lim_{j}f_{j}$ in $L_{0}(\nu)$ . For let $\epsilon>0$ . Choose $k$ such that $2\cdot\frac{b}{b-1}\cdot b^{-k}<\epsilon$ . Let $j\geq k$ , so that $2\cdot\frac{b}{b-1}\cdot b^{-j}<\epsilon$ . Then by (3) we have $\nu(\left|f_{j}-\lim_{j}f_{j}\right|>\epsilon)\leq\nu(\left|f_{j}-\lim_{j}f_{j}\right|>2\cdot\frac{b}{b-1}\cdot b^{-j})\leq\nu(F_{k})\leq 2\cdot\frac{b}{b-1}\cdot b^{-k}<\epsilon$ .
(6)

Since both $f_{j}\rightarrow\lim_{j}f_{j}$ in $L_{0}(\nu)$ and $f_{j}\rightarrow f$ in $L_{0}(\nu)$ , we have that $\lim_{j}f_{j}=f$ $\nu$ -a.s.

Suppose that $h_{j}$ is another computable sequence from the countable dense set of $L_{0}(\nu)$ such that $h_{j}\rightarrow f$ at a geometric rate $c>1$ of convergence in $L_{0}(\nu)$ . Note that $\lim_{j}f_{j}$ and $\lim_{j}h_{j}$ are equal $\nu$ -a.s. since they are both equal $\nu$ -a.s. to $f$ . Let $G_{j}$ and $H_{k}$ be constructed from $h_{j}$ and $c$ just as we constructed $E_{j}$ and $F_{k}$ from $f_{j}$ and $b$ above. Let $d=\min(b,c)$ , rational number $>1$ . Let $e=\max\{2\cdot\frac{b}{b-1},2\cdot\frac{c}{c-1}\}$ . Note that for all $j\geq 0$ , we have $e\cdot d^{-j}\geq 2\cdot\frac{b}{b-1}\cdot b^{-j}$ and $e\cdot d^{-j}\geq 2\cdot\frac{c}{c-1}\cdot c^{-j}$ . Let $D_{j}$ be a c.e. open which is equal on $\mathsf{KR}^{\nu}$ to $\{x\in X:\left|f_{j}(x)-h_{j}(x)\right|>3\cdot e\cdot d^{-j}\}$ , and note that $D_{j}$ has computable $\nu$ -measure, as in the argument of the first paragraph of the proof. Then one has that $\nu(D_{j})\leq\nu(\left|f_{j}-\lim_{j}f_{j}\right|>2\cdot\frac{b}{b-1}\cdot b^{-j})+\nu(\left|\lim_{j}f_{j}-\lim_{j}h_{j}\right|>e\cdot d^{-j})+\nu(\left|h_{j}-\lim_{j}h_{j}\right|>2\cdot\frac{c}{c-1}\cdot c^{-j})\leq\nu(F_{j})+0+\nu(H_{j})\leq 2\cdot\frac{b}{b-1}\cdot b^{-j}+2\cdot\frac{c}{c-1}\cdot c^{-j}$ , where the middle term is zero since $\lim_{j}f_{j}$ and $\lim_{j}h_{j}$ are equal $\nu$ -a.s. Hence $\sum_{j}I_{D_{j}}$ is an $L_{1}(\nu)$ integral test, and thus, for $x$ in $\mathsf{SR}^{\nu}$ one has that $\lim_{j}f_{j}(x)=\lim_{j}h_{j}(x)$ .

As in the previous proof, the limit does not depend on the choice of $\nu$ -computable basis since $\nu$ -computable bases are closed under effective unions (cf. Proposition 2.8).

The remarks about $\lim_{n}f_{n}$ being a version of $f$ , and the remark about $L_{0}(\nu)$ Schnorr tests, follows as in the proof of the previous proposition. ∎

There is a result similar to Proposition 4.1 when the $f_{n}$ are themselves $L_{p}(\nu)$ Schnorr tests:

Proposition 4.3.

Suppose that $p\geq 1$ is computable (resp. $p=0$ ). Suppose that $f_{n}$ are uniformly $L_{p}(\nu)$ Schnorr tests with $f_{n}\rightarrow f$ fast in $L_{p}(\nu)$ , so that $f$ is also a computable point of $L_{p}(\nu)$ . Then $\lim_{n}f_{n}(x)$ exists and for all $x$ in $\mathsf{SR}^{\nu}$ . If $f$ is also an $L_{p}(\nu)$ Schnorr test, then $\lim_{n}f_{n}(x)=f(x)$ for all $x$ in $\mathsf{SR}^{\nu}$ .

Proof.

By Proposition 2.16 (resp. Proposition 2.30), choose doubly-indexed computable sequence $f_{n,s}$ from the countable dense set of $L_{p}(\nu)$ such that for all $n\geq 0$ we have $0\leq f_{n,s}\leq f_{n,s+1}\leq f_{n}$ everywhere and $f_{n}=\sup_{s}f_{n,s}$ and $f_{n,s}\rightarrow f_{n}$ fast in $L_{p}(\nu)$ . Then $f_{n+1,n+1}\rightarrow f$ fast in $L_{p}(\nu)$ . Hence, by Proposition 4.1 (resp. Proposition 4.2), $\lim_{n}f_{n,n}$ exists on $\mathsf{SR}^{\nu}$ . Note that by these propositions, if $f$ is also an $L_{p}(\nu)$ Schnorr test then $\lim_{n}f_{n,n}=f$ on $\mathsf{SR}^{\nu}$ .

It suffices to show that $\lim_{n}(f_{n}-f_{n,n})=0$ on $\mathsf{SR}^{\nu}$ . Use Lemma 3.2 to rewrite what we are to show as $\lim_{n}(f_{n}\ominus g_{n,n})=0$ , where $g_{n,n}$ is an $L_{p}(\nu)$ Schnorr test equal to $f_{n,n}$ on $\mathsf{KR}^{\nu}$ , so that $f_{n}\ominus g_{n,n}$ is an $L_{p}(\nu)$ Schnorr test equal to $f_{n}-f_{n,n}$ on $\mathsf{KR}^{\nu}$ . By Lemma 2.22 in conjunction with Proposition 2.21 (resp. Proposition 2.27), choose a computable sequence $\eta_{n}$ in the interval $(2^{-(n+1)},2^{-n})$ such that the $(f_{n}\ominus g_{n,n})^{-1}(\eta_{n},\infty]$ has computable $\nu$ -measure. Then $U_{n}=(f_{n}\ominus g_{n,n})^{-1}(\eta_{n},\infty]$ is c.e. open with $\nu$ -computable measure. Let $h_{n}=(f_{n}\ominus g_{n,n})\cdot I_{U_{n}}$ which is an $L_{p}(\nu)$ Schnorr test. Let $h=\sum_{n}h_{n}$ and $h_{m}=\sum_{n<m}h_{n}$ . Then $\|h-h_{m}\|_{p}\leq\sum_{n>m}\|h_{n}\|_{p}\leq\sum_{n>m}\|f_{n}-f_{n,n}\|_{p}\leq\sum_{n>m}2^{-n}\leq 2^{-m}$ . Then $h$ is an $L_{p}(\nu)$ Schnorr test: it is non-negative lsc as a sum of non-negative lsc functions, and the sequence $h_{m}$ is uniformly $L_{p}(\nu)$ -computable and we just showed that $h_{m}\rightarrow h$ fast in $L_{p}(\nu)$ . Now we verify $\lim_{n}(f_{n}-f_{n,n})=0$ on $\mathsf{SR}^{\nu}$ . Let $x$ in $\mathsf{SR}^{\nu}$ . Let $\epsilon>0$ . Since $x$ is in $\mathsf{SR}^{\nu}$ , choose $n_{0}\geq 0$ such that we have the estimate $\sum_{n\geq n_{0}}(f_{n}(x)-f_{n,n}(x))\cdot I_{U_{n}}(x)<\epsilon$ . Choose $n_{1}\geq n_{0}$ such that $\eta_{n}<\epsilon$ for all $n\geq n_{1}$ . Let $n\geq n_{1}$ . If $x$ is in $U_{n}$ , then by our estimate we have $f_{n}(x)-f_{n,n}(x)<\epsilon$ . If $x$ is not in $U_{n}$ , then by the definition of $U_{n}$ we have $f_{n}(x)-f_{n,n}(x)\leq\eta_{n}<\epsilon$ .

∎

5. Classical features of the maximal function

Suppose that $\nu$ is a point of $\mathcal{P}(X)$ and $\mathscr{F}_{n}$ is any increasing filtration of Borel subsets of $X$ . In this section, we recall some classical features of the maximal function $f^{\ast}=\sup_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ of an integrable function $f$ .

First, we recall the following, which gives us information about the codomain of the maximal function:

Lemma 5.1.

(1)

If $p>1$ then $\|g^{\ast}\|_{p}\leq\frac{p}{p-1}\cdot\|g\|_{p}$ for $g$ in $L_{p}(\nu)$ , and the maximal function maps $L_{p}(\nu)$ to $L_{p}(\nu)$ .
(2)

If $p=1$ , then the maximal function maps $L_{p}(\nu)$ to $L_{0}(\nu)$ .

Proof.

For $p>1$ and $g$ in $L_{p}(\nu)$ , the sequence $\mathbb{E}_{\nu}[g\mid\mathscr{F}_{n}]$ is a non-negative martingale, and so by Doob’s Maximal Inequality⁸²⁸²82[25, Theorem 9.4 pp. 505-506]. followed by conditional Jensen we have: $\|\sup_{n\leq m}\mathbb{E}_{\nu}[g\mid\mathscr{F}_{n}]\|_{p}\leq\frac{p}{p-1}\cdot\|\mathbb{E}_{\nu}[g\mid\mathscr{F}_{m}]\|_{p}\leq\frac{p}{p-1}\cdot\|g\|_{p}$ . Then by the Monotone Convergence Theorem we have $\|g^{\ast}\|_{p}\leq\frac{p}{p-1}\|g\|_{p}$ . For $p=1$ , let $\mathscr{F}_{\infty}$ be the $\sigma$ -algebra generated by all the $\mathscr{F}_{n}$ . By the classical Lévy Upward Theorem we have that $\mathbb{E}_{\nu}[g\mid\mathscr{F}_{n}]\rightarrow\mathbb{E}_{\nu}[g\mid\mathscr{F}_{\infty}]$ $\nu$ -a.s. which shows that $g^{\ast}$ is finite $\nu$ -a.s., and hence that $g^{\ast}$ is in $L_{0}(\nu)$ . ∎

The following proposition collects together all the other classical facts about the maximal function which we need:

Proposition 5.2.

(1)

For $p>1$ , the maximal function ${\cdot}^{\ast}:L_{p}(\nu)\rightarrow L_{p}(\nu)$ is uniformly continuous with modulus $m(\epsilon)=\frac{p-1}{p}\cdot\epsilon$ of uniform continuity.
(2)

For $p=1$ , the maximal function ${\cdot}^{\ast}:L_{p}(\nu)\rightarrow L_{0}(\nu)$ is uniformly continuous with a modulus $m(\epsilon)=\epsilon^{2}$ of uniform continuity.

Proof.

Let $p>1$ and $f$ in $L_{p}(\nu)$ . By Lemma 5.1 one has for $f,g$ in $L_{p}(\nu)$ with $\|f-g\|_{p}<\frac{p-1}{p}\cdot\epsilon$ that $\|f^{\ast}-g^{\ast}\|_{p}\leq\|\left|f-g\right|^{\ast}\|_{p}\leq\frac{p}{p-1}\cdot\|f-g\|_{p}<\epsilon$ .

Suppose that $f,g$ are in $L_{1}(\nu)$ with $\|f-g\|_{1}<\epsilon^{2}$ . Then one has

		$\displaystyle\nu(\left\|f^{\ast}-g^{\ast}\right\|>\epsilon)\leq\nu(\left\|f-g\right\|^{\ast})>\epsilon)=\lim_{m}\nu(\sup_{n\leq m}\mathbb{E}[\left\|f-g\right\|\mid\mathscr{F}_{n}]>\epsilon)$
	$\displaystyle\leq$	$\displaystyle\lim_{m}\epsilon^{-1}\cdot\mathbb{E}_{\nu}\mathbb{E}_{\nu}[\left\|f-g\right\|\mid\mathscr{F}_{m}]\leq\epsilon^{-1}\cdot\mathbb{E}_{\nu}\left\|f-g\right\|\leq\epsilon^{-1}\epsilon^{2}=\epsilon$

The first step of the second line follows from Doob’s Submartingale Inequality,⁸³⁸³83[75, 137-138]. where we apply it to the martingale $\mathbb{E}[\left|f-g\right|\mid\mathscr{F}_{n}]$ . ∎

6. An abstract version of Lévy’s Theorem for Schnorr randomness

Before we state our abstract version of Lévy’s Theorem, we need the following definition:

Definition 6.1.

Suppose that $\mathsf{XR}^{\nu}$ has $\nu$ -measure one.

Suppose that $\mathcal{C}$ is a class of $L_{p}(\nu)$ Schnorr tests.

Then the $L_{p}(\nu)$ Schnorr tests are approximated from below on $\mathsf{XR}^{\nu}$ by $\mathcal{C}$ if from an index for an $L_{p}(\nu)$ Schnorr test $f$ one can compute

–

an index for a sequence of $L_{p}(\nu)$ Schnorr tests $f_{s}$ in $\mathcal{C}$ such that $f_{s}\leq f_{s+1}$ on $\mathsf{XR}^{\nu}$ and $f=\sup_{s}f_{s}$ on $\mathsf{XR}^{\nu}$ and $f_{s}\rightarrow f$ fast in $L_{p}(\nu)$ ; and
–

an index for a sequence of non-negative usc functions $\breve{f}_{s}$ equal to $f_{s}$ on $\mathsf{XR}^{\nu}$ .

Recall from Proposition 3.1 that the $f_{s},\breve{f}_{s}$ are finite on $\mathsf{XR}^{\nu}$ , and we define $f\ominus f_{s}=\max(0,f-\breve{f}_{s})$ , and we have that $f\ominus f_{s}$ is an $L_{p}(\nu)$ Schnorr test equal on $\mathsf{XR}^{\nu}$ to $f-f_{s}$ . Similarly if $t\geq s$ , then we define $f_{t}\ominus f_{s}=\max(0,f_{t}-\breve{f}_{s})$ , and we have that $f_{t}\ominus f_{s}$ is an $L_{p}(\nu)$ Schnorr test equal on $\mathsf{XR}^{\nu}$ to $f_{t}-f_{s}$ .

The basic example of Definition 6.1 comes from Lemma 3.2.

The following is our abstract version of Lévy’s Theorem for Schnorr randomness. It is an abstract version in that we are not told more about the function $E[\cdot\mid n](\cdot)$ other than that what is stated explicitly in the hypotheses (I)-(IV). In particular, we do not assume that $E[\cdot\mid n](\cdot)$ comes from an effective disintegration, although in the next section we will show that effective disintegrations satisfy the hypotheses of the theorem.

Theorem 6.2.

Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ . Suppose that $\mathscr{F}_{n}$ is an increasing filtration of Borel sets. Suppose that $p\geq 1$ is computable.

Suppose that $E[\cdot\mid n](\cdot):\mathbb{L}_{p}^{+}(\nu)\times X\rightarrow[0,\infty]$ is a function such that for every $f$ in $\mathbb{L}_{p}^{+}(\nu)$ , one has that $E[f\mid n]:X\rightarrow[0,\infty]$ is a version of the conditional expectation of $f$ with respect to $\mathscr{F}_{n}$ . Define the function $\cdot^{\flat}(\cdot):\mathbb{L}_{p}^{+}(\nu)\times X\rightarrow[0,\infty]$ by $f^{\flat}(x)=\sup_{n}E[\cdot\mid n](x)$ .

Suppose that $\mathsf{XR}^{\nu}$ is a superset of $\mathsf{SR}^{\nu}$ .

Suppose that:

(I)

$E[\cdot\mid n]$ maps non-negative lsc functions to non-negative lsc functions.
(II)
$E[\cdot\mid n]$ satisfies the following properties on $\mathsf{XR}^{\nu}$ :
1. (a)
  
  If $f\leq g$ on $\mathsf{XR}^{\nu}$ , then $E[f\mid n]\leq E[g\mid n]$ on $\mathsf{XR}^{\nu}$ ;
2. (b)
  
  If $c$ in $\mathbb{R}^{\geq 0}$ then $c\cdot E[f\mid n]=E[c\cdot f\mid n]$ on $\mathsf{XR}^{\nu}$ ;
3. (c)
  
  $E[f+g\mid n]=E[f\mid n]+E[g\mid n]$ on $\mathsf{XR}^{\nu}$ ;
(III)

Both $E[\cdot\mid n]$ and $\cdot^{\flat}$ send the countable dense set of $L_{p}^{+}(\nu)$ uniformly to computable points of $L_{p}^{+}(\nu)$ .
(IV)

The $L_{p}(\nu)$ Schnorr tests are approximated from below on $\mathsf{XR}^{\nu}$ by a class $\mathcal{C}$ of $L_{p}(\nu)$ Schnorr tests such that $\lim_{n}E[f\mid n]=f$ on $\mathsf{XR}^{\nu}$ for each $f$ in $\mathcal{C}$ .

Then the following three items are equivalent for $x$ in $X$ :

(1)

$x$ is in $\mathsf{SR}^{\nu}$ .
(2)

$x$ is in $\mathsf{XR}^{\nu}$ and $\lim_{n}E[f\mid n](x)=f(x)$ for every $L_{p}(\nu)$ Schnorr test $f$ .
(3)

$x$ is in $\mathsf{XR}^{\nu}$ and $\lim_{n}E[f\mid n](x)$ exists for every $L_{p}(\nu)$ Schnorr test $f$ and $\lim_{n}E[I_{U}\mid n](x)=I_{U}(x)$ for every c.e. open $U$ with $\nu$ -computable measure.

We also have:

(i)

Every Borel set $B$ is equal on $\mathsf{SR}^{\nu}$ to a Borel set $B^{\prime}$ in the $\sigma$ -algebra generated by the union of the $\mathscr{F}_{n}$ .⁸⁴⁸⁴84Indeed, if $n\geq 1$ and $B$ is in $\utilde{\Sigma}^{0}_{n}$ (resp. $\utilde{\Pi}^{0}_{n}$ ) then we can take $B^{\prime}$ to be $\utilde{\Sigma}^{0}_{n+5}$ (resp. $\utilde{\Pi}^{0}_{n+5}$ ), and if $\alpha\geq\omega$ and $B$ is $\utilde{\Sigma}^{0}_{\alpha}$ (resp. $\utilde{\Pi}^{0}_{\alpha}$ ) then $B^{\prime}$ may be taken to be $\utilde{\Sigma}^{0}_{\alpha}$ (resp. $\utilde{\Pi}^{0}_{\alpha}$ ). And the same for the lightface classes.
(ii)

Suppose one adds to (IV) the condition that every $x$ in $\mathsf{SR}^{\nu}$ weakly computes a modulus of convergence for $E[f\mid n](x)\rightarrow f(x)$ , uniformly in $f$ from $\mathcal{C}$ . Then one can further conclude for every $x$ in $\mathsf{SR}^{\nu}$ and every $L_{p}(\nu)$ Schnorr test $f$ , the point $x$ weakly computes a modulus of convergence for $E[f\mid n](x)\rightarrow f(x)$ in (2).

Regarding (i), note that this is saying that the hypotheses of the Theorem amount collectively to an assumption that the union of the filtration generates a $\sigma$ -algebra very close to the Borel $\sigma$ -algebra, from the perspective of $\nu$ .

Proof.

First we note three things about the maps $E[\cdot\mid n]$ and $\cdot^{\flat}$ and $p\geq 1$ computable.

For $p\geq 1$ , the map $E[\cdot\mid n]:\mathbb{L}_{p}^{+}(\nu)\rightarrow\mathbb{L}_{p}^{+}(\nu)$ maps $L_{p}(\nu)$ Schnorr tests uniformly to $L_{p}(\nu)$ Schnorr tests. For, by (I), it sends non-negative lsc functions to non-negative lsc functions. And by conditional Jensen and (III) and Propositions 2.4,2.5, it sends $L_{p}^{+}(\nu)$ computable points to $L_{p}^{+}(\nu)$ computable points.

For $p>1$ , the map $\cdot^{\flat}:\mathbb{L}_{p}^{+}(\nu)\rightarrow\mathbb{L}_{p}^{+}(\nu)$ maps $L_{p}(\nu)$ Schnorr tests uniformly to $L_{p}(\nu)$ Schnorr tests. For, by (I), it sends non-negative lsc functions to non-negative lsc functions. And by Proposition 5.2(1) and (III) and Propositions 2.4,2.5, it sends $L_{p}^{+}(\nu)$ computable points to $L_{p}^{+}(\nu)$ computable points.

For $p=1$ , the map $\cdot^{\flat}:\mathbb{L}_{p}^{+}(\nu)\rightarrow\mathbb{L}_{0}^{+}(\nu)$ sends $L_{p}(\nu)$ Schnorr tests uniformly to $L_{0}(\nu)$ Schnorr tests. For, by (I), it sends non-negative lsc functions to non-negative lsc functions. And by Proposition 5.2(2) and (III) and Propositions 2.4,2.5 it sends $L_{p}^{+}(\nu)$ computable points to $L_{0}^{+}(\nu)$ computable points.

Now we work on the equivalence of (1)-(3).

Suppose (1); we show (2). Suppose that $f$ is an $L_{p}(\nu)$ Schnorr test; we want to show that $f=\lim_{n}E[f\mid n]$ on $\mathsf{SR}^{\nu}$ . Choose $f_{s}$ from $\mathcal{C}$ as in (IV). By definition, one has that $f_{s}\rightarrow f$ pointwise on $\mathsf{XR}^{\nu}$ and fast in $L_{p}(\nu)$ and is non-decreasing on $\mathsf{XR}^{\nu}$ . Let $g_{s}$ be the $L_{p}(\nu)$ Schnorr test $f\ominus f_{s}$ . Then $g_{s}\rightarrow 0$ pointwise on $\mathsf{XR}^{\nu}$ and is non-increasing on $\mathsf{XR}^{\nu}$ and $g_{s}\rightarrow 0$ fast in $L_{p}(\nu)$ .

Suppose $p>1$ (resp. $p=1$ ). Since $f_{s}$ is finite on $\mathsf{XR}^{\nu}$ , we have that $f=f_{s}+f\ominus f_{s}$ on $\mathsf{XR}^{\nu}$ and hence by (IIc) we have $E[f\mid n]=E[f_{s}\mid n]+E[f\ominus f_{s}\mid n]\leq E[f_{s}\mid n]+g_{s}^{\flat}$ on $\mathsf{XR}^{\nu}$ . These are all $L_{p}(\nu)$ Schnorr tests (resp. except for $g_{s}^{\flat}$ which is an $L_{0}(\nu)$ Schnorr test), and so they are finite on $\mathsf{SR}^{\nu}$ and we hence have $E[f\mid n]-E[f_{s}\mid n]\leq g_{s}^{\flat}$ on $\mathsf{SR}^{\nu}$ . Since $f_{s}\leq f$ on $\mathsf{XR}^{\nu}$ , we have $E[f_{s}\mid n]\leq E[f\mid n]$ on $\mathsf{XR}^{\nu}$ by (IIa). Hence $\left|E[f\mid n]-E[f_{s}\mid n]\right|\leq g_{s}^{\flat}$ on $\mathsf{SR}^{\nu}$ . Since the maximal function maps into $L_{p}(\nu)$ (resp. $L_{0}(\nu)$ ) and has a computable modulus of uniform continuity (cf. Proposition 5.2), and since $g_{s}\rightarrow 0$ fast in $L_{p}(\nu)$ (resp. at a geometric rate in $L_{0}(\nu)$ ), one can compute a subsequence such that $g^{\flat}_{s(n)}\rightarrow 0$ fast in $L_{p}(\nu)$ (resp. in $L_{0}(\nu)$ ). By Proposition 4.3 one has that $g_{s(n)}^{\flat}\rightarrow 0$ pointwise on $\mathsf{SR}^{\nu}$ . Let $x$ in $\mathsf{SR}^{\nu}$ and let $\epsilon>0$ . Since $f,f_{s},E[f\mid n],E[f_{s}\mid n]$ are $L_{p}(\nu)$ Schnorr tests and since $g_{s}^{\flat}$ is an $L_{p}(\nu)$ Schnorr test (resp. $L_{0}(\nu)$ Schnorr test), these values are all finite on the $\mathsf{SR}^{\nu}$ point $x$ . Choose $n_{0}\geq 0$ such that for all $n\geq n_{0}$ and $f(x)-f_{n}(x)<\frac{\epsilon}{3}$ . Choose $n_{1}\geq n_{0}$ such that for all $n\geq n_{1}$ one has $g_{s(n)}^{\flat}(x)<\frac{\epsilon}{3}$ . By the hypothesis on the $f_{s}$ coming from $\mathcal{C}$ , choose $n_{2}\geq n_{1}$ such that $\left|f_{s(n_{1})}(x)-E[f_{s(n_{1})}\mid n](x)\right|<\frac{\epsilon}{3}$ for all $n\geq n_{2}$ . Hence for all $n\geq n_{2}$ one has that $\left|f(x)-E[f\mid n](x)\right|\leq\left|f(x)-f_{s(n_{1})}(x)\right|+\left|f_{s(n_{1})}(x)-E[f_{s(n_{1})}\mid n](x)\right|+\left|E[f_{s(n_{1})}\mid n](x)-E[f\mid n](x)\right|<\frac{\epsilon}{3}+\frac{\epsilon}{3}+g_{s(n_{1})}^{\flat}(x)<\epsilon$ .

Note that the previous paragraph yields (ii). For, Proposition 3.4(1) tells us that $x$ can weakly compute a modulus of convergence for $g_{s(n)}^{\flat}(x)\rightarrow 0$ . And Proposition 3.4(2) tells us that $x$ can weakly compute a modulus of convergence for $f_{n}(x)\rightarrow f(x)$ . And the extra hypothesis in (ii) says that $x$ can weakly compute a modulus of convergence for $E[f_{s(n_{1})}\mid n](x)\rightarrow f_{s(n_{1})}(x)$ .

The implication from (2) to (3) is trivial.

Suppose (3); we show (1). Suppose that $x$ is a point satisfying (3). We want to show that $x$ is in $\mathsf{SR}^{\nu}$ . Let $f$ be an $L_{p}(\nu)$ Schnorr test. We want to show that $f(x)<\infty$ . Suppose for reductio that $f(x)=\infty$ . By hypothesis, $\lim_{n}E[f\mid n](x)$ exists and is finite. Choose $n_{0}\geq 0$ and rationals $p<q$ such that $E[f\mid n](x)<p$ for all $n\geq n_{0}$ . Then $x$ is in the c.e. open $f^{-1}(q,\infty]$ . Using a $\nu$ -computable basis, choose c.e. open $U$ which is a subset of $f^{-1}(q,\infty]$ and which contains $x$ and which has computable $\nu$ -measure. Let $g=q\cdot I_{U}$ , which is an $L_{p}(\nu)$ Schnorr test. Since $g-q\leq 0$ everywhere, by (3), there is $n_{1}\geq n_{0}$ such that $q-p>\left|E[g\mid n](x)-q\right|=q-E[g\mid n](x)$ for all $n\geq n_{1}$ , and thus $E[g\mid n](x)>p$ for all such $n$ . Let $n\geq n_{1}$ . Since $g\leq f$ everywhere, by (IIa), we have $E[g\mid n](x)\leq E[f\mid n](x)<p$ , a contradiction.

For (i), by using a $\nu$ -computable basis it suffices to prove it for $U$ c.e. open with $\nu(U)$ computable. In this case, $f=I_{U}$ is an $L_{p}(\nu)$ Schnorr test. Then $f=\lim_{n}E[f\mid n]$ on $\mathsf{SR}^{\nu}$ . By (IIa), one has $0\leq E[f\mid n]\leq 1$ on $\mathsf{XR}^{\nu}$ . For rational $\epsilon$ in the interval $(0,\frac{1}{2})$ , let $V_{n,\epsilon}=E[f\mid n]^{-1}(1-\epsilon,\infty]$ . This set is in $\mathscr{F}_{n}$ since $E[f\mid n]$ is a version of the conditional expectation of $f$ relative to $\mathscr{F}_{n}$ . Further this set is c.e. open by the second paragraph of this proof. Then on $\mathsf{SR}^{\nu}$ one has that $U=\bigcap_{\epsilon\in\mathbb{Q}\cap(0,\frac{1}{2})}\bigcup_{n_{0}\geq 0}\bigcap_{n\geq n_{0}}V_{n,\epsilon}$ .⁸⁵⁸⁵85This event is further in $\utilde{\Pi}^{0}_{4}$ . When we decompose an arbitrary open as a union of c.e. opens with $\nu$ -computable measure, we will get an event in $\utilde{\Sigma}^{0}_{5}$ . Hence, $U$ is equal on $\mathsf{SR}^{\nu}$ to an event $B^{\prime}$ in the $\sigma$ -algebra generated by the union of the $\mathscr{F}_{n}$ . ∎

7. Fundamental properties of effective disintegrations

In this section, we develop the properties of effective disintegrations (cf. Definition 1.3, and for examples see Appendicies A-B). In the next propositions, $\mathsf{XR}^{\nu}$ denotes a $\nu$ -measure one subset of $\mathsf{KR}^{\nu}$ , as in the definition of an effective disintegration. Further, throughout this section, the expression $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is defined as in (1.1) of §1.2, namely the version of conditional expectation coming from the effective disintegration.

Proposition 7.1.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is an $\mathsf{XR}^{\nu}$ disintegration of $\mathscr{F}$ . Then for lsc $f:X\rightarrow[0,\infty]$ , the map $\mathbb{E}_{\nu}[f\mid\mathscr{F}]:X\rightarrow[0,\infty]$ is lsc, uniformly in $f$ .

Proof.

By Definition 1.3(3) it suffices to show that for rational $r\geq 0$ we have

\int f\;d\rho_{x}>r\mbox{ iff }\exists\;q\in\mathbb{Q}^{>0}\;\big{(}\rho_{x}(X)>\frac{r}{q}\;\&\;\rho_{x}(f^{-1}(q,\infty])>0\big{)}

Suppose that $\int f\;d\rho_{x}>r$ . Since $r\geq 0$ , we have $\rho_{x}(X)>0$ . Choose rational $q>0$ in the interval $(\frac{r}{\rho_{x}(X)},\frac{\int f\;d\rho_{x}}{\rho_{x}(X)})$ . Then $\int f\;d\rho_{x}>\rho_{x}(X)\cdot q$ and $\rho_{x}(X)>\frac{r}{q}$ . Then $\rho_{x}(f^{-1}(q,\infty])>0$ , since otherwise $0\leq f\leq q$ $\rho_{x}$ -a.e. and hence $\int f\;d\rho_{x}\leq q\cdot\rho_{x}(X)$ .

Suppose that rational $q>0$ satisfies $\rho_{x}(X)>\frac{r}{q}$ and $\rho_{x}(f^{-1}(q,\infty])>0$ . If $f$ is not $\rho_{x}$ -integrable then trivially we have $\int f\;d\rho_{x}>r$ . Hence suppose $f$ is $\rho_{x}$ -integrable. Since $\rho_{x}(f^{-1}(q,\infty])>0$ , choose $\epsilon>0$ such that $\rho_{x}(f-q>\epsilon)>0$ . Then $0<\rho_{x}(f-q>\epsilon)\leq\frac{1}{\epsilon}\int f-q\;d\rho_{x}$ . Then $0<\int f-q\;d\rho_{x}$ . Then $q\cdot\rho_{x}(X)<\int f\;d\rho_{x}$ . Then $r<\int f\;d\rho_{x}$ . ∎

Proposition 7.2.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a $\mathsf{XR}^{\nu}$ disintegration of $\mathscr{F}$ . Then for $f,g$ in $\mathbb{L}_{1}^{+}(\nu)$ the conditional expectation satisfies the following monotone linearity properties:

(1)

If $f\leq g$ on $\mathsf{XR}^{\nu}$ , then $\mathbb{E}_{\nu}[f\mid\mathscr{F}]\leq\mathbb{E}_{\nu}[g\mid\mathscr{F}]$ on $\mathsf{XR}^{\nu}$ ;
(2)

If $c$ in $\mathbb{R}^{\geq 0}$ then $c\cdot\mathbb{E}_{\nu}[f\mid\mathscr{F}]=\mathbb{E}_{\nu}[c\cdot f\mid\mathscr{F}]$ everywhere.
(3)

$\mathbb{E}_{\nu}[f+g\mid\mathscr{F}]=\mathbb{E}_{\nu}[f\mid\mathscr{F}]+\mathbb{E}_{\nu}[g\mid\mathscr{F}]$ everywhere.

Proof.

For (1), suppose that $f\leq g$ on $\mathsf{XR}^{\nu}$ . Suppose that $x$ is in $\mathsf{XR}^{\nu}$ . By Definition 1.3(2), $\rho_{x}$ is in $\mathcal{P}(X)$ and $\rho_{x}([x]_{\mathscr{F}}\cap\mathsf{XR}^{\nu})=1$ . Then $f\leq g$ on a $\rho_{x}$ -measure one set, and hence $\int f(v)\;d\rho_{x}(v)\leq\int g(v)\;d\rho_{x}(v)$ . For (2)-(3), these just follow from the properties of the integral. ∎

The use of Definition 1.3(2) in the proof of the previous proposition is typical, and henceforth we do not explicitly reiterate it as we go along.

We stated the previous proposition for non-negative functions. For these functions, the conditional expectation $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)$ in (1.1) is automatically defined for all points $x$ in $X$ , even if it is infinite. However, $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)$ in (1.1) is automatically defined and finite when $f$ is a simple function, and so the previous proposition holds for these functions as well. More generally, the previous proposition holds for functions which take negative values, provided that $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)$ is defined and finite on all points $x$ of $\mathsf{XR}^{\nu}$ .

Proposition 7.3.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a $\mathsf{XR}^{\nu}$ disintegration of $\mathscr{F}$ .

If $f$ in $\mathbb{L}^{+}_{1}(\nu)$ is equal on $\mathsf{XR}^{\nu}$ to a function which is $\mathscr{F}$ -measurable, then one has $\mathbb{E}_{\nu}[f\mid\mathscr{F}]=f$ on $\mathsf{XR}^{\nu}$ .

Proof.

By Proposition 7.2(1), it suffices to consider functions in $\mathbb{L}^{+}_{1}(\nu)$ which are themselves $\mathscr{F}$ -measurable (as opposed to being merely equal on $\mathsf{XR}^{\nu}$ to such a function).

Suppose that $x$ is in $\mathsf{XR}^{\nu}$ . If $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)<f(x)$ , then for some rationals $p,q$ we have $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)<p<q<f(x)$ . Then $x$ is in the event $f^{-1}(q,\infty]$ in $\mathscr{F}$ . Then $[x]_{\mathscr{F}}\subseteq f^{-1}(q,\infty]$ . Then $f\geq q$ for $\rho_{x}$ -a.s. many values and hence $\int f\;d\rho_{x}\geq q$ , a contradiction. The case of $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)>f(x)$ is similar. ∎

In the below proposition, we use the traditional names for the properties of conditional expectation.⁸⁶⁸⁶86E.g. [75, 88].. Of course, the hypothesis of effective disintegrations in Definition 1.3(1) is that $\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)=\int f\;d\rho_{x}$ is a version of conditional expectation. But in this proposition and several others in this section, what we are verifying is that they hold pointwise on specifiable measure $\nu$ -one subsets.

Proposition 7.4.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a $\mathsf{XR}^{\nu}$ disintegration of $\mathscr{F}$ .

(1)

(Conditional MCT). Suppose that $f_{n},f$ are in $\mathbb{L}_{1}^{+}(\nu)$ and $0\leq f_{n}\leq f_{n+1}$ on $\mathsf{XR}^{\nu}$ and $\lim_{n}f_{n}=f$ on $\mathsf{XR}^{\nu}$ . Then $\lim_{n}\mathbb{E}_{\nu}[f_{n}\mid\mathscr{F}]=\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ on $\mathsf{XR}^{\nu}$ .
(2)
(Conditional DCT) Suppose that $f_{n},f,g$ are in $\mathbb{L}_{1}^{+}(\nu)$ and $\left|f_{n}\right|\leq g$ on $\mathsf{XR}^{\nu}$ and $\lim_{n}f_{n}=f$ on $\mathsf{XR}^{\nu}$ . Then:
1. (a)
  
  If $x$ in $\mathsf{XR}^{\nu}$ and $\mathbb{E}_{\nu}[g\mid\mathscr{F}](x)<\infty$ then $\lim_{n}\mathbb{E}_{\nu}[f_{n}\mid\mathscr{F}](x)=\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)$ .
2. (b)
  
  If $\mathbb{E}_{\nu}[g\mid\mathscr{F}]<\infty$ on $\mathsf{XR}^{\nu}$ , then $\lim_{n}\mathbb{E}_{\nu}[f_{n}\mid\mathscr{F}]=\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ on $\mathsf{XR}^{\nu}$ .
(3)

(‘Taking out what is known’). Suppose that $f,g$ in $\mathbb{L}_{1}^{+}(\nu)$ , and suppose that $g$ is equal on $\mathsf{XR}^{\nu}$ to a $\mathscr{F}$ -measurable function. Then $\mathbb{E}_{\nu}[f\cdot g\mid\mathscr{F}]=g\cdot\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ on $\mathsf{XR}^{\nu}$ .

Proof.

For (1), let $x$ in $\mathsf{XR}^{\nu}$ . By hypothesis, $0\leq f_{n}\leq f_{n+1}$ for $\rho_{x}$ -a.s. many points, and likewise $\lim_{n}f_{n}=f$ for $\rho_{x}$ -a.s. many points. Then by MCT applied to $\rho_{x}$ we have $\lim_{n}\int f_{n}\;d\rho_{x}=\int f\;d\rho_{x}$ .

For (2), let $x$ in $\mathsf{XR}^{\nu}$ with $\mathbb{E}_{\nu}[g\mid\mathscr{F}](x)<\infty$ . This means that $\int g\;d\rho_{x}<\infty$ , and so $g$ is in $\mathbb{L}^{+}_{1}(\rho_{x})$ . By hypothesis, $\left|f_{n}\right|\leq g$ for $\rho_{x}$ -a.s. many points, and likewise $\lim_{n}f_{n}=f$ for $\rho_{x}$ -a.s. many points. Hence by the DCT applied to $\rho_{x}$ , we have that $\lim_{n}\int f_{n}\;d\rho_{x}=\int f\;d\rho_{x}$ .

For (3), by Proposition 7.2(1), it suffices to prove it for $g$ which is itself $\mathscr{F}$ -measurable. We show it by induction on complexity of $g$ .

Suppose $g=I_{A}$ where $A$ is $\mathscr{F}$ -measurable. If $x$ in $A$ then $[x]_{\mathscr{F}}\subseteq A$ and then $A$ is a $\rho_{x}$ -measure one event and then it reduces to the observation that $\int_{A}f(v)\;d\rho_{x}(v)=\int f(v)\;d\rho_{x}(v)$ . If $x$ is not in $A$ then $[x]_{\mathscr{F}}\subseteq X\setminus A$ and then $A$ is a $\rho_{x}$ -measure zero event and then it reduces to the observation that $\int_{A}f(v)\;d\rho_{x}(v)=0$ .

By Proposition 7.2(2)-(3), it extends to simple functions. By Conditional MCT it extends to all elements of $\mathbb{L}_{1}^{+}(\nu)$ . ∎

Unlike the previous propositions, this proposition concerns Kurtz disintegrations:

Proposition 7.5.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a Kurtz disintegration of $\mathscr{F}$ . If $p\geq 1$ is computable, then $\mathbb{E}_{\nu}[\cdot\mid\mathscr{F}]:L_{p}(\nu)\rightarrow L_{p}(\nu)$ is computable continuous.

Proof.

By conditional Jensen, the function $m(\epsilon)=\epsilon$ is a computable modulus of uniform continuity. Hence by Proposition 2.5, it suffices to show that if $\varphi=\sum_{i=1}^{n}q_{i}\cdot I_{A_{i}}$ is an element of the countable dense set of $L_{p}(\nu)$ , then $\mathbb{E}_{\nu}[\varphi\mid\mathscr{F}]$ is a computable point of $L_{p}(\nu)$ . Since we can effectively separate $\varphi$ into positive and negative parts, it suffices by the linearity of conditional expectation (Proposition 7.2) to consider the case where $q_{i}\geq 0$ . We may assume further that the $A_{i}$ are pairwise disjoint, which like in the discussion at the beginning of §2.3 implies that $\varphi^{p}=\sum_{i=1}^{n}q_{i}^{p}\cdot I_{A_{i}}$ .

By Proposition 2.13, let $U_{i}$ be a c.e. open which is equal to $A_{i}$ on $\mathsf{KR}^{\nu}$ . Let $f=\sum_{i=1}^{n}q_{i}\cdot I_{U_{i}}$ , so that likewise $f^{p}=\sum_{i=1}^{n}q_{i}\cdot I_{U_{i}}$ on $\mathsf{KR}^{\nu}$ . Then $f=\varphi$ on $\mathsf{KR}^{\nu}$ and $f^{p}=\varphi^{p}$ on $\mathsf{KR}^{\nu}$ . By Proposition 7.2 we have for $x$ in $\mathsf{KR}^{\nu}$ :

	$\displaystyle\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)=\sum_{i=1}^{n}q_{i}\int I_{U_{i}}(v)\;d\rho_{x}(v)=\sum_{i=1}^{n}q_{i}\cdot\rho_{x}(U_{i}),\hskip 42.67912pt$
	$\displaystyle\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}](x)=\sum_{i=1}^{n}q_{i}^{p}\int I_{U_{i}}(v)\;d\rho_{x}(v)=\sum_{i=1}^{n}q_{i}^{p}\cdot\rho_{x}(U_{i})$

By Definition 1.3(3), the functions $x\mapsto\rho_{x}(U_{i})$ are lsc. Hence, by Proposition 2.16, choose functions $\upsilon_{i,s}$ from the countable dense set of $L_{p}(\nu)$ that converge upward to $\rho_{\cdot}(U_{i})$ , in that $0\leq\upsilon_{i,s}\leq\upsilon_{i,s+1}$ everywhere and $\rho_{\cdot}(U_{i})=\sup_{s}\upsilon_{i,s}$ everywhere. Let $g_{s}=\sum_{i=1}^{n}q_{i}\cdot\upsilon_{i,s}$ and $h_{s}=\sum_{i=1}^{n}q_{i}^{p}\cdot\upsilon_{i,s}$ , which likewise converge upward to $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ and $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]$ respectively on $\mathsf{KR}^{\nu}$ .

Suppose $x$ is in $\mathsf{KR}^{\nu}$ . Since we are working with a Kurtz disintegration, we then have that $\mathsf{KR}^{\nu}$ is a $\rho_{x}$ -measure one set, and so $\rho_{x}(U_{i})=\rho_{x}(A_{i})$ . Since the $A_{i}$ are pairwise disjoint, we then have $\sum_{i=1}^{n}\rho_{x}(U_{i})=\sum_{i=1}^{n}\rho_{x}(A_{i})\leq 1$ , and so $\sum_{i=1}^{n}\rho_{x}(U_{i})-\upsilon_{i,s}(x)\leq 1$ . Then for $x$ in $\mathsf{KR}^{\nu}$ , by the convexity of the $p$ -th power function applied with coefficients $\rho_{x}(U_{i})-\upsilon_{i,s}(x)$ and points $q_{i}$ , we have:

		$\displaystyle(\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)-g_{s}(x))^{p}=\bigg{(}\sum_{i=1}^{n}q_{i}\cdot(\rho_{x}(U_{i})-\upsilon_{i,s}(x))\bigg{)}^{p}$
	$\displaystyle\leq$	$\displaystyle\sum_{i=1}^{n}q_{i}^{p}\cdot(\rho_{x}(U_{i})-\upsilon_{i,s}(x))=\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}](x)-h_{s}(x)$

Since this estimate holds on the $\nu$ -measure one set $\mathsf{KR}^{\nu}$ , by taking expectations and then $p$ -th roots, we have $\|\mathbb{E}_{\nu}[f\mid\mathscr{F}]-g_{s}\|_{p}\leq\big{(}\mathbb{E}_{\nu}f^{p}-\mathbb{E}_{\nu}h_{s}\big{)}^{\frac{1}{p}}$ . Since the right-hand side is a computable value which goes to zero as $s$ goes to infinity (by MCT), we can compute a subsequence of the $g_{s}$ which is a witness to the $L_{p}(\nu)$ computability of $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ . Since $f,\varphi$ are equal on $\mathsf{KR}^{\nu}$ , we have $\mathbb{E}_{\nu}[\varphi\mid\mathscr{F}]$ is also a computable point of $L_{p}(\nu)$ . ∎

The previous proposition has the following elementary consequence:

Proposition 7.6.

Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a Kurtz disintegration of $\mathscr{F}$ . Suppose $p\geq 1$ is computable.

(1)

If $f$ is an $L_{p}(\nu)$ Martin-Löf test, then $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is an $L_{p}(\nu)$ Martin-Löf test and $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]$ is an $L_{1}(\nu)$ Martin-Löf test.
(2)

If $f$ is an $L_{p}(\nu)$ Schnorr test, then $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is an $L_{p}(\nu)$ Schnorr test and $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]$ is an $L_{1}(\nu)$ Schnorr test.

Proof.

For (1), suppose that $f$ is an $L_{p}(\nu)$ Martin-Löf test. Then $f,f^{p}$ are non-negative lsc, and so by Proposition 7.1 one has that $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ and $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]$ are non-negative lsc. By conditional Jensen, $\|\mathbb{E}_{\nu}[f\mid\mathscr{F}]\|_{p}\leq\|f\|_{p}<\infty$ , and likewise $\|\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]\|_{1}\leq\|f^{p}\|_{1}=\|f\|^{p}_{p}<\infty$ .

For (2), suppose that $f$ is an $L_{p}(\nu)$ Schnorr test. Since $\|f^{p}\|_{1}=\|f\|^{p}_{p}$ , one has that $f^{p}$ is an $L_{1}(\nu)$ Schnorr test. By Proposition 2.16, $f$ is a computable point of $L_{p}(\nu)$ , and $f^{p}$ is a computable point of $L_{1}(\nu)$ . By the previous proposition $\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ is a computable point of $L_{p}(\nu)$ , and $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}]$ is a computable point of $L_{1}(\nu)$ . ∎

This next proposition seems specific to Schnorr tests and $\mathsf{SR}^{\nu}$ :

Proposition 7.7.

(Tower) Suppose that $\mathscr{H}\subseteq\mathscr{G}$ are two effective $\sigma$ -algebras, each of which has a Kurtz disintegration. Then for every $L_{1}(\nu)$ Schnorr test $f$ , one has that $\mathbb{E}_{\nu}[\mathbb{E}_{\nu}[f\mid\mathscr{G}]\mid\mathscr{H}]=\mathbb{E}_{\nu}[f\mid\mathscr{H}]$ on $\mathsf{SR}^{\nu}$ .

Proof.

By the previous proposition, one has that the two functions $g:=\mathbb{E}_{\nu}[\mathbb{E}_{\nu}[f\mid\mathscr{G}]\mid\mathscr{H}]$ and $h:=\mathbb{E}_{\nu}[f\mid\mathscr{H}]$ are $L_{1}(\nu)$ Schnorr tests. Suppose that they are not equal on $x$ in $\mathsf{SR}^{\nu}$ . Then, without loss of generality, there are rationals $a,b,c$ with $g(x)<a<b<c<h(x)$ . By Lemma 2.22, there is a computable real $\epsilon$ in the interval $(b,c)$ with $(h\#\nu)(\epsilon,\infty]$ computable. Since $h$ is lsc, the set $U:=h^{-1}(\epsilon,\infty]$ is c.e. open, and it has computable measure. By the same lemma, there is a computable real $\delta$ in the interval $(a,b)$ with $(g\#\nu)(\delta,\infty]$ computable, so that $(g\#\nu)[0,\delta]$ is likewise computable. Since $g$ is lsc, the set $C:=g^{-1}[0,\delta]$ is effectively closed. Since it has computable $\nu$ -measure, by Proposition 2.12, there is a a decreasing sequence of c.e. opens $V_{n}\supseteq C$ with $\nu(V_{n})$ uniformly computable and $\nu(V_{n}\setminus C)<2^{-n}$ . We then claim that $\nu(U\cap C)>0$ . For, suppose not. Then $0=\nu(U\cap C)=\lim_{i}\nu(U\cap V_{i})$ . Since $\nu(U\cap V_{i})$ is computable by Proposition 2.8(2), we can then compute a subsequence $U\cap V_{n(i)}$ with $\nu(U\cap V_{n(i)})\leq 2^{-i}$ , so that $\sum_{i}I_{U\cap V_{n(i)}}$ is an $L_{1}(\nu)$ Schnorr test. But since $x$ in $\mathsf{SR}^{\nu}$ and $x$ in $U\cap C$ by construction, we have a contradiction. Hence indeed $\nu(U\cap C)>0$ . Since $g,h$ are by definition $\mathscr{H}$ -measurable, we have that $U\cap C$ is also $\mathscr{H}$ -measurable and hence $\mathscr{G}$ -measurable. Then one has the following identities by the definition of conditional expectation (these identities being the classical proof of the tower property):

\int_{U\cap C}\mathbb{E}_{\nu}[f\mid\mathscr{H}]\;d\nu=\int_{U\cap C}f\;d\nu=\int_{U\cap C}\mathbb{E}_{\nu}[f\mid\mathscr{G}]\;d\nu=\int_{U\cap C}\mathbb{E}_{\nu}[\mathbb{E}_{\nu}[f\mid\mathscr{G}]\mid\mathscr{H}]\;d\nu

But these identities give us the below identity, where the remaining inequalities follow from the definitions of $g,h,U,C$ :

\epsilon\cdot\nu(U\cap C)\leq\int_{U\cap C}h=\int_{U\cap C}g\leq\delta\cdot\nu(U\cap C)

But since $\nu(U\cap C)>0$ , we then have that $\epsilon\leq\delta$ , contrary to construction.

∎

Proposition 7.8.

(The rôle of independence) Suppose $\rho:X\rightarrow\mathcal{M}^{+}(X)$ is a Kurtz disintegration of $\mathscr{F}$ .

If $f$ in $\mathbb{L}_{1}^{+}(\nu)$ is independent of $\mathscr{F}$ , then $\mathbb{E}_{\nu}[f\mid\mathscr{F}]\leq\mathbb{E}_{\nu}[f]$ everywhere.

If $f$ is an $L_{1}(\nu)$ Martin-Löf test $f$ independent of $\mathscr{F}$ , then $\mathbb{E}_{\nu}[f\mid\mathscr{F}]=\mathbb{E}_{\nu}[f]$ on $\mathsf{KR}^{\nu}$ .

Proof.

Let us abbreviate $g=\mathbb{E}_{\nu}[f\mid\mathscr{F}]$ .

First suppose that $f$ in $\mathbb{L}_{1}^{+}(\nu)$ is independent of $\mathscr{F}$ . Let $x$ in $X$ be arbitrary. Suppose that $g(x)>\mathbb{E}_{\nu}f$ . Choose rational $a$ with $g(x)>a>\mathbb{E}_{\nu}f$ . Let $B=g^{-1}(a,\infty]$ , which is in $\mathscr{F}$ . Then we have the following, where the first step is independence: $\mathbb{E}_{\nu}[I_{B}\cdot f]=\nu(B)\cdot\mathbb{E}_{\nu}[f]<a\cdot\nu(B)\leq\int_{B}g\;d\nu=\mathbb{E}_{\nu}[I_{B}\cdot f]$ .

Second suppose that $f$ is an $L_{1}(\nu)$ Martin-Löf test, $f$ independent of $\mathscr{F}$ . Then $g$ is non-negative lsc by Proposition 7.1. Suppose $x$ is in $\mathsf{KR}^{\nu}$ . Suppose $g(x)<\mathbb{E}_{\nu}f$ . Choose rational $a>0$ with $g(x)<a<\mathbb{E}_{\nu}f$ . Let $C=g^{-1}[0,a]$ , which is effectively closed and in $\mathscr{F}$ . Since it contains the $\mathsf{KR}^{\nu}$ point $x$ , we have that $\nu(C)>0$ . Then we have the following, where the first step is from independence: $\nu(C)\cdot\mathbb{E}_{\nu}f=\mathbb{E}_{\nu}[I_{C}\cdot f]=\mathbb{E}_{\nu}[I_{C}\cdot g]\leq a\cdot\nu(C)$ . Since $\nu(C)>0$ we have $\mathbb{E}_{\nu}f\leq a$ , a contradiction. ∎

Now we turn towards effective properties of the maximal function (cf. §5 for the classical properties). Recall that almost-full was defined in Definition 1.1(12).

Proposition 7.9.

Let $\mathscr{F}_{n}$ be an almost-full effective filtration equipped with Kurtz disintegrations. Let $f^{\flat}(x)=\sup_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ be the associated version of the maximal function.

For $p>1$ , the maximal function is computable continuous from $L_{p}(\nu)$ to $L_{p}(\nu)$ .

For $p=1$ , the maximal function is computable continuous from $L_{p}(\nu)$ to $L_{0}(\nu)$ .

For $p\geq 1$ , the maximal function sends the countable dense set of $L_{p}^{+}(\nu)$ (resp. $L_{p}(\nu)$ ) uniformly to computable points of $L_{p}^{+}(\nu)$ (resp. $L_{p}(\nu)$ ).

Proof.

First suppose $p>1$ . Let $f=\sum_{i=1}^{k}q_{i}\cdot I_{A_{i}}$ , where $q_{i}$ is rational and $A_{k}$ comes from the algebra generated by a $\nu$ -computable basis. Without loss of generality, $q_{i}\neq 0$ and the $A_{k}$ are pairwise disjoint. By almost-fullness of the filtration and Proposition 2.13, for each $1\leq i\leq k$ , let $A_{i}=\bigcup_{s}U_{i,s}$ on $\mathsf{KR}^{\nu}$ , where $U_{i,s}$ is a c.e. open equal on $\mathsf{KR}^{\nu}$ to an element from the sequence which generates $\mathscr{F}_{g(i,s)}$ , where $g$ is a computable function. By replacing $U_{i,s}$ by $\bigcup_{t\leq s}U_{i,s}$ , we may assume that $U_{i,s}$ and $g(i,s)$ are non-decreasing in $s$ , for each $i\geq 0$ . Let $f_{s}=\sum_{i=1}^{k}q_{i}\cdot I_{U_{i,s}}$ . The $A_{i}$ and $U_{i,s}$ come from the algebra generated by a $\nu$ -computable basis, and so for each $1\leq i\leq k$ , we can compute a subsequence $U_{i,s(n)}$ such that $\nu(A_{i}\setminus U_{i,s(n)})<\big{(}\frac{1}{\max_{1\leq j\leq k}\left|q_{j}\right|}\cdot\frac{1}{k}\cdot\frac{p-1}{p}\cdot 2^{-n}\big{)}^{p}$ . By Doob’s Maximal Inequality (Lemma 5.1), we have

\|f^{\flat}-f_{s(n)}^{\flat}\|_{p}\leq\|(f-f_{s(n)})^{\flat}\|_{p}\leq\frac{p}{p-1}\cdot\|f-f_{s(n)}\|_{p}\leq\sum_{i=1}^{k}\left|q_{i}\right|\cdot\frac{p}{p-1}\cdot\nu(A_{i}\setminus U_{i,s(n)})^{\frac{1}{p}}

which is $<2^{-n}$ . Hence it remains to show that $f_{s(n)}^{\flat}$ is uniformly a computable point of $L_{p}(\nu)$ . Since for all $t\geq\max_{1\leq i\leq k}g(i,s(n))$ , the c.e. open $U_{i,s(n)}$ is equal on $\mathsf{KR}^{\nu}$ to an element of $\mathscr{F}_{t}$ , one has that $f_{s(n)}=\mathbb{E}_{\nu}[f_{s(n)}\mid\mathscr{F}_{t}]$ on $\mathsf{KR}^{\nu}$ by Proposition 7.3. Hence, $f_{s(n)}^{\flat}=\sup_{t\leq\max_{1\leq i\leq k}g(i,s(n))}\mathbb{E}[f_{s(n)}\mid\mathscr{F}_{t}]$ on $\mathsf{KR}^{\nu}$ . And the latter is a computable element of $L_{p}(\nu)$ since it is a finite max of conditional expectations which are computable elements of $L_{p}(\nu)$ by Proposition 7.5.

For $p=1$ , one just appeals to the fact that $L_{1}(\nu)$ and $L_{q}(\nu)$ for computable $q>1$ share a common dense set and that $L_{q}(\nu)$ computably embeds in $L_{1}(\nu)$ .

Then computability continuity follow from Proposition 2.5 and Proposition 5.2. ∎

Proposition 7.10.

Let $\mathscr{F}_{n}$ be an almost-full effective filtration equipped with Kurtz disintegrations.

For every $L_{p}(\nu)$ Schnorr test $f$ , we can compute an index for a sequence of $L_{p}(\nu)$ Schnorr tests $g_{s}$ such that $g_{s}\leq g_{s+1}$ on $\mathsf{KR}^{\nu}$ and $f=\sup_{s}g_{s}$ on $\mathsf{KR}^{\nu}$ and $g_{s}\rightarrow f$ fast in $L_{p}(\nu)$ and $g_{s}=\lim_{n}\mathbb{E}_{\nu}[g_{s}\mid\mathscr{F}_{n}]$ on $\mathsf{KR}^{\nu}$ . Indeed, there is computable function $n(\cdot)$ such that

g_{s}=\mathbb{E}_{\nu}[g_{s}\mid\mathscr{F}_{m}]\mbox{ on }\mathsf{KR}^{\nu}\mbox{ for all }m\geq n(s)

(7.1)

Further, we can compute an index for a non-negative usc function $h_{s}$ such that $g_{s},h_{s}$ are equal on $\mathsf{KR}^{\nu}$ .

Finally, we can compute from $s$ an index for a non-negative rational which bounds $\left|g_{s}\right|$ on $\mathsf{KR}^{\nu}$ .

Proof.

Enumerate $\mathbb{Q}\cap[0,\infty)$ as $q_{0},q_{1},\ldots$ . For each $n\geq 0$ , one has that $f^{-1}(q_{n},\infty]$ is uniformly c.e. open. Since the filtration is almost-full, by using Proposition 2.13 there is a computable sequence $U_{n,j}$ of c.e. opens such that $f^{-1}(q_{n},\infty]=\bigcup_{j}U_{n,j}$ on $\mathsf{KR}^{\nu}$ , where the $U_{n,j}$ are equal on $\mathsf{KR}^{\nu}$ to events from the sequence which generates $\mathscr{F}_{m}$ . Then define

f_{s}(x)=\max\{0,q_{n}:n,j\leq s,x\in U_{n,j}\}

(7.2)

As in the proof of Proposition 2.16, this is a rational-valued step function, whose events are equal on $\mathsf{KR}^{\nu}$ to events from the sequence which generates $\mathscr{F}_{n(s)}$ , where $n(\cdot)$ is a computable function. Then by Proposition 7.3 one has that

f_{s}=\mathbb{E}_{\nu}[f_{s}\mid\mathscr{F}_{m}]\mbox{ on }\mathsf{KR}^{\nu}\mbox{ for all }m\geq n(s)

(7.3)

Further from (7.2) one sees that $f_{s}\leq f_{s+1}$ everywhere since the sum over which we taking the maximum grows in $s$ . Further, one has $f_{s}\leq f$ on $\mathsf{KR}^{\nu}$ since if we had $f_{s}(x)>f(x)$ for $x$ in $\mathsf{KR}^{\nu}$ , then $f_{s}(x)=q_{n}$ for some $n,j\leq s$ with $x$ in $U_{n,j}$ . But since $U_{n,j}\subseteq f^{-1}(q_{n},\infty]$ on $\mathsf{KR}^{\nu}$ , we then have $f(x)>q_{n}$ . Finally, one has $\sup_{s}f_{s}=f$ on $\mathsf{KR}^{\nu}$ , since if not we would have $\sup_{s}f_{s}(x)<q_{n}<f(x)$ for some $x$ in $\mathsf{KR}^{\nu}$ and some $n$ and hence $x$ would be in $f^{-1}(q_{n},\infty]=\bigcup_{j}U_{n,j}$ and so $x$ would be in $U_{n,j}$ for some $j$ and hence for $s\geq j$ one would have that $f_{s}(x)\geq q_{n}$ by definition in (7.2).

We can pass to a subsequence of $f_{s}$ which goes to $f$ fast in $L_{p}(\nu)$ as in the proof of Proposition 2.16.

Since $f_{s}$ is formed from events which are equal on $\mathsf{KR}^{\nu}$ to events coming from a $\nu$ -computable basis, by Proposition 2.23 there is non-negative lsc $g_{s}$ and non-negative usc $h_{s}$ such that $f_{s},g_{s},h_{s}$ are equal on $\mathsf{KR}^{\nu}$ . Since $f_{s},g_{s}$ are equal on $\mathsf{KR}^{\nu}$ , we can use Proposition 7.2(1) to infer from (7.3) to (7.1). ∎

8. Proof of Theorems 1.5-1.6

First we prove Theorem 1.5:

Proof.

The results of the previous section show that the conditions of Theorem 6.2 are satisfied:

–

Condition (I) is Proposition 7.1.
–

Condition (II) is Proposition 7.2.
–

Condition (III) is Propositions 7.5, 7.9.
–

Condition (IV) is Proposition 7.10.

Finally, we argue from Theorem 1.5(4) to Theorem 1.5(1). Suppose Theorem 1.5(4) is satisfied. We want to show that $x$ is in $\mathsf{SR}^{\nu}$ . Let $f$ be an $L_{p}(\nu)$ Schnorr test. We want to show that $f(x)<\infty$ . Suppose not. Since by hypothesis $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ exists, there are rationals $b,a$ and there is $n_{0}\geq 0$ such that $f(x)>b>a>\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ for all $n\geq n_{0}$ . Then $x$ is in the c.e. open $f^{-1}(b,\infty]$ . Since the filtration is almost-full and $x$ is in $\mathsf{KR}^{\nu}$ , there is $n_{1}\geq n_{0}$ and an event $A$ from $\mathscr{F}_{n_{1}}$ such that $x$ is in $A$ and $A\cap\mathsf{KR}^{\nu}\subseteq f^{-1}(b,\infty]$ . Then $[x]_{\mathscr{F}_{n_{1}}}\subseteq A$ , and hence $[x]_{\mathscr{F}_{n_{1}}}\cap\mathsf{KR}^{\nu}\subseteq A\cap\mathsf{KR}^{\nu}\subseteq f^{-1}(b,\infty]$ . Hence $f^{-1}(b,\infty]$ is a $\rho_{x}^{(n_{1})}$ -measure one event, and hence $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n_{1}}](x)=\int f\;d\rho_{x}^{(n_{1})}\geq b>a$ , a contradiction.

∎

Now we turn to Theorem 1.6:

Proof.

For Theorem 1.6(1), one appeals to Theorem 6.2(ii), along with Proposition 7.10.

For Theorem 1.6(2) suppose that $x$ in $\mathsf{SR}^{\nu}$ is of computably dominated degree. By the previous paragraph, we have that $x$ weakly computes a modulus $m:\mathbb{Q}^{>0}\rightarrow\mathbb{N}$ for the convergence $\mathbb{E}[f\mid\mathscr{F}_{n}](x)\rightarrow f(x)$ . Likewise $m^{\prime}:\mathbb{N}\rightarrow\mathbb{N}$ defined by $m^{\prime}(i)=m(2^{-i})$ is computable from $m$ . Since $x$ is of computably dominated degree, we have that at least one of these $m^{\prime}$ is dominated by some computable function, call it $m^{{\dagger}}$ , so that that past some point, call it $i_{0}$ , we have $m^{{\dagger}}\geq m^{\prime}$ . Let $\epsilon>0$ be rational. Compute $j\geq i_{0}$ such that $2^{-j}<\epsilon$ . Let $n\geq m^{{\dagger}}(j)$ . Then $n\geq m^{{\dagger}}(j)\geq m^{\prime}(j)=m(2^{-j})$ . Then $\left|\mathbb{E}[f\mid\mathscr{F}_{n}](x)-f(x)\right|<2^{-j}<\epsilon$ . ∎

9. Proof of Theorems 1.8-1.9

We begin with an elementary proposition.

Proposition 9.1.

Suppose $\mu_{n},\mu$ in $\mathcal{P}(X)$ such that for every c.e. open $U$ one has $\lim_{n}\mu_{n}(U)=\mu(U)$ .

(1)

For every event $A$ in the algebra generated by the c.e. opens, one has $\lim_{n}\mu_{n}(A)=\mu(A)$ .
(2)

For every simple function $f$ generated from events in this algebra, one has $\lim_{n}\int f\;\mu_{n}=\int f\;\mu$ .
(3)

For every lsc $f:X\rightarrow[0,\infty]$ which is in each of $\mathbb{L}^{+}_{1}(\mu_{n}),\mathbb{L}^{+}_{1}(\mu)$ , one has $\int f\;d\mu\leq\liminf_{n}\int f\;d\mu_{n}$ .

This proposition too illustrates that convergence to the truth is a strengthening of weak convergence, since in (3) there is no boundedness constraint on the lsc function.

Proof.

For (1), every event in this algebra can be written as a finite disjoint union of sets of the form $U_{1}\cap\cdots\cap U_{m}\cap V_{1}^{c}\cap\cdots\cap V_{n}^{c}$ , where $U_{i},V_{i}$ are c.e. opens. Let $U=U_{1}\cap\cdots\cap U_{m}$ and $V=V_{1}\cup\cdots\cup V_{n}$ , so that the set has the form $U\setminus V$ . Since $\mu_{n},\mu$ are in $\mathcal{P}(X)$ and since $U\cap V$ is c.e. open as well, we have that

\lim_{n}\mu_{n}(U\setminus V)=\lim_{n}\mu_{n}(U)-\lim_{n}\mu_{n}(U\cap V)=\mu(U)-\mu(U\cap V)=\mu(U\setminus V)

For (2), one just applies the previous item and the properties of the integral.

For (3), suppose not. Choose rational $q$ such that $\int f\;d\mu>q>\liminf_{n}\int f\;d\mu_{n}$ . Let $f_{s}$ be the approximation to $f$ as in Proposition 2.16. Then by the MCT applied in $\mathbb{L}^{+}_{1}(\mu)$ , there is $s\geq 0$ such that $\int f_{s}\;d\mu>q$ . By (2), there is $n_{0}\geq 0$ such that for all $n\geq n_{0}$ one has that $\int f_{s}\;d\mu_{n}>q$ . But this contradicts that $q>\liminf_{n}\int f\;d\mu_{n}$ . ∎

Now we prove Theorem 1.8:

Proof.

Suppose Theorem 1.8(1); we show Theorem 1.8(2). Since $x$ is in $\mathsf{MLR}^{\nu}$ , one has that $\rho_{x}^{(n)}$ is in $\mathcal{P}(X)$ . Suppose that $f$ is an $L_{p}(\nu)$ Martin-Löf test. Since $x$ is in $\mathsf{MLR}^{\nu}$ , one has $f(x)<\infty$ . By Proposition 7.6(1), the functions $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ are $L_{p}(\nu)$ Martin-Löf tests as well, and hence $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)<\infty$ , which is just to say that $\int f\;d\rho^{(n)}_{x}<\infty$ . By Proposition 9.1(3) applied to $\rho^{(n)}_{x},\delta_{x}$ one has that $\int f\;d\delta_{x}\leq\liminf_{n}\int f\;d\rho^{(n)}_{x}$ , which is just to say that $f(x)\leq\liminf_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ . Hence, it remains to show that $\limsup_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\leq f(x)$ . Suppose not. For reductio, suppose there are rational $a,b$ with $f(x)<a<b<\limsup_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)$ .

Let $U=f^{-1}(a,\infty]$ and $C=f^{-1}[0,a]$ , so that $U$ is c.e. open and $C$ is effectively closed. Since $x$ is in $\mathsf{DR}_{\rho}^{\nu}$ and $x$ is not in $U$ , we have $\rho^{(n)}_{x}(U)\rightarrow 0$ .

By definition of $C$ , we have for all $n\geq 0$ that $\int_{C}f\;d\rho^{(n)}_{x}\leq a$ . For any $n\geq 0$ such that $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)>b$ , we then have $\int_{C}f\;d\rho^{(n)}_{x}\leq a<b<\int fd\rho^{(n)}_{x}$ , so that $\int_{U}f\;d\rho^{(n)}_{x}>b-a$ . Hence, our reductio hypothesis gives that there are infinitely many $n\geq 0$ with $\int_{U}f\;d\rho^{(n)}_{x}>b-a$ .

By Proposition 7.6(1), we have that $\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{n}]$ are $L_{1}(\nu)$ Martin-Löf tests. Hence its maximal function $\sup_{n}\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{n}]$ is also non-negative lsc. Let $U_{k}=\{y\in X:\sup_{n}\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{n}](y)>2^{k}\}$ , which is c.e. open. By Doob’s Submartingale Inequality, one has that $\nu(U_{k})$ is $\leq$ the following:

\lim_{m}\nu(\{y\in X:\sup_{n\leq m}\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{n}](y)>2^{k}\})\leq\lim_{m}2^{-k}\int\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{m}]\;d\nu=2^{-k}\|f\|_{p}^{p}

Hence $g=\sum_{k}I_{U_{k}}$ is an $L_{1}(\nu)$ Martin-Löf test, and hence there is constant $K>0$ such that $\sup_{n}\mathbb{E}_{\nu}[f^{p}\mid\mathscr{F}_{n}](x)<K$ .

Then for all $n\geq 0$ we have $\int f^{p}\;d\rho^{(n)}_{x}<K$ , so that $f$ is in $L_{p}(\rho^{(n)}_{x})$ with $\|f\|_{L_{p}(\rho^{(n)}_{x})}<K^{\frac{1}{p}}$ . Let $q$ be the conjugate exponent to $p$ . Then, for each $n\geq 0$ , we have by Hölder with respect to $\rho^{(n)}_{x}$ that:

\int_{U}f\;d\rho^{(n)}_{x}=\|f\cdot I_{U}\|_{L_{1}(\rho^{(n)}_{x})}\leq\|f\|_{L_{p}(\rho^{(n)}_{x})}\cdot\|I_{U}\|_{L_{q}(\rho^{(n)}_{x})}\leq K^{\frac{1}{p}}\cdot(\rho^{(n)}_{x}(U))^{\frac{1}{q}}

(9.1)

Since $\rho^{(n)}_{x}(U)\rightarrow 0$ , we have that $\int_{U}f\;d\rho^{(n)}_{x}\rightarrow 0$ , contradicting the previous conclusion from the reductio hypothesis.

The implication from (2) to (3) is trivial.

The argument from (3) to (1) is nearly identical to the proof in §8 from Theorem 1.5(4) to Theorem 1.5(1): one just replaces $\mathsf{SR}^{\nu}$ with $\mathsf{MLR}^{\nu}$ and replaces $L_{p}(\nu)$ Schnorr tests with $L_{p}(\nu)$ Martin-Löf tests. ∎

Now we prove Theorem 1.9:

Proof.

We work in Cantor space with the uniform measure $\nu$ , the effective full filtration $\mathscr{F}_{n}$ of the algebra of events generated by the length $n$ strings, and with the effective disintegration $\rho^{(n)}_{\omega}=\nu(\cdot\mid[\omega\upharpoonright n])$ . Then $\mathbb{E}[f\mid\mathscr{F}_{n}](\omega)=\frac{1}{\nu([\omega\upharpoonright n])}\int_{[\omega\upharpoonright n]}f\;d\nu$ , and likewise $\mathbb{E}[I_{A}\mid\mathscr{F}_{n}](\omega)=\nu(A\mid[\omega\upharpoonright n])$ . (This is just Example B.1 for Cantor space and uniform measure).

We show that for $\omega$ in $\mathsf{DR}_{\rho}^{\nu}$ there is c.e. open $U$ with $0<\nu(U)<1$ such that $\omega$ is not in $U$ and the convergence $\nu(U\mid[\omega\upharpoonright n])\rightarrow 0$ does not have a computable rate.

(Since the example involves an indicator function, we have that $I_{U}$ is in $L_{p}(\nu)$ for all $p\geq 1$ computable).

Let $k\geq 0$ . Let $K$ be the halting set $\{e:\varphi_{e}(e){\downarrow}\}$ . Enumerate it as $e_{0},e_{1},\ldots$ , where the map $n\mapsto e_{n}$ is injective.

Define $c_{0}=0$ and $c_{n+1}=\max(\varphi_{e_{n}}(e_{n}),c_{n})+1$ .

For $k,n\geq 0$ , define clopen $U_{k,n}=\{\omega:\forall\;i\in[c_{n+1},c_{n+1}+e_{n}+k+1)\;\omega(i)=0\}$ and define the c.e. open $U_{k}=\bigcup_{n}U_{k,n}$ . Then $\nu(U_{k,n})=2^{-(e_{n}+k+1)}$ and $0<\nu(U_{k})\leq\sum_{n}\nu(U_{k,n})<2^{-k}$ . Since $U_{k,n}$ just makes decisions on bits $\geq c_{n+1}$ , it is independent of all bits $<c_{n+1}$ (since $\nu$ is uniform measure). We then have that $\nu(U_{k,n}\mid[\omega\upharpoonright c_{n+1}])=2^{-(e_{n}+k+1)}$ for any $\omega$ , and hence $\nu(U_{k}\mid[\omega\upharpoonright c_{n+1}])\geq 2^{-(e_{n}+k+1)}$ for any $\omega$ .

Let $\omega$ in $\mathsf{DR}_{\rho}^{\nu}$ . Since $\sum_{k}I_{U_{k}}$ is an $L_{1}(\nu)$ Martin-Löf test, one has that there is $k$ such that $\omega$ is not in $U_{k}$ . Hence $\nu(U_{k}\mid[\omega\upharpoonright n])\rightarrow 0$ . Suppose that $\nu(U_{k}\mid[\omega\upharpoonright n])\rightarrow 0$ with computable rate $m$ . Let $e_{n}$ be such that $\varphi_{e_{n}}(i)=m(2^{-(i+k+1)})$ for all $i\geq 0$ . Then $\varphi_{e_{n}}(e_{n})=m(2^{-(e_{n}+k+1)})$ . Since $c_{n+1}\geq\varphi_{e_{n}}(e_{n})=m(2^{-(e_{n}+k+1)})$ , one has that $\nu(U_{k}\mid[\omega\upharpoonright c_{n+1}])<2^{-(e_{n}+k+1)}$ , a contradiction to the previous paragraph. ∎

Note that the c.e. sets $U_{k}$ constructed above are dense, and their definition interleaves Example 2.15 with the halting set. This seems natural, since as noted in Proposition 2.14, if $\overline{U_{k}}$ were effectively closed with the same $\nu$ -measure as $U_{k}$ , then we could include it in a $\nu$ -computable basis.

10. Proof of Theorem 1.11

First we note a fact mentioned in the introduction, namely that Maximal Doob Randomness is inbetween Martin-Löf and Schnorr randomness:

Proposition 10.1.

For all computable $p\geq 1$ one has $\mathsf{MLR}^{\nu}\subseteq\mathsf{MDR}^{\nu,p}\subseteq\mathsf{SR}^{\nu}$ .

Proof.

Since any $L_{p}(\nu)$ maximal Doob test is an $L_{p}(\nu)$ Martin-Löf test, we have $\mathsf{MLR}^{\nu}\subseteq\mathsf{MDR}^{\nu,p}$ . To show that $\mathsf{MDR}^{\nu,p}\subseteq\mathsf{SR}^{\nu}$ , it suffices to show that any $L_{p}(\nu)$ Schnorr test $f$ is an $L_{p}(\nu)$ maximal Doob test. By Proposition 2.16, let $f_{s}$ be from the countable dense set of $L_{p}(\nu)$ so that $0\leq f_{s}\leq f_{s+1}$ on $\mathsf{KR}^{\nu}$ and $f=\sup_{s}f_{s}$ on $\mathsf{KR}^{\nu}$ and $f_{s}\rightarrow f$ fast in $L_{p}(\nu)$ . Then we can compute a subsequence $s(n)$ such that $\|f-f_{s(n)}\|_{p}<e^{-n}$ . Then for all $k\geq 0$ one has that $\sum_{n}\|f-f_{s(n)}\|_{p}\cdot(n+1)^{k}\leq\sum_{n}e^{-n}(n+1)^{k}<\infty$ . Then we are done by Lemma 3.2. ∎

The following closure condition on $L_{p}(\nu)$ maximal Doob tests is the difficult component of the proof of Theorem 1.11:

Proposition 10.2.

If $p>1$ is computable and $f$ is an $L_{p}(\nu)$ maximal Doob test with witness $f_{s}$ , then $g=\sum_{s}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]$ is equal on $\mathsf{KR}^{\nu}$ to a $L_{p}(\nu)$ maximal Doob test with witness equal on $\mathsf{KR}^{\nu}$ to $g_{t}=\sum_{s<t}\sup_{n}\mathbb{E}_{\nu}[f_{t}-f_{s}\mid\mathscr{F}_{n}]$ .

Proof.

The function $g$ is equal on $\mathsf{KR}^{\nu}$ to a non-negative lsc since it is equal on $\mathsf{KR}^{\nu}$ to a supremum of non-negative lsc functions (cf. Proposition 7.1, Proposition 7.2(1)). Since $p>1$ , by Proposition 7.9, we have that $g_{t}$ is equal on $\mathsf{KR}^{\nu}$ to an $L_{p}(\nu)$ Schnorr test. For $k\geq 0$ , the quantity $\sum_{t}\|g-g_{t}\|_{p}\cdot(t+1)^{k}$ is $\leq$ :

		$\displaystyle\sum_{t}\\|\sum_{s\geq t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$		(10.1)
	$\displaystyle+$	$\displaystyle\sum_{t}\\|\sum_{s<t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]-\sup_{n}\mathbb{E}_{\nu}[f_{t}-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$		(10.2)

To estimate (10.1), let us first define a computable sequence of non-negative left-c.e. reals:

c_{s,t}=\begin{cases}0&\text{if $t>s$},\\ \|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\|_{p}\cdot(s+1)^{k}&\text{if $t\leq s$}.\end{cases}

For fixed $s\geq 0$ we have $\sum_{t}c_{s,t}=(s+1)\cdot c_{s,s}=\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\|_{p}\cdot(s+1)^{k+1}$ . To estimate (10.1), we have the following, where the last line follows from Doob’s Maximal Inequality (Lemma 5.1(1)):

		$\displaystyle\sum_{t}\\|\sum_{s\geq t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}\leq\sum_{t}\sum_{s\geq t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\sum_{s\geq t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(s+1)^{k}=\sum_{t}\sum_{s\geq t}c_{s,t}$
	$\displaystyle=$	$\displaystyle\sum_{t}\sum_{s}c_{s,t}=\sum_{s}\sum_{t}c_{s,t}=\sum_{s}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(s+1)^{k+1}$
	$\displaystyle\leq$	$\displaystyle\sum_{s}\frac{p}{p-1}\cdot\\|f-f_{s}\\|_{p}\cdot(s+1)^{k+1}<\infty$

For (10.2), we have the following, where we use Doob’s Maximal Inequality (Lemma 5.1) again at the end:

		$\displaystyle\sum_{t}\\|\sum_{s<t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]-\sup_{n}\mathbb{E}_{\nu}[f_{t}-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\sum_{s<t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle=$	$\displaystyle\sum_{t}t\cdot\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k+1}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\frac{p}{p-1}\cdot\\|f-f_{t}\\|_{p}\cdot(t+1)^{k+1}<\infty$

∎

Here is the proof of Theorem 1.11:

Proof.

Suppose (1); we prove (2). One has that $x$ in $\mathsf{KR}^{\nu}$ and indeed $x$ in $\mathsf{SR}^{\nu}$ by Proposition 10.1. Suppose now that $f$ is an $L_{p}(\nu)$ maximal Doob test with witness $f_{s}$ . By the previous proposition and (1), we have $\lim_{s}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}](x)=0$ . Let $\epsilon>0$ . Choose $s_{0}\geq 0$ such that for all $s\geq s_{0}$ we have $\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}](x)<\frac{\epsilon}{3}$ . Choose $s_{1}\geq s_{0}$ such that for all $s\geq s_{1}$ we have $f(x)-f_{s}(x)<\frac{1}{3}$ . By Theorem 1.5 applied to $f_{s_{1}}$ , we have that $f_{s_{1}}(x)=\lim_{n}\mathbb{E}_{\nu}[f_{s_{1}}\mid\mathscr{F}_{n}](x)$ . Choose $n_{0}\geq 0$ such that for all $n\geq n_{0}$ we have $\left|f_{s_{1}}(x)-\mathbb{E}_{\nu}[f_{s_{1}}\mid\mathscr{F}_{n}](x)\right|<\frac{\epsilon}{3}$ . Then putting this all together, we have for all $n\geq n_{0}$ that $\left|f(x)-\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\right|$ is $\leq$ the following:

\left|f(x)-f_{s_{1}}(x)\right|+\left|f_{s_{1}}(x)-\mathbb{E}_{\nu}[f_{s_{1}}\mid\mathscr{F}_{n}](x)\right|+\left|\mathbb{E}_{\nu}[f_{s_{1}}\mid\mathscr{F}_{n}](x)-\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)\right|<\epsilon

The step from (2) to (3) is trivial.

The step from (3) to (1) is exactly as in the corresponding step of the proof of Theorem 1.5 (in §8), but with the class of $L_{p}(\nu)$ Schnorr tests replaced by the class of $L_{p}(\nu)$ maximal Doob tests. ∎

11. Back and forth between tests and computable points

In this section we indicate how to state a version of Theorem 1.5 in terms of $L_{p}(\nu)$ -computable points. This essentially follows by a translation method of Miyabe.

We begin with how to select a version for each computable point $L_{p}(\nu)$ . Pathak, Rojas, and Simpson and Rute have shown how to do this via Proposition 4.1. We use the following slight variant of their selection method:

Definition 11.1.

Suppose that $f$ is a computable point of $L_{1}(\nu)$ with witness $f_{n}$ . Then we define a version $f_{\infty}$ in $\mathbb{L}_{1}(\nu)$ by:

f_{\infty}(x)=\begin{cases}\lim_{n}f_{n}(x)&\text{if $\lim_{n}f_{n}(x)$ exists},\\ 0&\text{otherwise}.\end{cases}

Hence, Proposition 4.1 tells us that the definition of $f_{\infty}$ goes through the first case break on all points of $\mathsf{SR}^{\nu}$ , and on these points it is independent of the choice of the witness $f_{n}$ . However, on $X\setminus\mathsf{SR}^{\nu}$ we have that it is dependent on the witness $f_{n}$ . If one was working more extensively with $f_{\infty}$ , one would want to develop some notation which better mark its dependence on the version $f_{n}$ . But this dependence has the following advantage: if $f$ is an $L_{p}(\nu)$ Schnorr test and $f_{n}$ is a witness to its being $L_{p}(\nu)$ -computable such that $f_{n}\rightarrow f$ everywhere (as in Proposition 2.16), then $f=f_{\infty}$ everywhere.⁸⁷⁸⁷87Pathak, Rojas, and Simpson [52, Definition 3.8 p. 314] work with a variant of our $f_{\infty}$ that organises the case break depending on whether $x$ is in $\mathsf{SR}^{\nu}$ . Their approach has the advantage of making $f_{\infty}$ , which they denote as $\widehat{f}$ , entirely independent of the witness $f_{n}$ . Rute [61, Definition 3.17 p. 16] organises the case break the same as we do but sets it undefined when the limit does not exist, which prevents it from being an $L_{p}(\nu)$ Schnorr test.

Miyabe proved the following transfer result for going back and forth between $L_{p}(\nu)$ -computable functions and differences of $L_{p}(\nu)$ Schnorr tests:⁸⁸⁸⁸88[45, Theorem 4.3 p. 7]. He proved it for $p=1$ , but the proof is the same for $p\geq 1$ .

Proposition 11.2.

Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ and $p\geq 1$ is computable.

(1)

Suppose $f$ is a computable point of $L_{p}(\nu)$ with witness $f_{n}$ . Then there are $L_{p}(\nu)$ Schnorr tests $g,h$ such that $f_{\infty}=g-h$ on on $\mathsf{SR}^{\nu}$ .
(2)

Suppose that $g,h$ are $L_{p}(\nu)$ Schnorr tests. Then there is $L_{p}(\nu)$ -computable $f$ with witness $f_{n}$ such that $f_{\infty}=g-h$ on on $\mathsf{SR}^{\nu}$ .

Proof.

(Sketch) For (1), using Proposition 2.23, one shows that $g=\sum_{n}(f_{n+1}-f_{n})^{+}$ and $h=\sum_{n}(f_{n+1}-f_{n})^{-}$ are equal on $\mathsf{KR}^{\nu}$ to $L_{p}(\nu)$ Schnorr tests, where $\cdot^{+}$ and $\cdot^{-}$ denote positive and negative parts. For (2), one uses Proposition 2.16. ∎

These observations allow us to restate Theorem 1.5 in terms of $L_{p}(\nu)$ -computable points, provided one assumes Schnorr disintegrations:

Corollary 11.3.

Suppose that $X$ is a computable Polish space and $\nu$ is a computable probability measure. Suppose that $\mathscr{F}_{n}$ is an almost-full effective filtration, equipped with Schnorr disintegrations.

If $p\geq 1$ is computable, then the following three items are equivalent for $x$ in $X$ :

(1)

$x$ is in $\mathsf{SR}^{\nu}(X)$ .
(2)

$x$ is in $\mathsf{KR}^{\nu}$ and $\lim_{n}\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{n}](x)=f_{\infty}(x)$ for every $L_{p}(\nu)$ computable $f$ with witness $f_{m}$ .
(3)

$x$ is in $\mathsf{KR}^{\nu}$ and $\lim_{n}\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{n}](x)$ exists for every $L_{p}(\nu)$ computable $f$ with witness $f_{m}$ .

Proof.

Suppose (2); we show Theorem 1.5(2). But simply note that any $L_{p}(\nu)$ Schnorr test $f$ has a witness $f_{m}$ from Proposition 2.16 with $f=f_{\infty}$ everywhere.

Suppose Theorem 1.5(2); we show (2). Let $f$ be $L_{p}(\nu)$ computable with witness $f_{m}$ . By the previous proposition, there are two $L_{p}(\nu)$ Schnorr tests $g,h$ such that $f_{\infty}=g-h$ on $\mathsf{SR}^{\nu}$ . Since we are working with Schnorr disintegrations, by Proposition 7.2(1), one has that $\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{n}]=\mathbb{E}_{\nu}[g-h\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ for all $n\geq 0$ . Then by Theorem 1.5(2) and Proposition 7.2(3), we have on $\mathsf{SR}^{\nu}$ that $f_{\infty}=g-h=\lim_{n}\big{(}\mathbb{E}_{\nu}[g\mid\mathscr{F}_{n}]-\mathbb{E}_{\nu}[h\mid\mathscr{F}_{n}]\big{)}=\lim_{n}\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{n}]$ .

Finally, (2) trivially implies (3). And (3) implies Theorem 1.5(4) since again any $L_{p}(\nu)$ Schnorr test $f$ has a witness $f_{m}$ from Proposition 2.16 with $f=f_{\infty}$ everywhere. ∎

12. Martingale convergence in $L_{2}(\nu)$

Our topic in this paper is convergence of the conditional expectations of random variables. But of course these are instances of martingales. In this brief section, we prove a result mentioned in §1.4, namely that one can characterise $\mathsf{SR}^{\nu}$ in terms of convergence of certain $L_{2}(\nu)$ martingales. As mentioned there, this generalises a result of Rute from Cantor space to the more general setting.

If $p\geq 1$ and if $\mathscr{F}_{n}$ is a filtration, then a classical martingale in $L_{p}(\nu)$ adapted to $\mathscr{F}_{n}$ is a sequence $M_{n}$ of $\mathscr{F}_{n}$ measurable functions in $L_{p}(\nu)$ such that $M_{n}=\mathbb{E}_{\nu}[M_{n+1}\mid\mathscr{F}_{n}]$ $\nu$ -a.s. When $\mathscr{F}_{n}$ is clear from context, we just say classical martingale in $L_{p}(\nu)$ .

If $p\geq 1$ is computable and $\mathscr{F}_{n}$ is an effective filtration equipped with Schnorr disintegrations $\rho^{(n)}$ , then a martingale of $L_{p}(\nu)$ Schnorr tests adapted to $\mathscr{F}_{n}$ and $\rho^{(n)}$ is a uniformly computable sequence $M_{n}$ of $\mathscr{F}_{n}$ -measurable Schnorr $L_{p}(\nu)$ tests such that $M_{n}=\mathbb{E}_{\nu}[M_{n+1}\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ , where the version of the conditional expectation is that from the disintegration. When $\mathscr{F}_{n}$ and $\rho^{(n)}$ are clear from context, we just say martingale of $L_{p}(\nu)$ Schnorr tests.

Here is an example:

Example 12.1.

(Products of mean one independent variables).

Suppose $p\geq 1$ is computable. Suppose that $f_{n}:X\rightarrow[0,\infty]$ is a sequence of independent $L_{p}(\nu)$ Schnorr tests with $\mathbb{E}_{\nu}f_{n}=1$ for all $n\geq 1$ . Suppose that $\mathscr{F}_{n}=\sigma(f_{1},\ldots,f_{n})$ is an effective filtration equipped with Schnorr disintegrations. Then $M_{n}=\prod_{i=1}^{n}f_{i}$ is a martingale of $L_{p}(\nu)$ Schnorr tests.

To see this, note that the $M_{n}$ are $L_{p}(\nu)$ Schnorr tests: they are non-negative lsc by Proposition 2.6, and by independence we have $\|M_{n}\|_{p}=\prod_{i=1}^{n}\|f_{i}\|_{p}$ , which is computable. On $\mathsf{SR}^{\nu}$ one has

\mathbb{E}_{\nu}[M_{n+1}\mid\mathscr{F}_{n}]=\mathbb{E}_{\nu}[f_{n+1}\cdot M_{n}\mid\mathscr{F}_{n}]=M_{n}\cdot\mathbb{E}_{\nu}[f_{n+1}\mid\mathscr{F}_{n}]=M_{n}\cdot\mathbb{E}_{\nu}f_{n+1}=M_{n}

The second identity is by taking out what is known (Proposition 7.4(3)) and the third identity is by the rôle of independence (Proposition 7.8).

Our goal is to prove the following:

Theorem 12.2.

Suppose that $\nu$ is a computable point of $\mathcal{P}(X)$ . Let $\mathscr{F}_{n}$ be an almost-full effective filtration equipped with Schnorr disintegrations.

The following are equivalent for $x$ in $X$ :

(1)

$x$ is in $\mathsf{SR}^{\nu}$ .
(2)

$x$ is in $\mathsf{KR}^{\nu}$ and $\lim_{n}M_{n}(x)$ exists for every martingale $M_{n}$ of $L_{2}(\nu)$ Schnorr tests such that both $\sup_{n}\|M_{n}\|_{2}$ is computable and the maximal function $\sup_{n}M_{n}$ is a $L_{2}(\nu)$ Schnorr test.

This theorem generalises a result of Rute.⁸⁹⁸⁹89[61, Corollary 6.8 and Theorem 12.6]. But there are two important differences between our result and Rute’s. First, Rute’s analogue of the direction from (2) to (1) of Theorem 12.2 only works for Cantor space and the uniform measure. Second, Rute’s results do not require that the maximal function $\sup_{n}M_{n}$ be a $L_{2}(\nu)$ Schnorr test.

We do not know the answer to the following:

Question 12.3.

Does Theorem 12.2 also hold for all computable $p>1$ ?

It is clear from the proofs below that (2) to (1) holds for computable $p>1$ . Hence it is a question of (1) to (2).

Throughout the remainder of this section, $X$ is a computable Polish space, $\nu$ is a computable point of $\mathcal{P}(X)$ , and $\mathscr{F}_{n}$ is an effective filtration equipped with Schnorr disintegrations. We only assume that $\mathscr{F}_{n}$ is almost-full in the last proposition, and flag this assumption when it comes up.

We begin by noting the following two elementary results:

Proposition 12.4.

If $M_{n}$ is a martingale of $L_{p}(\nu)$ Schnorr tests, then $M_{n}=\mathbb{E}_{\nu}[M_{m}\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ for all $m>n$ .

Proof.

This is by an induction on $m>n$ . Suppose it holds for $m>n$ . Then $M_{n}=\mathbb{E}_{\nu}[M_{m}\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ . Since $M_{m}=\mathbb{E}_{\nu}[M_{m+1}\mid\mathscr{F}_{m}]$ on $\mathsf{SR}^{\nu}$ and since we are working with a Schnorr disintegration, we have by Proposition 7.2(1) that $\mathbb{E}_{\nu}[M_{m}\mid\mathscr{F}_{n}]=\mathbb{E}_{\nu}[\mathbb{E}_{\nu}[M_{m+1}\mid\mathscr{F}_{m}]\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ . By the tower property (Proposition 7.7), this latter is equal to $\mathbb{E}_{\nu}[M_{m+1}\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ . ∎

Proposition 12.5.

If $M_{n}$ is a classical martingale in $L_{p}(\nu)$ , then $\|M_{m}\|_{p}\geq\|M_{n}\|_{p}$ for all $m>n$ .

Proof.

The function $x\mapsto\left|x\right|^{p}$ is a convex function, and hence $\left|M_{n}\right|^{p}$ is a submartingale.⁹⁰⁹⁰90[75, p. 138]. And the expectation of a submartingale is always non-decreasing. ∎

The following gives a canonical example of a martingale of $L_{p}(\nu)$ Schnorr tests, and in conjunction with Theorem 1.5 gives the (2) to (1) direction of Theorem 12.2.

Proposition 12.6.

Suppose that $p\geq 1$ is computable.

If $f$ is an $L_{p}(\nu)$ Schnorr test, then $M_{n}:=\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ is a martingale of $L_{p}(\nu)$ Schnorr tests.

If $p>1$ and the filtration is almost-full then the maximal function $\sup_{n}M_{n}$ is also an $L_{p}(\nu)$ Schnorr test and $\sup_{n}\|M_{n}\|_{p}$ is computable.

Proof.

The function $M_{n}$ is non-negative lsc by Proposition 7.1, and it is $L_{p}(\nu)$ -computable by Proposition 7.5. To see that it satisfies the martingale condition, from $M_{n+1}=\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n+1}]$ everywhere we have $\mathbb{E}_{\nu}[M_{n+1}\mid\mathscr{F}_{n}]=\mathbb{E}_{\nu}[\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n+1}]\mid\mathscr{F}_{n}]$ everywhere. And by the tower property (Proposition 7.7), the latter is equal to $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ , which is by definition $M_{n}$ .

Suppose now that the filtration is almost-full and $p>1$ . By almost-fullness and Theorem 1.5, we have that $f=\lim_{n}M_{n}$ on $\mathsf{SR}^{\nu}$ . Since $p>1$ we have $\sup_{n}M_{n}$ is in $L_{p}(\nu)$ by Lemma 5.1(1). Then we can dominate $M_{n}^{p}$ by $(\sup_{n}M_{n})^{p}$ and argue by DCT as follows, where the first identity comes from Proposition 12.5:

\sup_{n}\|M_{n}\|_{p}^{p}=\lim_{n}\|M_{n}\|_{p}^{p}=\lim_{n}\int M_{n}^{p}\;d\nu=\int\lim_{n}M_{n}^{p}\;d\nu=\int f^{p}\;d\nu

(12.1)

Since $f$ is by hypothesis a computable point of $L_{p}(\nu)$ , we have that $\sup_{n}\|M_{n}\|_{p}^{p}$ is computable and hence likewise $\sup_{n}\|M_{n}\|_{p}$ is computable. ∎

In conjunction with Corollary 11.3, the following proposition then gives the (1) to (2) direction of Theorem 12.2. As mentioned in §1.4, the proof largely follows the outline of Rute’s own Hilbert space proof.

Proposition 12.7.

Suppose that the filtration is almost-full.

Suppose $M_{n}$ is a martingale $M_{n}$ of $L_{2}(\nu)$ Schnorr tests such that both $\sup_{n}\|M_{n}\|_{2}$ is computable and $\sup_{n}M_{n}$ is a $L_{2}(\nu)$ Schnorr test.

Then there is $L_{2}(\nu)$ -computable function $f$ such that $M_{n}=\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{n}]$ on $\mathsf{SR}^{\nu}$ for each $n\geq 0$ .

Further, the $L_{2}(\nu)$ -computable function $f$ can be taken to be a pointwise limit of a computable subsequence of the $M_{n}$ , which limit exists at least on $\mathsf{SR}^{\nu}$ .

Proof.

Recall that for $n>k$ we have by Hilbert space methods in $L_{2}(\nu)$ that $\mathbb{E}_{\nu}M_{k}M_{n}=\mathbb{E}_{\nu}M_{k}^{2}$ .⁹¹⁹¹91[25, 488]. This implies that for $n>k$ we have:

\|M_{n}-M_{k}\|_{2}^{2}=\mathbb{E}_{\nu}(M_{n}-M_{k})^{2}=\mathbb{E}_{\nu}M_{n}^{2}-\mathbb{E}_{\nu}M_{k}^{2}=\|M_{n}\|_{2}^{2}-\|M_{k}\|_{2}^{2}

Let $f=\lim_{n}M_{n}$ , which classically is in $L_{2}(\nu)$ . Since $\sup_{n}M_{n}$ is in $L_{2}(\nu)$ , we can dominate $(M_{n}-M_{k})^{2}$ by $2\cdot(\sup_{n}M_{n})^{2}$ and argue by DCT and the previous equation that:

\|f-M_{k}\|_{2}^{2}=\lim_{n}\|M_{n}-M_{k}\|_{2}^{2}=\lim_{n}\|M_{n}\|_{2}^{2}-\|M_{k}\|_{2}^{2}=(\sup_{n}\|M_{n}\|_{2})^{2}-\|M_{k}\|_{2}^{2}

Since the latter is a computable real which goes to zero, we can compute a subsequence $M_{k(n)}$ which converges to $f$ fast in $L_{2}(\nu)$ . By Proposition 4.3 and Definition 11.1, we have that $M_{k(n)}\rightarrow f_{\infty}$ on $\mathsf{SR}^{\nu}$ .

For each $m\geq 0$ , let $g_{m}=\mathbb{E}_{\nu}[\sup_{n}M_{n}\mid\mathscr{F}_{m}]$ , so that $g_{m}$ is an $L_{2}(\nu)$ Schnorr test by Proposition 7.6(2), and so $g_{m}$ is finite on $\mathsf{SR}^{\nu}$ . Then by Conditional DCT (Proposition 7.4(2)), for each $m\geq 0$ we have that $\mathbb{E}_{\nu}[M_{k(n)}\mid\mathscr{F}_{m}]\rightarrow\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{m}]$ on $\mathsf{SR}^{\nu}$ . But by Proposition 12.4 for $k(n)>m$ we have that the former is equal to $M_{m}$ on $\mathsf{SR}^{\nu}$ . Hence we have that $M_{m}=\mathbb{E}_{\nu}[f_{\infty}\mid\mathscr{F}_{m}]$ on $\mathsf{SR}^{\nu}$ for each $m\geq 0$ . ∎

13. Conclusion

The main results of this paper (Theorems 1.5, 1.6, 1.8, 1.9, 1.11) characterise the points under which Lévy’s Upward Theorem holds in terms of notions from algorithmic randomness and the rates of convergence in terms of concepts from the classical theory of computation. As discussed in §1.4 this builds on work by previous authors. That which is new are the results on rates of convergence in Theorems 1.6, 1.9, the articulation of the general framework of effective disintegrations (see Definition 1.3, §7 for fundamental properties, and Appendix B for examples), a conceptually new proof of the characterisation of density randomness in the more general framework of effective disintegrations for $p>1$ (Theorem 1.9), and the articulation of the new concept of Maximal Doob Randomness (cf. Definition 1.10, Theorem 1.11, and Question 1.12). As far as Schnorr randomness goes, we noted in §1.4 that Theorem 1.5 can be derived from Rute’s work, modulo the verification of certain properties of effective disintegrations and the Miyabe translation method in §11. We have extended Rute on Schnorr randomness in the generalisation of the $L_{2}(\nu)$ martingale result in §12. We have also sought to present very accessible proofs, based almost entirely on the concept of $L_{p}(\nu)$ Schnorr test.

Our results also contribute to understanding the significance of convergence to the truth results for Bayesian inference. As was pointed out by philosophers of science, the probability one qualification in theorems like Lévy’s Upward Theorem raises the spectre of arbitrariness: a Bayesian with credences represented by a probability measure $\nu$ believes in convergence to the truth with certainty, but might do so only by arbitrarily packaging into a set of probability zero those points at which convergence fails.⁹²⁹²92[2], [18, pp. 144 ff], [28, pp. 28-29]. We have shown that for certain classes of effective random variables the packaging is anything but arbitrary. The probability one set on which convergence to the truth is successful coincides with standard classes of points which are algorithmically random by the lights of the computable probability measure. Thus, the effective typicality expressed by convergence to the truth is extensionally equivalent with a principled effective typicality of the underlying probability measure.

Appendix A Examples of classical disintegrations

In this appendix, we review two classical examples of disintegrations. This also affords us the opportunity to illustrate natural circumstances in which Lévy’s Upward Theorem need not hold for all points. Another reason to dwell on these two examples is that one of the main theorems of Rohlin is that, up to Borel isomorphism, “blendings” of these two examples are the only examples of disintegrations of countably generated $\sigma$ -algebras.⁹³⁹³93[59, §4 pp. 40-41], [12, Theorem 1.12 p. 12]. Our diagrams in this appendix are inspired by the few diagrams in Einsiedler-Ward,⁹⁴⁹⁴94[19, pp. 122-123]. although they only work with a single $\sigma$ -algebra rather than a filtration.

Most concrete examples of disintegrations involve products. We write $\lambda\otimes\mu$ for the product measure on $Y\times Z$ formed from finite measure $\lambda$ on $Y$ and finite measure $\mu$ on $Z$ .

Example A.1.

(Refined partitions of the unit square). Let $X=[0,1]\times[0,1]$ with measure $\nu=m\otimes m$ being the product of Lebesgue measure $m$ on $[0,1]$ with itself. Let $\mathscr{D}_{n}$ be the dyadic partition of $X$ into $4^{n-1}$ many squares, and let $\mathscr{F}_{n}$ be the $\sigma$ -algebra it generates. We can visualise the elements of $\mathscr{F}_{n}$ as any shape one can form from the squares in the below diagrams, so that like in pixelations more detailed shapes become available as $n$ gets larger:

In this diagram, we use the familiar diagrammatic conventions from point-set topology to indicate which components of the partition contain the edges: for instance, for $n\geq 2$ , the southwest square and the northwest square have two edges, the southeast square has three edges, and the northeast square has four edges. Then the following map is a disintegration of $\mathscr{F}_{n}$ , where $\mathcal{M}^{+}(X)$ is again the set of finite Borel measures on $X$ :

\rho^{(n)}:X\rightarrow\mathcal{M}^{+}(X)\hskip 8.53581pt\mbox{ by }\hskip 8.53581pt\rho^{(n)}_{w}=\sum_{Q\in\mathscr{D}_{n}}\nu(\cdot\mid Q)\cdot I_{Q}(w)

(A.1)

Further, using equation (1.1) from §1.2, one has the following associated formula for the version of conditional expectation of $f$ with respect to $\mathscr{F}_{n}$ :

\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](w)=\sum_{Q\in\mathscr{D}_{n}}\bigg{(}\frac{1}{\nu(Q)}\int_{Q}f(x)\;d\nu(x)\bigg{)}\cdot I_{Q}(w)

(A.2)

Suppose that at stage $n$ , the agent’s world $w$ is located in the square $Q_{n}(w)$ from $\mathscr{D}_{n}$ . Intuitively this means that the agent’s evidence at this stage of inquiry is $Q_{n}(w)$ . Then (A.2) says that the agent’s best estimate as to the value of a random variable $f$ at this stage is obtained by averaging $f$ over the event $Q_{n}(w)$ according to the prior probability measure $\nu$ , and then making it higher to the extent that the prior probability $\nu(Q_{w}(n))$ is lower. In the case where the random variable $f$ is the indicator function $I_{C}$ of a Borel event $C$ , this best estimate is just the usual conditional probability $\nu(C\mid Q_{n}(w))$ . For instance, if $C$ is the closed polygon displayed below, then the conditional probability of the agent at stage $n$ is higher than if she were at another world $w^{\prime}$ iff there is more overlap between $Q_{n}(w),C$ than between $Q_{n}(w^{\prime}),C$ :

This example also vividly illustrates how $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](w)=f(w)$ can fail. For instance, take the vertex $w=(.75,.75)$ , which is in the polygon since it is closed. For all $n\geq 3$ , this point is the southwest vertex of a dyadic square in $\mathscr{F}_{n}$ , and such a square overlaps the closed polygon only at this vertex. Hence for the usc function $f=I_{C}$ , one has both $f(w)=1$ and $\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](w)=0$ for all $n\geq 3$ .

In simple examples like this one, geometric intuition can guide us as to what points $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]=f$ holds. The further assurance the classical version of Levy’s Upward Theorem provides is that $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]=f$ holds on a set of $\nu$ -probability one, even when geometric intuition is unavailable. The additional assurance that Theorems 1.5,1.8, 1.11 provides is that $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]=f$ holds on the random points relative to $\nu$ , for a large class of effective random variables $f$ . From this perspective, the problem with our vertex $w=(.75,.75)$ is that it is not sufficiently random, which enabled us to construct a random variable which failed to converge to the truth at this point.

Example A.2.

(Refined lines in the unit square) Let $X=[0,1]\times[0,1]$ with Lebesgue measure $\nu=m\otimes m$ being the product of Lebesgue measure $m$ on $[0,1]$ with itself. Let $\mathscr{G}_{n}$ be the $\sigma$ -algebra on $[0,1]\times[0,1]$ generated by events of the form $B\times Q$ , where $B\subseteq[0,1]$ is Borel and where $Q$ is from a dyadic partition $\mathscr{D}_{n}$ of $[0,1]$ into $2^{n-1}$ (half)-closed intervals of equal length. Intuitively, $\mathscr{G}_{n}$ is the $\sigma$ -algebra of evidence where the agent knows everything there is to know about the $x$ -component at the outset, but is progressively learning more about the $y$ -component. Since events of the form $\{x\}\times[0,1]$ are in $\mathscr{G}_{1}$ , we can depict the $\sigma$ -algebra $\mathscr{G}_{1}$ as the decomposition of $X$ into vertical lines. Likewise, we can depict $\mathscr{G}_{2}$ as the decomposition of $X$ into half vertical lines, etc. While we draw only ten such vertical lines in the below diagram, the idea is that $X$ is being decomposed into continuum-many such vertical lines at each stage:

Then the following map is a disintegration of $\mathscr{G}_{n}$ , where $\delta_{u}$ is the Dirac measure centred on $u$ :

\rho^{(n)}:X\rightarrow\mathcal{M}^{+}(X)\hskip 8.53581pt\mbox{ by }\hskip 8.53581pt\rho^{(n)}_{(u,v)}=\delta_{u}\otimes\sum_{Q\in\mathscr{D}_{n}}m(\cdot\mid Q)\cdot I_{Q}(v)

(A.3)

Further, using equation (1.1) from §1.2, one has the following associated formula for the version of conditional expectation of $g$ with respect to $\mathscr{G}_{n}$ :

\mathbb{E}_{\nu}[g\mid\mathscr{G}_{n}](u,v)=\sum_{Q\in\mathscr{D}_{n}}\bigg{(}\frac{1}{m(Q)}\int_{Q}g(u,t)\;dm(t)\bigg{)}\cdot I_{Q}(v)

(A.4)

Suppose that at stage $n$ , the agent’s world $(u,v)$ is such that its second coordinate $v$ located in the interval $Q_{n}(v)$ from $\mathscr{D}_{n}$ . Intuitively this means that the agent’s evidence at this stage of inquiry is the line $\{u\}\times Q_{n}(v)$ . Then (A.2) says that the agent’s best estimate as to the value of a random variable $g$ at this world and stage is obtained by defining the one-place random variable $f(v)=g(u,v)$ and then doing a one-dimensional analogue of the update in Example A.1. For instance, if $C$ is the displayed closed triangle and we consider the usc function $g=I_{C}$ , then $\mathbb{E}_{\nu}[g\mid\mathscr{G}_{n}](u,v)$ is obtained by calculating the length of the line $C\cap(\{u\}\times Q_{n}(v))$ , and then by multiplying by a factor of $2^{n-1}$ which is responsive to smaller partitions of the $y$ -axis involving less likely events. We illustrate this with respect to the marked point $(u,v)=(\frac{1}{3},\frac{2}{3})$ in the below diagram, where the line $C\cap(\{u\}\times Q_{n}(v))$ is indicated with a heavier dark line:

In contrast to the previous Example A.1, in this example many of the events in $\mathscr{G}_{n}$ have measure zero according to the prior probability measure $\nu$ . Like in the previous Example A.1, we have natural pointwise failures of $\lim_{n}\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}]=f$ here as well: the rightmost vertex of the triangle displays the same kind of failure as in the previous example. And like in that case, the interpretation suggested by Theorems 1.5,1.8, 1.11 is that the vertex is insufficiently random.

Appendix B Examples of effective disintegrations

In this section, we describe several examples of effective disintegrations (cf. Definition 1.3). We focus for the most part on effectivizing the two paradigmatic Examples A.1-A.2 from the previous appendix, but we also include a countable product (Example B.13). In a sequel to this paper, we look also at Bayesian parameter spaces and sample spaces.

One example like Example A.1 is already widely-used in algorithmic randomness, although it is not usually thematized as such:

Example B.1.

(The canonical concrete refined partition disintegrations).

Suppose that $T\subseteq\mathbb{N}^{<\mathbb{N}}$ is a computable tree with no dead ends. Let $X=[T]$ , the paths through $T$ , which is a computable Polish space, and suppose $\nu$ in $\mathcal{P}(X)$ is computable with full support.

Suppose that $\mathscr{F}_{n}$ is the effective refined partition generated by the length $n$ strings in $T$ . That is, $\mathscr{F}_{n}$ is generated by the sets $[\sigma]$ of paths in $T$ through the length $n$ strings $\sigma$ .

Let $\rho^{(n)}:X\rightarrow\mathcal{P}(X)$ by $\rho_{\omega}=\nu(\cdot\mid[\omega\upharpoonright n])$ .

Then $\rho^{(n)}$ is a Martin-Löf disintegration of $\mathscr{F}_{n}$ with respect to $\nu$ and one has the following expression for the version of conditional expectation, which is defined for all $f$ in $\mathbb{L}_{1}(\nu)$ and all $\omega$ in $X$ :

\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](\omega)=\frac{1}{\nu([\omega\upharpoonright n])}\int_{[\omega\upharpoonright n]}f\;d\nu

(B.1)

Since we want to generalise this in what follows, we defer the verification that it is a Martin-Löf disintegration.

The following is an effective disintegration like Example A.2:

Example B.2.

(The canonical concrete refined lines disintegrations).

Suppose that $S,T\subseteq\mathbb{N}^{<\mathbb{N}}$ are computable trees with no dead ends. Let $Y=[S]$ and $Z=[T]$ , the paths through $S,T$ respectively, which are computable Polish spaces, and suppose $\lambda$ in $\mathcal{P}(Y)$ and $\mu$ in $\mathcal{P}(Z)$ are computable with full support.

Let $\mathscr{F}_{n}$ be the $\sigma$ -algebra on $Y\times Z$ generated by sets of the form $U\times[\tau]$ , where $U$ ranges over c.e. opens from a $\lambda$ -computable basis on $Y$ , and where $\tau$ ranges over length $n$ strings in $T$ .

Then the map $\rho:Y\times Z\rightarrow\mathcal{M}^{+}(Y\times Z)$ given by $\rho^{(n)}_{(\omega,\omega^{\prime})}=\delta_{\omega}\otimes\mu(\cdot\mid[\omega^{\prime}\upharpoonright n])$ is a Martin-Löf disintegration of $\mathscr{F}_{n}$ with respect to $\lambda\otimes\mu$ , and one has the following expression for the version of conditional expectation, which for each $f$ in $\mathbb{L}_{1}(\lambda\otimes\mu)$ is defined for $(\lambda\otimes\mu)$ -a.s. many $(\omega,\omega^{\prime})$ in $Y\times Z$ :

\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}_{n}](\omega,\omega^{\prime})=\frac{1}{\mu([\omega^{\prime}\upharpoonright n])}\int_{[\omega^{\prime}\upharpoonright n]}f(\omega,\theta)\;d\mu(\theta)

(B.2)

Note that $\omega$ is free under the integral sign. Since there are no continuity assumptions on $f$ (it is merely an element of $\mathbb{L}_{1}(\nu)$ ) the value of the conditional expectation in (B.2) can apriori change drastically with small changes of $\omega$ . By contrast, in (B.1), $\omega$ ’s contribution is restricted to its first $n$ bits.

In what follows, we want to generalise these two examples to a broader class of computable Polish spaces and verify that they are indeed Martin-Löf disintegrations. We begin by generalising the way in which the previous examples involve partitions. We define a special case of Definition 1.1(11):

Definition B.3.

A sub- $\sigma$ -algebra $\mathscr{F}$ of the Borel sets on $X$ is a $\nu$ -effective partition if it is generated by a computable sequence of events $\{A_{i}:i\in I\}$ from the algebra $\mathscr{A}$ generated by $\nu$ -computable basis $\mathscr{B}$ such that the events $\{A_{i}:i\in I\}$ are a partition of $X$ .

Given such a partition with its computable index set $I$ , we define the c.e. set $I^{+}=\{i\in I:\nu(A_{i})>0\}$ .

Further, a $\nu$ -effective softening of $\mathscr{F}$ is a pairwise disjoint computable sequence of c.e. opens $\{U_{i}:i\in I\}$ such that $U_{i}=A_{i}$ on $\mathsf{KR}^{\nu}$ .

Proposition B.4.

Every effective partition has an effective softening.

Proof.

Since the computable sequence $A_{i}$ comes from the algebra generated by a $\nu$ -computable basis, by Proposition 2.13, there is a computable sequence of c.e. opens $V_{i}$ and effectively closed $C_{i}\supseteq V_{i}$ with $\nu(C_{i})=\nu(A_{i})$ and $V_{i}=A_{i}$ on $\mathsf{KR}^{\nu}$ . Then define recursively the sequence of c.e. opens by $U_{0}=V_{0}$ and $U_{n+1}=V_{n+1}\setminus\bigcup_{m\leq n}C_{m}$ . ∎

Softenings of full partitions are a canonical way to obtain almost-full effective filtrations (cf. Definition 1.1(12)):

Proposition B.5.

Suppose that $\mathscr{F}_{n}$ is a full $\nu$ -effective partition equipped with effective softenings which generate $\sigma$ -algebras $\mathscr{G}_{n}$ . Then $\mathscr{G}_{n}$ is an almost-full $\nu$ -effective partition.

Proof.

Uniformly from an index for a c.e. open $U$ , we can compute an index for a sequence $A_{m_{i}}$ from the sequences which generate the filtration $\mathscr{F}_{0},\mathscr{F}_{1},\ldots$ such that $U=\bigcup_{i}A_{m_{i}}$ . Each $A_{m_{i}}$ is equal on $\mathsf{KR}^{\nu}$ to $U_{m_{i}}$ , where the latter comes from the softening. Then $U$ is equal on $\mathsf{KR}^{\nu}$ to $\bigcup_{i}U_{m_{i}}$ . ∎

Here is how to organise a suitably generalised version of a single stage of the filtration of Example A.1:

Proposition B.6.

Let $\nu$ be a computable point of $\mathcal{P}(X)$ . Let $\mathscr{F}$ be an effective partition $\{A_{i}:i\in I\}$ of $X$ with effective softening $\{U_{i}:i\in I\}$ .

Let $\rho:X\rightarrow\mathcal{M}^{+}(X)$ by $\rho_{x}=\sum_{i\in I^{+}}\nu(\cdot\mid U_{i})\cdot I_{U_{i}}(x)$ .

Then $\rho$ is a Martin-Löf disintegration of $\mathscr{F}$ with respect to $\nu$ and one has the following expression for the version of conditional expectation, which is defined for all $f$ in $\mathbb{L}_{1}(\nu)$ and all $x$ in $X$ :

\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)=\sum_{i\in I^{+}}\bigg{(}\frac{1}{\nu(U_{i})}\int_{U_{i}}f\;d\nu\bigg{)}\cdot I_{U_{i}}(x)

(B.3)

Proof.

By the definition of effective softening, the sets $U_{i},U_{j}$ for distinct $i,j$ in $I^{+}$ have empty intersection. Hence, the map $\rho$ has codomain $\mathcal{M}^{+}(X)$ . In particular, if $x$ is in $U_{i}$ for $i$ in $I^{+}$ , then $\rho_{x}=\nu(\cdot\mid U_{i})$ , which is in $\mathcal{P}(X)$ and hence in $\mathcal{M}^{+}(X)$ . But if $x$ not in any $U_{i}$ for $i$ in $I^{+}$ , then $\rho_{x}=0$ , which is a point of $\mathcal{M}^{+}(X)$ .⁹⁵⁹⁵95If one does not introduce softenings, then this part of the argument breaks down and $\rho_{x}$ need not be a finite measure.

By the definition in (1.1) and since $\frac{d\nu(\cdot\mid U_{i})}{d\nu}=\frac{1}{\nu(U_{i})}\cdot I_{U_{i}}$ for $i$ in $I^{+}$ , one has the following for all $x$ in $X$ :

\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)=\sum_{i\in I^{+}}\big{(}\int_{X}f(v)\;d\nu(\cdot\mid U_{i})(v)\big{)}\cdot I_{U_{i}}(x)=\sum_{i\in I^{+}}\bigg{(}\frac{1}{\nu(U_{i})}\int_{U_{i}}f(v)\;d\nu(v)\bigg{)}\cdot I_{U_{i}}(x)

If $j$ in $I^{+}$ , then when we integrate over $x$ in $A_{j}$ with respect to $\nu$ we then get

\int_{A_{j}}\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)\;d\nu(x)=\int_{A_{j}}\frac{1}{\nu(U_{j})}\int_{U_{j}}f(v)\;d\nu(v)\;d\nu(x)=\int_{A_{j}}f(v)\;d\nu(v)

If $j$ not in $I^{+}$ then the event $A_{j}$ is $\nu$ -null and so trivially we get:

\int_{A_{j}}\mathbb{E}_{\nu}[f\mid\mathscr{F}](x)\;d\nu(x)=\int_{A_{j}}f(v)\;d\nu(v)

Since elements $A_{j}$ generate $\mathscr{F}$ , this shows that (B.3) is a version of the conditional expectation of $f$ with respect to $\mathscr{F}$ . Further, it is totally defined for all $f$ in $\mathbb{L}_{1}(\nu)$ and all $x$ in $X$ .

Since $\mathscr{F}$ is an effective partition, one has that $[x]_{\mathscr{F}}=A_{i}$ for $x$ in $A_{i}$ . Further, for $x$ in $\mathsf{KR}^{\nu}\cap A_{i}$ , we have that $i$ in $I^{+}$ . Hence for $x$ in $\mathsf{KR}^{\nu}\cap A_{i}$ we have $\rho_{x}([x]_{\mathscr{F}}\cap\mathsf{KR}^{\nu})=\rho_{x}(A_{i}\cap\mathsf{KR}^{\nu})=\nu(A_{i}\cap\mathsf{KR}^{\nu}\mid U_{i})=\nu(A_{i}\mid U_{i})=1$ . The same argument works for $\mathsf{SR}^{\nu}$ and $\mathsf{MLR}^{\nu}$ .

Suppose that $U$ is c.e. open and $q\geq 0$ is rational. Since the $U_{i}$ are pairwise disjoint, one has that $\rho_{x}(U)>q$ iff there is $i$ in $I^{+}$ such that $x$ is in $U_{i}$ and $\nu(U\mid U_{i})>q$ , which is a c.e. open condition in variable $x$ . ∎

Here is how to organise a suitably generalised version of the initial step of the filtration of Example A.2, that is, where the partition on the second component consists just of a single set (like $\mathscr{G}_{1}$ in Example A.2).

Proposition B.7.

Suppose that $Y,Z$ are computable Polish spaces. Suppose that $\lambda$ is a computable point of $\mathcal{P}(Y)$ and $\mu$ is a computable point of $\mathcal{P}(Z)$ . Let $\mathscr{F}$ be the $\lambda\otimes\mu$ -effective $\sigma$ -algebra on $Y\times Z$ generated by sets of the form $U\times Z$ , where $U$ ranges over c.e. opens from a $\lambda$ -computable basis on $Y$ . Then $\rho:Y\times Z\rightarrow\mathcal{P}(Y\times Z)$ given by $\rho_{(u,v)}=\delta_{u}\otimes\mu$ is a Martin-Löf disintegration of $\mathscr{F}$ with respect to $\lambda\otimes\mu$ , and one has the following expression for the version of conditional expectation, which for each $f$ in $\mathbb{L}_{1}(\lambda\otimes\mu)$ is defined for $(\lambda\otimes\mu)$ -a.s. many $(u,v)$ in $Y\times Z$ :

\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}](u,v)=\int_{Z}f(u,t)\;d\mu(t)

(B.4)

Proof.

By the definition in (1.1) and by Fubini-Tonelli, one has the following for $f$ in $\mathbb{L}_{1}(\lambda\otimes\mu)$ , and by the same theorem it is defined for $\lambda$ -a.s. many $u$ in $Y$ , and hence for $(\lambda\otimes\mu)$ -a.s. many $(u,v)$ in $Y\times Z$ :

\displaystyle\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}](u,v)=\int_{Y\times Z}f(s,t)\;d\rho_{(u,v)}(s,t)=\int_{Z}\int_{Y}f(s,t)\;d\delta_{u}(s)\;d\mu(t)=\int_{Z}f(u,t)\;d\mu(t)

Since $v$ from $Z$ does not appear free in this last term, when we integrate with respect to $v$ in $Z$ we get:

\int_{Z}\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}](u,v)\;d\mu(v)=\int_{Z}f(u,t)\;d\mu(t)

Hence for Borel subsets $B$ of $Y$ we have by Fubini-Tonelli that:

	$\displaystyle\int_{B\times Z}\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}](u,v)\;d(\lambda\otimes\mu)(u,v)$	$\displaystyle=\int_{B}\int_{Z}f(u,t)\;d\mu(t)\;d\lambda(u)$
		$\displaystyle=\int_{B\times Z}f(u,t)\;d(\lambda\otimes\mu)(u,t)$

This shows that (B.4) is a version of the condition expectation of $f$ with respect to $\mathscr{F}$ .

Note that $[(u,v)]_{\mathscr{F}}=\{u\}\times Z$ , for any $(u,v)\in Y\times Z$ . Further, recall that $\mathsf{KR}^{\lambda\otimes\mu}(Y\times Z)=\mathsf{KR}^{\lambda}(Y)\times\mathsf{KR}^{\mu}(Z)$ .⁹⁶⁹⁶96One can easily check this by hand. It also follows from the fact that Kurtz randomness is preserved both ways under computable continuous open maps. Hence for $(u,v)$ in $\mathsf{KR}^{\lambda\otimes\mu}(Y\times Z)$ one has the identity:

[(u,v)]_{\mathscr{F}}\cap\mathsf{KR}^{\lambda\otimes\mu}(Y\times Z)=(\{u\}\cap\mathsf{KR}^{\lambda}(Y))\times(Z\cap\mathsf{KR}^{\mu}(Z))=\{u\}\times\mathsf{KR}^{\mu}(Z)

From this we get $\rho_{(u,v)}([(u,v)]_{\mathscr{F}}\cap\mathsf{KR}^{\lambda\otimes\mu}(Y\times Z))=\delta_{u}(\{u\})\cdot\mu(\mathsf{KR}^{\mu}(Z))=1$ .

In this next paragraph, we use some notation familiar from Fubini-Tonelli, namely if $A\subseteq Y\times Z$ and $s$ in $Y$ , then $A_{s}$ is defined to be $\{t\in Z:(s,t)\in A\}$ .

For Schnorr disintegrations, suppose that $(u,v)$ is in $\mathsf{SR}^{\lambda\otimes\mu}(Y\times Z)$ , so that $u$ is in $\mathsf{SR}^{\lambda}(Y)$ . Since $[(u,v)]_{\mathscr{F}}=\{u\}\times Z$ , we want to show that $(\delta_{u}\otimes\mu)(A)=1$ , where $A$ is the event $(\{u\}\times Z)\cap\mathsf{SR}^{\lambda\otimes\mu}(Y\times Z)$ . We have $A_{u}=\{t\in Z:(u,t)\in\mathsf{SR}^{\lambda\otimes\mu}(Y\times Z)\}$ . By choosing a Turing degree ${\bf a}$ which computes a fast Cauchy sequence for $u$ , we have by van Lambalgen’s Theorem that $A_{u}\supseteq\mathsf{SR}^{\mu,{\bf a}}(Z)$ and so $\mu(A_{u})=1$ .⁹⁷⁹⁷97This “hard” direction of van Lambalgen’s Theorem works in arbitrary computable Polish spaces with computable measures, basically because it is Fubini-Tonelli type argument. It similarly works for $\mathsf{MLR}^{\nu}$ . For the setting of Cantor space with uniform measure, see discussion in [15, pp. 257-258, 357]. Then by Fubini-Tonelli $\rho_{(u,v)}(A)=(\delta_{u}\otimes\mu)(A)=\int_{Y}\mu(A_{s})\;d\delta_{u}(s)=\mu(A_{u})=1$ , which is what we wanted to show. The argument for Martin-Löf disintegrations is similar.

Suppose that $W\subseteq Y\times Z$ is c.e. open. Then we can write $W=\bigcup_{i}U_{i}\times V_{i}$ where $U_{i}\subseteq Y,V_{i}\subseteq Z$ are computable sequences of c.e. opens with $U_{i}\subseteq U_{i+1}$ and $V_{i}\subseteq V_{i+1}$ . Then for rational $q\geq 0$ , we have $\rho_{u,v}(W)>q$ iff there is $i\geq 0$ with $\rho_{(u,v)}(U_{i}\times V_{i})>q$ , which happens iff there is $i\geq 0$ with $\delta_{u}(U_{i})\cdot\mu(V_{i})>q$ , which happens iff there is $i\geq 0$ with $u$ in $U_{i}$ and $\mu(V_{i})>q$ . This is a c.e. open condition in variables $(u,v)$ . ∎

Finally, we can combine partitions and lines as follows, which gives a suitably generalised version of an individual step in the filtration from Example A.2 (like $\mathscr{G}_{2}$ or $\mathscr{G}_{3}$ in that example).

Proposition B.8.

Suppose that $Y,Z$ are computable Polish spaces. Suppose that $\lambda$ is a computable point of $\mathcal{P}(Y)$ and $\mu$ is a computable point of $\mathcal{P}(Z)$ .

Suppose $\{C_{i}:i\in I\}$ is an effective partition of $Z$ with effective softening $\{V_{i}:i\in I\}$ .

Let $\mathscr{F}$ be the $\lambda\otimes\mu$ -effective $\sigma$ -algebra on $Y\times Z$ generated by sets of the form $U\times C_{i}$ , where $U$ ranges over c.e. opens from a $\lambda$ -computable basis on $Y$ . Then the map $\rho:Y\times Z\rightarrow\mathcal{M}^{+}(Y\times Z)$ given by $\rho_{(u,v)}=\sum_{i\in I^{+}}\big{(}\delta_{u}\otimes\mu(\cdot\mid V_{i})\big{)}\cdot I_{V_{i}}(v)$ is a Martin-Löf disintegration of $\mathscr{F}$ with respect to $\lambda\otimes\mu$ , and one has the following expression for the version of conditional expectation:

\mathbb{E}_{\lambda\otimes\mu}[f\mid\mathscr{F}](u,v)=\sum_{i\in I^{+}}\bigg{(}\frac{1}{\mu(V_{i})}\int_{V_{i}}f(u,t)\;d\mu(t)\bigg{)}\cdot I_{V_{i}}(v)

(B.5)

Proof.

This proof is just a combination of the proofs of Proposition B.6 and Proposition B.7. ∎

Another variant on Example A.2 is

Proposition B.9.

Let $X=\prod X_{i}$ , where $X_{i}$ is a computable sequence of computable Polish spaces. Let a computable point $\nu$ of $\mathcal{P}(X)$ be given by $\nu=\bigotimes_{i}\nu_{i}$ , where $\nu_{i}$ is a computable sequence in $\mathcal{P}(X_{i})$ . Let $n\geq 1$ . Let $\mathscr{F}_{n}$ be the $\sigma$ -algebra on $X$ generated by sets of the form $\prod_{i\leq n}V_{i}\times\prod_{i>n}X_{i}$ , where $V_{i}$ for $i\leq n$ ranges over c.e. opens from a $\nu_{i}$ -computable basis on $X_{i}$ . For $x$ in $X$ , write its coordinates as $x=(x_{1},x_{2},\ldots)$ . Then $\rho^{(n)}:X\rightarrow\mathcal{P}(X)$ given by $\rho_{x}^{(n)}=(\otimes_{i\leq n}\delta_{x_{i}})\otimes(\otimes_{i>n}\nu_{i})$ is a Martin-Löf disintegration of $\mathscr{F}_{n}$ with respect to $\nu$ , and one has the following expression for the version of conditional expectation, which for each $f$ in $\mathbb{L}_{1}(\nu)$ is defined for $\nu$ -a.s. many $x$ from $X$ , and where $x=(x_{1},x_{2},\ldots)$ and $t=(t_{n+1},t_{n+2},\ldots)$

\mathbb{E}_{\nu}[f\mid\mathscr{F}_{n}](x)=\int_{\prod_{i>n}X_{i}}f(x_{1},\ldots,x_{n},\overline{t})\;d(\otimes_{i>n}\nu_{i})(\overline{t})

Proof.

Simply apply Proposition B.7 to $Y\times Z$ , where $Y=\prod_{i\leq n}X_{i}$ and $Z=\prod_{i>n}X_{i}$ and $\lambda=\otimes_{i\leq n}\nu_{i}$ and $\mu=\otimes_{i>n}\nu_{i}$ . ∎

The simplest kind of an effective filtration, which occurs in both Examples B.1-B.2 is the following:

Definition B.10.

Suppose that $X$ is a computable Polish space. Suppose that $T\subseteq\mathbb{N}^{<\mathbb{N}}$ is a computable tree with no dead ends. Let $I_{n}=\{\sigma\in T:\left|\sigma\right|=n\}$ . Suppose that $\mathscr{F}_{n}$ is an effective partition $\{A_{\sigma}:\sigma\in I_{n}\}$ , uniformly in $n\geq 0$ . If the partitions refine one another, in that $A_{\sigma}=\bigcup_{\sigma^{\frown}(j)\in T}A_{\sigma^{\frown}(j)}$ for all $\sigma$ in $T$ , then the $\mathscr{F}_{n}$ is an effective filtration, which we call an effective refined partition.

The following isolates the natural sufficient condition for an effective refined partition to be full, and this condition is obviously met in Example B.1:

Proposition B.11.

Suppose that the effective refined partition $\mathscr{F}_{n}$ satisfies the following properties:

–

Effectively Shrinking: There is a computable function $\ell:\mathbb{Q}^{>0}\rightarrow\mathbb{N}$ such that for all rational $\epsilon>0$ one has that $\mathrm{diam}(A_{\sigma})<\epsilon$ for all $\sigma$ in $T$ with $\left|\sigma\right|\geq\ell(\epsilon)$ .
–

Effectively non-empty: There is a uniformly computable sequence of points $x_{\sigma}$ in $A_{\sigma}$ .

Then the effective refined partition $\mathscr{F}_{n}$ is full.

Proof.

(Sketch) The two conditions imply that there is a well-defined computable continuous surjection $\pi:[T]\rightarrow X$ given by $\pi(\omega)=x$ iff $\{x\}=\bigcap_{n}A_{\omega\upharpoonright n}$ . For fullness, suppose that $U\subseteq X$ is c.e. open. Then $\pi^{-1}(U)$ is c.e. open. Then there is c.e. set $S\subseteq T$ such that $\pi^{-1}(U)=\bigcup_{\sigma\in S}[T]\cap[\sigma]$ . Then one can check that $U=\bigcup_{\sigma\in S}A_{\sigma}$ . ∎

We can also obtain full effective filtrations by combining lines with full effective partitions, as in B.2:

Example B.12.

Suppose that $Y,Z$ are computable Polish spaces. Suppose that $\lambda$ is a computable point of $\mathcal{P}(Y)$ and suppose that $\mu$ is a computable point of $\mathcal{P}(Z)$ .

Suppose that $\mathscr{F}_{n}$ is a full effective partition of $Z$ (resp. almost-full effective partition of $Z$ ).

Let $\mathscr{G}_{n}$ be the effective $\sigma$ -algebra $\{U\times A:U\subseteq Y\mbox{ c.e. open }\;\&\;A\in\mathscr{F}_{n}\}$ .

Then $\mathscr{G}_{n}$ is a full effective filtration of $Y\times Z$ (resp. almost-full effective filtration of $Y\times Z$ ).

Another example of a full effective filtration is related to countable products:

Example B.13.

The effective $\sigma$ -algebras $\mathscr{F}_{n}$ from the countable products Example B.9 is a full effective filtration. This is because every c.e. open $W\subseteq X=\prod_{i}X_{i}$ can be uniformly written as $\bigcup_{\sigma\in J}W_{\sigma}$ , for some c.e. index set $J$ , where $W_{\sigma}=\prod_{i<\left|\sigma\right|}V_{\sigma(i)}\times\prod_{i\geq\left|\sigma\right|}X_{i}$ , where $V_{\sigma(i)}$ is uniformly c.e. open in $X_{i}$ for $i\leq n$ .

Of course, this example is the same as that of full effective partitions when $X_{i}$ is uniformly countable. However, this example goes beyond that of full effective partitions when the $X_{i}$ are uncountable.

References

[1] Nathanael L Ackerman, Cameron E Freer, and Daniel M Roy, On computability and disintegration, Mathematical Structures in Computer Science 27 (2017), no. 8, 1287–1314.
[2] Gordon Belot, Bayesian orgulity, Philosophy of Science 80 (2013), no. 4, 483–503.
[3] Laurent Bienvenu, Rupert Hölzl, Joseph S Miller, and André Nies, Denjoy, Demuth and density, Journal of Mathematical Logic 14 (2014), no. 01, 1450004.
[4] Patrick Billingsley, Convergence of probability measures, Wiley, 2013.
[5] David Blackwell and Lester Dubins, Merging of opinions with increasing information, Annals of Mathematical Statistics 33 (1962), no. 3, 882–886.
[6] Vladimir I Bogachev, Weak convergence of measures, American Mathematical Society, 2018.
[7] Vasco Brattka, Joseph S Miller, and André Nies, Randomness and differentiability, Transactions of the American Mathematical Society 368 (2015), no. 1, 581–605.
[8] Gemma Carotenuto and André Nies, Lightface ${\Pi}^{0}_{3}$ -completeness of density sets under effective wadge reducibility, Pursuit of the Universal, Springer, 2016, pp. 234–239.
[9] Douglas Cenzer, $\Pi^{0}_{1}$ -Classes in computability theory, Handbook of Computability Theory (Edward R Griffor, ed.), Studies in Logic and the Foundations of Mathematics, vol. 140, North-Holland, Amsterdam, 1999, pp. 37–85.
[10] Douglas Cenzer and Jeffrey B Remmel, Effectively closed sets, May 2017.
[11] J T Chang and D Pollard, Conditioning as disintegration, Statistica Neerlandica 51 (1997), no. 3, 287–317.
[12] Vaughn Climenhaga and Anatole Katok, Measure theory through dynamical eyes, (2012), https://arxiv.org/abs/1208.4550.
[13] Natasha L Dobrinen and Stephen G Simpson, Almost everywhere domination, Journal of Symbolic Logic 69 (2004), no. 3, 914–922.
[14] J L Doob, Measure theory, Graduate Texts in Mathematics, vol. 143, Springer, 1994.
[15] Rod Downey and Dennis Hirschfeldt, Algorithmic randomness and complexity, Springer, Berlin, 2010.
[16] R M Dudley, Real analysis and probability, Cambridge University Press, 2002.
[17] Rick Durrett, Probability: Theory and examples, fourth ed., Cambridge University Press, 2010.
[18] John Earman, Bayes or bust? A critical examination of Bayesian confirmation theory, MIT Press, Cambridge, 1992.
[19] Manfred Einsiedler and Thomas Ward, Ergodic theory: with a view towards number theory, Graduate Texts in Mathematics, vol. 259, Springer, 2010.
[20] Jean-Pierre Florens, Velayoudom Marimoutou, and Anne Peguin-Feissolle, Econometric modeling and inference, Cambridge University Press, 2007.
[21] Gerald B. Folland, Real analysis, second ed., Pure and Applied Mathematics, Wiley, 1999.
[22] Peter Gács, Uniform test of algorithmic randomness over a general space, Theoretical Computer Science 341 (2005), no. 1, 91–137.
[23] Su Gao, Invariant descriptive set theory, CRC Press, 2008.
[24] Vassilios Gregoriades, Tamás Kispéter, and Arno Pauly, A comparison of concepts from computable analysis and effective descriptive set theory, Mathematical Structures in Computer Science 27 (2017), no. 8, 1414–1436.
[25] Allan Gut, Probability: A graduate course, Springer, 2013.
[26] Matthew Harrison-Trainor, Alexander Melnikov, and Keng Meng Ng, Computability of Polish spaces up to homeomorphism, Journal of Symbolic Logic 85 (2020), no. 4, 1664–1686.
[27] Peter G Hinman, A survey of Mučnik and Medvedev degrees, Bulletin of Symbolic Logic 18 (2012), no. 2, 161–229.
[28] Colin Howson and Urbach Peter, Scientific Reasoning: The Bayesian Approach, third ed., Open Court, Chicago, 2006.
[29] Mathieu Hoyrup, Calculabilité, aléatoire et théorie ergodique sur les espaces métriques, Ph.D. thesis, Université Paris-Diderot - Paris VII, 2008.
[30] Mathieu Hoyrup and Cristóbal Rojas, Computability of probability measures and Martin-Löf randomness over metric spaces, Information and Computation 207 (2009), no. 7, 830–847.
[31] Mathieu Hoyrup, Cristóbal Rojas, and Klaus Weihrauch, Computability of the Radon-Nikodym derivative, Models of Computation in Context (Benedikt Löwe, Dag Normann, Ivan Soskov, and Alexandra Soskova, eds.), Lecture Notes in Computer Science, vol. 6735, Springer, Berlin, Heidelberg, 2011, pp. 132–141.
[32] Mathieu Hoyrup and Jason Rute, Computable measure theory and algorithmic randomness, Handbook of Computability and Complexity in Analysis (Vasco Brattka and Peter Hertling, eds.), Springer, 2021, pp. 227–270.
[33] Olav Kallenberg, Foundations of modern probability, Springer, New York, NY, 2002.
[34] Alexander S. Kechris, Classical descriptive set theory, Graduate Texts in Mathematics, vol. 156, Springer, 1995.
[35] Mushfeq Khan, Lebgesgue density and ${\Pi}_{1}^{0}$ Classes, Journal of Symbolic Logic 81 (2016), no. 1, 80–95.
[36] Stuart Alan Kurtz, Randomness and genericity in the degrees of unsolvability, Ph.D. thesis, University of Illinois at Urbana-Champaign, 1981.
[37] Leonid Anatolievich Levin, Uniform tests of randomness, Soviet Mathematics Doklady 17 (1976), no. 2, 337–340.
[38] P Lévy, Théorie de l’addition des variables aléatoires, Gauthier-Villars, Paris (1938).
[39] Ming Li and Paul Vitányi, An introduction to Kolmogorov complexity and its applications, second ed., Graduate Texts in Computer Science, Springer, 1997.
[40] Donald Martin, Measure, category, and degrees of unsolvability, Unpublished (1967).
[41] Per Martin-Löf, The definition of random sequences, Information and Control 9 (1966), 602–619.
[42] by same author, On the notion of randomness, Intuitionism and Proof Theory (Akiko Kino, John Myhill, and Richard E Vesley, eds.), North-Holland, Amsterdam, 1970, pp. 73–78.
[43] Joseph S Miller, Degrees of unsolvability of continuous functions, Journal of Symbolic Logic 69 (2004), no. 2, 555–584.
[44] Kenshi Miyabe, Characterization of kurtz randomness by a differentiation theorem, Theory of Computing Systems 52 (2013), no. 1, 113–132.
[45] by same author, ${L}^{1}$ -Computability, layerwise computability, and Solovay reducibility, Computability 2 (2013), 15–29.
[46] by same author, Algorithmic randomness over general spaces, Mathematical Logic Quarterly (2014).
[47] Kenshi Miyabe, André Nies, and Jing Zhang, Using almost-everywhere theorems from analysis to study randomness, Bulletin of Symbolic Logic 22 (2016), no. 3, 305–331.
[48] Yiannis N Moschovakis, Descriptive set theory, second ed., Mathematical Surveys and Monographs, vol. 155, American Mathematical Society, Providence, RI, 2009.
[49] Keng Meng Ng, Frank Stephan, Yue Yang, and Liang Yu, Computational aspects of the hyperimmune-free degrees, Proceedings of the 12th Asian Logic Conference, World Scientific, 2012, pp. 271–284.
[50] André Nies, Computability and randomness, Oxford University Press, 2008.
[51] K R Parthasarathy, Probability measures on metric spaces, Academic Press, 1967.
[52] Noopur Pathak, Cristóbal Rojas, and Stephen G. Simpson, Schnorr randomness and the Lebesgue differentiation theorem, Proceedings of the American Mathematical Society 142 (2014), no. 1, 335–349.
[53] David Pollard, A user’s guide to measure theoretic probability, Cambridge University Press, 2002.
[54] Christopher P Porter, On analogues of the Church–Turing Thesis in algorithmic randomness, Review of Symbolic Logic 9 (2016), no. 3, 456–479.
[55] Malempati M Rao, Stochastic processes - inference theory, second ed., Springer, 2014.
[56] Jan Reimann, Randomness—beyond Lebesgue measure, Logic Colloquium 2006 (S Barry Cooper, Herman Geuvers, Anand Pillay, and Jouko Vaananen, eds.), Cambridge University Press, Cambridge, 2010, pp. 247–279.
[57] Robert Rettinger and Xizhong Zheng, Computability of real numbers, Handbook of Computability and Complexity in Analysis (Vasco Brattka and Peter Hertling, eds.), Springer, 2021, pp. 3–28.
[58] V A Rohlin, On the fundamental ideas of measure theory, Matematicheskii Sbornik (1949), Translated in [59].
[59] by same author, On the fundamental ideas of measure theory, Functional analysis and measure theory, American Mathematical Society, 1962, pp. 1–54.
[60] V A Rokhlin, Lectures on the entropy of measure-preserving transformations, Russian Mathematical Surveys 22 (1967), no. 5.
[61] Jason Rute, Algorithmic randomness, martingales, and differentiability I, preprint (2012), This is an expanded version of [62, Chapter 2].
[62] by same author, Topics in algorithmic randomness and computable analysis, Ph.D. thesis, Carnegie Mellon University, 2013.
[63] Filippo Santambrogio, Optimal transport for applied mathematicians: Calculus of variations, PDEs, and modeling, Birkhauser, 2015.
[64] Claus-Peter Schnorr, Zufälligkeit und Wahrscheinlichkeit. Eine algorithmische Begründung der Wahrscheinlichkeitstheorie, Lecture Notes in Mathematics, vol. 218, Springer, 1971.
[65] by same author, A survey of the theory of random sequences, Basic Problems in Methodology and Linguistics: Part Three of the Proceedings of the Fifth International Congress of Logic, Methodology and Philosophy of Science, London, Ontario, Canada-1975, Reidel, Dordrecht, 1977, pp. 193–211.
[66] Glenn Shafer and Vladimir Vovk, Game-Theoretic foundations for probability and finance, Wiley, 2019.
[67] Glenn Shafer, Vladimir Vovk, and Akimichi Takemura, Levy’s zero-one law in game-theoretic probability, (2009), https://arxiv.org/abs/0905.0254.
[68] A Shen, V A Uspensky, and N Vereshchagin, Kolmogorov complexity and algorithmic randomness, American Mathematical Society, 2017.
[69] Stephen G Simpson, Mass problems and randomness, Bulletin of Symbolic Logic 11 (2005), no. 1, 1–27.
[70] Stephen G. Simpson, Subsystems of second order arithmetic, second ed., Cambridge University Press, Cambridge, 2009.
[71] Robert I Soare, Turing computability: Theory and applications, Springer, 2016.
[72] Alan M Turing, On computable numbers, with an application to the Entscheidungsproblem, Proceedings of the London Mathematical Society 2 (1937), no. 1, 230–265.
[73] Jan Von Plato, Creating modern probability: Its mathematics, physics and philosophy in historical perspective, Cambridge University Press, 1994.
[74] Hermann Weyl, Über die Gleichverteilung von Zahlen Mod. Eins, Mathematische Annalen 77 (1916), 313–352.
[75] David Williams, Probability with martingales, Cambridge Mathematical Textbooks, Cambridge University Press, Cambridge, 1991.
[76] Xiaokang Yu, Measure theory in weak subsystems of second-order arithmetic, Ph.D. thesis, The Pennsylvania State University, 1987.
[77] by same author, Radon-Nikodým theorem is equivalent to arithmetical comprehension, Logic and computation (Pittsburgh, PA, 1987), Contemporary Mathematics, vol. 106, American Mathematical Society, 1990, pp. 289–297.

		$\displaystyle\sum_{t}\\|\sum_{s\geq t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}\leq\sum_{t}\sum_{s\geq t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\sum_{s\geq t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(s+1)^{k}=\sum_{t}\sum_{s\geq t}c_{s,t}$
	$\displaystyle=$	$\displaystyle\sum_{t}\sum_{s}c_{s,t}=\sum_{s}\sum_{t}c_{s,t}=\sum_{s}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(s+1)^{k+1}$
	$\displaystyle\leq$	$\displaystyle\sum_{s}\frac{p}{p-1}\cdot\\|f-f_{s}\\|_{p}\cdot(s+1)^{k+1}<\infty$

		$\displaystyle\sum_{t}\\|\sum_{s<t}\sup_{n}\mathbb{E}_{\nu}[f-f_{s}\mid\mathscr{F}_{n}]-\sup_{n}\mathbb{E}_{\nu}[f_{t}-f_{s}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\sum_{s<t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle=$	$\displaystyle\sum_{t}t\cdot\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\\|\sup_{n}\mathbb{E}_{\nu}[f-f_{t}\mid\mathscr{F}_{n}]\\|_{p}\cdot(t+1)^{k+1}$
	$\displaystyle\leq$	$\displaystyle\sum_{t}\frac{p}{p-1}\cdot\\|f-f_{t}\\|_{p}\cdot(t+1)^{k+1}<\infty$