[1]\fnmShun-ichi \surAzuma

[1]\orgdivGraduate School of Informatics, \orgnameKyoto University, \orgaddress\streetYoshida-honmachi, Sakyo-ku, \cityKyoto, \postcode606-8501, \countryJapan

3]\orgdivFaculty of Engineering, \orgnameTokyo University of Agriculture and Technology, \orgaddress\street2-24-16 Naka-cho, \cityKoganei, \postcode184-8588, \countryJapan

4]\orgdivGraduate School of Engineering, \orgnameNagoya University, \orgaddress\street Furo-cho, Chikusa-ku, \cityNagoya, \postcode464-8603, \countryJapan

Networks of Classical Conditioning Gates and Their Learning

[email protected] \fnmDai \surTakakura [email protected] \fnmRyo \surAriizumi [email protected] \fnmToru \surAsai [email protected] * [ [

Abstract

Chemical AI is chemically synthesized artificial intelligence that has the ability of learning in addition to information processing. A research project on chemical AI, called the Molecular Cybernetics Project, was launched in Japan in 2021 with the goal of creating a molecular machine that can learn a type of conditioned reflex through the process called classical conditioning. If the project succeeds in developing such a molecular machine, the next step would be to configure a network of such machines to realize more complex functions. With this motivation, this paper develops a method for learning a desired function in the network of nodes each of which can implement classical conditioning. First, we present a model of classical conditioning, which is called here a classical conditioning gate. We then propose a learning algorithm for the network of classical conditioning gates.

keywords:

network, learning, classical conditioning, Pavlov’s dog, logic gate.

1 Introduction

The development of DNA nanotechnology has raised expectations for chemical AI. Chemical AI is chemically synthesized artificial intelligence that has the ability of learning in addition to information processing. A research project on chemical AI, called the Molecular Cybernetics Project, was launched in Japan in 2021 [1], with the goal of establishing an academic field called molecular cybernetics.

One of the milestones of the project is to create a molecular machine that can learn a type of conditioned reflex through the process called classical conditioning [2]. Classical conditioning is the process of acquiring a conditioned reflex by giving an conditioned stimulus with an unconditioned stimulus. It was discovered by the well-known psychological experiment of “Pavlov’s dog”:

•

if one feeds a dog repeatedly while ringing of a bell, then the dog will eventually begin to salivate at the sound of the bell;
•

If one just rings a bell repeatedly without doing anything else for the dog that salivates at the sound of the bell, the dog will stop salivating at the sound of the bell,

which are respectively called the acquisition and extinction. This project attempts to create liposomes with different functions and combine them to artificially achieve a function similar to classical conditioning.

If the milestone is achieved, the next step would be to configure a network of such machines to realize more complex functions. Therefore, it is expected to establish a learning method for such a network. However, there exists no method because the learning has to be performed by the interaction of classical conditioning on the network, which is completely different from what is being considered these days. In fact, neural networks are well known as a learning model, where learning is performed by adjusting weights between nodes [3], not by providing external inputs as in classical conditioning. On the other hand, Boolean networks [4] are known as a model of the network of logic gates and their learning has been studied, e.g., in [5]; but it also differs from learning by providing external inputs.

In this paper, we develop a method for learning a desired function in a network of nodes each of which can implement classical conditioning. First, classical conditioning is modeled as a time-varying logic gate with two inputs and single output. The two inputs correspond to the feeding and bell ringing in Pavlov’s dog experiment, and the gate operates as either a YES gate or an OR gate at each time. The gate state, which is either YES or OR, is determined by how the inputs are given, in a similar manner to classical conditioning. The model is called here a classical conditioning gate. Based on this model, the network of classical conditioning gates and its learning problem are formulated. We then derive a key principle to solving the problem, called the flipping principle, that the gate state of any node in the network can be flipped while preserving the state of some other nodes. By the flipping principle, we present a learning algorithm to obtain a desired function on the network.

Finally, we note the terminology and notation used in this paper. We consider two types of logical gates, logical YES and logical OR. Table 1 shows the truth tables. We use $x_{1}\vee x_{2}$ to represent the logical OR operation of the binary variables $x_{1}$ and $x_{2}$ . Moreover, let $\bigvee_{i\in{\mathbf{I}}}x_{i}$ denote the logical OR operation of $x_{i}$ with respect to all the indeces in a finite set ${\mathbf{I}}$ .

Table 1: Truth tables of logical YES and OR.

Input 1	Input 2	Output (YES)	Output (OR)
0	0	0	0
0	1	0	1
1	0	1	1
1	1	1	1

2 System Modeling

2.1 Classical Conditioning Gates

We model classical conditioning as shown in Fig. 1. It is a two-state machine that switches between the states “YES” and “OR” based on the two input signals taking a binary value. When the state is “YES”, the gate operates as a logical YES gate, whose output is equal to the first input, as shown in Table 1. On the other hand, when the state is “OR”, the gate operates as a logical OR gate, whose output is equal to 1 if and only if at least one of the two inputs is equal to 1. The state changes when two types of training inputs are applied. When the state is “YES”, the state is changed to “OR” by entering the value $(1,1)$ several times in row. On the other hand, when the state is “OR”, the state is changed to “YES” by entering the value $(0,1)$ several times in row.

This model can be interpreted in terms of Pavlov’s dog experiment as follows. The state “YES” corresponds to responding only when the dog is being fed, while the state “OR” corresponds to responding when the dog is being fed or hears the bell. Then the input value $(1,1)$ is interpreted as the stimulus for acquisition, i.e., feeding with bell ringing, and $(0,1)$ is interpreted as the stimulus for extinction, i.e., bell ringing without feeding.

The model of the above classical conditioning gate is expressed as

\displaystyle\left\{\begin{array}[]{l}x(t+1)=\left\{\begin{array}[]{ll}{\rm OR}&~{}{\rm if}~{}x(t-s+1)={\rm YES},\\ &~{}~{}~{}~{}(v(\tau),w(\tau))=(1,1)~{}(\tau=t,t-1,\ldots,t-s+1),\\ {\rm YES}&~{}{\rm if}~{}x(t-s+1)={\rm OR},\\ &~{}~{}~{}~{}(v(\tau),w(\tau))=(0,1)~{}(\tau=t,t-1,\ldots,t-s+1),\\ x(t)&~{}{\rm otherwise},\\ \end{array}\right.\\[31.29802pt] y(t)=\left\{\begin{array}[]{lll}v(t)&~{}{\rm if}~{}{\rm YES},\\ v(t)\vee w(t)&~{}{\rm if}~{}{\rm OR},\end{array}\right.\end{array}\right.

(10)

where $x(t)\in\{{\rm YES},{\rm OR}\}$ is the state, $v(t)\in\{0,1\}$ and $w(t)\in\{0,1\}$ are the inputs, $y(t)\in\{0,1\}$ is the output, and $s\in\{1,2,\ldots\}$ is a period, called the unit training time. The state equation represents classical conditioning, while the output equation represents the resulting input-output relation at time $t$ .

The following result presents a basic property of (10), which will be utilized for training a network of classical conditioning gates.

Lemma 1.

Consider the classical conditioning gate in (10) with $x(t)=\bar{x}$ , where $t\in\{0,1,\ldots\}$ and $\bar{x}\in\{{\rm YES},{\rm OR}\}$ are arbitrarily given. Then the following statements hold.
(i) If $(v(t),w(t))=(0,0)$ , then $y(t)=0$ and $x(t+1)=x(t)$ .
(ii) If $(v(t),w(t))=(1,0)$ , then $y(t)=1$ and $x(t+1)=x(t)$ .

Proof.

Trivial from (10). ∎

This shows that there exists an input value that sets an arbitrary value at the output while preserving the state value.

Refer to caption — Figure 1: Classical conditioning gate.

2.2 Network of Classical Conditioning Gates

Now, we introduce a network of classical conditioning gates, as shown in Fig. 2. The network has a binary tree structure, where each gate, except for the leftmost gates, is connected to two other gates on the input side. The network has $m$ layers, indexed by $1,2,\ldots,m$ from the input side. The $i$ -th layer has $2^{m-i}$ gates, and thus the network has $\sum_{i=1}^{m}2^{m-i}$ gates. We use $n_{i}$ and $n$ to denote these numbers, i.e., $n_{i}=2^{m-i}$ and $n=\sum_{i=1}^{m}2^{m-i}=\sum_{i=1}^{m}n_{i}$ .

We introduce the following notation for the network. The network with $m$ layers is denoted by $\Sigma(m)$ . The $j$ -th gate from the top in the $i$ -layer is called node $(i,j)$ , and let $x_{ij}(t)\in\{{\rm YES},{\rm OR}\}$ , $v_{ij}(t)\in\{0,1\}$ , $w_{ij}(t)\in\{0,1\}$ , and $y_{ij}(t)\in\{0,1\}$ be the state, first input, second input, and output of node $(i,j)$ , respectively. The pair of $v_{ij}(t)$ and $w_{ij}(t)$ is often denoted by $u_{ij}(t)\in\{0,1\}^{2}$ . We use $\bar{x}_{ij}(t)$ to represent the flipped value of $x_{ij}(t)$ : $\bar{x}_{ij}(t)={\rm YES}$ for ${x_{ij}(t)}={\rm OR}$ , while $\bar{x}_{ij}(t)={\rm OR}$ for ${x_{ij}(t)}={\rm YES}$ .

Next, let us consider the collection of signals. We use ${\mathbf{N}}:=\{(1,1),(1,2),\ldots,$ $(m,1)\}$ , which has $n$ elements, to represent the set of node indeces, and use ${\mathbf{N}}_{i}\subset{\mathbf{N}}$ to represent the set of node indeces in layer $i$ . Let $X_{i}(t)\in\{{\rm YES},{\rm OR}\}^{n_{i}}$ , $U_{i}(t)\in\prod_{j=1}^{n_{i}}(\{0,1\}\times\{0,1\})$ , and $Y_{i}(t)\in\{0,1\}^{n_{i}}$ be the collective state, input, and output of the $i$ -th layer. Let also $X(t):=(X_{1}(t),X_{2}(t),\ldots,X_{m}(t))\in\{{\rm YES},{\rm OR}\}^{n_{1}}\times\{{\rm YES},{\rm OR}\}^{n_{2}}\times\cdots\times\{{\rm YES},{\rm OR}\}^{n_{m}}=\{{\rm YES},{\rm OR}\}^{n}$ . According to this notation, the state, input, and output of the network $\Sigma(m)$ are denoted by $X(t)$ , $U_{1}(t)$ , and $Y_{m}(t)$ , respectively. Note that $Y_{m}(t)=y_{m1}(t)$ .

Fig. 3 shows an example for $m=3$ . In this case, we have ${\mathbf{N}}=\{(1,1),(1,2),(1,3),$
$(1,4),(2,1),(2,2),(3,1)\}$ , $X_{1}(t)=(x_{11}(t),x_{12}(t),x_{13}(t),x_{14}(t))$ , $X_{2}(t)=(x_{21}(t),$ $x_{22}(t))$ , $X_{3}(t)=x_{31}(t)$ , $U_{1}(t)=(u_{11}(t),u_{12}(t),u_{13}(t),u_{14}(t))$ , $U_{2}(t)=(u_{21}(t),u_{22}(t))$ , $U_{3}(t)=u_{31}(t)$ , $Y_{1}(t)=(y_{11}(t),y_{12}(t),y_{13}(t),y_{14}(t))$ , $Y_{2}(t)=(y_{21}(t),y_{22}(t))$ , and $Y_{3}(t)=y_{31}(t)$ .

Since a classical conditioning gate operates as either logical YES or OR, the possible input-output relation of $\Sigma(m)$ is limited to logical OR of some of the inputs of $\Sigma(m)$ . Moreover, the output $y_{m}(t)$ always depends on the input $v_{11}(t)$ because $v_{11}(t)$ propagates through nodes $(i,1)$ ( $i=1,2,\ldots,m$ ) operating as logical YES or OR. For example, $y_{3}(0)=v_{11}(0)\vee v_{12}(0)$ for the network $\Sigma(3)$ in Fig. 3. This fact is formalized as follows.

Lemma 2.

Consider the network $\Sigma(m)$ with $X(t)=\bar{X}$ , where $t\in\{0,1,\ldots\}$ and $\bar{X}\in\{{\rm YES},{\rm OR}\}^{n}$ are arbitrarily given. The following statements hold.
(i) The input-output relation at time $t$ is given by

\displaystyle y_{m}(t)=\left(\bigvee_{i\in{\mathbf{J}}_{1}}v_{1j}(t)\right)\vee\left(\bigvee_{i\in{\mathbf{J}}_{2}}w_{1j}(t)\right)

(11)

for some ${\mathbf{J}}_{1}\subseteq\{1,2,\ldots,n_{1}\}$ and ${\mathbf{J}}_{2}\subseteq\{1,2,\ldots,n_{1}\}$ .
(ii) If (11) holds, then $1\in{\mathbf{J}}_{1}$ . Moreover, if $v_{11}(t)=1$ , then $y_{m}(t)=1$ . $\Box$

The following result shows a structural property that indicates which inputs affect node $(i,j)$ .

Lemma 3.

Consider the network $\Sigma(m)$ with $X(t)=\bar{X}$ and a node $(i,j)\in\{2,3,\ldots,m\}\times\{1,2,\ldots,n_{i}\}$ , where $t\in\{0,1,\ldots\}$ and $\bar{X}\in\{{\rm YES},{\rm OR}\}^{n}$ are arbitrarily given. Assume that the input $U(t)$ is divided into $2n_{i}$ blocks in the same size and let $[U(t)]_{k}\in\{0,1\}^{2^{i-1}}$ be the $k$ -th block. Then

\displaystyle v_{ij}(t)=f_{1}([U(t)]_{2j-1}),~{}~{}w_{ij}(t)=f_{2}([U(t)]_{2j})

(12)

holds for some functions $f_{1}:\{0,1\}^{2^{i-1}}\to\{0,1\}$ and $f_{2}:\{0,1\}^{2^{i-1}}\to\{0,1\}$ .

Proof.

It is trivial from the definition of $\Sigma(m)$ . See Fig. 2. ∎

Lemma 3 is illustrated as follows. Consider the network $\Sigma(3)$ in Fig. 3 and node $(2,1)$ . Then $U(t)$ is divided into $4$ blocks (where $2n_{2}=2^{2}$ ): $[U(t)]_{1}=(v_{11}(t),w_{11}(t))$ , $[U(t)]_{2}=(v_{12}(t),w_{12}(t))$ , $[U(t)]_{3}=(v_{13}(t),w_{13}(t))$ , and $[U(t)]_{4}=(v_{14}(t),w_{14}(t))$ . From Lemma 3, we have $v_{21}(t)=f_{1}(U_{[1]}(t))$ and $w_{21}(t)=f_{2}(U_{[2]}(t))$ for some $f_{1}:\{0,1\}^{2^{i-1}}\to\{0,1\}$ and $f_{2}:\{0,1\}^{2^{i-1}}\to\{0,1\}$ . This is consistent with the interdependence between signals in Fig. 3.

3 Problem Formulation

For the network $\Sigma(m)$ , we address the following learning problem.

Problem 1.

Consider the network $\Sigma(m)$ with $X(0)=\bar{X}$ , where $\bar{X}\in\{{\rm YES},{\rm OR}\}^{n}$ is arbitrarily given. Suppose that ${\mathbf{J}}_{1}\subseteq\{1,2,\ldots,2^{m}\}$ and ${\mathbf{J}}_{2}\subseteq\{1,2,\ldots,2^{m}\}$ are given. Find a time $T\in\{1,2,\ldots\}$ and an input sequence $(U(0),U(1),\ldots,U(T-1))$ such that (11) holds for $t=T$ . $\Box$

Two remarks are given.

First, the problem is not always feasible because (11) cannot be always realized in $\Sigma(m)$ . For example, as is seen from the output equation in (10), $y_{m}(T)=v_{12}(T)\vee v_{22}(T)$ is not possible for any $T\in\{1,2,\ldots\}$ .

Second, if the problem is feasible, there exists a vector $X^{*}\in\{{\rm YES},{\rm OR}\}^{n}$ such that $X(T)=X^{*}$ implies (11). For example, consider the system $\Sigma(3)$ in Fig. 3 and the case where ${\mathbf{J}}_{1}=\{1,3,4\}$ and ${\mathbf{J}}_{2}=\{3\}$ for (11). Then, we have $X^{*}=({\rm YES},{\rm YES},{\rm OR},{\rm YES},{\rm YES},{\rm OR},{\rm YES})$ , for which $X(T)=X^{*}$ implies $y_{m}(T)=v_{11}(T)\vee v_{13}(T)\vee w_{13}(T)\vee v_{14}(T)$ . Thus, the problem is reduced into finding the input sequence to steer the state to $X^{*}$ .

4 Learning

Now, we present a solution to Problem 1.

4.1 Flipping Principle

Let us provide a key principle, called the flipping principle, for solving Problem 1.

The following is a preliminary result to derive the flipping principle.

Lemma 4.

Consider the network $\Sigma(m)$ with $X(t)=\bar{X}$ , where $t\in\{0,1,\ldots\}$ and $\bar{X}\in\{{\rm YES},{\rm OR}\}^{n}$ are arbitrarily given. Then the following statements hold.
(i) If $U(t)=(0,0,\ldots,0)\in\{0,1\}^{2n_{1}}$ , then $y_{m}(t)=0$ and $X(t+1)=X(t)$ .
(ii) If $U(t)=(1,0,\ldots,0)\in\{0,1\}^{2n_{1}}$ , then $y_{m}(t)=1$ and $X(t+1)=X(t)$ .

Proof.

In (i) and (ii), the relation $X(t+1)=X(t)$ is proven by the network structure of $\Sigma(m)$ and Lemma 1, which states that, in (10), $x(t+1)=x(t)$ holds under $(v(t),w(t))=(0,0)$ or $(v(t),w(t))=(1,0)$ . Next, Lemma 2 (i) (in particular, (11)) implies that $y_{m}(t)=0$ for $U(t)=(0,0,\ldots,0)$ , which proves (i). On the other hand, it follows from Lemma 2 (ii) that $y_{m}(t)=1$ for $U(t)=(1,0,\ldots,0)$ . This proves (ii). ∎

Lemma 4 implies that there exists an input value for $\Sigma(m)$ that sets an arbitrary value at the output of $\Sigma(m)$ while preserving the state value. Note from this lemma that the state of $\Sigma(m)$ does not change by an input sequence that takes $(0,0,\ldots,0)$ and $(1,0,\ldots,0)$ at each time.

From Lemma 4, we obtain the flipping principle for learning of $\Sigma(m)$ .

Theorem 1.

Consider the network $\Sigma(m)$ with $X(t)=\bar{X}$ and node $(p,q)$ , where $t\in\{0,1,\ldots\}$ and $\bar{X}\in\{{\rm YES},{\rm OR}\}^{n}$ are arbitrarily given. Then the following statements hold.
(i) There exists an input sequence $(U_{t},U_{t+1},\ldots,U_{t+s-1})\in\{0,1\}^{2n_{1}}\times\{0,1\}^{2n_{1}}\times\cdots\times\{0,1\}^{2n_{1}}$ (Cartesian product of $s$ sets) such that

\displaystyle x_{ij}(t+s)=\left\{\begin{array}[]{lll}\bar{x}_{ij}(t)&~{}{\rm if}~{}(i,j)=(p,q),\\ x_{ij}(t)&~{}{\rm if}~{}(i,j)\in{\mathbf{N}}_{p}\setminus\{(p,q)\}\end{array}\right.

(15)

under $U(t)=U_{t}$ , $U(t+1)=U_{t+1}$ , $\ldots$ , $U(t+s-1)=U_{t+s-1}$ , where $s\in\{0,1,\ldots\}$ is the unit training time defined for (10).
(ii) An input sequence satisfying (15) is given by $(\hat{U}_{(p,q)},\hat{U}_{(p,q)},\ldots,\hat{U}_{(p,q)})$ (constant on the time interval $\{t,t+1,\ldots,t+s-1\}$ ), where $\hat{U}_{(p,q)}\in\{0,1\}^{2n_{1}}$ is an input value which is divided into $2^{m+1-p}$ blocks in the same size and whose blocks are given as follows:

\displaystyle\begin{array}[]{rl}\mbox{$(2q-1)$-th block}:&\left\{\begin{array}[]{ll}(0,0,0,\ldots,0)&{\rm if}~{}x_{pq}(t)={\rm OR},\\ (1,0,0,\ldots,0)&{\rm if}~{}x_{pq}(t)={\rm YES},\\ \end{array}\right.\\ \mbox{$2q$-th block}:&(1,0,0,\ldots,0),\\ \mbox{Other blocks}:&(0,0,0,\ldots,0).\end{array}

(21)

Proof.

Statements (i) and (ii) are proven by showing that (15) holds for the input sequence $(\hat{U}_{(p,q)},\hat{U}_{(p,q)},\ldots,\hat{U}_{(p,q)})$ specified in (ii).

By definition, the network $\Sigma(m)$ can be represented as the cascade connection of $m$ layers as shown in Fig. 4. As we can see by comparing Figs. 2 and 4, the entire left side of the $i$ -th layer is the parallel system of $2n_{i}$ subsystems, denoted by $S_{1},S_{2},\ldots,S_{2n_{i}}$ , as shown in Fig. 5. Each subsystem is equivalent to the network of $i-1$ layers, i.e., $\Sigma(i-1)$ . This allows us to apply Lemma 4 to each subsystem because Lemma 4 holds for any $m\in\{1,2,\ldots\}$ .

Now, consider node $(p,q)$ . Suppose that $\Sigma(m)$ is represented as Fig. 5 for $i=q$ , and let $Z_{k}(t)\in\{{\rm YES},{\rm OR}\}^{\nu_{p}}$ be the state of the subsystem $S_{k}$ , where $\nu_{p}=\sum_{i=1}^{p-1}2^{i-1}$ . Note here that the following statements are equivalent:

•

$Z_{k}(t+s)=Z_{k}(t)$ for every $k\in\{1,2,\ldots,2n_{p}\}$ .
•

$x_{ij}(t+s)=x_{ij}(t)$ for every $(i,j)\in{\mathbf{N}}_{p-1}$ .

We divide the proof into two cases: $x_{pq}(t)={\rm OR}$ and $x_{pq}(t)={\rm YES}$ . First, we address the case $x_{pq}(t)={\rm OR}$ . If $x_{pq}(t)={\rm OR}$ and $U(t)=\hat{U}_{(p,q)}$ , it follows from Lemmas 3 and 4 (Lemma 4 is applied to $\Sigma(p-1)$ ) that $(v_{pq}(t),w_{pq}(t))=(0,1)$ , $(v_{pj}(t),w_{pj}(t))=(0,0)$ for $j\in\{1,2,\ldots,n_{p}\}\setminus\{q\}$ , and $Z_{k}(t+1)=Z_{k}(t)$ for $k\in\{1,2,\ldots,2n_{p}\}$ . Thus if $U(t)=\hat{U}_{(p,q)}$ , $U(t+1)=\hat{U}_{(p,q)}$ , $\ldots$ , $U(t+s-1)=\hat{U}_{(p,q)}$ , then

•

$x_{pq}(t+s)={\rm YES}=\bar{x}_{pq}(t)$ ,
•

$x_{pj}(t+s)=x_{pj}(t)$ for $j\in\{1,2,\ldots,n_{p}\}\setminus\{q\}$ ,
•

$Z_{k}(t+s)=Z_{k}(t)$ for $k\in\{1,2,\ldots,2n_{p}\}$ .

This implies (15). The other case $x_{pq}(t)={\rm YES}$ can be proven in the same manner. The only difference is that $(v_{pq}(t),w_{pq}(t))=(1,1)$ holds when $x_{pq}(t)={\rm OR}$ and $U(t)=\hat{U}_{(p,q)}$ . ∎

Theorem 1 implies that we can flip the state of any node while preserving the state of the other nodes in the layer to which the node to be flipped belongs and its upstream layers.

Example 1.

Consider the network $\Sigma(3)$ in Fig. 3, where $X(0)=({\rm YES},{\rm YES},{\rm OR},$ ${\rm YES},{\rm OR},{\rm OR},{\rm YES})$ . By the input sequence $(\hat{U}_{(2,1)},\hat{U}_{(2,1)},\ldots,\hat{U}_{(2,1)})$ for $\hat{U}_{(2,1)}=((1,0,0,0),(0,0,0,0))$ , the state of node $(2,1)$ is flipped while preserving the states of nodes $(1,1)$ , $(1,2)$ , $(1,3)$ , $(1,4)$ , and $(2,2)$ . Fig. 6 shows $\Sigma(3)$ with the resulting state $X(s)$ . $\Box$

4.2 Learning Algorithm

Theorem 1 implies that we can steer the state of the network $\Sigma(m)$ from any value to any value by flipping the state of each node one by one from the upstream node. Based on this idea, we obtain the following algorithm to solve Problem 1.

Algorithm 1

(Step 1): Let $X^{*}\in\{{\rm YES},{\rm OR}\}^{n}$ be a state associated with the desired input-output relation in (11) and let $x^{*}_{ij}\in\{{\rm YES},{\rm OR}\}$ be its element corresponding to node $(i,j)$ . Let also $k:=0$ and $\hat{\mathbf{N}}:={\mathbf{N}}$ .
(Step 2): Pick the minimum pair $(i,j)$ from $\hat{\mathbf{N}}$ in lexicographical order.
(Step 3): If $x_{ij}(ks)\neq x^{*}_{ij}$ , apply the input sequence $(\hat{U}_{(i,j)},\hat{U}_{(i,j)},\ldots,\hat{U}_{(i,j)})$ to the network $\Sigma(m)$ , i.e., $U(ks)=\hat{U}_{(i,j)},U(ks+1)=\hat{U}_{(i,j)},\ldots,U(ks+s-1)=\hat{U}_{(i,j)}$ , and let $k:=k+1$ .
(Step 4): Let $\hat{\mathbf{N}}:=\hat{\mathbf{N}}\setminus\{(i,j)\}$ . If $\hat{\mathbf{N}}\neq\emptyset$ , goto Step 2; otherwise, halt.

In the algorithm, $k$ is a variable to count the number of nodes whose state is flipped, and $\hat{\mathbf{N}}$ is the list of the nodes for which the algorithm has never checked whether their state needs to be flipped or not. In Step 1, $X^{*}$ is defined from (11) and $k$ and $\hat{\mathbf{N}}$ are initialized. Step 2 picks a node $(i,j)$ from the list $\hat{\mathbf{N}}$ . Step 3 checks whether the state of node $(i,j)$ has to be flipped or not; if it has to be flipped, the state is flipped by applying the training input sequence specified in Theorem 1. Note here that $s$ steps of actual time elapsed while applying the input sequence to $\Sigma(m)$ . In Step 4, node $(i,j)$ is removed from the list $\hat{\mathbf{N}}$ . In addition, the terminate condition is checked; if $\hat{\mathbf{N}}$ is empty, the algorithm terminates; otherwise, the above procedure is performed for a remaining node in the list $\hat{\mathbf{N}}$ .

For the algorithm, we obtain the following result.

Theorem 2.

Consider Problem 1. Assume that there exists a vector $X^{*}\in\{{\rm YES},{\rm OR}\}^{n}$ such that (11) is equivalent to $X(T)=X^{*}$ . Let $k^{*}\in\{0,1,\ldots\}$ be the value of $k$ when Algorithm 1 terminates and ${\mathbb{U}}(k)$ ( $k=0,1,\ldots k^{*}-1$ ) are the input sequence generated in Step 3 of Algorithm 1. Then a solution to the problem is given by $T=k^{*}s$ and $({\mathbb{U}}(0),{\mathbb{U}}(1),\ldots,{\mathbb{U}}(k^{*}-1))$ . $\Box$

5 Example

Consider the network $\Sigma(3)$ in Fig. 3, where $X(0)=({\rm YES},{\rm YES},{\rm OR},{\rm YES},{\rm OR},{\rm OR},$ ${\rm OR})$ . Assume that $s=3$ . Let us show how Algorithm 1 solves Problem 1 for ${\mathbf{J}}_{1}=\{1,3,4\}$ and ${\mathbf{J}}_{2}=\{3\}$ .

In Step 1, we have $X^{*}=({\rm YES},{\rm YES},{\rm OR},{\rm YES},{\rm YES},{\rm OR},{\rm YES})$ . Then the algorithm generates the input sequence ${\mathbb{U}}(0)=(\hat{U}_{(2,1)},\hat{U}_{(2,1)},\hat{U}_{(2,1)})$ to flip the state of node $(2,1)$ in Step 3 at $k=0$ . The result is shown in Fig. 6. Next, the algorithm generates the input sequence ${\mathbb{U}}(1)=(\hat{U}_{(3,1)},\hat{U}_{(3,1)},\hat{U}_{(3,1)})$ in Step 3 at $k=1$ , which flips the state of node $(3,1)$ . As the result, we have the system in Fig. 7 with the output $y_{m}(2s)=v_{11}(2s)\vee v_{13}(2s)\vee w_{13}(2s)\vee v_{14}(2s)$ .

6 Conclusion

We have presented a learning method for a network of nodes each of which can implement classical conditioning. Based on the principle that the state of any node can be flipped while preserving the state of some other nodes, an algorithm has been derived.

The proposed algorithm can be applied only to the networks with a tree structure. In the future, we plan to generalize our framework to handle a more general class of networks. It is also interesting to address other types of gates, for example, whose state takes logical operators that are not necessarily YES and OR.

Acknowledgment

This work was supported by Grant-in-Aid for Transformative Research Areas (A) 20H05969 from the Ministry of Education, Culture, Sports, Science and Technology of Japan.

Declarations

Conflict of interest: The author declares that he has no conflict of interest.

References

[1] S. Murata, T. Toyota, S.I.M. Nomura, T. Nakakuki, and A. Kuzuya, “Molecular cybernetics: challenges toward cellular chemical AI,” Advanced Functional Materials, vol. 32, no. 37, pp. 2201866, 2022.
[2] A. Hart-Davis, Pavlov’s Dog: Groundbreaking Experiments in Psychology, Elwin Street Limited, 2015.
[3] C.C. Aggarwal, Neural Networks and Deep Learning: A Textbook, Springer, 2018.
[4] S. Kauffman, “Homeostasis and differentiation in random genetic control networks,” Nature, vol. 224, no. 5215, pp. 177–178, 1969.
[5] J. Sun, A.A.R. AlMomani, and E. Bollt, “Data-driven learning of Boolean networks and functions by optimal causation entropy principle,” Patterns, vol. 3, no. 11, pp. 100631, 2022.