Building a quantum superposition of conscious states with integrated information theory
Abstract
Could there be a quantum superposition of consciousness, as in the Wigner’s friend thought experiment? The integrated information theory (IIT) of consciousness has turned this into a well-defined question. According to IIT, consciousness is a measurable physical quantity given by integrated information (), such that the amount of consciousness in a system corresponds to its amount of . We use the most recent IIT formalism (IIT4.0) to analyze the simplest non-zero system known as a feedback dyad. We then propose a circuit that puts the dyad into a superposition of states which, according to IIT, would correspond to a superposition of conscious states. We refer to this as “Schrödinger’s dyad”. We therefore show that either IIT is false or the simple dyad is conscious and can easily be put into a superposition of conscious states. We then identify the simplest possible consciousness-collapse model, which predicts that this superposition is unstable and collapses at a rate determined by a measure of difference between the superposed conscious states. Our analysis will enable us to make a number of key observations about the general structure of integrated information theory (IIT2.0, IIT3.0, IIT4.0, and QIIT) and the general structure of consciousness-collapse models.
1 Introduction
Could there be a quantum superposition of consciousness? This question was raised by Eugene Wigner in the thought experiment that is now known as “Wigner’s Friend”. Wigner imagined his friend, in a nearby sealed lab, making a quantum measurement. Wigner, who is uncertain of his friend’s result, wonders whether he should consider his friend to have entered a quantum superposition of experiencing different results. Wigner argued that this is “absurd because it implies that my friend was in a state of suspended animation”. He then concluded that “consciousness must have a different role in quantum mechanics than the inanimate measuring device” ([51, p.180]).
There has since been much speculation by physicists and philosophers over whether states of consciousness could be superposed and what that would even mean. For example, there have been many attempts to extend the Wigner’s friend scenario and the associated epistemological and metaphysical implications ([21], [14], [19], [13], [52]). There have also been many attempts to make sense of superpositions of conscious states in many worlds and many minds interpretations of quantum mechanics ([20], [44], [50], [33], [16], [5], [8], [31], [32]). However, without any well-defined criteria for determining which physical states are conscious (and to what degree), the question of whether there could be such a superposition, and what it would be like to be in one, is difficult to evaluate.
Recent neuroscience, on the other hand, has seen the rise of mathematical theories of consciousness, notably, the integrated information theory, or IIT for short ([46], [47], [40], [48], [4])). IIT associates systems with both quantitative amounts of consciousness (roughly, the amount of integrated information in the system, denoted by the symbol ) and qualitative states of consciousness (roughly, the “shape” of the system’s integrated information, or its “Q-shape”). More recently, IIT has been extended into the quantum domain in a framework known as QIIT ([53], [27], [3]). Inspired by these results, Wigner’s suggestion that consciousness may be responsible for the collapse of the wave function has been resurrected in models that use integrated information as a criterion for collapse ([28], [17]). In comparison to standard collapse models [11], it has been claimed that IIT-based consciousness-collapse models may be much easier to experimentally test, since they can be tested by the right sorts of quantum computers, if only we could design the right sort of circuit [17].
In this paper, we propose such a circuit which, if implemented, would put a simple quantum computer into a superposition of states of conscious experience according to the IIT definition of consciousness. Following [17], we consider the simplest non-zero system, a feedback dyad. Classically, the dyad has four possible states: (0,0), (1,1), (0,1), and (1,0). Each state is predicted to have a tiny amount of consciousness. This prediction is robust across successive IIT formalisms. Each dyad state has in IIT2.0 and in IIT3.0, as shown in [37]. Here, we show that each dyad state has in IIT4.0 (section 3) and in QIIT (appendix B). Although these states have the same amount of consciousness, they yield different states of consciousness, because they are associated with different Q-shapes, as we show in section 4. The dyad in a superposition of two of its four possible states is therefore the simplest consciousness superposition predicted by IIT. We refer to this as “Schrödinger’s dyad” and we propose a simple quantum circuit that allows Schrödinger’s dyad to be built.
We would like to stress that we are not endorsing IIT, and so we remain agnostic on whether the dyad is conscious in any meaningful sense. IIT has been shown to be consistent with a number of important experimental results in neuroscience ([34], [15], [22], [2], [30], [29], [38]). However, many criticisms of IIT have also been proposed, and we are sympathetic with some of them ([23], [1], [12], [7], [18], [41]). Either way, what we show is that unless one drastically revises IIT (e.g. [36], [35]), then either IIT is false or the dyad is conscious and can easily be put into a superposition of conscious states. We leave it to the reader to decide between these options.
In section 5 we identify the simplest possible consciousness-collapse model, which predicts that Schrödinger’s dyad is unstable and collapses at a rate determined by a measure of difference between the superposed conscious states. We take the Q-shapes defined in section 4, and use them to define the simplest possible collapse operators. This toy model makes a number of important properties of such models transparent. We then compare our toy-model to the more general consciousness-collapse model proposed in [17].
Finally, in section 6 we propose a physical implementation of Schrödinger’s dyad, in which two photons enter into a feedback loop inside an optical cable. On the one hand, the implementation may potentially falsify the simplest versions of the IIT-based consciousness collapse models. On the other hand, the example raises a difficulty with IIT when it comes to physical implementation: IIT assumes that there is always an objective fact of the matter about what the basic causal units in a physical system are.
In addition to identifying this prediction of IIT, our analysis helps to reveal much about the structure of IIT. For example, we resolve a crucial ambiguity in IIT in which logic gates are treated as having binary states (section 2). We also identify a subtle inconsistency between the IIT4.0 description of the dyad and the axioms of IIT4.0 (appendix A).
The paper is organized as follows. Section 2 describes the classical feedback dyad. Section 3 shows how to calculate the classical dyad’s using IIT4.0. Section 4 provides a simple way of describing the classical dyad’s Q-shape. Section 5 explains the simple consciousness-collapse model. Section 6 proposes a physical implementation of the dyad, which may test the model, but which also raises questions about how to understand causality in IIT.
2 The feedback dyad
The classical dyad is a simple system consisting of two elements or channels, A and B, that simply swap their states from one time step to the next. That is, if at some time, , A is in state 1 and B is in state 0, then at the next time step, , A is in state 0 and B is in state 1. The action on these channels is equivalent to a logical SWAP gate which is given a simple diagrammatic representation in Figure 1.
The figure makes it clear that there are three distinct levels of description to the dyad: channels, channel values, and channel relationships. A and B are the channels that are related via the logical SWAP gate in such a way as to exchange their values. In the language of quantum information, the channels are systems, the channel values are states, and the channel relationships are transformations. This is a crucial point. The SWAP gate is a transformation of the states of systems A and B. The gate itself is never “in a state” on its own.
This is an important distinction because gates are frequently described as being in a state that possesses a value, especially in IIT3.0 [40]. In particular, the elements or nodes in the IIT3.0 diagrams have binary states but are also treated as being logic gates. If the nodes are understood as neurons, then they are considered as being in an “active” or an “inactive” state [9], much like a channel. Yet the neurons are also said to act like gates by only activating in response to the right combination of connections to other neurons that are themselves either active or inactive. But this is really a notational relic from the early days of Boolean networks [24, 25] that ignores what is happening at a more granular level. In the neuronal case, an “active” or “inactive” neuron really refers to whether it sends a signal via some channel, i.e. it represents an action. The difference is typically unimportant at the granularity in which it is usually considered. But when considering quantum models of these networks, this treatment breaks down. As such, it is the states of the channels that can be in superposition, not the gates themselves. In a neuronal sense, it is thus conscious states that are in superposition, not the physical neurons themselves.
Figure 1 also highlights a fundamental causal dependence in the dyad. The output of channel A causally depends on the input to channel B and vice-versa. In order to emphasize this point, we use capital letters to identify the channels or systems themselves and lowercase letters to identify the values the channels can attain, i.e. their states. One could think of the SWAP gate as a black box with the channels simply identifying the locations of the inputs and outputs of the box. Values are fed into the inputs and then produced by the outputs.
To develop a feedback system with this SWAP gate we simply feed the outputs directly back into the inputs. For simplicity we can represent this system over a series of time steps in the manner shown in Figure 3.
The output at a given time step is given as in Figure 1. For example, if the system at a given time step is given by where the first element of the set is the state of channel A and the second is the state of channel B, if the inputs were the evolution of the system state over time is just .
Creating Schrödinger’s dyad then requires that we treat the channels as quantum and represent their states as such. That is, a classical state is equivalent to a pure quantum state in the so-called computational basis . A superposition of the and states can be achieved by feeding the superposition state
(1) |
into channel at . The input state to the dyad as a whole at is
(2) |
which then evolves into the following state at :
(3) |
This is not a superposition of values, since all four possible classical states of the dyad have the same value. It is, however, a superposition of distinct Q-shapes according to IIT3.0 and IIT4.0, as we show in section 4. And so according to IIT, the state of equation 3 represents a superposition of qualitatively distinct states of consciousness. We begin by calculating the dyad’s .
3 Calculating the amount of consciousness () in the feedback dyad
The general procedure for calculating and Q-shape takes many steps. Fortunately, the simplicity of our dyad allows us to skip several steps and to emphasize the most important ones. We explain the more general case in appendix A.
The dyad consists of two parts, A and B. We begin by calculating the integrated cause information and the integrated effect information of each part. Integrated information concerns how much information is lost by partitioning the system, which means replacing a causal relationship with noise where the noise is represented as an equiprobable distribution over all possible states.
To illustrate, let us calculate how much integrated effect information A has, given its present state, about the next state of each of the system’s parts, A and B. The maximum of these defines A’s integrated effect information.
Given that our dyad is a SWAP gate, it is trivially true that A’s present state has zero integrated effect information about A’s next state since A’s next state is entirely determined by B’s present state. Put another way, A’s possible next states are all equally probable given its current state. So introducing a partition that induces noise between A at and A at makes no difference. This makes sense given that there is no causal connection between them in the first place: A affects B but not itself in the next time step. A’s present state is not causally connected to A’s future state and so there is no integrated effect information.
However, A’s present state does fully determine B’s future state and so if, for example, our system’s present state is (1,0), equation 39 in [4] tells us that the integrated effect information of A’s state at time given that it is in state 1 at that instant, is
(4) |
Here, is the probability that B will be in state at time given that A is currently in state 1. It is trivially true that this equals 1. Likewise represents the probability that B will be in state 1 at time given the partition which sets the value of channel A to an equiprobable distribution of the two possible states. In other words, the partition replaces the effect that A had on B with noise, which means that B’s future state is randomly determined. Since there are only two possible states, that means that . As such, we have
(5) |
The same basic equation tells us that the integrated effect information of B’s state at time , , also equals 1.
The integrated cause information for A is calculated in a slightly different manner and illustrates a time asymmetry in the equations of IIT. As in the effect case, the past state of A contains no information about the present state of A, and likewise for B. We only consider the information B’s past state has on A’s current state and the information A’s past state has on B’s current state. Specifically, given a current state of (1,0), equation 42 in [4] gives
(6) |
where is calculated according to Bayes’ rule as follows:
(7) |
where and are unconstrained probabilities (see equations 6-8 in [4]) and are both equal to 0.5 since, at any given time step and with no knowledge of past or future states, the probability that we will find either channel in a given state is 0.5 because there are only two states. Here we also have that is the probability that A’s current state is 1 if B’s past state is 1 and is the probability that B’s past state was 1 given that A’s state is currently 1. As before, noises the system and is equal to 0.5. Since , Bayes’ rule given by equation (7) tells us that . As before, then, we find that . Likewise, the same process tells us that also equals 1.
Equation 45 in [4] then tells us that the integrated information of a part is the minimum of its integrated effect and integrated cause information, i.e.
(8) | ||||
(9) |
respectively, which are both trivially 1.
The amount of consciousness () in the state of the whole system is then simply a sum of the integrated information of the smaller subsystems as calculated above. The state of the dyad at the time therefore has
(10) | ||||
units of consciousness.
No matter which of its four possible states the dyad is in, all of the above reasoning applies, and we find that it always has two units of consciousness. It is therefore not possible to put the dyad into a superposition of -values. What we can do, however, is put the system into a superposition of different states of consciousness.
To understand this distinction intuitively, compare experiencing a green screen with experiencing a blue screen. It might be that these two experiences do not correspond to any difference in (why would changing only the color change the amount of consciousness?). Now imagine that we put a subject into a superposition of experiencing a blue screen and experiencing a green screen. By assumption this is not a superposition, but it is clearly a superposition of distinct conscious experiences. One might doubt that distinct human states of consciousness could ever have identical [26], but IIT allows for this in AI, and IIT3.0 and IIT4.0 predict that this is indeed the case for our simple dyad, as we now explain.
4 Calculating the state of consciousness (Q-shape) of the feedback dyad
If two qualitatively distinct states of consciousness are quantitatively identical (i.e. they have identical ), then their distinctness must come down to the different ways in which each state generates that -value. This difference is what is captured in a Q-shape.111This has come under various labels in the literature. In [40] it is primarily referred to as a “maximally irreducible conceptual structure (MICS)”. But it is also referred to as a “shape in qualia space”, and so we adopt the simpler terminology, “Q-shape”. In [4] it is referred to as a “-structure”. In this section we define Q-shapes for all four states of the dyad. We show that these Q-shapes are distinct. It follows that IIT (as presently formulated) must treat these states as corresponding to qualitatively distinct states of consciousness. Finally, we discuss some differences in how Q-shapes are understood in IIT3.0 versus IIT4.0, which will be relevant to the collapse model proposed in the next section.
The dyad states and each have , but for different reasons. This can be seen by partitioning the dyad, replacing some of the parts by noise, as defined above, and then noting that and induce different forward and backward probability distributions. These different distributions lead to different Q-shapes. In the general case of more complex systems, we also have to weigh the parts according to their individual values of . The simple structure of the dyad allows us to bypass this (since ), but we will return to the more general case in the next section.
We begin with part A, when the dyad is in state . The prescription of partitioning means that we replace the complement of A (that is, B) by noise, i.e. an equiprobable distribution of and , while keeping in state . Evolving this forward in time, we obtain a probability distribution , where we have labelled the four states in lexicographical order: . Evolving it backwards in time, i.e. retrodicting the dyad’s state at one time step earlier, we obtain exactly the same probability distribution. This gives us the first two rows in the Q-shape matrix
(11) |
The third and fourth row are the forward (effect) and backward (cause) probability distributions that we obtain if we consider the subsystem B instead, keeping it in state and replacing A by noise as above. Thus, the Q-shape of a given state (such as ) is a collection of four probability distributions over the four dyad states, represented by the four rows in our representation matrix.
Performing the calculation for the other dyad states (which each have due to the two parts always having =1), we obtain
(12) |
Our Q-shapes are not really “shapes”; they are just matrices of probability distributions. But we can turn them into shapes by following the IIT3.0 prescription described in [40] (see especially Figures 10-12). To obtain such visualizations for our dyad states, we simply interpret two probability distributions over the dyad’s state space (which have four real entries each) as an element of the eight-dimensional vector space . This is the phase space of the dyad. Let us use this to build the “shape” corresponding to from equation 11 above. Consider the first two rows of that matrix. They determine the location of part A. The last two rows determine the location of part B. This gives us two points in the eight-dimensional space. Since in all our cases, it does not help us to distinguish Q-shapes, so we have ignored it.
IIT3.0 therefore predicts that the dyad is (minimally) conscious, and can be in one of four qualitatively distinct conscious states. It is natural therefore to wonder what it is like to be the dyad, and what these qualitative differences actually consist of. This is a question that IIT actually aims to answer. That is, IIT wants to be able to say something about what the experience of any given conscious system is like, especially when the system is incapable of verbal reports. The general idea is to extrapolate from features of our own Q-shapes.
For example, consider what it is like to be an echolocating bat. In [39] it was famously argued that this question is intractable. However, more recently in [49] it was argued that IIT makes it tractable. The idea is to consider the general properties of human visual experience Q-shapes and human auditory experience Q-shapes. Then, we compare them with bat experience Q-shapes. If bat experience Q-shapes are “more similar” to, say, human auditory experience Q-shapes, then we can say something about what it is like to be a bat (it is more like human auditory experience than human visual experience). Of course for both human and bat experience, deriving exact Q-shapes is far too complicated. Consequently, there is also no straightforward way to compare the dyad Q-shapes with (aspects of) our Q-shapes.
Nonetheless, there is a curious discussion about this in [40] (see Figure 19), that considers a system that is only slightly more complex than our dyad, which they call a “photodiode”. It also involves two parts, labelled ‘D’ and ‘P’, that specify each other’s states at each time step. (The main difference is that D receives two external inputs and has a threshold 2. All connections have weight 1. Meanwhile P serves as a memory for the previous state of D and its feedback to D serves as a predictor of the next external input by effectively decreasing the threshold of D.) Despite these differences, its Q-shapes are very similar to our dyad’s Q-shapes. They also involve two points in an 8D space. About its experience, they say the following:
“It is instructive to consider the quality of experience specified by such a minimally conscious photodiode. […] D says something about P’s past and future, and P about D’s, and that is all. Accordingly, the shape in qualia space is a constellation having just two [points], and is thus minimally specific. […] Moreover, the symmetry of the [Q-shape] implies that the quality of the experience would be the same regardless of the system’s state: the photodiode in state DP=00, 01, or 10, receiving one external input, generates exactly the same [Q-shape] as DP=11. In all the above cases, the experience might be described roughly as “it is like this rather than not like this”, with no further qualifications. The photodiode’s experience is thus both quantitatively and qualitatively minimal.”
If all four states of the photodiode have the same Q-shape, then they must all correspond to the same probability distributions. For as they say (in the IIT3.0 jargon), the probability distributions (or “cause effect repertoires”) for each part (or each “concept”) specifies what each part “contributes to the quality of the experience”. (Meanwhile, the of each part is said to be “how much” the part is present in experience.) But as we have seen, our feedback dyad does not yield this result: the four possible states correspond to distinct Q-shapes. It is therefore not possible to simply describe each of the four possible conscious states of the dyad as “it is like this rather than not like this”. What could the differences in our four dyad Q-shapes possibly translate to in experience? These are difficult questions for IIT.
We have mostly followed the IIT3.0 rather than the IIT4.0 prescription for building Q-shapes. In IIT4.0, they are somewhat simpler, in that they replace probability distributions with states (see equation (56) in [4]). In particular, the IIT4.0 Q-shape of any dyad state is given by the -values of A and B as well as the states that these -values were maximized over. So in the case of , A and B both have ; for A this was maximized over B being in state 1 (i.e. ), while for B this was maximized over A being in state 0 (i.e. ). For , A and B both have ; for A this was maximized over B being in state 0 (i.e. ), while for B this was maximized over A being in state 1 (i.e. ). The four states of the dyad therefore correspond to distinct Q-shapes in IIT4.0, consistently with IIT3.0.
The choice of how to represent Q-shapes here seems somewhat arbitrary, as both options satisfy the constraint of identifying differences in how the parts contributed to an overall -value for the system. However, as we explain in the next section, the IIT3.0 choice is much better suited for a certain application of IIT: defining a fully general consciousness-collapse model.
5 The simplest consciousness-collapse model
In [17] a dynamical collapse model is proposed in which Q-shape superpositions are unstable and tend to collapse. The following general form for continuous collapse models ([10, p.27]) is used:
(13) |
The first term on the right-hand side of the equation represents Schrödinger evolution, while the remaining two terms represent the collapse evolution. Here, is the Hamiltonian of the system, is a real-valued parameter governing the collapse rate, is a collapse operator whose eigenstates the system collapses towards, is its expected value at time , and is a noise process which ensures that collapse happens stochastically at a rate determined by a measure of difference between the superposed eigenstates.
The pure state therefore evolves stochastically. All statistical predictions that we can extract from this state are linear in due to the Born rule. Hence, given a single realization of the process (13), all statistical predictions (say, about outcomes of any measurement that we might decide to perform at some point while the process unfolds) can be computed from [10]. As a consequence of (13), this resulting state evolves according to the Lindblad equation
(14) |
(for the derivation see e.g. [10]). Hence, the system can evolve via Schrödinger dynamics, via collapse, or via some combination of the two. To understand the collapse term we can ignore the Schrödinger dynamics term by setting its Hamiltonian to zero, . The collapse term only has an effect when the system is in a superposition of eigenstates of . In this situation, the double commutator will be non-zero and the state will evolve. The “speed” at which it evolves is a function of the eigenvalues of . This is because the th matrix entry of the double commutator in ’s eigenbasis is
(15) |
The dampening of the off-diagonal elements of occurs at a rate that grows with where if the system is in a superposition. We see that the eigenbasis of determines the collapse basis, i.e. the basis in which the state becomes “classical”, while its eigenvalues tell us which superpositions of pairs of such states are removed more quickly (namely, those with large ).
Let us now use this prescription to construct the simplest possible consciousness collapse model for the dyad. Subsequently, we will compare this with the more general, but more involved approach in [17]. For the moment, let us only mention that our simple model contains only a single collapse operator, whereas the one in [17] involves several such operators, generalizing Eq. (13). We will say more about the similarities and differences below.
The four states of the dyad are mutually distinct states of consciousness, spanning the total Hilbert space. Therefore, we expect a consciousness-collapse model to lead to a state for large times that is diagonal in that basis. Therefore, our collapse operator will have the form
(16) |
with four eigenvalues . Any consciousness-collapse model should arguably imply the following principle for the choice of those eigenvalues:
If two states of the dyad (say, and ) are qualitatively very different states of consciousness, then superpositions of these states should vanish very quickly, i.e. should be very large.
That is, it is natural to allow superpositions of “qualitatively similar” states to persist for longer, while qualitatively different states must decohere quickly.
For a quantitative application of this prescription, we need a way to compare states of consciousness, i.e. a distance measure on Q-shapes. Since Q-shapes are collections of probability distributions, it is natural to define their distance in terms of distance measures on probability distributions, which is a classical and well-studied topic in information theory. The preferred distance measures on probability distributions in IIT have changed in almost every successive version. IIT2.0 used the well-known Kullback-Leibler divergence. IIT3.0 used Earth Mover’s distance [42]. IIT4.0 uses the intrinsic difference measure from section 3. IIT3.0’s measure was explicitly turned into a generalized distance measure for Q-shapes. IIT4.0’s measure is not so well suited for this task, a point we will return to at the end of the section.
A natural choice is to define the distance of two Q-shapes (i.e. with rows ) and as
(17) |
where is some choice of distance measure on the set of probability distributions. That is, the distance of two Q-shapes is the sum of the distances of their probability distributions. (This is precisely the form of IIT3.0’s extended Earth mover’s distance measure.)
Now we have a large choice of possible distance measures at our disposal. However, note that the four Q-shapes of the dyad (Eqs. (11) and (12)) consist of a small variety of very simple probability distributions only: all entries are or , and any two rows are either equal, or they differ in all entries. Two identical rows must have distance zero. Furthermore, it is natural to demand that every two probability distributions arising as rows in these Q-shapes that differ in all four places all have the same distance, which we can set to unity by a choice of scaling factor. For example,
We can then determine the distances between all pairs of Q-shapes of the dyad and obtain the following values, writing as the -entry of a table:
Q(0,0) | Q(0,1) | Q(1,0) | Q(1,1) | |
---|---|---|---|---|
Q(0,0) | ||||
Q(0,1) | ||||
Q(1,0) | ||||
Q(1,1) |
Let us now return to our consciousness-collapse principle. Formulating it in terms of this distance measure, it reads: If the distance between two Q-shapes and is large, then the distance between the eigenvalues and of the collapse operator must also be large.
This desideratum could always be satisfied by the arbitrary prescription to make all eigenvalues extremely large and distant from each other. However, this would typically induce almost-instantaneous collapse, a behavior that we do not expect for simple systems such as the dyad. Thus, we are searching for a choice of eigenvalues that is as tame as possible while still satisfying the above postulate.
This leads us to define the eigenvalues in terms of an optimization problem:
Minimize
subject to , .
This prescription keeps the collapse behavior “tame” by demanding that the eigenvalues are not arbitrarily large, but only as large as they need to be (in their total sum) to satisfy our principle for all pairs of Q-shapes. Note that the total time scale of the collapse is not determined by and its eigenvalues, which do not have any physical units. Instead, it is determined by the noise term of (13), i.e. the parameter in (14). This will remain a parameter of the collapse model that needs to be determined experimentally. The above considerations tell us only the relative speed at which superpositions between distinct Q-shapes are suppressed, whereas the total speed would depend on and hence on further considerations as to which states of consciousness are implausible to remain in superposition for significant amounts of time because of, say, human experience.
As we show in appendix C, this optimization problem has twelve solutions: one of them is
and the other solutions are permutations of this one (), such as — indeed, all permutations of these four numbers such that . This degeneracy can be understood as a consequence of the symmetry of the problem: for example, the table of pairwise distances does not change if we exchange and . Indeed, these solutions do not only minimize the sum of the , but they also minimize the expression
i.e. the total sum of the pairwise collapse rates, under the assumption (that we can always make) that one of the is zero.
We can simply pick one of the six solutions and use it to define our collapse operator. For the sake of the argument, let us pick the above, but the choice does not matter for the following discussion.
Let us interpret the result by looking at some example collapse rates. We have which is small, and is also small (and, indeed, identical). Superpositions of the two dyad states and can thus remain stable for a relatively long time. On the other hand, is large, and so is . Hence, superpositions between the dyad states and will be killed off more quickly.
However, consider the two dyad states and . Their distance is small, , and our principle demands that the corresponding difference of eigenvalues (i.e. the associated collapse rate) is at least as large as that. However, it is actually , which is much larger than required. Thus, any superposition of these two dyad states would fall off much faster than what would be expected by considering the difference between their Q-shapes alone.
We can understand this behavior by noting that the dyad states lead to distance values (the table above), from which eigenvalues of the collapse operator have to be determined. Thus, every value of must depend on more than just the number . If our principle is satisfied, then a large value of the latter implies a large value of the former, but the converse is not in general true. The quantum limitation of only having eigenvalues introduces additional constraints.
It seems that this must be a general phenomenon: if we have distinct Q-shapes, but collapse operators, then the eigenvalues are smaller in number than the distance values. We must hence have pairs of Q-shapes whose superposition must collapse more quickly than what their mere qualitative distance as states of consciousness would suggest. Superposition resistance hence cannot only resemble the structure of conscious experience, but is also additionally constrained by the general structure of quantum mechanics.
We can now see how the elements of the construction above are realized in greater generality in [17]. Here, there is not a single Q-shape operator whose eigenstates are all the classical Q-shapes. Rather, a Q-shape is associated with an ensemble of orthogonal self-adjoint collapse operators. The eigenvalue of each operator does not pick out a Q-shape, but an element of a Q-shape, which is either an entry in a probability matrix or a -value. This solves the above problem and allows for superposition resistance to resemble the structure of conscious experience more closely. However it does so at the cost of having a very complex model.
To illustrate this complexity, consider how many collapse operators we need to capture all the details of a Q-shape of a classical system. A system of n elements will have subsystems. So if we have two elements, as with our dyad, we have three subsystems (A, B, and AB). We were able to ignore AB in our simple case but we cannot do that in general. Every element has d possible states giving a total of possible states for the system. Each subsystem is associated with two probability distributions, as we saw in the previous section, and one -value. To capture all of this, the number of collapse operators we need is . So for our classical dyad, where , we need 27 collapse operators. It would be extraordinary if Nature were to operate at such a high level of complexity for such a simple system.
But this still is not sufficient when dealing with quantum systems, since qubit elements do not have possible states, but have infinitely many possible pure states. It is for this reason that the model in [17] formulates everything in terms of the QIIT found in ([53],[27]). Here each subsystem is associated with two appropriate density matrices instead of two appropriate classical probability distributions. The density matrices for the quantum dyad have more entries than the classical probability distributions associated with the classical dyad, so we need more collapse operators. In particular, we now need collapse operators in general, and so 99 collapse operators for our dyad.
The use of QIIT raises a further complication. In QIIT, every quantum system is assigned a well-defined Q-shape (which in many cases may be the null Q-shape) whether or not the system is in a superposition of classical Q-shapes. That is, for QIIT, distinct states of consciousness do not always correspond to mutually orthogonal quantum states, and this makes it in general impossible to have physical processes whose observable behavior depends on all the properties of those states of consciousness (because non-orthogonal states cannot be perfectly distinguished). In particular, this excludes collapse models where the rate of collapse is proportional to the “size” of the superposition, e.g. to the qualitative difference of the superposed conscious states. The ensemble of collapse operators defined in equation (2) of [17] therefore adds an additional constraint that restricts these operators to just those states that are associated with classical Q-shapes.
Thus, in an attempt to be completely general, the collapse model in [17] became very complex. But as has been demonstrated here, if one just wants a collapse model for some simple system whose physical properties are known, then much of those complexities can be bypassed, as we can instead define a single collapse operator as we have done here.
Finally, we note that specific predictions of one’s collapse model may vary with the use of IIT formalism, which is constantly being updated. The choice of distance measure in particular has undergone significant revision.
IIT2.0 used the Kullback-Leibler divergence to measure the distance between probability distributions. But that was rejected in part because it is not symmetric.
IIT3.0 adopted Earth Mover’s distance (EMD). This is symmetric and, as shown in [40], yields different results than the IIT2.0 measure, even for the dyad. The EMD can easily be generalized to become a measure of distance between Q-shapes, as in equation 17. This Q-shape distance measure was essential to IIT3.0 because it was used to calculate (by measuring the distance between the Q-shape of a system’s state and the Q-shape of that state partitioned). But the EMD has recently been abandoned, in part because of problems raised in [7]. Alternative measures can be found in [45].
IIT4.0 adopts the intrinsic difference measure, described in IIT4.0 and in section 3 above, as its preferred distance measure. The intrinsic difference is infinite if the denominator is zero. This is avoided when the denominator involves some partition and therefore white noise. But measuring the distance between two Q-shapes doesn’t involve any partitioning. So, the intrinsic difference is not suitable for measuring Q-shape distance. Consequently, IIT4.0 does not supply such a distance measure. This led to a simpler calculation of in IIT4.0: the system is a sum of subsystem -values. QIIT in its most recent version is consistent with IIT4.0 B.
6 Physically implementing the dyad
Consider the following simple implementation of the dyad as depicted in Figure 3: channels A and B are optical cables and the dyad does nothing more than cross those cables, without contact. The outputs are then fed back into the inputs, creating a kind of feedback cycle. We have two photons in the cables, and each of them can carry one of two perfectly distinguishable “classical” states, corresponding to horizontal or vertical polarization . What horizontal or vertical means is determined by an external reference frame; for what follows, the exact choice of reference is unimportant, except that the physical situation must tell us what we mean by both photons carrying identical or orthogonal polarization directions (e.g. in the first case, and in the second). It is clear that we are not restricted to preparing the photons in the classical basis states, but we can prepare them in arbitrary superpositions, such as that of Equation 3. This is a necessary condition to implement “Schrödinger’s dyad” as introduced in the previous sections.
However, if we identify our basic units as the photons that traverse the cables, then it may not seem like IIT applies here, since IIT requires causal relationships. In particular, to be a basic unit of IIT, something should have the power to “take a difference” (be affected by something) and “make a difference” (produce effects on something) [4]. The concern with this implementation is that the photons do not take or make a difference, because they never change their polarization states. Indeed, under this interpretation of the physical setup, we would not even have implemented the dyad, but another system (two bits and an identity gate).
On the other hand, this depends on counting the photons as our basic causal units. If we instead identify our basic units as the polarization qubits at the physical locations and in space, we get a different result. In particular, we may say that the photon polarization state at At0 causes the state at Bt+1 and was caused by the state at Bt-1. Under this way of identifying our basic units, the system has non-zero and is the simplest conscious system according to IIT. This may even fit well with interpretations of modern physics in which the basic causal objects are spacetime points [43].
IIT does not want the or Q-shape of a system to depend on some arbitrary choice: these are meant to be objective properties. So at least one of the above two causal interpretations of the system must be ruled out. IIT does not give clear criteria for what to do in such a situation. However, one option is suggested by the IIT4.0 principle of maximal existence, which states that “what exists is what exists the most”. This ontological principle is used to motivate the exclusion principle, which effectively states that if two overlapping sets of units have non-zero , then only the system with maximal is conscious. Thus, if there are multiple interpretations of what the causal units are in the first place, we might similarly only consider the interpretation that yields the greater . In that case, we have found a very simple implementation of the dyad by identifying the qubits with locations in space.
7 Conclusion
In this paper we have described some simple predictions of IIT that have enabled us to make a number of observations.
First, we showed that either IIT is false, or a simple system like the feedback dyad is conscious and can easily be put into a quantum superposition of conscious states (i.e. a superposition of Q-shapes). This result was shown to be robust across successive IIT formalisms.
Second, we identified the simplest consciousness-collapse (or Q-shape-collapse) model. It involves a single Q-shape collapse operator, whose eigenstates are the four possible states of the dyad. For the model to do what is needed (make the collapse rate proportional to a measure of difference between the superposed Q-shapes), we found that the four eigenvalues must depend on six distance values.
In such models, the rate of collapse of a superposition of two states of consciousness must therefore depend on more than the relation between the two states. More complex models may avoid this by defining an ensemble of orthogonal Q-shape collapse operators. However, this can get very complicated, so for practical purposes (like testing Q-shape-collapse models), the prescription that we have provided here may be more useful.
Finally, we have made several observations about the general structure of IIT. For example, we argued that while treating gates as having states is permissible if gates are neurons, this does not work in general, and especially not for computers, where gates operate on systems that possess states. This is especially clear for quantum computers, where qubits, and not gates, are superposed. We have also noted that to apply IIT to a physical system, we need a specification of the basic causal units in the system. Insofar as physics does not specify such things, IIT is not fully applicable to physical systems.
In further research, it would be interesting to investigate whether any existing quantum computers (or other quantum systems) can maintain states like the state of our quantum dyad, and for how long. Such systems may place bounds on the fundamental parameters of IIT-based consciousness-collapse models.
Acknowledgments
We are grateful to Thomas D. Galley for discussions, and to Larissa Albantakis for helpful feedback on an earlier draft. This research was supported by grant number FQXi-RFP-CPW-2015 from the Foundational Questions Institute and Fetzer Franklin Fund, a donor advised fund of Silicon Valley Community Foundation. Moreover, this research was supported in part by Perimeter Institute for Theoretical Physics. Research at Perimeter Institute is supported by the Government of Canada through the Department of Innovation, Science, and Economic Development, and by the Province of Ontario through the Ministry of Colleges and Universities.
References
- [1] S. Aaronson “Why I Am Not An Integrated Information Theorist (or, the Unconscious Expander)”, 2014 URL: http://www.scottaaronson.com/blog/?p=1799
- [2] Mohsen Afrasiabi et al. “More than just front or back: Parietal-striatal-thalamic circuits predict consciousness level” In bioRxiv, 2020 DOI: 10.1101/2020.04.07.030429
- [3] Larissa Albantakis, Robert Prentner and Ian Durham “Computing the Integrated Information of a Quantum Mechanism” In Entropy 25.3 MDPI, 2023, pp. 449
- [4] Larissa Albantakis et al. “Integrated information theory (IIT) 4.0: Formulating the properties of phenomenal existence in physical terms” In arXiv preprint arXiv:2212.14787, 2022
- [5] D.. Albert “Quantum Mechanics and Experience” Cambridge University Press, 1992
- [6] Leonardo S. Barbosa et al. “A measure for intrinsic information” In Scientific Reports 10, 2020 DOI: 10.1038/s41598-020-75943-4
- [7] A Barrett and Pedro AM Mediano “The Phi measure of integrated information is not well-defined for general physical systems” In Journal of Consciousness 26, 2019, pp. 11–20
- [8] Jeffrey A Barrett “The quantum mechanics of minds and worlds” OUP Oxford, 1999
- [9] Gary Bartlett “Does integrated information theory make testable predictions about the role of silent neurons in consciousness?” In Neuroscience of Consciousness 2022.1 Oxford University Press UK, 2022, pp. niac015
- [10] A. Bassi “Gravitational decoherence” In Class. Quantum Grav. 34, 2017
- [11] A. Bassi et al. “Models of wave-function collapse, underlying theories, and experimental tests” In Rev. Mod. Phys. 85, 2013, pp. 471–527
- [12] T. Bayne “On the axiomatic foundations of the integrated information theory of consciousness” In Neuroscience of Consciousness 2018.1, 2018
- [13] Kok-Wei Bong et al. “A strong no-go theorem on the Wigner’s friend paradox” In Nature Physics 16.12 Nature Publishing Group UK London, 2020, pp. 1199–1205
- [14] Časlav Brukner “A no-go theorem for observer-independent facts” In Entropy 20.5 MDPI, 2018, pp. 350
- [15] S. Casarotto et al. “Stratification of unresponsive patients by an independently validated index of brain complexity” In Annals of Neurology 80.5, 2016, pp. 718–729
- [16] D.J. Chalmers “The Conscious Mind” Oxford University Press., 1996
- [17] D.J. Chalmers and K.J. McQueen “Consciousness and the collapse of the wave function” In Consciousness and Quantum Mechanics Oxford University Press, 2022 URL: http://arxiv.org/abs/2105.02314
- [18] A. Doerig, A. Schurger, K. Hess and M.H. Herzog “The unfolding argument: Why IIT and other causal structure theories cannot explain consciousness” In Consciousness and Cognition 72, 2019, pp. 49–59
- [19] Ian T Durham “Observer-independence in the presence of a horizon” arXiv, 2019 DOI: 10.48550/arXiv.1902.09028
- [20] H. Everett ““Relative State” Formulation of Quantum Mechanics” In Rev. Mod. Phys. 29, 1957, pp. 454–462
- [21] Daniela Frauchiger and Renato Renner “Quantum theory cannot consistently describe the use of itself” In Nature communications 9.1 Nature Publishing Group UK London, 2018, pp. 3711
- [22] Andrew M. Haun et al. “Conscious Perception as Integrated Information Patterns in Human Electrocorticography” In eNeuro 4.5, 2017 DOI: 10.1523/ENEURO.0085-17.2017
- [23] Amber R Hopkins and Kelvin J McQueen “Filled/non-filled pairs: an empirical challenge to the integrated information theory of consciousness” In Consciousness and Cognition 97 Elsevier, 2022, pp. 103245
- [24] Stuart Kauffman “Metabolic stability and epigenesis in randomly constructed genetic nets” In Journal of Theoretical Biology 22, 1969, pp. 437–467
- [25] Stuart Kauffman “Origins of Order: Self Organization in Evolution” Oxford: Oxford University Press, 1990
- [26] Adrian Kent “Collapse and Measures of Consciousness” In Foundations of Physics 51.3 Springer, 2021, pp. 62
- [27] J. Kleiner and S. Tull “The Mathematical Structure of Integrated Information Theory”, 2020 arXiv:2002.07655 [q-bio.NC]
- [28] K. Kremnizer and A. Ranchin “Integrated Information-Induced Quantum Collapse” In Found Phys 45, 2015, pp. 889–899
- [29] Angus Leung and Naotsugu Tsuchiya “Separating weak integrated information theory (IIT) into IIT-inspired and aspirational-IIT approaches” PsyArXiv, 2023 DOI: 10.31234/osf.io/kxywt
- [30] Angus Leung, Dror Cohen, Bruno van Swinderen and Naotsugu Tsuchiya “Integrated information structure collapses with anesthetic loss of conscious arousal in Drosophila melanogaster” In bioRxiv, 2020 DOI: 10.1101/2020.05.17.090001
- [31] David Lewis “How many lives has Schrödinger’s cat?” In Australasian Journal of Philosophy 82.1 Taylor & Francis, 2004, pp. 3–22
- [32] Peter J Lewis “What is it like to be Schrödinger’s cat?” In Analysis JSTOR, 2000, pp. 22–29
- [33] Michael Lockwood “’Many Minds’. Interpretations of Quantum Mechanics” In The British journal for the philosophy of science 47.2 JSTOR, 1996, pp. 159–188
- [34] M Massimini et al. “Breakdown of cortical effective connectivity during sleep” In Science (New York) 309.5744, 2005, pp. 2228–2232
- [35] K.J. McQueen “Illusionist Integrated Information Theory” In Journal of Consciousness Studies 26, 2019, pp. 141–169
- [36] K.J. McQueen “Interpretation-Neutral Integrated Information Theory” In Journal of Consciousness Studies 26, 2019, pp. 76–106
- [37] K.J. McQueen and Naotsugu Tsuchiya “When do parts form wholes? Integrated information as the restriction on mereological composition” In Neuroscience of Consciousness 2023.1 Oxford University Press UK, 2023, pp. niad013
- [38] Lucia Melloni et al. “Adversarial Collaboration to test GNW and IIT.” OSF, 2023 URL: osf.io/mbcfy
- [39] Thomas Nagel “What is it like to be a bat?” In The Language and Thought Series Harvard University Press, 1980, pp. 159–168
- [40] Masafumi Oizumi, L. Albantakis and Giulio Tononi “From the Phenomenology to the Mechanisms of Consciousness: Integrated Information Theory 3.0” In PLOS Computational Biology 10, 2014, pp. 1–25
- [41] Adam Pautz “What Is the Integrated Information Theory of Consciousness?” In Journal of Consciousness Studies 26.1-2, 2019, pp. 1–2
- [42] Ofir Pele and Michael Werman “Fast and robust earth mover’s distances” In 2009 IEEE 12th international conference on computer vision, 2009, pp. 460–467 IEEE
- [43] Oliver Pooley “Substantivalist and Relationalist Approaches to Spacetime” In The Oxford Handbook of Philosophy of Physics Oxford University Press, 2013
- [44] Euan J Squires “One mind or many—A note on the everett interpretation of quantum theory” In Synthese 89 Springer, 1991, pp. 283–286
- [45] M. Tegmark “Improved measures of integrated information” In PLoS Computational Biology 12.11, 2016, pp. 195–199
- [46] Giulio Tononi “An information integration theory of consciousness” In BMC Neurosci 5.42, 2004
- [47] Giulio Tononi “Consciousness as integrated information: a provisional manifesto.” In Biol Bull 215.3, 2008, pp. 216–242
- [48] Giulio Tononi, M. Boly, M. Massimini and C. Koch “Integrated information theory: from consciousness to its physical substrate.” In Nat Rev Neurosci 17, 2016, pp. 450–461
- [49] Naotsugu Tsuchiya ““What is it like to be a bat?”—a pathway to the answer from the integrated information theory” In Philosophy Compass 12.3, 2017, pp. e12407
- [50] Lev Vaidman “On schizophrenic experiences of the neutron or why we should believe in the many-worlds interpretation of quantum theory” In International studies in the Philosophy of science 12.3 Taylor & Francis, 1998, pp. 245–261
- [51] E.. Wigner “Remarks on the mind-body question” In The Scientist Speculates Heineman, 1961
- [52] Howard M Wiseman, Eric G Cavalcanti and Eleanor G Rieffel “A” thoughtful” Local Friendliness no-go theorem: a prospective experiment with new assumptions to suit” In arXiv preprint arXiv:2209.08491, 2022
- [53] P. Zanardi, M. Tomka and L.C. Venuti “Towards Quantum Integrated Information Theory”, 2018 arXiv:1806.01421 [quant-ph]
Appendix A The general IIT4.0 formalism
Our simple dyad allowed us to steer clear of many complications that arise when calculating and Q-shape for more complex systems. Here we explain IIT4.0 more generally, and identify which steps we avoided in section 3. We also identify a subtle inconsistency with respect to an axiom of IIT4.0.
Standard presentations of IIT do not begin with its formalism. Instead, they begin with a small set of axioms, from which a small set of postulates are derived. The formalism is then built from these postulates.
In IIT4.0, there are six axioms, which state that conscious experience exists and is intrinsic, specific, unitary, definite, and structured. Drawing upon “introspection and reason” these axioms are deemed to be “axiomatic or essential properties, properties that are immediate and irrefutably true of every conceivable experience.” [4] However, there is a problem with this last axiom, known as the axiom of composition.
The axiom of composition states that “experience is structured: it is composed of distinctions and the relations that bind them together, yielding a phenomenal structure [i.e. yielding a Q-shape]”. However, the dyad is actually a counterexample to this axiom since the dyad has no “relations” between its distinctions. That is, it yields a phenomenal structure without relations.
IIT4.0 calculates the integrated information of the relations between the distinctions, . That is, relations exist when distinct parts maximize their over the same or overlapping parts. However, our dyad avoids this complication because A has about B only and B has about A only, so there is no overlap in what these parts contain integrated information about. As such, for the dyad. This is why we ignored the relation/distinction contrast in the main text. The dyad states are conscious, because they have nonzero but are without structure, as they have no relations.
Note that if our dyad did have relations, then terms would have to enter the sum in equation 10 for the of the system. They would also need to be included in the dyad’s Q-shape. At any rate, IIT4.0 either needs to weaken their composition axiom so that relations are not necessary or strengthen their postulates to render the relationless dyad unconscious.
This points to a further complication that our dyad avoided, which arises when a part has multiple connections to other parts so that multiple partitions become available e.g. if A is connected to B and C and we are considering the integrated information A has about BC. In such cases we take the minimum information partition into account i.e. the partition that makes the least difference to the information that A has about BC. But this is trivial in the dyad since there is only a single partition.
The remaining complications that our dyad avoided concern the beginning stages of the calculation. The first step in the IIT4.0 analysis of any system that is presently in some state, is to identify which future state is most probable given the present state, and which past state is most probable given the present state. In general, this is determined by calculating something called the intrinsic information of a system (see equation 12 in [4]). But for our system in Figure 3, this is trivial: if the present state is (1,0), for example, then both the most probable future state and the most probable past state are (0,1) since our dyad is a completely deterministic SWAP gate.
Next we divide the system up into subsystems or parts such that there are two or more parts and each part (1) contains at least one unit or element; (2) does not overlap any other part; (3) causally interacts with other parts of the system (see equation 13 in [4]). For the dyad, this is once again trivial, as there are clearly only two causally interacting parts: A and B.
We then calculate a quantity called the “small phi” of the system as a whole (), to ensure it is nonzero. The purpose of this is to ensure that our system really is an integrated whole, and not just two (or more) separate systems that are simply causally interacting with themselves, (see equations 20-22 in [4]). Again, it is clear that A and B are causally interacting, so we skip this step.
Finally, we might have a case where we have many overlapping systems that have nonzero . IIT’s exclusion postulate tells us that the overlapping system that maximizes wins, and is the truly conscious system. Once again, this can be skipped, since our system is closed and does not overlap with any other systems. Now our calculation proceeds as in section 3.
Appendix B The quantum feedback dyad in QIIT
There have been several attempts to extend IIT to the quantum realm. Kleiner and Tull [27], in their efforts to broadly classify theories of consciousness in a mathematically rigorous way, provided an abstract framework within which any quantum extension of IIT should fit.
Zanardi, Tomka, and Venuti developed a quantum theory that was analogous to classical IIT but that was not unique, did not converge to the classical formalism in the case of essentially purely classical inputs, and did not strictly adhere to the axioms of classical IIT [53]. Most recently Albantakis, Prentner, and Durham [3] developed a direct quantum translation of IIT4.0 that is faithful to its axioms and that reduces to the classical formalism, as described here in Section 3, in the case of purely classical inputs.
Once again consider a feedback SWAP function as described in Figure 3. Whereas channels A and B in the classical case could only be represented as classical bits (thus limiting their values to be exclusively on the Boolean domain (i.e. ), we will now allow these channels to be represented as qubits which extends the range of possible values that they may take.
Specifically, consider a discrete quantum system described by some density operator where the are pure states, each prepared with a probability . As such, can be either a single pure state or an ensemble of pure states, i.e. a mixed state. We consider unitary time evolution in which the state evolves according to
(18) |
where is a unitary operator defined by which ensures that is reversible.
In quantum IIT, rather than comparing probability distributions we are comparing density operators. Given two density operators and , the quantum relative entropy between them is defined as
(19) | ||||
where . The quantum intrinsic difference (QID) is thus defined as
(20) |
In the case in which , equation 20 reduces to equation 4. Similarly, if is pure, as in the classical case for fully selective distributions. Otherwise is subadditive as required [6].
The procedure for calculating the integrated cause and effect information in the quantum case proceeds in much the same manner as in the classical case. In fact, the case in which the input state is corresponds exactly to the classical case with input which was analyzed in section 3. Hence, restricting ourselves to the so-called computational basis, , the formalism developed in [3] exactly reproduces the the classical results for the same circuit and is essentially analyzed in precisely the same manner.
To initiate a quantum system we can represent the state of the dyad at as the superposition state where . We then partition the system as usual and calculate the integrated effect information by evaluating the QID over the eigenstate for a particular partition such that
(21) |
where is now the partitioned effect repertoire.
We can then begin by identifying three subsystems as before, A, B, and AB (we will initially consider AB but will show that, as in the classical case, it is irrelevant). At the density matrices for these subsystems are , , and respectively. As described above, in the case of purely classical inputs, the QID given by equation 20 simply reduces to the ID given by equation 4.
As with classical IIT, the first step is to identify which future and past states are the most probable given the present state. As in the classical case, this is trivial since all the dyad does is swap the states. That is, if the present state is , then both the past and future states are .
We now consider each of the subsystems separately as before. As in the classical case, A’s present state contains no integrated effect information about A’s future state (and likewise for B). The same is also true for the future state of the whole system AB since, as before, there is a possible partition (A to A) that does nothing. That leaves just a single partition to consider and, since the current state of A is purely classical, the calculation is identical to equation 4. In other words, as before, .
Calculating the effect information of B might seem as if it would be different since the input is not classical, but due to the simplicity of the SWAP operation, the calculation is nearly identical to the classical case. The partition, as before, simply replaces the effect that B’s current state has on A’s future state with noise, but the noise is now treated differently. In the quantum case, noise is given by the maximally mixed state
(22) |
In order to fully work through a calculation of equation 21 it will help to calculate the individual pieces separately. Since we’re calculating the effect, the in equation 21 represents the future state of A while the represents the present state of B. Recall that the definition of a density operator defines the in terms of a spectral decomposition. By this definition, then, for the state. This means that the term in equation 21 is zero. Since and from equation 22, the second term in equation 21 becomes
(23) | ||||
where we note that . Equation 21 then reads in full
(24) | ||||
The calculation for is identical except the are for the present state and the are for the past state. For the case in which A’s present state is we thus find that and, likewise for the case in which B’s present state is , we find that .
Once again, the integrated information of each subsystem is then just the minimum of each of these values for each subsystem which gives
(25) |
As before the amount of consciousness in the entire system is simply the sum of these values:
(26) |
Appendix C Solution of the optimization problem of section 5
First note that replacing by , where , will also satisfy the constraints, but lead to unless . Thus, to be a minimizer, at least one of the must be zero.
Consider, for example, the case that . Then , and so . Similarly, , and so either (which is impossible) or . Finally, , and so either or . The “either” case gives us a possible minimizer , where . Consider this sum for the “or” case. It is
For this to define a minimizer, we need , which means that . But , and so . In summary, we must have equality, , and then , and so . This yields the other minimizer, .
All other minimizers can be determined similarly, but considering all the other three cases of zero .