This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Context-dependent communication under environmental constraints

Krzysztof Główka
Human Interactivity and Language Lab
Faculty of Psychology
University of Warsaw
Poland
[email protected] &Julian Zubek
Human Interactivity and Language Lab
Faculty of Psychology
University of Warsaw
Poland
[email protected] &Joanna Rączaszek-Leonardi
Human Interactivity and Language Lab
Faculty of Psychology
University of Warsaw
Poland
[email protected]
The corresponding author
Abstract

There is significant evidence that real-world communication cannot be reduced to sending signals with context-independent meaning. In this work, based on a variant of the classical Lewis, (1969) signalling model, we explore the conditions for the emergence of context-dependent communication in an agent-based situated model. In particular, we demonstrate that pressure to minimise the vocabulary size is sufficient for such emergence. At the same time, we study the environmental conditions and cognitive capabilities that enable contextual disambiguation of symbol meanings. We show that a) regularities in the context are not necessary for context-dependent communication and that b) environmental constraints on the receiver’s referent choice can be unilaterally exploited by the sender, without disambiguation capabilities on the receiver’s end. Consistent with common assumptions, the sender’s awareness of the context appears to be required for contextual communication. We suggest that context-dependent communication is a situated multilayered phenomenon, crucially influenced by environment properties such as distribution of objects in the context. The computational model developed in this work is a demonstration of how signals may be ambiguous out of context, but still allow for near-perfect communication accuracy.

Keywords referential game, language emergence, pragmatics, context, context-dependent communication, ambiguity, agent-based modelling

1 Introduction

In language sciences and philosophy, there is a marked tension between the perceived invariance of the meaning of natural language and its inevitable interaction with the actual context of usage. In this vein, it is common to distinguish between semantics, concerned with the context-independent meaning, and pragmatics, investigating the contribution of the context (for an overview, see Korta and Perry, (2020)). In much of linguistic work, the focus is on semantics, while pragmatics is peripheral (Steffensen and Harvey,, 2018).

Although such a relatively narrow framing of natural language may have been useful for a number of research goals, there is ample evidence that, in real-world communication, context-independent meaning is an exception rather than a rule. Human interlocutors seem to readily produce messages that are ambiguous (Wasow et al.,, 2005), relying on the context to complete their meaning (Barwise and Perry,, 1983). The mode and extent of this dependence vary widely: the context guides our understanding of homonyms such as bat (are we playing baseball or watching the night sky?); it allows us to interpret approximate quantities such as high price (how much do I usually spend on this?); and deictic expressions like this are perhaps the most vivid examples, as they cannot function without a situational or linguistic context. Yet humans understand language in context so effortlessly that potential misunderstandings are often only noticed by the linguist (Ferreira,, 2008).

In this work, we focus on two questions. First, what pressure is sufficient for the emergence of context-dependent communication (why it may be preferred over unambiguous communication)? Second, what environmental conditions and cognitive capabilities enable such communication? To address these issues, we turn to agent-based models as tools to examine various factors shaping language evolution in simplified and idealised simulations (Cangelosi,, 2001; Steels,, 2011; Cangelosi and Parisi,, 2012). We adopt a variant of the Lewis, (1969) signalling game – a cooperative scenario in which the sender’s task is to communicate the referent that the receiver must identify in a limited set. Within this model, we introduce the communication context as a set of possible referents, selected at random in each communication episode.

Even in the simple scenario of the signalling game, context may play a crucial role in at least two aspects: (a) it offers additional information that can be exploited by the sender and the receiver, and (b) context directly constrains the receiver’s action choice, which may impact the agents’ communicative strategy regardless of the receiver’s information-processing capabilities. We can illustrate this distinction by building upon an example given by Searle, (1980): when someone hears the utterance “we need to cut it” while standing on a lawn, they are likely to think about a mower. The same sentence uttered during a birthday party can be interpreted as a request to fetch a knife to cut a cake. However, when the sender knows that the right cutting tool, a mower or a knife, is already present at the receiver’s hand, they can be sure that the receiver will understand the utterance correctly even without knowing the context, as the possible course of action is already narrowed down.

While previous research on context-dependent communication has usually assumed agents’ nontrivial internal information-processing capabilities, we focus on the enabling constraints external to the agents and carry out experiments with the receiver’s minimal skills. We observe that the following conditions are sufficient for the development of context-dependent communication in this setting: (a) information on the receiver’s context is provided to the sender and (b) pressure is placed on the agents to minimise their vocabulary. Under these conditions, agents develop a language that is partly ambiguous and rely on the context for disambiguation rather than invent a fully precise language.

2 Why does linguistic ambiguity arise?

Linguists’ focus on context-independent language is reflected and arguably sustained by the commonly adopted metaphors of communication as a transmission problem and the resulting treatment of language as a code (Reddy,, 1979). The accompanying pervasive assumption of stable correspondence between language utterances and their meaning has roots in the long history of focus on language generativity (Katz and Fodor,, 1963). More pragmatically and ecologically-oriented works argue for a different picture, where natural language utterances are commonly ambiguous out of context – but they are successfully understood by humans based on the shared situational context (Sperber and Wilson,, 1986). As Barwise and Perry, (1983) indicated, while it is important that we can express an infinite number of propositions with a finite number of elements, due to the possibility of combination and recursion, it is equally amazing that we can express an infinite number of meanings with one expression due to its dependence on context (bringing to mind examples of entirely context-dependent expressions such as this).

Arguably, the conventional meaning of expressions can be extended to novel contexts due to their ambiguity (Brochhagen,, 2015). Recent works propose that expression meaning is negotiated in interaction (Chater and Christiansen,, 2022) and that language, rather than transmitting clear and bounded meanings, has the role of steering or controlling interactive events (Rączaszek-Leonardi,, 2012). These directions in the development of the theory of language, from the transmission model to viewing language as a prevalently ambiguous out of context, creative interaction control, situated in a particular environment, make understanding of the nature of context and its contribution to linguistic meaning and structure an urgent task 111Paradoxically, from such a standpoint it seems more challenging to account for the partial meaning context-independence than to acknowledge the effects of situational context; for example, how it is possible that we have a sense that utterances refer to something seemingly unrelated to the current circumstances, possibly distant in time and space (Rączaszek-Leonardi et al.,, 2018).. In this work, we investigate the role of the environment and, consequently, focus on the extra-linguistic context, understood as the current or past composition of the environment.

While the perspective adopted in this study foregrounds the positive and neutral aspects of linguistic ambiguity, we note that previous studies often associated ambiguity with miscommunication. As we see no compelling argument to make a clear distinction between these two cases, we consistently use the term ambiguity to refer to all cases of meaning uncertainty.

Piantadosi et al., (2012) propose that ambiguity arises under pressure for communicative efficiency. They consider the scenario of a sender and receiver in a shared context. If the context is at least slightly informative about the possible meanings of the sender’s message, the sender does not need to convey full information for the receiver to disambiguate the message, that is, infer its meaning from the context. Consequently, if symbol production is costly and the receiver’s inference is relatively cheap, it is suboptimal on the sender’s part to ensure that their utterance can only be interpreted in exactly one way. A shorter, partially ambiguous utterance will be preferred and will still be informative enough for the receiver. Additionally, if there are differences in the production cost of certain symbols, the sender may choose more economic, but more ambiguous ones.

While the bottleneck indicated by Piantadosi and colleagues concerned language production, Kirby et al., (2015) argue for an informational bottleneck in cultural transmission. They demonstrate that natural limitations in the cognitive abilities of language learners impose pressure for simplicity on a language system. Thus, despite the pressure for informativeness 222We follow the use of terms in Winters et al., (2018); the original terms were compressibility and expressivity (Kirby et al.,, 2015)., arising from the use of language, languages will never become perfectly informative about the intended meaning. Rather, they will gravitate towards a point of optimal trade-off between the competing pressures, where a degree of context dependence is to be expected. Under such a trade-off, language ambiguities can make communication less effective but still functional in general (Santana,, 2014; O’Connor,, 2015).

Intuitively, if language meaning relies on the context, then the composition of contexts, in which the interlocutors find themselves, should also play a role for ambiguity. Imagine a simple interaction between a shop assistant and a customer asking for a specific product visible on the display. Here, successful communication requires discrimination of the intended referent from other possible referents (e.g. Olson, (1970)). Winters et al., (2018) demonstrated that in this kind of situated setting, more predictable contexts allow for minimising information transmission through linguistic means. They manipulated the sender’s access to context and contextual object distribution in artificial language experiments, where pairs of participants were tasked to communicate about the target referent. When the possible referents in the context were random, the emergent signalling system was highly context-independent. However, when objects within each context were always of the same colour and the sender could see the receiver’s context, the emergent languages became ambiguous, not encoding colour information, and thereby relying on the stable object distribution in the context.

In summary, several factors seem to crucially contribute to the emergence of ambiguous language. One of them is the situational context shared between interlocutors, which is a source of additional information independent of what is conveyed by language. The second is an additional pressure that makes the language not only informative but also reasonably simple. The last is the regularity of the environment, which determines the relevant distinctions to be made by language.

3 Context in signalling games

The models relevant for this work follow the general schema of the influential Lewis, (1969) signalling game, where the sender and receiver learn to communicate about objects in their environment through rewards for communication success. In the original formulation of the game, as well as in its many extensions, there is no shared context between the agents. Under such circumstances, the only protocols that guarantee communication success are those that unequivocally map objects to signals, and they predictably emerge in simulations of evolutionary dynamics with realistic perturbations (Skyrms,, 2010; Santana,, 2014).

In some more recent models of visually grounded language emergence, neural agents faced an image discrimination task with shared context that directly constrained the referent choice (e.g. Lazaridou et al., (2017, 2016, 2018); Havrylov and Titov, (2017); Bouchacourt and Baroni, (2018)). The size of the context ranged from just two objects (e.g. Lazaridou et al., (2016)) to 128128 (Havrylov and Titov,, 2017). In these works, due to their engineering motivation, the only optimisation goal for agent policies was communication success. The results of Guo et al., (2021) suggest that the constrained choice of referents can itself lead to the emergence of context-dependent languages. Nevertheless, context dependence was not controlled for nor reported in these works.

Some works in evolutionary game theory investigated the origins of language ambiguity in signalling games where additional costs were included in the reward function. This included an additional cost imposed on the sender for using a larger vocabulary (Santana,, 2014; Mühlenbernd,, 2021) and / or the cost related to disambiguation imposed on the receiver (Mühlenbernd,, 2021). The communicating agents were provided with shared contextual information. The results show that partial ambiguity is evolutionarily the most likely strategy when the sender’s signalling cost is higher or comparable to the receiver’s disambiguation cost. In a similar vein, Brochhagen, (2020) showed that the context can be exploited by the sender to use its preferred but more ambiguous forms. In these game-theoretic studies, the receiver had the ability to infer the intended meaning of the sender, in congruence with Piantadosi et al., (2012). However, the direct impact of the environment on context-dependent communication was not accounted for because of the unlimited referent choice.

While the receiver’s inference is undoubtedly important for context-dependent communication (Frank and Goodman,, 2012), direct environmental constraint on the choice of possible referents substantially modifies the task (Winters et al.,, 2018; Guo et al.,, 2021). In a situated communicative setting with a constrained referent choice, the speaker’s signal needs to be only as precise as to discriminate the referent from other present objects, and the receiver’s task is arguably less challenging, as the agent’s inference is partly “offloaded” to the environment. Therefore, to investigate the role of the environment for context dependence, we use a model in which agents have minimal skills and in which the pressure for vocabulary size and sender’s access to context will be manipulated.

Importantly, we use an entirely random distribution for objects in the context. This is for two reasons. Firstly, it is to be expected that regularities in the encountered contexts can yield more ambiguous but still effective signals (Guo et al.,, 2021) but the possible outcomes for an entirely random object distribution under vocabulary size pressure are less clear. Secondly, we aimed for the most general conditions for the ambiguous communication to emerge and thus made as few assumptions as possible.

4 Contextual Lewis signalling game

Let us define a contextual variant of the probabilistic Lewis signalling model. There are two agents: sender and receiver. The sender perceives one of the alternative states of the world sSs\in S, and emits one of the available signals σΣ\sigma\in\Sigma with probability f(σ|s)f(\sigma|s). The receiver chooses a response rRr\in R, based on the received signal, with probability g(r|σ)g(r|\sigma). Upon receiving a correct response rr for a given state ss both agents are rewarded. Agent policies are represented through functions ff and gg and can be modelled, for example, using neural networks. Through adaptive learning, agents can reach an equilibrium in which signals are used for functional communication. Signals are conventional in the sense that ssσ\sigma and σ\sigmarr mappings are arbitrary but must be consistent. We can say that the signalling protocol established in a Lewis signalling game is unambiguous if there is a unique mapping from signals σ\sigma to states ss. Such a setting and its many varieties have been a kind of general testing ground for investigating various aspects of language emergence.

Let us call a non-contextual variant of this game where the receiver’s output belongs to the same space as the sender’s input (namely, the response belongs to the set of possible world states, rSr\in S), a reconstruction game, following recent usage of the term by Guo et al., (2021). In the reconstruction game, the receiver has to learn to reconstruct the target solely from the signal emitted by the sender, without any contextual cues (e.g. Chaabouni et al., (2020); Kharitonov and Baroni, (2020)).

In contrast, in another popular scenario, which we call discrimination game 333Guo et al., (2021) have called this scenario simply referential game. However, this name has been used across a wide variety of contexts and fields. To avoid confusion and distinguish this setting from reconstruction game more accurately, we choose the term discrimination. the receiver’s task is to identify the target from among the provided set of objects which constitutes the context. We will use the word context in this narrow technical sense. The world state is multi-object, that is, if OO is the set of all possible objects, the world state consists of the context COC\subset O and the target tCt\in C – what is communicated is only a part of the world state.

Within a discrimination game, we can consider a receiver that is not “aware” of the context, but is still constrained by it, since only some objects can be selected. For such a case, we need to define an auxiliary selection function:

ιC(𝐳)i={ezirjCezjif riC0if riC\iota_{C}(\mathbf{z})_{i}=\begin{cases}\dfrac{e^{z_{i}}}{\sum_{r_{j}\in C}e^{z_{j}}}&\text{if }r_{i}\in C\\ 0&\text{if }r_{i}\not\in C\end{cases}

The function ιC\iota_{C} operates on a vector of probabilities 𝐳\mathbf{z} corresponding to different responses rOr\in O. It selects only valid responses that belong to a given context and normalizes the probabilities.

On a spectrum of perceptual availability of the context, we can distinguish the following variants of the sender–receiver pairs playing a discrimination game, from the least to the most contextual:

  1. 1.

    Neither sender nor receiver have access to the context. Sender’s signal is generated with probability f(σ|t)f(\sigma|t), receiver’s response is generated with probability ιC(g(r|σ))\iota_{C}(g(r|\sigma)).

  2. 2.

    Sender has access to the context, but the receiver does not. Sender’s signal is generated with probability f(σ|t,C)f(\sigma|t,C), receiver’s response is generated with probability ιC(g(r|σ))\iota_{C}(g(r|\sigma)).

  3. 3.

    Both sender and receiver have access to the context. Sender’s signal is generated with probability f(σ|t,C)f(\sigma|t,C), receiver’s response is generated with probability g(r|σ,C)g(r|\sigma,C).

Under variant 1) agents cannot use contextual information at all. The optimal solution is to use unambiguous signalling, where every world state is represented by a unique signal. Under variant 3) sender and receiver share context, and may use it as a common ground. This, in principle, opens a possibility for ambiguous, yet efficient, communication protocols, such that the sender emits the signal conditional on the context and the receiver disambiguates the signal in a process of inference within the particular context (Piantadosi et al.,, 2012).

Variant 2) is an interesting middle ground between these two possibilities. Since sender knows which objects are available for the receiver to choose from, it may safely use an ambiguous signal when other probable referents of the signal are not present in the current context. Signalling protocol that is highly ambiguous in the entire object space may still be optimal for the vast majority of contexts. Thus, accurate context-dependent communication may emerge even without inferential capabilities on the receiver’s side. This is the variant that we implement in our simulations.

5 Model architecture

Agents are modelled as multi-layer neural networks. We consider two agent pairs, corresponding to variant 1 and 2 from section 4, including the same type of receiver but differing with respect to the sender’s access to the context beyond the target object. Importantly, the receiver is designed to be unable to infer signal meaning in context.

Target-only (T) sender. A simple feed-forward network of fully-connected layers, which receives only the target as the input and maps it to probabilities of activation of each symbol (Fig. 1). Since this sender cannot access the context, it shares no common information with the receiver, and thus it is not expected to be able to communicate in a context-dependent manner in any setting.

(a) Refer to caption (b)  Refer to caption
Figure 1: The target-only (T) sender (a) and example vector encodings of object properties (b).

Target-in-context (TC) sender . The input of this network is all objects in the context, including the target. It receives input sequentially (Fig. 2). As non-target objects are input to the network in subsequent steps, the target is passed along each of them. Such an input structure enables direct comparisons between the target and each of the objects. This network has an LSTM memory cell, which constitutes the first layer of the network. After the last input is provided, LSTM output is passed through fully-connected layers to produce activation probabilities for each symbol. Since this sender architecture is informed by the receiver’s context, it is expected to be able to communicate in a context-dependent manner.

Refer to caption
Figure 2: Target-in-context (TC) sender

Receiver The receiver is a simple network that generates the decision on the choice of the object within a single forward pass (Fig. 3). It is inspired by and very similar to the receiver architecture used by Lazaridou et al., (2017) in that each object may only be compared with the received signal independently from other objects. Each object is processed by the same sequence of network layers along with the signal emitted by the sender, which produces what may be interpreted as an alignment score between the input object and the received signal. All such scores are combined with a softmax operation into a probability distribution over objects, from which the final selection is sampled. In simple terms, this receiver most often selects the object that best matches the received signal.

Refer to caption
Figure 3: The receiver

5.1 Optimization method

Each iteration in a simulation run consists of nn episodes. The fitness of the agents is based on their accuracy and vocabulary size.

Accuracy of a sender-receiver pair characterised by a parameter vector θ\theta is defined as Acc(θ)=1ni=1nGiAcc(\theta)=\frac{1}{n}\sum_{i=1}^{n}G_{i}, where GiG_{i} is the reward for the ii-th episode. Gi=1G_{i}=1 if and only if the receiver selects the target object in the ii-th episode; otherwise Gi=0G_{i}=0.

Vocabulary size is defined in a continuous way based on the usage of different signals across all episodes in an iteration. If σj\sigma^{j} is the jj-th symbol (or bit) of the signal vector, 1jm1\leq j\leq m, then:

Voc(σ)=1mj=1mH(σj)=1mj=1m(P(σj=0)log2P(σj=0)+P(σj=1)log2P(σj=1))\textit{Voc}(\sigma)=\frac{1}{m}\sum_{j=1}^{m}H(\sigma^{j})\\ =-\frac{1}{m}\sum_{j=1}^{m}(P(\sigma^{j}=0)\log_{2}P(\sigma^{j}=0)\\ +P(\sigma^{j}=1)\log_{2}P(\sigma^{j}=1))

Intuitively, it means that a language with 10 words that uses only 5 of them in most episodes has a smaller vocabulary than one with 10 words that are equally frequent words. The measure is made continuous for the purpose of convenient gradient optimisation.

Fitness The fitness function calculated for the agent parameters θ\theta takes the form:

F(θ)=1ni=1nGipvocVoc(σ)F(\theta)=\frac{1}{n}\sum_{i=1}^{n}G_{i}-p_{voc}\textit{Voc}(\sigma)

where pvocp_{\textit{voc}} is the penalty coefficient (can be equal to zero).

The model parameters θ\theta are iteratively optimised to maximise the fitness of the agents. We chose one of the black-box optimisation methods – the evolution strategies (Salimans et al.,, 2017). We define the model parameters θ\theta as a concatenation of the sender and receiver parameters, the weights and biases of their corresponding networks. At first, the model parameters are initialised randomly. In every iteration, a mini-batch of new parameters θ\theta is generated by sampling noise ϵ\epsilon from the normal distribution and adding it to the parameter vector. We also use mirror sampling (Brockhoff et al.,, 2010), that is, for every sampled perturbation ϵ\epsilon, an inverse perturbation ϵ-\epsilon is generated. The perturbed parameters are then evaluated to approximate the gradient of the fitness function FF:

θ𝔼ϵ𝒩(0,I)F(θ+σϵ)=1σ𝔼ϵ𝒩(0,I)[ϵF(θ+σϵ)]\nabla_{\theta}\mathbb{E}_{\epsilon\sim\mathcal{N}(0,I)}F(\theta+\sigma\epsilon)=\frac{1}{\sigma}\mathbb{E}_{\epsilon\sim\mathcal{N}(0,I)}\left[\epsilon F(\theta+\sigma\epsilon)\right]

The gradient approximated in this way is used for optimisation using gradient ascent.

5.2 Measures of communication context-dependence

To measure how contextual the emerging language is, for each iteration, we calculate several complementary metrics.

Target certainty. This measure is the reverse of the notion of ambiguity, that is, it represents how much each symbol corresponds to only a single target.

At the same time, target certainty is the out-of-context baseline to be compared with actual accuracy. It is the highest accuracy possible in a counterfactual situation such that the sender emits the same signals but the receiver’s choice is neither constrained nor informed by context. Therefore, the best strategy is to select the most probable target given the given signal. We expect that the more the language becomes context-dependent, the more “imperfect” it will be in the eyes of an observer that considers only the received signal.

1ni=1nmaxoOP(T=o|σi)\frac{1}{n}\sum_{i=1}^{n}\max_{\textit{o}\in\textit{O}}P(T=\textit{o}|\sigma_{i})

Signal certainty. This is an analogue of target certainty on the sender’s end – a measure of how much each target perceived by the sender maps to a single signal.

1ni=1nmaxσΣP(σ|ti)\frac{1}{n}\sum_{i=1}^{n}\max_{\sigma\in\Sigma}P(\sigma|t_{i})

Maximal contextless interpretation accuracy. This measure is devised to detect nontrivial context processing by the receiver. It corresponds to accuracy in a counterfactual situation where the receiver is neither constrained nor informed by the context and still “does its best”. In this scenario, the sender’s signal emission is unchanged, but the receiver selects the most probable target in the current context given the received signal alone. If the real accuracy exceeds this hypothetical quantity, it means that the receiver conditions its response on the context in a more complex way than choosing the most frequent signal referent considering each object oo present in the context.

1ni=1n1 iff ti=argmaxociP(T=o|σi) else 0\frac{1}{n}\sum_{i=1}^{n}1\textit{ iff }t_{i}=\arg\max_{\textit{o}\in c_{i}}P(T=\textit{o}|\sigma_{i})\textit{ else }0

For a full picture of the emergent communication, we additionally calculate information-theoretic measures based on Shannon’s mutual information (Cover and Thomas,, 2012), here denoted by II. They quantify context’s participation in the signal generation on the part of the sender and in the interpretation of its meaning on the part of the receiver – which here amounts to the selection of the object from the context. For brevity, exact values of these metrics are reported in Appendix A.

Sender context information gain.

This is the amount of information gained about the signal when the whole context is considered beyond the target. When CC, TT and σ\sigma correspond to random variables denoting context, target, and signal emitted, we define the metric as follows: I(σ;C)I(σ;T)I(\sigma;C)-I(\sigma;T)

Receiver context information gain.

This is the amount of information gained about the receiver response when the context is considered beyond the received signal. When RR is the receiver’s response random variable, this metric is defined as follows: I(R;C,σ)I(R;σ)I(R;C,\sigma)-I(R;\sigma)

6 Simulation results

We ran our model in the world where all objects have 22 categorical properties, both with 33 possible values. For illustration purposes, we can consider them as shape and colour. The context size was set to 33. The context was entirely random, that is, each object appeared in the context with equal probability and, consequently, so did every possible object combination. The target, too, was selected from the context at random.

We tested our models in two main conditions: with unlimited vocabulary and with a vocabulary size penalty. The experiments were carried out with both T and TC senders (see section 5), paired with the same receiver type. We ran each simulation 1010 times for 50k50k iterations. The exact model hyperparameters used in both experiments and final metric values are specified in Appendix A.

(a)  Refer to caption (b)  Refer to caption
(c)  Refer to caption (d)  Refer to caption
Refer to caption
Figure 4: Metric plots for Experiment 1: T and TC sender with no vocabulary penalty. The dark area around curves marks the standard error.
(a)  Refer to caption (b)  Refer to caption
(c)  Refer to caption (d)  Refer to caption
Refer to caption
Figure 5: Metric plots for Experiment 2: T and TC sender combined with 2 different levels of vocabulary penalty. The dark area around curves marks the standard error.

6.1 Experiment 1: Unlimited vocabulary

The goal of the first experiment was to check whether agents could develop contextual communication without additional pressure beyond the pressure for communication accuracy. The results can tell us whether the agent architectures used in this study (a) have any inherent tendency to develop contextual languages and (b) can reach maximum accuracy in a favourable setting. In this experiment, the vocabulary penalty coefficient pvoc=0p_{\textit{voc}}=0.

Regardless of the sender type, whether T or TC, all agent pairs converge to top accuracy (Fig. 4a). The developed languages are fully unambiguous, as shown by the maximum value of target certainty – given the signal, the receiver selects the correct object unequivocally regardless of the remaining context. Interestingly, TC sender, unlike T sender, does not always send the same signal given the target (Fig. 4c). However, this does not amount to context-dependent communication, since at the receiver’s end the signal always unequivocally indicates the target, which is shown by the maximum value of target certainty (Fig. 4d). Let us note that, nevertheless, receiver context information gain is non-zero, nearing 1.21.2 for both receiver types: the context is informative about the receiver’s response as it constrains it directly.

6.2 Experiment 2: Penalised vocabulary size

In this experiment, the fitness of the agents is reduced proportionally to the size of their vocabulary. According to previous research (Santana,, 2014; Mühlenbernd,, 2021), this should introduce pressure for contextual communication. We expected that TC sender’s access to context should allow it to “get away” with ambiguity, which would lead to languages with a smaller vocabulary. T sender, since it does not perceive the receiver’s context, should not be able to communicate in a context-dependent manner despite the pressure that favours context-dependence. We tested two different levels of the vocabulary penalty coefficient pvocp_{\textit{voc}}: 0.050.05 and 0.10.1.

All emergent signal systems were influenced by pressure to minimise their vocabulary size, which now decreased from values slightly above 44 in Experiment 1 to values ranging from 1.41.4 to 2.62.6 (Fig. 5b). Significant signal ambiguity, indicated by reduced target certainty (Fig. 5d), appeared in all conditions except T + 0.05pVoc.

For all agents with ambiguous signals, the ambiguity came at the price of slightly reduced accuracy (Fig. 5a). Nevertheless, the accuracy was still relatively high (above 0.880.88) compared to the target certainty (0.490.49 to 0.780.78) (Fig. 5d). This means that the signal alone without contextual information and constrained object choice would only allow for much lower accuracy. This effect could be observed to some degree for both sender types. The achieved accuracy did not exceed the maximum contextless interpretation accuracy for either type of sender (see Appendix A for exact values), confirming that the receiver architecture did not allow for any sort of contextual inference.

There are differences between T and TC senders. At near-identical fitness levels, TC sender managed to reduce its vocabulary in comparison to T sender. While T sender could only produce symbols based on the target, TC sender’s signal certainty values are below maximum (Fig. 5c) as the emitted signal can be conditional on the context. The target certainty was much lower for TC than for T sender for both penalty levels. In other words, TC sender managed to “reuse” the same signals to a greater degree than T sender, which resulted in greater ambiguity of the signals actually used.

We examined in more detail languages used in the final iteration of TC + 0.1pVoc condition. In 99 out of 1010 runs, 44 most frequent signals amounted to more than 99%99\% of all emitted signals. In the remaining one, the same percentage was comprised by 77 most frequent signals. For all runs, the sender’s signal use formed diverse patterns across all 99 communicated targets (Fig. 6): including signals systematically used for one category (e.g. triangles of all colours), signals used for disjoint meanings (e.g. red circle and yellow square), as well as combinations of both. While some targets were always communicated using the same signal, for others the sender’s signal choice depended on other items in the context (Fig. 7). The receiver’s choice, too, was context-dependent for some signals (Fig. 8): deterministic and correct when only one likely target was present but unreliable under meaning conflict.

Refer to caption
Refer to caption
Figure 6: Signal emission frequencies for an exemplary run of TC + 0.1pVoc condition, mapped for all 99 target objects. Signal names in the legend are arbitrary. bb refers to all triangles, so it is the most systematic of all 44 signals. On the other end of this spectrum, signal cc is used to refer to many unrelated meanings: red circles, yellow squares, but also sometimes to green circles and green squares.
Refer to caption
Figure 7: Signal emission probabilities for selected target and contexts in an exemplary run of TC + 0.1pVoc condition. Given yellow triangle target, the sender always sends one of two signals. Our inspection of the resulting protocol shows that depending on the context, the sender’s signal follows one of two distinct distributions, exemplified here. 1) For most contexts, signal aa is sent with maximum probability, but 2) if green triangle also appears in the context, one of the two signals, either aa or bb is sent with near-equal probability.
Refer to caption
Figure 8: Meaning probabilities for selected signal and contexts in an exemplary run of TC + 0.1pVoc condition. This signal, across all contexts, always refers to one of three objects. The bottom two contexts show two different cases of how potentially conflicting meanings are resolved. When both green triangle and green circle are present, the signal is dysfunctionally ambiguous as it may refer to either of them. When both green circle and yellow triangle are present, there is one conventionalized referent: yellow triangle.

7 Discussion

We investigated a situated communicative scenario in which the environment directly constrains the receiver’s response to the objects in the current context and thus constrains its interpretation of the signal. In this setting, we examined the consequences of pressure to minimise vocabulary. Our results are consistent with previous modelling work in that the penalty on vocabulary size induced a degree of ambiguity (Santana,, 2014; O’Connor,, 2015; Mühlenbernd,, 2021). Furthermore, they show the ambiguity that arises as a trade-off between accuracy and minimisation of vocabulary size, in accordance with O’Connor, (2015) predictions that a degree of inaccuracy is optimal for a certain ratio between the cost of vocabulary size and the size of the meaning space to be expressed. In Experiment 2, with a large enough vocabulary size penalty, the accuracy dropped below the maximum value for both sender types.

In our model, unlike in the above works and others (Piantadosi et al.,, 2012), effective disambiguation of signal meaning depends on the environmental constraints on the referent choice, and not on the receiver’s inferential capabilities. The effectiveness of disambiguation can be seen in that the accuracy reached levels higher than our out-of-context baseline – target certainty – for both types of senders. While the target-in-context sender (TC) did not outperform the target-only sender (T) in terms of raw accuracy, it developed a more ambiguous language with a smaller vocabulary that was still reasonably effective. This was the case even though the receiver was always context-agnostic. The context-aware sender was able to capitalise on the situated context constraining receiver’s actions. This kind of context-dependent signalling constitutes a variant of audience design that does not rely on simulating inferential processes at the receiver’s end, as in some previous works (Batali,, 1998; Brochhagen,, 2020). The need for context awareness on the sender’s side is consistent with empirical experiments by Winters et al., (2018), where the most sophisticated languages that encoded only the contextually relevant semantic distinctions emerged only when the sender was able to perceive the context.

Disambiguation mechanisms similar to those present in our model arguably play a major role in shaping languages; natural languages evolve under pressures that favour ambiguity, such as minimising speaker effort (Piantadosi et al.,, 2012) or language learnability (Kirby et al.,, 2015), and these mechanisms may allow ambiguity to enter the language without sacrificing communication success. Although constrained referent choice in the absence of the sender’s context awareness does not offer a systematic disambiguation mechanism, it may still greatly reduce communication errors and thus pressure for meaning precision. As the two meanings of the word bat – the baseball bat and the flying mammal – are unlikely to be relevant in the same context, the word itself may safely remain ambiguous. When the sender becomes aware of the context, different or additional linguistic means can be used to specify the meaning. This was illustrated by the context-aware sender’s behaviour in our model and reflected in an experiment by Ferreira et al., (2005), where participants were tasked to describe a scene. Perceiving both the flying mammal and a baseball bat in one scene, the participants often used an additional specifier for one of the objects, producing descriptions such as “bat and baseball bat”. Clearly, complex human inferential capabilities that involve both sender and receiver, as considered in pragmatics (e.g. Korta and Perry, (2020)), will likely make the ambiguous language even more effective.

While context-dependent communication has usually been portrayed as relying on processes internal to the communicating agents, we demonstrate that a nontrivial part of the job of disambiguation is performed by the environment itself (Brooks,, 1990; Wilson and Golonka,, 2013). Consequently, it seems that the composition of contexts in which we find ourselves plays a key role. Guo et al., (2021) show in a computational model that languages developed by neural agents in signalling games with medium context size encode more information than those emerging within a smaller or larger context. We can speculate that another source of environmentally induced ambiguity may be simply the frequency of referents. Huttegger, (2007) shows that for an uneven distribution of referent types, the population is more likely to stay in a suboptimal ambiguous signalling strategy. As some types need to be communicated less frequently, the potential payoff of assigning them a unique symbol is reduced. In our simulations, we used a very simple environment with uniform distribution over the communicated targets and uncorrelated object occurrence within the context. More complex environments with predictable regularities open new possibilities. If two particular objects never co-occur (the baseball bat and the flying mammal), the same signal can be potentially used for both of them without causing communication errors. Winters et al., (2018) show that predictable composition of the context is, in fact, exploited by human participants who, consequently, arrive at simpler languages than those emerging for entirely random contexts.

There may be more mechanisms responsible for handling ambiguity in language than is often assumed. The phenomena discussed so far concerned one-directional communication. In contrast, human interlocutors engage in dialogues where they explicitly ask for disambiguation (Healey et al.,, 2018) or even co-construct meaning together (Kempson et al.,, 2019). Looking from this angle, we discover that not only sharing context, but also aggregating information from individually available contexts is important for dialogically co-constructed messages (Rączaszek-Leonardi et al.,, 2014). These multiple mechanisms can remedy meaning uncertainty when precision is vital; the very same uncertainty can allow for language flexibility, which is indispensable on other occasions (see Section 2).

There are a couple of directions where the current work can be developed. First, future research could examine the effects of various context properties, such as the distribution of objects in the environment. In our work, we used only a uniform distribution; moving away from it would affect relative benefits of distinguishing certain object properties and also introduce regularities that could be exploited by agents in their disambiguation strategies. Second, the analysis could be extended to answer questions concerning emerging language structure, an issue that we have left aside in our work. As in natural languages a neighbouring word often sufficiently disambiguates an otherwise ambiguous expression (Ferreira,, 2008), we could expect that complexifying signals would render their individual components even more ambiguous. One possible amendment to the model would be allowing agents to emit compound signals of variable length. Finally, the current model could be altered to allow bi-directional exchange between agents (in the vein of Kottur et al., (2017); Bancerek and Zubek, (2023)). This would allow us to investigate dialogical mechanisms of disambiguation and their interactions with situational context in constructing flexible language.

8 Conclusion

In this work, we developed a model in which the pressure that is conducive to the emergence of context-dependent communication was combined with the environmentally constrained interpretation of the signal by the receiver. In doing this, we complemented previous modelling work, which focused on how communicating agents can exploit shared information by means of inference. To the best of our knowledge, this is also the first work where environmentally constrained communication was modelled in conjunction with a vocabulary size cost. Our results point to the fact that context-dependent communication is likely a multilayered phenomenon, which should be analysed not only in terms of individual inferential capabilities, but also in terms of the agents-environment and agent-agent relations. In line with other authors of recent research on linguistic context dependence, we urge that out-of-context signal ambiguity should not be readily equated with proneness to error but rather understood as a pervasive property of real-life situated communication, resulting in communicative efficiency and flexibility.

Acknowledgements

This work was funded by the National Science Centre, Poland (OPUS 15 grant, 2018/29/B/HS1/00884). The experiments were partly run on the supercomputers of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw (computational grant G86-1039). The metrics in the experimental results were tracked via Weights & Biases (Biewald,, 2020).

Declarations of interest

The authors declare that they have no competing interests.

References

  • Bancerek and Zubek, (2023) Bancerek, M. and Zubek, J. (2023). The emergence of coordinative dialogue – pragmatic context in multi-agent communication. In Proceedings of the Annual Meeting of the Cognitive Science Society (Accepted).
  • Barwise and Perry, (1983) Barwise, J. and Perry, J. (1983). Situations and Attitudes. MIT Press.
  • Batali, (1998) Batali, J. (1998). Computational simulations of the emergence of grammar. Approaches to the Evolution of Language-Social and Cognitive Bases-.
  • Biewald, (2020) Biewald, L. (2020). Experiment tracking with weights and biases.
  • Bouchacourt and Baroni, (2018) Bouchacourt, D. and Baroni, M. (2018). How agents see things: On visual representations in an emergent language game. arXiv preprint arXiv:1808.10696.
  • Brochhagen, (2015) Brochhagen, T. (2015). Improving coordination on novel meaning through context and semantic structure. In Proceedings of the Sixth Workshop on Cognitive Aspects of Computational Language Learning, pages 74–82.
  • Brochhagen, (2020) Brochhagen, T. (2020). Signalling under uncertainty: Interpretative alignment without a common prior. The British Journal for the Philosophy of Science.
  • Brockhoff et al., (2010) Brockhoff, D., Auger, A., Hansen, N., Arnold, D. V., and Hohm, T. (2010). Mirrored sampling and sequential selection for evolution strategies. In Parallel Problem Solving from Nature, PPSN XI: 11th International Conference, Kraków, Poland, September 11-15, 2010, Proceedings, Part I 11, pages 11–21. Springer.
  • Brooks, (1990) Brooks, R. A. (1990). Elephants don’t play chess. Robotics and autonomous systems, 6(1-2):3–15.
  • Cangelosi, (2001) Cangelosi, A. (2001). Evolution of communication and language using signals, symbols, and words. IEEE Transactions on Evolutionary Computation, 5(2):93–101.
  • Cangelosi and Parisi, (2012) Cangelosi, A. and Parisi, D. (2012). Simulating the Evolution of Language. Springer Science & Business Media.
  • Chaabouni et al., (2020) Chaabouni, R., Kharitonov, E., Bouchacourt, D., Dupoux, E., and Baroni, M. (2020). Compositionality and Generalization In Emergent Languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 4427–4442, Online. Association for Computational Linguistics.
  • Chater and Christiansen, (2022) Chater, N. and Christiansen, M. H. (2022). Grammar through spontaneous order. In Algebraic Structures in Natural Language, pages 61–76. CRC Press.
  • Cover and Thomas, (2012) Cover, T. M. and Thomas, J. A. (2012). Elements of Information Theory. John Wiley & Sons.
  • Ferreira, (2008) Ferreira, V. S. (2008). Ambiguity, accessibility, and a division of labor for communicative success. Psychology of Learning and motivation, 49:209–246.
  • Ferreira et al., (2005) Ferreira, V. S., Slevc, L. R., and Rogers, E. S. (2005). How do speakers avoid ambiguous linguistic expressions? Cognition, 96(3):263–284.
  • Frank and Goodman, (2012) Frank, M. and Goodman, N. (2012). Predicting Pragmatic Reasoning in Language Games. Science (New York, N.Y.), 336:998.
  • Guo et al., (2021) Guo, S., Ren, Y., Mathewson, K., Kirby, S., Albrecht, S. V., and Smith, K. (2021). Expressivity of emergent language is a trade-off between contextual complexity and unpredictability. arXiv preprint arXiv:2106.03982.
  • Havrylov and Titov, (2017) Havrylov, S. and Titov, I. (2017). Emergence of language with multi-agent games: Learning to communicate with sequences of symbols. Advances in neural information processing systems, 30.
  • Healey et al., (2018) Healey, P. G. T., Mills, G. J., Eshghi, A., and Howes, C. (2018). Running Repairs: Coordinating Meaning in Dialogue. Topics in Cognitive Science, 10(2):367–388.
  • Huttegger, (2007) Huttegger, S. M. (2007). Evolution and the explanation of meaning. Philosophy of science, 74(1):1–27.
  • Katz and Fodor, (1963) Katz, J. J. and Fodor, J. A. (1963). The structure of a semantic theory. language, 39(2):170–210.
  • Kempson et al., (2019) Kempson, R., Gregoromichelaki, E., and Howes, C. (2019). Language as Mechanisms for Interaction: Towards an Evolutionary Tale. In Silva, A., Staton, S., Sutton, P., and Umbach, C., editors, Language, Logic, and Computation, Lecture Notes in Computer Science, pages 209–227, Berlin, Heidelberg. Springer.
  • Kharitonov and Baroni, (2020) Kharitonov, E. and Baroni, M. (2020). Emergent language generalization and acquisition speed are not tied to compositionality. arXiv preprint arXiv:2004.03420.
  • Kirby et al., (2015) Kirby, S., Tamariz, M., Cornish, H., and Smith, K. (2015). Compression and communication in the cultural evolution of linguistic structure. Cognition, 141:87–102.
  • Korta and Perry, (2020) Korta, K. and Perry, J. (2020). Pragmatics. In Zalta, E. N., editor, The Stanford Encyclopedia of Philosophy. Metaphysics Research Lab, Stanford University, spring 2020 edition.
  • Kottur et al., (2017) Kottur, S., Moura, J., Lee, S., and Batra, D. (2017). Natural Language Does Not Emerge ‘Naturally’ in Multi-Agent Dialog. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2962–2967, Copenhagen, Denmark. Association for Computational Linguistics.
  • Lazaridou et al., (2018) Lazaridou, A., Hermann, K. M., Tuyls, K., and Clark, S. (2018). Emergence of Linguistic Communication from Referential Games with Symbolic and Pixel Input. In International Conference on Learning Representations.
  • Lazaridou et al., (2017) Lazaridou, A., Peysakhovich, A., and Baroni, M. (2017). Multi-agent cooperation and the emergence of (natural) language. arXiv preprint arXiv:1612.07182.
  • Lazaridou et al., (2016) Lazaridou, A., Pham, N. T., and Baroni, M. (2016). Towards multi-agent communication-based language learning. arXiv preprint arXiv:1605.07133.
  • Lewis, (1969) Lewis, D. K. (1969). Convention: A Philosophical Study,. Harvard University Press, Cambridge.
  • Mühlenbernd, (2021) Mühlenbernd, R. (2021). Evolutionary stability of ambiguity in context signaling games. Synthese. An International Journal for Epistemology, Methodology and Philosophy of Science, 198(12):11725–11753.
  • O’Connor, (2015) O’Connor, C. (2015). Ambiguity is kinda good sometimes. Philosophy of Science, 82(1):110–121.
  • Olson, (1970) Olson, D. R. (1970). Language and thought: Aspects of a cognitive theory of semantics. Psychological review, 77(4):257.
  • Piantadosi et al., (2012) Piantadosi, S. T., Tily, H., and Gibson, E. (2012). The communicative function of ambiguity in language. Cognition, 122(3):280–291.
  • Reddy, (1979) Reddy, M. (1979). The conduit metaphor. Metaphor and thought, 2:285–324.
  • Rączaszek-Leonardi, (2012) Rączaszek-Leonardi, J. (2012). Language as a system of replicable constraints. In Laws, Language and Life, pages 295–333. Springer.
  • Rączaszek-Leonardi et al., (2014) Rączaszek-Leonardi, J., Dębska, A., and Sochanowicz, A. (2014). Pooling the ground: Understanding and coordination in collective sense making. Frontiers in psychology, 5:1233.
  • Rączaszek-Leonardi et al., (2018) Rączaszek-Leonardi, J., Nomikou, I., Rohlfing, K. J., and Deacon, T. W. (2018). Language development from an ecological perspective: Ecologically valid ways to abstract symbols. Ecological Psychology, 30(1):39–73.
  • Salimans et al., (2017) Salimans, T., Ho, J., Chen, X., Sidor, S., and Sutskever, I. (2017). Evolution Strategies as a Scalable Alternative to Reinforcement Learning. arXiv:1703.03864 [cs, stat].
  • Santana, (2014) Santana, C. (2014). Ambiguity in cooperative signaling. Philosophy of Science, 81(3):398–422.
  • Searle, (1980) Searle, J. R. (1980). The background of meaning. In Searle, J. R., Kiefer, F., and Bierwisch, M., editors, Speech Act Theory and Pragmatics, pages 221–232. Dordrecht: D. Reidel.
  • Skyrms, (2010) Skyrms, B. (2010). Signals: Evolution, Learning, and Information. Oxford University Press.
  • Sperber and Wilson, (1986) Sperber, D. and Wilson, D. (1986). Relevance: Communication and Cognition, volume 142. Citeseer.
  • Steels, (2011) Steels, L. (2011). Modeling the cultural evolution of language. Physics of life reviews, 8(4):339–356.
  • Steffensen and Harvey, (2018) Steffensen, S. V. and Harvey, M. I. (2018). Ecological meaning, linguistic meaning, and interactivity. Cognitive Semiotics, 11(1).
  • Wasow et al., (2005) Wasow, T., Perfors, A., and Beaver, D. (2005). The puzzle of ambiguity. Morphology and the web of grammar: Essays in memory of Steven G. Lapointe, pages 265–282.
  • Wilson and Golonka, (2013) Wilson, A. D. and Golonka, S. (2013). Embodied cognition is not what you think it is. Frontiers in psychology, 4:58.
  • Winters et al., (2018) Winters, J., Kirby, S., and Smith, K. (2018). Contextual predictability shapes signal autonomy. Cognition, 176:15–30.

Appendix A Appendix

A.1 Model hyperparameters

This section lists the details of network architecture and hyperparameters used for both experiments in this work. All non-LTSM network layers are interleaved with ReLU activations. TC sender consists of a single LSTM layer and 22 linear layers. T sender and the receiver network are both made of 33 linear layers. The hyperparameter values are manually selected to allow smooth optimisation. Hyperparameters below describe all networks in this work:

Network structure:

  • The size of all dense layers: 5050

  • The size of the hidden state and the cell state for LSTM cells: 5050

Optimisation with evolution strategies

  • mini-batch size: 5050

  • episodes per iteration: 400400

  • sample noise standard deviation σ\sigma: 0.10.1

  • gradient optimiser: AdamW with weight decay 0.010.01 and learning rate dependent on the sender type:

    • 0.020.02 for the receiver

    • 0.050.05 for TC sender and 0.020.02 for T sender

The results of Lazaridou et al., (2018) suggest that a too low signal length may make the model more prone to getting stuck in the local minima; we set a relatively large signal length 1010, to ensure that such a problem does not occur.

A.2 Experiment results

Table 1: Final metric values for Experiment 1
run metric mean std min max
T fitness 1.000 0.001 0.998 1.000
TC fitness 0.999 0.001 0.996 1.000
T accuracy 1.000 0.001 0.998 1.000
TC accuracy 0.999 0.001 0.996 1.000
T signal vocabulary size 4.250 0.961 3.181 5.605
TC signal vocabulary size 4.378 0.648 3.372 5.048
T receiver context information gain 1.220 0.005 1.211 1.226
TC receiver context information gain 1.218 0.005 1.212 1.228
T max contextless interpretation accuracy 1.000 0.000 1.000 1.000
TC max contextless interpretation accuracy 1.000 0.000 0.999 1.000
T sender context information gain 0.035 0.027 0.002 0.081
TC sender context information gain 0.780 0.274 0.250 1.128
T signal entropy 3.217 0.048 3.158 3.294
TC signal entropy 4.231 0.368 3.504 4.732
T signal certainty 0.982 0.014 0.962 0.999
TC signal certainty 0.682 0.104 0.563 0.896
T target certainty 1.000 0.000 0.999 1.000
TC target certainty 0.999 0.001 0.996 1.000
Table 2: Final metric values for Experiment 2
run metric mean std min max
T + 0.05pVoc fitness 0.872 0.005 0.865 0.878
T + 0.1pVoc fitness 0.743 0.007 0.729 0.748
TC + 0.05pVoc fitness 0.869 0.007 0.859 0.880
TC + 0.1pVoc fitness 0.744 0.010 0.730 0.765
T + 0.05pVoc accuracy 0.999 0.001 0.998 1.000
T + 0.1pVoc accuracy 0.937 0.012 0.915 0.944
TC + 0.05pVoc accuracy 0.973 0.010 0.960 0.983
TC + 0.1pVoc accuracy 0.889 0.027 0.863 0.963
T + 0.05pVoc signal vocabulary size 2.552 0.095 2.406 2.685
T + 0.1pVoc signal vocabulary size 1.937 0.044 1.853 1.967
TC + 0.05pVoc signal vocabulary size 2.089 0.113 1.990 2.297
TC + 0.1pVoc signal vocabulary size 1.444 0.191 1.331 1.982
T + 0.05pVoc receiver context information gain 1.218 0.006 1.207 1.227
T + 0.1pVoc receiver context information gain 1.129 0.018 1.092 1.146
TC + 0.05pVoc receiver context information gain 1.196 0.010 1.178 1.209
TC + 0.1pVoc receiver context information gain 1.108 0.026 1.084 1.179
T + 0.05pVoc max contextless interpretation accuracy 1.000 0.000 1.000 1.000
T + 0.1pVoc max contextless interpretation accuracy 0.944 0.010 0.922 0.950
TC + 0.05pVoc max contextless interpretation accuracy 0.975 0.010 0.962 0.984
T + 0.05pVoc sender context information gain 0.002 0.001 0.000 0.004
T + 0.1pVoc sender context information gain 0.003 0.003 0.001 0.011
TC + 0.05pVoc sender context information gain 0.123 0.060 0.052 0.208
TC + 0.1pVoc sender context information gain 0.259 0.052 0.126 0.305
T + 0.05pVoc signal entropy 3.157 0.003 3.154 3.162
T + 0.1pVoc signal entropy 2.673 0.093 2.494 2.727
TC + 0.05pVoc signal entropy 2.913 0.083 2.787 3.036
TC + 0.1pVoc signal entropy 2.065 0.247 1.908 2.764
T + 0.05pVoc signal certainty 0.999 0.001 0.998 1.000
T + 0.1pVoc signal certainty 0.999 0.002 0.994 1.000
TC + 0.05pVoc signal certainty 0.946 0.032 0.899 0.983
TC + 0.1pVoc signal certainty 0.896 0.026 0.855 0.955
T + 0.05pVoc target certainty 1.000 0.001 0.999 1.000
T + 0.1pVoc target certainty 0.776 0.042 0.696 0.800
TC + 0.05pVoc target certainty 0.843 0.060 0.769 0.892
TC + 0.1pVoc target certainty 0.490 0.101 0.450 0.778