A Lock-free Binary Trie

Jeremy Ko, University of Toronto, [email protected]

Abstract

A binary trie is a sequential data structure for a dynamic set on the universe $\{0,\dots,u-1\}$ supporting Search with $O(1)$ worst-case step complexity, and Insert, Delete, and Predecessor operations with $O(\log u)$ worst-case step complexity.

We give a wait-free implementation of a relaxed binary trie, using read, write, CAS, and ( $\log u$ )-bit AND operations. It supports all operations with the same worst-case step complexity as the sequential binary trie. However, Predecessor operations may not return a key when there are concurrent update operations. We use this as a component of a lock-free, linearizable implementation of a binary trie. It supports Search with $O(1)$ worst-case step complexity and Insert, Delete and Predecessor with $O(c^{2}+\log u)$ amortized step complexity, where $c$ is a measure of the contention.

A lock-free binary trie is challenging to implement as compared to many other lock-free data structures because Insert and Delete operations perform a non-constant number of modifications to the binary trie in the worst-case to ensure the correctness of Predecessor operations.

1 Introduction

Finding the predecessor of a key in a dynamic set is a fundamental problem with wide-ranging applications in sorting, approximate matching and nearest neighbour algorithms. Data structures supporting Predecessor can be used to design efficient priority queues and mergeable heaps [42], and have applications in IP routing [15] and bioinformatics [3, 31].

A binary trie is a simple sequential data structure that maintains a dynamic set of keys $S$ from the universe $U=\{0,\dots,u-1\}$ . Predecessor $(y)$ returns the largest key in $S$ less than key $y$ , or $-1$ if there is no key smaller than $y$ in $S$ . It supports Search with $O(1)$ worst-case step complexity and Insert, Delete, and Predecessor with $O(\log u)$ worst-case step complexity. It has $\Theta(u)$ space complexity.

The idea of a binary trie is to represent the prefixes of keys in $U$ in a sequence of $b+1$ arrays, $\mathit{D}_{i}$ , for $0\leq i\leq b$ , where $b=\lceil\log_{2}u\rceil$ . Each array $\mathit{D}_{i}$ has length $2^{i}$ and is indexed by the bit strings $\{0,1\}^{i}$ . The array entry $\mathit{D}_{i}[x]$ stores the bit 1 if $x$ is the prefix of length $i$ of some key in $S$ , and 0 otherwise. The sequence of arrays implicitly forms a perfect binary tree. The array entry $\mathit{D}_{i}[x]$ represents the node at depth $i$ with length $i$ prefix $x$ . Its left child is the node represented by $\mathit{D}_{i+1}[x\cdot 0]$ and its right child is the node represented by $\mathit{D}_{i+1}[x\cdot 1]$ . Note that $\mathit{D}_{b}$ (which represents the leaves of the binary trie) is a direct access table describing the set $S\subseteq U$ . An example of a binary trie is shown in Figure 1.

A Search $(x)$ operation reads $\mathit{D}_{b}[x]$ and returns True if $\mathit{D}_{b}[x]$ has value 1, and False otherwise. An Insert $(x)$ operation sets the bits of the nodes on the path from the leaf $\mathit{D}_{b}[x]$ to the root to 1. A Delete $(x)$ operation begins by setting $\mathit{D}_{b}[x]$ to 0. It then traverses up the trie starting at $\mathit{D}_{b}[x]$ , setting the value of the parent of the current node to 0 if both its children have value 0. A Predecessor $(y)$ operation $pOp$ traverses up the trie starting from the leaf $\mathit{D}_{b}[y]$ to the root. If the left child of each node on this path either has value 0 or is also on this path, then $pOp$ returns $-1$ . Otherwise, consider the first node on this path whose left child $t$ has value 1 and is not on this path. Then starting from $t$ , $pOp$ traverses down the right-most path of nodes with value 1 until it reaches a leaf $\mathit{D}_{b}[w]$ , and returns $w$ .

Refer to caption — Figure 1: A sequential binary trie for the set $S=\{0,2\}$ from a universe $U=\{0,1,2,3\}$ .

More complicated variants of sequential binary tries exist, such as van Emde Boas tries [41], x-fast tries and y-fast tries [44]. Compared to binary tries, they improve the worst-case complexity of predecessor operations. Both x-fast tries and y-fast tries use hashing to improve the space complexity, and hence are not deterministic. Furthermore, none of these variants support constant time search. One motivation for studying lock-free binary tries is as a step towards efficient lock-free implementations of these data structures.

Universal constructions provide a framework to give (often inefficient) implementations of concurrent data structures from sequential specifications. A recent universal construction by Fatourou, Kallimanis, and Kanellou [19] can be used to implement a wait-free binary trie supporting operations with $O(P+\bar{c}(op)\cdot\log u)$ worst-case step complexity, where $P$ is the number of processes in the system, and $\bar{c}(op)$ , is the interval contention of $op$ . This is the number of Insert, Delete, and Predecessor operations concurrent with the operation $op$ . Prior to this work, there have been no lock-free implementations of a binary trie or any of its variants without using universal constructions.

There are many lock-free data structures that directly implement a dynamic set, including variations of linked lists, balanced binary search trees, skip lists [36], and hash tables. There is also a randomized, lock-free implementation of a skip trie[33]. We discuss these data structures in more detail in Section 3.

Our contribution: We give a lock-free implementation of a binary trie using registers, compare-and-swap (CAS) objects, and ( $b+1$ )-bounded min-registers. A min-write on a ( $b+1$ )-bit memory location can be easily implemented using a single ( $b+1$ )-bit AND operation, so all these shared objects are supported in hardware. The amortized step complexity of our lock-free implementation of the binary trie is expressed using two other measures of contention. For an operation $op$ , the point contention of $op$ , denoted $\dot{c}(op)$ , is the maximum number of concurrent Insert, Delete, and Predecessor operations at some point during $op$ . The overlapping-interval contention [33] of $op$ , denoted, $\tilde{c}(op)$ is the maximum interval contention of all update operations concurrent with $op$ . Our implementation supports Search with $O(1)$ worst-case step complexity, and Insert with $O(\dot{c}(op)^{2}+\log u)$ amortized step complexity, and Delete and Predecessor operations with $O(\dot{c}(op)^{2}+\tilde{c}(op)+\log u)$ amortized step complexity. In a configuration $C$ where there are $\dot{c}(C)$ concurrent Insert, Delete, and Predecessor operations, the implementation uses $O(u+\dot{c}(C)^{2})$ space. Our data structure consists of a relaxed binary trie, as well as auxiliary lock-free linked lists. Our goal was to maintain the $O(1)$ worst-case step complexity of Search, while avoiding $O(\bar{c}(op)\cdot\log u)$ terms in the amortized step complexity of the other operations seen in universal constructions of a binary trie. Our algorithms to update the bits of the relaxed binary trie and traverse the relaxed binary trie finish in $O(\log u)$ steps in the worst-case, and hence are wait-free. The other terms in the amortized step complexity are from updating and traversing the auxiliary lock-free linked lists.

Techniques: A linearizable implementation of a concurrent data structure requires that all operations on the data structure appear to happen atomically. In our relaxed binary trie, predecessor operations are not linearizable. We relax the properties maintained by the binary trie, so that the bit stored in each internal binary trie node does not always have to be accurate. At a high-level, we ensure that the bit at a node is accurate when there are no active update operations whose input key is a leaf of the subtrie rooted at the node. This allows us to design an efficient, wait-free algorithm for modifying the bits along a path in the relaxed binary trie.

Other lock-free data structures use the idea of relaxing properties maintained by the data structure. For example, in lock-free balanced binary search trees [11], the balance conditions are often relaxed. This allows the tree to be temporarily unbalanced, provided there are active update operations. A node can be inserted into a tree by updating a single pointer. Following this, tree rotations may be performed, but they are only used to improve the efficiency of search operations. Lock-free skip lists [21] relax the properties about the heights of towers of nodes. A new node is inserted into the bottom level of a skip list using a single pointer update. Modifications that add nodes into the linked lists at higher levels only improve the efficiency of search operations.

Relaxing the properties of the binary trie is more complicated than these two examples because it affects correctness, not just efficiency. For a predecessor operation to traverse a binary trie using the sequential algorithm, the bit of each node must be the logical OR of its children. This is not necessarily the case in our relaxed binary trie. For example, a node in the relaxed binary trie with value 1 may have two children which each have value 0.

The second way our algorithm is different from other data structures is how operations help each other complete. Typical lock-free data structures, including those based on universal constructions, use helping when concurrent update operations require modifying the same part of a data structure: a process may help a different operation complete by making modifications to the data structure on the other operation’s behalf. This technique is efficient when a small, constant number of modifications to the data structure need to be done atomically. For example, many lock-free implementations of linked lists and binary search trees can insert a new node by repeatedly attempting CAS to modify a single pointer. For a binary trie, update operations require updating $O(\log u)$ bits on the path from a leaf to the root. An operation that helps all these updates complete would have $O(\dot{c}(op)\cdot\log u)$ amortized step complexity.

Predecessor operations that cannot traverse a path through the relaxed binary trie do not help concurrent update operations complete. For our linearizable, lock-free binary trie, our approach is to have update operations and predecessor operations announce themselves in an update announcement linked list and a predecessor announcement linked list, respectively. We guarantee that a predecessor operation will either learn about concurrent update operations by traversing the update announcement linked list, or it will be notified by concurrent update operations via the predecessor announcement linked list. A predecessor operation uses this information to determine a correct return value, especially when it cannot complete its traversal of the relaxed binary trie. This is in contrast to other lock-free data structures that typically announce operations so that they can be completed by concurrent operations in case the invoking processes crash.

Section 2 describes the asynchronous shared memory model. In Section 3, we describe related lock-free data structures and compare them to our lock-free binary trie. In Section 4 we give the specification of a relaxed binary trie, give a high-level description of our wait-free implementation, present the pseudocode of the algorithm, and prove it correct. In Section 5, we give a high-level description of our implementation of a lock-free binary trie, present the pseudocode of the algorithm and a more detailed explanation, prove it is linearizable, and analyze its amortized step complexity. We conclude in Section 6. .

2 Model

Throughout this paper, we use an asynchronous shared memory model. Shared memory consists of a collection of shared objects accessible by all processes in a system. The primitives supported by these objects are performed atomically. A register is an object supporting Write $(w)$ , which stores $w$ into the object, and Read $()$ , which returns the value stored in the object. CAS $(r,old,new)$ compares the value stored in object $r$ with the value $old$ . If the two values are the same, the value stored in $r$ is replaced with the value $new$ and True is returned; otherwise False is returned. A min-register is an object that stores a value, and supports Read $()$ , which returns the value of the object, and MinWrite $(w)$ , which changes the value of the object to $w$ if $w$ is smaller than its previous value.

A configuration of a system consists of the values of all shared objects and the states of all processes. A step by a process either accesses or modifies a shared object, and can also change the state of the process. An execution is an alternating sequence of configurations and steps, starting with a configuration.

An abstract data type is a collection of objects and types of operations that satisfies certain properties. A concurrent data structure for the abstract data type provides representations of the objects in shared memory and algorithms for the processes to perform operations of these types. An operation on a data structure by a process becomes active when the process performs the first step of its algorithm. The operation becomes inactive after the last step of the algorithm is performed by the process. This last step may include a response to the operation. The execution interval of the operation consists of all steps between its first and last step (which may include steps from processes performing other operations). In the initial configuration, the data structure is empty and there are no active operations.

We consider concurrent data structures that are linearizable [28], which means that its operations appear to occur atomically. One way to show that a concurrent data structure is linearizable is by defining a linearization function, which, for all executions of the data structure, maps all completed operations and a subset of the uncompleted operations in the execution to a step or configuration, called its linearization point. The linearization points of these operations must satisfy two properties. First, each linearized operation is mapped to a step or configuration within its execution interval. Second, the return value of the linearized operations must be the same as in the execution in which all the linearized operations are performed atomically in the order of their linearization points, no matter how operations with the same linearization point are ordered.

A concurrent data structure is strong linearizability if its linearization function satisfies an additional prefix preserving property: For all its executions $\alpha$ and for all prefixes $\alpha^{\prime}$ of $\alpha$ , if an operation is assigned a linearization point for $\alpha^{\prime}$ , then it is assigned the same linearization point for $\alpha$ , and if it is assigned a linearization point for $\alpha$ that occurs during $\alpha^{\prime}$ , then it is assigned the same linearization point for $\alpha^{\prime}$ . This means that the linearization point of each operation is determined as steps are taken in the execution and cannot depend on steps taken later in the execution. This definition, phrased somewhat differently, was introduced by Golab, Higham and Woelfel [23]. A concurrent data structure is strongly linearizable with respect to a set of operation types $\cal{O}$ if it has a linearization function that is only defined on operations whose types belong to $\cal{O}$ .

A lock-free implementation of a concurrent data structure guarantees that whenever there are active operations, one operation will eventually complete in a finite number of steps. However, the execution interval of any particular operation in an execution may be unbounded, provided other operations are completed. A wait-free implementation of a concurrent data structure guarantees that every operation completes within a finite number of steps by the process that invoked the operation.

The worst-case step complexity of an operation is the maximum number of steps taken by a process to perform any instance of this operation in any execution. The amortized step complexity of a data structure is the maximum number of steps in any execution consisting of operations on the data structure, divided by the number operations invoked in the execution. One can determine an upper bound on the amortized step complexity by assigning an amortized cost to each operation, such that for all possible executions $\alpha$ on the data structure, the total number of steps taken in $\alpha$ is at most the sum of the amortized costs of the operations in $\alpha$ .

3 Related Work

In this section, we discuss related lock-free data structures and the techniques used to implement them. We first describe simple lock-free linked lists, which are a component of our binary trie. We next describe search tree implementations supporting Predecessor, and discuss the general techniques used. Next we describe implementations of a Patricia trie and a skip trie. Finally, we discuss some universal constructions and general techniques for augmenting existing data structures.

There are many existing implementations of lock-free linked lists [24, 38, 40]. The implementation with best amortized step complexity is by Fomitchev and Ruppert [21]. It supports Insert, Delete, Predecessor, and Search operations $op$ with $O(n(op)+\dot{c}(op))$ amortized step complexity, where $n(op)$ is the number of nodes in the linked list at the start of $op$ . When $n(op)$ is in $O(\dot{c}(op))$ , such as in the case of our binary trie implementation, operations have $O(\dot{c}(op))$ amortized step complexity. This implementation uses flagging: a pointer that is flagged indicates an update operation wishes to modify it. Concurrent update operations that also need to modify this pointer must help the operation that flagged the pointer to complete before attempting to perform their own operation. After an update operation completes helping, it backtracks (using back pointers set in deleted nodes) to a suitable node in the linked list before restarting its own operation.

Ellen, Fatourou, Ruppert, and van Breugel [18] give the first provably correct lock-free implementation of an unbalanced binary search tree using CAS. Update operations are done by flagging a constant number of nodes, followed by a single pointer update. A flagged node contains a pointer to an operation record, which includes enough information about the update operation so that other processes can help it complete. Ellen, Fatourou, Helga, and Ruppert [17] improve the efficiency of this implementation so each operation $op$ has $O(h(op)+\dot{c}(op))$ amortized step complexity, where $h(op)$ is the height of the binary search tree when $op$ is invoked. This is done by allowing an update operation to backtrack along its search paths by keeping track of nodes it visited in a stack. There are many other implementations of lock-free unbalanced binary search trees [9, 14, 29, 32].

There are also many implementations of lock-free balanced binary search trees [5, 6, 8, 16]. Brown, Ellen, and Ruppert designed a lock-free balanced binary search tree [11] by implementing more powerful primitives, called LLX and SCX, from CAS [10]. These primitives are generalizations of LL and SC. SCX allows a single field to be updated by a process provided a specified set of nodes have not been modified since that process last performed LLX on them. Ko [30] shows that a version of their lock-free balanced binary search tree has good amortized step complexity.

Although a binary trie represents a perfect binary tree, the techniques we use to implement a binary trie are quite different than those that have been used to implement lock-free binary search trees. The update operations of a binary trie may require updating the bits of all nodes on the path from a leaf to the root. LLX and SCX do not facilitate this because SCX only updates a single field.

Brown, Prokopec, and Alistarh [12] give an implementation of an interpolation search tree supporting Search with $O(P+\log n(op))$ amortized step complexity, and Insert and Delete with $O(\bar{c}_{avg}(P+\log n(op))$ amortized step complexity, where $\bar{c}_{avg}$ is the average interval contention of the execution. Their data structure is a balanced search tree where nodes can have large degree. A node containing $n$ keys in its subtree is ideally balanced when it has $\sqrt{n}$ children, each of which contains $\sqrt{n}$ nodes in its subtree. Update operations help replace subtrees that become too unbalanced with ideally balanced subtrees. When the input distribution of keys is well-behaved, Search can be performed with $O(P+\log\log n(op))$ expected amortized step complexity and update operations can be performed with $O(\bar{c}_{avg}(P+\log\log n(op))$ expected amortized step complexity. Their implementation relies on the use of double-compare single-swap (DCSS). DCSS is not a primitive typically supported in hardware, although there exist implementations of DCSS from CAS [1, 22, 25].

Shafiei [37] gives an implementation of a Patricia trie. The data structure is similar to a binary trie, except that only internal nodes whose children both have value 1 are stored. In addition to Search, Insert and Delete, it supports Replace, which removes a key and adds another key in a possibly different part of the trie. Her implementation uses a variant of the flagging technique described in [18], except that it can flag two different nodes with the same operation record.

Oshman and Shavit [33] introduce a randomized data structure called a skip trie. It combines an x-fast trie with a truncated skip list whose max height is $\log_{2}\log_{2}u$ (i.e. it is a y-fast trie whose balanced binary search trees are replaced with a truncated skip list). Only keys that are in the top level of the skip list are in the x-fast trie. They give a lock-free implementation of a skip trie supporting Search, Insert, Delete, and Predecessor operations with $O(\tilde{c}(op)+\log\log u)$ expected amortized step complexity from registers, CAS and DCSS. Their x-fast trie uses lock-free hash tables [39]. Their x-fast trie implementation supports update operations with $O(\dot{c}(op)\cdot\log u)$ expected amortized step complexity. Their skip list implementation is similar to Fomitchev and Ruppert’s skip list implementation [21]. In the worst-case (for example, when the height of the skip list is 0), a skip trie performs the same as a linked list, so Search and Predecessor take $\Theta(n)$ steps, even when there are no concurrent updates. Our lock-free binary trie implementation is deterministic, does not rely on hashing, uses primitives supported in hardware, and always performs well when there are no concurrent update operations. Furthermore, Search operations in our binary trie complete in a constant number of reads in the worst-case.

The first universal constructions were by Herhily [26, 27]. To achieve wait-freedom, he introduced an announcement array where operations announce themselves. Processes help perform these announced operations in a round-robin order. Barnes [2] gives a universal construction for obtaining lock-free data structures. He introduces the idea of using operation records to facilitate helping.

Lock-free data structures can be augmented to support iterators, snapshots, and range queries [13, 20, 34, 35]. Wei et al. [43] give a simple technique to take snapshots of concurrent data structures in constant time. This is done by implementing a versioned CAS object that allows old values of the object to be read. The number of steps needed to read the value of a versioned CAS object at the time of a snapshot is equal to the number of times its value changed since the snapshot was taken. Provided update operations only perform a constant amortized number of successful versioned CAS operations, balanced binary search trees can be augmented to support Predecessor with $O(\bar{c}(op)+\log n(op))=O(\dot{c}(op)+\log n(op))$ amortized step complexity.

4 Relaxed Binary Trie

In this section, we describe our relaxed binary trie, which is used as a component of our lock-free binary trie. We begin by giving the formal specification of the relaxed binary trie in Section 4.1. In Section 4.2, we describe how our implementation is represented in memory. In Section 4.3, we give a high-level description of our algorithms for each operation. In Section 4.4, we give a detailed description of our algorithms for each operation and its pseudocode. Finally, in Section 4.5, we show that our implementation satisfies the specification.

4.1 Specification

A relaxed binary trie is a concurrent data structure maintaining a dynamic set $S$ from the universe $U=\{0,\dots,u-1\}$ that supports the following strongly linearizable operations:

•

TrieInsert $(x)$ , which adds key $x$ into $S$ if it is not already in $S$ ,
•

TrieDelete $(x)$ , which removes key $x$ from $S$ if it is in $S$ , and
•

TrieSearch $(x)$ , which returns True if key $x\in S$ , and False otherwise.

It additionally supports the (non-linearizable) RelaxedPredecessor $(y)$ operation. Its concurrent specification relies on a few definitions.

Because the relaxed binary trie is strongly linearizable with respect to all of its update operations, it is possible to determine the value of the set $S$ represented by the data structure in every configuration of every execution from the sequence of linearization points of the update operations prior to this configuration. For any execution of the relaxed binary trie and any key $x\in U$ , consider the sequence $\sigma$ of TrieInsert $(x)$ and TrieDelete $(x)$ operations in the order of their linearization points. A TrieInsert $(x)$ operation is $S$ -modifying if it is the first TrieInsert $(x)$ operation in $\sigma$ , or if it is the first TrieInsert $(x)$ operation that follows a TrieDelete $(x)$ operation. In other words, a TrieInsert $(x)$ operation is $S$ -modifying if it successfully adds the key $x$ to $S$ . Likewise, a TrieDelete $(x)$ operation is $S$ -modifying if it is the first TrieDelete $(x)$ operation that follows a TrieInsert $(x)$ operation. A key $x$ is completely present throughout a RelaxedPredecessor operation, $pOp$ , if there is an $S$ -modifying TrieInsert $(x)$ operation, $iOp$ , that completes before the invocation of $pOp$ and there is no $S$ -modifying TrieDelete $(x)$ operation that is linearized after $iOp$ but before the end of $pOp$ .

Specification of RelaxedPredecessor: Let $pOp$ be a completed RelaxedPredecessor $(y)$ operation in some execution. Let $k$ be the largest key less than $y$ that is completely present throughout $pOp$ , or $-1$ if no such key exists. Then $pOp$ returns a value in $\{\bot\}\cup\{k,\dots,y-1\}$ such that:

•

If $pOp$ returns $\bot$ , then there exists a key $x$ , where $k<x<y$ , such that the last $S$ -modifying update operation linearized prior to the end of $pOp$ is concurrent with $pOp$ .
•

If $pOp$ returns a key $x>k$ , then $x\in S$ sometime during $pOp$ .

These properties imply that if, for all $k<x<y$ , the $S$ -modifying update operation with key $x$ that was last linearized prior to the end of $pOp$ is not concurrent with $pOp$ , then $pOp$ returns $k$ . In this case, if the last $S$ -modifying update operation with key $x$ linearized prior to the end of $pOp$ is a TrieInsert $(x)$ operation, then it was completed before the start of $pOp$ . But, then $x$ is completely present throughout $pOp$ , contradicting the definition of $k$ . Therefore, $k$ is the predecessor of $y$ throughout $pOp$ .

4.2 Our Relaxed Binary Trie Implementation

Our wait-free implementation of a relaxed binary trie supports TrieSearch with $O(1)$ worst-case step complexity, and TrieInsert, TrieDelete, and RelaxedPredecessor with $O(\log u)$ worst-case step complexity. We first describe the major components of the relaxed binary trie and how it is stored in memory.

Like the sequential binary trie, the relaxed binary trie consists of a collection of arrays, $\mathit{D}_{i}$ for $0\leq i\leq b=\lceil\log_{2}u\rceil$ . Each array $\mathit{D}_{i}$ , for $0\leq i\leq b$ , has length $2^{i}$ and represents the nodes at depth $i$ of the relaxed binary trie.

An update node is created by a TrieInsert or TrieDelete operation. It is an INS node if it is created by a TrieInsert operation, or a DEL node if it is created by a TrieDelete operation. It includes the input key of the operation that created it.

There is an array latest indexed by each key in $U$ , where $\textit{latest}[x]$ contains a pointer to an update node with key $x\in U$ . The update node pointed to by $\textit{latest}[x]$ belongs to the last $S$ -modifying TrieInsert $(x)$ and TrieDelete $(x)$ operation that has been linearized. So the update node pointed to by $\textit{latest}[x]$ is an INS node if and only if $x\in S$ . In the initial configuration, when $S=\emptyset$ , $\textit{latest}[x]$ points to a dummy DEL node.

Recall that, in the sequential binary trie, each binary trie node $t$ contains the bit 1 if there is leaf in its subtrie whose key is in $S$ , and 0 otherwise. These bits need to be accurately set for the correctness of predecessor operations. In our relaxed binary trie, each binary trie node has an associated value, called its interpreted bit. The interpreted bit of a leaf with key $x$ is 1 if and only if $x\in S$ .

For each internal binary trie node $t$ , let $U_{t}$ be the set of keys of the leaves contained in the subtrie rooted at $t$ . When there are no active update operations with keys in $U_{t}$ , the interpreted bit of $t$ is the logical OR of the interpreted bits of the leaves of the subtrie rooted at $t$ . More generally, our relaxed binary trie maintains the following two properties concerning its interpreted bits. For all binary trie nodes $t$ and configurations $C$ :

IB0

If $U_{t}\cap S=\emptyset$ and for all $x\in U_{t}$ , either there has been no $S$ -modifying TrieDelete $(x)$ operation or the last $S$ -modifying TrieDelete $(x)$ operation linearized prior to $C$ is no longer active, then the interpreted bit of $t$ is 0 in $C$ .
IB1

If there exists $x\in U_{t}\cap S$ such that the last $S$ -modifying TrieInsert $(x)$ operation linearized prior to $C$ is no longer active, then the interpreted bit of $t$ is 1 in $C$ .

When there are active update operations with a key in $U_{t}$ , the interpreted bit of the binary trie node $t$ may be different from the bit stored in $t$ of the sequential binary trie representing the same set.

The interpreted bit of $t$ is not physically stored in $t$ , but is, instead, computed from the update node pointed to by $\textit{latest}[x]$ , for some key $x\in U_{t}$ . Each internal binary trie node $t$ stores this key $x$ . The interpreted bit of $t$ depends on the update node, $\mathit{uNode}$ , pointed to by $\textit{latest}[x]$ . If $\mathit{uNode}$ is an INS node, the interpreted bit of $t$ is 1. When $\mathit{uNode}$ is a DEL node, the interpreted bit of $t$ is determined by two thresholds, $\mathit{uNode}.\mathit{upper0Boundary}$ and $\mathit{uNode}.\mathit{lower1Boundary}$ . In this case, the interpreted bit of $t$ is

•

1 if $t.0pt\geq\mathit{uNode}.\mathit{lower1Boundary}$ ,
•

0 if $t.0pt<\mathit{uNode}.\mathit{lower1Boundary}$ and $t.0pt\leq\mathit{uNode}.\mathit{upper0Boundary}$ , and
•

1 otherwise.

Only TrieInsert operations modify $\mathit{lower1Boundary}$ and only TrieDelete operations modify $\mathit{upper0Boundary}$ . We discuss these thresholds in more detail when describing TrieInsert and TrieDelete in the following section.

4.3 High-Level Algorithm Description

A TrieSearch $(x)$ operation reads the update node pointed to by $\textit{latest}[x]$ , returns True if it is an INS node, and returns False if it is a DEL node.

An TrieInsert $(x)$ or TrieDelete $(x)$ operation, $op$ , begins by finding the first activated update node in $\textit{latest}[x]$ . If it has the same type as $op$ , then $op$ can return because $S$ does not need to be modified. Otherwise $op$ creates a new inactive update node $\mathit{uNode}$ with key $x$ . It then attempts to add $\mathit{uNode}$ to the beginning of $\textit{latest}[x]$ . It then attempts to change $\textit{latest}[x]$ to point to $\mathit{uNode}$ using CAS. If successful, the operation is linearized at this step. Any other update nodes in $\textit{latest}[x]$ are then removed by setting the next pointer of $\mathit{uNode}$ to $\bot$ . If multiple update operations with key $x$ concurrently attempt to add an update node to the beginning of $\textit{latest}[x]$ , exactly one will succeed. Update operations that are unsuccessful instead help the update operation that succeeded by setting the status of its update node to active.

In the case that $uOp$ successfully changes $\textit{latest}[x]$ to point to $\mathit{uNode}$ , $uOp$ must then update the interpreted bits of the relaxed binary trie. Both TrieInsert and TrieDelete operations update the interpreted bits of the relaxed binary trie in manners similar to the sequential data structure. A TrieInsert $(x)$ operation traverses from the leaf with key $x$ to the root and sets the interpreted bits along this path to 1 if they are not already 1. This is described in more detail in Section 4.3.1. A TrieDelete $(x)$ operation traverses the binary trie starting from the leaf with key $x$ and proceeds to the root. It changes the interpreted bit of a binary trie node on this path to 0 if both its children have interpreted bit 0, and returns otherwise. This is described in more detail in Section 4.3.2.

4.3.1 TrieInsert

Consider a latest TrieDelete $(x)$ operation, $iOp$ , and let $\mathit{iNode}$ be the INS node it created, so $\mathit{iNode}$ is the first activated update node in $\textit{latest}[x]$ . Let $t$ be a binary trie node $iOp$ encounters as it is updating the interpreted bits of the binary trie. If $t$ already has interpreted bit 1, then it does not need to be updated. This can happen when the interpreted bit of $t$ depends on an INS node (for example, when $t$ stores key $x$ and hence depends on $\mathit{iNode}$ ). So suppose the interpreted bit of $t$ is 0. In this case, $t$ stores key $x^{\prime}\neq x$ and its interpreted bit depends on a DEL node $\mathit{dNode}$ . Only a Delete operation can change the key stored in $t$ and it can only change the key to its own key. We do not allow Insert operations to change this key to avoid concurrent Delete operations from repeatedly interfering with an Insert operation. Instead, Insert operations can modify $\mathit{dNode}.\mathit{lower1Boundary}$ to change the interpreted bit of $t$ from 0 to 1. This is a min-register whose value is initially $b+1$ , which is greater than the height of any binary trie node. All binary trie nodes that depend on $\mathit{dNode}$ and whose height is at least the value of $\mathit{dNode}.\mathit{lower1Boundary}$ have interpreted bit 1. Therefore, to change the interpreted bit of $t$ from 0 to 1, $iOp$ can perform MinWrite $(t.0pt)$ to $\mathit{dNode}.\mathit{lower1Boundary}$ . This also changes the interpreted bit of all ancestors of $t$ that depend on $\mathit{dNode}$ to 1. A min-register is used so that modifying $\mathit{dNode}.\mathit{lower1Boundary}$ never changes the interpreted bit of any binary trie node from 1 to 0.

An example execution of an Insert operation updating the interpreted bits of the relaxed binary trie is shown in Figure 2. Blue rectangles represent active INS nodes, while red rectangles represent active DEL nodes. The number in each binary trie node is its interpreted bit. The dashed arrow from an internal binary trie node points to the update node it depends on. Note that the dashed arrow is not a physical pointer stored in the binary trie node. Under each update node are the values of its $\mathit{lower1Boundary}$ , abbreviated $l1b$ , and $\mathit{upper0Boundary}$ , abbreviated $u0b$ . Figure 2(a) shows a possible state of the data structure where $S=\emptyset$ . In Figure 2(b), an Insert $(0)$ operation, $iOp$ , activates its newly added INS node in $\textit{latest}[0]$ . This simultaneously changes the interpreted bit of the leaf with key 0 and its parent from 0 to 1 in a single step. In Figure 2(c), $iOp$ changes the interpreted bit of the root from 0 to 1. This is done using a MinWrite, which changes the $\mathit{lower1Boundary}$ of the DEL node in $\textit{latest}[3]$ (i.e. the update node that the root depends on) from 3 to the height of the root.

4.3.2 TrieDelete

Consider a latest Delete $(x)$ operation, $dOp$ , and let $\mathit{dNode}$ be the DEL node it created, so $\mathit{dNode}$ is the first activated update node in $\textit{latest}[x]$ . Furthermore, this means that the leaf with key $x$ has interpreted bit 0.

Let $t$ be an internal binary trie node on the path from the leaf with key $x$ to the root. Suppose $dOp$ successfully changed the interpreted bit of one of $t$ ’s children to 0. If the interpreted bit of the other child of $t$ is 0, then $dOp$ attempts to change the interpreted bit of $t$ to 0. First, $dOp$ tries to change the update node that $t$ depends on to $\mathit{dNode}$ by changing the key stored in $t$ to $x$ . After a constant number of reads and at most 2 CAS operations, our algorithm guarantees that if $dOp$ does not successfully change $t$ to depend on $\mathit{dNode}$ , then for some $y\in U_{t}$ , a latest Delete $(y)$ operation, $dOp^{\prime}$ , changed $t$ to depend on the DEL node $\mathit{dNode}^{\prime}$ created by $dOp^{\prime}$ . In this case, $dOp^{\prime}$ will change the interpreted bit of $t$ to 0 on $dOp$ ’s behalf, so $dOp$ can stop updating the interpreted bits of the binary trie. Suppose $dOp$ does successfully change $t$ to depend on $\mathit{dNode}$ . To change the interpreted bit of $\mathit{dNode}$ to 0, $dOp$ writes $t.0pt$ into $\mathit{dNode}.\mathit{upper0Boundary}$ , which is a register with initial value 0. This indicates that all binary trie nodes at height $t$ and below that depend on $\mathit{dNode}$ have interpreted bit 0. Only $dOp$ , the creator of $\mathit{dNode}$ , writes to $\mathit{dNode}.\mathit{upper0Boundary}$ . Since $dOp$ changes the interpreted bits of binary trie nodes in order from the leaf with key $x$ up to the root, $\mathit{dNode}.\mathit{upper0Boundary}$ is only ever incremented by 1 starting from 0.

An example execution of Delete operations updating the interpreted bits of the relaxed binary trie is shown in Figure 3. Figure 3(a) shows a possible state of the data structure where $S=\{0,1\}$ . In Figure 3(b), a Delete $(0)$ , $dOp$ , and a Delete $(1)$ , $dOp^{\prime}$ , activate their newly added DEL nodes. This removes the keys 0 and 1 from $S$ . The leaves with keys 0 and 1 have interpreted bit 0. In Figure 3(c), $dOp^{\prime}$ sees that its sibling leaf has interpreted bit 0. Then $dOp^{\prime}$ successfully changes the left child of the root to depend on its DEL node, while $dOp$ is unsuccessful and returns. In Figure 3(d), $dOp^{\prime}$ increments the $\mathit{upper0Boundary}$ of its DEL node, so it is now equal to the height of the left child of the root. This changes the interpreted bit of the left child of the root to 0. In Figure 3(e), $dOp^{\prime}$ sees that the right child of the root has interpreted bit 0, so the interpreted bit of the root needs to be updated. So $dOp^{\prime}$ changes the root to depend on its DEL node. In Figure 3(f), $dOp^{\prime}$ increments the $\mathit{upper0Boundary}$ of its DEL node, so it is now equal to the height of the root. This changes the interpreted bit of the left child of the root to 0.

4.3.3 RelaxedPredecessor

A RelaxedPredecessor $(y)$ operation, $pOp$ , traverses the relaxed binary trie in a manner similar to the sequential algorithm, except that it uses the interpreted bits of binary trie nodes to direct the traversal. If $pOp$ completes its traversal of the relaxed binary trie following the sequential algorithm, then it either returns the key of the leaf it reaches, or $-1$ if the traversal ended at the root.

It is possible that $pOp$ is unable to complete a traversal of the relaxed binary trie due to inaccurate interpreted bits. During the downward part of its traversal, it may encounter an internal binary trie node with interpreted bit 1, but both of its children have interpreted bit 0. When this occurs, $pOp$ terminates and returns $\bot$ . There is a concurrent update operation that needs to update this part of the relaxed binary trie.

4.4 Detailed Algorithm Description and Pseudocode

In this section we give a detailed description of the algorithm for each relaxed binary trie operation, and present its pseudocode.

4.4.1 TrieSearch and Basic Helper Functions

The TrieSearch $(x)$ algorithm finds the update node pointed to by $\textit{latest}[x]$ . It returns True if this update node has type INS, and False if this update node has type DEL.

We use the helper function FindLatest $(x)$ to return the update node pointed to by $\textit{latest}[x]$ . The helper function FirstActivated $(v)$ takes a pointer $v$ to an update node and checks if $v$ is the update node pointed to by $\textit{latest}[v.key]$ . The implementation of these helper functions are simple the case of the relaxed binary trie, but will be replaced with a different implementation when we consider the lock-free binary trie.

The helper function InterpretedBit $(t)$ receives a binary trie node $t$ , and returns its interpreted bit. Its implementation follows from the definition of the interpreted bit.

1:Algorithm FindLatest

(x)

2: return

\ell\leftarrow\textit{latest}[x]

3:Algorithm TrieSearch

(x)

\mathit{uNode}\leftarrow\textsc{FindLatest}(x)

5: if

\mathit{uNode}.\mathit{type}=\textsc{INS}

then return True

6: else return False

7:Algorithm FirstActivated

(v)

\ell\leftarrow\textit{latest}[v.\mathit{key}]

9: return

v=\ell

10:Algorithm InterpretedBit

(t)

11:

\mathit{uNode}\leftarrow\textsc{FindLatest}(t.\mathit{dNodePtr}.\mathit{key})

12: if

\mathit{uNode}.\mathit{type}=\text{INS}

then return

1

13: if

t.0pt\geq\mathit{uNode}.\mathit{lower1Boundary}

then return

1

14: if

t.0pt\leq\mathit{uNode}.\mathit{upper0Boundary}

then return

0

15: return

1

4.4.2 TrieInsert

A TrieInsert $(x)$ operation $iOp$ begins by reading the update node, $\mathit{dNode}$ , pointed to by $\textit{latest}[x]$ . If $\mathit{dNode}$ is not a DEL node, then $x$ is already in $S$ so TrieInsert $(x)$ returns. Otherwise $iOp$ creates a new INS node, denoted $\mathit{iNode}$ with key $x$ . It then attempts to change $\textit{latest}[x]$ to point to $\mathit{iNode}$ using CAS (on line 22). A TrieInsert $(x)$ operation that successfully performs this CAS adds $x$ to $S$ , and is linearized at this successful CAS. If multiple TrieInsert $(x)$ operations concurrently attempt to change $\textit{latest}[x]$ , exactly one will succeed. Any TrieInsert $(x)$ operations that are unsuccessful can return, because some other TrieInsert $(x)$ operation successfully added $x$ to $S$ . Note that by updating $\textit{latest}[x]$ to point to $\mathit{iNode}$ , the interpreted bit of the leaf with key $x$ is 1.

16:Algorithm TrieInsert

(x)

17:

\mathit{dNode}\leftarrow\textsc{FindLatest}(x)

18: if

\mathit{dNode}.\mathit{type}\neq\text{DEL}

then return

\triangleright

x

is already in

S

19: Let

\mathit{iNode}

be a pointer to a new update node:

20:

\mathit{iNode}.\mathit{key}\leftarrow x

21:

\mathit{iNode}.\mathit{type}\leftarrow\text{INS}

22: if CAS

(\textit{latest}[x].\mathit{head},\mathit{dNode},\mathit{iNode})=\textsc{False}

then

\triangleright

Insert operation is linearized

23: return

24: InsertBinaryTrie

(\mathit{iNode})

25: return

The algorithm to update the binary trie is described in InsertBinaryTrie $(\mathit{iNode})$ , where $\mathit{iNode}$ is the INS node created by $iOp$ . The purpose of InsertBinaryTrie is to set the interpreted bit of each binary trie node $t$ on the path from the parent of the leaf with key $x$ to the root to 1. The algorithm first determines the current interpreted bit of $t$ on lines 28 to 30 by reading fields of the first activated update node, $\mathit{uNode}$ , in $\textit{latest}[x]$ . If the interpreted bit of $t$ is 1, $op$ proceeds to the parent of $t$ . If the interpreted bit of $t$ is 0, then $\mathit{uNode}.\mathit{lower1Boundary}$ is updated to the value $t.0pt$ using a MinWrite on line 33. Updating $\mathit{uNode}.\mathit{lower1Boundary}$ serves the purpose of changing the interpreted bit of $t$ to 1, as well as informing the Delete operation that created $\mathit{uNode}$ to stop updating the binary trie. It is problematic if $iOp$ crashes if it is poised to perform this MinWrite. So $op$ sets $\mathit{iNode}.\textit{target}$ to point to $\mathit{uNode}$ beforehand, indicating $iOp$ wishes to perform a MinWrite to $\mathit{uNode}$ . The field $\mathit{iNode}.\textit{target}$ is read by TrieDelete $(x)$ operations to help $iOp$ in case it crashes.

26:Algorithm InsertBinaryTrie

(\mathit{iNode})

27: for each binary trie node

t

on path from parent of the leaf

\mathit{iNode}.\mathit{key}

to root do

28:

\mathit{uNode}\leftarrow\textsc{FindLatest}(t.\mathit{dNodePtr}.\mathit{key})

29: if

\mathit{uNode}.type=\text{DEL}

then

30: if

t.0pt<\mathit{uNode}.\mathit{lower1Boundary}

and

t.0pt\leq\mathit{uNode}.\mathit{upper0Boundary}

then

31:

\mathit{iNode}.\textit{target}\leftarrow\mathit{uNode}

32: if FirstActivated

(\mathit{iNode})=\textsc{False}

then return

33:

\textsc{MinWrite}(\mathit{uNode}.\mathit{lower1Boundary},t.0pt)

4.4.3 TrieDelete

A TrieDelete $(x)$ operation $dOp$ checks if the update node pointed to by $\textit{latest}[x]$ is an INS node. If not, it returns because $x$ is not in $S$ . Otherwise, it creates a new DEL node, $\mathit{dNode}$ . It then updates $\textit{latest}[x]$ to point to $\mathit{dNode}$ using CAS in the same way as TrieInsert $(x)$ . A TrieDelete $(x)$ that successfully performs this CAS removes $x$ from $S$ , and is linearized at this successful CAS.

34:Algorithm TrieDelete

(x)

35:

\mathit{iNode}\leftarrow\textsc{FindLatest}(x)

36: if

\mathit{iNode}.\mathit{type}\neq\text{INS}

then return

\triangleright

x

is not in

S

37: Let

\mathit{dNode}

be a pointer to a new update node:

38:

\mathit{dNode}.\mathit{key}\leftarrow x

39:

\mathit{dNode}.\mathit{type}\leftarrow\text{DEL}

40: if CAS

(\textit{latest}[x],\mathit{iNode},\mathit{dNode})=\textsc{False}

then

\triangleright

Delete operation is linearized

41: return

42:

\mathit{iNode}.\textit{target}.\textit{stop}\leftarrow\textsc{True}

43: DeleteBinaryTrie

(\mathit{dNode})

44: return

The operation $dOp$ then calls DeleteBinaryTrie $(\mathit{dNode})$ to update the interpreted bits of the relaxed binary trie nodes from the parent of the leaf with key $x$ to the root. Let $t$ be an internal binary trie node on the path from the leaf with key $x$ to the root. Suppose $dOp$ successfully changed the interpreted bit of one of $t$ ’s children to 0. If the interpreted bit of the other child of $t$ is 0, then $dOp$ attempts to change the interpreted bit of $t$ to 0. Recall that $t$ depends on the first activated update node in $\textit{latest}[t.\mathit{dNodePtr}.\mathit{key}]$ . To change $t$ to depend on $\mathit{dNode}$ , $dOp$ performs CAS to attempt to change $t.\mathit{dNodePtr}$ to point to $\mathit{dNode}$ . Note that $dOp$ performs two attempts of this CAS, each time checking its $\mathit{dNode}.\textit{stop}$ is not set to True (indicating a concurrent Insert $(x)$ wants to set the interpreted bit of $t$ to 1) and that $\mathit{dNode}$ is still the first activated update node in $\textit{latest}[x]$ . Two CAS attempts are performed to prevent out-dated Delete operations that were poised to perform CAS from conflicting with latest Delete operations. If $dOp$ is unsuccessful in both its CAS attempts, it can stop updating the binary trie because some concurrent Delete $(x^{\prime})$ operation, with key $x^{\prime}\in U_{t}$ , successfully changed $t.\mathit{dNodePtr}$ to point to its own DEL node. Otherwise $op$ is successful in changing the interpreted bit of $t$ to depend on $\mathit{dNode}$ . Immediately after $op$ ’s successful CAS, the interpreted bit of $t$ is still 1 (because $\mathit{dNode}.\mathit{upper0Boundary}$ has not yet been incremented to $t.0pt$ ). Once again, $op$ verifies both children of $t$ have interpreted bit 0, otherwise it returns. To change the interpreted bit of $\mathit{dNode}$ to 0, $dOp$ writes $t.0pt$ into $\mathit{dNode}.\mathit{upper0Boundary}$ , which increments its value. This indicates that all binary trie nodes at height $t$ and below that depend on $\mathit{dNode}$ have interpreted bit 0. Only $dOp$ , the creator of $\mathit{dNode}$ , writes to $\mathit{dNode}.\mathit{upper0Boundary}$ . Since $dOp$ changes the interpreted bits of binary trie nodes in order from the leaf with key $x$ to the root, $\mathit{dNode}.\mathit{upper0Boundary}$ is only ever incremented by 1 starting from 0.

45:Algorithm DeleteBinaryTrie

(\mathit{dNode})

46:

t\leftarrow

leaf of binary trie with key

\mathit{dNode}.key

47: while

t

is not the root of the binary trie do

48: if

\textsc{InterpretedBit}(t.\mathit{sibling})=1

\textsc{InterpretedBit}(t)=1

then return

49:

t\leftarrow t.parent

50:

\mathit{d}\leftarrow t.\mathit{dNodePtr}

51: if FirstActivated

(\mathit{dNode})

= False then return

52: if

\mathit{dNode}.\textit{stop}=\textsc{True}

\mathit{dNode}.\mathit{lower1Boundary}\neq b+1

then return

53: if CAS

(t.\mathit{dNodePtr},d,\mathit{dNode})=\textsc{False}

then

54:

\mathit{d}\leftarrow t.\mathit{dNodePtr}

55: if FirstActivated

(\mathit{dNode})

= False then return

56: if

\mathit{dNode}.\textit{stop}=\textsc{True}

\mathit{dNode}.\mathit{lower1Boundary}\neq b+1

then return

57: if CAS

(t.\mathit{dNodePtr},d,\mathit{dNode})=\textsc{False}

then return

58: if

\textsc{InterpretedBit}(t.\mathit{left})=1

\textsc{InterpretedBit}(t.\mathit{right})=1

then return

59:

\mathit{dNode}.\mathit{upper0Boundary}\leftarrow t.0pt

4.4.4 RelaxedPredecessor

A RelaxedPredecessor $(y)$ operation, $pOp$ , begins by traversing up the relaxed binary trie starting from the leaf with key $y$ towards the root (during the while-loop on line62). If the left child of each node on this path either has interpreted bit 0 or is also on this path, then $pOp$ returns $-1$ (on line 65). Otherwise, consider the first node on this path whose left child $t$ (set on line 67) has interpreted bit 1 and is not on this path. Starting from $t$ , $pOp$ traverses the right-most path of binary trie nodes with interpreted bit 1 (during the while-loop on line68). If a binary trie node $t$ is encountered where both its children have interpreted bit 0, then $\bot$ is returned (on line 65. Otherwise, the $pOp$ reaches a leaf, and returns its key (on line 77).

60:Algorithm RelaxedPredecessor

(y)

61:

t\leftarrow

the binary trie node represented by

\mathit{D}_{b}[y]

62: while

t

is the left child of

t.\mathit{parent}

\textsc{InterpretedBit}(t.\mathit{sibling})=0

63:

t\leftarrow t.parent

64: if

t

is the root then

65: return

-1

66:

\triangleright

Traverse right-most path of nodes with interpreted bit 1 from

t.\mathit{parent}.\mathit{left}

67:

t\leftarrow t.\mathit{parent}.\mathit{left}

68: while

t.\mathit{height}>0

69: if

\textsc{InterpretedBit}(t.\mathit{right})=1

then

70:

t\leftarrow t.\mathit{right}

71: else if

\textsc{InterpretedBit}(t.\mathit{left})=1

then

72:

t\leftarrow t.\mathit{left}

73: else

74:

\triangleright

both children of

t

have interpreted bit 0

75: return

\bot

76:

\triangleright

t

is a leaf node with key

t.key

77: return

t.\mathit{key}

4.5 Proof of Correctness

In this section, we prove that our relaxed binary trie implementation is linearizable. The proof is organized as follows. In Section 4.5.1 we give the linearization points of TrieSearch, TrieInsert, and TrieDelete, and prove that the implementation is strongly linearizable with respect to these operation types. In Section 4.5.2 we prove the properties satisfied by the interpreted bits of the binary trie. In Section 5.3 we prove that RelaxedPredecessor operations follow the specification of the relaxed binary trie.

4.5.1 Strong Linearizability of TrieInsert, TrieDelete, and TrieSearch

A TrieSearch $(x)$ operation is linearized immediately after it reads $\textit{latest}[x]$ . We argue that this operation returns True if and only if $x\in S$ in this configuration.

Lemma 4.1.

Let $op$ be a TrieSearch $(x)$ operation. Then $op$ returns True if and only if in the configuration $C$ immediately after $op$ reads $\textit{latest}[x]$ , $x\in S$ .

Proof.

Let $\mathit{uNode}$ be the update node pointed to by $\textit{latest}[x]$ read by $op$ , and let $C$ be the configuration immediately after this read by $op$ . If $op$ returned True, it read that $\mathit{uNode}.\mathit{type}=\textsc{INS}$ . The type of an update node is immutable, so $\textit{latest}[x]$ points to an INS node in $C$ . So it follows by definition that $x\in S$ in $C$ . If $op$ returned False, it read that $\mathit{uNode}.\mathit{type}=\textsc{DEL}$ . It follows by definition that $x\notin S$ in $C$ . ∎

The TrieInsert $(x)$ and TrieDelete $(x)$ operations that are $S$ -modifying successfully change $\textit{latest}[x]$ to point to their own update node using CAS, and are linearized at this CAS step. A TrieInsert $(x)$ operation that is not $S$ -modifying does not update $\textit{latest}[x]$ to point to its own update node. This happens when it reads $\textit{latest}[x]$ points to a DEL node, or when it performs an unsuccessful CAS. In the following two lemmas, we prove that for each of these two cases, there is a configuration during the TrieInsert $(x)$ in which $x\in S$ , and hence does not need to add $x$ to $S$ . The case for TrieDelete $(x)$ is symmetric.

Lemma 4.2.

If $uOp$ is a TrieInsert $(x)$ operation that returns on line 18, then in the configuration $C$ immediately after $uOp$ reads $\textit{latest}[x]$ , $x\in S$ . If $uOp$ is a TrieDelete $(x)$ returns on line 36, then in the configuration $C$ immediately after $uOp$ reads $\textit{latest}[x]$ , $x\notin S$ .

Proof.

Suppose $uOp$ is a TrieInsert $(x)$ operation. Let $\mathit{uNode}$ be the pointed to by $\textit{latest}[x]$ that is read by $uOp$ . Since $uOp$ returned on line 18 (or on line 36 for TrieDelete), it saw that $\mathit{uNode}.\mathit{type}=\text{INS}$ (or $\mathit{uNode}.\mathit{type}=\text{DEL}$ for TrieDelete). By definition, $x\in S$ (or $x\notin S$ ) in the configuration $C$ immediately after this read. ∎

Lemma 4.3.

If $uOp$ is a TrieInsert $(x)$ operation that returns on line 23, then there is a configuration during $uOp$ in which $x\in S$ . If $uOp$ is a TrieDelete $(x)$ operation that returns on line 41, then there is a configuration during $uOp$ in which $x\notin S$ .

Proof.

We prove the case when $uOp$ is an TrieInsert $(x)$ operation. The case when $uOp$ is a TrieDelete $(x)$ operation follows similarly.

Since $uOp$ does not return on line 18, it read that $\textit{latest}[x]$ points to a DEL node. From the code, $\textit{latest}[x]$ can only change from pointing to this DEL node to an INS node by a successful CAS of some TrieInsert $(x)$ operation. Since $uOp$ performs an unsuccessful CAS on line 155, some other TrieInsert $(x)$ changed $\textit{latest}[x]$ to point to an INS node using a successful CAS sometime between $uOp$ ’s read of $\textit{latest}[x]$ and $uOp$ ’s unsuccessful CAS on $\textit{latest}[x]$ . In the configuration immediately after this successful CAS, $x\in S$ . ∎

Lemma 4.4.

The implementation of the relaxed binary trie is strongly linearizable with respect to TrieInsert, TrieDelete, and TrieSearch operations.

Proof.

Consider an execution $\alpha$ of the relaxed binary trie. Let $\alpha^{\prime}$ be any prefix of $\alpha$ . Let $op$ be a TrieSearch, TrieInsert, or TrieDelete operation in $\alpha$ .

Suppose $op$ is a TrieSearch $(x)$ operation. This operation is linearized in the configuration $C$ immediately after $op$ reads $\textit{latest}[x]$ . By Lemma 4.1, $op$ returns True if and only if $x\in S$ in $C$ . If $\alpha^{\prime}$ contains $C$ , then $op$ is linearized at $C$ for both $\alpha$ and $\alpha^{\prime}$ .

Suppose $op$ is a TrieInsert $(x)$ operation. Suppose $op$ reads that $\textit{latest}[x]$ points to a INS node (on line 17) during $\alpha^{\prime}$ and, hence, returns without changing $\textit{latest}[x]$ . By Lemma 4.2, $x\in S$ in the configuration $C$ immediately after this read. Therefore, if $\alpha^{\prime}$ contains $C$ , then $op$ is linearized at $C$ for both $\alpha$ and $\alpha^{\prime}$ .

So $op$ reads that $\textit{latest}[x]$ points to a DEL node. If $op$ successfully changes $\textit{latest}[x]$ to point to its own update node using CAS, then $op$ is linearized at this CAS step. If $\alpha^{\prime}$ contains this CAS step, then $op$ is linearized at this CAS step for both $\alpha$ and $\alpha^{\prime}$ .

If $op$ does not successfully change $\textit{latest}[x]$ to point to its own update node using CAS, then by Lemma 4.3, there is a configuration sometime between $op$ ’s read of $\textit{latest}[x]$ and its unsuccessful CAS in which $x\in S$ . Consider the earliest such configuration $C$ , so $C$ immediately follows the successful CAS of some TrieInsert $(x)$ operation. If $\alpha^{\prime}$ contains this successful CAS (and hence $C$ ), then $op$ performs an unsuccessful CAS in any continuation of $\alpha^{\prime}$ . So $op$ is linearized at $C$ for both $\alpha$ and $\alpha^{\prime}$ .

The case when $op$ is a TrieDelete $(x)$ operation follows similarily. ∎

4.5.2 Properties of the Interpreted Bits

In this section, we prove that properties IB0 and IB0 of the interpreted bits are satisfied by our implementation. We say that the interpreted bit of a binary trie node $t$ is accurate if it is the OR of the interpreted bits of the leaves in the subtrie rooted at $t$ . In all configurations $C$ and all binary trie nodes $t$ , either the interpreted bit of $t$ is accurate or there is an active update operation in $C$ that may change the interpreted bit of $t$ to be accurate.

We begin with basic observations and lemmas about how the fields of DEL nodes change.

Observation 4.5.

Only TrieDelete operations change $\mathit{dNodePtr}$ of a binary trie node and only change it to point to their own DEL node (on line 53 or 57).

Observation 4.6.

For any binary trie node $t$ , suppose $t.\mathit{dNodePtr}$ is changed to point from a DEL node $\mathit{dNode}$ to a different DEL node. Then in any future configuration, $t.\mathit{dNodePtr}$ does not point to $\mathit{dNode}$ .

Observation 4.7.

Let $\mathit{dNode}$ be the DEL node created by a TrieDelete $(x)$ operation $op$ . Only $op$ writes to $\mathit{dNode}.\mathit{upper0Boundary}$ and it only does so on line 59.

Lemma 4.8.

Let $\mathit{dNode}$ be a DEL node created by a TrieDelete $(x)$ operation $op$ . Suppose $op$ changes $t.\mathit{dNodePtr}$ of some binary trie node $t$ to point to $\mathit{dNode}$ due to a successful CAS $s$ (performed on line 53 or 57). Before $s$ , $op$ has performed a sequence of successful CASs updating each binary trie node from the parent of the leaf with key $x$ to a child of $t$ to point to $\mathit{dNode}$ . Immediately after $s$ , $\mathit{dNode}.\mathit{upper0Boundary}=t.0pt-1$ .

Proof.

Let $C$ be the configuration immediately after the CAS that changes $t.\mathit{dNodePtr}$ to point to $\mathit{dNode}$ . Let $h=t.0pt$ . Let $t_{0},\dots,t_{h}$ be the sequence of nodes on the path from the leaf $t_{0}$ with key $x$ to $t_{h}=t$ .

Suppose that $t_{h}$ is the parent of the leaf with key $x$ , so $t.0pt-1=0$ . The initial value of $\mathit{dNode}.\mathit{upper0Boundary}$ is 0. In the first iteration of BinaryTrieDelete, $op$ does not change $\mathit{dNode}.\mathit{upper0Boundary}$ prior to performing $cas$ . Therefore, in $C$ , $\mathit{dNode}.\mathit{upper0Boundary}=0$ .

So suppose that $t_{h}$ is not the parent of the leaf with key $x$ . From the code of BinaryTrieDelete, $op$ does not proceed to its next iteration unless it performs at least one successful CAS on either line 53 or 57. Hence, prior to $C$ , $op$ has performed a sequence of successful CASs, updating the $t_{i}.\mathit{dNodePtr}$ to point to $\mathit{dNode}$ , for $1\leq i\leq h-1$ .

In the iteration prior to the iteration $op$ performs a successful CAS on $t_{h}$ , $op$ updates $\mathit{dNode}.\mathit{upper0Boundary}$ to $t_{h-1}.0pt=t.0pt-1$ on line 59. By Observation 4.7, no operation besides $op$ change $\mathit{dNode}.\mathit{upper0Boundary}$ between this write and $C$ . Hence, in $C$ , $\mathit{dNode}.\mathit{upper0Boundary}=t.0pt-1$ .

∎

Lemma 4.9.

Suppose $\mathit{dNode}$ and $\mathit{dNode}^{\prime}$ were the DEL nodes created by TrieDelete $(x)$ operations, $op$ and $op^{\prime}$ , respectively. Suppose that $\mathit{dNode}$ is activated before $\mathit{dNode}^{\prime}$ . Once $op^{\prime}$ performs a successful CAS on the $\mathit{dNodePtr}$ of some binary trie node, then $op$ does not perform a successful CAS on the $\mathit{dNodePtr}$ of that same binary trie node.

Proof.

Let $C^{\prime}$ be the configuration immediately after $op^{\prime}$ ’s successful CAS on $t.\mathit{dNodePtr}$ of some binary trie node $t$ . This only occurs after $\mathit{dNode}^{\prime}$ is activated by $op^{\prime}$ . Then in $C^{\prime}$ , $\mathit{dNode}$ is not the first activated node in $\textit{latest}[x]$ .

Suppose $op$ is performs a CAS on $t.\mathit{dNodePtr}$ in some configuration after $C^{\prime}$ . Before this CAS, $op$ read that $\mathit{dNode}$ is the first activated node $\textit{latest}[x]$ on line 53 or 57.

This read must occur before $C^{\prime}$ , since $\mathit{dNode}$ is not the first activated node in $\textit{latest}[x]$ in $C^{\prime}$ . Prior to this read, $op^{\prime}$ reads $\mathit{dNodePtr}$ (on line 50 or 54). Hence, $op$ ’s last read of $\mathit{dNodePtr}$ also occurred before $C^{\prime}$ . Then $op^{\prime}$ changes the value of $t.\mathit{dNodePtr}$ to a value different than what was last read by $op$ . So the CAS that $op$ is performs on $t.\mathit{dNodePtr}$ will be unsuccessful. ∎

Lemma 4.10.

Let $\mathit{dNode}$ be the DEL node created by a TrieDelete $(x)$ operation $op$ . Suppose $op$ changes $t.\mathit{dNodePtr}$ of some binary trie node $t$ to point to $\mathit{dNode}$ (due to a successful CAS on line 53 or 57). In the configuration immediately after this CAS, the interpreted bit of $t$ is 1.

Proof.

Let $cas$ be the successful CAS operation where $op$ updates $t$ to point to $\mathit{dNode}$ . Let $C$ be the configuration immediately after this CAS.

Suppose, for contradiction, that the interpreted bit of $t$ is 0 in $C$ . The interpreted bit of $t$ depends on the first activated update node in $\textit{latest}[x]$ . The first activated update node must be a DEL node, otherwise the interpreted bit of $t$ is 1.

Let $\mathit{dNode}^{\prime}$ be the first activated update node in $\textit{latest}[x]$ in $C$ . The interpreted bit of $t$ is 0 when $t.0pt<\mathit{dNode}^{\prime}.\mathit{lower1Boundary}$ and $t.0pt\leq\mathit{dNode}^{\prime}.\mathit{upper0Boundary}$ . By Lemma 4.8, $\mathit{dNode}.\mathit{upper0Boundary}=t.0pt-1$ , so $\mathit{dNode}^{\prime}\neq\mathit{dNode}$ . So $\mathit{dNode}^{\prime}$ is a DEL node created by a TrieDelete $(x)$ operation $op^{\prime}\neq op$ . Since $\mathit{dNode}^{\prime}$ is the first activated update node in $\textit{latest}[x]$ in $C$ , $\mathit{dNode}^{\prime}$ is activated and inserted into $\textit{latest}[x]$ after $\mathit{dNode}$ .

Since $\mathit{dNode}^{\prime}.\mathit{upper0Boundary}\geq t.0pt$ , this implies $op^{\prime}$ performed a successful CAS on $t.\mathit{dNodePtr}$ , updating it to point to $\mathit{dNode}^{\prime}$ sometime before $C$ . Lemma 4.9 implies $cas$ will be unsuccessful, a contradiction.

∎

For a binary trie node $t$ , recall that $U_{t}$ is the set of keys of the leaves of the subtrie rooted at $t$ . Let the latest update operation with key $x$ in configuration $C$ be the update operation that created the first activated update node in $\textit{latest}[x]$ in $C$ . If the first activated update node in $\textit{latest}[x]$ is a dummy node, we assume its latest update operation is a completed, dummy TrieDelete $(x)$ operation.

Let $op$ be the latest TrieInsert operation with key $x\in U_{t}$ in configuration $C$ . We say that $op$ has completed iteration $t$ of InsertBinaryTrie if $op$ performed a MinWrite with value $t.height$ on line 33, read that $\mathit{uNode}.\mathit{type}$ has type INS on line 29 during iteration $t$ of InsertBinaryTrie, or read that $t.0pt\geq\mathit{uNode}.\mathit{lower1Boundary}$ or $t.0pt>\mathit{uNode}.\mathit{upper0Boundary}$ on line 30 during iteration $t$ of InsertBinaryTrie. Note that if $op$ returns while performing iteration $t$ of InsertBinaryTrie, it does not complete iteration $t$ . We say that $op$ has a potential update to $t$ if $op$ has not yet invoked InsertBinaryTrie, or it has invoked InsertBinaryTrie but has not returned and has not completed iteration $t$ .

Lemma 4.11.

Suppose $op$ is a latest TrieDelete operation that created a DEL node $v$ , and suppose $op$ is in iteration $t$ of DeleteBinaryTrie in a configuration $C$ . Then it is not possible for $op$ to complete iteration $t$ of DeleteBinaryTrie from $C$ if and only if either $v.\textit{stop}=\textsc{True}$ and there is a possible check to $v.\textit{stop}$ , $v.\mathit{lower1Boundary}=b+1$ and there is a possible check to $v.\mathit{lower1Boundary}$ , or $t.\mathit{dNodePtr}$ has changed since $op$ ’s last read of it on line 54.

Proof.

Suppose $v.\textit{stop}=\textsc{True}$ and there is a possible check to $v.\textit{stop}$ , $v.\mathit{lower1Boundary}=b+1$ and there is a possible check to $v.\mathit{lower1Boundary}$ , or $t.\mathit{dNodePtr}$ has changed since $op$ ’s last read of it on line 54. Then in any continuation from $C$ , $op$ will return when it does its next read to $v.\textit{stop}$ , $v.\mathit{lower1Boundary}$ , or performs its next CAS, or return earlier.

Suppose $v.\textit{stop}=\textsc{False}$ or there is no possible check to $v.\textit{stop}$ , $v.\mathit{lower1Boundary}\neq b+1$ or there no is a possible check to $v.\mathit{lower1Boundary}$ , and $t.\mathit{dNodePtr}$ has not changed since $op$ ’s last read of it on line 54. In $op$ ’s solo continuation from $C$ , $op$ will complete iteration $t$ . ∎

Observation 4.12.

Suppose an update operation $op$ with key $x$ completes iteration $t$ (of InsertBinaryTrie for TrieInsert operations or DeleteBinaryTrie for TrieDelete operations). Then for each node $t^{\prime}$ on the path from the parent of the leaf with key $x$ to $t$ , $op$ has completed iteration $t^{\prime}$ .

Proof.

From the code of InsertBinaryTrie and DeleteBinaryTrie, if $op$ does not complete an iteration, it returns from InsertBinaryTrie and DeleteBinaryTrie. Since $op$ completes iterations up the binary trie, starting from the parent of the leaf with key $x$ to the root, $op$ has previously completed iterations for all internal binary trie nodes on the path to $t$ . ∎

Lemma 4.13.

Let $\mathit{dNode}$ be the DEL node created by a TrieDelete operation $op$ with key $x$ . Suppose $\mathit{dNode}.\mathit{upper0Boundary}>0$ . Then for each binary trie node $t$ such that $x\in U_{t}$ and $t.0pt\leq\mathit{dNode}.\mathit{upper0Boundary}$ , $op$ has completed iteration $t$ of DeleteBinaryTrie.

Proof.

By Observation 4.7, only the Delete operation that created $\mathit{dNode}$ writes to $op$ . From the code of DeleteBinaryTrie, each completed iteration of DeleteBinaryTrie increases $\mathit{dNode}.\mathit{upper0Boundary}$ by 1. By definition, $op$ completes iteration $t$ of DeleteBinaryTrie immediately after it writes $t.0pt$ to $\mathit{dNode}$ . Hence, $op$ has completed iteration $t$ of DeleteBinaryTrie for each node $t$ where $x\in U_{t}$ and $t.0pt\leq\mathit{dNode}.\mathit{upper0Boundary}$ . ∎

Lemma 4.14.

Suppose an TrieInsert operation is poised to perform a MinWrite of $t.0pt$ to $\mathit{dNode}.\mathit{lower1Boundary}$ for some DEL node, $\mathit{dNode}$ , during iteration $t$ of InsertBinaryTrie. Then the TrieDelete operation that created $\mathit{dNode}$ has previously completed iteration $t$ of DeleteBinaryTrie.

Proof.

Let the TrieInsert operation poised to perform the MinWrite be $op$ . Since $op$ is poised to perform the MinWrite, it read that $t.0pt\leq\mathit{dNode}.\mathit{upper0Boundary}$ on line 30 of InsertBinaryTrie. By Observation 4.13, the TrieDelete operation that created $\mathit{dNode}$ has completed iteration $t$ of DeleteBinaryTrie. ∎

Lemma 4.15.

Suppose an internal binary trie node $t$ has interpreted bit 0 in a configuration $C$ , and suppose $t.\mathit{dNodePtr}$ points to an DEL node with key $x$ . Then the latest update operation with key $x$ has completed iteration $t$ of DeleteBinaryTrie.

Proof.

Let $op$ be the latest update operation with key $x$ . Since the interpreted bit of $t$ is 0, the first activated update node in $\textit{latest}[x]$ is a DEL node, $\mathit{dNode}$ , which was created by $op$ . Furthermore, $t.0pt\leq\mathit{dNode}.\mathit{upper0Boundary}$ . By Observation 4.7, only $op$ writes to $\mathit{dNode}.\mathit{upper0Boundary}$ . Since $\mathit{dNode}.\mathit{upper0Boundary}$ and $op$ completes iteration $t$ of DeleteBinaryTrie immediately after it writes $t.0pt$ to $\mathit{dNode}.\mathit{upper0Boundary}$ , it follows that $op$ has completed iteration $t$ of DeleteBinaryTrie. ∎

Observation 4.16.

Let $\mathit{iNode}$ be the INS node created by an TrieInsert operation $op$ . Then only $op$ writes to $\mathit{iNode}.\textit{target}$ .

The following lemma implies that, in all configurations $C$ and all binary trie nodes $t$ , either the interpreted bit of $t$ is accurate or there exists a key $x$ in $U_{t}$ whose latest update operation has a potential update to $t$ . It implies that property IB1 of the interpreted bits is satisfied.

Lemma 4.17.

If there exists a key in $U_{t}$ whose latest update operation, $iOp$ , is an TrieInsert operation, and $iOp$ has completed iteration $t$ of InsertBinaryTrie, then $t$ has interpreted bit 1.

Proof.

We prove by induction on the configurations of the execution. In the initial configuration, the latest update operation of every key in $U$ is a dummy TrieDelete operation. So the lemma is vacuously true.

Suppose the lemma is true in a configuration $C^{\prime}$ immediately before a step $s$ by an operation $op$ , and we show that it is true in the following configuration $C$ .

•

Suppose $s$ is a step that activates an INS node in $\textit{latest}[x]$ . So $op$ is now the latest update operation with key $x$ .

Let $t$ be any node in the binary trie where $x\in U_{t}$ . The interpreted bit of $t$ does not change as a result of $s$ unless $t.\mathit{dNodePtr}.key=x$ . In this case, the first activated update node in $\textit{latest}[x]$ is an INS node, so by definition $t$ has interpreted bit 1 in $C$ .
•

Suppose $s$ is a step that activates an DEL node, $\mathit{dNode}$ , in $\textit{latest}[x]$ . So $op$ is now the latest update operation with key $x$ .

Let $t$ be any internal node in the binary trie where $x\in U_{t}$ . The interpreted bit of $t$ does not change as a result of $s$ unless $t.\mathit{dNodePtr}.key=x$ . Since $\mathit{dNode}$ is a newly activated DEL node, $\mathit{dNode}.\mathit{upper0Boundary}=0$ . Since $t.0pt\geq 1$ , it follows by definition that $t$ has interpreted bit 1 in $C$ .
•

Suppose $s$ is a successful CAS that changes $t.\mathit{dNodePtr}$ from an DEL node with key $x^{\prime}$ to a DEL node with key $x$ .

Only the interpreted bit of $t$ may change as a result of $s$ . By Lemma 4.10, the interpreted bit of $t$ is 1 in $C$ .
•

Suppose $s$ is a step in which $op$ completes iteration $t$ of InsertBinaryTrie (as a result of a read on line 29 or line 30, or a MinWrite of $t.0pt$ to $\mathit{dNode}.\mathit{lower1Boundary}$ ).

Let $\mathit{uNode}$ be the update node returned by FindLatest on line 28 during iteration $t$ of $op$ InsertBinaryTrie prior to $s$ . Let $x=\mathit{uNode}.key$ . By Lemma 5.3, there is a configuration $C^{\prime\prime}$ during FindLatest in which $\mathit{uNode}$ is the first activated update node in $\textit{latest}[x]$ . If $t.\mathit{dNodePtr}.key=\mathit{uNode}.key$ and $\mathit{uNode}$ is still the first activated update node in $C$ , then the interpreted bit of $t$ is 1 after $s$ and $op$ completes iteration $t$ of InsertBinaryTrie.

So suppose in $C$ , the first activated update node in $\textit{latest}[t.\mathit{dNodePtr}.key]$ is an update node other than $\mathit{uNode}$ . Then a successful CAS has changed $t.\mathit{dNodePtr}.key$ sometime between $C^{\prime\prime}$ and $C$ . These steps change the interpreted bit of $t$ to 1.
•

Suppose $s$ is a write of $t.0pt$ to $\mathit{dNode}.\mathit{upper0Boundary}$ on line 59 of DeleteBinaryTrie.

Suppose the lemma holds in $C^{\prime}$ because some TrieInsert operation $op^{\prime}$ has completed iteration $t$ in a previous configuration $C^{\prime\prime}$ . Let $C^{\prime\prime\prime}$ be the configuration before $C^{\prime\prime}$ in which $op^{\prime}$ completed iteration $t^{\prime}$ , where $t^{\prime}$ is a child of $t$ . By the induction hypothesis, the interpreted bit of $t$ is 1 in all configurations from $C^{\prime\prime}$ to $C^{\prime}$ , and the interpreted bit of $t^{\prime}$ is 1 in all configurations from $C^{\prime\prime\prime}$ to $C^{\prime}$ .

Since $op$ is poised to perform a write of $t.0pt$ to $\mathit{dNode}.\mathit{upper0Boundary}$ in $C^{\prime}$ , this implies that $op$ has previously successfully read that $t^{\prime}$ has interpreted bit 0 on line 58 of DeleteBinaryTrie prior to $C^{\prime\prime\prime}$ . Furthermore, $op$ has previously performed a successful CAS updating $t.\mathit{dNodePtr}$ to point to $\mathit{dNode}$ . The step $s$ only affects $op$ if $t.\mathit{dNodePtr}$ still points to $\mathit{dNode}$ and $\mathit{dNode}$ is the first activated update node in its latest list. So when $op^{\prime}$ completes iteration $t$ , it either saw $t.0pt<\mathit{dNode}.\mathit{lower1Boundary}$ or it performed a MinWrite of $t.0pt$ to $\mathit{dNode}.\mathit{lower1Boundary}$ . So $t.0pt\geq\mathit{dNode}.\mathit{lower1Boundary}$ in $C$ , and hence the interpreted bit of $t$ is 1 in $C$ .

∎

We now focus on showing that property IB0 of the interpreted bits is satisfied. Let $op$ be the latest TrieDelete operation with key $x\in U_{t}$ in configuration $C$ . Let $v$ be the DEL node created by $op$ . We say that $op$ has completed iteration $t$ of DeleteBinaryTrie $(v)$ if $op$ performed the write with value $t.0pt$ to $v.\mathit{upper0Boundary}$ on line 59 of DeleteBinaryTrie.

We say that $op$ is flagged if $v.\textit{stop}=\textsc{True}$ or $v.\mathit{lower1Boundary}\neq b+1$ . We say that $op$ has a potential update to $t$ in configuration $C$ if $op$ is not flagged in $C$ and there exists an execution from $C$ in which $op$ writes the value $t.0pt$ to $v.\mathit{upper0Boundary}$ on line 59 of DeleteBinaryTrie while $t.\mathit{dNodePtr}$ points to $v$ . By definition, a dummy TrieDelete operation does not have a potential update to any binary trie node in any configuration.

Consider an configuration $C$ and binary trie node $t$ in which all the latest update operations with keys in $U_{t}$ are Delete operations. Let $D(t,C)$ denote the earliest configuration before $C$ in which these operations have been invoked. Let $OP(t,C)$ be the set of latest TrieDelete operations with keys in $U_{t}$ that have a potential update to $t$ in $D(t,C)$ . Note that in the step immediately before $D(t,C)$ , a TrieDelete operation with key in $U_{t}$ activates its DEL node, becoming a latest TrieDelete operation. This operation has a potential update to $t$ , so $OP(t,C)$ is non-empty.

Lemma 4.18.

Consider a configuration $C$ and a binary trie node $t$ such that the latest update operations for all keys in $U_{t}$ are TrieDelete operations. Operations in $OP(t,C)$ do not become flagged in any step between $D(t,C)$ and $C$ by an operation with a key in $U_{t}$ .

Proof.

Suppose, for contradiction, that an operation $op\in OP(t,C)$ is flagged by a step $s$ by an operation $op^{\prime}$ with key $x^{\prime}\in U_{t}$ , where $s$ is between $D(t,C)$ and $C$ . Let $v$ and $v^{\prime}$ be the DEL node created by $op$ and $op^{\prime}$ , respectively. Let $x$ be the key of $op$ .

Suppose $op$ is flagged because $op^{\prime}$ does a MinWrite to $v.\mathit{lower1Boundary}$ . Prior to this MinWrite, $op^{\prime}$ reads that its update node is the first activated in $\textit{latest}[x^{\prime}]$ on line 32, which is before $D(t,C)$ . It also set $v^{\prime}.\textit{target}=v$ on line 31. Therefore, before the latest TrieDelete $(x^{\prime})$ operation invokes DeleteBinaryTrie, it reads $v^{\prime}.\textit{target}=v$ and sets $v.\textit{stop}=\textsc{True}$ . Since $op$ is flagged before $D(t,C)$ , $op$ does not have a potential update to $t$ in $D(t,C)$ . So $op\notin OP(t,C)$ , a contradiction.

Suppose $op$ is flagged because $op^{\prime}$ writes $v.\textit{stop}=\textsc{True}$ . Then $op$ read $v^{\prime}.\textit{target}=v$ , where $v^{\prime}$ is the INS node of some TrieInsert $(x^{\prime})$ operation where $x^{\prime}\in U_{t}$ . Then $op^{\prime}$ is a TrieDelete operation invoked after $op$ . So $op^{\prime}\in OP(t,C)$ . But $op^{\prime}$ writes $v.\textit{stop}=\textsc{True}$ before it invokes DeleteBinaryTrie, which is before $D(t,C)$ . Since $op$ is flagged before $D(t,C)$ , $op$ does not have a potential update to $t$ in $D(t,C)$ . So $op\notin OP(t,C)$ , a contradiction. ∎

The next lemma implies that property IB0 is satisfied by the interpreted bits.

Lemma 4.19.

Consider any configuration $C$ and binary trie node $t$ . If the latest update operation for all keys in $U_{t}$ in $C$ are TrieDelete operations and none of these operations have a potential update to $t$ in $C$ , then $t$ has interpreted bit 0 in $C$ and $t.\mathit{dNodePtr}$ points to a DEL node created by an operation in $OP(t,C)$ .

Proof.

The proof is by induction on the configurations of the execution and the height of binary trie nodes.

In the initial configuration, the latest update operation of every key in $U$ is a dummy TrieDelete operation and each latest list points to a dummy update node. The fields of the dummy update nodes are initialized so that the interpreted bits of all binary trie nodes are 0. Since the dummy operations do not have a potential update to any binary trie node, the claim is true in the initial configuration.

Consider any other reachable configuration $C$ . Let $C^{\prime}$ be the configuration immediately before $C$ in some execution and let $s$ be the step performed in $C^{\prime}$ that results in $C$ . We assume that the claim holds for all binary trie nodes in $C^{\prime}$ . Suppose $s$ is performed by operation $op$ with key $x\in U_{t}$ .

Consider a leaf $t$ of the binary trie with key $x$ . No operations have a potential update to $t$ in any configuration. If the latest update operation with key $x$ is a TrieDelete operation in $C$ , then the first activated update node in $\textit{latest}[x]$ is a TrieDelete operation. By definition, the interpreted bit of $t$ is 0.

Now suppose when $t$ is the parent of a leaf. Let $op_{\ell}$ and $op_{r}$ be the latest TrieDelete operations with keys $x$ and $x+1$ in $C$ . Since $op$ and $op^{\prime}$ have not yet completed any iterations of DeleteBinaryTrie, $op_{\ell}$ and $op_{r}$ are not flagged. Non-latest update operations with keys in $\{x,x+1\}$ in $C$ can only cause operations in $OP(t,C)$ to perform one unsuccessful CAS. Since operations in $OP(t,C)$ perform at least two CASs during iteration $t$ of DeleteBinaryTrie, at least one operation in $OP(t,C)$ will perform a successful CAS.

Now suppose $t$ is an internal binary trie node that is not the parent of a leaf. We assume the claim is true in $C$ for all binary trie nodes that are proper descendants of $t$ .

By definition, step $s$ can change the interpreted bit of $t$ if and only if it is a successful CAS that changes $t.\mathit{dNodePtr}$ , a write to $v.\mathit{lower1Boundary}$ or $v.\mathit{upper0Boundary}$ , where $v$ is the first activated DEL node in $\textit{latest}[t.\mathit{dNodePtr}.\mathit{key}]$ , or a successful CAS that activates a new update node in $\textit{latest}[x]$ , where $x=t.\mathit{dNodePtr}.\mathit{key}$ .

Step $s$ can flag an operation if and only if it is a write to $v.\textit{stop}$ or $v.\mathit{lower1Boundary}$ for some DEL node $v$ . This step may also result in the operation that created $v$ from no longer having a potential update to $t$ , as it is guaranteed to return the next time they read $v.\textit{stop}$ or $v.\mathit{lower1Boundary}$ .

Step $s$ can cause $op$ to return from DeleteBinaryTrie if it reads $v.\textit{stop}=\textsc{True}$ , $v.\mathit{lower1Boundary}\neq b+1$ , results in FirstActivated $(v)$ returns False, or an unsuccessful CAS on $t.\mathit{dNodePtr}$ .

When $s$ is a step where $op$ returns from DeleteBinaryTrie because InterpretedBit $(t^{\prime})$ returns 1, $op$ may no longer have a potential update to $t$ in $C$ .

•

Suppose $s$ is a step that activates an INS node in $\textit{latest}[x]$ , where $x=t.\mathit{dNodePtr}.key$ . In this case, $op$ is an TrieInsert operation. So $op$ is now the latest update operation with key $x$ . Since the latest update operations for all keys in $U_{t}$ are no longer TrieDelete, the claim is vacuously true for $t$ in $C$ .
•

Suppose $s$ is a step that activates a DEL node $v$ in $\textit{latest}[x]$ , where $x=t.\mathit{dNodePtr}.key$ . Since $v$ is newly activated, $v.\mathit{lower1Boundary}=b+1$ and $v.\textit{stop}=\textsc{False}$ . Then the latest update operation with key $x$ is non-flagged and has a potential update to $t$ . So the claim is vacuously true for $t$ in $C$ .
•

Suppose $s$ is a step where $op$ returns because it reads that its update node $v$ is not the first activated update node on line 51 or on line 55 during iteration $t$ of DeleteBinaryTrie.

Then prior to $s$ , an TrieInsert $(x)$ operation has inserted an activated INS node into $\textit{latest}[x]$ sometime after $v$ was inserted into $\textit{latest}[x]$ . So $op$ is not the latest update operation with key $x$ in $C^{\prime}$ .

By induction hypothesis, since the claim is true for $t$ in $C^{\prime}$ , it is also true for $t$ in $C$ .
•

Suppose $s$ is a step where $op$ returns because it reads $v.\textit{stop}=\textsc{True}$ or $v.\mathit{upper0Boundary}\neq b+1$ on line 52 or line 56 during iteration $t$ of DeleteBinaryTrie.

By definition, $op$ is flagged in $C^{\prime}$ and $C$ and does not have a potential update to $t$ in $C^{\prime}$ and $C$ . Hence, the claim is vacuously true for $t$ in $C^{\prime}$ and $C$ .
•

Suppose $s$ is the second unsuccessful CAS performed by $op$ during iteration $t$ of DeleteBinaryTrie.

Then $t.\mathit{dNodePtr}$ has changed since $op$ ’s last read of $t.\mathit{dNodePtr}$ . By definition, $op$ does not have a potential update to $t$ in $C^{\prime}$ and $C$ . Hence, the claim is vacuously true for $t$ in $C^{\prime}$ and $C$ .
•

Suppose $s$ is a successful CAS that changes $t.\mathit{dNodePtr}$ .

If there are still latest update operations with keys in $U_{t}$ that have a potential update to $t$ in $C$ , the claim is vacuously true for $t$ in $C$ . So in $C$ , suppose no latest update operations have a potential update to $t$ . Note that if there is a latest update operation with a key in $U_{t}$ with a potential update to $t$ in $C^{\prime}$ that is in iteration $t^{\prime}$ of DeleteBinaryTrie, where $t^{\prime}$ is a descendant of $t$ , then this operation will still have a potential update to $t$ in $C$ . So all latest update operations with keys in $U_{t}$ are in iteration $t$ of DeleteBinaryTrie. Furthermore, these operations either already performed a successful CAS during iteration $t$ of DeleteBinaryTrie or have read $t.\mathit{dNodePtr}$ on line 54 but not yet attempted their second CAS during iteration $t$ of DeleteBinaryTrie.

Suppose that in $C^{\prime}$ , $op$ is the latest update operation with key $x$ and $op$ is not flagged. By the induction hypothesis, the children of $t$ in $C^{\prime}$ have interpreted bit 0. So in $op$ ’s solo continuation in $C$ , $op$ completes iteration $t$ . Then $op$ has a potential update to $t$ in $C$ . So the claim is vacuously true for $t$ in $C$ .

Suppose that in $C^{\prime}$ , $op$ is not the latest update operation with key $x$ in $C^{\prime}$ . Prior to $op$ performing $s$ , $op$ reads $t.\mathit{dNodePtr}$ and reads that $\textsc{FirstActivated}(v)=\textsc{True}$ on line 51 or line 55. So there a latest $\textsc{TrieDelete}(x)$ operation $op^{\prime}$ that activates a DEL node into $\textit{latest}[x]$ sometime after $op$ performs reads that $\textsc{FirstActivated}(v)=\textsc{True}$ . So $D(t,C)$ occurs after $op$ reads $t.\mathit{dNodePtr}$ .

Let $t_{\ell}$ and $t_{r}$ be the left and right children of $t$ , respectively. Note that both $D(t_{\ell},C)$ and $D(t_{r},C)$ occur after at or before $D(t,C)$ . By the induction hypothesis, $t_{\ell}$ and $t_{r}$ have interpreted bit 0 in $C^{\prime}$ . Without loss of generality, suppose $t_{\ell}$ is the last child of $t$ to have its $\mathit{dNodePtr}$ changed before $C^{\prime}$ , and suppose $t_{\ell}.\mathit{dNodePtr}$ points to a DEL node $v_{\ell}$ created by $op_{\ell}$ . Since $t_{\ell}$ has interpreted bit 0, $v_{\ell}.\mathit{upper0Boundary}\geq t_{\ell}.0pt$ . So $op_{\ell}$ completes iteration $t_{\ell}$ sometime after $D(t,C)$ . By Lemma 4.18, $op_{\ell}$ is not flagged in $C^{\prime}$ . It follows that $op_{\ell}$ may only return from iteration $t$ of DeleteBinaryTrie from an unsuccessful CAS. But there are no changes to $t.\mathit{dNodePtr}$ from $D(t,C)$ to $C^{\prime}$ . Furthermore, $op_{\ell}$ does not perform a successful CAS on $t$ , otherwise $s$ will be unsuccessful. It follows that $op_{\ell}$ has a potential update to $t$ in $C$ , a contradiction.

Suppose that in $C^{\prime}$ , $op$ is the latest update operation with key $x$ and $op$ is flagged in $C^{\prime}$ . Prior to $op$ performing $s$ , $op$ reads $t.\mathit{dNodePtr}$ and reads that it is not flagged on line 52 or line 56. Lemma 4.18 implies $op$ is flagged sometime before $D(t,C)$ .
•

Suppose $s$ is a write that sets $v.\textit{stop}$ to True on line 183 of TrieDelete to some DEL node $v$ with key $x$ . Let $op^{\prime}$ be the creator of $v$ .

Suppose $op^{\prime}$ is already flagged in $C^{\prime}$ . Then since the claim is true for $t$ in $C^{\prime}$ , it is also true for $t$ in $C$ .

Suppose $op^{\prime}$ is not already flagged in $C^{\prime}$ , so $op^{\prime}$ becomes flagged in $C$ as a result of $s$ . The step $s$ performed by $op$ occurs before $op$ invokes DeleteBinaryTrie. So $op$ has a potential update to all binary trie nodes $t$ where $x\in U_{t}$ . By Observation, $op$ is not a flagged operation. So the claim is vacuously true for $t$ in $C$ .
•

Suppose $s$ is a MinWrite of $t.0pt$ to $\mathit{dNode}.\mathit{lower1Boundary}$ on line 33 of InsertBinaryTrie. Let $x^{\prime}=\mathit{dNode}.key$ .

Suppose $op$ is already flagged in $C^{\prime}$ . Then since the claim is true for $t$ in $C^{\prime}$ , it is also true for $t$ in $C$ .

Suppose $op$ is not already flagged in $C^{\prime}$ , so $op$ becomes flagged in $C$ as a result of $s$ . Prior to $s$ , $op$ must set $v.\textit{target}$ to $\mathit{dNode}$ , where $v$ is the INS node created by $op$ . If $op$ is the latest update operation with key $x^{\prime}$ , then there is a latest update operation for a key in $U_{t}$ which is an TrieInsert operation, and hence the claim is vacuously true for $t$ in $C$ .

Otherwise there is a TrieDelete $(x^{\prime})$ operation $op^{\prime}$ becomes the latest update operation with key $x$ sometime after $op$ reads that $v$ is the first activated node on line 32, and hence after $op$ set $v.\textit{target}$ to $\mathit{dNode}$ . Since $op$ is not flagged in $C^{\prime}$ , $op^{\prime}$ has not yet set $\mathit{dNode}.\textit{stop}=\textsc{True}$ , and hence has not yet invoked DeleteBinaryTrie. So by definition, $op^{\prime}$ has a potential update to $t$ , where $x^{\prime}\in U_{t}$ . So the claim is vacuously true for $t$ in $C$ .
•

Suppose $s$ is the step where $op$ returns from DeleteBinaryTrie because InterpretedBit $(t.\mathit{left})$ or InterpretedBit $(t.\mathit{right})$ returns 1.

The induction hypothesis for the children of $t$ implies that either is a latest TrieInsert operation with key in $U_{t}$ , or there is a TrieDelete operation with a potential update to a child of $t$ sometime during $op$ ’s execution of iteration $t$ of DeleteBinaryTrie.
•

Suppose $s$ completes iteration $t$ of DeleteBinaryTrie because it writes $t.0pt$ to $v.\mathit{upper0Boundary}$ on line 59 of DeleteBinaryTrie.

If $op$ is not the only non-flagged, latest update operation with a potential update to $t$ in $C^{\prime}$ , then the claim is vacuously true for $t$ in $C^{\prime}$ and $C$ .

So $op$ is the only non-flagged update operation with a potential update to $t$ in $C^{\prime}$ . Since there are no non-flagged, update operations with a potential update to $t$ in $C$ , we need to show that the interpreted bit of $t$ is 0 in $C$ . Since $op$ is non-flagged, $v.\textit{stop}=\textsc{False}$ and $v.\mathit{lower1Boundary}=b+1$ . After $op$ performs $s$ , $v.\mathit{upper0Boundary}=t.0pt$ . Furthermore, $t.\mathit{dNodePtr}$ points to $v$ . By definition, the interpreted bit of $t$ is 1. Since $op$ had a potential update to $t$ in all configurations from when it was invoked to $C$ , it follows that $op\in OP(t,C)$ .

In all cases, the claim is true for $t$ in $C$ . Therefore, it is true for all binary trie nodes in $C$ . ∎

4.5.3 Correctness of RelaxedPredecessor

In this section, we prove that the output of RelaxedPredecessor satisfies the specification outlined in Section 4.1. Let $pOp$ be a completed instance of RelaxedPredecessor $(y)$ . Let $k$ be the largest key that is completely present throughout $pOp$ that is less than $y$ , or $-1$ if no such key exists.

Lemma 4.20.

In all configurations during $pOp$ , for each binary trie node $t$ on the path from the leaf with key $k$ to the root, $t$ has interpreted bit 1.

Proof.

Since $k$ is completely present throughout $pOp$ , for each binary trie node $t$ on the path from the leaf with key $k$ to the root, $k\in U_{t}\cap S$ in all configurations during $pOp$ . Moreover, the last $S$ -modifying TrieInsert $(k)$ operation linearized prior to the end of $pOp$ is not concurrent with $pOp$ . Hence, in all configurations during $pOp$ , the last $S$ -modifying TrieInsert $(k)$ operation is not active. So, by Property IB1, $t$ has interpreted bit 1 in all configurations during $pOp$ . ∎

The next two lemmas prove that the specification of RelaxedPredecessor is satisfied.

Lemma 4.21.

Suppose $pOp$ returns a key $x\in U$ . Then $k\leq x<y$ and $x\in S$ in some configuration during $pOp$ .

Proof.

Since $\tau$ returns $w$ , $pOp$ read that the leaf $\ell$ with key $w$ has interpreted bit 1. In particular, $\tau$ performs an instance of InterpretedBit $(\ell)$ that returns 1. This means that the first activated update node in $\textit{latest}[w]$ returned by FindLatest $(w)$ on line 11 is an INS node. By Lemma 5.3, there is a configuration during FindLatest $(w)$ in which $w\in S$ . Before $pOp$ returns a key $x\in U$ , it verified that the leaf with key $x$ had interpreted bit 1 by reading $\textit{latest}[x]$ . In the configuration immediately after this read, $\textit{latest}[x]$ points to an INS node, so $x\in S$ .

Since $pOp$ begins its upward traversal starting at the leaf with key $y$ and then performs a downward traversal starting from the left child of a binary trie node $t$ on this path and ending at the leaf with key $x$ , it follows that $x<y$ .

Consider the path of binary trie nodes from the leaf with key $y$ to $t.\mathit{right}$ . Any node that is not on this path and is a left child of a node on this path has interpreted bit 0 when encountered by $pOp$ during its upward traversal. By Lemma 4.20, each node on the path from the leaf with key $k$ to $t$ has interpreted bit 1. If $k$ is in the left subtrie of a proper ancestor of $t$ , then $k<x$ . Otherwise $k$ is in the left subtrie of $t$ . Since $pOp$ traverses the right-most path of binary trie nodes with interpreted bit 1 starting from $t.\mathit{left}$ , $pOp$ reaches a leaf with key at least $k$ . Therefore, $k\leq x$ . ∎

Lemma 4.22.

Suppose that, for all $k<x<y$ , there is no $S$ -modifying update operation with key $x$ that is linearized during $pOp$ . Then $pOp$ returns $k$ .

Proof.

Assume that, for all $k<x<y$ , the $S$ -modifying update operation with key $x$ that was last linearized prior to the end of $pOp$ is not concurrent with $pOp$ . We will prove that $pOp$ returns $k\neq\bot$ .

By definition of $k$ , there are no keys greater than $k$ and less than $y$ that are completely present throughout $pOp$ . By assumption, it follows that, throughout $pOp$ , there are no keys in $S$ that are greater than $k$ and smaller than $y$ .

First suppose $k=-1$ . Recall that $pOp$ begins by traversing up the relaxed binary trie starting from the leaf with key $y$ . Consider any node on this path whose left child $t$ is not on this path. Every key in $U_{t}$ is less than $y$ , so $U_{t}\cap S=\emptyset$ . By Property IB0, $t$ has interpreted bit 0 in all configurations during $pOp$ . It follows that the while-loop on line 62 always evaluates to True. So $pOp$ eventually reaches the root, and returns $-1$ on line 65.

Now suppose $k\in U$ . Consider any binary trie node $t$ such that $k<\min U_{t}<\max U_{t}<y$ . Since $U_{t}\cap S=\emptyset$ , it follows from Property IB0 that $t$ has interpreted bit 0 in all configurations during $pOp$ .

Let $t$ be the lowest common ancestor of the leaf with key $k$ and the leaf with key $y$ . Then the leaf with key $k$ is in the subtree rooted at $t.\textit{left}$ and the leaf with key $y$ is in the subtree rooted at $t.\textit{right}$ . Consider any node on the path from $y$ to $t.\textit{right}$ whose left child $t^{\prime}$ is not on this path. Note that $U_{t^{\prime}}\cap S=\emptyset$ , since $k<\min U_{t^{\prime}}<\max U_{t^{\prime}}<y$ . By Property IB0, $t^{\prime}$ has interpreted bit 0 in all configurations during $pOp$ . It follows from Lemma 4.20 that $pOp$ reaches $t$ and traverses down the right-most path of binary trie nodes with interpreted bit 1 starting from $t.\textit{left}$ .

Consider any node on the path from the leaf with key $k$ to $t.\textit{left}$ whose right child $t^{\prime}$ is not on this path. Note that $U_{t^{\prime}}\cap S=\emptyset$ , since $k<\min U_{t^{\prime}}<\max U_{t^{\prime}}<y$ . By Property IB0, $t^{\prime}$ has interpreted bit 0 in all configurations during $pOp$ . By Lemma 4.20, each binary trie node on the path from the leaf with key $k$ to $t$ has interpreted bit 1 throughout $pOp$ , so $pOp$ reaches the leaf with key $k$ and $pOp$ returns $k$ . ∎

5 Lock-free Binary Trie

In this section, we give the full implementation of the lock-free binary trie, which uses the relaxed binary trie as one of its components. This implementation supports a linearizable Predecessor operation, unlike the RelaxedPredecessor operation of the relaxed binary trie.

In Section 5.1, we give the high-level description of our algorithms. We then describe the algorithms in detail and present the pseudocode in Section 5.2. We then prove that the implementation is linearizable in Section 5.3.

5.1 High-level Algorithm Description

One component of our lock-free binary trie is a relaxed binary trie described in Section 4. The other components are linked lists, which enable Insert, Delete, and Predecessor operations to help one another make progress.

To ensure update operations are announced at the same time as they are linearized, each update node has a status. It is initially inactive and can later change to active when the operation that created it is linearized. The array entry $\textit{latest}[x]$ is modified to point to a linked list of update nodes of length at most 2. When an update node is added to this list, it is inactive and it is only added to the beginning of the list. Every latest list contains at least 1 activated update node and only its first update node can be inactive. The sequence of update nodes pointed to by $\textit{latest}[x]$ in an execution is the history of $S$ -modifying TrieInsert $(x)$ and TrieDelete $(x)$ operations performed. So the types of the update nodes added to $\textit{latest}[x]$ alternate between INS and DEL. The first activated update node in $\textit{latest}[x]$ is an INS node if and only if $x\in S$ .

The update announcement linked list, called the U-ALL, is a lock-free linked list of update nodes sorted by key. An update operation can add an inactive update node to the U-ALL, and is added after every update node with the same key. Then it can announce itself by activating its update node. Just before the update operation completes, it removes its update node from the U-ALL. Whenever update nodes are added and removed from the U-ALL, we additionally modify a lock-free linked list called the reverse update announcement linked list or RU-ALL. It contains a copy of all update nodes in the U-ALL, except it is sorted by keys in descending order and then by the order in which they were added. For simplicity, we assume that both the U-ALL and RU-ALL contain two sentinel nodes with keys $\infty$ and $-\infty$ . So $\textit{U-ALL}.\mathit{head}$ always points to the sentinel node with key $\infty$ and $\textit{RU-ALL}.\mathit{head}$ always points to the sentinel node with key $-\infty$ .

The predecessor announcement linked list, or the P-ALL, is an unsorted lock-free linked list of predecessor nodes. Each predecessor node contains a key and an insert-only linked list, called its $\mathit{notifyList}$ . An update operation can notify a predecessor operation by adding a notify node to the beginning of the $\mathit{notifyList}$ of the predecessor operation’s predecessor node. Each Predecessor operation begins by creating a predecessor node and then announces itself by adding this predecessor node to the beginning of the P-ALL. Just before a Predecessor operation completes, it removes its predecessor node from the P-ALL.

Figure 4 shows an example of the data structure for $U=\{0,1,2,3\}$ . White circles represent nodes of the relaxed binary trie. Blue rectangles represent activated INS nodes, red rectangles represent activated DEL nodes, and light red rectangles represent inactive DEL nodes. Yellow diamonds represent predecessor nodes. This example depicts 5 concurrent operations, Insert $(0)$ , Insert $(1)$ , Delete $(3)$ , and two Predecessor operations. The data structure represents the set $S=\{0,1,3\}$ because the first activated update node in each of $\textit{latest}[0]$ , $\textit{latest}[1]$ , and $\textit{latest}[3]$ is an INS node.

5.1.1 Insert and Delete Operations

A Search $(x)$ operation finds the first activated update node in $\textit{latest}[x]$ , returns True if it is an INS node, and returns False if it is a DEL node.

5.1.2 Insert and Delete Operations

An Insert $(x)$ or Delete $(x)$ operation, $uOp$ , is similar to an update operation of the relaxed binary trie, with a few modifications.

Rather than changing $\textit{latest}[x]$ to point to its own inactive update node $\mathit{uNode}$ , $uOp$ instead attempts to add $\mathit{uNode}$ to the beginning of $\textit{latest}[x]$ . If successful, $\mathit{uNode}$ is then added to the U-ALL and RU-ALL. Next, $uOp$ changes the status of $\mathit{uNode}$ from inactive to active, which announces $uOp$ . Additionally, $uOp$ is linearized at this step. Any other update nodes in $\textit{latest}[x]$ are then removed to keep the length of $\textit{latest}[x]$ at most 2. If multiple update operations with key $x$ concurrently attempt to add an update node to the beginning of $\textit{latest}[x]$ , exactly one will succeed. Update operations that are unsuccessful instead help the update operation that succeeded until it is linearized. Inserting into the U-ALL and RU-ALL has an amortized cost of $O(\dot{c}(op))$ because their lengths are at most $\dot{c}(op)$ .

Another modification is to notify predecessor operations after the relaxed binary trie is updated. For each predecessor node $\mathit{pNode}$ in the P-ALL, $uOp$ creates a notify node containing information about its update node and adds it to the beginning of $\mathit{pNode}$ ’s $\mathit{notifyList}$ , provided the update node created by $uOp$ is still the first activated update node in $\textit{latest}[x]$ . After notifying the predecessor operations announced in the P-ALL, $uOp$ removes its update node from the U-ALL and RU-ALL before returning.

There are at most $\dot{c}(uOp)$ predecessor nodes in the P-ALL when $uOp$ is invoked. Adding a notify node to the beginning of a $\mathit{notifyList}$ has an amortized cost of $O(\dot{c}(uOp))$ . So the total amortized cost charged to $uOp$ for notifying these predecessor nodes is $O(\dot{c}(uOp)^{2})$ . If a predecessor node $\mathit{pNode}$ is added to the P-ALL after the start of $uOp$ , the predecessor operation $pOp$ that created $\mathit{pNode}$ pays the amortized cost of notifying $\mathit{pNode}$ on $uOp$ ’s behalf. Since there are $O(\dot{c}(pOp))$ update operations concurrent with $pOp$ when it is invoked, the total amortized cost charged to $pOp$ is $O(\dot{c}(pOp)^{2})$ .

When $uOp$ is a Delete $(x)$ operation, it also performs two embedded Predecessor $(x)$ operations, one just before $uOp$ is announced and one just before $uOp$ begins to update the relaxed binary trie. The announcement of these embedded predecessor operations remain in the P-ALL until just before $uOp$ returns. Pointers to the predecessor nodes of these embedded Predecessor $(x)$ operations and their return values are stored in $\mathit{uNode}$ . This information is used by other Predecessor operations.

5.1.3 Predecessor Operations

A Predecessor $(y)$ operation, $pOp$ , begins by adding a predecessor node to the beginning of the P-ALL so that it can be notified by update operations. It continues to traverses the P-ALL to determine embedded predecessor operations belonging to Delete operations that have not yet completed. It then traverses the RU-ALL to identify DEL nodes corresponding to Delete operations that may have been linearized before the start of $pOp$ .

Following this, $pOp$ determines a number of candidate return values, which are keys in $S$ sometime during $pOp$ , and returns the largest of these. The exact properties satisfied by the candidate return values are stated in Section 5.3.3. Some candidate return values are determined from a traversal of the relaxed binary trie, a traversal of the U-ALL, and a traversal of its notify list. The linearization point of $pOp$ depends on when $pOp$ encountered the value it eventually returns. The amortized cost for $pOp$ to perform these traversals is $O(\bar{c}(pOp))=O(\dot{c}(pOp))$ .

When $pOp$ encounters an update node with key $x<y$ during its traversal of the U-ALL or its notify list, it verifies that the node is the first activated update node in $\textit{latest}[x]$ . If a verified update node is an INS node, then $x\in S$ sometime during $pOp$ and $x$ is a candidate return value. If it is a DEL node and the Delete $(x)$ operation that created it is linearized during $pOp$ , then $x\in S$ immediately before this linearization point and $x$ is a candidate return value. The traversal of the RU-ALL is used to identify Delete operations that may have been linearized before the start of $pOp$ . The keys of these operations are not added to the set of candidate return values during $pOp$ ’s traversal of the U-ALL and its notify list.

If the result returned by traversing the relaxed binary trie using RelaxedPredecessor $(y)$ is not $\bot$ , then it is a candidate return value. Now suppose RelaxedPredecessor $(y)$ returns $\bot$ . Let $k$ be the largest key less than $y$ that is completely present throughout $pOp$ ’s traversal of the relaxed binary trie, or $-1$ if no such key exists. By Lemma 4.22, there is an $S$ -modifying update operation $uOp$ with key $x$ , where $k<x<y$ , whose update to the relaxed binary trie is concurrent with $pOp$ ’s traversal of the relaxed binary trie. The update node created by $uOp$ is encountered by $pOp$ either in the U-ALL or in its notify list. This is because either $pOp$ will traverse the U-ALL before $uOp$ can remove its update node from the U-ALL, or $uOp$ will notify $pOp$ before $pOp$ removes its predecessor node from the P-ALL. Unless $uOp$ is a Delete $(x)$ operation that may have been linearized before the start of $pOp$ , $x$ is a candidate return value.

Now suppose $uOp$ is a Delete $(x)$ operation linearized before the start of $pOp$ . For simplicity, suppose $uOp$ is the only update operation concurrent with $pOp$ . Since $uOp$ is concurrent with $pOp$ ’s traversal of the relaxed binary trie, its DEL node is the only update node that $pOp$ encounters during its traversal of the RU-ALL. Let $pOp^{\prime}$ be the first embedded Predecessor $(x)$ of $uOp$ , which was completed before $uOp$ was announced in the RU-ALL. The result returned by $pOp^{\prime}$ may be added as a candidate return value for $pOp$ . In addition, $pOp$ traverses the notify list of $pOp^{\prime}$ to possibly obtain other candidate return values. Note that $k$ is the predecessor of $y$ throughout $pOp$ , so $pOp$ must return $k$ . Let $iOp$ be the completed Insert $(k)$ operation that last added $k$ to $S$ prior to the start of $pOp$ . First, suppose $iOp$ is linearized after $pOp^{\prime}$ was announced. Then $iOp$ will notify $pOp^{\prime}$ because $iOp$ is completed before the start of $pOp$ . When $pOp$ traverses the notify list of $pOp^{\prime}$ , it adds $k$ to its set of candidate return values.

Now suppose $iOp$ is linearized before $pOp^{\prime}$ was announced. Then $k\in S$ throughout $pOp^{\prime}$ , and $pOp^{\prime}$ returns a value $k^{\prime}$ where $k\leq k^{\prime}<x$ . If $k^{\prime}=k$ , then $k$ is added to $pOp$ ’s candidate return values. If $k^{\prime}\neq k$ , then $k^{\prime}$ is removed from $S$ by a Delete $(k^{\prime})$ operation prior to the start of $pOp$ because $k$ is the predecessor of $y$ at the start of $pOp$ . More generally, consider any Delete operation, $dOp$ , with key strictly between $k$ and $y$ that is linearized after $pOp^{\prime}$ is announced and is completed before the start of $pOp$ . In particular, $dOp$ ’s second embedded Predecessor is completed before the start of $pOp$ and returns a value which is at least $k$ . Before $dOp$ completed, it notified $pOp^{\prime}$ of this result. We prove that there is a Delete operation that notifies $pOp^{\prime}$ and whose second embedded Predecessor returns a value exactly $k$ . By traversing the notify list of $pOp^{\prime}$ , $pOp$ can determine the largest key less than $y$ that is in $S$ at the start of $pOp$ . So $k$ is added to $pOp$ ’s set of candidate return values. The cost for $pOp$ to traverse the notify list of $pOp^{\prime}$ is $O(\tilde{c}(pOp))$ .

5.2 Detailed Algorithm and Pseudocode

In this section, we present a detailed description of Insert, Delete and Predecessor, as well as present their pseudocode.

Figure 5 gives a summary of the fields used by each type of node, and classifies each field as immutable, update-once, or mutable. An immutable field is set when the node is initialized and is never changed. A mutable field may change its value an arbitrary number of times. The possible transitions of other fields is specified.

78:Update Node

79:

\mathit{key}

(Immutable)

\triangleright

A key in

U

80:

\mathit{type}

(Immutable)

\triangleright

Either INS or DEL

81:

\mathit{status}

(From Inactive to Active to Stale)

\triangleright

One of

\{\textsc{Inactive},\textsc{Active},\textsc{Stale}\}

82:

\mathit{latestNext}

(Initialized to a pointer to an update node. Changes once to

\bot

)

83: target (Mutable, initially

\bot

)

\triangleright

pointer to update node

84: stop (From 0 to 1)

\triangleright

(Boolean value)

85: completed (From 0 to 1)

\triangleright

(Boolean value)

86:

\triangleright

Additional fields when

\mathit{type}=\text{DEL}

87:

\mathit{upper0Boundary}

(Mutable, initially

0

)

\triangleright

An integer in

\{0,\dots,b\}

88:

\mathit{lower1Boundary}

(Mutable min-register, initially

b+1

)

\triangleright

An integer in

\{0,\dots,b+1\}

89:

\mathit{delPredNode}

(Immutable)

\triangleright

A pointer to a predecessor node

90:

\mathit{delPred}

(Immutable)

\triangleright

A key in

U

91:

\mathit{delPred2}

(From

\bot

to a key in

U

)

\triangleright

A key in

U

92:Predecessor Node of P-ALL

93:

\mathit{key}

(Immutable)

\triangleright

A key in

U

94:

\mathit{notifyList}

(Mutable, initially an empty linked list)

\triangleright

A linked list of notify nodes

95:

\mathit{RuallPosition}

(Mutable, initially a pointer to sentinel update node with key

\infty

)

96:Notify Node of

\mathit{pNode}.\mathit{notifyList}

for each

\mathit{pNode}\in\textit{P-ALL}

97:

\mathit{key}

(Immutable)

\triangleright

A key in

U

98:

\mathit{updateNode}

(Immutable)

\triangleright

A pointer to an update node

99:

\mathit{updateNodeMax}

(Immutable)

\triangleright

A pointer to an update node

100:

\mathit{notifyThreshold}

(Immutable)

\triangleright

A key in

U

101:Binary Trie Node

102:

\mathit{dNodePtr}

(Mutuable, initially points to a dummy DEL node)

\triangleright

A pointer to a DEL node

Figure 5: Summary of the fields and initial values of each linked list node used by the data structure.

5.2.1 Search Operations

The Search $(x)$ algorithm finds the first activated update node in $\textit{latest}[x]$ by calling FindLatest $(x)$ . It returns True if this update node has type INS, and False if this update node has type DEL.

The helper function FindLatest $(x)$ first reads the update node $\ell$ pointed to by $\textit{latest}[x].head$ . If $\ell$ is inactive, then the update node, $m$ , pointed to by its next pointer is read. If $m$ is $\bot$ , then $\ell$ was activated sometime between when $\ell$ was read to be inactive and when its next pointer was read. If $m$ is an update node, then $m$ was active when $\ell$ was read to be inactive. If FindLatest $(x)$ returns an update node $\mathit{uNode}$ , we prove that there is a configuration during the instance of FindLatest in which $\mathit{uNode}$ is the first activated update node in $\textit{latest}[x]$ .

The helper function FirstActivated $(v)$ takes a pointer $v$ to an activated update node and checks if $v$ is the first activated update node in $\textit{latest}[v.key]$ . If first reads the update node $\ell$ pointed to by $\textit{latest}[x].head$ . If $\ell=v$ , then the algorithm returns True because $v$ is the first activated update node in $\textit{latest}[v.key]$ . If $\ell$ is inactive and its next pointer points to $v$ , then $v$ is the first activated update node in $\textit{latest}[v.key]$ when $\ell$ was read inactive.

103:Algorithm FindLatest

(x)

104:

\ell\leftarrow\textit{latest}[x].head

105: if (

\ell.\mathit{status}=\textsc{Inactive}

) then

106:

m\leftarrow\ell.\mathit{latestNext}

107: if

m\neq\bot

then return

m

return

\ell

108:Algorithm TrieSearch

(x)

109:

\mathit{uNode}\leftarrow\textsc{FindLatest}(x)

110: if

\mathit{uNode}.\mathit{type}=\textsc{INS}

then return True

111: else

112: return False

113:Algorithm FirstActivated

(v)

114:

\ell\leftarrow\textit{latest}[v.\mathit{key}].head

115: return

v=\ell

OR (

\ell.status=\textsc{Inactive}

AND

v=\ell.\mathit{latestNext}

)

5.2.2 Insert Operations

We next describe the algorithm for an Insert $(x)$ operation $iOp$ . It is roughly divided into the main parts of inserting a new INS node, $\mathit{iNode}$ , into $\textit{latest}[x]$ , adding $\mathit{iNode}$ to U-ALL, updating the relaxed binary trie, notifying predecessor operations, and removing $\mathit{iNode}$ from the U-ALL.

It begins by finding the first update node in $\textit{latest}[x]$ . If this is an INS node then $iOp$ returns, because $x$ is already in $S$ . Otherwise, $\textit{latest}[x]$ begins with in a DEL node, $\mathit{dNode}$ . A new update node $\mathit{iNode}$ containing information about $iOp$ is created, and CAS is used to try to change $\textit{latest}[x].\mathit{head}$ to point from $\mathit{dNode}$ to $\mathit{iNode}$ . If the CAS is unsuccessful, then some other Insert $(x)$ operation, $iOp^{\prime}$ , successfully updated $\textit{latest}[x].\mathit{head}$ to point to $\mathit{iNode}^{\prime}$ . In this case, $iOp$ invokes HelpActivate $(\mathit{iNode}^{\prime})$ to help activate $\mathit{iNode}^{\prime}$ , and hence linearize $iOp^{\prime}$ . First, $iOp$ helps $iOp^{\prime}$ add $\mathit{iNode}^{\prime}$ to the U-ALL, and changes the status of $\mathit{iNode}^{\prime}$ from Inactive to Active. Then $iOp$ checks if $\mathit{iNode}^{\prime}.\textit{completed}$ is set to True (on line 121). This indicates that $iOp^{\prime}$ has completed updating the relaxed binary trie, and $\mathit{iNode}^{\prime}$ no longer needs to be in U-ALL or RU-ALL. It is possible that $iOp^{\prime}$ already removed $\mathit{iNode}^{\prime}$ from the U-ALL and RU-ALL, but $iOp$ After $iOp$ removes $\mathit{iNode}^{\prime}$ from the U-ALL and RU-ALL, it returns.

Otherwise the CAS is successful, then $iOp$ inserts $\mathit{iNode}$ into the U-ALL and RU-ALL. The status of $\mathit{iNode}$ is changed from Inactive to Active using CAS, which announces the operation. The operation $iOp$ is linearized immediately after the first write that changes $\mathit{iNode}.\mathit{status}$ from Inactive to Active, which may be performed by $iOp$ or a Insert $(x)$ operation helping $iOp$ .

116:Algorithm HelpActivate

(\mathit{uNode})

117: if

\mathit{uNode}.status=\textsc{Inactive}

then

118: Insert

\mathit{uNode}

into U-ALL and RU-ALL

119:

\mathit{uNode}.\mathit{status}\leftarrow\textsc{Active}

120:

\mathit{uNode}.\mathit{latestNext}\leftarrow\bot

121: if

\mathit{uNode}.\textit{completed}=\textsc{True}

then

\triangleright

\mathit{uNode}

no longer needed in U-ALL or RU-ALL

122: Remove

\mathit{uNode}

from U-ALL and RU-ALL

123:Algorithm TraverseUall

(x)

124: Initialize local variables

I\leftarrow\emptyset

and

D\leftarrow\emptyset

125:

\mathit{uNode}\leftarrow\textit{U-ALL}.\mathit{head}

126: while

\mathit{uNode}\neq\bot

and

\mathit{uNode}.\mathit{key}<x

127: if (

\mathit{uNode}.\mathit{status}\neq\textsc{Inactive}

and

\textsc{FirstActivated}(\mathit{uNode})

) then

128: if

\mathit{uNode}.type=\textsc{INS}

then

I\leftarrow I\cup\{\mathit{uNode}\}

129: else

D\leftarrow D\cup\{\mathit{uNode}\}

130:

\mathit{uNode}\leftarrow\mathit{uNode}.next

131: return

I,D

132:Algorithm NotifyPredOps

(\mathit{uNode})

133:

I,D\leftarrow\textsc{TraverseUall}(\infty)

134: for each node

\mathit{pNode}\in\textit{P-ALL}

135: if

\textsc{FirstActivated}(\mathit{uNode})

then

136: Create a notify node

\mathit{nNode}

137:

\mathit{nNode}.\mathit{key}\leftarrow\mathit{uNode}.\mathit{key}

138:

\mathit{nNode}.\mathit{updateNode}\leftarrow\mathit{uNode}

139:

\mathit{nNode}.\mathit{updateNodeMax}\leftarrow

INS node in

I

with largest key less than

\mathit{pNode}.\mathit{key}

140:

\mathit{nNode}.\mathit{notifyThreshold}\leftarrow\mathit{pNode}.\mathit{RuallPosition}.\mathit{key}

141:

\textsc{SendNotification}(\mathit{nNode},\mathit{pNode})

142:Algorithm SendNotification

(\mathit{nNodeNew},\mathit{pNode})

143: while True do

144:

\mathit{nNode}\leftarrow\mathit{pNode}.\mathit{notifyList}.\mathit{head}

145:

\mathit{nNodeNew}.\mathit{next}\leftarrow\mathit{nNode}

146: if FirstActivated

(\mathit{nNodeNew}.\mathit{updateNode})=\textsc{False}

then return

147: if CAS

(\mathit{pNode}.\mathit{notifyList}.\mathit{head},\mathit{nNode},\mathit{nNodeNew})=\textsc{True}

then return

148:Algorithm Insert

(x)

149:

\mathit{dNode}\leftarrow\textsc{FindLatest}(x)

150: if

\mathit{dNode}.\mathit{type}\neq\text{DEL}

then return

\triangleright

x

is already in

S

151: Let

\mathit{iNode}

be a pointer to a new update node:

152:

\mathit{iNode}.\mathit{key}\leftarrow x

\mathit{iNode}.\mathit{type}\leftarrow\text{INS}

153:

\mathit{iNode}.\mathit{latestNext}\leftarrow\mathit{dNode}

154:

\mathit{dNode}.\mathit{latestNext}\leftarrow\bot

155: if CAS

(\textit{latest}[x].head,\mathit{dNode},\mathit{iNode})=\textsc{False}

then

156: HelpActivate

(\textit{latest}[x].head)

157: return

158: Insert

\mathit{iNode}

into U-ALL and RU-ALL.

159:

\mathit{iNode}.\mathit{status}\leftarrow\textsc{Active}

160:

\mathit{iNode}.\mathit{latestNext}\leftarrow\bot

161: InsertBinaryTrie

(\mathit{iNode})

162:

\textsc{NotifyPredOps}(\mathit{iNode})

163:

\mathit{iNode}.\textit{completed}\leftarrow\textsc{True}

164: Remove

\mathit{iNode}

from U-ALL and RU-ALL.

165: return

The helper function NotifyPredOps $(\mathit{iNode})$ sends a notification to all predecessor nodes in P-ALL containing information about $\mathit{iNode}$ as follows. First, $iOp$ invokes TraverseUall $(x)$ (on line 133). It finds update nodes with key less than $x$ that are the first activated update node in their respective latest lists. INS nodes are put into $iOp$ ’s local set $I$ and DEL nodes into $iOp$ ’s local set $D$ . It simply traverses U-ALL, and checking if each update node $\mathit{uNode}$ visited has key less than $x$ and is the first activated update node in $\textit{latest}[\mathit{uNode}.\mathit{key}]$ . If so, $\mathit{uNode}$ is added to $I$ if it is an INS node, or it is added to $D$ if it is a DEL node. We prove that the keys of the nodes in $I$ are in the set $S$ sometime during the traversal, while the keys in $D$ are not in the set $S$ sometime during the traversal.

Then $iOp$ notifies each predecessor node $\mathit{pNode}$ in P-ALL with $\mathit{pNode}.\mathit{key}>x$ . It creates a new notify node, $\mathit{nNode}$ , containing a pointer to $iOp$ ’s update node and a pointer to the INS node in $I$ with the largest key less than $\mathit{pNode}.\mathit{key}$ . It also reads the value stored in $\mathit{pNode}.\mathit{RuallPosition}$ , which is a pointer to an update node. The key of this update node is written into $\mathit{nNode}.\mathit{notifyThreshold}$ (on line 140). This information is used by the predecessor operation to determine if the notification sent by $iOp$ should be used to determine a candidate return value.

Before returning, the update node $\mathit{iNode}$ is removed from the U-ALL and RU-ALL.

5.2.3 Delete Operations

166:Algorithm Delete

(x)

167:

\mathit{iNode}\leftarrow\textsc{FindLatest}(x)

168: if

\mathit{iNode}.\mathit{type}\neq\text{INS}

then return

\triangleright

x

is not in

S

169:

\mathit{delPred},\mathit{pNode}1\leftarrow\textsc{PredHelper}(x)

170: Let

\mathit{dNode}

be a pointer to a new update node:

171:

\mathit{dNode}.\mathit{key}\leftarrow x

\mathit{dNode}.\mathit{type}\leftarrow\text{DEL}

172:

\mathit{dNode}.\mathit{latestNext}\leftarrow\mathit{iNode}

173:

\mathit{dNode}\mathit{delPred}\leftarrow\mathit{delPred}

174:

\mathit{dNode}.\mathit{delPredNode}\leftarrow\mathit{pNode}1

175:

\mathit{iNode}.\mathit{latestNext}\leftarrow\bot

176: NotifyPredOps

(\mathit{iNode})

\triangleright

Help previous Insert send notifications

177: if CAS

(\textit{latest}[x].head,\mathit{iNode},\mathit{dNode})=\textsc{False}

then

178: HelpActivate

(\textit{latest}[x].head)

179: Delete

\mathit{pNode}1

from P-ALL.

180: return

181: Insert

\mathit{dNode}

into U-ALL and RU-ALL

182:

\mathit{dNode}.\mathit{status}\leftarrow\textsc{Active}

183:

\mathit{iNode}.\textit{target}.\textit{stop}\leftarrow\textsc{True}

184:

\mathit{dNode}.\mathit{latestNext}\leftarrow\bot

185:

\mathit{delPred2},\mathit{pNode}2\leftarrow\textsc{PredHelper}(x)

186:

\mathit{dNode}.\mathit{delPred2}\leftarrow\mathit{delPred2}

187: DeleteBinaryTrie

(\mathit{dNode})

188: NotifyPredOps

(\mathit{dNode})

189:

\mathit{dNode}.\textit{completed}\leftarrow\textsc{True}

190: Delete

\mathit{pNode}1

and

\mathit{pNode}2

from P-ALL.

191: Delete

\mathit{dNode}

from U-ALL and RU-ALL

We next describe the algorithm for a Delete $(x)$ operation, $dOp$ . The algorithm is similar to an Insert $(x)$ operation, but with a few more parts. Most importantly, $dOp$ may perform embedded predecessor operations, which we describe later. The main parts of a Delete $(x)$ operation are performing an embedded predecessor, notifying Predecessor operations about the previous $S$ -modifying Insert $(x)$ operation, notifying predecessor operations, inserting a new DEL node into $\textit{latest}[x]$ and then into the U-ALL, performing a second embedded predecessor operation, updating the relaxed binary trie, and then notifying Predecessor operations about its own operation.

The Delete $(x)$ operation, $dOp$ , begins by finding the first activated update node in $\textit{latest}[x]$ . It immediately returns if this is a DEL node, since $x\notin S$ . Otherwise, $\textit{latest}[x]$ begins with a INS node, $\mathit{iNode}$ . Next, $dOp$ performs an embedded predecessor operation with key $x$ . The result of the embedded predecessor is saved in a new DEL node $\mathit{dNode}$ , along with other information. Recall that this embedded predecessor will be used by concurrent Predecessor operations in the case that $dOp$ prevents them from traversing the relaxed binary trie.

Additionally, $dOp$ performs NotifyPredOpds $(\mathit{iNode})$ to help notify $\mathit{iNode}$ notify predecessor operations. Since an Insert $(x)$ operation does not send notifications when its update node is no longer the first activated update node in $\textit{latest}[x]$ , we ensure at least one update operation with key $x$ notifies all predecessor nodes about $\mathit{iNode}$ before a Delete $(x)$ operation is linearized.

Then $dOp$ attempts to add $\mathit{dNode}$ to the head of $\textit{latest}[x]$ , by using CAS to change $\textit{latest}[x].\mathit{head}$ to point from $\mathit{iNode}$ to $\mathit{dNode}$ . If the CAS is unsuccessful, a concurrent Delete $(x)$ operation, $dOp^{\prime}$ , successfully updated $\textit{latest}[x].\mathit{head}$ to point to some update node $\mathit{dNode}^{\prime}$ . So $dOp$ performs HelpActivate $(\mathit{dNode}^{\prime})$ to help $dOp^{\prime}$ linearize and then $dOp$ returns. If the CAS is successful, $dOp$ inserts $\mathit{dNode}$ into the U-ALL and RU-ALL and changes $\mathit{dNode}.\mathit{status}$ from Inactive to Active using a write. The linearization point of $dOp$ is immediately after the status of $\mathit{dNode}$ first changes from Inactive to Active. Next, $dOp$ performs a second embedded predecessor operation and records the result in $\mathit{dNode}$ .

The binary trie is updated in DeleteBinaryTrie $(x)$ . It proceeds similarly to the sequential binary trie delete algorithm, traversing a path through the binary trie starting from the leaf with key $x$ . Let $t$ be an internal binary trie node on the path from the leaf with key $x$ to the root. Suppose $dOp$ successfully changed the interpreted bit of one of $t$ ’s children to 0. If the interpreted bit of the other child of $t$ is 0, then $dOp$ attempts to change the interpreted bit of $t$ to 0. Recall that $t$ depends on the first activated update node in $\textit{latest}[t.\mathit{dNodePtr}.\mathit{key}]$ . To change $t$ to depend on $\mathit{dNode}$ , $dOp$ performs CAS to attempt to change $t.\mathit{dNodePtr}$ to point to $\mathit{dNode}$ . Note that $dOp$ performs two attempts of this CAS, each time checking its $\mathit{dNode}.\textit{stop}$ is not set to True (indicating a concurrent Insert $(x)$ wants to set the interpreted bit of $t$ to 1) and that $\mathit{dNode}$ is still the first activated update node in $\textit{latest}[x]$ . Two CAS attempts are performed to prevent out-dated Delete operations that were poised to perform CAS from conflicting with latest Delete operations. If $dOp$ is unsuccessful in both its CAS attempts, it can stop updating the binary trie because some concurrent Delete $(x^{\prime})$ operation, with key $x^{\prime}\in U_{t}$ , successfully changed $t.\mathit{dNodePtr}$ to point to its own DEL node. Otherwise $dOp$ is successful in changing the interpreted bit of $t$ to depend on $\mathit{dNode}$ . Immediately after $dOp$ ’s successful CAS, the interpreted bit of $t$ is still 1 (because $\mathit{dNode}.\mathit{upper0Boundary}$ has not yet been incremented to $t.0pt$ ). Once again, $dOp$ verifies both children of $t$ have interpreted bit 0, otherwise it returns. To change the interpreted bit of $\mathit{dNode}$ to 0, $dOp$ writes $t.0pt$ into $\mathit{dNode}.\mathit{upper0Boundary}$ , which increments its value. This indicates that all binary trie nodes at height $t$ and below that depend on $\mathit{dNode}$ have interpreted bit 0. Only $dOp$ , the creator of $\mathit{dNode}$ , writes to $\mathit{dNode}.\mathit{upper0Boundary}$ . Since $dOp$ changes the interpreted bits of binary trie nodes in order from the leaf with key $x$ to the root, $\mathit{dNode}.\mathit{upper0Boundary}$ is only ever incremented by 1 starting from 0.

Once the relaxed binary trie has been updated, $dOp$ notifies predecessor operations using NotifyPredOps $(\mathit{dNode})$ described previously. Finally, $dOp$ deletes the predecessor nodes it created for its embedded predecessor operations from P-ALL and removes its update node $\mathit{dNode}$ from the U-ALL and RU-ALL.

5.2.4 Predecessor Operations

A Predecessor $(y)$ operation begins by calling an instance, $pOp$ , of $\textsc{PredHelper}(y)$ , which does all the steps of the operation except for removing the announcement from the P-ALL. This helper function is also used by Delete $(y)$ operations to perform their embedded predecessor operations. Recall that these embedded predecessor operations do not remove their announcements from the P-ALL until the end of their Delete $(y)$ operation. The helper function $\textsc{PredHelper}(y)$ is complicated, so its description is divided into six main parts: announcing the operation in the P-ALL, traversing the RU-ALL, traversing the relaxed binary trie, traversing the U-ALL, collecting notifications, and handling the case when the traversal the relaxed binary trie returns $\bot$ . These parts are performed one after the other in this order.

192:Algorithm PredHelper

(y)

193: Create predecessor node

\mathit{newNode}

with key

y

194: Insert

\mathit{pNode}

to the head of P-ALL

195:

\mathit{pNode}^{\prime}\leftarrow\mathit{pNode}.\mathit{next}

196:

Q\leftarrow()

\triangleright

Initialize local sequence

197: while

\mathit{pNode}^{\prime}\neq\bot

198: prepend

\mathit{pNode}^{\prime}

Q

199:

\mathit{pNode}^{\prime}\leftarrow\mathit{pNode}^{\prime}.\mathit{next}

200:

\triangleright

Determine active Delete operations at the start of Pred operation

201:

(I_{\mathit{ruall}},D_{\mathit{ruall}})\leftarrow\textsc{TraverseRUall}(\mathit{pNode})

202:

p_{0}\leftarrow\textsc{RelaxedPredecessor}(y)

203:

(I_{\mathit{uall}},D_{\mathit{uall}})\leftarrow\textsc{TraverseUall}(y)

204:

(I_{\mathit{notify}},D_{\mathit{notify}})\leftarrow(\emptyset,\emptyset)

205: for each notify node

\mathit{nNode}

\mathit{pNode}.\mathit{notifyList}

with key less than

y

206: if

\mathit{nNode}.\mathit{updateNode}.\mathit{type}=\text{INS}

then

207: if

\mathit{nNode}.\mathit{notifyThreshold}\leq\mathit{nNode}.\mathit{key}

then

208:

I_{\mathit{notify}}\leftarrow I_{\mathit{notify}}\cup\{\mathit{nNode}.\mathit{updateNode}\}

209: else

210: if

\mathit{nNode}.\mathit{notifyThreshold}<\mathit{nNode}.\mathit{key}

then

211:

D_{\mathit{notify}}\leftarrow D_{\mathit{notify}}\cup\{\mathit{nNode}.\mathit{updateNode}\}

212: if

\mathit{nNode}.\mathit{notifyThreshold}=-\infty

and

\mathit{nNode}.\mathit{updateNode}\notin(I_{\mathit{ruall}}\cup D_{\mathit{ruall}})

then

213:

I_{\mathit{notify}}\leftarrow I_{\mathit{notify}}\cup\{\mathit{nNode}.\mathit{updateNodeMax}\}

214:

K\leftarrow\text{keys of the update nodes in }I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})

215:

p_{1}\leftarrow\max\{K,-1\}

216:

\triangleright

Unsuccessful traversal of relaxed binary trie

217: if

p_{0}=\bot

and

D_{\mathit{ruall}}\neq\emptyset

then

218:

L_{1}\leftarrow()

\triangleright

Initialize empty sequence

219:

\mathit{predNodes}\leftarrow\{\mathit{pNode}^{\prime}\mid\exists\mathit{dNode}\in D_{\mathit{ruall}}\text{ where }\mathit{dNode}.\mathit{delPredNode}=\mathit{pNode}^{\prime}\}

220: if

Q

contains a predecessor node in

\mathit{predNodes}

then

221:

\mathit{pNode}^{\prime}\leftarrow

predecessor node in

\mathit{predNodes}

that occurs earliest in

Q

222: for each notify node

\mathit{nNode}

\mathit{pNode^{\prime}}.\mathit{notifyList}

with key less than

y

223: prepend

\mathit{nNode}.\mathit{updateNode}

L_{1}

224:

L^{\prime}\leftarrow\{\}

L_{2}\leftarrow()

\triangleright

Initialize empty set and sequence

225: for each notify node

\mathit{nNode}

\mathit{pNode}.\mathit{notifyList}

with key less than

y

226: if

\mathit{nNode}.\mathit{notifyThreshold}\geq\mathit{nNode}.\mathit{key}

then

227: prepend

\mathit{nNode}.\mathit{updateNode}

L_{2}

228: else

229:

L^{\prime}\leftarrow L^{\prime}\cup\{\mathit{nNode}.\mathit{updateNode}\}

230:

L\leftarrow

sequence of update nodes in

L_{1}

followed by

L_{2}

that are not in

L^{\prime}

231:

R\leftarrow R\cup\{w\mid\exists\mathit{dNode}\in D_{\mathit{ruall}}\text{ where }\mathit{dNode}.\mathit{delPred}=w\}

232: for each update node

\mathit{uNode}

L

in order do

233: if

\mathit{uNode}.\mathit{type}=\text{INS}

then

R\leftarrow R\cup\{\mathit{uNode}.\mathit{key}\}

234: else

235: if

\mathit{uNode}.\mathit{key}\in R

then

R\leftarrow R-\{\mathit{uNode}.\mathit{key}\}\cup\{\mathit{uNode}.\mathit{delPred2}\}

236:

R\leftarrow R-\{w\in R\mid\exists\mathit{dNode}\in D_{\mathit{ruall}}\text{ where }\mathit{dNode}.\mathit{key}=w\}

237:

p_{0}\leftarrow\max\{R\}

238: return

\max\{p_{0},p_{1}\}

\mathit{pNode}

239:Algorithm Predecessor

(y)

240:

\mathit{pred},\mathit{pNode}\leftarrow\textsc{PredHelper}(y)

241: Remove

\mathit{pNode}

from P-ALL

242: return

\mathit{pred}

Announcing the operation in the P-ALL: An instance, $pOp$ , of PredHelper $(y)$ announces itself by creating a new predecessor node, $\mathit{pNode}$ , with key $y$ and inserts it at the head of P-ALL (on line 194). Then it traverses the P-ALL starting from $\mathit{pNode}$ (on line 195 to 199), locally storing the sequence of predecessor nodes it encounters into a sequence $Q$ .

Traversing the RU-ALL: The traversal of the RU-ALL is done by a call to TraverseRUall $(\mathit{pNode})$ . During this traversal, $pOp$ identifies each update node with key less than $y$ that is the first activated update node in its latest list. Those with type INS are put into $pOp$ ’s local set $I_{\mathit{ruall}}$ , while those with type DEL are put into $pOp$ ’s local set $D_{\mathit{ruall}}$ . The sets $I_{\mathit{ruall}}$ and $D_{\mathit{ruall}}$ include the update nodes of all $S$ -modifying update operations linearized before the start of $pOp$ and are still active at the start of $pOp$ ’s traversal of the relaxed binary trie. They may additionally contain update nodes of update operations linearized shortly after the start of $pOp$ , because it is difficult to distinguish them from those that were linearized before the start of $pOp$ . Since the Delete operations of DEL nodes in $D_{\mathit{ruall}}$ may be linearized before the start of $pOp$ , they are not used to determine candidate return values. Instead, they are used to eliminate announcements or notifications later seen by $pOp$ when it traverses the U-ALL or $\mathit{pNode}.\mathit{notifyList}$ .

While $pOp$ is traversing the RU-ALL, it makes available the key of the update node it is currently visiting in the RU-ALL. This is done by maintaining $\mathit{pNode}.\mathit{RuallPosition}$ , which contains a pointer to the update node in the RU-ALL that $pOp$ is currently visiting. Recall that the key field of an update node is immutable, so a pointer to this node is sufficient to obtain its key. Initially, $\mathit{pNode}.\mathit{RuallPosition}$ points to the sentinel at the head of the RU-ALL, which has key $\infty$ . Only $pOp$ modifies $\mathit{pNode}.\mathit{RuallPosition}$ . Each time $pOp$ reads a pointer to the next node in the RU-ALL, $pOp$ atomically copies this pointer into $\mathit{pNode}.\mathit{RuallPosition}$ . Single-writer atomic copy can be implemented from CAS with $O(1)$ worst-case step complexity [4].

The pseudocode for TraverseRUall $(\mathit{pNode})$ is as follows. The local variable $\mathit{uNode}$ is initialized to point to the sentinel node with key $\infty$ at the head of the RU-ALL (on line 246). Then $pOp$ traverses the RU-ALL one update node at a time, atomically copying $\mathit{uNode}.\mathit{next}$ into $\mathit{pNode}.\mathit{RuallPosition}$ (on line 248) before progressing to the next node. It traverses the list until it first reaches an update node with key less than $y$ . From this point on, it checks whether the update node it is pointing to is the first activated update node in $\textit{latest}[\mathit{uNode}.\mathit{key}]$ (on line 251). If so, the update node is added to $I_{\mathit{ruall}}$ or $D_{\mathit{ruall}}$ , depending on its type. When $\mathit{uNode}.\mathit{key}=-\infty$ , $\mathit{pNode}.\mathit{RuallPosition}$ points to the sentinel node at the end of the RU-ALL.

243:Algorithm TraverseRUall

(\mathit{pNode})

244: Initialize local variables

I\leftarrow\emptyset

and

D\leftarrow\emptyset

245:

y\leftarrow\mathit{pNode}.\mathit{key}

246:

\mathit{uNode}\leftarrow\mathit{pNode}.\mathit{RuallPosition}

247: do

248: atomic copy

\mathit{uNode}.\mathit{next}

\mathit{pNode}.\mathit{RuallPosition}

\triangleright

Atomic read and write

249:

\mathit{uNode}\leftarrow\mathit{pNode}.\mathit{RuallPosition}

250: if

\mathit{uNode}.\mathit{key}<y

then

251: if (

\mathit{uNode}.\mathit{status}\neq\textsc{Inactive}

and

\textsc{FirstActivated}(\mathit{uNode})

) then

252: if

\mathit{uNode}.type=\textsc{INS}

then

I\leftarrow I\cup\{\mathit{uNode}\}

253: else

D\leftarrow D\cup\{\mathit{uNode}\}

254: while

(\mathit{uNode}.\mathit{key}\neq-\infty)

255: return

I,D

We now explain the purpose of having available the key of the update node $pOp$ is currently visiting in the RU-ALL. Recall that when an update operation creates a notify node, $\mathit{nNode}$ , to add to $\mathit{pNode}.\mathit{notifyList}$ , it reads $\mathit{pNode}.\mathit{RuallPosition}.\mathit{key}$ and writes it into $\mathit{nNode}.\mathit{notifyThreshold}$ . This is used by $pOp$ to determine if $\mathit{nNode}.\mathit{key}$ should be used as a candidate return value. For example, consider a Delete $(w)$ operation, for some where $w<y$ , linearized before the start of $pOp$ , that notifies $pOp$ . This Delete $(w)$ operation’s DEL node should not be used to determine a candidate return value, because $w$ was removed from $S$ before the start of $pOp$ . If $pOp$ sees this DEL node when it traverses the RU-ALL, then it is added to $D_{\mathit{ruall}}$ and hence will not be used. Otherwise this DEL is removed from the RU-ALL before $pOp$ can see it during its traversal of the RU-ALL. The notification from a Delete operation should only be used to determine a candidate return value if $pOp$ can guarantee that the Delete operation was linearized sometime during $pOp$ . In particular, when $pOp$ does not add a DEL node into $D_{\mathit{ruall}}$ and $pOp$ is currently at an update node with key strictly less than $w$ , the Delete $(w)$ operation must have added its DEL node into the RU-ALL before this update node. So only after $pOp$ has encountered an update node with key less than $w$ during its traversal of the RU-ALL does $pOp$ begin accepting the notifications of Delete $(w)$ operations. Note that $pOp$ cannot accept notifications from Insert $(w)$ operations until $pOp$ also accepts notifications from Delete operations with key larger than $w$ . Otherwise $pOp$ may miss a candidate return value larger than $w$ . So only after $pOp$ has encountered an update node with key less than or equal to $w$ during its traversal of the RU-ALL does $pOp$ begin accepting the notifications of Insert $(w)$ operations.

It is important that the RU-ALL is sorted by decreasing key. Since the RU-ALL is sorted by decreasing key, as $pOp$ traverses the RU-ALL, it begins accepting notifications from update operations with progressively smaller keys. This is so that if $pOp$ determines accepts the notification from an update operation, it does not miss notifications of update operations with larger key. For example, consider the execution depicted in Figure 6.

There are three concurrent operations: a Predecessor $(y)$ operation, a Delete $(x)$ operation, and a Delete $(w)$ operation, where $w<x<y$ . The Delete $(x)$ operation is linearized after the Delete $(w)$ operation, and both are linearized after the start of the Predecessor $(y)$ operation, $pOp$ . Notice that $x\in S$ for all configurations during $pOp$ in which $w\in S$ . So if $w$ is a candidate return value of $pOp$ , $pOp$ must also determine a candidate return value which is at least $x$ . Hence, If $pOp$ accepts notifications from Delete $(w)$ operations, it also accepts notifications from update operations with keys larger than $w$ , and hence $x$ .

Atomic copy is used to make sure that no update operations modify the next pointer of an update node in the RU-ALL between when $pOp$ reads this next pointer and when $pOp$ writes a copy it into $\mathit{pNode}.\mathit{RuallPosition}$ . Otherwise $pOp$ may miss such an update operation whose key should be used as a candidate return value. To see why, we consider the following execution where $pOp$ does not use atomic copy, which is depicted in Figure 7. The RU-ALL contains an update node, $\mathit{uNode}_{20}$ , with key $20$ , and the two sentinel nodes with keys $\infty$ and $-\infty$ . A Predecessor $(40)$ operation, $pOp$ , reads a pointer to $\mathit{uNode}_{20}$ during the first step of its traversal of the RU-ALL, but it does not yet write it into its predecessor node, $\mathit{pNode}$ . An $S$ -modifying Delete $(25)$ operation, $dOp_{25}$ , is linearized, and then an $S$ -modifying Delete $(29)$ operation, $dOp_{29}$ , is linearized. These DEL nodes of these Delete operations are not seen by $pOp$ because they are added to the RU-ALL before $pOp$ ’s current location in the RU-ALL. Then $dOp_{29}$ attempts to notify $pOp$ . It reads that $\mathit{pNode}.\mathit{RuallPosition}$ points to the sentinel node with key $\infty$ , so $dOp_{29}$ writes $\infty$ to the notify threshold of its notification. Hence, the notification is rejected by $pOp$ . Now $pOp$ writes the pointer to $\mathit{uNode}_{20}$ to $\mathit{pNode}.\mathit{RuallPosition}$ . So $pOp$ begins accepting the notifications of Delete operations with key greater than 20. When $dOp_{25}$ attempts to notify $pOp$ , it reads that $\mathit{pNode}.\mathit{RuallPosition}$ points to $\mathit{uNode}_{25}$ , so it writes $25$ to the notify threshold of its notification. Hence, the notification is accepted by $pOp$ , and 25 is a candidate return value of $pOp$ . In all configurations in which 25 is in $S$ , the key $29$ is also in $S$ . So $pOp$ should not return 25. By using atomic copy, either both 25 and 29 will be added as candidate return values, or neither will be.

Traversing the relaxed binary trie: Following $pOp$ ’s traversal of the RU-ALL, $pOp$ traverses the relaxed binary trie using RelaxedPredecessor $(y)$ (on line 202). This has been described in Section 4. If it returns a value other than $\bot$ , the value is a candidate return value of $pOp$ .

Traversing the U-ALL: The traversal of the U-ALL is done in TraverseUall $(y)$ (on line 203). Recall that it returns two sets of update nodes $I_{\mathit{uall}}$ and $D_{\mathit{uall}}$ . The keys of INS nodes in $I_{\mathit{uall}}$ are candidate return values. The keys of DEL nodes in $D_{\mathit{uall}}$ not seen during $pOp$ ’s traversal of RU-ALL are candidate return values. This is because the Delete operations that created these DEL nodes are linearized sometime during $pOp$ .

Collecting notifications: The collection of $pOp$ ’s notifications is done on line 205 to line 224. Consider a notify node, $\mathit{nNode}$ , created by an update operation, $uOp$ , with key less than $y$ that $pOp$ encounters in its $\mathit{notifyList}$ at the beginning of the for-loop on line 205. If $\mathit{nNode}$ was created by an Insert operation, then $\mathit{nNode}$ is accepted if $\mathit{nNode}.\mathit{notifyThreshold}$ is less than or equal to $\mathit{nNode}.\mathit{key}$ . In this case, the update node created by the Insert operation is put into $pOp$ ’s local set $I_{\mathit{notify}}$ . If $\mathit{nNode}$ was created by a Delete operation, then $\mathit{nNode}$ is accepted if $\mathit{nNode}.\mathit{notifyThreshold}$ is less than $\mathit{nNode}.\mathit{key}$ . In this case, the update node created by the Delete operation is put into $pOp$ ’s local set $D_{\mathit{notify}}$ . Note that, if $\mathit{nNode}$ is not accepted, it may still be used in the sixth part of the algorithm.

Recall that $\mathit{nNode}.\mathit{updateNodeMax}$ is the INS node with largest key less than $y$ that $uOp$ identified during its traversal of the U-ALL. This is performed before $uOp$ notifies any Predecessor operations. Operation $pOp$ also determines if the key of this INS node should be used as a candidate return value: If $\mathit{nNode}.\mathit{notifyThreshold}=-\infty$ (indicating that $pOp$ had completed its traversal of the RU-ALL when $pOp$ was notified) and $\mathit{nNode}.\mathit{updateNode}$ is not an update node in $I_{\mathit{ruall}}$ or $D_{\mathit{ruall}}$ (checked on line 212), then $\mathit{nNode}.\mathit{updateNodeMax}$ is also added to $I_{\mathit{notify}}$ . We need to ensure that $pOp$ does not miss relevant Insert operations linearized after $pOp$ completed its traversal of the U-ALL and before $uOp$ is linearized. These Insert operations might not notify $pOp$ , and their announcements are not seen by $pOp$ when it traverses the U-ALL. We guarantee that $\mathit{nNode}.\mathit{updateNodeMax}$ is the INS node with largest key less than $y$ that falls into this category. For example, consider the execution shown in Figure 8. Let $w$ , $x$ , and $y$ be three keys where $w<x<y$ . An Insert $(x)$ operation, $iOp_{x}$ is linearized before an Insert $(w)$ operation, $iOp_{w}$ , and both are linearized after a Predecessor $(y)$ operation, $pOp$ , has completed its traversal of the U-ALL. Suppose $iOp_{w}$ notifies $pOp$ , but $iOp_{x}$ does not. Then $w$ is a candidate return value of $pOp$ . Note that $pOp$ does not see the announcement of $iOp_{x}$ when it traverses the U-ALL. In this execution, $x\in S$ whenever $w\in S$ . Since $pOp$ returns its largest candidate return value and $w$ is a candidate return value, $pOp$ must determine a candidate return value at least $x$ . The INS node of $iOp_{x}$ is in the U-ALL throughout $iOp_{w}$ ’s traversal of the U-ALL and, hence, is seen by $iOp_{w}$ . So, when $iOp_{w}$ notifies $pOp$ , $iOp_{w}$ will set $\mathit{updateNodeMax}$ to point to this INS node. Hence, $x$ is a candidate return value of $pOp$ .

When the traversal of the relaxed binary trie returns $\bot$ : Let $k$ be the largest key less than $y$ that is completely present throughout $pOp$ ’s traversal of the relaxed binary trie, or $-1$ if no such key exists. If $pOp$ ’s traversal returns $\bot$ , then by the specification of the relaxed binary trie, there is an $S$ -modifying update operation $uOp$ with key $x$ , where $k<x<y$ , whose update to the relaxed binary trie is concurrent with $pOp$ ’s traversal of the relaxed binary trie. The update node created by $uOp$ is encountered by $pOp$ either in the U-ALL or in its notify list. This is because either $pOp$ will traverse the U-ALL before $uOp$ can remove its update node from the U-ALL, or $uOp$ will notify $pOp$ before $pOp$ removes its predecessor node from the P-ALL. Unless $uOp$ is a Delete $(x)$ operation whose DEL node is in $D_{\mathit{ruall}}$ , $x$ is a candidate return value. This gives the following observation: If $p_{1}<k$ (on line 215), then there is a DEL node in $D_{\mathit{ruall}}$ with key $x$ such that $k<x<y$ .

When the traversal of the relaxed binary trie returns $\bot$ and $D_{\mathit{ruall}}$ is non-empty, $pOp$ takes additional steps to guarantee it has a candidate return value at least $k$ (by executing lines 217 to 237). This is done by using the keys and results of embedded predecessor operations of update operations linearized before the start of $pOp$ ’s traversal of the relaxed binary trie, and possibly before the start of $pOp$ . First, $pOp$ determines the predecessor nodes created by the first embedded predecessor operations of DEL nodes in $D_{\mathit{ruall}}$ . If $pOp$ encounters one of these predecessor nodes when it traversed the P-ALL, $pOp$ sets $\mathit{pNode}^{\prime}$ to be the one it encountered the latest in the P-ALL (on line 221). Note that $\mathit{pNode}^{\prime}$ was announced the earliest among these predecessor nodes and also announced earlier than $\mathit{pNode}$ . Then $pOp$ traverses $\mathit{pNode}^{\prime}.\mathit{notifyList}$ to determine the update nodes of update operations with key less than $y$ that notified $\mathit{pNode}^{\prime}$ (on lines 222 to 223). These update nodes are stored in a local sequence, $L_{1}$ , and appear in the order in which their notifications were added to $\mathit{pNode}^{\prime}.\mathit{notifyList}$ .

Next, $pOp$ traverses $\mathit{pNode}.\mathit{notifyList}$ to determine the update nodes of update operations with key less than $y$ that notified $\mathit{pNode}$ . Those belonging to notifications whose $\mathit{notifyThreshold}$ are greater than or equal to the key of the notification are stored in a local sequence, $L_{2}$ (on line 227), while the others are stored in a local set, $L^{\prime}$ (on line 229). The update nodes in $L_{2}$ appear in the order in which their notifications were added to $\mathit{pNode}.\mathit{notifyList}$ , and were added to $\mathit{pNode}.\mathit{notifyList}$ before $pOp$ completed its traversal of the RU-ALL. It includes the update nodes of update operations whose notifications were rejected by $pOp$ , and may include some INS nodes of Insert operations whose notifications were accepted by $pOp$ . The local sequence $L$ is $L_{1}$ followed by $L_{2}$ , excluding the update nodes in $L^{\prime}$ (computed on line 230). The update nodes in $L^{\prime}$ are excluded from $L$ so that $L$ only contains update nodes belonging to update operations linearized before the start of $pOp$ , or update operations that notified $pOp$ before $pOp$ started its traversal of the relaxed binary trie.

A candidate return value is then computed from $L$ and $D_{\mathit{ruall}}$ (on lines 231 to line 237). If $\mathit{pNode}^{\prime}$ was set, let $C$ be the configuration immediately after $\mathit{pNode}^{\prime}$ was announced; otherwise let $C$ be the configuration immediately after $\mathit{pNode}$ was announced. Each key $w\in R$ is in $S$ sometime between $C$ and the start of $pOp$ ’s traversal of the relaxed binary trie. However, it may be deleted from $S$ before $pOp$ begins its traversal of the relaxed binary trie. For example, if the last update node in $L$ with key $w$ is a DEL node, then $w$ was deleted from $S$ before $pOp$ begins its traversal of the relaxed binary trie. Such keys are removed from $R$ (on line 235). It is replaced with the value of the second embedded predecessor stored in this DEL node. For the same reason, the keys of DEL nodes in $D_{\mathit{ruall}}$ are removed from $R$ (on line 236). The largest remaining key in $R$ , which we can guarantee is non-empty, is a candidate return value of $pOp$ .

We next explain in more detail why $pOp$ determines a candidate return value at least $k$ . Suppose that immediately after $pOp$ has completed collecting notifications, it has not determined a candidate return value at least $k$ . In other words, $p_{1}<k$ (on line 215). Consider the Insert $(k)$ operation, $iOp$ , that last added $k$ to $S$ prior to the start of $pOp$ ’s traversal of the relaxed binary trie. The INS node of $iOp$ was not seen when $pOp$ traversed the U-ALL and $pOp$ did not accept a notification from $iOp$ , so $iOp$ must have completed sometime before the start of $pOp$ ’s traversal of the relaxed binary trie. We show that, on line 237, $R$ contains a value at which is least $k$ and, hence, $pOp$ returns a value which is at least $k$ (on line 238).

Suppose $iOp$ is linearized after $C$ . Since $\mathit{iNode}\notin I_{\mathit{uall}}\cup I_{\mathit{notify}}$ , if $iOp$ notifies $\mathit{pNode}$ , the notification must be rejected, and hence $iOp$ notified $pOp$ when $\mathit{pNode}.\mathit{RuallPosition}$ points to an update node with key greater than $k$ . It follows that $\mathit{iNode}$ is added to $L_{2}$ on line 227. Otherwise $iOp$ does not notify $\mathit{pNode}$ , so it notifies $\mathit{pNode}^{\prime}$ . Then $\mathit{iNode}$ is added to $L_{1}$ . In any case, $k$ is a key in $R$ . So $k$ is the key of an INS node in $L$ , and hence is added to $R$ on line 233. By assumption, there are no Delete $(k)$ operations linearized after $iOp$ and before the end of $pOp$ ’s traversal of the relaxed binary trie. Since $L$ only contains the update nodes of update operations linearized before the start of $pOp$ ’s traversal of the relaxed binary trie, the last update node with key $k$ in $L$ is $iOp$ ’s INS node. So $k$ is not removed from $R$ on line 235. Furthermore, $D_{\mathit{ruall}}$ only contains the DEL nodes of Delete operations linearized before the start of $pOp$ ’s traversal of the relaxed binary trie. For contradiction, suppose there is a DEL node in $D_{\mathit{ruall}}$ with key $k$ . Then $pOp$ encountered this DEL node in RU-ALL and simultaneously set $\mathit{pNode}.\mathit{RuallPosition}$ to point to an update node with key $k$ . From this point on, $pOp$ accepts all notifications from Insert $(k)$ operations. When $pOp$ put this DEL node into $D_{\mathit{ruall}}$ , it was the latest update operation with key $k$ . Therefore, $iOp$ was linearized after this point. Hence, either $iOp$ notified $pOp$ or $pOp$ encountered $iOp$ ’s INS node when it traversed the U-ALL. This contradicts the fact that $\mathit{iNode}\notin I_{\mathit{uall}}\cup I_{\mathit{notify}}$ . Thus, $k$ is not removed from $R$ on line 236, and $R$ contains a value at which is least $k$ .

Now suppose $iOp$ was linearized before $C$ . So $k\in S$ in all configurations between $C$ and the end of $pOp$ ’s traversal of the relaxed binary trie. Note that $C$ occurs before the start of the first embedded predecessor operation of any DEL node in $D_{\mathit{ruall}}$ . Recall the observation that when $p_{1}<k$ (on line 215), there is a DEL node $\mathit{dNode}\in D_{\mathit{ruall}}$ with key $x$ such that $k<x<y$ . The first embedded predecessor of the Delete $(x)$ operation, $dOp_{x}$ , that created $\mathit{dNode}$ begins after $C$ . From the code, this embedded predecessor operation completes before $\mathit{dNode}$ is added to the RU-ALL. Since $pOp$ added $\mathit{dNode}$ to $D_{\mathit{ruall}}$ while it traversed the RU-ALL, $\mathit{dNode}$ was added to the RU-ALL before $pOp$ began its traversal of the relaxed binary trie. The first embedded predecessor of $dOp_{x}$ returns a value $k^{\prime}$ such that $k\leq k^{\prime}<x$ , because $k\in S$ throughout its execution interval. This value will be added to $R$ on line 231. So $R$ contains at least one value at least $k$ at this point.

We will prove (in Lemma 5.27) the following claim: If a key that is at least $k$ is removed from $R$ on line 235 during some iteration of $pOp$ ’s for-loop on line 232, a smaller key that is at least $k$ will be added to $R$ in the same or later configuration of the for-loop. Since $R$ contains at least one value at least $k$ before the for-loop on line 232, this claim implies that $R$ contains a value at least $k$ after the for-loop. Let $k^{\prime\prime}$ be the smallest value $k^{\prime\prime}\geq k$ that is in $R$ immediately before line 236 (i.e. immediately after $pOp$ completes its local traversal of $L$ during the for-loop on line 232). Suppose, for contradiction, that $k^{\prime\prime}$ is removed from $R$ on line 236. Then there exists a DEL node, $\mathit{dNode}^{\prime}\in D_{\mathit{ruall}}$ such that $\mathit{dNode}^{\prime}.\mathit{key}=k^{\prime\prime}$ . By definition of $C$ , its first embedded predecessor occurs after $C$ . So the first embedded predecessor of $\mathit{dNode}^{\prime}$ returns a key $k^{\prime\prime\prime}$ where $k\leq k^{\prime\prime\prime}<k^{\prime\prime}$ . The claim implies that, immediately before line 236, $R$ contains a key at least $k^{\prime\prime\prime}$ . This contradicts the definition of $k^{\prime\prime}$ . Therefore, $p_{0}$ is set to $k^{\prime\prime}\geq k$ on line 237.

5.3 Linearizability

This section shows that our implementation of the lock-free binary trie is linearizable. We first prove basic properties about the latest lists in Section 5.3.1. We then show that our implementation is linearizable with respect to Search, Insert, and Delete operations in Section 5.3.2.

Recall that in a configuration $C$ , the predecessor of $y$ is the key $w$ such that $w\in S$ and there is no key $x\in S$ such that $w<x<y$ , otherwise it is $-1$ if there is no key in $S$ smaller than $y$ . We show that if a Predecessor $(y)$ operation returns $w$ , then there is a configuration $C$ during its execution interval in which $w$ is the predecessor of $y$ . To show that our implementation is linearizble, each completed Predecessor $(y)$ operation can be linearized at any such configuration.

Recall that our implementation of Predecessor, described in Section 5.2.4, determines a number of candidate return values, and returns the largest of them. In Section 5.3.3, we first define three properties, denoted Properties 1, 2, and 3, that these candidate return values will satisfy. Additionally, we prove that any implementation of Predecessor whose candidate return values satisfies these properties, together with our implementations of Search, Insert, and Delete, results in a linearizable implementation of a lock-free binary trie. In Section 5.3.4, we show that, the candidate return values determined by our implementation satisfy Property 1. We prove that Properties 2 and 3 are satisfied in Sections 5.3.5 and 5.3.6.

5.3.1 Properties of the Latest Lists

In this section, we prove basic facts about the $\textit{latest}[x]$ lists, for each $x\in U$ . They are used to show the linearizability of TrieInsert, TrieDelete, and TrieSearch.

Lemma 5.1.

Let $\mathit{uNode}$ be an updated node in $\textit{latest}[x]$ . If $\mathit{uNode}.\mathit{status}=\textsc{Inactive}$ , then $\mathit{uNode}$ is not the last node in $\textit{latest}[x]$ (i.e. $\mathit{uNode}.\mathit{latestNext}\neq\bot$ ).

Proof.

When $\mathit{uNode}$ is initialized, its status is Inactive and its $\mathit{latestNext}$ field is also initialized to an update node. By inspection of the algorithms for Insert and Delete, the field $\mathit{latestNext}$ is only set to $\bot$ after some process performs a CAS that changes the status of $\mathit{uNode}$ from Inactive to Active. ∎

Lemma 5.2.

For each $x\in U$ , the length of $\textit{latest}[x]$ is either 1 or 2.

Proof.

Initially, $\textit{latest}[x].\mathit{head}$ points to a dummy DEL node whose $\mathit{latestNext}$ is set to $\bot$ . Let $\mathit{uNode}$ be the update node pointed to by $\textit{latest}[x].\mathit{head}$ . New update nodes are only added to $\textit{latest}[x]$ by updating $\textit{latest}[x].\mathit{head}$ to point to a different update node $\mathit{uNode}^{\prime}$ using CAS $(\textit{latest}[x].\mathit{head},\mathit{uNode},\mathit{uNode}^{\prime})$ (on line 155 or 177), where $\mathit{uNode}^{\prime}.\mathit{latestNext}=\mathit{uNode}$ . Immediately before this CAS, $\mathit{uNode}.\mathit{latestNext}$ is set to $\bot$ . ∎

Lemma 5.3.

Let $\tau$ be a completed instance of FindLatest $(x)$ that returns an update node $\ell$ . Then there is a configuration during $\tau$ in which $\ell$ is the first activated node in $\textit{latest}[x]$ .

Proof.

Suppose that the operation $op$ reads that $\ell.\mathit{status}\neq\textsc{Inactive}$ on line 105. Then $\ell$ is the first activated node in $\textit{latest}[x]$ some time between the read of the pointer to $\ell$ and the read that $\ell.\mathit{status}\neq\textsc{Inactive}$ . Since $op$ returns $\ell$ , the lemma holds.

Now suppose $op$ reads that $\ell.\mathit{status}=\textsc{Inactive}$ . Suppose $op$ reads that $\ell.\mathit{latestNext}=\bot$ on line 107. Lemma 5.1 implies that the status of $\ell$ was changed to Active sometime between $op$ ’s read that $\ell.\mathit{status}=\textsc{Inactive}$ and $\ell.\mathit{latestNext}=\bot$ . Since $op$ returns $\ell$ , the lemma holds.

So $op$ reads that $\ell.\mathit{latestNext}=m\neq\bot$ on line 107. Then $m$ is an activated update node. Once $\ell.\mathit{latestNext}$ is initialized to point to $m$ , it does not change to any other value except for $\bot$ . So $m$ is the first activated update node in $\textit{latest}[x]$ immediately after $op$ reads $\ell.\mathit{status}=\textsc{Inactive}$ . Since $op$ returns $m$ , the lemma holds. ∎

Lemma 5.4.

Let $\tau$ be a completed instance of FirstActivated $(v)$ , where $v$ is a pointer to an activated update node. If $\tau$ returns True, there is a configuration during $\tau$ in which $v$ is the first activated node in $\textit{latest}[v.\mathit{key}]$ .

Proof.

Let $\ell$ be the pointer to the update node pointed to by $\textit{latest}[v.\mathit{key}].\mathit{head}$ read on line 114. Suppose $\tau$ returns True because $\ell=v$ . Then $v$ is the first activated node in $\textit{latest}[v.\mathit{key}]$ immediately after the read of $\ell$ .

So suppose $\tau$ returns True because $\ell.status=\textsc{Inactive}$ and $v=\ell.\mathit{latestNext}$ . Then $v$ is the first activated update node in $\textit{latest}[v.\mathit{key}]$ in the configuration immediately after the read of $\ell.status=\textsc{Inactive}$ . ∎

Lemma 5.5.

Let $\tau$ be a completed instance of FirstActivated $(v)$ , where $v$ is a pointer to an activated update node. If $\tau$ returns False, there is a configuration during $\tau$ in which $v$ is not the first activated node in $\textit{latest}[v.\mathit{key}]$ .

Proof.

Let $\ell$ be the pointer to the update node pointed to by $\textit{latest}[v.\mathit{key}].\mathit{head}$ read on line 114. Suppose $\tau$ returns False because $\ell\neq v$ and $\ell.\mathit{status}=\textsc{Active}$ . Then in the configuration immediately after the read of $\ell.\mathit{status}$ , $v$ is not the first activated update node in $\textit{latest}[v.\mathit{key}]$ .

So suppose $\tau$ returns False because $\ell\neq v$ and $v\neq\ell.\mathit{latestNext}$ . By Lemma 5.2, $\textit{latest}[v.\mathit{key}]$ has length at most 2, and update nodes removed from $\textit{latest}[v.\mathit{key}]$ are never added back. Then in the configuration immediately after the read of $\ell.\mathit{latestNext}$ , $v$ is not in $\textit{latest}[v.\mathit{key}]$ , and hence not the first activated update node in $\textit{latest}[v.\mathit{key}]$ . ∎

5.3.2 Linearizability of Insert, Delete, and Search

A Search $(x)$ operation that returns True is linearized in any configuration during its execution interval in which $x\in S$ . The next lemma proves such a configuration exists.

Lemma 5.6.

Suppose $op$ is a Search $(x)$ operation that returns True. Then there exists a configuration during $op$ in which $x\in S$ .

Proof.

Search $(x)$ begins by calling FindLatest $(x)$ on line 109, which returns an update node $\mathit{uNode}$ . By Lemma 5.3, there is a configuration $C$ during this instance of FindLatest $(x)$ in which $\mathit{uNode}$ is the first activated node in $\textit{latest}[x]$ . Since $op$ returned True, it read that $\mathit{uNode}.\mathit{type}=\textsc{INS}$ . By definition, $x\in S$ in $C$ . ∎

A Search $(x)$ operation that returns False is linearized in any configuration during its execution interval in which $x\notin S$ .

Lemma 5.7.

Suppose $op$ is a Search $(x)$ operation that returns False. Then there exists a configuration during $op$ in which $x\notin S$ .

Proof.

Search $(x)$ begins by calling FindLatest $(x)$ on line 109, which returns an update node $\mathit{uNode}$ . By Lemma 5.3, there is a configuration $C$ during this instance of FindLatest $(x)$ in which $\mathit{uNode}$ is the first activated node in $\textit{latest}[x]$ . Since $op$ returned False, it read that $\mathit{uNode}.\mathit{type}=\textsc{DEL}$ . By definition, $x\notin S$ in $C$ . ∎

Recall that the linearization points of an $S$ -modifying Insert or Delete operation is immediately after when status of the update node it created is changed from Inactive to Active.

An Insert $(x)$ operation that is not $S$ -modifying does not update $\textit{latest}[x]$ to point to its own update node. This happens when it reads that the first activated update node in $\textit{latest}[x]$ is a DEL node, or when it performs an unsuccessful CAS. In the following two lemmas, we prove that for each of these two cases, there is a configuration during the Insert $(x)$ in which $x\in S$ , and hence does not need to add $x$ to $S$ . Likewise, a Delete $(x)$ operation may return early before activating its update node because there is a configuration in which $x\notin S$ , and hence does not need to remove $x$ from $S$ .

Lemma 5.8.

Suppose $uOp$ is a Insert $(x)$ operation (or a Delete $(x)$ operation) that returns on line 150 (or on line 168). Then there is a configuration in which $uOp$ in which $x\in S$ (or $x\notin S$ ).

Proof.

Insert $(x)$ begins by calling FindLatest $(x)$ on line 28 (or line 167 of Delete), which returns an update node $\mathit{uNode}$ . By Lemma 5.3, there is a configuration $C$ during this instance of FindLatest $(x)$ in which $\mathit{uNode}$ is the first activated node in $\textit{latest}[x]$ . Since $uOp$ returned on line 150 (or on line 150 for Delete), it saw that $\mathit{uNode}.\mathit{type}=\text{INS}$ (or $\mathit{uNode}.\mathit{type}=\text{DEL}$ for Delete). By definition, $x\in S$ (or $x\notin S$ ) in $C$ . ∎

Lemma 5.9.

Suppose $uOp$ is a Insert $(x)$ operation (or a Delete $(x)$ operation) that returns on line 157 (or on line 180). Then there is a configuration in which $uOp$ in which $x\in S$ (or $x\notin S$ ).

Proof.

We prove the case when $uOp$ is an Insert $(x)$ operation. The Delete $(x)$ case follows similarly.

Since $uOp$ performs an unsuccessful CAS on line 155, some other Insert $(x)$ operation added its INS node $\mathit{iNode}$ to the head of $\textit{latest}[x]$ since $uOp$ last read that $\textit{latest}[x].head$ pointed to a $\mathit{dNode}$ . So $uOp$ calls HelpActivate on the update node pointed to by $\textit{latest}[x].head$ . If this update node is still $\mathit{iNode}$ and its inactive, then $\mathit{iNode}$ will be activated by $uOp$ on line 119. If the update node pointed to by $\textit{latest}[x].head$ is not $\mathit{iNode}$ , then by $\mathit{iNode}$ was activated by some other Insert $(x)$ . In the configuration immediately after $\mathit{iNode}$ is activated, $x\in S$ . ∎

5.3.3 Properties of Candidate Return Values

In this section, we state properties of the candidate return values of a Predecessor $(y)$ operation, and prove that the value returned by this operation is correct, assuming these properties hold.

Each candidate return value of a Predecessor operation is a value in $U\cup\{-1\}$ . Recall that Predecessor operations announce themselves at the start of their operation, and remove the announcement at the end of their operation. Before this announcement is removed, they notify Predecessor operations that have announced themselves. Intuitively, the candidate return values of a Predecessor operation are a subset of the values read from its traversal of the relaxed binary trie, the announcements, or its notifications.

In the following properties, we let $pOp$ be a completed Predecessor $(y)$ operation. Let $C_{T}$ be the configuration immediately before $pOp$ begins its traversal of the relaxed binary trie.

Property 1.

All candidate return values of $pOp$ are less than $y$ , and $pOp$ returns its largest candidate return value.

For each of $pOp$ ’s candidate return values $w\neq-1$ , there is a configuration $C$ during $pOp$ in which $w\in S$ . Furthermore, it states that the keys of certain update operations that are concurrent with $pOp$ are also candidate return values of $pOp$ . This property (together with the next property) is used to argue that all keys between $w$ and $y$ that are also in $S$ in $C$ are candidate return values of $pOp$ .

Property 2.

Suppose $w\neq-1$ is a candidate return value of $pOp$ . Then there is a configuration $C$ during $pOp$ such that

(a)

$w\in S$ ,
(b)

if $C$ occurs before $C_{T}$ and there exists an $S$ -modifying Delete $(x)$ operation linearized between $C$ and $C_{T}$ with $w<x<y$ , then $pOp$ has a candidate return value which is at least $x$ , and
(c)

if $C$ occurs after $C_{T}$ and there exists an $S$ -modifying Insert $(x)$ operation linearized between $C_{T}$ and $C$ with $w<x<y$ , then $pOp$ has a candidate return value which is at least $x$ .

The next property states that $pOp$ should learn about keys $x$ in $S$ that have been added to $S$ before the start of $pOp$ ’s traversal of the binary trie. If $pOp$ is does not learn about $x$ , perhaps because there are no concurrent update operations with key $x$ , then $pOp$ ’s traversal of the relaxed binary trie returns a key at which is least $x$ .

Property 3.

Suppose an $S$ -modifying Insert $(x)$ operation $iOp$ is linearized before $C_{T}$ , $x<y$ , and there are no $S$ -modifying Delete $(x)$ operations linearized after $iOp$ and before $C_{T}$ . Then $pOp$ has a candidate return value which is at least $x$ .

In a configuration $C$ , the predecessor of $y$ is $-1$ if there is no key in $S$ smaller than $y$ , otherwise it is the key $w$ such that $w\in S$ and there is no key $x\in S$ where $w<x<y$ . The next lemma states that value returned by $pOp$ is a predecessor of $y$ in some configuration during $pOp$ . This is the linearization point of $pOp$ . So any predecessor algorithm that satisfies Property 1, Property 2, and Property 3 results in a linearizable implementation.

Theorem 5.10.

If $pOp$ returns $w\in U\cup\{-1\}$ , then there exists a configuration during $pOp$ in which $w$ is the predecessor of $y$ .

Proof.

Suppose $pOp$ returns $-1$ . We show that there is no key $x\in S$ in $C_{T}$ , where $x<y$ . Suppose, for contradiction, that there is a key $x\in S$ in $C_{T}$ , where $x<y$ . Let $iOp$ be the Insert $(x)$ operation that last added $x$ to $S$ before $C_{T}$ . So there are no Delete $(x)$ operations linearized after $iOp$ but before $C_{T}$ . By Property 3, there is a key $x^{\prime}$ where $x\leq x^{\prime}<y$ that is a candidate return value of $pOp$ . This contradicts Property 1.

So suppose $pOp$ returns $w\in U$ . By Property 1, $w$ is a candidate return value of $pOp$ . Let $C$ be the configuration during $pOp$ defined in Property 2. By Property 2(a), $w\in S$ in $C$ . To show that $w$ is the predecessor of $y$ in $C$ , it remains to show that there is no key $x\in S$ in $C$ , where $w<x<y$ .

Suppose, for contradiction, that there is a key $x\in S$ in $C$ , where $w<x<y$ . Let $iOp$ be the Insert $(x)$ operation that last added $x$ to $S$ before $C$ . So suppose $iOp$ is linearized after $C_{T}$ . Since $iOp$ is linearized between $C_{T}$ and $C$ , it follows from Property 2(c) that $pOp$ has a candidate return value that is at least $x$ . This contradicts Property 1. Suppose $iOp$ is linearized before $C_{T}$ . If $x\in S$ in all configurations from the linearization point of $iOp$ to $C_{T}$ , then Property 3 states there is a key $x^{\prime}$ that is a candidate return value of $pOp$ , where $w<x\leq x^{\prime}<y$ . This contradicts Property 1. So $x\notin S$ in some configuration between the linearization point of $iOp$ and $C_{T}$ . Since $x\in S$ in all configurations from the linearization point of $iOp$ to $C$ , it follows that $x$ is removed from $S$ by a Delete $(x)$ operation $dOp$ linearized sometime between $C$ and $C_{T}$ . By Property 2(b), $pOp$ has a candidate return value that is at least $x$ . This contradicts Property 1. Therefore, in any case, $w$ is the predecessor of $y$ in $C$ . ∎

5.3.4 Our Implementation Satisfies Property 1

Recall that our implementation of Predecessor $(y)$ performs a single instance of PredHelper $(y)$ , and then returns the result of this instance. The candidate return values of a Predecessor $(y)$ operation are equal to the candidate return values determined by its instance of PredHelper $(y)$ . We prove that the candidate return values determined by an instance of PredHelper $(y)$ satisfy the properties, and hence are also satisfied by the Predecessor $(y)$ operation that invoked it. We prove properties about PredHelper $(y)$ because it is also used by Delete $(y)$ operations when performing embedded predecessor operations.

For the remainder of Section 5.3, we let $\alpha$ be an arbitrary execution of our implementation, and let $pOp$ be an arbitrary completed instance of PredHelper $(y)$ in $\alpha$ . Our proof is by induction on the order in which instances of PredHelper are completed in $\alpha$ . In particular, we assume that all instances of PredHelper that are completed before $pOp$ in $\alpha$ satisfy Property 1, Property 2, and Property 3.

Let $\mathit{pNode}$ be the predecessor node created by $pOp$ . We let $I_{\mathit{ruall}}$ , $I_{\mathit{uall}}$ , $I_{\mathit{notify}}$ , $D_{\mathit{ruall}}$ , $D_{\mathit{uall}}$ , and $D_{\mathit{notify}}$ be the sets of update nodes corresponding to $pOp$ ’s local variables with the same name. Recall that $I_{\mathit{ruall}}$ and $D_{\mathit{ruall}}$ are update nodes obtained from $pOp$ ’s traversal of the RU-ALL, $I_{\mathit{uall}}$ and $D_{\mathit{uall}}$ are update nodes obtained from $pOp$ ’s traversal of the U-ALL, and $I_{\mathit{notify}}$ and $D_{\mathit{notify}}$ are update nodes obtained from $pOp$ ’s traversal of its notify list (i.e. $\mathit{pNode}.\mathit{notifyList}$ ). Recall that the keys of update nodes in $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ are candidate return values of $pOp$ .

There may be one additional candidate return value from $pOp$ ’s traversal of the relaxed binary trie. When $pOp$ ’s traversal of the relaxed binary trie returns a value $p_{0}\neq\bot$ , $p_{0}$ is a candidate return value. If the traversal of the relaxed binary trie returns $p_{0}\neq\bot$ and $D_{\mathit{ruall}}\neq\emptyset$ , then the value $pOp$ computes for $p_{0}$ from lines 217 to 237 is a candidate return value.

It is easy to show that our algorithm satisfies Property 1.

Lemma 5.11.

All candidate return values of $pOp$ are less than $y$ , and $pOp$ returns its largest candidate return value.

Proof.

The maximum candidate return value of $pOp$ is returned on line 238. It is either a key of an update node in $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ , or $pOp$ ’s local variable $p_{0}$ .

The update nodes in $I_{\mathit{uall}}$ and $D_{\mathit{uall}}$ are those returned by TraverseUAll $(y)$ on line 203. By the check on line 126, these update nodes have key less $y$ . Update nodes in $D_{\mathit{uall}}$ and $D_{\mathit{notify}}$ have keys less than $y$ by the check in the while loop on line 205. By the specification of RelaxedPredecessor $(y)$ , the value $p_{0}$ returned by RelaxedPredecessor $(y)$ is either less than $y$ , or $\bot$ .

When RelaxedPredecessor $(y)$ returns $\bot$ , $p_{0}$ is calculated from the return values of the embedded predecessors of DEL nodes in $D_{\mathit{ruall}}$ , or from the keys of update nodes in a list $L$ . DEL nodes in $D_{\mathit{ruall}}$ have keys less than $y$ according to line 250. The embedded predecessors of nodes in $D_{\mathit{ruall}}$ are the return values of completed instances of PredHelper $(x)$ , for some key in $x<y$ . By assumption that all completed instances of PredHelper satisfy Property 1, PredHelper $(x)$ returns a value less than $x$ , which is also less than $y$ . Only notifications of these embedded predecessors with keys less than $y$ are considered (on lines 222 and 225). So $p_{0}$ is a value less than $y$ . It follows from the code on lines 222 and line 225 that the update nodes that added to $L$ have keys less than $y$ . So all candidate return values of $pOp$ are less than $y$ . ∎

5.3.5 Our Implementation Satisfies Property 2

We next define several configurations during $pOp$ that are used for the remainder of Section 5.3. Recall that during TraverseRUAll, $pOp$ traverses the RU-ALL by atomically reading the next update node in the list and writing that pointer into $\mathit{pNode}.\mathit{RuallPosition}$ . For any key $x\in U$ , we let $C_{<x}$ be the configuration immediately after $pOp$ first atomically reads a pointer of an update node with key less than $x$ and writes it into $\mathit{pNode}.\mathit{RuallPosition}$ during TraverseRUAll. Let $C_{\leq x}$ be the configuration immediately after $pOp$ first atomically reads a pointer to an update node with key less than or equal to $x$ and writes it into $\mathit{pNode}.\mathit{RuallPosition}$ during TraverseRUAll. Let $C_{T}$ be the configuration immediately before $pOp$ starts its traversal of the relaxed binary trie. We let $C_{\mathit{notify}}$ be the configuration immediately after $pOp$ reads the head pointer to the first notify node in $\mathit{pNode}.\mathit{notifyList}$ (on line 205). A notify node is seen by $pOp$ when it traverses $\mathit{pNode}.\mathit{notifyList}$ (on line 205) if and only if it is added into $\mathit{pNode}.\mathit{notifyList}$ before $C_{\mathit{notify}}$ .

Because RU-ALL is a linked list of update nodes whose keys are in non-increasing order, $C_{<x}$ occurs at or before $C_{\leq w}$ for any two keys $w<x$ . Likewise, $C_{\leq w}$ occurs at or before $C_{<w}$ . Since $pOp$ performs TraverseRUAll before the start of its traversal of the relaxed binary trie, it follows that for any key $x$ , $C_{<x}$ occurs before $C_{T}$ . Since $pOp$ performs its traversal of the relaxed binary trie before it reads the head pointer to the first notify node in $\mathit{pNode}.\mathit{notifyList}$ , $C_{T}$ occurs before $C_{\mathit{notify}}$ . This is summarized in the following observation.

Observation 5.12.

Let $w$ and $x$ be any two keys in $U$ where $w<x$ . Then the following statements hold:

(a)

$C_{<x}$ occurs at or before $C_{\leq w}$ .
(b)

$C_{\leq w}$ occurs at or before $C_{<w}$ .
(c)

$C_{<w}$ occurs before $C_{T}$ .
(d)

$C_{T}$ occurs before $C_{\mathit{notify}}$ .

We next prove several lemmas that say $pOp$ ’s candidate return values are in $S$ sometime during its execution interval. This is Property 2(a). We consider several cases, depending on how $pOp$ learns about its candidate return value.

Recall that $pOp$ traverses the U-ALL during TraverseUall, which returns two sets of update nodes, $I_{\mathit{uall}}$ and $D_{\mathit{uall}}$ . The next lemma states that the INS nodes in $I_{\mathit{uall}}$ returned by TraverseUall have keys in $S$ sometime during the traversal, while the DEL nodes in $D_{\mathit{uall}}$ have keys not in $S$ sometime during the traversal. This is because an update node $\mathit{uNode}$ is only added to $I_{\mathit{uall}}$ or $D_{\mathit{uall}}$ after $pOp$ checks that $\mathit{uNode}$ is the first activated update node in $\textit{latest}[\mathit{uNode}.\mathit{key}]$ , which determines whether or not $\mathit{uNode}.\mathit{key}$ is in $S$ .

Lemma 5.13.

For each $\mathit{uNode}\in I_{\mathit{uall}}\cup D_{\mathit{uall}}$ , there is a configuration $C$ during $pOp$ ’s traversal of the U-ALL (in its instance of TraverseUall) in which $\mathit{uNode}$ is the first activated update node in $\textit{latest}[\mathit{uNode}.\mathit{key}]$ . Furthermore, $C$ occurs before $pOp$ encounters any update nodes with key greater than $\mathit{uNode}.\mathit{key}$ during this traversal of U-ALL.

Proof.

Consider an update node $\mathit{uNode}\in I_{\mathit{uall}}\cup D_{\mathit{uall}}$ . It follows from the code that there is an iteration during TraverseUall where $\textsc{FirstActivated}(\mathit{uNode})=\textsc{True}$ on line 127. By Lemma 5.4, there is a configuration $C$ during this instance of FirstActivated in which $\mathit{uNode}$ is the first activated update node in $\textit{latest}[\mathit{uNode}.key]$ . Since U-ALL is sorted by increasing key, $C$ occurs before $pOp$ encounters any update nodes with key greater than $\mathit{uNode}.\mathit{key}$ during this instance of TraverseUall.

∎

An update operation that notifies $pOp$ about $\mathit{iNode}\in I_{\mathit{notify}}$ with key $x$ may be from the Insert $(x)$ operation that created $\mathit{iNode}$ . Alternatively, it may be from an Insert $(w)$ or Delete $(w)$ operation, $uOp$ , for some $w<x$ , that included $\mathit{iNode}$ when it notified $pOp$ . This happens because $\mathit{iNode}$ has the largest key less than $y$ among the INS nodes returned by $uOp$ ’s instance of TraverseUall on line 133.

We first handle the case the where the Insert $(x)$ operation that created $\mathit{iNode}$ notifies $pOp$ . We show that $x\in S$ some time after $pOp$ begins accepting notifications with key $x$ , but before $pOp$ begins collecting its notifications. Intuitively, $x\in S$ because an update operation verifies its update node is still the first activated node in $\textit{latest}[x]$ prior to notifying $pOp$ .

Lemma 5.14.

Consider an INS node $\mathit{iNode}\in I_{\mathit{notify}}$ with key $x$ . Suppose the Insert $(x)$ operation that created $\mathit{iNode}$ notified $pOp$ . Then there is a configuration between $C_{<x}$ and $C_{\mathit{notify}}$ in which $x\in S$ .

Proof.

Let $iOp$ be the Insert $(x)$ operation that created $\mathit{iNode}$ . The INS node $\mathit{iNode}$ can be added to $I_{\mathit{notify}}$ on line 208 because $iOp$ notifies $pOp$ about its own operation, or on line 213 if some other update operation includes $\mathit{iNode}$ in its notify node because $\mathit{iNode}$ has the largest key less than $y$ among the INS nodes returned by its instance of TraverseUall on line 133.

Suppose $\mathit{iNode}$ is added to $I_{\mathit{notify}}$ on line 208. So $iOp$ or a Delete $(x)$ operation helping $iOp$ notified $pOp$ by adding a notify node $\mathit{nNode}$ into $\mathit{pNode}.\mathit{notifyList}$ where $\mathit{nNode}.\mathit{updateNode}=\mathit{iNode}$ . Let $uOp$ be the update operation that successfully added $\mathit{nNode}$ into $\mathit{pNode}.\mathit{notifyList}$ using CAS on line 147. In the line of code prior, $uOp$ successfully checks that $\mathit{iNode}$ is the first activated update node in $\textit{latest}[x]$ during FirstActivated $(\mathit{iNode})$ on line 146. By Lemma 5.4, there is a configuration $C$ during FirstActivated $(\mathit{iNode})$ in which $x\in S$ . Furthermore, $x\in S$ in all configurations from when $uOp$ is linearized to $C$ . In particular, $x\in S$ in the configuration $uOp$ reads that $\mathit{pNode}$ is in P-ALL on line 134. Since $pOp$ is active when it has a predecessor node in U-ALL, $x\in S$ sometime during $pOp$ and when $uOp$ is traversing P-ALL.

Finally, since $\mathit{iNode}\in I_{\mathit{notify}}$ , this means that when $uOp$ reads $\mathit{pNode}.\mathit{RuallPosition}$ on line 140, $\mathit{pNode}.\mathit{RuallPosition}$ is a pointer to an update node with key less than $x$ , and hence after $C_{<x}$ . This read occurs before $uOp$ ’s instance of FirstActivated $(\mathit{iNode})$ , and hence before $C$ . So $C$ occurs sometime after $C_{<x}$ . ∎

We next handle the case where an Insert $(w)$ or Delete $(w)$ operation notifies $pOp$ about an INS node it did not create.

Lemma 5.15.

Consider an INS node $\mathit{iNode}\in I_{\mathit{notify}}$ with key $x$ . Suppose an update operation with key $w$ notified $pOp$ about $\mathit{iNode}$ , where $w<x<y$ . Then there is a configuration between $C_{<x}$ and $C_{\mathit{notify}}$ in which $x\in S$ .

Proof.

From the code, $\mathit{iNode}$ is added to $I_{\mathit{notify}}$ on line 213. So there is a notify node $\mathit{nNode}$ in $\mathit{pNode}.\mathit{notifyList}$ where $\mathit{iNode}=\mathit{nNode}.\mathit{updateNodeMax}$ . Let $uOp$ be the update node that created $\mathit{nNode}$ , and hence is the update operation that notified $pOp$ about $\mathit{iNode}$ . Let $\mathit{uNode}$ be the update node created by $uOp$ .

By the code on line 213, $\mathit{uNode}\notin I_{\mathit{ruall}}\cup D_{\mathit{ruall}}$ . If $uOp$ is a DEL operation, it follows by Lemma 5.16 $uOp$ is linearized sometime after $C_{\leq w}$ , and hence after $C_{\leq x}$ . If $uOp$ is an Insert operation, then $\mathit{uNode}$ is the first activated update node in $\textit{latest}[w]$ from when it was linearized to when it notified $pOp$ , which is after $pOp$ completes its traversal of the RU-ALL because $\mathit{nNode}.\mathit{notifyThreshold}=-1$ . So it is linearized sometime after $C_{\leq x}$ , otherwise $\mathit{uNode}\in I_{\mathit{ruall}}$ . In either case, $uOp$ is linearized sometime after $C_{\leq x}$ .

By definition, $\mathit{iNode}=\mathit{nNode}.\mathit{updateNodeMax}$ is the update node with largest key less than $y$ returned by $uOp$ ’s instance of TraverseUall on line 133, which occurs sometime between $C_{\leq x}$ and $C_{\mathit{notify}}$ . By Lemma 5.13, there is a configuration $C$ during this instance of TraverseUall in which $\mathit{iNode}.\mathit{key}\in S$ . So $C$ is between $C_{<x}$ and $C_{\mathit{notify}}$ . ∎

Recall that prior to traversing the relaxed binary trie, an instance $pOp$ of PredHelper first traverses the RU-ALL to find DEL nodes of Delete operations that may have been linearized before the start of $pOp$ .

Suppose $\mathit{dNode}$ is the DEL node of a Delete operation with key less than $y$ that is linearized before $pOp$ . If $pOp$ does not encounter $\mathit{dNode}$ when it traverses the RU-ALL, then $\mathit{dNode}$ was removed from RU-ALL before $pOp$ could encounter it. In this case, $pOp$ will also not accept any notifications about $\mathit{dNode}$ and $pOp$ will not encounter $\mathit{dNode}$ in the U-ALL. This is formalized in the next lemma.

Lemma 5.16.

Let $dOp$ be an $S$ -modifying Delete $(x)$ operation for some key $x<y$ , and let $\mathit{dNode}$ be the DEL node created by $dOp$ . If $dOp$ is linearized before $C_{\leq x}$ , then either $\mathit{dNode}\in D_{\mathit{ruall}}$ or $\mathit{dNode}\notin D_{\mathit{uall}}\cup D_{\mathit{notify}}$ .

Proof.

Since $dOp$ is linearized before $C_{\leq x}$ , $\mathit{dNode}$ is in RU-ALL before $C_{\leq x}$ . Suppose $pOp$ does not encounter $\mathit{dNode}$ when it traverses the RU-ALL. So $dOp$ removed its $\mathit{dNode}$ from the RU-ALL before $pOp$ sets $\mathit{pNode}.\mathit{RuallPosition}$ to an update node with key less than to $x$ , and hence before $C_{<x}$ . Hence, $dOp$ adds a notify node $\mathit{nNode}$ to $\mathit{pNode}.\mathit{notifyList}$ with $\mathit{nNode}.\mathit{notifyThreshold}\geq x$ . By line 210, $\mathit{dNode}\in D_{\mathit{ruall}}$ .

Suppose $pOp$ encounters $\mathit{dNode}$ when it traverses the RU-ALL and $pOp$ ’s instance of FirstActivated $(\mathit{dNode})$ on line 251 returns True. Then $\mathit{dNode}\in D_{\mathit{ruall}}$ .

So $pOp$ ’s instance of FirstActivated $(\mathit{dNode})$ on line 251 returns False. By Lemma 5.5, there is a configuration $C$ during this instance of FirstActivated $(\mathit{dNode})$ in which $\mathit{dNode}$ is not the first activated update node in $\textit{latest}[x]$ . So there is an Insert $(x)$ operation linearized sometime between when $dOp$ is linearized and $C$ , and hence before $pOp$ completes TraverseRUAll. Since $\mathit{dNode}$ is no longer the first activated update node in $\textit{latest}[x]$ , $\mathit{dNode}\notin D_{\mathit{uall}}$ . Furthermore, it is before $pOp$ sets $\mathit{pNode}.\mathit{RuallPosition}$ to an update node with key less than to $x$ , so $\mathit{dNode}\notin D_{\mathit{notify}}$ . ∎

The next lemma states that the key of each DEL node $\mathit{dNode}\in D_{\mathit{uall}}-D_{\mathit{ruall}}$ is in $S$ sometime during $pOp$ . Likewise, the lemma following it states that the key of each DEL node $\mathit{dNode}\in D_{\mathit{notify}}-D_{\mathit{ruall}}$ is in $S$ sometime during $pOp$ . Both of these results use Lemma 5.16 to argue that the Delete $(x)$ operation that created $\mathit{dNode}$ was linearized sometime after $C_{\leq x}$ . In the configuration immediately before this operation was linearized, $x\in S$ .

Lemma 5.17.

Consider a DEL node $\mathit{dNode}\in D_{\mathit{uall}}-D_{\mathit{ruall}}$ with key $x$ . There is a configuration $C$ after $C_{\leq x}$ in which $x\in S$ . Furthermore, $C$ occurs before $pOp$ encounters any update nodes with key greater than $x$ during its traversal of the U-ALL.

Proof.

Let $dOp$ be the creator of $\mathit{dNode}$ . By Lemma 5.13, there is a configuration $C$ during TraverseUall of $pOp$ in which $dOp$ is the latest Delete $(x)$ operation, and hence $x\notin S$ . Suppose, for contradiction, that $x\notin S$ in all configurations from $C_{\leq x}$ to $C$ . This implies that $dOp$ was linearized before $C_{\leq x}$ . By Lemma 5.16, $\mathit{dNode}\in D_{\mathit{ruall}}$ or $\mathit{dNode}\notin D_{\mathit{uall}}\cup D_{\mathit{notify}}$ . This contradicts the fact that $\mathit{dNode}\in D_{\mathit{uall}}-D_{\mathit{ruall}}$ . So there exists a configuration after $C_{\leq x}$ in which $x\in S$ .

∎

Lemma 5.18.

Consider a DEL node $\mathit{dNode}\in D_{\mathit{notify}}-D_{\mathit{ruall}}$ with key $x$ . There is a configuration between $C_{\leq x}$ and $C_{\mathit{notify}}$ in which $x\in S$ .

Proof.

Let $dOp$ be the Delete $(x)$ operation that created $\mathit{dNode}$ . Since $\mathit{dNode}\in D_{\mathit{notify}}$ , $dOp$ successfully added a notify node $\mathit{nNode}$ to $\mathit{pNode}.\mathit{notifyList}$ , where $\mathit{nNode}.\mathit{notifyThreshold}<x$ . This means that when $dOp$ read $\mathit{pNode}.\mathit{RuallPosition}$ on line 140 it pointed to an update node with key less than $x$ , and FirstActivated $(\mathit{dNode})$ on line 135 returned True. By Lemma 5.4, there is a configuration $C$ during FirstActivated in which $\mathit{dNode}$ is the first activated update node in $\textit{latest}[x]$ . Since $\mathit{dNode}\notin D_{\mathit{ruall}}$ and $\mathit{dNode}$ is the first activated update node $\textit{latest}[x]$ in $C$ , it follows by Lemma 5.16 that $dOp$ is linearized sometime after $C_{\leq x}$ . In the configuration immediately before $dOp$ is linearized, $x\in S$ . ∎

We next focus on proving lemmas about keys of update operations that become candidate return values of $pOp$ . They are used to show Property 2(b) and 2(c). At the end of this subsection, we prove that our implementation satisfies Property 2.

The following two technical lemmas state that after an update operation $uOp$ with key $x$ is linearized, there is an update node with key $x$ in the U-ALL and is the first activated update node in $\textit{latest}[x]$ until some update operation with key $x$ completes a traversal of the P-ALL, attempting to notify each predecessor node it encounters about $uOp$ . Either $uOp$ performs these notifications itself, or some update operation with key $x$ helps $uOp$ perform these notifications before linearizing its own operation. The next two lemmas prove this for Insert and Delete operations, respectively.

Lemma 5.19.

Let $iOp$ be an $S$ -modifying Insert $(x)$ operation for some key $w<x<y$ , and let $\mathit{iNode}$ be the INS node created by $iOp$ . For all configurations from when $iOp$ is linearized until some update operation completes an instance of NotifyPredOps $(\mathit{iNode})$ , $\mathit{iNode}$ is in the U-ALL and is the first activated update node in $\textit{latest}[x]$ .

Proof.

By definition, $iOp$ is linearized when the status of $\mathit{iNode}$ changes from inactive to active by the CAS on line 119. Immediately after this CAS, $\mathit{iNode}$ is the first activated update node in $\textit{latest}[x]$ and is an activated update node in U-ALL. From the code $iOp$ does not remove $\mathit{iNode}$ from U-ALL until after it completes NotifyPredOps $(\mathit{iNode})$ .

Before any Delete $(x)$ operation $dOp$ is linearized after $iOp$ is linearized, $dOp$ helps perform NotifyPredOps $(\mathit{iNode})$ on line 176. Furthermore, $iOp$ does not remove $\mathit{iNode}$ from U-ALL until sometime after it completes its instance of NotifyPredOps $(\mathit{iNode})$ . It follows that $\mathit{iNode}$ is the first activated update node in $\textit{latest}[x]$ and is an activated update node in U-ALL for all configurations starting when $iOp$ is linearized, and ending immediately after some update operation invokes and completes NotifyPredOps $(\mathit{iNode})$ . ∎

Lemma 5.20.

Let $dOp$ be an $S$ -modifying Delete $(x)$ operation for some key $w<x<y$ , and let $\mathit{dNode}$ be the DEL node created by $dOp$ . For all configurations from when $dOp$ is linearized until either

•

$dOp$ completes an instance of NotifyPredOps $(\mathit{dNode})$ on line 188, or
•

an $S$ -modifying Insert $(x)$ operation is linearized after $dOp$ .

Proof.

By definition, $dOp$ is linearized when it the status of $\mathit{iNode}$ changes from inactive to active by the line 119. Immediately after this CAS, $\mathit{dNode}$ is the first activated update node in $\textit{latest}[x]$ and is an activated update node in U-ALL.

Suppose an $S$ -modifying Insert $(x)$ operation, $iOp$ , is linearized after $dOp$ before $dOp$ invokes and completes NotifyPredOps $(\mathit{dNode})$ . Only after $iOp$ is linearized is $dOp$ no longer the first activated update node in $\textit{latest}[x]$ . Furthermore, $dOp$ does not remove $\mathit{dNode}$ from U-ALL until after it invokes and completes NotifyPredOps $(\mathit{dNode})$ . Then $\mathit{dNode}$ is the first activated update node in $\textit{latest}[x]$ and is an activated update node in U-ALL for all configurations starting when $dOp$ is linearized and ending when $iOp$ is linearized.

So suppose $dOp$ invokes and completes NotifyPredOps $(\mathit{dNode})$ before an $S$ -modifying Insert $(x)$ operation is linearized after $dOp$ . So $\mathit{dNode}$ remains the first activated update node in $\textit{latest}[x]$ until after $dOp$ invokes and completes NotifyPredOps $(\mathit{dNode})$ . Furthermore, $dOp$ does not remove $dNode$ from U-ALL until after it invokes and completes NotifyPredOps $(\mathit{dNode})$ . Then $\mathit{dNode}$ is the first activated update node in $\textit{latest}[x]$ and is an activated update node in U-ALL for all configurations starting when $dOp$ is linearized and ending when $iOp$ is linearized. ∎

Next we prove two lemmas that state different scenarios when an update operation $uOp$ with key $x$ is linearized during $pOp$ , $x$ is a candidate return value of $pOp$ . This is done using the previous two lemmas, which guarantee that an update node with key $x$ will either be seen when $pOp$ traverses the U-ALL, or some update operation with key $x$ notifies $pOp$ sometime before $pOp$ completes its traversal of the U-ALL.

Lemma 5.21.

Let $dOp$ be an $S$ -modifying Delete $(x)$ operation, for some key $w<x<y$ , that is linearized sometime between $C_{\leq w}$ and $C_{T}$ . Then $x$ is a candidate return value of $pOp$ .

Proof.

Suppose some latest update operation $uOp^{\prime}$ with key $x$ notifies all predecessor nodes in P-ALL before $pOp$ completes its traversal of the U-ALL. Note that $\mathit{pNode}$ is inserted into P-ALL before the start of $pOp$ , and is not removed from P-ALL until sometime after $pOp$ completes its traversal of the U-ALL. So $\mathit{pNode}$ is in P-ALL throughout $uOp^{\prime}$ ’s traversal of the P-ALL on line 162. Since $uOp^{\prime}$ is linearized after $C_{\leq w}$ it follows that $\mathit{pNode}.\mathit{RuallPosition}$ points to an update node with key less than or equal to $w$ throughout $uOp^{\prime}$ traversal of the P-ALL. So when $pOp$ finishes traversing its notify list on line 205, $\mathit{uNode}\in I_{\mathit{notify}}\cup D_{\mathit{notify}}$ . Hence, $x$ is a candidate return value of $pOp$ .

So suppose $pOp$ completes its traversal of the U-ALL before some latest update operation $uOp^{\prime}$ with key $x$ notifies all predecessor nodes in P-ALL. Since no latest update operation with key $x$ notifies $pOp$ , no latest update node with key $x$ is removed from U-ALL before $pOp$ completes its traversal of the U-ALL. It follows by Lemma 5.20 and Lemma 5.19 that $pOp$ encounters an activated update node $\mathit{uNode}^{\prime}$ with key $x$ during its traversal of the U-ALL. Since FirstActivated $(\mathit{uNode}^{\prime})$ returns True on line 127, $\mathit{uNode}^{\prime}\in I_{\mathit{uall}}\cup D_{\mathit{uall}}$ when TraverseUall returns on line 203. Hence, $x$ is a candidate return value of $pOp$ . ∎

Lemma 5.22.

Let $iOp$ be an $S$ -modifying Insert $(x)$ operation, for some key $x<y$ that is linearized sometime after $C_{T}$ , but before $pOp$ encounters any update nodes with key greater than or equal to $x$ during its instance of $\textsc{TraverseUall}(y)$ . Then $x$ is a candidate return value of $pOp$ .

Proof.

Let $\mathit{iNode}$ be the INS node created by $iOp$ . Suppose some latest update operation notifies all predecessor nodes in P-ALL about $\mathit{iNode}$ before $pOp$ completes its traversal of the U-ALL. Note that $\mathit{pNode}$ is inserted into P-ALL before the start of $pOp$ , and is not removed from P-ALL until sometime after $pOp$ completes its traversal of the U-ALL. So $\mathit{pNode}$ is in P-ALL throughout $uOp^{\prime}$ ’s traversal of the P-ALL on line 162. Since $uOp^{\prime}$ is linearized after $C_{\leq w}$ it follows that $\mathit{pNode}.\mathit{RuallPosition}$ points to an update node with key less than or equal to $w$ throughout $uOp^{\prime}$ traversal of the P-ALL. So when $pOp$ finishes traversing its notify list on line 205, $\mathit{uNode}\in I_{\mathit{notify}}\cup D_{\mathit{notify}}$ . Hence, $x$ is a candidate return value of $pOp$ .

So suppose $pOp$ completes its traversal of the U-ALL before some latest update operation $uOp^{\prime}$ with key $x$ notifies all predecessor nodes in P-ALL about about $\mathit{iNode}$ . Since no latest update operation with key $x$ notifies $pOp$ , $\mathit{iNode}$ is not removed from U-ALL before $pOp$ completes its traversal of the U-ALL. It follows by Lemma 5.19 that $pOp$ encounters an activated update node $\mathit{iNode}$ during its traversal of the U-ALL. Since FirstActivated $(\mathit{uNode}^{\prime})$ returns True on line 127, $\mathit{uNode}^{\prime}\in I_{\mathit{uall}}\cup D_{\mathit{uall}}$ when TraverseUall returns on line 203. Hence, $x$ is a candidate return value of $pOp$ . ∎

The follow lemma describes the scenario in which an Insert $(w)$ or Delete $(w)$ operation, $uOp$ , includes the INS node of an Insert $(x)$ operation it, $iOp$ , for $w<x<y$ , when $uOp$ notifies $pOp$ . Either $iOp$ will notify $pOp$ about its INS node before $C_{\mathit{notify}}$ , or $uOp$ will see this INS node when it traverses the U-ALL, and hence include an INS node with key at least $x$ when it notifies $pOp$ .

Lemma 5.23.

Let $iOp$ be an $S$ -modifying Insert $(x)$ operation, for some key $x<y$ . Let $uOp$ be an Insert $(w)$ or Delete $(w)$ operation that notifies $pOp$ before $C_{\mathit{notify}}$ , for some key $w<x<y$ . Suppose $iOp$ is linearized after $C_{T}$ , but before $uOp$ encounters any update nodes with key greater than or equal to $x$ during its instance of TraverseUall. Then $pOp$ has a candidate return value $x^{\prime}$ , where $w<x\leq x^{\prime}<y$ .

Proof.

Let $\mathit{iNode}$ be the INS node created by $iOp$ . By Lemma 5.19, $\mathit{iNode}$ is the first activated update node in $\textit{latest}[x]$ in all configurations from when $iOp$ is linearized to when it first completes NotifyPredOps $(\mathit{iNode})$ . It follows by Lemma 5.5 that all instances of FirstActivated $(\mathit{iNode})$ during this instance of NotifyPredOps $(\mathit{iNode})$ return True. Therefore, if $iOp$ attempts to notify $pOp$ before $C_{\mathit{notify}}$ , $iOp$ successfully notifies $pOp$ . Furthermore, $iOp$ notifies $pOp$ after $C_{T}$ , and hence after $C_{<x}$ . Then $\mathit{iNode}\in I_{\mathit{notify}}$ . Then $pOp$ has a candidate return value $x$ .

So suppose does not attempt to notify $pOp$ before $C_{\mathit{notify}}$ . Then Lemma 5.19 implies that $iOp$ is the first activated update node in $\textit{latest}[x]$ in all configurations from when $iOp$ is linearized to $C_{\mathit{notify}}$ . Since $iOp$ is linearized before $uOp$ encounters any update nodes with key greater than or equal to $x$ during its instance of TraverseUall, it follows that $uOp$ encounters $\mathit{iNode}$ in U-ALL. Furthermore, by Lemma 5.5, $uOp$ ’s instance of FirstActivated $(\mathit{iNode})$ during TraverseUall returns True. This implies that when $iOp$ notifies $pOp$ by adding a notify node $\mathit{nNode}$ into $\mathit{pNode}.\mathit{notifyList}$ , the $\mathit{nNode}.\mathit{updateNodeMax}$ contains a pointer to an INS node with key $x^{\prime}$ , where $w<x\leq x^{\prime}<y$ . Then $pOp$ has a candidate return value $x^{\prime}$ .

∎

We now show that Property 2 is satisfied for the keys of update nodes in $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ .

Lemma 5.24.

Let $w$ be the key of an update node in $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ , or is the key returned by $pOp$ ’s instance of RelaxedPredecessor $(y)$ . Then there is a configuration $C$ during $pOp$ such that

(a)

$w\in S$ ,
(b)

if $C$ occurs before $C_{T}$ and there exists a Delete $(x)$ operation linearized between $C$ and $C_{T}$ with $w<x<y$ , then $pOp$ has a candidate return value which is at least $x$ , and
(c)

if $C$ occurs at or after $C_{T}$ and there exists an Insert $(x)$ operation linearized between $C_{T}$ and $C$ with $w<x<y$ , then $pOp$ has a candidate return value which is at least $x$ .

Proof.

First suppose $w$ is the key returned by $pOp$ ’s instance of RelaxedPredecessor $(y)$ . By Lemma 4.21, $w\in S$ in a configuration $C$ during RelaxedPredecessor $(y)$ . So $C$ occurs after $C_{T}$ . Suppose, for contradiction, that there is an Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . By Lemma 5.22, $x$ is a candidate return value of $pOp$ .

Now suppose $w$ is the key of an INS node $\mathit{iNode}\in I_{\mathit{uall}}$ . Let $iOp_{w}$ be the Insert $(w)$ operation that created $\mathit{iNode}$ . By Lemma 5.13, there is a configuration $C$ during $pOp$ ’s traversal of U-ALL in which $w\in S$ . So $C$ occurs after $C_{T}$ . Suppose, for contradiction, that there is an Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . By Lemma 5.22, $x$ is a candidate return value of $pOp$ .

Consider an update node in $\mathit{uNode}\in I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ , where $\mathit{uNode}.\mathit{key}=w<y$ . Suppose there is a configuration $C$ between $C_{\leq w}$ and $C_{T}$ in which $w\in S$ . Suppose, for contradiction, that there is a Delete $(x)$ operation, $dOp_{x}$ , linearized between $C$ and $C_{T}$ , where $w<x<y$ . By Lemma 5.21, $x$ is a candidate return value of $pOp$ .

So there is no configuration between $C_{\leq w}$ and $C_{T}$ in which $w\in S$ . By Lemma 5.14, Lemma 5.15, Lemma 5.17, and Lemma 5.18, there is a configuration during $pOp$ after $C_{\leq w}$ in which $w\in S$ . This configuration occurs after $C_{T}$ .

•

Suppose $\mathit{uNode}\in I_{\mathit{notify}}$ . Let $iOp_{w}$ be the Insert $(w)$ operation that created $\mathit{uNode}$ . Let $C$ be the configuration after $C_{T}$ immediately after $iOp_{w}$ is linearized.

First, suppose that $\mathit{uNode}\in I_{\mathit{notify}}$ because $iOp_{w}$ notified $pOp$ . Suppose, for contradiction, that there Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . So $iOp_{x}$ is linearized before $iOp_{w}$ . Since $iOp_{w}$ notifies $pOp$ , it follow from Lemma 5.23 that $pOp$ has a candidate return value whose value is at least $x$ .

Next, suppose $\mathit{uNode}\in I_{\mathit{notify}}$ because some Insert $(w^{\prime})$ or Delete $(w^{\prime})$ operation included $\mathit{uNode}$ in its notification to $pOp$ , where $w^{\prime}<w<y$ . Suppose, for contradiction, that there Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . Since $uOp$ includes $\mathit{uNode}$ in its notification to $pOp$ , this means that FirstActivated $(\mathit{uNode})$ returned True during $uOp$ ’s instance of TraverseUall. By Lemma 5.4, there is a configuration $C^{\prime}$ during this instance of FirstActivated $(\mathit{uNode})$ in which $\mathit{uNode}$ is the first activated update node in $\textit{latest}[w]$ , which is after $C$ . Since U-ALL is sorted by increasing key, $C^{\prime}$ occurs before $uOp$ encounters any update node with key greater than or equal to $x$ during its instance of TraverseUall. Since $iOp$ is linearized between $C_{T}$ and $C$ , and hence before $C^{\prime}$ , it follows by Lemma 5.23 that $pOp$ has a candidate return value $x^{\prime},$ where $w<x\leq x^{\prime}<y$ .
•

Suppose $\mathit{uNode}\in(D_{\mathit{uall}}-D_{\mathit{ruall}})$ . Let $dOp_{w}$ be the Delete $(x)$ operation that created $\mathit{uNode}$ . Let $C$ be the configuration immediate before $dOp_{w}$ is linearized.

Suppose, for contradiction, that there is a Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . Since $\mathit{uNode}\in(D_{\mathit{uall}}-D_{\mathit{ruall}})$ and $pOp$ encounters $\mathit{uNode}$ when it traverse U-ALL, $C$ occurs before $pOp$ encounters any update nodes with key greater than $w$ . It follows by Lemma 5.22 that $x$ is a candidate return value of $pOp$ .
•

Suppose $\mathit{uNode}\in(D_{\mathit{notify}}-D_{\mathit{ruall}})$ . Let $dOp_{w}$ be the Delete $(x)$ operation that created $\mathit{uNode}$ . Let $C$ be the configuration immediate before $dOp_{w}$ is linearized.

Suppose, for contradiction, that there is a Insert $(x)$ operation, $iOp_{x}$ , linearized between $C_{T}$ and $C$ , where $w<x<y$ . So $iOp_{x}$ is linearized before $iOp_{w}$ . Since $dOp_{w}$ notifies $pOp$ , it follow from Lemma 5.23 that $pOp$ has a candidate return value whose value is at least $x$ .

∎

Recall that $pOp$ may have one additional candidate return value $p_{0}$ . We will show in the next section that this candidate return value is in $S$ in $C_{T}$ , and hence vacuously satisfies Property 2.

5.3.6 Our Implementation Satisfies Property 3

In this section, we show that Property 3 is satisfied by our implementation. It is easy to show this property is satisfied when $pOp$ ’s instance of RelaxedPredecessor $(y)$ returns a value $p_{0}\neq\bot$ . When $\bot$ is returned, we show that after $pOp$ completes the pseudocode from lines 217 to 237, if a key $x$ is completely present throughout $pOp$ , then $pOp$ sets $p_{0}$ to a value at least $x$ , and hence $pOp$ has a candidate return value at least $x$ .

The next lemma states that if there is a linearized update operation $uOp$ with key $x$ that has not completed updating the relaxed binary trie is concurrent with $pOp$ ’s traversal of the relaxed binary trie, $pOp$ will encounter an update node with key $x$ when $pOp$ traverses the U-ALL or when $pOp$ traverses its own notify list. Intuitively, $pOp$ will either traverse the U-ALL before $uOp$ can remove its update node from the U-ALL, or $uOp$ will notify $pOp$ before $pOp$ removes its predecessor node from the P-ALL.

Lemma 5.25.

Let $uOp$ be a linearized, $S$ -modifying update operation with key $x$ , where $x<y$ . Suppose that, at any point during $pOp$ ’s traversal of the relaxed binary trie, $uOp$ is the latest update operation with key $x$ and $uOp$ has not yet completed updating the relaxed binary trie. Then $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup D_{\mathit{uall}}\cup D_{\mathit{notify}}$ contains an update node with key at least $x$ .

Proof.

Since $uOp$ is an $S$ -modifying update operation, the update node $\mathit{uNode}$ it created is the first activated update node in $\textit{latest}[x]$ when it was linearized. By Lemma 5.19 and Lemma 5.20, before a latest update node with key $x$ is removed from U-ALL, some latest update node with key $x$ notifies the predecessor operations whose predecessor nodes are in the P-ALL. We consider two cases, depending on whether $pOp$ completes its traversal of the U-ALL on line 203 first, or if some latest update operation with key $x$ completes notifying $pOp$ on line 162 first.

Suppose $pOp$ completes its traversal of the U-ALL before some latest update operation with key $x$ notifies $pOp$ . The danger interval of $uOp$ starts before the end of $pOp$ ’s binary trie traversal, so U-ALL contains a latest update node with key $x$ before the start of $pOp$ ’s traversal of the U-ALL. Since no latest update operation with key $x$ notifies $pOp$ , no latest update node with key $x$ is removed from U-ALL before $pOp$ completes its traversal of the U-ALL. So $pOp$ encounters a latest update node $\mathit{uNode}^{\prime}$ with key $x$ during its traversal of the U-ALL. Since FirstActivated $(\mathit{uNode}^{\prime})$ returns True on line 127, $\mathit{uNode}^{\prime}\in I_{\mathit{uall}}\cup D_{\mathit{uall}}$ when TraverseUall returns on line 203.

Suppose some latest update operation $uOp^{\prime}$ with key $x$ notifies all predecessor nodes in the P-ALL before $pOp$ completes its traversal of the U-ALL, and hence before $pOp$ removes $\mathit{pNode}$ from P-ALL. Note that since $pOp$ starts its binary trie traversal before the end of $uOp$ ’s danger interval, $\mathit{pNode}$ is inserted into P-ALL before $uOp$ , and hence before $uOp^{\prime}$ starts its traversal of P-ALL on ine 162 (or line 133). Since $\mathit{pNode}$ is in P-ALL throughout $uOp^{\prime}$ ’s traversal of the P-ALL on line 162 $uOp^{\prime}$ notifies $pOp$ . Since $uOp^{\prime}$ does not notify $pOp$ until after the end of its danger interval and $pOp$ completes its traversal of RU-ALL before the start of its traversal of the relaxed binary trie, it follows that $\mathit{pNode}.\mathit{RuallPosition}$ points to the sentinel node in the RU-ALL with key $-\infty$ throughout $uOp^{\prime}$ traversal of the P-ALL. So when $pOp$ finishes traversing its notify list on line 205, $\mathit{uNode}\in I_{\mathit{notify}}\cup D_{\mathit{notify}}$ . ∎

Recall from the specification of RelaxedPredecessor $(y)$ that if it returns $\bot$ , there exists an $S$ -modifying update operation with key less than $y$ concurrent with the instance of RelaxedPredecessor $(y)$ . The next lemma states that $pOp$ will be notified by this update operation or $pOp$ will encounter the update node it created when it traverses the U-ALL.

Lemma 5.26.

Let $k$ be the largest key less than $y$ that is completely present throughout $pOp$ ’s traversal of the relaxed binary trie. Suppose $pOp$ ’s instance of RelaxedPredecessor $(y)$ returns $\bot$ . If $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key at least $k$ , then $D_{\mathit{ruall}}$ contains an update node with key at least $k$ .

Proof.

By assumption, $pOp$ ’s instance of RelaxedPredecessor $(y)$ returns $\bot$ . So by Lemma 4.22, there exists an $S$ -modifying update operation $uOp$ with key $x$ , where $k<x<y$ , such that, at some point during $pOp$ ’s traversal of the relaxed binary trie, $uOp$ is the latest update operation with key $x$ and $uOp$ has not yet completed updating the relaxed binary trie. By Lemma 5.25, $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup D_{\mathit{uall}}\cup D_{\mathit{notify}}$ contains an update node with key $x$ . Therefore, if $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key at least $x$ , $D_{\mathit{ruall}}$ contains an update node with key $x$ .

∎

The remaining lemmas relate to the value computed for $p_{0}$ from lines 217 to 237. We use the following definitions for all these lemmas. Let $k$ be the largest key less than $y$ that is completely present throughout $pOp$ . If $pOp$ determines a predecessor node $\mathit{pNode}^{\prime}$ on line 221, let $C$ be the configuration immediately after $\mathit{pNode}^{\prime}$ was announced; otherwise let $C$ be the configuration immediately after $\mathit{pNode}$ was announced. We let $R$ and $L$ refer to $pOp$ ’s local variables with the same name, and consider their values at various points in the algorithm.

Lemma 5.27.

Suppose $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key at least $k$ . Let $k^{\prime}$ be any key such that $k^{\prime}>k$ , $k^{\prime}\in R$ immediately after line 231, and $k^{\prime}\in S$ in some configuration $C^{\prime}$ after $C$ . If $k^{\prime}$ is removed from $R$ on line 235 in some iteration of the for-loop on line 232, then there exists a key $w$ , where $k\leq w<k^{\prime}$ , is added $R$ on line 235 in the same iteration or later iteration of the for-loop and $w\in S$ in some configuration after $C$ .

Proof.

So suppose $k^{\prime}$ is removed from $R$ on line 235 because there is a DEL node, $\mathit{dNode}$ , with $\mathit{dNode}.\mathit{key}=k^{\prime}$ . Let $dOp$ be the Delete $(k^{\prime})$ operation that created $\mathit{dNode}$ .

Suppose $\mathit{dNode}$ is linearized before $C^{\prime}$ . Since $k^{\prime}\in C^{\prime}$ , there is an $S$ -modifying Insert $(k^{\prime})$ operation, $iOp$ , linearized after $dOp$ but before $C^{\prime}$ . Let $\mathit{iNode}$ be the INS node created by $iOp$ . If $iOp$ does not notify $\mathit{pNode}^{\prime}$ or $\mathit{pNode}$ by the time $pOp$ completes its traversal of the U-ALL, then it follows from Lemma 5.19 that $\mathit{iNode}\in I_{\mathit{uall}}$ . So $iOp$ must notify $\mathit{pNode}^{\prime}$ or $\mathit{pNode}$ before $pOp$ completes its traversal of the U-ALL. Since $\mathit{iNode}\notin I_{\mathit{uall}}\cup I_{\mathit{notify}}$ , if $iOp$ notifies $\mathit{pNode}$ , the notification must be rejected, and hence $iOp$ notified $pOp$ when $\mathit{pNode}.\mathit{RuallPosition}$ points to an update node with key greater than $k$ . It follows that $\mathit{iNode}$ is added to $L_{2}$ on line 227. Otherwise $iOp$ does not notify $\mathit{pNode}$ , so it notifies $\mathit{pNode}^{\prime}$ . Then $\mathit{iNode}$ is added to $L_{1}$ . Since $iOp$ is linearized after $dOp$ , $\mathit{iNode}$ appears after $\mathit{dNode}$ in $L$ . Then $k^{\prime}$ is added to $R$ on line 233 once $pOp$ encounters $\mathit{iNode}$ in $L$ . So $k^{\prime}\in S$ sometime after $C$ .

So $\mathit{dNode}$ is linearized after $C^{\prime}$ . Then the second embedded predecessor of $\mathit{dNode}$ returns a value $w$ where $k\leq w<k^{\prime}$ , and $w\in S$ sometime during the embedded predecessor. So $w\in S$ after $C^{\prime}$ . Then when $k^{\prime}$ is removed from $R$ on line 235, $w$ is added to $R$ on line 235 in the same iteration. ∎

The following lemma is the main lemma that proves Property 3 is satisfied by $pOp$ . It ensures that for each key $k$ inserted into the relaxed binary trie that is not later deleted by the start of $pOp$ ’s traversal of the relaxed binary trie, $pOp$ has a candidate return value at least $k$ . If key $k$ is not seen in any of these, then a $pOp$ ’s traversal of the binary trie could not complete due to Delete operations that may have been linearized before the start of $pOp$ (i.e. Delete operations whose DEL nodes are in $D_{\mathit{ruall}}$ ). In this case, $p_{0}$ is set to a value at least $k$ on line 237.

Lemma 5.28.

Suppose an $S$ -modifying Insert $(w)$ operation $iOp$ is linearized before $C_{T}$ , $w<y$ , and there are no $S$ -modifying Delete $(w)$ operations linearized after $iOp$ and before $C_{T}$ . Then either

•

$pOp$ ’s instance of RelaxedPredecessor $(y)$ returns a value at least $w$ ,
•

$I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ contains an update node with key at least $w$ , or
•

$p_{0}$ is set to a value at least $w$ on line 237.

Proof.

If $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ contains an update node with a key at which is at least $w$ , the lemma holds. So suppose $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key which is at least $w$ . By Lemma 5.25, $iOp$ ’s update of the relaxed binary trie does not overlap with $pOp$ ’s traversal of the binary trie. So $w\in S$ throughout $pOp$ ’s traversal of the binary trie and $w$ is completely present through $pOp$ ’s traversal of the relaxed binary trie.

We prove that the lemma is true for $k$ , and hence is true for $w\leq k$ . By the specification of the relaxed binary tire (Lemma 4.22 and Lemma 4.21), if RelaxedPredecessor $(y)$ returns a value other than $\bot$ , it returns a value at least $k$ .

So suppose RelaxedPredecessor $(y)$ returns $\bot$ . By Lemma 4.22, there is an $S$ -modifying update operation, $uOp$ , with key $x$ , where $k<x<y$ whose update to the relaxed binary trie overlaps with $pOp$ ’s traversal of the relaxed binary trie. Since $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key which is at least $k$ , it follows from Lemma 5.25 that $D_{\mathit{ruall}}$ contains a DEL node with key $k$ .

So the if-statement of line 217 of PredHelper $(y)$ evaluates to True. Let $\mathit{iNode}$ be the INS node created by $iOp$ . If $pOp$ determines a predecessor node $\mathit{pNode}^{\prime}$ on line 221, let $C$ be the configuration immediately after $\mathit{pNode}^{\prime}$ was announced; otherwise let $C$ be the configuration immediately after $\mathit{pNode}$ was announced.

Suppose $iOp$ is linearized after $C$ . If $iOp$ does not notify $\mathit{pNode}^{\prime}$ or $\mathit{pNode}$ by the time $pOp$ completes its traversal of the U-ALL, then it follows from Lemma 5.19 that $\mathit{iNode}\in I_{\mathit{uall}}$ . Since $\mathit{iNode}\notin I_{\mathit{uall}}\cup I_{\mathit{notify}}$ , if $iOp$ notifies $\mathit{pNode}$ , the notification must be rejected, and hence $iOp$ notified $pOp$ when $\mathit{pNode}.\mathit{RuallPosition}$ points to an update node with key greater than $k$ . It follows that $\mathit{iNode}$ is added to $L_{2}$ on line 227. Otherwise $iOp$ does not notify $\mathit{pNode}$ , so it notifies $\mathit{pNode}^{\prime}$ . Then $\mathit{iNode}$ is added to $L_{1}$ . In any case, $k$ is a key in $R$ . So $k$ is the key of an INS node in $L$ , and hence is added to $R$ on line 233. By assumption, there are no Delete $(k)$ operations linearized after $iOp$ and before the end of $pOp$ ’s traversal of the relaxed binary trie. Since $L$ only contains the update nodes of update operations linearized before the start of $pOp$ ’s traversal of the relaxed binary trie, the last update node with key $k$ in $L$ is $iOp$ ’s INS node. So $k$ is not removed from $R$ on line 235.

Furthermore, $D_{\mathit{ruall}}$ only contains the update nodes of update operations linearized before the start of $pOp$ ’s traversal of the relaxed binary trie. For contradiction, suppose there is a DEL node in $D_{\mathit{ruall}}$ with key $k$ . Then $pOp$ encountered this DEL node in RU-ALL and simultaneously set $\mathit{pNode}.\mathit{RuallPosition}$ to point to an update node with key $k$ . From this point on, $pOp$ accepts all notifications from Insert operations with key $k$ . When $pOp$ put this DEL node into $D_{\mathit{ruall}}$ , it was the latest update operation with key $k$ . Therefore, $iOp$ was linearized after this point. Hence, either $iOp$ notified $pOp$ or $pOp$ encountered $iOp$ ’s INS node when it traversed the U-ALL. This contradicts the fact that $\mathit{iNode}\notin I_{\mathit{uall}}\cup I_{\mathit{notify}}$ . Thus, $k$ is not removed from $R$ on line 236.

Now suppose $iOp$ is linearized before $C$ . So $k\in S$ in all configurations between $C$ and the end of $pOp$ ’s traversal of the relaxed binary trie. Note that $C$ occurs before the start of the first embedded predecessor operation of any DEL node in $D_{\mathit{ruall}}$ . Recall that there is a DEL node $\mathit{dNode}\in D_{\mathit{ruall}}$ with key $x$ such that $k<x<y$ . The first embedded predecessor of the Delete $(x)$ operation, $dOp_{x}$ , that created $\mathit{dNode}$ begins after $C$ . From the code, this embedded predecessor operation completes before $\mathit{dNode}$ is added to the RU-ALL. Since $pOp$ added $\mathit{dNode}$ to $D_{\mathit{ruall}}$ while it traversed the RU-ALL, $\mathit{dNode}$ was added to the RU-ALL before $pOp$ began its traversal of the relaxed binary trie. The first embedded predecessor of $dOp_{x}$ returns a value $k^{\prime}$ such that $k\leq k^{\prime}<x$ , because $k\in S$ throughout its execution interval. This value will be added to $R$ on line 231. So $R$ contains at least one value at least $k$ at this point.

Since $R$ contains at least one value at least $k$ before the for-loop on line 232, the Lemma 5.27 implies that $R$ contains a value at least $k$ after the for-loop. Let $k^{\prime\prime}$ be the smallest value $k^{\prime\prime}\geq k$ that is in $R$ immediately before line 236 (i.e. immediately after $pOp$ completes its local traversal of $L$ during the for-loop on line 232). Suppose, for contradiction, that $k^{\prime\prime}$ is removed from $R$ on line 236. Then there exists a DEL node, $\mathit{dNode}^{\prime}\in D_{\mathit{ruall}}$ such that $\mathit{dNode}^{\prime}.\mathit{key}=k^{\prime\prime}$ . By definition of $C$ , its first embedded predecessor occurs after $C$ . So the first embedded predecessor of $\mathit{dNode}^{\prime}$ returns a key $k^{\prime\prime\prime}$ where $k\leq k^{\prime\prime\prime}<k^{\prime\prime}$ . Lemma 5.27 implies that, immediately before line 236, $R$ contains a key at least $k^{\prime\prime\prime}$ . This contradicts the definition of $k^{\prime\prime}$ .

Therefore, $p_{0}$ is set to a value at least $k$ on line 237. ∎

It remains to prove that, when $p_{0}$ is set to a value $k$ on line 237 and is the largest candidate return value, it satisfies Property 2. We do this by proving $k\in S$ in $C_{T}$ .

Lemma 5.29.

Suppose $p_{0}$ is set to a value $k$ on line 237 and $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ does not contain an update node with key at least $k$ . Then $k\in S$ in $C_{T}$ .

Proof.

Suppose, for contradiction, that $k\notin S$ in $C_{T}$ . Let $dOp$ be the $S$ -modifying Delete $(k)$ operation that last removed $k$ from $S$ prior to $C_{T}$ . Let $\mathit{dNode}$ be the DEL node created by $dOp$ . Note that no $S$ -modifying Insert $(k)$ operation is linearized after $dOp$ but before $C_{T}$ . If $pOp$ determines a predecessor node $\mathit{pNode}^{\prime}$ on line 221, let $C$ be the configuration immediately after $\mathit{pNode}^{\prime}$ was announced; otherwise let $C$ be the configuration immediately after $\mathit{pNode}$ was announced.

Since $p_{0}$ is set to a value $k$ on line 237, $k$ was previously added to $R$ on line 233 or line 235. Consider the last time $k$ is added to $R$ . We first prove that $dOp$ is linearized after $C$ .

Suppose $p_{0}$ was last set to $k$ on line 231. Then it is the return value of the first embedded predecessor of some DEL node $\mathit{dNode}$ in $D_{\mathit{ruall}}$ . By the definition of $C$ , this embedded predecessor is performed after $C$ . So there exists a configuration after $C$ and during this embedded predecessor in which $k\in S$ . It follows that $dOp$ is linearized after $C$ .

Suppose $p_{0}$ was last set to $k$ on line 233. Then some $S$ -modifying Insert $(k)$ operation, $iOp$ , notified $\mathit{pNode}$ or $\mathit{pNode}^{\prime}$ . Furthermore, the INS node, $\mathit{iNode}$ , it created is in the sequence $L$ . By lemma 5.19, $dOp$ is linearized after $iOp$ notifies $\mathit{pNode}$ or $\mathit{pNode}^{\prime}$ , which is after $C$ .

Suppose $p_{0}$ was last set to $k$ on line 235. Then it is the return value of the second embedded predecessor of some DEL node $\mathit{dNode}$ in $L$ , where $\mathit{dNode}.\mathit{key}\in R$ . By Lemma 5.27, $dOp$ is linearized after $C$ .

In any case, $dOp$ is linearized after $C$ . If $dOp$ is linearized after $pOp$ sets $\mathit{RuallPosition}$ to point to an update node with key less than $k$ , then by Lemma 5.21, $I_{\mathit{uall}}\cup I_{\mathit{notify}}\cup(D_{\mathit{uall}}-D_{\mathit{ruall}})\cup(D_{\mathit{notify}}-D_{\mathit{ruall}})$ contains contain an update node with key at least $k$ , a contradiction. So $dOp$ is linearized before $pOp$ sets $\mathit{RuallPosition}$ to point to an update node with key less than $k$ . Suppose $\mathit{dNode}$ does not notify $\mathit{pNode}$ or $\mathit{pNode}^{\prime}$ prior to $pOp$ setting $\mathit{RuallPosition}$ to point to an update node with key less than $k$ . Then $\mathit{dNode}$ is encountered when $pOp$ traverses the RU-ALL, and hence added to $D_{\mathit{ruall}}$ . Then $k$ is removed from $R$ on line 236. This contradicts the fact that $p_{0}$ is set to a value $k$ on line 237. So $\mathit{dNode}$ does notify $\mathit{pNode}$ or $\mathit{pNode}^{\prime}$ prior to $pOp$ setting $\mathit{RuallPosition}$ to point to an update node with key less than $k$ . Since no $S$ -modifying Insert $(k)$ operations are linearized after $dOp$ but before $C_{T}$ , $\mathit{dNode}$ is the last update node in $L$ with key $k$ . It follows that $k$ is removed from $R$ on line 235 and not later added back on line 233. This contradicts the fact that $p_{0}$ is set to a value $k$ on line 237. ∎

Therefore, $pOp$ ’s candidate return values are satisfy all properties. It follows by Theorem 5.10 that if $pOp$ returns $w\in U\cup\{-1\}$ , then there exists a configuration during $pOp$ in which $w$ is the predecessor of $y$ . So our implementation of a lock-free binary trie is linearizable with respect to all operations.

5.4 Amortized Analysis

In this section, we give the amortized analysis of our implementation. The amortized analysis is simple because the algorithms to update and traverse the binary trie component of our data structure are wait-free. All other steps traverse and update the lock-free linked lists.

Lemma 5.30.

Search $(x)$ operations have $O(1)$ worst-case step complexity.

Proof.

Search $(x)$ operations simply find the first activated update node in $\textit{latest}[x]$ . From the pseudocode, it always completes in a constant number of reads. ∎

We next consider the amortized step complexity of Insert and Delete operations, $uOp$ , while ignoring the steps taken in embedded Predecessor operations.

Lemma 5.31.

Each Insert and Delete operation, $uOp$ , has $O(\dot{c}(uOp)+\log u)$ amortized step complexity, ignoring all instances of NotifyPredOps and PredHelper (i.e. embedded predecessors performed by Delete operations).

Proof.

Let $uOp$ be an Insert and Delete operation. It follows from the psuedocode of InsertBinaryTrie and DeleteBinaryTrie that operations perform a constant number of steps updating each binary trie node on the path from a leaf to the root, which has length $\lceil\log_{2}u\rceil+1$ .

Recall that inserting and deleting from a lock-free linked list can be done in amortized step complexity $O(\dot{c}(uOp)+L(uOp))$ , where $L(uOp)$ is the number of nodes in the linked list at the start of $uOp$ . The number of nodes in P-ALL, U-ALL, and RU-ALL at the start of $uOp$ is $O(\dot{c}(uOp))$ . So $uOp$ can update the lock-free linked lists in $O(\dot{c}(uOp))$ amortized number of steps. It is known that any execution $\sum_{op\in\alpha}\dot{c}(op)\leq\sum_{op\in\alpha}2\dot{c}(op)$ . So $uOp$ can traverse P-ALL and U-ALL in $O(\dot{c}(op))$ amortized steps. So $uOp$ can traverse the lock-free linked lists in $O(\dot{c}(uOp))$ amortized number of steps. All other parts of Insert and Delete take a constant number of steps. ∎

We next consider the number of steps taken during instances of NotifyPredOps, which involves adding notify nodes into the notify lists of every predecessor node in P-ALL.

Lemma 5.32.

In any execution $\alpha$ , the total number of steps taken by instances of NotifyPredOps is $\sum_{op\in\alpha}\dot{c}(op)^{2}$ .

Proof.

Let $op$ be an update operation invoking NotifyPredOps $(\mathit{uNode})$ . Inserting into the head of $\mathit{pNode}.\mathit{notifyList}$ of some predecessor node $\mathit{pNode}$ created by Predecessor operation, $pOp$ , takes $\dot{c}(C)$ amortized number of steps, where $C$ is the configuration immediately after the successful CAS required to update the pointer to add a new notify node. The operation performing the successful CAS pays for the at most $\dot{c}(C)-1$ unsuccessful CAS steps it causes. We let $op$ pay for inserting into $\mathit{pNode}.\mathit{notifyList}$ if $pOp$ is active at the start of $op$ . This costs $O(\dot{c}(op)^{2})$ amortized number of steps in total since there are at most $O(\dot{c}(op))$ such operations $pOp$ . If $pOp$ is invoked after the start of $op$ , $pOp$ helps $op$ pay for inserting into $\mathit{pNode}.\mathit{notifyList}$ . There are $O(\dot{c}(pOp))$ operations concurrent with $pOp$ when it is invoked, so a total of $O(\dot{c}(pOp)^{2})$ amortized number of steps are charged to $pOp$ . Therefore, for any execution $\alpha$ , the total number of steps taken by instances of NotifyPredOps is $\sum_{op\in\alpha}\dot{c}(op)^{2}$ . ∎

Lemma 5.33.

The amortized number of steps taken by instances $pOp$ of PredHelper is $O(\dot{c}(pOp)^{2}+\tilde{c}(pOp)+\log u)$ .

Proof.

Adding $pOp$ ’s predecessor node into the P-ALL, which takes $O(\dot{c}(pOp))$ amortized number of steps. Performing RelaxedPredecessor $(y)$ takes $O(\log u)$ steps in the worst-case. Traversing P-ALL, U-ALL, and RU-ALL takes $O(\dot{c}(pOp))$ amortized number of steps. Traversing its own notify list takes $O(\bar{c}(pOp))=O(\dot{c}(pOp))$ steps because it contains $O(\bar{c}(pOp))$ notify nodes. Recall that $O(\dot{c}(op)^{2}$ steps are charged to $pOp$ from instances of NotifyPredOps. The steps in this if-block on on line 217 to line 237 involves traversing the $\mathit{notifyList}$ of some concurrent Delete operation $dOp$ (on line 222). The length of this $\mathit{notifyList}$ is $O(\bar{c}(dOp))$ . So $op$ takes $O(\tilde{c}(pOp))$ steps to traverse this $\mathit{notifyList}$ . Furthermore, $op$ takes $O(\bar{c}(pOp))$ steps to traverse the notify list of its own predecessor node on line 225. In summary, the total amortized cost is $O(\dot{c}(op)^{2}+\tilde{c}(op)+\log u)$ . ∎

The total amortized number of steps taken by each operation $op$ is summarized below.

Theorem 5.34.

We give a linearizable, lock-free implementation of a binary trie for a universe of size $u$ supporting Search with $O(1)$ worst-case step complexity, Delete and Predecessor with $O(\dot{c}(op)^{2}+\tilde{c}(op)+\log u)$ amortized step complexity, and Insert with $O(\dot{c}(op)^{2}+\log u)$ amortized step complexity.

6 Conclusion and Future Work

The main contribution of our paper is a deterministic, lock-free implementation of a binary trie using read, write, CAS, and AND operations. We prove that the implementation is linearizable. We show that it supports Search with $O(1)$ worst-case step complexity, Delete and Predecessor with $O(\dot{c}(op)^{2}+\tilde{c}(op)+\log u)$ amortized step complexity, and Insert with $O(\dot{c}(op)^{2}+\log u)$ amortized step complexity.

The implementation uses a relaxed binary trie as one of its components. All update operations on the relaxed binary trie take $O(\log u)$ steps in the worst-case. Each predecessor operation on the relaxed binary trie takes $O(\log u)$ steps in the worst-case, since it can complete without having to help concurrent update operations.

It is possible extend our lock-free binary trie to support Max, which returns the largest key in $S$ . This can be done by extending the binary trie to represent an additional key $\infty$ that is larger than all keys in $U$ , and then performing Predecessor $(\infty)$ . By symmetry, Successor and Min can also be supported.

Our lock-free binary trie is in the process of being implemented. The implementation uses a version of epoch-based memory reclamation based on DEBRA [7] to avoid ABA problems when accessing dynamically allocated objects. Its performance will be compared to that of other lock-free data structures supporting Predecessor.

In our lock-free binary trie, predecessor operations get information about update operations that announce themselves in the update announcement linked list. Predecessor operations also announce themselves in the predecessor announcement linked list, so that update update operations can give them information. There is an amortized cost of $O(\dot{c}(op)^{2})$ for an update operation, $op$ , to give information to all predecessor operations. We would like to obtain a more efficient algorithm to do this, which will result in a more efficient implementation of a lock-free binary trie.

A sequential van Emde Boas trie supports Search, Insert, Delete, and Predecessor in $O(\log\log u)$ worst-case time. We conjecture that there is a lock-free implementation supporting operations with $O(\dot{c}(op)^{2}+\tilde{c}(op)+\log\log u)$ amortized step complexity. Since the challenges are similar, we believe our techniques for implementing a lock-free binary trie will be useful for implementing a lock-free van Emde Boas trie. In particular, using an implementation of a relaxed van Emde Boas trie should be a good approach.

References

[1] Maya Arbel-Raviv and Trevor Brown. Reuse, don’t recycle: Transforming lock-free algorithms that throw away descriptors. In 31st International Symposium on Distributed Computing, volume 91 of LIPIcs, pages 4:1–4:16, 2017.
[2] Greg Barnes. A method for implementing lock-free shared-data structures. In Lawrence Snyder, editor, Proceedings of the 5th Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA ’93, pages 261–270. ACM, 1993.
[3] Paul Bieganski, John Riedl, John V. Carlis, and Ernest F. Retzel. Generalized suffix trees for biological sequence data: Applications and implementation. In 27th Annual Hawaii International Conference on System Sciences (HICSS-27), pages 35–44. IEEE Computer Society, 1994.
[4] Guy E. Blelloch and Yuanhao Wei. LL/SC and atomic copy: Constant time, space efficient implementations using only pointer-width CAS. In 34th International Symposium on Distributed Computing, DISC 2020, pages 5:1–5:17, 2020.
[5] Anastasia Braginsky and Erez Petrank. A lock-free b+tree. In 24th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA ’12, pages 58–67. ACM, 2012.
[6] Trevor Brown. B-slack trees: Space efficient b-trees. In Algorithm Theory - SWAT 2014 - 14th Scandinavian Symposium and Workshops, volume 8503 of Lecture Notes in Computer Science, pages 122–133, 2014.
[7] Trevor Brown. Reclaiming memory for lock-free data structures: There has to be a better way. In Proceedings of the 2015 ACM Symposium on Principles of Distributed Computing, PODC 2015, pages 261–270, 2015.
[8] Trevor Brown. Techniques for Constructing Efficient Lock-Free Data Structures. PhD thesis, Department of Computer Science, University of Toronto, 2017.
[9] Trevor Brown and Hillel Avni. Range queries in non-blocking k-ary search trees. In Principles of Distributed Systems, 16th International Conference, OPODIS 2012, volume 7702 of Lecture Notes in Computer Science, pages 31–45. Springer, 2012.
[10] Trevor Brown, Faith Ellen, and Eric Ruppert. Pragmatic primitives for non-blocking data structures. In ACM Symposium on Principles of Distributed Computing, PODC ’13, pages 13–22. ACM, 2013.
[11] Trevor Brown, Faith Ellen, and Eric Ruppert. A general technique for non-blocking trees. In Proceedings of the Symposium on Principles and Practice of Parallel Programming (PPoPP), pages 329–342, 2014.
[12] Trevor Brown, Aleksandar Prokopec, and Dan Alistarh. Non-blocking interpolation search trees with doubly-logarithmic running time. In PPoPP ’20: 25th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 276–291. ACM, 2020.
[13] Bapi Chatterjee. Lock-free linearizable 1-dimensional range queries. In Proceedings of the 18th International Conference on Distributed Computing and Networking, page 9. ACM, 2017.
[14] Bapi Chatterjee, Nhan Nguyen Dang, and Philippas Tsigas. Efficient lock-free binary search trees. In Proceedings of the ACM Symposium on Principles of Distributed Computing (PODC), pages 322–331, 2014.
[15] Mikael Degermark, Andrej Brodnik, Svante Carlsson, and Stephen Pink. Small forwarding tables for fast routing lookups. In Proceedings of the ACM SIGCOMM 1997 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pages 3–14. ACM, 1997.
[16] Dana Drachsler, Martin T. Vechev, and Eran Yahav. Practical concurrent binary search trees via logical ordering. In ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP ’14, pages 343–356. ACM, 2014.
[17] Faith Ellen, Panagiota Fatourou, Joanna Helga, and Eric Ruppert. The amortized complexity of non-blocking binary search trees. In Proceedings of the Symposium on Principles of Distributed Computing (PODC), pages 332–340, 2014.
[18] Faith Ellen, Panagiota Fatourou, Eric Ruppert, and Franck van Breugel. Non-blocking binary search trees. In Proceedings of the 29th Annual ACM Symposium on Principles of Distributed Computing (PODC), pages 131–140, 2010.
[19] Panagiota Fatourou, Nikolaos D. Kallimanis, and Eleni Kanellou. An efficient universal construction for large objects. In 23rd International Conference on Principles of Distributed Systems, OPODIS 2019, volume 153 of LIPIcs, pages 18:1–18:15, 2019.
[20] Panagiota Fatourou, Elias Papavasileiou, and Eric Ruppert. Persistent non-blocking binary search trees supporting wait-free range queries. In The 31st ACM on Symposium on Parallelism in Algorithms and Architectures, SPAA 2019, pages 275–286. ACM, 2019.
[21] Mikhail Fomitchev and Eric Ruppert. Lock-free linked lists and skip lists. In Proceedings of the Twenty-Third Annual ACM Symposium on Principles of Distributed Computing (PODC), pages 50–59, 2004.
[22] George Giakkoupis, Mehrdad Jafari Giv, and Philipp Woelfel. Efficient randomized DCAS. In STOC ’21: 53rd Annual ACM SIGACT Symposium on Theory of Computing, pages 1221–1234, 2021.
[23] Wojciech M. Golab, Lisa Higham, and Philipp Woelfel. Linearizable implementations do not suffice for randomized distributed computation. In Proceedings of the 43rd ACM Symposium on Theory of Computing, STOC 2011, pages 373–382, 2011.
[24] Timothy L. Harris. A pragmatic implementation of non-blocking linked-lists. In Distributed Computing, 15th International Conference, DISC 2001, pages 300–314, 2001.
[25] Timothy L. Harris, Keir Fraser, and Ian A. Pratt. A practical multi-word compare-and-swap operation. In Distributed Computing, 16th International Conference, DISC 2002, volume 2508 of Lecture Notes in Computer Science, pages 265–279. Springer, 2002.
[26] Maurice Herlihy. Wait-free synchronization. ACM Trans. Program. Lang. Syst., 13(1):124–149, 1991.
[27] Maurice Herlihy. A methodology for implementing highly concurrent objects. ACM Trans. Program. Lang. Syst., 15(5):745–770, 1993.
[28] Maurice Herlihy and Jeannette M. Wing. Linearizability: A correctness condition for concurrent objects. Trans. Program. Lang. Syst., 12(3):463–492, 1990.
[29] Shane V. Howley and Jeremy Jones. A non-blocking internal binary search tree. In 24th ACM Symposium on Parallelism in Algorithms and Architectures, SPAA ’12, pages 161–171. ACM, 2012.
[30] Jeremy Ko. The Amortized Analysis of a Non-blocking Chromatic Tree. In 22nd International Conference on Principles of Distributed Systems (OPODIS 2018), volume 125, pages 8:1–8:17, 2018.
[31] Miguel A. Martínez-Prieto, Nieves R. Brisaboa, Rodrigo Cánovas, Francisco Claude, and Gonzalo Navarro. Practical compressed string dictionaries. Inf. Syst., 56:73–108, 2016.
[32] Aravind Natarajan and Neeraj Mittal. Fast concurrent lock-free binary search trees. In ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP ’14, pages 317–328. ACM, 2014.
[33] Rotem Oshman and Nir Shavit. The skiptrie: low-depth concurrent search without rebalancing. In ACM Symposium on Principles of Distributed Computing, PODC ’13, pages 23–32, 2013.
[34] Erez Petrank and Shahar Timnat. Lock-free data-structure iterators. In Distributed Computing - 27th International Symposium, DISC 2013, volume 8205 of Lecture Notes in Computer Science, pages 224–238. Springer, 2013.
[35] Aleksandar Prokopec, Nathan Grasso Bronson, Phil Bagwell, and Martin Odersky. Concurrent tries with efficient non-blocking snapshots. In Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, pages 151–160. ACM, 2012.
[36] William W. Pugh. Skip lists: A probabilistic alternative to balanced trees. In Algorithms and Data Structures, Workshop WADS, volume 382 of Lecture Notes in Computer Science, pages 437–449. Springer, 1989.
[37] Niloufar Shafiei. Non-blocking patricia tries with replace operations. In IEEE 33rd International Conference on Distributed Computing Systems, ICDCS 2013, pages 216–225. IEEE Computer Society, 2013.
[38] Niloufar Shafiei. Non-blocking doubly-linked lists with good amortized complexity. In Proceedings of the 19th International Conference on Principles of Distributed Systems (OPODIS), pages 35:1–35:17, 2015.
[39] Ori Shalev and Nir Shavit. Split-ordered lists: lock-free extensible hash tables. In Proceedings of the Twenty-Second ACM Symposium on Principles of Distributed Computing, PODC 2003, pages 102–111. ACM, 2003.
[40] John D. Valois. Lock-free linked lists using compare-and-swap. In Proceedings of the Fourteenth Annual ACM Symposium on Principles of Distributed Computing, pages 214–222, 1995.
[41] Peter van Emde Boas. Preserving order in a forest in less than logarithmic time and linear space. Inf. Process. Lett., 6(3):80–82, 1977.
[42] Peter van Emde Boas, R. Kaas, and E. Zijlstra. Design and implementation of an efficient priority queue. Math. Syst. Theory, 10:99–127, 1977.
[43] Yuanhao Wei, Naama Ben-David, Guy E. Blelloch, Panagiota Fatourou, Eric Ruppert, and Yihan Sun. Constant-time snapshots with applications to concurrent data structures. In PPoPP ’21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pages 31–46, 2021.
[44] Dan E. Willard. Log-logarithmic worst-case range queries are possible in space theta(n). Inf. Process. Lett., 17(2):81–84, 1983.