Scalable Nanophotonic-Electronic Spiking Neural Networks

Luis El Srouji, Yun-Jhu Lee, Mehmet Berkay On, Li Zhang, and S.J. Ben Yoo

Abstract

Spiking neural networks (SNN) provide a new computational paradigm capable of highly parallelized, real-time processing. Photonic devices are ideal for the design of high-bandwidth, parallel architectures matching the SNN computational paradigm. Co-integration of CMOS and photonic elements allow low-loss photonic devices to be combined with analog electronics for greater flexibility of nonlinear computational elements. As such, we designed and simulated an optoelectronic spiking neuron circuit on a monolithic silicon photonics (SiPh) process that replicates useful spiking behaviors beyond the leaky integrate-and-fire (LIF). Additionally, we explored two learning algorithms with the potential for on-chip learning using Mach-Zehnder Interferometric (MZI) meshes as synaptic interconnects. A variation of Random Backpropagation (RPB) was experimentally demonstrated on-chip and matched the performance of a standard linear regression on a simple classification task. Meanwhile, the Contrastive Hebbian Learning (CHL) rule was applied to a simulated neural network composed of MZI meshes for a random input-output mapping task. The CHL-trained MZI network performed better than random guessing but does not match the performance of the ideal neural network (without the constraints imposed by the MZI meshes). Through these efforts, we demonstrate that co-integrated CMOS and SiPh technologies are well-suited to the design of scalable SNN computing architectures.

Index Terms:

neuromorphic computing, spiking neural networks, nanophotonics, photonic integrated circuits, silicon photonics.

I Introduction

Computation using spiking neural networks (SNN) yields three major architectural advantages: (1) the sparsity of communication between elements which reduces energy cost, (2) the binarization of communication without discretization of messages (i.e. all-or-nothing spike responses), and (3) completely asynchronous operation of computational units. At the architectural level, the spiking paradigm requires several computational elements in common to the traditional artificial neural network (ANN)—weighted addition, nonlinearity, and learning algorithms—though with the additional complexity of computation spread through time. Traditional computational approaches based on the von Neumann computing architecture—including modern system architectures equipped with graphical processing units (GPUs)—are not well-suited for the use of this computational paradigm due to the fundamental separation between computing and memory units and resulting serialization of many processing tasks. In turn, the traditional computing paradigm cannot efficiently support the requisite computational elements without significant simplification or long latencies thus warranting the development of new computer architectures. Neuromorphic design operates under the general principle that evolution has already produced a successful SNN architecture for operating under real-time, low-power conditions. Approaches to replicating this design employ a variety of digital, analog, or mixed-signal circuits that can be based on electronic, photonic, or optoelectronic devices. Nonetheless, substantially more work is necessary to determine the optimal approach to abstract, apply, and improve upon this evolutionary design.

Digital neuromorphic processors (such as TrueNorth[1], Loihi[2], SpiNNaker[3], etc.) increase the parallelization of processing by including a large number of cores that allow asynchronous computation—in contrast to GPU architectures—though this approach is not unlike a specialized and monolithic form of cluster computing. Though each core completes its operation in parallel, a desire for determinism in digital electronics necessitates synchronization between simulated time steps. This, in turn, limits full asynchronous operation which may prove to be prohibitive at biological network scales. On the other hand, analog electronic meshes can provide fully parallel computation, though the capacitance of electrical wire networks causes increases to both latency and power consumption.

Photonic and optical computing efforts have sought to exploit the nearly lossless and parallel communication capabilities of optical fibers into the domain of photonic integrated circuits (PICs). A number of demonstrations have already shown matrix multiplication and convolutional processing using non-spiking photonic circuits [4, 5, 6]. These devices use a combination of wavelength-division multiplexing (WDM) and space-division multiplexing (SDM) to manage multiply-and-accumulate (MAC) operations in parallel; thus, these schemes are also compatible with spike processing in synaptic networks. Choices of nonlinearity in spiking elements varies widely from one approach to another, though a major division can be made between all-optical and optoelectronic approaches. Optical nonlinearities typically have shorter lifetimes and can potentially service higher speed computation compared to electronic nonlinearities based on electronic charges or currents. However, the manipulation of these nonlinearities is governed mainly by material properties which are fixed after fabrication. Given that biological neural networks operate over a range of time-scales, it is preferable to have programmable elements in the neuron design. Optoelectronic approaches can take advantage of recent progress in the co-integration of CMOS circuitry with photonic devices to form flexible and programmable spiking neuromorphic computers.

In addition to the architectural benefits, SNNs offer provable advantages in solving graph algorithms, constraint satisfaction, and other optimization problems [7, 8, 9, 10]. Incorporating learning and training using Hebbian[11] and spike-timing-dependent plasticity (STDP)[12] algorithms also allows for the application of SNNs in many of the same contexts as deep neural networks (DNN). These learning rules have the additional architectural advantage of using only locally available information for the updating of each synapse. In principle, this means that all weight updates within the network can be calculated completely in parallel. With the appropriate network topology and training signals, Hebbian learning has also been shown capable of error-driven learning equivalent to backpropagation in deep and convolutional neural networks of moderate size [13, 14].

In this paper we will discuss the design of a nanophotonic-electronic neuromorphic architecture for native SNN computation with on-chip learning. Sec. II will provide a brief taxonomy of existing photonic and optoelectronic approaches to spiking neuron and optical matrix multiplication. Next, Sec. III will discuss the technologies and algorithms used, while addressing scalability and remaining design challenges. Finally, Sec. IV will detail future directions and perspectives for the design of photonic neuromorphic processors.

II Background and Survey

Spiking neural networks require two primary computational elements: (i) a nonlinear spiking unit that can integrate its inputs over time (the neuron) and (ii) a reconfigurable network to service weighted connections between these elements (the synaptic network). As previously alluded, the nonlinearities exploited for the design of spiking units can vary between all-optical and optoelectronic approaches, the choice of which can limit the choice of network elements to service communications between units.

II-A Spiking Nonlinearity

Excitability describes the ability of a system to quickly and temporarily deviate from its quiescent state following small perturbations and can be rigorously described through bifurcation analysis as done by Izhikevich [15]. Biological neurons are dynamical systems and have been classified into saddle-node and Andronov-Hopf bifurcations which correspond to integrator and resonator neurons respectively. Simply put, integrator neurons integrate their inputs and will generate a spike upon reaching some dynamic threshold, while a resonator neuron undergoes some internal subthreshold oscillation with an increased response and likelihood to generate a spike for inputs that fall at specific phases of a resonant frequency.

Computationally useful spiking neurons, however, need not be entirely biologically plausible. Instead, behavior is commonly summarized by the leaky-integrate-and-fire (LIF) neuron model. In the LIF model, the membrane potential constantly undergoes exponential decay towards its resting potential with discrete jumps at each input spike. When the membrane potential reaches a fixed threshold, the spike is generated and the potential is instantaneously returned to a reset potential. LIF neurons are only able to represent integrator neurons and lose much of the complexity of behaviors seen in biological neurons. Alternatively, Izhikevich devised a neuron model which faithfully reproduces a wide range of biologically observed behaviors using only four parameters and two coupled differential equations [16]. For a brief summary of computationally relevant neuron behaviors and a comparison of neuron models see [17]. Other taxonomies exist to classify neuron types according to these behaviors, though some evidence has shown that biological neurons may flexibly switch between these types based on the history of the cell [18]. As such, an ideal hardware implementation of spiking neurons would be capable of representing a range of neuron types for maximal computational ability.

A number of semiconductor lasers have been explored which create isomorphisms between the time dynamics of material parameters of active photonic elements and the cellular mechanisms of biological neurons. Researchers have exploited the time dynamics of photocarriers, thermal diffusion, optical modes, and polarization competition to create excitable laser devices with varying degrees of faithfulness to the biology. Photonic spiking neurons can be most meaningfully divided into two categories based on whether the device can accept optical or electrical inputs—some devices can be modulated by either, but electrical input may be preferred for the advantages in system design discussed in Sec. II-B.

Optical devices can be further classified into coherent and incoherent devices based on how incoming wavelengths are used to excite the active medium. In coherent excitable semiconductor lasers [19, 20, 21, 22, 23], the incoming signal interacts with a lasing cavity mode on the same wavelength to modulate the output signal directly. Excitability is induced by disturbing the balance between competing modes or polarizations which, with sufficient input energy, temporarily drive the extinction of one mode and amplification of the other. Bandwidth for such devices is bound by the cavity Q factor, with a time constant for energy dissipation given by $\tau=Q/\omega_{0}$ . For incoherent devices [24, 25, 26, 27] the incoming signal interacts with some element within the cavity that indirectly modulates the output signal. This may take the form of optical pumping of the laser medium, or otherwise modulating the carrier populations which affect gain and saturation properties. Bandwidth for such approaches are limited by the dynamics of these carrier populations which are material dependent. Alternatively, optoelectronic approaches [28, 29] can allow for the design of analog circuitry with time-dynamics that can be fit to a variety of available neuron models, with lasers modulated by current injection in response to processed photodetector input. Optoelectronic designs are mainly limited by the total bandwidth of integrated photodetectors and electronics, though some estimates suggest that bandwidths upwards of 10 GHz can be expected; see [30] for a more in-depth review of various excitable semiconductor lasers with discussions of bifurcation paralleling Izhikevich’s analysis.

II-B Reconfigurable Networks

Given the ability of silicon waveguides to simultaneously support a wide range of wavelengths with negligible loss, on-chip optical networks are most efficiently parallelized using wavelength division multiplexing (WDM). Time-division multiplexing (TDM) offers another scheme for sharing computing resources over time, but the asynchronous and stochastic nature of SNNs is not likely to benefit from this technique. Using WDM, signals from each neuron can be routed according to wavelength and resources for matrix multiplication may potentially be used for multiple independent operations to support weight sharing and convolution. To support such architectures, different neurons must be distinguishable by output wavelength. However, the system does not need a unique wavelength for each neuron since most SNN architectures group neurons into layers that provide an additional level of hierarchy for routing structures.

Using a WDM approach, arrayed waveguide grating routers (AWGR) can be used to support all-to-all routing schemes between neural layers [31, 32, 33]. Inputs to each layer would be passed through reconfigurable optical matrix multipliers such as cross-bar networks, micro-ring resonator (MRR) banks, and mach-zehnder interferometry (MZI) meshes. MZI meshes can perform unitary matrix transformations that correspond to lossless multiplication and are thus particularly suitable for low-power neuromorphic computing. See [34] for a longer discussion on the design trade-offs between each of these devices. Sec. III-B describes our MZI mesh architecture, while Sec.III-C details algorithms for training SNNs using MZI meshes.

III Scalable Photonic SNN Technologies

III-A Towards Attojoule nanophotonic-electronic spiking neurons

Neurons provide nonlinearity and signal regeneration between each neural network layer. Our previous work [35] presents an optoelectronic neuron design with projected energy efficiency on the order of $200\,aJ/\text{spike}$ . Because the time-scales of electrical circuits are more tunable than photonic nonlinear materials, the neuron is more easily programmable while still taking advantage of low-loss communication provided by photonic interconnects. This design closely matches the behavioral characteristics of the Izhikevich neuron model to achieve a variety of neural behaviors. To move a step forward in realizing attojoule energy efficiencies, we have updated this design on a more advanced foundry platform.

Refer to caption — Figure 1: The circuit diagram of 45SPCLO neuron design. The circuit mechanism of optoelectronic neuron start with converting light input to current. The membrane potential control section will decide the neuron threshold, feedback strength to refractory feedback potential control section and send the light out from laser diode. The feedback potential control decides the refractory strength and the frequency of spiking.

Our previous foundry neuron design [35] also employs optoelectronics and a scalable MZI interconnect mesh, however, this design is not capable of the full range of neural behaviors described by the Izhikevich model. Using the GlobalFoundries (GF) 45SPCLO PDK, a new neuron was designed that can support a wider range of neural behaviors depending on applied voltage biasing. GF 45SPCLO is the successor of the GF 90WG PDK, and preserves the same CMOS-silicon photonic co-integration with a more advanced process node and additional metal routing layers. Fig. 1 shows the GF 45SPCLO neuron circuit design. The labeled red pins mark voltage biasing nodes that can be adjusted to achieve the desired neuron behavior. These nodes correspond to the control of an adjustable positive bias ( $V_{bias}$ ), spiking threshold ( $V_{th}$ ), refractory feedback rate ( $V_{leak}$ ), and adaptation rate ( $V_{leak2}$ ). The function of these node voltages is divided between membrane potential control and feedback potential control. Membrane potential controls $V_{bias}\;\&\;V_{th}$ adjust the spiking threshold and determine the current flow into membrane potential for each spike input. Feedback potential controls, $V_{leak}\;\&\;V_{leak2}$ , determine the strength of negative feedback on the membrane potential and the length of refractory period. Balanced photodetectors receive excitatory and inhibitory light input. The diode at the circuit output incorporates the I-V characteristics of the laser diode chosen for the design.

To demonstrate this design, we first simulate the basic spiking behavior in response to excitatory and inhibitory inputs simulated in Cadence Spectre (shown in Figure 2). The nodes of each measurement are matched to the color of each line in Figure 1. We include inhibitory inputs on spike #11 and #12 and can confirm from Fig. 2 that inhibitory input suppressed the membrane potential and output, which matches our expectation.

Next, we demonstrate three spiking patterns: regular spiking (RS), fast spiking (FS), and chattering (CH) in analogy to [16]. These behaviors can be achieved flexibly by modifying the voltages at each biasing pin, which allows a greater tolerance for mismatch between design and tapeout. These spiking patterns are shown in Figure 3, Figure 4, and Figure 5 respectively. Input photocurrents are simulated as step functions from $0.0\,mA$ to $0.1\,mA$ , node voltages corresponding to each behavior are set as follows:

1) Regular spiking: bias ( $V_{bias}$ ) low, threshold ( $V_{th}$ ) low, refractory feedback ( $V_{leak}$ ) low, and frequency adaptation ( $V_{leak2}$ ) low.

2) Fast spiking: bias ( $V_{bias}$ ) low, threshold ( $V_{th}$ ) high, refractory feedback ( $V_{leak}$ ) low, and frequency adaptation ( $V_{leak2}$ ) high.

3) Chattering: bias ( $V_{bias}$ ) medium, threshold ( $V_{th}$ ) medium, refractory feedback ( $V_{leak}$ ) high, and frequency adaptation ( $V_{leak2}$ ) high.

These simulations verify the ability of the neuron circuit to achieve various spiking patterns on the more advanced 45SPCLO process.

III-B Photonic MZI Mesh as Synaptic Network

The building block of an MZI Mesh is a 4-port device that consists of two 50:50 beam splitters and two-phase shifters, $\theta$ and $\phi$ as shown in Fig. 6 (b). The phase shifter $\theta$ , inside the interferometer, controls the power splitting ratio. Meanwhile, the phase shifter, $\phi$ , outside of the interferometer, controls the relative phase difference between the two coherent input ports. As demonstrated in Fig 6 (a), the tunable power splitting functionality is tested by sweeping applied DC voltage on the phase shifter $\theta$ . MZI Meshes can be arranged in several ways, with the most popular arrangements being the triangular[36] or rectangular[37] formations. Both of the formations can realize an arbitrary N $\times$ N unitary matrix. There are a variety of applications where MZI Meshes are employed, such as mode-division multiplexing [38], free-space beamforming[39], quantum computing [40], and photonic neural networks [41]. Our work utilizes MZI Meshes as synaptic interconnections for bio-inspired neural networks and aims to integrate learning algorithms on the same chip. Although calibration procedures of the MZI Meshes are well-studied [42], training MZI Meshes as neural network (NN) interconnects remains challenging. Hughes et al. [43] proposed an in-situ training to realize the traditional backpropagation algorithm for MZI meshes, and recently Pai et al. [44] experimentally demonstrated the method. This in-situ training requires additional forward and backward light propagation with power monitoring for each phase shifter element at each step. There are various approaches to monitor power. For example, Pai et al. [44] utilized power tapping and grating couplers with an infrared camera to record the emitted power from MZI Meshes. Alternatively, Morichetti et al. [45] used a non-invasive power sensing device introduced for silicon waveguides. We exploited 1:99 power taps and Ge photodetectors (PDs) which are available as an instance in Process Design Kit (PDK) elements of the active silicon photonic multi-project-wafer (MPW) runs from the AIM Photonic foundry. Fig. 6 (a) shows photocurrent changes on the monitoring PDs with respect to applied voltage on phase-shifter $\theta$ . Although we used thermo-optics as a simple and practical phase-shifting mechanism, it is possible to utilize micro-electro-mechanical systems (MEMS) for even lower power consumption [46] in future designs.

Fig. 7 (b) shows the fabricated and tested 6 $\times$ 6 rectangular MZI Mesh with power taps after each 2 $\times$ 2 MZI unit as shown in Fig. 6 (b). At each output waveguide of the 6 $\times$ 6 mesh, a micro ring resonator (MRR) add-drop filter is placed with a PD on the drop ports, allowing for output monitoring by either optical or electrical means. When the MRR is at resonance, the output can be monitored and accessed through the electrical interface during the training. Alternatively, the MRR resonance wavelength can be tuned to let the optical signal propagate after the MZI Mesh. In this way, multiple MZI Mesh layers can be cascaded for DNN-like implementation. All the components are available in AIM Photonic’s PDK v4.0. The device is wirebonded on a fanout printed-circuit board. A USB interfaced multi-channel high current output digital to analog converter (DAC) unit drives the thermo-optic heaters and MRR add-drop filters. Similarly, the photocurrents are digitized by a USB interfaced multi-channel 250kSps analog-to-digital converter (ADC), as shown in Fig. 7 (a).

III-C Training and Inference

For the on-chip training demonstration, we targeted a linear classification problem with 4-dimensional input vectors and two output classes. We used the Iris flower dataset [47], consisting of 3 classes and 150 input samples. For simplicity in the proof-of-principle demonstration, we excluded one of the classes that is linearly separable from the other two classes. Therefore, a linear regression classifier can achieve a maximum of 94 true classifications over 100 samples. We use a single-mode cleaved fiber to couple a CW tunable laser source operating at $1553.7\,nm$ to the chip. After the edge coupler, the first three 2 $\times$ 2 MZI stages act as tunable beam splitters and were used to generate coherent input vectors. First, the input generator phase-shifters are optimized adaptively to create desired 100 input samples. Next, optimum voltage values are recorded in a look-up table (LUT) to recall in the training and interference cycles.

One of the challenges of using MZI Meshes as a synaptic weight matrix is that controllable variables (phase shifters) do not explicitly map to individual weight matrix entries. In other words, adjusting a single phase shifter will affect multiple weight matrix entries. Clements et al. [37] devised a decomposition method for rectangular meshes. In machine learning, however, the optimum weight matrix is unknown at the beginning of training and the additional resources for continual adjustment and decomposition become intractable. Hughes et al. [43] demonstrated a method of differentiating the weight matrix w.r.t. each phase shifter. However, this method requires two optical propagation steps in addition to the initial inference step: one forward, and one backward. Therefore, an external controller is required to schedule each propagation, and light sources must be bidirectional. Moreover, during the additional optical propagation steps, power must be monitored for every phase shifter element. The number of phase shifters in the MZI mesh scales as $N(N-1)$ for N $\times$ N weight matrices, meaning $\mathcal{O}$ ( $N^{2}$ ) power monitoring is required. This presents remaining challenges for scalability in deep neural networks.

Here, we looked for more hardware-friendly solutions and, taking inspiration from biology, explored random backpropagation (RBP) and contrastive Hebbian learning (CHL) for MZI Meshes. In Section III-C1 we present an experimental demonstration of random backpropagation training for a linear classification task; Section III-C2 discusses the CHL algorithm and its relevance to human-like predictive error-driven learning.

III-C1 Random Backpropagation

In RBP, global error is backpropagated electrically from the end of the network. As such, RBP does not require optical backpropagation or power monitoring for each individual 2 $\times$ 2 MZI unit. An important difference between conventional BP and RBP is the direction of the gradient. BP follows the steepest gradient direction, which requires error to multiply the conjugate transpose of the forward weight matrix. In a digital computer, these forward weights are available in the memory unit, but for MZI Meshes, optical light would be physically backpropagated as discussed earlier. The original researchers demonstrated that a random backward weight matrix could also guarantee learning unless random backward weights are exactly orthogonal to the steepest backward weights [48]. Further, neuroscience studies observed that backward synaptic connections of neural networks in mammals are not fully symmetric [49, 50] giving biological credibility to the RBP algorithm. Direct feedback alignment, a variant of RBP, has also been demonstrated for MRR-based photonic weight matrices [51]. Given that tunable elements in the MRR bank have a one-to-one mapping with the synaptic weight matrix, it is computationally easier to calculate steepest gradient direction. Therefore, RBP can be more useful for MZI mesh training where this mapping is non-trivial. Nonetheless, MZI meshes are preferred for their ability to perform lossless matrix multiplication.

Appendix A summarizes our method of applying RBP on a SiPh MZI Mesh, while an illustration of our experimental setup is shown in Fig. 7 (a). The multiplication, addition, comparator, and memory buffer operations are realized in an external computer through Python scripts. Unlike conventional RBP, we draw a new random backward matrix for each iteration where the error is larger than the previous. With this modification, we empirically observed faster convergence to the classifier’s highest accuracy and the ability escape local minimums, as seen in the course search of Fig. 8 (a). Note, however, this additional operation may not be necessary for a network with a larger number of parameters and multiple synaptic layers. For example, in the papers [48, 49, 50, 51] the authors use fixed random backward weights. Future efforts will involve the real-time implementation of these operations by integrated electronic circuits within the mesh.

Fig. 8 (a) shows the interference accuracy of the SiPh MZI Mesh classifier for each epoch. In each epoch, 100 samples are forward propagated once. We use i.i.d. random backward weights uniformly distributed in the interval $[-\mu,\mu]$ . During the coarse search cycle ( $\mu=0.05$ ), the classifier searches different local minimums, and after some epochs, the interference accuracy decreases due to large variance on random weights. We defined an accuracy limit (85 true labels among 100 samples) and switched to the fine search cycle ( $\mu=0.0025$ ) when the limit was reached. As seen in Fig. 8 (a), the coarse search cycle ended when the classifier labeled 89 samples correctly, and in the fine search cycle, 92 true labels were achieved. The confusion matrix for the ideal linear regression classifier and SiPh MZI Mesh classifier are presented in Fig. 8 (b). The SiPh MZI Mesh misclassified only two samples compared to the ordinary least squares linear regression model we built in the computer via scikit-learn Python package. We also implemented a numerical simulation for the MZI Meshes on the computer. From the simulation results, we observed that the SiPh MZI Meshes achieve the same accuracy as the linear regression model. Therefore, we concluded that reason for the misclassification of two input samples related to hardware imprecisions such as noise on the output PDs, electrical wires, thermal crosstalk between the phase shifters, etc.

Intuitively, traditional BP outperforms RBP in terms of convergence speed due to the steepest gradient direction. However, RBP is more hardware-friendly given that forward weights are unavailable and since phase shifter-to-weight mapping is not explicit in the MZI Meshes. Because the steepest direction for the gradient is not calculated, RBP does not require any power monitoring inside the MZI Meshes except for the input and output stages. Therefore, the PDs can scale with $\mathcal{O}$ ( $N$ ) for N $\times$ N weight matrices. In the future, we plan to study RBP for larger SiPh MZI Meshes and more complex machine learning problems.

III-C2 Contrastive Hebbian Learning

In contrast to backpropagation where learning is based on credit towards global error, learning in biological systems is restricted to information local to a given synapse. Despite this, biological neural networks are able to autonomously develop expansive hierarchical abstractions of information useful for interpreting the environment. This represents a form of self-supervised learning that needs no explicit calculation of error, but instead relies on chemical signals marking recent spiking activity local to a synapse.

O’Reilly [13] proved that differences in activity at two distinct phases of network computation can drive a class of temporal-difference learning rules that is equivalent to backpropagation and gradient descent of errors. This equivalence, however, only holds for a multi-layer perceptron (MLP) with recurrent feedback connections between each layer as in Fig. 9 (a). The general learning rule has minor variations which have different properties, though an empirical test under common MLP tasks showed that the CHL variant often converges to a solution most quickly:

\Delta w_{ij}=\eta(a_{i}^{+}a_{j}^{+}-a_{i}^{-}a_{j}^{-})

(1)

where $a_{i}$ and $a_{j}$ are variables representing the activity of the $i$ th and $j$ th neuron, and $\eta$ dictates the rate of learning.

Superscripts denote the phase of activity that each variable represents. The minus phase of execution occurs first, and represents the network’s natural response to the given input sample. Next, in the plus phase, the target activity is imposed on the output layer and the network reaches a new equilibrium. For fastest implementation, the duration of each phase should be the minimum time required for stable output activity. Taking the product of activity of sending and receiving neurons roughly tracks their correlations during each phase. Taking the difference of this correlation in each phase forces the network to unlearn its natural response and learn the desired target activity. In a spiking network, activity in these phases can be represented by low pass filters of spike trains; however, non-spiking activity can be assumed to approximate a rate-coding of spiking activity that fits some non-linear activation function. Unlike backpropagation, however, the network architecture requires bidirectional synaptic connectivity (as shown between layer 1 and 2 of Fig. 9 (a)) such that information propagates in both directions. Because each neuron is asynchronous, recurrence does not increase computational complexity as it does on traditional computer architectures. Additionally, the locality of learning and agnosticism to the neuron nonlinearity is advantageous for spiking neuromorphic hardware.

Following the two-layer network structure depicted in Fig. 9 (a), we simulated an implementation of CHL on an ideal MZI-mesh neural network. A set of 40 input-output pairs were generated from randomly-distributed, uniform-magnitude, four-dimensional vectors. Each layer was simulated with four rate-coded neurons with a sigmoidal activation function; as such, each MZI mesh was simulated as a $4\times 4$ rectangular mesh. As in Fig. 6 (b), it is assumed that each MZI unit of each mesh contains four PDs for input and output monitoring. For simplicity, it is assumed that each neuron injects light to the mesh on a separate wavelength, and that the PD capacitance is large enough to reject the cross-term products between signals. Thus, the PD is assumed to linearly sum the power received from each wavelength. Because CHL assumes real-valued activation, phase shifter $\phi$ is neglected such that phase of each signal can be ignored. Following these assumptions, each MZI unit can be treated as a $2\times 2$ sub-network that applies the following transformation to signal amplitude at each arm:

\boldsymbol{W}=\left[\begin{matrix}w_{11}&w_{12}\\ w_{21}&w_{22}\\ \end{matrix}\right]=\left[\begin{matrix}\text{sin}(\theta/2)&\text{cos}(\theta/2)\\ \text{cos}(\theta/2)&-\text{sin}(\theta/2)\\ \end{matrix}\right]

(2)

Given that CHL is agnostic to the neural nonlinearity, Eq. 1 can be directly applied to the photodetector outputs as long as they are measured correctly at the plus and minus phase. However, as seen in Eq. 2, the MZI mesh is not able to implement any arbitrary matrix. To resolve this, we can calculate derivatives that relate how a change in $\theta$ affects each individual weight. Next, we average the contribution from each $\Delta w_{ij}$ to estimate the best overall change:

\Delta\theta=\frac{1}{4}\sum_{i,j}\left[\left(\frac{dw_{ij}}{d\theta}\right)^{-1}\Delta w_{ij}\right]

(3)

Note, we use $\left(dw_{ij}/d\theta\right)^{-1}$ because it is simpler to calculate than $d\theta/dw_{ij}$ . Assuming that the plus and minus activity of each detector is recorded locally, this rule can be applied to every MZI in each mesh all at once. Fig. 10 shows the root-mean-squared-error (RMSE) over each epoch for the aforementioned two-layer $4\times 4$ network, along with an ideal implementation (direct application of Eq. 1) and implementation with randomly selected $\Delta w_{ij}$ . Learning is applied after each sample (not batched) with $500$ epochs of training and a learning rate, $\eta=0.1$ . Each implementation is initialized to the same starting matrices.

Our MZI implementation of CHL showed an $11.21\%$ decrease in RMSE over the course of training which is indicative of learning. However, the ideal implementation showed a significantly larger decrease of RMSE at $46.53\%$ . For comparison, the randomly varying network shows an increase of $28.87\%$ in RMSE, giving more credibility to the idea that the MZI-CHL implementation is capable of learning—albeit at a much slower rate than the ideal implementation. It is clear from the stochastic nature of RMSE in Fig 10 that this implementation is prone to local minimums and instability. Nonetheless, this simple simulation illustrates the ability of the CHL rule to train local synaptic weights without regard for the other connections in the synaptic mesh and provides proof of concept for its use in MZI meshes. Additionally, the MZI mesh is restricted to unitary matrices which preserve magnitude of the input vector (before neural nonlinearity). In contrast, the ideal implementation allows independent gain and attenuation of each weight in the synaptic network. More work is needed to determine strategies for mitigating these restrictions and characterizing learning with more bio-realistic neural nonlinearities.

III-C3 Predictive Error-Driven Learning

In the biology, however, target signals can only come from the network’s own activity in response to its observations; even in the case of instructed learning, a biological brain must interpret perceptual stimuli (i.e. auditory and visual) and transform them into intelligible target signals for training. More recent work by O’Reilly et al. [52] has shown that the human brain may generate its own target training signals through cortico-thalamo loops which constantly undergo phases of prediction and observation to reduce future errors in prediction. O’Reilly et al. postulate that the alpha cycle ( $\approx 10$ Hz) in the human brain demarcates iterations of such predictive error-driven learning, where plus and minus phases are separated by a bursting skip connection between primary processing regions and prediction-carrying regions (shown as the dotted connection in Fig. 9 (b)) that fire with a $25\%$ duty-cycle within the alpha rhythm; a simplified diagram of this neural network architecture can be seen in Fig. 9 (b). Over many iterations of such prediction and observation, abstract representations can be learned that are capable of transformation-invariant object-recognition[52]. The learning rule in this model is more complicated than CHL to include additional biologically relevant terms, though the error-driven learning is captured sufficiently by the simpler rule.

Bursting is important to enforce the 25% duty-cycle and thus generate activity differences between the plus and minus phases of the CHL rule. The skip connection between first-order processing and the prediction layers allows the representation of the latter to more accurately match the ground-truth observation in the plus phase. Thus, without an explicit target signal, the network learns to better predict future inputs. Because subsequent inputs are governed by causality and are constantly occurring, the network is also constantly learning to better understand its environment. This structure can even be repeated for higher-order processing layers to hierarchically form even deeper, more abstract predictions of the input space. Future work is needed to identify an optimal implementation of the CHL rule within the MZI mesh structure and subsequently employ this style of self-supervised learning.

IV Perspectives and Future Directions

IV-A Our Future System and Benchmarking

The nanophotonic-electronic spiking neuron is composed of three main components: a photodetector, a nonlinear electrical circuit, and a laser. The photodetector receives information from the synaptic network and converts the optical signal to an electrical signal. The electrical circuit is the core of the neuron and processes the inputs to generate output spike responses. The laser output regenerates signal power after each layer to supply synaptic fanout to subsequent layers. Our team will exploit attojoule photonics with quantum impedance conversion [53] and closely integrate with low-capacitance ( $<1\,fF$ ) electronics for monolithic integration on a silicon-on-insulator (SOI) platform. Using photonic communication between each SNN layer reduces capacitive charge associated with the interconnect wires [54] in comparable electronic circuits. Additionally, the photonic platform can allow neurons to communicate with other neurons at high speeds ( $\sim$ 10 GHz) independently of communication distance.

To calculate the projected energy consumption, we can examine the composition of each component in the attojoule nanophotonic-electronic spiking neuron design. The dynamic energy cost of the nonlinear electronic circuit and laser can be calculated by examining the transistor on-state currents and associated operation voltages and frequency. Meanwhile the parasitic energy cost can be calculated from the total capacitance and the leakage current. According to our previous work[35], the electrical circuit current flow inside the maximum $10\,GHz$ spiking rate attojoule neuron is expected to be $31.27\,\mu A$ at $1.4\,V$ voltage supply when the neuron is in the ON state, while the leakage current is $10\,nA$ in the OFF state. The expected nanolaser energy consumption is 4.4 fJ per spike for a fanout of $\sim$ 80 [54]. The parasitic capacitance includes the load capacitance on the photodetector, membrane capacitor, and transistor gate capacitance. According to the IRDS2020 [55] and [56], we expect the load capacitance of the photodetector to be around 0.1fF, and the simulated membrane capacitor to be 0.5fF. By considering closely integrated nanoelectronics at 10 fJ/bit energy efficiency and a fanout of 10-100 following the concept outlined by [54], the minimum dynamic input energy to generate a spike output is projected to be 200aJ/spike.

For input, the proposed attojoule neuron design will utilize a low-Q nanophotonic crystal photodector with a Ge/Si cavity. The photonic crystal creates a resonant cavity that increases the confinement of light and reduces the size of the absorption medium [57][58]. This allows for an ultra-low capacitance ( $\sim$ 0.1fF) nano-cavity PD that can generate sufficiently large voltage without amplification when combined with a high-impedance load [59]. In addition, minimizing the electrical wiring between PDs and the nonlinear electronic circuit also reduces power consumption [54]. Similarly for spiking output, a hybrid InAs/AlGaAs quantum-dot nanolaser with photonic crystal cavity can be employed.

Aside from neuron design, the scalability on interconnect is also a critical design challenge. MZI meshes show nearly lossless multiplication that is particularly suitable for large-scale low-power neuromorphic computing. However, the number of tunable elements, $N\cdot(N-1)$ in an $N\times N$ MZI mesh, grows polynomially with the number of neurons in the layer. As such, a control circuit must be designed that scales with a minimal additional computational complexity.

IV-B Footprint Efficiency

In the previous sections, we introduced and experimentally demonstrated bio-inspired on-chip training methods which improve the scalability of the SiPh MZI meshes for synaptic networks. We also simulated optoelectronic spiking neurons in GF 45SPCLO electronic-photonic hybrid platform and envisioned a scalable attojoule nanophotonic-electronic neuron design. However, one handicap of the proposed photonic neuromorphic system remains unaddressed, footprint efficiency. From our experience with commercial SiPh foundries, a 16 $\times$ 16 MZI Meshes occupies a 12.5 $mm^{2}$ chip area. Similarly, Lightmatter introduced their 64 $\times$ 64 SiPh AI accelerator occupying a 150 $mm^{2}$ chip area [60] which incorporates billions of transistors. We propose two solutions, Tensorized Photonic Neural Networks (TPNN) and 3D Electronic-Photonic Integrated Circuits (3D EPICs) to improve footprint efficiency and enable deep and wide photonic neuromorphic systems.

IV-B1 TPNN

There are three main methods to avoid over-parameterized neural networks and relieve hardware requirements such as weight pruning, quantization, and model compression [61]. Because photonic NNs are analog computers, available bit precision is already limited. Unlike electronics, a photonic system can easily offer all-to-all connectivity through wavelength and space-division multiplexing. Therefore, the benefits of weight pruning and quantization approaches are not significant. In contrast, model compression can result in fewer hardware resources and smaller footprints. We proposed and simulated an algorithm-hardware co-design approach: photonic tensorized neural networks [62]. Tensor-Train (TT) decomposition is a multi-dimensional array processing technique to represent large matrices in a low-rank approximation [63]. Although low-rank approximation may cause decreased performance in NNs, one could train NN models in TT-decomposed format so that performance degradation is minimized [64]. For some ML problems, low-rank approximation also serves as a regularization term and improves performance [65]. Moreover, in the simulations [66], we observed that TT-decomposed MZI meshes are more resilient to noise and hardware imprecision. Our simulations and benchmarks demonstrated that TPNN could improve the footprint-energy-efficiency product by $4$ orders of magnitude by using $79\times$ fewer 2 $\times$ 2 MZI units without decreasing accuracy below 95% in image classification tasks [67]. Future work will realize a SiPh end-to-end TPNN system and provide benchmarks for footprint-energy efficiency and performance.

IV-B2 3D EPIC

3D electronic ICs (EIC) promise low energy consumption, low noise, and high density because of shorter electrical wires [68]. The main enabling technology for 3D EICs is through-silicon vias (TSV). Although thermal relief and yield are the challenges, 3D integrated high bandwidth memories show clear advantages compared to 2D EICs. Similarly, 3D electronic-photonic ICs (EPICs) can achieve high density, low loss, and high bandwidth performance. Multi-layer silicon photonic devices are already available in commercial foundries. However, they rely on evanescent vertical couplers, which require relatively long taper lengths ( $\sim 100\,\mu m$ ) and small layer distance ( $\sim 1\,\mu m$ ) [69, 70]. As an alternative, our previous work demonstrates through silicon optical vias (TSOV) [71, 72] for 3D EPICS using 45degree reflectors and silicon vias[72]. Ultrafast laser inscription also allows for freeform shaping of waveguides useful for routing in three dimensions. This technique has already been demonstrated for orbital-angular momentum multiplexing/demultiplexing and optical beam steering applications[73]. 3D EPICs provide devices to be stacked vertically allowing for greater neuron density per area and thus the design of deeper and wider photonic neural networks.

IV-C Applications for SNNs

In relation to AI and machine learning, SNNs provide several advantages over modern computing paradigms for tasks which mimic the conditions in which they naturally evolved. Because SNNs process data over time in a continuous manner, they are well-suited to applications situated in real-time environments with single inference and learning instances presented at a time (such as event-based signal processing [74]). In addition, the spread of information over time allows multiple forms of memory at different time-scales similar to the human distinction between working [75], short-term [76], and long-term memories. Neuromorphic sensing and robotics are a common direction of applications of SNNs; for example, an adaptive robotic arm controller can provide reliable motor control as actuators wear down [77]. More speculatively, future devices might exploit these properties in the context of live audio and natural language processing for voice assistants, live-captioning services, or audio separation; similarly SNNs can be used for live video and lidar processing in autonomous vehicles or surveillance systems. SNNs are not ideal for batched computation—in which multiple training samples are computed in parallel and averaged for parallelism in training—however, data centers may still make use of the increased computational parallelism in tasks like the nearest-neighbor search which can be performs in constant time, $O(1)$ , on neuromorphic chips like Loihi [78].

A major challenge of many modern DNN and reinforcement learning (RL) agents is the development of abstract, transformation-invariant representations of objects relevant to the task. In classification tasks, a neural network must transform its input space into a representation which most clearly separates each labeled class. Similarly, RL agents must be able to process their input space into a representation that best accentuates the value of potential actions. Predictive error-driven learning, modeled after the work of O’Reilly [52], has the potential to autonomously build deep hierarchies of abstraction for a given input space. For example, a learning agent could implicitly learn physical properties of the world such as gravity, buoyancy, and contact forces simply by observing its environment. In combination with complimentary learning systems for memory [79] and RL models based on the basal ganglia [80], a neuromorphic learning agent may be capable of replicating simple navigation and foraging behaviors which require the flexible application of knowledge and memory. Such a model could provide key insights for the development of self-motivated learning agents that exploit hierarchical representations to solve reinforced tasks. Developing dedicated spiking neuromorphic hardware and taking advantage of the energy-efficient and scalable photonic devices will allow the development of larger models and new computational paradigms. These developments can be applied in dynamic, noisy environments that are not well-handled by today’s machine learning efforts.

V Conclusion

We have discussed the advantages of dedicated SNN hardware and highlighted the benefits of nanophotonic-electronic design within this computational paradigm. Additionally, we argued that co-integration of photonic and electronic devices combines the high-bandwidth, low-power communication protocols of photonics with the well-established and flexible CMOS circuitry. Towards the construction of a photonic SNN computing architecture, we demonstrated an Izhikevich-inspired optoelectronic neuron design, implemented RPB on an MZI mesh, and simulated CHL on a rate-coded, MZI-mesh neural network. In addition, we proposed the construction of a powerful self-learning SNN computing architecture built from these technologies and based on predictive error-driven learning models of the human brain. Subsequently, we have discussed technologies for improving the scalability of neuron and network density through tensorization of large neural networks and 3D electronic-photonic integration. Finally, we discussed perspectives on the suitable applications of photonic SNNs and emphasized applications of interest for our own efforts.

Future work is needed to establish the optimal design for brain-inspired spiking networks. Modern ANNs have oversimplified neural nonlinearities due to the limitations of the von Neumann computing architecture. Meanwhile, the heterogeneity of neural behaviors in different regions of the human brain provide various methods of encoding information. As such, a deeper exploration of these encodings is warranted to fully leverage the computing power of SNNs. Furthermore, modern learning algorithms are designed for sequential processing that is not ideal for SNNs hardware. As such, considerable work is necessary to determine the most efficient on-chip implementation of local learning rules like CHL. Nonetheless, the design challenges are well worth the effort to provide alternative routes for continued advances in computation and signal processing in the face of slowing progress of transistor scaling. Our continued work will focus on the characterization and design of nanophotonic-electronic spiking neurons and their incorporation within scalable, MZI-based neural networks capable of on-chip local learning.

Acknowledgment

This work was funded in part by the Air Force Office of Scientific Research grant FA9550-181-1-0186.

This research is based upon work supported in part by the Office of the Director of National Intelligence (ODNI), Intelligence Advanced Research Projects Activity (IARPA), via [2021-21090200004]. The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies, either expressed or implied, of ODNI, IARPA, or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright annotation therein.

The authors would like to thank GLOBALFOUNDRIES for providing silicon fabrication through the 90WG university program and for their technical assistance in 45SPCLO MPW runs.

Appendix A RBP Algorithm

Algorithm 1 Random Backprop on SiPh MZI Mesh

1:Initialize resistor values

\mathbf{R}

, accuracy limit

L

, total number of samples

N

, MZI voltages

\mathbf{v}^{-1}_{MZI}

, error

\mathbf{e}^{-1}=\infty

, coarse and fine step sizes

\mu_{c},\mu_{f}

, start with coarse search

\mu\leftarrow\mu_{c}

, , random backprop weights

\mathbf{B}\sim[-\mu,\mu]

2:for Every epoch do

3: for

k=0

through

N

4: Find input generator voltages

\mathbf{v}^{k}_{in}

for

\mathbf{x}^{k}

LUT

5: Read input generator’s PDs to verify

\mathbf{x}^{k}

6: Read output PDs

\mathbf{v}_{out}

7: Calculate photocurrent

\mathbf{i}_{out}=(v_{dd}-\mathbf{v}_{out})/\mathbf{R}

8: Normalize

\mathbf{i}_{out}

to calculate

\mathbf{\hat{y}}^{k}

9: Calculate error

\mathbf{e}^{k}=|\mathbf{\hat{y}}^{k}-\mathbf{y}^{k}|^{2}

10: if

\mathbf{e}^{k}>\mathbf{e}^{k-1}

then

11: Draw a new

\mathbf{B}\sim[-\mu,\mu]

12: end if

13: Update

\mathbf{v}^{k}_{MZI}\leftarrow\mathbf{v}^{k-1}_{MZI}+\mathbf{B}\mathbf{e}^{k}

14: end for

15: Calculate interference accuracy

a

16: for Every sample

\mathbf{x}^{k}

17: Find input generator voltages

\mathbf{v}^{k}_{in}

for

\mathbf{x}^{k}

LUT

18: Read input generator’s PDs to verify

\mathbf{x}^{k}

19: Read output PDs

\mathbf{v}_{out}

20: Calculate photocurrent

\mathbf{i}_{out}=(v_{dd}-\mathbf{v}_{out})/\mathbf{R}

21: Decide class label

\hat{l}^{k}=\underset{n}{\arg\max}\ {i}_{out}[n]

22: end for

23:

a=sum(\mathbf{\hat{l}}==\mathbf{l})

24: if

a\geq L

then

25: Switch to fine search

\mu\leftarrow\mu_{f}

26: end if

27:end for

References

[1] P. A. Merolla, J. V. Arthur, R. Alvarez-Icaza, A. S. Cassidy, J. Sawada, F. Akopyan, B. L. Jackson, N. Imam, C. Guo, Y. Nakamura, B. Brezzo, I. Vo, S. K. Esser, R. Appuswamy, B. Taba, A. Amir, M. D. Flickner, W. P. Risk, R. Manohar, and D. S. Modha, “A million spiking-neuron integrated circuit with a scalable communication network and interface,” Science, vol. 345, no. 6197, pp. 668 LP–673, 8 2014. http://science.sciencemag.org/content/345/6197/668.abstracthttps://www.science.orghttp://science.sciencemag.org/
[2] M. Davies, N. Srinivasa, T. H. Lin, G. Chinya, Y. Cao, S. H. Choday, G. Dimou, P. Joshi, N. Imam, S. Jain, Y. Liao, C. K. Lin, A. Lines, R. Liu, D. Mathaikutty, S. McCoy, A. Paul, J. Tse, G. Venkataramanan, Y. H. Weng, A. Wild, Y. Yang, and H. Wang, “Loihi: A Neuromorphic Manycore Processor with On-Chip Learning,” IEEE Micro, vol. 38, no. 1, pp. 82–99, 1 2018.
[3] E. Painkras, L. A. Plana, J. Garside, S. Temple, S. Davidson, J. Pepper, D. Clark, C. Patterson, and S. Furber, “SpiNNaker: A multi-core system-on-chip for massively-parallel neural net simulation,” Proceedings of the Custom Integrated Circuits Conference, 2012.
[4] A. Mehrabian, Y. Al-Kabani, V. J. Sorger, and T. El-Ghazawi, “PCNNA: A Photonic Convolutional Neural Network Accelerator,” in 2018 31st IEEE International System-on-Chip Conference (SOCC), vol. 2018-Septe. IEEE, 9 2018, pp. 169–173. https://ieeexplore.ieee.org/document/8618542/
[5] X. Xu, M. Tan, B. Corcoran, J. Wu, A. Boes, T. G. Nguyen, S. T. Chu, B. E. Little, D. G. Hicks, R. Morandotti, A. Mitchell, and D. J. Moss, “11 TOPS photonic convolutional accelerator for optical neural networks,” Nature, vol. 589, no. 7840, pp. 44–51, 2021. https://doi.org/10.1038/s41586-020-03063-0
[6] J. Feldmann, N. Youngblood, M. Karpov, H. Gehring, X. Li, M. Stappers, M. Le Gallo, X. Fu, A. Lukashchuk, A. S. Raja, J. Liu, C. D. Wright, A. Sebastian, T. J. Kippenberg, W. H. P. Pernice, and H. Bhaskaran, “Parallel convolutional processing using an integrated photonic tensor core,” Nature 2020 589:7840, vol. 589, no. 7840, pp. 52–58, 1 2021. https://www.nature.com/articles/s41586-020-03070-1
[7] C.-N. Chou, K.-M. Chung, and C.-J. Lu, “On the Algorithmic Power of Spiking Neural Networks,” Leibniz International Proceedings in Informatics, LIPIcs, vol. 124, 3 2018. https://arxiv.org/abs/1803.10375v2
[8] S. J. Verzi, F. Rothganger, O. D. Parekh, T. T. Quach, N. E. Miner, C. M. Vineyard, C. D. James, and J. B. Aimone, “Computing with Spikes: The Advantage of Fine-Grained Timing,” Neural Computation, vol. 30, no. 10, pp. 2660–2690, 10 2018. https://direct.mit.edu/neco/article/30/10/2660/8414/Computing-with-Spikes-The-Advantage-of-Fine
[9] J. Kwisthout and N. Donselaar, “On the computational power and complexity of Spiking Neural Networks,” ACM International Conference Proceeding Series, vol. 17, 3 2020. https://www.economist.com/technology-quarterly/2016-03-12/after-moores-law.
[10] J. B. Aimone, Y. Ho, O. Parekh, C. A. Phillips, A. Pinar, W. Severa, and Y. Wang, “Provable advantages for graph algorithms in spiking neural networks,” in Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2021.
[11] W. Gerstner and W. M. Kistler, “Mathematical formulations of Hebbian learning,” Biological Cybernetics 2002 87:5, vol. 87, no. 5, pp. 404–415, 2002. https://link.springer.com/article/10.1007/s00422-002-0353-y
[12] N. Caporale and Y. Dan, “Spike Timing–Dependent Plasticity: A Hebbian Learning Rule,” http://dx.doi.org/10.1146/annurev.neuro.31.060407.125639, vol. 31, pp. 25–46, 6 2008. https://www.annualreviews.org/doi/abs/10.1146/annurev.neuro.31.060407.125639
[13] R. C. O’Reilly, “Biologically Plausible Error-Driven Learning Using Local Activation Differences: The Generalized Recirculation Algorithm,” Neural Computation, vol. 8, no. 5, pp. 895–938, 7 1996. https://doi.org/10.1162/neco.1996.8.5.895
[14] G. Amato, F. Carrara, F. Falchi, C. Gennaro, and G. Lagani, “Hebbian learning meets deep convolutional neural networks,” Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11751 LNCS, pp. 324–334, 2019. https://link.springer.com/chapter/10.1007/978-3-030-30642-7_29
[15] E. M. Izhikevich, “Neural Excitability, Spiking and Bursting,” International Journal of Bifurcation and Chaos, vol. 10, no. 06, pp. 1171–1266, 6 2000. https://www.worldscientific.com/doi/abs/10.1142/S0218127400000840
[16] ——, “Simple model of spiking neurons,” IEEE Transactions on Neural Networks, vol. 14, no. 6, pp. 1569–1572, 11 2003.
[17] ——, “Which model to use for cortical spiking neurons?” IEEE Transactions on Neural Networks, vol. 15, no. 5, pp. 1063–1070, 9 2004.
[18] M. Steriade, “Neocortical cell classes are flexible entities,” Nature Reviews Neuroscience 2004 5:2, vol. 5, no. 2, pp. 121–134, 2004. https://www.nature.com/articles/nrn1325
[19] M. Giudici, C. Green, G. Giacomelli, U. Nespolo, and J. R. Tredicce, “Andronov bifurcation and excitability in semiconductor lasers with optical feedback,” Physical Review E, vol. 55, no. 6, p. 6414, 6 1997. https://journals.aps.org/pre/abstract/10.1103/PhysRevE.55.6414
[20] W. Coomans, L. Gelens, S. Beri, J. Danckaert, and G. Van Der Sande, “Solitary and coupled semiconductor ring lasers as optical spiking neurons,” Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, vol. 84, no. 3, p. 036209, 9 2011. https://journals.aps.org/pre/abstract/10.1103/PhysRevE.84.036209
[21] M. Brunstein, A. M. Yacomotti, I. Sagnes, F. Raineri, L. Bigot, and A. Levenson, “Excitability and self-pulsing in a photonic crystal nanocavity,” Physical Review A - Atomic, Molecular, and Optical Physics, vol. 85, no. 3, p. 031803, 3 2012. https://journals.aps.org/pra/abstract/10.1103/PhysRevA.85.031803
[22] J. Dambre, K. Alexander, M. Fiers, P. Mechet, P. Bienstman, and T. V. Vaerenbergh, “Excitability in optically injected microdisk lasers with phase controlled excitatory and inhibitory response,” Optics Express, Vol. 21, Issue 22, pp. 26182-26191, vol. 21, no. 22, pp. 26 182–26 191, 11 2013. https://opg.optica.org/viewmedia.cfm?uri=oe-21-22-26182&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-21-22-26182https://opg.optica.org/oe/abstract.cfm?uri=oe-21-22-26182
[23] B. Garbin, B. Kelleher, D. Goulding, G. Huyet, S. Barland, and S. P. Hegarty, “Incoherent optical triggering of excitable pulses in an injection-locked semiconductor laser,” Optics Letters, Vol. 39, Issue 5, pp. 1254-1257, vol. 39, no. 5, pp. 1254–1257, 3 2014. https://opg.optica.org/viewmedia.cfm?uri=ol-39-5-1254&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=ol-39-5-1254https://opg.optica.org/ol/abstract.cfm?uri=ol-39-5-1254
[24] M. A. Nahmias, B. J. Shastri, A. N. Tait, and P. R. Prucnal, “A leaky integrate-and-fire laser neuron for ultrafast cognitive computing,” IEEE Journal on Selected Topics in Quantum Electronics, vol. 19, no. 5, 2013.
[25] F. Selmi, R. Braive, G. Beaudoin, I. Sagnes, R. Kuszelewicz, and S. Barbay, “Relative refractory period in an excitable semiconductor laser,” Physical Review Letters, vol. 112, no. 18, p. 183902, 5 2014. https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.112.183902
[26] A. Hurtado and J. Javaloyes, “Controllable spiking patterns in long-wavelength vertical cavity surface emitting lasers for neuromorphic photonics systems,” Applied Physics Letters, vol. 107, no. 24, p. 241103, 12 2015. https://aip.scitation.org/doi/abs/10.1063/1.4937730
[27] F. Selmi, G. Beaudoin, I. Sagnes, R. Braive, R. Kuszelewicz, and S. Barbay, “Temporal summation in a neuromimetic micropillar laser,” Optics Letters, Vol. 40, Issue 23, pp. 5690-5693, vol. 40, no. 23, pp. 5690–5693, 12 2015. https://opg.optica.org/viewmedia.cfm?uri=ol-40-23-5690&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=ol-40-23-5690https://opg.optica.org/ol/abstract.cfm?uri=ol-40-23-5690
[28] B. Romeira, C. N. Ironside, J. M. L. Figueiredo, J. Javaloyes, O. Piro, and S. Balle, “Excitability and optical pulse generation in semiconductor lasers driven by resonant tunneling diode photo-detectors,” Optics Express, Vol. 21, Issue 18, pp. 20931-20940, vol. 21, no. 18, pp. 20 931–20 940, 9 2013. https://opg.optica.org/viewmedia.cfm?uri=oe-21-18-20931&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-21-18-20931https://opg.optica.org/oe/abstract.cfm?uri=oe-21-18-20931
[29] A. N. Tait, B. J. Shastri, M. A. Nahmias, P. R. Prucnal, and T. F. d. Lima, “Excitable laser processing network node in hybrid silicon: analysis and simulation,” Optics Express, Vol. 23, Issue 20, pp. 26800-26813, vol. 23, no. 20, pp. 26 800–26 813, 10 2015. https://opg.optica.org/viewmedia.cfm?uri=oe-23-20-26800&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-23-20-26800https://opg.optica.org/oe/abstract.cfm?uri=oe-23-20-26800
[30] ——, “Recent progress in semiconductor excitable lasers for photonic spike processing,” Advances in Optics and Photonics, Vol. 8, Issue 2, pp. 228-299, vol. 8, no. 2, pp. 228–299, 6 2016. https://opg.optica.org/viewmedia.cfm?uri=aop-8-2-228&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=aop-8-2-228https://opg.optica.org/aop/abstract.cfm?uri=aop-8-2-228
[31] C. Mitsolidou, G. Dabos, G. T. Kanellos, J. V. Campenhout, N. Pleros, P. D. Heyn, R. Broeke, S. Pitris, and T. Alexoudi, “Silicon photonic 8 × 8 cyclic Arrayed Waveguide Grating Router for O-band on-chip communication,” Optics Express, Vol. 26, Issue 5, pp. 6276-6284, vol. 26, no. 5, pp. 6276–6284, 3 2018. https://opg.optica.org/viewmedia.cfm?uri=oe-26-5-6276&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-26-5-6276https://opg.optica.org/oe/abstract.cfm?uri=oe-26-5-6276
[32] Y. Zhang, X. Xiao, K. Zhang, S. Li, A. Samanta, Y. Zhang, K. Shang, R. Proietti, K. Okamoto, and S. J. Ben Yoo, “Foundry-Enabled Scalable All-to-All Optical Interconnects Using Silicon Nitride Arrayed Waveguide Router Interposers and Silicon Photonic Transceivers,” IEEE Journal of Selected Topics in Quantum Electronics, vol. 25, no. 5, 9 2019.
[33] X. Xiao, R. Proietti, S. Werner, P. Fotouhi, and S. J. Yoo, “Flex-LIONS: A Scalable Silicon Photonic Bandwidth-Reconfigurable Optical Switch Fabric,” OECC/PSC 2019 - 24th OptoElectronics and Communications Conference/International Conference Photonics in Switching and Computing 2019, 7 2019.
[34] L. E. Srouji, A. Krishnan, R. Ravichandran, . A. Krishnan, Y. Lee, M. On, X. Xiao, and S. J. B. Yoo, “Photonic and optoelectronic neuromorphic computing,” APL Photonics, vol. 7, no. 5, p. 051101, 5 2022. https://aip.scitation.org/doi/abs/10.1063/5.0072090
[35] Y.-J. Lee, M. B. On, X. Xiao, R. Proietti, and S. J. B. Yoo, “Photonic spiking neural networks with event-driven femtojoule optoelectronic neurons based on Izhikevich-inspired model,” Opt. Express, vol. 30, no. 11, pp. 19 360–19 389, 5 2022. http://opg.optica.org/oe/abstract.cfm?URI=oe-30-11-19360
[36] M. Reck, A. Zeilinger, H. J. Bernstein, and P. Bertani, “Experimental realization of any discrete unitary operator,” Physical Review Letters, vol. 73, no. 1, pp. 58–61, 7 1994. https://journals.aps.org/prl/abstract/10.1103/PhysRevLett.73.58
[37] W. R. Clements, P. C. Humphreys, B. J. Metcalf, W. S. Kolthammer, and I. A. Walmsley, “Optimal design for universal multiport interferometers,” Optica, vol. 3, no. 12, pp. 1460–1465, 2016.
[38] K. Choutagunta, I. Roberts, D. A. Miller, and J. M. Kahn, “Adapting Mach-Zehnder Mesh Equalizers in Direct-Detection Mode-Division-Multiplexed Links,” Journal of Lightwave Technology, vol. 38, no. 4, pp. 723–735, 2 2020.
[39] M. Milanizadeh, S. SeyedinNavadeh, F. Zanetto, V. Grimaldi, C. De Vita, C. Klitis, M. Sorel, G. Ferrari, D. A. B. Miller, A. Melloni, and F. Morichetti, “Multibeam Free Space Optics Receiver Enabled by a Programmable Photonic Mesh,” 12 2021. https://arxiv.org/abs/2112.13644
[40] X. Qiang, X. Zhou, J. Wang, C. M. Wilkes, T. Loke, S. O’Gara, L. Kling, G. D. Marshall, R. Santagati, T. C. Ralph, J. B. Wang, J. L. O’Brien, M. G. Thompson, and J. C. Matthews, “Large-scale silicon quantum photonics implementing arbitrary two-qubit processing,” Nature Photonics 2018 12:9, vol. 12, no. 9, pp. 534–539, 8 2018. https://www.nature.com/articles/s41566-018-0236-y
[41] Y. Shen, N. C. Harris, S. Skirlo, M. Prabhu, T. Baehr-Jones, M. Hochberg, X. Sun, S. Zhao, H. Larochelle, D. Englund, and M. Soljačić, “Deep learning with coherent nanophotonic circuits,” NATURE PHOTONICS —, vol. 11, 2017. www.nature.com/naturephotonics
[42] S. Pai, I. A. D. Williamson, T. W. Hughes, M. Minkov, O. Solgaard, S. Fan, and D. A. B. Miller, “Parallel fault-tolerant programming of an arbitrary feedforward photonic network,” 9 2019. http://arxiv.org/abs/1909.06179
[43] T. W. Hughes, M. Minkov, Y. Shi, and S. Fan, “Training of photonic neural networks through in situ backpropagation and gradient measurement,” Optica, vol. 5, no. 7, p. 864, 7 2018. https://doi.org/10.1364/OPTICA.5.000864
[44] S. Pai, Z. Sun, T. W. Hughes, T. Park, B. Bartlett, I. A. D. Williamson, M. Minkov, M. Milanizadeh, N. Abebe, F. Morichetti, A. Melloni, S. Fan, O. Solgaard, and D. A. B. Miller, “Experimentally realized in situ backpropagation for deep learning in nanophotonic neural networks,” Tech. Rep., 2022.
[45] F. Morichetti, S. Grillanda, M. Carminati, G. Ferrari, M. Sampietro, M. J. Strain, M. Sorel, and A. Melloni, “Non-invasive on-chip light observation by contactless waveguide conductivity monitoring,” IEEE Journal on Selected Topics in Quantum Electronics, vol. 20, no. 4, 7 2014.
[46] A. Yuji, A. Wim Bogaerts, H. Sattari, P. Edinger, A. Yuji Takabayashi, I. Zand, X. Wang, A. Ribeiro, M. Jezzini, C. Errando-Herranz, G. Talli, K. Saurav, M. Garcia Porcel, P. Verheyen, B. Abasahl, F. Niklaus, N. Quack, K. B. Gylfason, U. Khan, and W. Bogaerts, “MORPHIC: programmable photonic circuits enabled by silicon photonic MEMS,” https://doi.org/10.1117/12.2540934, vol. 11285, p. 1128503, 2 2020.
[47] R. A. FISHER, “THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS,” Annals of Eugenics, vol. 7, no. 2, pp. 179–188, 9 1936. https://onlinelibrary.wiley.com/doi/full/10.1111/j.1469-1809.1936.tb02137.xhttps://onlinelibrary.wiley.com/doi/abs/10.1111/j.1469-1809.1936.tb02137.xhttps://onlinelibrary.wiley.com/doi/10.1111/j.1469-1809.1936.tb02137.x
[48] T. P. Lillicrap, D. Cownden, D. B. Tweed, and C. J. Akerman, “Random feedback weights support learning in deep neural networks,” Tech. Rep., 2014.
[49] A. Van Schaik, M. D. Mcdonnell, E. O. Neftci, C. Augustine, S. Paul, and G. Detorakis, “Event-Driven Random Back-Propagation: Enabling Neuromorphic Deep Learning Machines,” Frontiers in Neuroscience — www.frontiersin.org, vol. 1, p. 324, 2017. www.frontiersin.org
[50] G. Detorakis, T. Bartley, and E. Neftci, “Contrastive Hebbian learning with random feedback weights,” Neural Networks, vol. 114, pp. 1–14, 6 2019.
[51] A. N. Trondheim, “Direct Feedback Alignment Provides Learning in Deep Neural Networks,” Tech. Rep.
[52] R. C. O’reilly, J. L. Russin, M. Zolfaghar, and J. Rohrlich, “Deep Predictive Learning in Neocortex and Pulvinar,” Journal of Cognitive Neuroscience, vol. 33, no. 6, pp. 1158–1196, 5 2021. https://direct.mit.edu/jocn/article/33/6/1158/98116/Deep-Predictive-Learning-in-Neocortex-and-Pulvinar
[53] F. De Leonardis, R. Soref, V. M. Passaro, Y. Zhang, and J. Hu, “Broadband Electro-Optical Crossbar Switches Using Low-Loss Ge2Sb2Se4Te1 Phase Change Material,” Journal of Lightwave Technology, vol. 37, no. 13, pp. 3183–3191, 2019.
[54] D. A. Miller, “Attojoule Optoelectronics for Low-Energy Information Processing and Communications,” Journal of Lightwave Technology, vol. 35, no. 3, pp. 346–396, 2017.
[55] “THE INTERNATIONAL ROADMAP FOR DEVICES AND SYSTEMS: 2020,” IEEE, 2020.
[56] B. J. Shastri, A. N. Tait, T. Ferreira de Lima, M. A. Nahmias, H.-T. Peng, and P. R. Prucnal, “Neuromorphic Photonics, Principles of BT - Encyclopedia of Complexity and Systems Science,” R. A. Meyers, Ed. Berlin, Heidelberg: Springer Berlin Heidelberg, 2018, pp. 1–37. https://doi.org/10.1007/978-3-642-27737-5_702-1
[57] Y. El-Batawy, F. M. Mohammedy, and M. J. Deen, “13 - Resonant cavity enhanced photodetectors: Theory, design and modeling,” B. B. T. P. Nabet, Ed. Woodhead Publishing, 2016, pp. 415–470. https://www.sciencedirect.com/science/article/pii/B9781782424451000130
[58] K. Nozaki, S. Matsuo, K. Takeda, T. Sato, E. Kuramochi, and M. Notomi, “InGaAs nano-photodetectors based on photonic crystal waveguide including ultracompact buried heterostructure,” Opt. Express, vol. 21, no. 16, pp. 19 022–19 028, 8 2013. http://opg.optica.org/oe/abstract.cfm?URI=oe-21-16-19022
[59] D. A. Miller, “Optics for low-energy communication inside digital processors: quantum detectors, sources, and modulators as efficient impedance converters,” Optics Letters, vol. 14, no. 2, pp. 146–148, 1989. http://ol.osa.org/abstract.cfm?URI=ol-14-2-146
[60] C. Ramey, “Silicon Photonics for Artificial Intelligence Acceleration : HotChips 32,” 2020 IEEE Hot Chips 32 Symposium, HCS 2020, 8 2020.
[61] S. Han, J. Pool, J. Tran, and W. J. Dally, “Learning both weights and connections for efficient neural networks,” pp. 1135–1143, 2015. https://proceedings.neurips.cc/paper/2015/hash/ae0eb3eed39d2bcef4622b2499a05fe6-Abstract.html
[62] X. Xiao and S. J. B. Yoo, “Scalable and Compact 3D Tensorized Photonic Neural Networks; Scalable and Compact 3D Tensorized Photonic Neural Networks,” Tech. Rep., 2021.
[63] I. V. Oseledets, “Tensor-train decomposition,” in SIAM Journal on Scientific Computing, vol. 33, no. 5. Society for Industrial and Applied Mathematics, 9 2011, pp. 2295–2317. https://doi.org/10.1137/20M1316639
[64] A. Novikov, D. Podoprikhin, A. Osokin, and D. Vetrov, “Tensorizing Neural Networks,” Tech. Rep.
[65] C. Hawkins and Z. Zhang, “Bayesian tensorized neural networks with automatic rank selection,” Neurocomputing, vol. 453, pp. 172–180, 9 2021.
[66] M. Berkay On, Y.-J. Lee, X. Xiao, R. Proietti, and S. Ben Yoo, “Analysis of the Hardware Imprecisions for Scalable and Compact Photonic Tensorized Neural Networks,” 2021.
[67] X. Xiao, M. B. On, T. V. Vaerenbergh, D. Liang, R. G. Beausoleil, and S. J. B. Yoo, “Large-scale and energy-efficient tensorized optical neural networks on III–V-on-silicon MOSCAP platform,” APL Photonics, vol. 6, no. 12, p. 126107, 12 2021.
[68] J. U. Knickerbocker, P. S. Andry, B. Dang, R. R. Horton, M. J. Interrante, C. S. Patel, R. J. Polastre, K. Sakuma, R. Sirdeshmukh, E. J. Sprogis, S. M. Sri-Jayantha, A. M. Stephens, A. W. Topol, C. K. Tsang, B. C. Webb, and S. L. Wright, “Three-dimensional silicon integration,” Tech. Rep., 2008.
[69] W. D. Sacher, J. C. Mikkelsen, P. Dumais, J. Jiang, D. Goodwill, X. Luo, Y. Huang, Y. Yang, A. Bois, P. L. Guo-qiang, E. Bernier, J. K. S Poon, D. Celo, D. J. Goodwill, J. Jiang, P. Dumais, C. Zhang, F. Zhao, X. Tu, S. Yan, J. He, M. Li, W. Liu, Y. Wei, D. Geng, H. Mehrvar, E. Bernier, B. Guan, R. P. Scott, C. Qin, N. K. Fontaine, T. Su, C. Ferrari, M. Cappuzzo, F. Klemens, B. Keller, M. Earnshaw, and S. J. B Yoo, “Tri-layer silicon nitride-on-silicon photonic platform for ultra-low-loss crossings and interlayer transitions,” Optics Express, Vol. 25, Issue 25, pp. 30862-30875, vol. 25, no. 25, pp. 30 862–30 875, 12 2017. https://opg.optica.org/viewmedia.cfm?uri=oe-25-25-30862&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-25-25-30862https://opg.optica.org/oe/abstract.cfm?uri=oe-25-25-30862
[70] K. Shang, S. Pathak, B. Guan, G. Liu, S. J. B Yoo, J. F. Bauters, M. J. R Heck, D. D. John, J. S. Barton, C. M. Bruinink, A. Leinse, R. G. Heideman, D. J. Blumenthal, J. E. Bowers, R. Moreira, X. Zheng, J. E. Cunningham, I. Shubin, J. Simons, M. Asghari, D. Feng, H. Lei, D. Zheng, H. Liang, C. C. Kung, J. Luff, T. Sze, D. Cohen, A. V. Krishnamoorthy, D. Dai, Z. Wang, and M. C. Tien, “Low-loss compact multilayer silicon nitride platform for 3D photonic integrated circuits,” Optics Express, Vol. 23, Issue 16, pp. 21334-21342, vol. 23, no. 16, pp. 21 334–21 342, 8 2015. https://opg.optica.org/viewmedia.cfm?uri=oe-23-16-21334&seq=0&html=truehttps://opg.optica.org/abstract.cfm?uri=oe-23-16-21334https://opg.optica.org/oe/abstract.cfm?uri=oe-23-16-21334
[71] Y. Zhang, A. Samanta, K. Shang, and S. J. B. Yoo, “Scalable 3D Silicon Photonic Electronic Integrated Circuits and Their Applications; Scalable 3D Silicon Photonic Electronic Integrated Circuits and Their Applications,” IEEE Journal of Selected Topics in Quantum Electronics, vol. 26, no. 2, 2020. https://www.ieee.org/publications/rights/index.html
[72] Y. Zhang, Y.-C. Ling, Y. Zhang, K. Shang, and S. J. B. Yoo, “High-Density Wafer-Scale 3-D Silicon-Photonic Integrated Circuits; High-Density Wafer-Scale 3-D Silicon-Photonic Integrated Circuits,” IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, vol. 24, no. 6, 2018. http://ieeexplore.ieee.org.
[73] S. J. Ben Yoo, B. Guan, and R. P. Scott, “Heterogeneous 2D/3D photonic integrated microsystems,” Microsystems & Nanoengineering 2016 2:1, vol. 2, no. 1, pp. 1–9, 8 2016. https://www.nature.com/articles/micronano201630
[74] P. Blouw and C. Eliasmith, “Event-Driven Signal Processing with Neuromorphic Computing Systems,” ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, vol. 2020-May, pp. 8534–8538, 5 2020.
[75] M. Giulioni, P. Camilleri, M. Mattia, V. Dante, J. Braun, and P. Del Giudice, “Robust Working Memory in an Asynchronously Spiking Neural Network Realized with Neuromorphic VLSI,” Frontiers in Neuroscience, vol. 5, p. 149, 2012.
[76] A. Rao, P. Plank, A. Wild, and W. Maass, “A Long Short-Term Memory for AI Applications in Spike-based Neuromorphic Hardware,” Nature Machine Intelligence 2022 4:5, vol. 4, no. 5, pp. 467–479, 5 2022. https://www.nature.com/articles/s42256-022-00480-w
[77] T. DeWolf, T. C. Stewart, J. J. Slotine, and C. Eliasmith, “A spiking neural model of adaptive arm control,” Proceedings of the Royal Society B: Biological Sciences, vol. 283, no. 1843, 11 2016. https://royalsocietypublishing.org/doi/10.1098/rspb.2016.2134
[78] E. P. Frady, G. Orchard, D. Florey, N. Imam, R. Liu, J. Mishra, J. Tse, A. Wild, F. T. Sommer, and M. Davies, “Neuromorphic Nearest Neighbor Search Using Intel’s Pohoiki Springs,” ACM International Conference Proceeding Series, 3 2020. https://doi.org/10.1145/3381755.3398695
[79] R. C. O’Reilly, R. Bhattacharyya, M. D. Howard, and N. Ketz, “Complementary learning systems,” Cognitive science, vol. 38, no. 6, pp. 1229–1248, 2014. https://pubmed.ncbi.nlm.nih.gov/22141588/
[80] D. Rasmussen, A. Voelker, and C. Eliasmith, “A neural model of hierarchical reinforcement learning,” PLOS ONE, vol. 12, no. 7, p. e0180234, 7 2017. https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0180234

Yun-Jhu Lee received the B.S. in Life Science from the National Taiwan University, Taiwan. He is currently working towards the Ph.D degree in Electrical and Computer Engineering at the University of California, Davis. Research interests include neuromorphic computing, integrated photonics, MEMS, and control system.

Mehmet Berkay On received the B.S. in Electrical and Electronics Engineering from the Bilkent University, Ankara, Turkey in 2018. He is currently working towards the Ph.D degree in Electrical and Computer Engineering at the University of California, Davis. Research interests are enerrgy-efficient photonic neuromorphic systems, RF-photonic signal processing, fiber-optic communication, and compressive sensing.

Luis El Srouji received the B.S. in Applied Physics with an emphasis in Physical Electronics from the University of California, Davis in 2020. He is currently working towards the Ph.D degree in Electrical Engineering at the University of California, Davis. Research interests include the design of bio-physically accurate analog neuron circuits, development of learning algorithms for optoelectronic spiking neural networks, and fabrication of on-chip laser sources.

Li Zhang Li Zhang received the B.S. degree in electronics and information technology and instrumentation from Zhejiang University, Hangzhou, China, in 2016. He is currently pursuing the Ph.D. degree in electrical engineering with the University of California at Davis, Davis, CA, USA. His research interests include ultra-wideband transceiver, trans-impedance amplifier and optical driver.

S. J. Ben Yoo (Fellow, IEEE and Fellow, Optica) received the B.S. degree in electrical engineering with distinction, the M.S. degree in electrical engineering, and the Ph.D. degree in electrical engineering with a minor in physics, from Stanford University, Stanford, CA, USA, in 1984, 1986, and 1991, respectively. He is currently a Distinguished Professor of electrical engineering with UC Davis, Davis, CA, USA. His research with UC Davis includes 2D/3D photonic integration for fu- ture computing, communication, imaging, and navigation systems, micro/nano systems integration, and the future Internet. Prior to joining UC Davis in 1999, he was a Senior Research Scientist with Bellcore, leading technical efforts in integrated photonics, optical networking, and systems integration. His research activities with Bellcore included the next-generation internet, reconfigurable multiwavelength optical networks (MONET), wavelength interchanging cross connects, wavelength converters, vertical-cavity lasers, and high-speed modu- lators. He led the MONET testbed experimentation efforts, and participated in ATD/MONET systems integration and a number of standardization activities. Prior to joining Bellcore in 1991, he conducted research on nonlinear optical processes in quantum wells, a four-wave-mixing study of relaxation mechanisms in dye molecules, and ultrafast diffusion-driven photodetectors with Stanford University. He is a fellow of OSA, NIAC, and was the recipient of the DARPA Award for Sustained Excellence (1997), the Bellcore CEO Award (1998), the Mid-Career Research Faculty Award (2004 UC Davis), and the Senior Research Faculty Award (2011 UC Davis).