Distortion-Aware Phase Retrieval Receiver for High-Order QAM Transmission with Carrierless Intensity-Only Measurements
Abstract
We experimentally investigate transmitting high-order quadrature amplitude modulation (QAM) signals with carrierless and intensity-only measurements with phase retrieval (PR) receiving techniques. The intensity errors during measurement, including noise and distortions, are found to be a limiting factor for the precise convergence of the PR algorithm. To improve the PR reconstruction accuracy, we propose a distortion-aware PR scheme comprising both training and reconstruction stages. By estimating and emulating the distortion caused by various channel impairments, the proposed scheme enables enhanced agreement between the estimated and measured amplitudes throughout the PR iteration, thus resulting in improved reconstruction performance to support high-order QAM transmission. With the aid of proposed techniques, we experimentally demonstrate 50-GBaud 16QAM and 32QAM signals transmitting through a standard single-mode optical fiber (SSMF) span of 40 and 80 km, and achieve bit error rates (BERs) below the 6.25% hard decision (HD)-forward error correction (FEC) and 25% soft decision (SD)-FEC thresholds for the two modulation formats, respectively. By tuning the pilot symbol ratio and applying concatenated coding, we also demonstrate that a post-FEC data rate of up to 140 Gb/s can be achieved for both distances at an optimal pilot symbol ratio of 20%.
Index Terms:
Phase retrieval, coherent communication, direct detection, digital signal processing.I Introduction
Coherent optical communication technology, originally developed for long-haul transmission systems, is now expanding into short-reach transmission applications, including metro, access, and data-center networks, to accommodate the burgeoning traffic growth in edge networks. Coherent receivers, by interfering the received optical field with a local oscillator (LO) light and directly detecting the signal-carrier beat component, yield a linear representation of the entire optical field as electrical waveforms. This naturally facilitates the detection of both amplitude and phase [1]. Transmission impairments like chromatic dispersion (CD), polarization mode dispersion (PMD), and transceiver frequency response impairments can be adaptively compensated via digital signal processing (DSP). Such capability enables the detection of advanced modulation formats, cementing coherent detection as a particularly compelling approach for a high spectral efficiency [2, 3]. Yet, the prerequisite of costly, high-stability narrow linewidth lasers prevents coherent detection from being the ideal choice for cost-sensitive short-reach interconnect scenarios. It also complicates network management as wavelength alignment is required between the transmitters and the receivers. Consequently, researchers are actively pursuing detection paradigms that meld the merits of both coherent and intensity modulation with direct detection (IM-DD) systems.
To avoid using a LO at the receiver (Rx), the optical carrier is generated inside the transmitter and co-travels with the modulated signal. This configuration gives rise to various carrier-assisted detection schemes, such as the Kramers-Kronig Rx [4, 5], Stokes vector Rx [6], and carrier-assisted differential detection [7]. The concept of carrier-assisted detection schemes is to reconstruct the optical field by harnessing the beat component between the signal and its associated carrier. Therefore, these schemes necessitate a certain level of carrier-to-signal power ratio (CSPR) to suppress the interference from signal-signal beating, which limits the transmitted signal power, since a substantial fraction of the overall transmitted power is allocated for the carrier. To eliminate the need for any form of optical carrier, various detection schemes utilizing intensity-only (phase-less) and carrierless measurements have been proposed [8, 9, 10], termed as phase retrieval (PR). In the absence of an optical carrier, signal phase information is derived indirectly from the signal-signal beat components generated during photoelectric detection. Figure 1(a) shows the schematic of a two-photodiode(PD)-based PR Rx, which maps the optical signal with an optical bandwidth of into two radio-frequency waveforms representing measured intensity with an electrical bandwidth of . denotes the dispersion value originating from the dispersive element. Since the symmetric nature of the signal constellation poses serious phase ambiguity problems, PR becomes an ill-posed problem in optical communication scenarios [11].
To solve the PR problem, a strong symbol mixing effect, usually from fiber chromatic dispersion, is preferred to transfer the symbols’ phase changes into fluctuations in the received intensity, thereby furnishing vital details about the signal phase, as shown in Fig. 1(b). Figure 1(c) presents the measured intensity fluctuations of a 50 GBaud 16-ary quadrature amplitude modulation (QAM) signal transmitting over a span of 80-km standard single-mode optical fiber (SSMF) and received by a two-PD-based PR Rx. After symbol mixing, a strong intensity peak implies the overlap symbols modulate similar phases, leading to constructive interference, while vice versa leads to destructive interference, thus leading to a barcode-like pattern in intensity distribution along the time axis. The reconstruction of the signal needs to reproduce the actual field-based interference models. In order to achieve this, multiple measurements are induced to allow the algorithm to iterate between different field projection planes. To mitigate the ill-posedness of the PR problems, prior knowledge about the signal is also induced, such as pilot symbols and spectral constraints, which avoid the algorithm to stagnate in local minima. By implementing these strategies, a polarization-multiplexed 30 GBaud quadrature phase-shift keying (QPSK) signal is successfully recovered from 4 intensity measurements after a 520-km SSMF transmission based on a modified Gerchberg-Saxton (GS) algorithm [9]. In the study presented in [10], the authors utilize a two-dimensional photodetector array to expand the intensity observation number to 32, demonstrating that PR signal reconstruction can be achieved without the help of pilot symbols, provided with abundant intensity measurements. The research in [12] indicates that by digitally incorporating a weak carrier into the signal spectrum to break the inherent symmetry of the optical field, a PR Rx can enhance its performance and save the use of pilot symbols. Despite it uses an unaltered transmitter structure due to a low CSPR of -2 dB, the expansion capability to detect over 2 spatial channels with this scheme can become challenging. These arise chiefly because the received fields containing inter-channel crosstalk may introduce undesirable interference due to the spatial channel mixing effect, and it is difficult to keep the power of carrier-signal beat components evenly distributed over all spatial channels [13].

It is further illustrated that a specially designed approach, involving the placement of two carriers at opposite ends of the signal spectrum with orthogonal polarization states, is required to avoid power fading in the received intensities for enabling polarization multiplexing in weak-carrier PR receiving configurations [14]. Conversely, carrierless PR schemes intrinsically support polarization- [9, 10] and mode-division multiplexing transmission [8, 15] by augmenting the photodetector count, since the optical power can be evenly distributed over all spatial channels. The differences distinguishing these approaches suggest that optical carriers, supplementary observations, and pilot symbols possess analogous functions in providing useful information for field reconstruction. Given the strengths and limitations intrinsic to each method, a practical PR coherent receiver needs to discern and harness the optimal mix of these elements to maximize performance.
One effective method for enhancing the data rate of PR receivers is utilizing advanced modulation formats. Such an approach not only leads to higher spectral efficiency but also optimally exploits the reconstruction accuracy offered by each converged iteration cycle. To accommodate high-order QAM transmission, recently, several works have focused on improving the convergence speed and accuracy of the PR Rx by driving the GS algorithm out of stagnation more aggressively based on adaptive intensity transformation [16] and weight decision constraint operations [17], which show distinct convergence speed improvements in both simulations and experiments. Whereas, these methods show mild improvements in the converged bit error rate (BER) levels in experiments, which cannot drive the 25-GBuad 16QAM signal below 7% forward error correction (FEC) BER threshold, implying that the reconstruction integrity issue of PR Rx to support advanced modulation formats in the presence of complicated channel impairments is not fully addressed. Transmission impairment compensation techniques in standard coherent DSP procedures cannot directly apply to PR schemes. This is because the fundamental channel linearization assumption ceases to be valid during the square-law detection process. Furthermore, cascading standard coherent DSP procedures after field reconstruction could lead to sub-optimal algorithm convergence as the signal distortion during transmission is not fully considered by the algorithm, thus resulting in performance degradation.
This paper aims to fill the existing blank in the channel impairment estimation and compensation techniques within PR receiving schemes. In this paper, we study the limiting factor of PR convergence under noisy measurements, and experimentally investigate the reconstruction performance boundary of a two-PD-based PR receiver by exploring its feasibility in transmitting 50 GBaud 16QAM and 32QAM signals over one SSMF span, given that sufficient symbol mixing and prior knowledge from pilot symbols are provided. In this case, as the intrinsic signal recovery capability of the algorithm is ensured, the main limiting factors of the final performance are the errors in the detected intensity waveforms, which do not obey the propagation rule described by the dispersion transfer equation in the GS iteration. To mitigate the intensity errors stemming from signal distortions, we propose a distortion-aware PR scheme comprising both training and reconstruction stages. By estimating and emulating the distortion caused by various channel impairments throughout the PR DSP, the proposed strategy fosters enhanced congruence between the estimated and measured amplitude during iterations, thus enabling improved reconstruction performance to support high-order QAM transmission. With the aid of the proposed techniques, for both 40- and 80-km SSMF transmission, the BERs of recovered 16QAM and 32QAM signals can reach under below the 6.25% hard decision (HD)-FEC threshold of [18] and 25% soft decision (SD)-FEC threshold of [19], respectively. Additionally, by tuning the pilot symbol ratio, we demonstrate that our proposed PR scheme can support a post-FEC data rate of up to 140 Gb/s per polarization for a single wavelength channel for both distances at an optimal pilot symbol ratio of 20%.
For the rest of the paper, Section \@slowromancapii@ delineates the principle and methodology of the proposed PR scheme. Section \@slowromancapiii@ outlines the experimental setup for proof-of-concept validation. Section \@slowromancapiv@ presents the experimental results from the aspects of channel estimation, field reconstruction, and achievable data rate. Section \@slowromancapv@ concludes the paper.
II Principle and Methodology
II-A Phase Retrieval Convergence Limit
The convergence property of the GS algorithm has been proved mathematically when it is proposed in [20], while its convergence limit, which decides the ultimate convergence accuracy of the GS-based PR receiving scheme, is not clear. In this subsection, we aim to analyze the convergence limit of GS algorithm under noisy amplitude measurements by derivations. First, we consider a standard GS-based PR iteration with two alternated projections, with its schematic shown in Fig. 1(d). The goal is to estimate the field phase distribution and with knowing the measured amplitude distribution and across time samples. The relationship of the -th sample of the fields at two projection planes is given by
(1) |
and
(2) |
where , and denote the Toeplitz matrix derived by shifting the channel impulse response. We assume that and are known beforehand. and denote the estimated phase during the iteration. By propagating the estimated field to the other projection plane, the expected amplitude of the -th sample of estimated field is
(3) |
The gradient vector of combining measured amplitude with estimated phase is given by
(4) |
If we assume the estimated phase and approximate their true values, the mathematical expectation of the amplitude squared error between the measured and estimated value would approach zero as
(5) |
Equation (5) shows that each element in and has a tendency to approach zero, implying that under no amplitude measurement error, the correct phase and become a stationary point of the algorithm.
Now we consider that the amplitudes contain zero-mean additive white Gaussian noise (AWGN) during observation given by and , where and represent independent random noise. Using these definitions, Eq. (5) can be rewritten as
(6) | ||||
where and denotes the noise variance of and . The second step in Eq. (6) is derived by expanding the square bracket and simplifying the expression by and . The third step assumes a precondition as , since the propagation operation obeys energy conservation. Compared with Eq. (5), the discrepancy in Eq. (6) shows that, whether the convergence capability of an alternative projection PR scheme is good or not, its achievable performance limit is fundamentally constrained by the noise. As the observation error limits the minimum stepsize of the gradients during iterations, the estimated field will be forced to wander around the correct values by the misleading gradients, which limits the convergence accuracy.
To achieve a more accurate approximation between the estimated and measured amplitudes, the power of and should be suppressed. While it is impossible to predict amplitude errors stemming from noise due to its nature of randomness, a portion of these errors arises due to signal distortion, which comes from the non-ideal transfer characteristic of the devices. Therefore, recognizing the precise form of the distortion could partially compensate its influences and lead to reconstruction results with higher accuracy.

II-B Distortion-aware Phase Retrieval
Figure 2(a) illustrates the schematic of the PR transmission system as well as the transceiver DSP procedures employed in the experiment. Upon completing QAM mapping, frame generation, and pulse shaping, the transmitter (Tx) DSP premixes the sequence with extra chromatic dispersion in the digital domain, ensuring pronounced intensity fluctuations on the Rx end. The premixed patterns undergo conversion via the digital to analog converter (DAC), subsequently modulated into the optical domain, and then transmitted via fiber to be captured by a PR receiver. The receiver is based on a two-PD-based PR configuration with the same structure as in Fig. 1(a). Throughout the transmission, several impairment factors, such as the electrical-to-optical (E/O) and optical-to-electrical (O/E) frequency response, in-phase and quadrature (IQ)-dependent impairments, and modulator nonlinearity, may distort the signals.
To compensate for the distortion, a distortion-aware PR scheme is proposed, which uses a training sequence to identify the channel impairments. The frame structure is depicted in Fig. 2(b). The training sequence is placed at the start of a frame, with two segments of guard symbols placed beside the training sequence to prevent unknown mixing symbols from the payload section after chromatic dispersion. These guard symbol sequences can be derived as the cyclic prefix and postfix of the training sequence to make it follow the circular convolution. The pilot symbols distribute across the payload section evenly to facilitate convergence by eliminating phase ambiguity. The Rx DSP employs the pre-known training sequence to conduct dispersion estimation, time synchronization, and determine channel impairments during the training stage. The dispersion values for the two branches are estimated using a grid-search method aimed at maximizing the correlation peak values between the propagated training sequence’s intensities and received intensities. Within the training stage, the channel impairments are estimated by minimizing errors between the distorted forward propagated training sequences’ intensities and the actual measurements. During the reconstruction stage, the proposed PR scheme empowers the algorithm to emulate distortions caused by channel impairments in forward propagation and apply reverse distortion in backward propagation throughout its iterations. Besides these procedures, the receiver DSP also encompasses a performance evaluation module to calculate metrics like BERs, generalized mutual information (GMI), and the recovered constellations.
II-C Training Stage
The whole training stage contains several procedures, including estimating Tx and Rx response, optimal direct current (DC) level, IQ-dependent impairments (including power imbalance, time skew, and phase mismatch), and modulator nonlinearity.
To start with, the Rx response estimation module is for obtaining the channel distortion induced by the frequency roll-off during the O/E process, e.g. bandwidth limitations raised from the photodiodes and high-speed radio frequency (RF) cables as well as connectors. As these impairments happen after photoelectric detection, a pair of feed-forward equalizers (FFEs) are used for compensating those linear impairments. To calibrate the FFEs, the training sequence is initially propagated to the receiver photodiodes using the estimated chromatic dispersion values. The expected waveforms of received intensities are then computed via square-law detection. Subsequently, DC components from both received and expected intensities are removed, and FFE is trained on these DC-removed intensities. After FFE equalization, the DC offset levels are reintroduced, with their values optimized to minimize the mean absolute error (MAE) between the received and expected intensities. The aforementioned procedures eliminate the impact of incorrect DC offset level under O/E frequency response, thus providing the PR algorithm an optimal baseline of complete darkness.
For the estimation of Tx response, the situation becomes intricate, given that the I- and Q-channels can exhibit independent impairments, meaning their frequency-dependent phase changes need to be estimated. Since no direct phase information about the channel is measured, this transforms into a PR problem. Several works in [9] and [10] have successfully implemented complex-valued channel estimation using intensity measurements over multiple photodiodes. These primarily rely on multiple observations as a fundamental condition to address the PR challenge. In contrast, this paper introduce a complex-valued channel estimation technique that only employs one single PD measurement, with its pseudocode provided in Algorithm 1. , , denote the complex-valued field of training sequence, the impulse response of fiber CD, and the impulse response for a bandwidth constraint filter used to eliminate out-of-band frequency components, respectively. The main concept is merging the phases of the propagated training sequence with the measured amplitudes to derive the estimated distorted field , and estimating the channel with least square (LS) algorithm iteratively until the MAE between and starts to increase or the iteration number reaches a preset maximum . The convolution operation is used for conciseness in expression but is preferred to be implemented in frequency domain to achieve fast computation. In the experiment, we set as 20 and observe that the estimation converges within 8 iterations in most cases, which can be attributed to the initial is close to its correct values. By fully leveraging the prior knowledge of the training sequence, the proposed design alleviates the effects of channel disparities across multiple measurements, including bandwidth limitation variations among PDs and additional distortion introduced by dispersive elements. This could lead to improved accuracy and versatility regardless of the number of PDs.
For the estimation of IQ-dependent impairments and modulator nonlinearity, the distortions are introduced by field-based mathematical models. The distorted optical field due to IQ-dependent impairments can be expressed as
(7) |
where variables , , denote the power imbalance coefficient, time skew, and phase error between the I- and Q-channels, respectively. and denote the undistorted real-valued I- and Q-channel signals. When considering modulator nonlinearity, the nonlinear transfer function for the I-channel is approximated using a cubic polynomial, given by
(8) |
The Q-channel follows a similar representation, with nonlinear coefficients denoted as and . Note that the modulator nonlinearity is applied before the IQ impairments as two Mach-Zehnder interferometer can have different nonlinear properties, even it is estimated at the last step. These parameters are determined by minimizing the mean absolute error between the distorted forward propagated training sequence’s intensities and the actual measured values (both are power normalized). To be specific, the optimization employs a grid-search method in a greedy algorithm manner. The search process starts with a zero value for each parameter, and then a parameter scan is conducted with a suitable search span and granularity sequentially in an order of , , , , , , over three rounds.
After completing an entire training cycle, the scheme offers an optional refinement loop to iteratively enhance estimation accuracy, as indicated by the red arrow in Fig. 2(a). This is mainly because the Rx frequency limitations can be over estimated in the first cycle, since the frequency response impairments from Tx has not been considered when training the FFEs. Meanwhile, knowledge about the distortion induced by IQ-dependent impairments and modulator nonlinearity can be employed to modify the third line of Algorithm 1 to further increase its approximation capability in the refinement loop.
II-D Reconstruction Stage
The reconstruction stage of the proposed PR scheme is illustrated in Fig. 3, which resembles the pilot symbol-assisted PR in [9, 21], albeit with several modifications.

The first modification is inducing the Tx distortion raised from Tx frequency response, IQ impairments, and modulator nonlinearity when the estimated field is forward propagated from Tx to Rx. Correspondingly, a designated module introduces reverse distortion during backward propagation of the estimated field from Tx to Rx. This reverse distortion channel is derived by inversely determining the estimated channel distortion. The reverse Tx response is determined by inverting the Tx response in the frequency domain. For the reverse IQ impairment parameters, they are ascertained by taking the negative values of the estimated parameters , , . The reverse nonlinear transfer function is deduced by computing the inverse function of the estimated cubic polynomial function representing modulator nonlinearity.
The second modification is inducing a dual-trace propagation mechanism, as highlighted in red and blue in Fig. 3. The two color arrows denote the two estimated field traces experiencing different amplitude constraint sequences. After amplitude constraints, these two traces approach the correct field via distinct trajectories, enabling higher accuracy and reducing the likelihood of convergence to local minima after field mixing in the Tx plane. Moreover, the estimated results of these traces should coincide upon reaching the correct phase estimation. Consequently, any pronounced discrepancy in their amplitude waveforms indicates potential estimation failures, thus providing reliable criterion for the selective phase reset module [21]. In the experiment, the selective phase reset provides better reconstruction results under an insufficient pilot symbol ratio, while it shows a mild influence to the reconstruction results at a high symbol ratio such as 50%.

In addition to the aforementioned modifications, the proposed PR scheme integrates a forward and backward CD premixing module to adapt digitally CD-premixed symbols. Besides, every CD propagation operation includes a bandwidth constraint filter designed to eliminate local minima with incorrect frequency structures. After implementing the pilot constraint on the Tx projection plane, the estimated field undergoes down-sampling to the symbol rate, and then a pulse shaping procedure is introduced to circumvent the estimation of sub-symbol sampling points, thereby reducing the degree of freedom for the optimization problem. Note that the pilot symbol constraint has to be conducted at the transmitter projection plane to guarantee the one-to-one mapping between ideal and estimated pilot symbols as there is no channel memory effect in this projection.
III Experimental Setup
Figure 4 presents the experimental setup for the proposed PR scheme transmitting 50-GBaud QAM signals over a 40-km or 80-km span of SSMF. At the transmitter side, a continuous wave (CW) laser, featuring a 10-kHz linewidth (Pure Photonics PPCL500) and operating at 1541.02 nm, is modulated by an IQ-Mach-Zehnder modulator (MZM) to produce the premixed QAM signal with a roll-off factor of 1%. The MZM biases are carefully set at the null point to fully suppress the carrier. The transmitted IQ patterns undergo the Tx DSP procedures as illustrated in Fig. 2(a). The payload and training symbol segments are generated by pseudorandom bit patterns independently with no correlation. The training sequence consists of QPSK symbols, positioned between two guard symbol segments, each of which has a length of 64 symbols. The payload sequence circularly repeats a segment of QAM symbols for 30 times. The digital CD for premixing is set to -3000 ps/nm, which corresponds to overlapping 60 symbols at 50 GBaud. This large amount of CD premixing aims to explore the PR Rx performance under extensive symbol mixing. In practice, as SSMF provides a unit CD of 17 ps/nm/km, the digital CD premixing can be removed when the system operates at a higher symbol rate or transmits through longer distances. An extra 0.5% clipping is applied before the digital patterns are sent to the DAC to constrain the peak-to-average power ratios (PAPRs), thus improving the power efficiency of electric amplifiers (EAs). The output modulated signal is pre-amplified by an erbium-doped fiber amplifier (EDFA), followed by a variable optical attenuator (VOA), which is for controlling the transmitting optical power to eliminate nonlinear distortion during fiber transmission, with a launch power of -2 and -2.5 dBm for the 40- and 80-km transmission, respectively.
At the receiver side, another EDFA is used to re-boost the received optical power, with its input port connected to an amplified spontaneous emission (ASE) source by a 90:10 optical coupler. The coupled ASE power can be adjusted by tuning the pump current of the ASE source, resulting in received signals with different optical signal-to-noise ratio (OSNR) values into the PR Rx. The output port of the Rx EDFA connects to a 99:1 optical splitter, with the 1% branch connected to an optical spectrum analyzer (OSA) for OSNR measurement. The OSNR is measured as the ratio of the signal power to the ASE power within 0.1 nm bandwidth. Inside the PR Rx, an optical band-pass filter (OBPF) is used to filter out-of-band noise with a bandwidth of 0.6 nm, cascaded by a 70:30 optical splitter. This power splitting ratio is chosen to balance the received optical power at the two photodiodes. The Rx EDFA pump current is adjusted to set the filter output power as 7 dBm when no extra ASE is introduced. The dispersive element used in the PR Rx is a fiber Bragg grating (FBG)-based dispersion compensation module (Proximion DCM-FB) with a measured chromatic dispersion of -1275 ps/nm at the operating wavelength. In comparison to using a span of SSMF or dispersion compensating fiber (DCF) as the dispersive element, the FBG-based approach offers several advantages: it introduces mild loss (3 dB), negligible latency (115 ns, including the delay from patch cord fiber), and zero nonlinear effects. Additionally, it can provide a distinct amount of dispersion, making it particularly suitable for PR systems that aim at a high reconstruction accuracy. In practice, as continuous dispersion can be offered over the whole C-band via FBG-based solution, a single dispersive element can be shared by multiple PR receivers operating at different wavelength channels, thus reducing the implementation cost. The received two-branch optical signals are then detected by two 50-GHz class photodiodes (u2t XPDV2120RA). The detected electrical signals are then captured by a digital storage oscilloscope with a sampling rate of 160 GSa/s. The captured waveforms experience down-sampling to 2 samples per symbol, power normalization, and then being processed offline with the Rx DSP flow illustrated in Fig. 2(a). The tap number of FFEs for Rx response estimation is set to 101, and the tap number for Tx response estimation in Algorithm 1 is adjusted to 511. The relatively high tap number for the latter could be attributed to the distortions and reflections along the RF chain connecting discrete components at the Tx, and it could also be due to that having sufficient tap coefficients helps solving the PR problem by enabling more refined distortion estimation.
IV Experimental Results
IV-A Channel Estimation Results
Figures 5(a,b) depict the Rx-side normalized power frequency response obtained from the two PDs with different refinement loop configurations. These response results are estimated from the measured intensities of the training sequence after a 40-km transmission under a 35-dB OSNR using the estimation method discussed in Subsection \@slowromancapiv@-C. The curves reveal that with increasing refinement iterations, the transfer characteristics for high-frequency components exceeding 25 GHz get improved. Consequently, the 3-dB bandwidth of both ports converges towards 50 GHz.

Figures 6(af) present the Tx response results estimated from the same intensity data as in Figs. 5(a,b) with 2 refined loops. The I- and Q-channel are estimated separately since they possess different modulator nonlinear coefficients. Figures 6(a,b) show the power-normalized time domain impulse power response for the I and Q channels. The power and frequency phase responses are shown in Figs. 6(c,d) and Figs. 6(e,f), respectively, which both exhibit structured features of modulation signals. We notice that the results obtained from the two PDs appear to be similar in overall outline, implying that the effectiveness of the estimated method is likely to be robust to CD variation.



After combining the estimated I- and Q-channel results, the Tx-side normalized power frequency response is shown in Fig. 7(a). The measured optical spectra using OSA at Tx and Rx with different OSNRs are given in Fig. 7(b). The spectrum humps within 3 GHz may arise due to the unflatness of the frequency transfer characteristics of the EAs or RF chains at the Tx, which can be clearly observed in both the measured optical and estimated electrical spectra. By calculating the division of the estimated frequency response between the dispersed branch and the undispersed branch, Figs. 7(c,d) present the estimated spectrum and frequency phase response of the dispersive element. Although the small phase variation could be raised by the phase ripple of the FBG, the flatness of both the spectrum and phase profiles within the signal bandwidth suggests that the dispersive element introduces only negligible distortion.
Figures 8(ag) illustrate the resulting changing curves of intensity error as a function of each estimated parameter for , , , , , , during IQ-dependent impairment and modulator nonlinearity estimation. The intensity error is determined by calculating the MAE between the measured and estimated intensities of the training sequence. The estimated parameter values with the minimum intensity MAEs are marked with and from two receiving photodiodes, respectively. Although the two branches have different CD values, they obtain similar estimated values for these parameters. Figures 8(hi) depict the modulator nonlinear transfer curve for the I- and Q-channels obtained by the estimated nonlinear coefficients from the dispersed branch. The black dashed line represents the linear transfer function as a reference. The orange curves denote the calculated nonlinear mapping to reverse the modulator nonlinearity.
Figures 9(ad) validate the effectiveness of the channel estimation. Figures 9(a,b) compare approximation levels between the estimated and measured intensities for the undispersed branch, considering scenarios with and without estimated channel distortion. In both figures, red dashed lines with dots depict the average intensity of 30 separate measurements. The payload section of the transmitted sequence is assumed to be distorted by estimated impairments, including Tx response, IQ impairments, and modulator nonlinearity.

Then, the sequence propagates to the Rx with estimated CD values. Consequently, the expected intensity waveform can be deduced via the square-law detection formula, represented by a black line with dots in Fig. 9(b). In contrast, a green line with dots illustrates the expected intensity without considering the channel distortion in Fig. 9(a). The intensity errors for both schemes are calculated through 30 independently measured traces and depicted using lines with different colors. The high OSNRs during measurements constrain the distribution of these intensity error waveforms. Notably, numerous spikes emerge in the intensity error curves of Fig. 9(a), while their counterparts in Fig. 9(b) display minimal distinct spikes, a consequence of having knowledge regarding the channel distortion. Figures 9(c,d) contrast the errors between scenarios with and without the incorporation of estimated distortion, by presenting the spectra of both the estimated payload intensity and its measured counterpart. In both figures, the green shades denote the spectra of the intensity error using one measured trace. The lower and flatter outline of the intensity error spectrum in Fig. 9(d) indicates the error has been mitigated after propagating a distorted payload sequence. The rest fluctuations in Fig. 9(b) and the noise floor in Fig. 9(d) could be attributed to the effective number of bits (ENOB) limitations of analog-to-digital converters (ADCs) and the residual estimation error during channel estimation. The dispersed branch intensity appears similar approximation enhancement after applying channel distortion, while it is not shown by a figure to avoid repeated illustrations. For both of the two branches, by propagating the payload sequence without channel distortions, the measured intensity and amplitude signal-to-noise ratios (SNRs) are around 14 and 16 dB. After considering the estimated channel distortions, the intensity and amplitude SNRs for both branches achieve roughly 19 and dB, respectively.
IV-B Reconstruction Results


Figures 10(af) illustrate the measured pre-FEC BERs versus OSNRs for QPSK, 16QAM, and 32QAM signals across 40- and 80-km SSMF transmission with different reconstruction configurations. In these figures, a saturated pilot symbol ratio of 50% is adopted to explore the BER lower bounds, reflecting the maximum achievable reconstruction accuracy across different OSNRs. In this case, since the BERs are constrained by noise rather than local minima, the selective phase reset operation is skipped. More than symbols are used for the computation of each data point on the BER curves. The distortion-aware PR with no refinement loop, following the procedure detailed in Subsection \@slowromancapii@-D, is delineated by red lines with squares. The same PR schemes, but including 1 and 2 refinement loops, are depicted by purple and brown lines, respectively. Blue lines denote the conventional PR that omits all distortion compensation techniques including FFE pre-equalization. Orange lines with triangles denote the conventional PR that integrates FFE pre-equalization. Green lines with diamonds represent the distortion-aware PR excluding compensation for IQ impairment and modulator nonlinearity.
In high OSNR regimes, e.g. exceeding 28 dB, the PR scheme employing all compensation techniques and refinement loops delivers the lowest BERs. As OSNR falls below 28 dB, the distortion-aware PR with no refinement loop emerges as the optimal approach, suggesting that the estimation error tends to accumulate during refinement loops under noisy measurements. Surprisingly, the 80-km transmission results reveals lower BERs in low OSNR regimes as compared to the 40-km transmission. This deviation from conventional understanding can be attributed to the pump current difference of the Rx EDFA when measuring over two distances. To avoid the power saturation issue at the highest OSNR, the 40-km transmission uses a pump current of 377 mA rather than 890 mA in the 80-km case, which leads to a received optical power decrease by a few decibels in low OSNR regime. Therefore, the thermal noise at the Rx has more impact in 40-km transmission. The simulated BER-OSNR curves, considering ASE, thermal noise, and ENOB, are given as a reference indicating the performance without any distortion from channel impairments. The simulation assumes a PD responsivity of 1 A/W and a thermal noise level at 10e-12 A/Hz1/2. The results with ENOBs of 5.8 and 8 are marked with black solid lines and blue dashed lines, respectively. In the left two columns of the figures, the error floors of the simulated black BER curves indicate that the ENOB limitation could be the bottleneck hindering the high-order formats from having lower BERs.
Figures 11(ac) depict the measured pre-FEC BER curves versus iteration number recovering QPSK, 16QAM, and 32QAM signals over a 40-km transmission under various OSNRs using the distortion-aware and conventional PR schemes. Due to an abundant pilot symbol ratio of 50%, both PR schemes achieve convergence within 40 iterations. However, their converged BER floors appear to be different and are influenced by the PR configuration and the received OSNRs. The conventional PR scheme exhibits high BER floors, failing to meet the FEC thresholds, which can be explained by the uncompensated distortion. In contrast, the BER curves employing the distortion-aware PR satisfy FEC thresholds under relatively high OSNRs, while performing an earlier and degraded convergence under low OSNRs, validating the impact of noisy measurements on PR algorithms as analyzed in Subsection \@slowromancapii@-A. The recovered constellations after convergence for both schemes are compared in Figs. 11(di). The compact clustering of the constellations demonstrates the effectiveness of the distortion-aware PR.
IV-C Achievable Data Rate Assessment
The previous subsection illustrates the achievable BER limits constrained by noise during measurements. Yet, this does not offer a direct insight into the systematic achievable capacity of the presented experimental setup. Given that pilot symbols occupy the time slots allocated for payload symbols, a trade-off exists between reconstruction accuracy and the pilot symbol ratio. Consequently, the maximum data rate may not align with the highest reconstruction accuracy.
To determine the achievable data rates, we follow a similar procedure as described in [22]. We implement encoding and decoding using low-density parity-check (LDPC) codes from the DVB-S2 standard [23] and an outer hard-decision code with 6.25% overhead [18]. The concatenated coding approach eliminates the issue of measuring a limited number of symbol in the experiments, allowing us to claim a valid data rate at post-LDPC BERs of below the outer FEC theshold rather than below . However, embedding encoded patterns in the transmitter adds inconveniences to the experiments for a significantly increased measurement number and offers no assurance against time-varying factors (e.g. modulator bias drifting due temperature fluctuation) affecting the measured outcomes. As a result, we choose to directly apply LDPC codes to the uncoded offline data by introducing a syndrome sequence and modifying the min-sum algorithm as described in [24]. The DVB-S2 LDPC codes consist of a set of different code rates with a fixed block length of 64,800 bits. LDPC code rates of 1 (only employing the outer code), 9/10, 5/6, 3/4, 2/3, and pilot symbol ratios ranging from 1/2 to 1/6 are selected. 13 code blocks are utilized in computing the pre-LDPC BERs. Due to a lack of variation in the transmitted pseudo-random pattern, we integrate a random interleaver placed after soft mapping to improve estimation precision.

Figures 12(a,b) show the resulting post-FEC net data rates versus different pilot symbol ratios for 16QAM and 32QAM signals over 40- and 80-km transmissions, evaluated at their peak OSNR values. The selective phase reset operation is enabled, and the phase reset threshold is adjusted to its optimal value to drive the algorithm out of local minima and result the lowest pre-FEC BERs. The net data rates are calculated by selecting the highest LDPC code rate that yields a post-LDPC BER below the outer FEC threshold () for each pilot symbol ratio. The GMI, reflecting the achievable data-rate upper limit employing an ideal soft-decision FEC code of infinite code length, is estimated based on the AWGN noise assumption and presented as a reference metric [25]. The total code rates after considering the outer code are given in Figs. 12(c,d) for the two transmission distances. It can be seen that the maximum post-FEC data rates of 140 Gb/s are achieved at a pilot symbol ratio of 1/5 employing 32QAM modulation format for both distances, while the pilot symbol ratio of 1/2 only yields roughly 100 Gb/s. When the pilot symbol ratio falls below 1/5, the LDPC-estimated net data rates start to decline due to the arising estimation error in the PR process. Nonetheless, the estimated GMIs continue to increase, implying that the post-FEC data rates drop due to the limited granularity of the code rates and could be further enhanced using advanced modulation formats with enhanced rate adaptability, such as probabilistic constellation shaping (PCS). We also evaluated the conventional PR without considering channel distortions. In this case, after 40-km transmission, the LDPC code rates obtaining maximum achievable post-FEC data rates are 3/4 and 1/2 at a pilot rate of 1/3 for 16QAM and 32QAM signals, respectively. In contrast, the distortion-aware PR scheme demonstrates an approximate 49% enhancement in the achievable post-FEC data rate compared with the conventional PR. This can be ascribed to its improved reconstruction accuracy, thus reducing the dependency on the pilot symbols. Figures 12(e,f) depict the number of iterations required for the reconstruction algorithm to fully converge, which is determined by the relative change rate of mean amplitude error between the estimated fields of two traces in Fig. 4. A relative change rate of less than 0.1% lasting for ten iterations is regarded as a sign of full convergence. The computational complexity of PR predominantly arises from the (inverse) fast Fourier transform (FFT/IFFT) operations associated with complex-value field propagation in the reconstruction stage. Thus, the computational complexity for each processing block can be estimated as , where , , represent the number of required iterations, the FFT/IFFT operations per iteration, and the block size, respectively. The data in Figs. 12(e,f) reveals that while high-order QAM formats do not significantly affect the required iteration count, increasing the pilot symbols tends to proportionally reduce . This suggests that under conditions of relatively low noise level, opting for an advanced QAM order choosing a high pilot symbol ratio might be more advantageous than simpler QAM formats with a low pilot symbol ratio to meet a specific data rate, since the former requires much fewer iterations to converge and thereby saves the computational complexity.
IV-D Discussion
To summarize the experimental findings, to begin with, the channel results obtained in the training stage offer useful information regarding the transmitter’s frequency response and IQ-dependent impairments as well as modulator nonlinearity. This suggests that a photodiode can function as a transmitter monitoring device without resorting to direct phase measurements. Secondly, the proposed PR approach allows enhanced optical field reconstruction after considering distortion effects from various channel impairments. The reconstruction process fully considers the propagation effect within the fiber, thus making the performance resilient to chromatic dispersion.
Regarding the performance limiting factors, in high OSNR regimes, if we assume that the thermal noise is negligible for adequate received optical power, then the reconstruction accuracy is constrained by the limited ENOB during analog-to-digital (A/D) conversion. This limitation is fundamentally linked to the principle of PR, as symbol mixing inherently generates intensity waveforms with elevated PAPRs, resulting in a relatively low signal-to-quantization-noise ratio. In the experiment, we observe PAPRs spanning from 13 to 17 dB in the captured intensity waveforms. While clipping before A/D conversion can mitigate this effect, excessive clipping degrades the PR performance by distorting the intensity waveform peaks. In the future, advancements in high-speed A/D technology may resolve the ENOB challenge.
The requirement on pilot symbol ratios is another issue. As depicted in Figs. 12(c,d), a reduction in pilot symbol ratios correlates with a decline in code rates and a proportional increase in the required iteration number. This pattern underscores that the inherent illness of a two-PD-based PR Rx is not fully addressed without sufficient pilot symbol constraints. Considering these factors, a carrierless PR approach that balances a moderate pilot symbol ratios with fast convergence and high precision might necessitate additional independent intensity measurements. As the study on space-time diversity PR [26] indicates, the BER floor issues at reduced pilot symbol ratio can be alleviated by employing four intensity measurements instead of two. The contrast between [26] and this work suggests two potential development paths for PR, both of which can benefit by utilizing the proposed distortion compensation strategy. The first prioritizes the uses of numerous PDs, advanced modulation formats, and moderate pilot symbol ratios to maximize PR’s transmission capacity. The second leverages simper, noise-resilient modulation formats and a small number of PDs, such as two, to construct a simplified receiving front-end, and primarily employs PR as a transmission scheme that is both chromatic dispersion-resistant and colorless (resilient to wavelength drift and laser phase noise).
V Conclusion
In this paper, we investigated the feasibility and limitations of receiving high-order QAM signals using carrierless, intensity-only measurements through PR receiving techniques. The errors observed in the received intensities do not follow the field propagation relationship across different measurement projections, thus limiting the converged accuracy of the PR reconstruction. To mitigate part of these errors induced by channel impairments, a distortion-aware PR scheme has been proposed and demonstrated improved reconstruction accuracy in the experiments, which enabled the successful transmission over 40- and 80-km SSMF and achieved BERs below the 6.25% HD-FEC and 25% SD-FEC thresholds for 50-GBaud 16QAM and 32QAM signals, respectively. By implementing concatenated coding schemes, the achievable post-FEC data rate reached 140 Gb/s per polarization per wavelength channel at an optimal pilot symbol ratio of 20% for both distances. Compared with carrier-assisted detection schemes, the PR approach requires no assistance from the local or signal-accompanying optical carrier. As a result, PR receivers can benefit from having a simplified front-end architecture and enhanced energy efficiency in the optical domain, as it fully uses the transmitting optical power to send the information-carrying signals. Therefore, PR could be a promising solution for high-speed short-reach optical interconnect scenarios, such as datacenter and metropolitan area networks.
References
- [1] K. Kikuchi, “Fundamentals of Coherent Optical Fiber Communications,” J. Lightw. Technol., vol. 34, no.1, pp. 157-179, 2016.
- [2] X. Zhou, R. Urata, and H. Liu, “Beyond 1 Tb/s Intra-Data Center Interconnect Technology: IM-DD OR Coherent?,” J. Lightw. Technol., vol. 38, no. 2, pp. 475-484, 2020.
- [3] D. Che et al., “First Demonstration of Erbium-Doped Waveguide Amplifier Enabled Multi-Tb/s (16×1.6T) Coherent Transmission,” in Proc. Opt. Fiber Commun. Conf., paper Th4B.3, 2023.
- [4] A. Mecozzi, C. Antonelli, and M. Shtaif, “Kramers–Kronig coherent receiver,” Optica, vol. 3, no.11, pp. 1220-1227, 2016.
- [5] X. Chen et al., “Kramers–Kronig Receivers for 100-km Datacenter Interconnects,” J. Lightw. Technol., vol. 36, no. 1, pp. 79-89, 2018.
- [6] D. Che, A. Li, X. Chen, Q. Hu, Y. Wang and W. Shieh, “Stokes Vector Direct Detection for Linear Complex Optical Channels,” J. Lightw. Technol., vol. 33, no. 3, pp. 678-684, 2015.
- [7] W. Shieh, C. Sun and H. Ji “Carrier-assisted differential detection,” Light Sci Appl, vol. 9, no. 18, 2020.
- [8] Sercan Ö. Arik and Joseph M. Kahn, “Direct-detection mode-division multiplexing in modal basis using phase retrieval,” Opt. Lett., vol. 41, no. 18, pp. 4265-4268, 2016.
- [9] H. Chen, N. K. Fontaine, J. M. Gene, R. Ryf, D. T. Neilson and G. Raybon, “Dual Polarization Full-Field Signal Waveform Reconstruction Using Intensity Only Measurements for Coherent Communications,” J. Lightw. Technol., vol. 38, no. 9, 2020.
- [10] Y. Yoshida, T. Umezawa, A. Kanno and N. Yamamoto,“A Phase-Retrieving Coherent Receiver Based on Two-Dimensional Photodetector Array,” J. Lightw. Technol., vol. 38, no. 1, pp. 90-100, 2020.
- [11] Q. Qu, Y. Zhang, Y. C. Eldar and J. Wright, “Convolutional Phase Retrieval via Gradient Descent,” IEEE Trans. Inf. Theory, vol. 66, no. 3, pp. 1785-1821, 2020.
- [12] Q. Wu, Y. Zhu and W. Hu, “Carrier-Assisted Phase Retrieval,” J. Lightw. Technol., vol. 40, no. 16, pp. 5583-5596, 2022.
- [13] H. Huang et al., “MIMO Processing With Linear Beat Interference Cancellation for Space Division Multiplexing Self-Homodyne Coherent Transmission,” J. Lightw. Technol., vol. 40, no. 13, pp. 4136-4149, 2022.
- [14] Q. Wu, Y. Zhu, Q. Zhuge and W. Hu, “Dual-Carrier-Assisted Phase Retrieval for Polarization Division Multiplexing,” J. Lightw. Technol., vol. 40, no. 22, pp. 7297-7306, 2022.
- [15] H. Chen et al., “Mode-Multiplexed Full-Field Reconstruction Using Direct and Phase Retrieval Detection,” in Proc. Opt. Fiber Commun. Conf., paper W4A.5, 2020.
- [16] M. Xiang et al., “Adaptive intensity transformation-based phase retrieval with high accuracy and fast convergence,” Opt. Lett., vol. 46, no. 13, pp. 3215-3218, 2021.
- [17] P. Zhou et al., “Weighted Decision Enabled Phase Retrieval Receiver With Adaptive Intensity Transformation,” J. Lightw. Technol., vol. 41, no. 19, pp. 6208-6217, 2023.
- [18] L. M. Zhang and F. R. Kschischang, “Staircase Codes With 6% to 33% Overhead,” J. Lightw. Technol., vol. 32, no. 10, pp. 1999-2002, 2014.
- [19] W. Idler, F. Buchali, ”Higher-Order Modulation Formats – Concepts and Enabling Devices,” Fibre Optic Communication: Key devices, H. Venghaus, N. Grote, Cham, Switzerland: Springer, 2017, pp. 291-357.
- [20] R. Gerchberg, W. Saxton, ”A practical algorithm for the determination of the phase from image and diffraction plane pictures,” Optik, vol. 35, no. 2, pp. 237–246, 1972.
- [21] H. Chen, H. Huang, N. Fontaine, and R. Ryf, “Phase retrieval with fast convergence employing parallel alternative projections and phase reset for coherent communications,” Opt. Lett., vol. 45, no. 5, 2020.
- [22] G. Rademacher et al., ”High Capacity Transmission With Few-Mode Fibers,” in Journal of Lightwave Technology, vol. 37, no. 2, pp. 425-432, 15 Jan.15, 2019, doi: 10.1109/JLT.2018.2870038.
- [23] “Digital Video Broadcasting (DVB); Second generation framing structure, channel coding and modulation systems for Broadcasting, Interactive Services, News Gathering and other broadband satellite applications (DVB-S2),” European Telecommunications Standards Institute, EN 302307-1, V1.4.1, Nov. 2014.
- [24] N. Stojanovic, Y. Zhao, D. Chang, Z. Xiao, and F. Yu, “Reusing Common Uncoded Experimental Data in Performance Estimation of Different FEC Codes,” IEEE Photon. Technol. Lett., vol. 25, no. 24, pp. 2494-2497, 2013.
- [25] A. Alvarado, T. Fehenberger, B. Chen and F. M. J. Willems, “Achievable Information Rates for Fiber Optics: Applications and Computations,” J. Lightw. Technol., vol. 36, no. 2, pp. 424-439, 2018.
- [26] H. Chen et al., “Space-Time Diversity Phase Retrieval Receiver,” in Proc. Opt. Fiber Commun. Conf., paper Th4D.3, 2021.