On Self-Adaptive Perception Loss Function for Sequential Lossy Compression
Abstract
We consider causal, low-latency, sequential lossy compression, with mean squared-error (MSE) as the distortion loss, and a perception loss function (PLF) to enhance the realism of reconstructions. As the main contribution, we propose and analyze a new PLF that considers the joint distribution between the current source frame and the previous reconstructions. We establish the theoretical rate-distortion-perception function for first-order Markov sources and analyze the Gaussian model in detail. From a qualitative perspective, the proposed metric can simultaneously avoid the error-permanence phenomenon and also better exploit the temporal correlation between high-quality reconstructions. The proposed metric is referred to as self-adaptive perception loss function (PLF-SA), as its behavior adapts to the quality of reconstructed frames. We provide a detailed comparison of the proposed perception loss function with previous approaches through both information theoretic analysis as well as experiments involving moving MNIST and UVG datasets.
For first-order Markov sources, let the information RDP region, denoted by , be the set of all tuples which satisfy the following
{IEEEeqnarray}rCl
R_1 &≥ I(X_1;X_r,1),
R_2 ≥ I(X_2;X_r,2—X_r,1),
R_3 ≥ (X_3;X_r,3—X_r,1,X_r,2)
D_j ≥ E[∥X_j-^X_j∥^2],
P_j≥ ϕ_j(P_^X_1…^X_j-1 X_j, P_^X_1…^X_j-1^X_j), j=1,2,3,
for auxiliary random variables and satisfying the following
{IEEEeqnarray}rCl
&^X_1= η_1(X_r,1), ^X_2=η_2(X_r,1,X_r,2), ^X_3=X_r,3,
X