Ridge detection for nonstationary multicomponent signals with time-varying wave-shape functions and its applications
Abstract.
We introduce a novel ridge detection algorithm for time-frequency (TF) analysis, particularly tailored for intricate nonstationary time series encompassing multiple non-sinusoidal oscillatory components. The algorithm is rooted in the distinctive geometric patterns that emerge in the TF domain due to such non-sinusoidal oscillations. We term this method shape-adaptive mode decomposition-based multiple harmonic ridge detection (SAMD-MHRD). A swift implementation is available when supplementary information is at hand. We demonstrate the practical utility of SAMD-MHRD through its application to a real-world challenge. We employ it to devise a cutting-edge walking activity detection algorithm, leveraging accelerometer signals from an inertial measurement unit across diverse body locations of a moving subject.
1. Introduction
Applying time-frequency (TF) analysis [15] to study nonstationary time series with multiple oscillatory components has gained significant attention. Unlike traditional time or frequency domain analysis in time series [5], TF analysis follows a ’divide-and-conquer’ approach. By assuming local signal stationarity, we extract and combine information to create the TF representation (TFR)111Other domains like time-scale, ambiguity, and time-lag are beyond our scope here.. TFR is a function in the TF domain, a two-dimensional space with time on the x-axis and frequency on the y-axis.
Guided by this concept, various TF analysis algorithms emerged, ranging from linear methods like short-time Fourier transform (STFT) to nonlinear approaches like synchrosqueezing transform (SST) and its variations [41], along with others [15]. An effective TF analysis algorithm accurately captures signal components across frequencies and times in the TF domain [15], enhancing encoded information use. The TFR, an extension of the Fourier transform, serves this purpose. Researchers manipulate the TFR to achieve tasks like signal decomposition, feature extraction, and denoising after obtaining it. Fig. 1 illustrates the signal processing flowchart. This paper focuses on TFR determined by STFT and SST to avoid distracting from the array of TF analysis algorithms.
An essential TFR manipulation step is ridge detection (RD), involving identifying ridges on the TFR that represent dominant signal traits.222RD can be discussed more generally [16, 35], where ridges are defined as curves on a surface. In this paper, we focus on TF analysis, where ridges are time-parametrized [7, Section III]. Mathematically, a ridge is a sequence of local maxima of the TFR along the time-indexed frequency axis, often visualized as a curve on the TF domain. Under conditions like the adaptive harmonic model [12], capturing multiple oscillatory components with slowly varying amplitude and frequency, ridges correspond to instantaneous frequencies (IF) of different components [13], as shown in Fig. 1. Ridge information enables tasks like component reconstruction [12] for signal decomposition [11] and denoising [8]. Further, phase and IF of each component can inform feature design for machine learning [1, 2].
While the TF analysis-based signal processing flow in Fig. 1 has been utilized for real-world problems extensively, the pivotal step, RD, remains challenging. Literature has witnessed various endeavors to create efficient RD algorithms, encompassing path optimization with regularity constraints [6, 8], its frequency modulation extension [17, 10, 19], Markov chain Monte Carlo (MCMC) [7], weighted ridge reconstruction (WRR) and self-paced ridge reconstruction (SPRR) [43], and a pseudo-Bayesian approach [20]. TFR preprocessing via singular value decomposition (SVD) [29] or TF domain division using the reassignment vector [22, 19] before ridge determination is also viable. Noise impact is commonly mitigated via thresholding. Most RD algorithms, except [7], detect one ridge at a time from the entire TF domain or a portion, obtaining all ridges iteratively, often using a peeling scheme.
Despite successful applications, these algorithms have persistent limitations. All are founded on the assumption of sinusoidal oscillations and often require well-separated IFs; that is, no mode mixing. This assumption is challenged when oscillatory patterns lack sinusoidal traits, yielding multiple harmonics in the TFR. Refer to Fig. 1(b)(e) for an instance involving a non-sinusoidal oscillation depicted in Fig. 1(a) using second-order SST [27]. This pattern is termed the wave-shape function (WSF) [40]. Real-world instances, particularly in biomedicine like the wrist accelerometer signal in Fig. 2, exhibit complex non-sinusoidal WSFs. Even with well-separated IFs of fundamental components, overlapping IFs of their harmonics can arise. See an example in Fig. 3. This challenge, especially relevant to digital health, where diverse biomedical signals with intricate WSFs are prevalent, challenges the applicability of existing RD algorithms. Thus, a tailored RD algorithm is urgently needed.
In this paper, we introduce an innovative RD algorithm tailored for non-sinusoidal WSFs’ influence on the TFR’s geometric structure, and show its outperformance over existing methods. The algorithm’s foundation rests on a fundamental theoretical insight. When WSFs exhibit non-sinusoidal traits, STFT and SST generate a unique geometric pattern in the TFR. This pattern, recurrent along the frequency axis at a fixed time point, is evident in Fig. 1(b)(c), denoted by red arrows. Our approach involves fitting multiple ridges simultaneously to the TFR respecting this structure. When there are multiple oscillatory components, shape-adaptive mode decomposition (SAMD) [11] is applied to replace the commonly applied peeling scheme. We term it SAMD-based multiple harmonic RD (SAMD-MHRD). Incorporating this geometric structure is akin to accessing more ridge information, akin to the principles in cepstrum [28] and de-shape STFT [21]. Fig. 1(g) depicts SAMD-MHRD yielding four harmonic fits. SAMD-MHRD not only offers an accurate RD, but also enhances SAMD’s decomposition capabilities. As a practical application, it aids accurate walking activity detection via accelerometers placed on various body areas, particularly the wrist.

The paper’s structure is as follows: Section 2 introduces the mathematical model for nonstationary multicomponent signals with time-varying WSF. This section also summarizes the application of TF analysis tools in signal processing, discusses RD as a pivotal analysis step, and highlights existing challenges. Section 3 elaborates on the proposed SAMD-MHRD algorithm. Numerical examples are presented in Section 4, while the real-world application is demonstrated in Section 5. Finally, Section 6 contains the discussion and conclusion.
2. Mathematical models and analysis tools
We begin by examining the mathematical model for non-stationary multicomponent signals with time-varying WSFs. Following that, we provide brief overviews of STFT and SST, illustrating their application in diverse signal processing tasks. Subsequently, we identify the constraints of current RD algorithms, which serve as the inspiration for this study.
2.1. Mathematical model
We consider the adaptive non-harmonic model (ANHM) to model a non-sinusoidal oscillatory signal. Fix a small constant and . Assume the signal satisfies
(1) |
where is called the -th intrinsic model type (IMT) function that satisfies
-
(C1)
is a function called the phase function of the -th IMT function. We assume ;
-
(C2)
is the instantaneous frequency (IF) of the -th IMT function so that for all and for some . When , we assume for all and ;
-
(C3)
is a function denoting the amplitude modulation (AM) of the -th IMT function so that for all ;
-
(C4)
is a smooth -period function with mean 0 and unit norm so that , which is called the wave-shape function (WSF) of the -th IMT function;
-
(C5)
is a mean stationary random process in the wide sense with finite variance;
-
(C6)
is a smooth function so that its Fourier transform, , is compactly supported in .
The model’s identifiability is discussed in [8]. A typical example adhering to (1) is the accelerometer signal, with , shown in Fig. 2. Clearly, the IMT function does not oscillate sinusoidally. This model assumes one WSF for each IMT function. In essence, the -th IMT function arises from stretching by , subsequently scaled by . Nevertheless, in practical instances, researchers have observed WSFs not to be fixed [2, 14, 39]. See Fig. 2 for an example. It is evident that the oscillatory pattern varies over time. Notably, both STFT and SST reveal that the strength of the fundamental component (indicated by the red arrow) weakens, while the third, sixth, and seventh harmonics (indicated by purple arrows) strengthen after 225 seconds. As a result, (1) lacks appropriateness as a model.

To properly model such signals, we consider the generalization of (1) in [21]. To motivate this generalization, assume to simplify the discussion. We have
(2) |
where and are determined by Fourier coefficients of . The generalization idea in [21] involves permitting time-varying characteristics for each harmonic in (2), given specific conditions. Thus, consider
(3) |
where is called the harmonic order, and
-
(C7)
behaves like in (C1)-(C2), , and for and a small constant for all ;
-
(C8)
behaves like in (C3), and for all .
Conditions (C7) and (C8) capture the time-varying WSF, assuming slow variation. This model is a simplified version of that in [21] to facilitate discussion. For , is called the -th harmonic of the -th IMT function, and is the fundamental component. When , the -th IMT function oscillates sinusoidally. In this work, we assume the knowledge of and apply [31] to estimate to avoid the distraction. Although there are tools when for all [37, 32, 19, 31], estimating in general is challenging. The slowly varying IF condition (C2) can be extended to a fast varying one [18], which requires more technical details. To stay focused, we will discuss this generalization in future work.
2.2. Analysis by TF analysis tools—a quick summary
Due to the non-stationary nature of ANHM, conventional time series methods may be limited. A common strategy to analyze such signals involves employing TF analysis algorithms [15]. The core notion behind TF analysis is a “divide-and-conquer” approach. Locally assuming a “stable”, “time-invariant”, or “stationary” structure, we estimate this structure accurately in patches. By combining local estimates, we construct a TFR, which enables solving signal processing tasks like AM and IF estimation of each IMT function, signal decomposition into essential components (IMT functions and their harmonics), denoising, and dynamic feature extraction. This paper concentrates on STFT and SST, and we refer readers to [15] for other TF analysis algorithms.
Below, we explain the overall idea. Mathematically, for a function satisfying mild conditions (e.g., a tempered distribution), the STFT is defined as
(4) |
where is the time, is the frequency, and is a chosen window that is smooth and decays sufficiently fast (e.g., the Gaussian function). The spectrogram is , yielding a function on . When , the essential support of should reside in or the spectral interference of different IMT functions might happen. Generally, any TF analysis method’s output is a function on , known as the TFR. We call the TF domain. See Fig. 1(b) for an example, where the input signal follows ANHM with , where the fundamental component is weaker compared to its second harmonic and diminishes over time, as evident in the spectrogram. Nonetheless, STFT encounters the uncertainty principle [30]; that is, the TFR appears ’blurred’ due to the chosen window. To address this, SST [12], a reassignment algorithm [3] variant, and its generalizations like second-order SST [27] were introduced. They utilize the phase information within the complex-valued TFR obtained through STFT to sharpen the STFT-based TFR. Let and denote the original SST [12] and second-order SST [27] TFRs of function , respectively.
To proceed, the conventional procedure entails extracting ridges connected to harmonic IFs. In contrast to STFT, a sharper TFR, as determined by SST, can enhance ridge estimation accuracy [24]. It has been theoretically established that when a signal adheres to ANHM with sinusoidal WSFs, the ridges closely approximate IFs of harmonics of all IMT functions [13, 12]. Once ridges are accurately extracted, IMTs can be reconstructed through a reconstruction formula. Specifically, suppose the extracted ridge approximates IF of in (3) for some and . Then, if does not intersect with and is away from IF of any other harmonics, the following formula holds:
(5) |
where and user-defined bandwidth . This formula yields estimates of through absolute value extraction and through phase unwrapping [1]. With these estimates, further signal processing tasks become feasible. Regrettably, the condition that must not intersect with or approach the IF of other IMT function’s harmonics, as required by (5), is excessively stringent and practically infeasible. Thus, an alternative approach becomes necessary. Various mode retrieval algorithms for STFT exist, such as ridge-based skeleton [6], integration approach [19], basin attractor utilization [22, 23, 19], penalization with detected ridges [7], and others. However, these methods face similar limitations posed by non-sinusoidal WSFs.
Remark.
It is worth noting that the phase can be estimated by cumulatively summing the estimated IF. However, this method is suboptimal due to accumulating discretization errors and the need for a separate initial phase estimation. This approach is only advisable when (5) or other reconstruction formulas cannot be applied.
See Section A for a discussion of the numerical implementation of STFT and SST. Below, we denote the discretized TFR of , either determined by STFT or SST, as , where is the number of the sampling points of the signal with the sampling period , and is the number of the frequency domain ticks, each bin spaced by .
2.3. Ridge detection as a key step—existing algorithms
The key step in the signal processing process outlined in Section 2.2 involves an accurate RD algorithm. RD algorithms can be broadly categorized into three groups. One involves detecting ridges individually and gathering all ridges through an iterative peeling scheme [6, 8, 10]. The second is the multiridge scheme, which detects multiple ridges concurrently [7]. The third is the preprocessing scheme, where the TF domain is segmented to include one ridge per portion or the TFR is enhanced before applying either peeling or multiridge algorithms [19]. Given that our proposed algorithm adheres to the peeling scheme and the focus is handling WSFs, our focus remains on reviewing algorithms within this category.
In the peeling scheme, once we obtain a ridge , where , from , we set iteratively the following masked TFR, :
(8) |
where , is the ridge extracted from , and and is the bandwidth selected by the user. Note that we assume the knowledge of in this work. Practically, the selection of and depends on the TFR and profoundly impacts RD performance. Several methods exist for masking the TFR. For instance, one can opt for a constant frequency bandwidth (CFB) approach, where and remain constant for all [10]. Alternatively, the bandwidth can adapt to noise levels, known as the varying frequency bandwidth (VFB) [10]. Another option involves the modulation-based (MB) approach, which relies on estimating spectral spreading due to chirp [10].
Numerous algorithms exist for fitting a ridge to the (masked) TFR in each iteration. These algorithms share the common goal of approximating a curve within the TFR to capture maximum energy while meeting specific conditions. A frequently used method is to solve the following path optimization problem [6, 8]:
(9) |
where so that for , which is the numerical differentiation of the curve , is the penalty term constraining the regularity of the fit curve , and is a normalization of the matrix . In practice, various approaches to solving the optimization problem (9) exist, including the simulated annealing algorithm [6]. However, we recommend the more efficient penalized forward-backward greedy algorithm, referred to as the single curve extraction algorithm (Single-RD) [6, 8]. Note that we could further enhance regularity by considering second-order numerical differentiation in (9), or even incorporating available noise information [6]. For simplicity, we omit these aspects from the current discussion.
Several other algorithms proposed in [17, 10, 19] could be viewed as solving variations of (9). For example, the frequency modulation (FM) approach proposed in [10] solves such that for some , where is the chirp rate estimate [10, eq. (6)]. This optimization problem could be solved by, for example, the recurring principle [19, eq. (11)], leading to the FM-RD algorithm. In [19], to handle substantial noise, the concept of a relevant ridge portion (RRP) is introduced, giving rise to the RRP-RD algorithm. RRPs correspond to the desired ridges and are employed to segment the TF domain into specific structures known as basins of attraction [23]. The spline least-square approximation is applied to RRPs for RD.
As an example, consider the simplest scenario with only one IMT function ( in (3)) that oscillates sinusoidally ( for all ). In this case, the ridge determined by any of these RD algorithms provides an accurate estimate of the associated fundamental component’s IF [12], followed by other signal processing processes. Nonetheless, this simplest scenario is rarely encountered in practice.
2.4. Challenges of existing approaches
To illustrate the challenges posed by existing RD algorithms, consider scenarios closer to real-world situations. First, when in (3) and all IMT functions oscillate sinusoidally, we can apply any of the aforementioned RD algorithms to estimate the IFs associated with all IMT functions, assuming the IF separation condition in (C2) holds. Second, when in (3) but the WSF significantly deviates from being sinusoidal, we can apply the above RD algorithms based on (C7) to estimate all ridges linked to the harmonics. The ridge associated with the lowest IF can then be assumed as the fundamental component’s IF. In these more realistic scenarios, successfully applying existing RD algorithms becomes more challenging. An example in Fig. 1(f) demonstrates this complexity, where the strength of the fundamental component is weak in the middle while the strength of the second harmonic dominates. Such shifts in strengths can easily confuse existing RD algorithms.
In real-world scenarios, more intricate situations often challenge the aforementioned RD algorithms. When in (3) and the WSFs deviate significantly from sinusoidal patterns, the TFR can become complex with multiple curves representing different harmonics of distinct IMT functions. This complexity involves interference among harmonics from different IMT functions and can lead to mode mixing. For instance, as shown in Fig. 3, two IMT functions exhibit non-sinusoidal WSFs. Here, the second harmonic of the first IMT function (red arrows) intersects with the fundamental component of the second IMT function (blue arrows), making it challenging to accurately determine the IF of the fundamental component for each IMT function. Given that numerous signal processing tasks, such as SAMD [11], rely on accurately acquiring the phase of each IMT function and considering the prevalence of such signals, particularly in biomedical applications, there is a compelling need for a new RD algorithm.

3. Proposed ridge detection algorithm
To address the aforementioned challenge, we introduce a novel RD algorithm within a peeling scheme framework, termed SAMD-based multiple harmonic RD (SAMD-MHRD). This algorithm consists of two main iterative steps. First, it involves fitting ridges for multiple harmonics of a single IMT function, referred to as the multiple harmonic RD (MHRD) component. Subsequently, the algorithm progresses by iteratively “peeling off ridges” linked to the identified harmonics of the IMT function, constituting the SAMD phase. These steps are repeated until all IMT functions have been processed. It is important to acknowledge that each of these steps possesses its own interest and can be employed independently in alternative scenarios. Algorithm 1 provides a comprehensive summary of the SAMD-MHRD procedure. Below, we elaborate on these two steps and the iterative procedure. For the purpose of reproducibility, the Matlab implementation of the proposed algorithm and codes to regenerate results in Section 4 are available at https://github.com/yanweiSu/ridges-detection.
Before detailing the algorithm’s specifics, we highlight three innovations in SAMD-MHRD. First, we utilize the geometric structure of the TFR. To illustrate the idea, suppose and in (C7). Then the STFT of can be approximated by [12]; that is, at a fixed time , the ridges associated with different harmonics are “periodic” in the frequency axis with the period . This underlying periodic structure serves as the fundamental geometric element driving the design of SAMD-MHRD. Secondly, unlike the masking approach in (8), SAMD-MHRD employs the SAMD technique for ridge peeling. The masking technique can be limited in handling non-sinusoidal WSFs due to potential harmonic IF overlaps between different IMT functions. This can lead to error accumulation during iterations. In contrast, SAMD-based peeling effectively addresses these challenges and eliminates the need to determine optimal masking bandwidth. Third, SAMD-MHRD not only identifies ridges for all IMT functions but also decomposes the IMT functions themselves. Additionally, while SAMD aids ridge peeling in each iteration, SAMD-MHRD algorithm concurrently achieves signal decomposition like the original SAMD.
3.1. Step 1: fit multiple ridges for harmonics (MHRD)
Given an input signal that satisfies (3) with , let represent the discretized TFR of , obtained from STFT or SST. Choose as the intended number of harmonics to extract simultaneously. Using the notations from (9), we fit ridges for the harmonics of an IMT function by solving the optimization problem:
(10) | ||||
where , , is a unit vector with , is the smoothness penalty as in (9), and stands for the similarity penalty that captures the time-varying wave-shape condition stated in (C7). Note that forces the estimated IF of the -th harmonic, to satisfy at time , and hence the geometric structure is respected. The output represents the IFs of the first harmonics of an IMT function of the input signal .
We simplify (10) by introducing a set
(11) | ||||
where is the chosen “bandwidth” that echos the similarity penalty in (10), so that (10) becomes
(12) |
Solving (12) directly can be inefficient. We recommend a segmentwise approach. This method leverages the slowly-varying IF’s uniform continuity by approximating the curve with a finite sequence of “thin rectangles” . First, estimate the fundamental IF by running the algorithm on over the whole frequency domain. Then, for each segment , run the algorithm over a frequency band , where is chosen properly to ensure the box covers the ridge. This process determine the fundamental IF over . Note that a smaller reduces the search space and improves efficiency, but it must not be too small to avoid missing the ridge. We suggest using and for all , based on the slowly-varying IF assumption to ensure the ridge is covered. This section of the algorithm is referred to as MHRD and is summarized in Algorithm 2.
3.2. Step 2: Peel off ridges for harmonics by SAMD
After identifying one IMT function’s harmonic ridges using MHRD, we employ the SAMD algorithm [11] to extract the corresponding IMT function ( in Algorithm 1) related to the detected ridges with its fundamental component’s IF. In short, SAMD is an optimization-based fitting algorithm that uses the estimated fundamental component of an IMT function ( in Algorithm 1 using (5) with ) as inputs. The loss function accounts for the time-varying WSF, and its minimizer provides an estimate of the IMT function. See [11] for details.
Subtracting this extracted IMT function from the input signal results in a new signal denoted as , containing IMT functions. The discretized TFR of , determined by STFT or SST, is represented as . Note that ridges related to the extracted IMT function do not exist in . This step thus serves as a replacement for (8).
Having obtained the initial IMT function and , we can proceed to iterate Step 1 (MHRD) on and to extract the harmonic ridges of one of the remaining IMT functions. Then, Step 2 is once again executed to extract the associated IMT function using the SAMD algorithm. This iterative process is repeated for a total of cycles, resulting in the complete set of harmonic ridges for all IMT functions. These initial ridge estimates, denoted as , are ordered based on the fundamental components’ IFs, from low to high.
In practice, it is not guaranteed which IMT function will be extracted in Steps 1 and 2. It is possible that the IF and phase of its associated fundamental component is affected by the harmonics of other IMT functions with lower IF. Thus, with the initial estimate , we repeat Steps 1 and 2 on and by replacing in (12) by
(13) | ||||
and estimating the phase via (5). This ridge estimate is less influenced by harmonics from other IMT functions due to the IF separation condition (C2). We then repeat Steps 1 and 2 on and with modified in the same way as (13) by considering . We continue this process iteratively with and so on until ridges of all IMT functions are extracted, denoted as .
3.3. Iteration
While it is valid to conclude the process after these iterations, it is possible to enhance the results through an additional iteration of the cycles with . This reiteration step has the potential to yield improved outcomes, and we recommend considering this option for times. It is worth noting that the output includes the decomposition of all IMT functions from the input signal .
3.4. Speed up SAMD-MHRD with auxiliary information
Solving the optimization problem (10) can be time-consuming, especially when is large and is wide. However, if the phase of the fundamental component of the targeted IMT function is known or can be well estimated in advance, the algorithm can be accelerated. Let , and be the same as (10), and be the known IF of the fundamental component; that is, the extra auxiliary information. In this case, we only need to estimate the higher-order harmonics, and hence (10) is reduced to
(14) |
where for and . The following proposition shows that the optimization problem (14) can be reduced to several single-valued curve optimization problems, where the proof can be found in Section C.
Proposition 3.1.
This proposition allows us to estimate harmonics’ IFs by solving a modified single curve extraction program with respect to for each ; that is,
(16) |
instead of solving (14) directly. This reduces the complexity of the original problem. This auxiliary information is often available in many applications. For example, when analyzing photoplethysmogram (PPG) and electrocardiogram (ECG) signals, we can use the cardiac phase from the ECG as an accurate phase estimate for the cardiac component in the PPG signal. This allows us to speed up SAMD-MHRD when analyzing the cardiac component in the PPG signal.
3.5. Parameters selection
To optimize the performance of SAMD-MHRD for a given , selecting suitable penalties is crucial. To address this, we propose a simple grid search approach with the following rationale. Assuming that the algorithm accurately detects the ridges corresponding to the true IFs, we can consider masking the extracted curves on the TFR, similar to (8), by appropriately choosing and . To simplify the discussion, we apply the CFB approach [10] below, while VFB and MB can also be considered. The resulting TFR should exhibit reduced signal components. This insight prompts us to employ the -Rényi entropy [4, 33], where , as a metric to quantify and compare the energy distribution along the frequency axis of both the masked and original TFRs.
Consider the parameter set
(17) |
where is small, satisfies , and is a finite set. Note that while looks like a high-dimensional space, it is effectively a one-dimensional space. In this context, the ordering for adheres to the slowly varying condition for . For the parameter required to satisfy the WSF condition in (11), we consider a finite subset .
Take the TFR . For each and , extract the curves by SAMD-MHRD with and , and denote the result to be . Then, “mask” by the ridges following (8), and denote the masked TFR as , where the masking size respects the one used in the reconstruction formula (5). Define a sequence by , . Finally, we choose
(18) |
as the optimal in . In practice, is chosen to range from to and is chosen to range from to , both with a uniform grid in the log scale.
4. Numerical simulation
4.1. Single IMT function ()
In this first example, we compare different RD algorithms on signals with complicated WSF dynamics. Denote the smoothed standard Brownian motion , where is the standard Brownian motion, indicates convolution, and is the Gaussian function with the standard deviation seconds. Over , set
and
where is an independent and identical copy of , are independent white Gaussian process with mean 0 and variance , seconds, and
(19) |
Note that the term models the time-varying WSF. Consider a non-stationary noise , where for , follows an ARMA process with Student’s t4 innovation process, and for , i.i.d. follows a Student’s t5 distribution. We assume is independent of and The simulated signal is defined over with the sampling rate Hz so that
(20) |
where , , is determined by the desired signal-to-noise ratio (SNR), defined as with STD standing for the standard deviation, , models the intensity of the fundamental component, and are independent random variables uniformly distributed on that are independent of and . See Figure 8 in Section B for a realization of .
Then, we realize independently over for times. For a fair comparison with existing algorithms, we utilize the provided code from the authors [10, 19], following their recommended guidelines.333The code for FM-RD is obtained through private communication with the authors. The code for RRP-RD is available at https://github.com/Nils-Laurent/RRP-RD. However, since these algorithms are not tailored to handle WSFs, we need to make minor adjustments and assume the number of harmonics is known. In the case of FM-RD, we incorporate the peeling step (8) as a post-processing procedure. Essentially, we run FM-RD iteratively and apply TFR masking three times. Among the three extracted curves, we identify the one with the lowest frequency value as the result for the fundamental component’s ridge. Regarding RRP-RD, we concurrently detect three relevant ridge portions on the TF domain and extract the corresponding ridges. From these, we choose the ridge associated with the lowest frequency value as the outcome. If RRP-RD is unsuccessful in identifying three ridge portions, we rerun it to detect two RRPs. If this attempt also fails, we proceed with a single RRP. Finally, we assign the ridge with the lowest frequency as the final outcome.
Let be the detected ridge by the algorithm , which could be S, FM, RRP or MH, standing for the Single-RD, FM-RD, RRP-RD or SAMD-MHRD. To compare different algorithms, we consider the following metric:
(21) |
We run the Wilcoxon signed-rank test on realizations to test the null hypothesis that the median of is zero, with a significance level of and Bonferroni correction.
The results are depicted in Fig. 4, illustrating the distributions of for a fixed (rows: , , ) and a constant noise level (columns: noise-free, SNR = 5 dB, SNR = 0 dB) with asterisks indicating instances where . The outcomes reveal that in the absence of noise, all algorithms yield comparable results. However, when encountering scenarios with a weak fundamental component and noise interference, FM-RD and RRP-RD exhibit limitations compared to SAMD-MHRD. More numerical results for RRP-RD with different number of ridge portions can be found in Section B.

4.2. Multiple IMT functions ()
Following the same notations used in Section 4.1, we consider the case when . Denote and as two independent copies of the smoothed standard Brownian motion and are independent of , , , and . Over , set two random processes
and
where . The second simulated signal is defined over with the sampling rate Hz so that
(22) |
where , , models the intensity of the fundamental component, and are the independent copies of and is again determined by the desired SNR. Then we realize 100 copies of for each selected SNR.
See Fig. 5 for the RD results and a decomposition example of SAMD-MHRD. Both IMT functions are successfully recovered. We then run SAMD-MHRD and evaluate the reconstruction error of the -th IMT function as
(23) |
where the superscript indicates the -th iteration. See Fig. 6 for the comparison of and under different SNRs. As SNR decreases, errors increase. Additionally, the second IMT exhibits significantly larger recovery error than the first IMT, as tested with the Wilcoxon test at a significance level of , which is reasonable due to its dependence on the harmonics of the first IMT. Iteration improvement is statistically significant except for the second IMT at high noise levels.


5. Application to walking prediction from IMU
We now apply the proposed algorithm to detect the walking activity from the IMU signal. The data obtained from the triaxial accelerometer signals are denoted as , , and , where loc indicates the sensor’s placement location, and , , and represent the axes. We focus on the physical activity signal, defined as
To detect the walking activity, we modify the ANHM model in (3) with slightly to account for . Specifically, we express it as follows:
(24) |
where , , are disjoint connected intervals in that describes the wax-and-wane effect of walking; that is, a subject might walk for a bit, stop for a while, and continue to walk. Physically, reflects the intensity of the walking activity over , with both strength and pattern exhibiting variations between different steps. The walking activity encompasses a fusion of activities across body locations, resulting in the dependence of on sensor placement and synchronization of across sensors. Even within the same location, , , and can differ between different walking bouts. For instance, the walking pattern on a flat surface during interval might differ from the pattern during stair climbing in interval . This model reduces to (3) with when , and has been used in [42] for cadence estimation.
Finally, we make an assumption regarding the durations of walking intervals. The definition of “walking” lacks universal consensus [38], leading to questions about whether a mere one or two steps qualify as “walking” and what is the number of continuous steps needed to classify an interval as “walking”. While our work does not aim to address this scientific matter, we provide a mathematical definition based on the properties of the TF analysis tools we utilize. Building upon (C1)-(C8), we introduce the following assumption:
-
(C9)
For each , we have for all and a sufficiently large .
That is, we require , as the phase advances by after each cycle. Empirical evidence suggests that, to reliably estimate the IF of an oscillatory signal using STFT or SST, the window function should span around to oscillatory cycles. Therefore, we designate a process as “walking” only if a subject performs more than continuous steps by setting . Note that can vary depending on the walking pace; faster cadence leads to shorter . This definition aligns with the concept of sustained harmonic walking (SHW) presented in [38], where SHW involves walking for at least seconds (in line with C9) with minimal variability in step frequency (in line with C2).
In practice, are unknown. The main signal processing mission we consider in this section is estimating from one realization of the random process (or recorded accelerometer signal).
5.1. Database
We analyze the Indiana University Walking and Driving Study (IUWDS)444https://physionet.org/content/accelerometry-walk-climb-drive/1.0.0/ dataset, collected in 2015 from individuals (19 females) aged 21 to 51 years. This study took place in a semi-controlled environment, where participants were directed to follow a route involving walking on flat ground and climbing stairs. Data was recorded using ActiGraph GT3X+ accelerometers at 100 Hz. Fig. 2 provides an example of this dataset. The study was ethically approved by the IRB of Indiana University, and participants provided informed written consent. The dataset comprises recordings from four locations: wrist (wr), hip (hi), left ankle (la), and right ankle (ra), alongside expert annotations.
5.2. Our proposed index for walking detection
To harness the distinctive properties of WSF, we propose employing SAMD-MHRD for devising a walking detection metric. Denote the extracted IFs of the harmonics by SAMD-MHRD as , with representing the user-specified number of harmonics. We introduce the SST walking strength index (SST-WSI) at time (where ) as follows:
where is the SST of ,
(25) | ||||
is determined by SAMD-MHRD, is the bandwidth chosen by the user, and is a normalization factor. We set so that and are sufficiently separate. An intuitive interpretation of SST-WSI is capturing the ratio of energy associated with the walking activity. Specifically, the extracted curves correspond to the most prominent features in the TFR, effectively capturing instances of walking activity. See Section D.1.1 for a justification of SST-WSI. Since the WSF depends on the subject, sensor location and the walking pattern, needed for the walking activity detection depends on these parameters. To simplify the discussion, we fix an empirical in this work.
5.3. Results
The number of subjects is , with the numbers of walking and non-walking segments in the database and . We consider four existing walking detection indices, including the SHW index, the Entropy Ratio index, the Hilbert transform based index called the Hilbert-WSI, and the index of freezing of gait called FOG-WSI [26]. Details of these indices’ implementation can be found in Section D.1 in the supplementary materials. For the SST-WSI index, we set and Hz.
We use Leave One Subject Out Cross Validation (LOSOCV) to evaluate the performance. In each round, one subject is used as the test case while the others determine the threshold. This threshold is then applied to the test subject, and performance is recorded. Results are shown in Fig. 7. For Hilbert-WSI, some validation results give zero true positives, and we define its F1-score to be 0. We use the Wilcoxon signed-rank test () with Bonferroni correction to assess significance between indices. SST-WSI and Entropy-Ratio are the top-performing indices, with SST-WSI consistently better.
Finally, we combine SST-WSI and Entropy-Ratio using a support vector machine. Fig. 7 presents the LOSOCV results, indicating that this combination outperforms other indices. Additional results for different sensor placements can be found in Section D.2.

6. Discussion and Conclusion
We introduce a novel RD algorithm, SAMD-MHRD, designed for both STFT and SST, to effectively manage nonstationary signals featuring intricate non-sinusoidal WSFs. Numerical investigations and real-world applications showcase its practical promise. This endeavor can be seen as an extension of [40], filling the RD gap toward signal processing missions.
Naturally, one may question its adaptability to other TFRs, such as the commonly used CWT. The CWT’s inherent affine structure can be limiting for oscillatory signals with non-sinusoidal WSFs, as it requires a small scale to capture higher-order harmonics while dealing with broad spectral support, leading to spectral interference among these harmonics. On the other hand, SAMD-MHRD may be applicable to other TF analysis algorithms that can separate harmonics within their TFRs. Further exploration in this area is needed.
Another strategy for handling non-sinusoidal WSFs involves using de-shape STFT [21]. This method aims to mitigate the impact of harmonics in complex situations and could potentially be used to extract the fundamental IF of each IMT function. However, its success relies on the presence of a strong fundamental component. In cases where the fundamental component is weak or absent, preprocessing steps like rectification or nonlinear transforms [36], or the application of a comb filter as explored in [38], might be necessary to enhance the fundamental IF before applying de-shape STFT. Note that the fundamental component can be absent since , where and is -periodic if the greatest common divisor of is . We plan to explore the development of an RD algorithm based on this concept in our future work.
Several challenges and open questions remain in this study. Firstly, we assume knowledge of to simplify the analysis. Extending existing tools [37, 32, 19, 31] to handle more general WSF cases is a promising avenue. Secondly, our assumption that the IFs of fundamental components do not overlap needs refinement to address the common scenario of mode mixing. Extending the RD algorithm to accommodate the synchrosqueezed chirplet transform [9] could be a solution for this challenge. The walking detection problem is essentially a change point detection problem. We can potentially generalize the existing bootstrapping-based change point detection algorithm by integrating information related to IF and IA and techniques from [31] to yield a more accurate index. These promising directions will be explored in our future work.
Acknowledgement
The authors would like to thank the anonymous reviewers for their constructive feedbacks.
References
- [1] Aymen Alian, Yu-Lun Lo, Kirk Shelley, and Hau-Tieng Wu. Reconsider phase reconstruction in signals with dynamic periodicity from the modern signal processing perspective. Foundations of Data Science, 4(3):355–393, 2022.
- [2] Aymen Alian, Kirk Shelley, and Hau-Tieng Wu. Amplitude and phase measurements from harmonic analysis may lead to new physiologic insights: lower body negative pressure photoplethysmographic waveforms as an example. J. Clin. Monit. Comput., 37(1):127–137, 2023.
- [3] François Auger and Patrick Flandrin. Improving the readability of time-frequency and time-scale representations by the reassignment method. IEEE Trans. Signal Process, 43(5):1068–1089, 1995.
- [4] Richard G Baraniuk, Patrick Flandrin, Augustus JEM Janssen, and Olivier JJ Michel. Measuring time-frequency information content using the Rényi entropies. IEEE Trans. Inf. Theory, 47(4):1391–1409, 2001.
- [5] Peter J Brockwell and Richard A Davis. Time series: theory and methods. Springer science & business media, 2009.
- [6] René A Carmona, Wen L Hwang, and Bruno Torrésani. Characterization of signals by the ridges of their wavelet transforms. IEEE Trans. Signal Process, 45(10):2586–2590, 1997.
- [7] René A Carmona, Wen L Hwang, and Bruno Torrésani. Multiridge detection and time-frequency reconstruction. IEEE Trans. Signal Process, 47(2):480–492, 1999.
- [8] Yu-Chun Chen, Ming-Yen Cheng, and Hau-Tieng Wu. Non-parametric and adaptive modelling of dynamic periodicity and trend with heteroscedastic and dependent errors. J. R. Stat. Soc. Ser. B. Stat. Methodol., 76(3):651–682, 2014.
- [9] Ziyu Chen and Hau-Tieng Wu. Disentangling modes with crossover instantaneous frequencies by synchrosqueezed chirplet transforms, from theory to application. Appl. Comput. Harmon. Anal., 62:84–122, 2023.
- [10] Marcelo A. Colominas, Sylvain Meignen, and Duong-Hung Pham. Fully adaptive ridge detection based on stft phase information. IEEE Signal Processing Letters, 27:620–624, 2020.
- [11] Marcelo A Colominas and Hau-Tieng Wu. Decomposing non-stationary signals with time-varying wave-shape functions. IEEE Trans. Signal Process, 69:5094–5104, 2021.
- [12] Ingrid Daubechies, Jianfeng Lu, and Hau-Tieng Wu. Synchrosqueezed wavelet transforms: an empirical mode decomposition-like tool. Appl. Comput. Harmon. Anal., 30(2):243–261, 2011.
- [13] Nathalie Delprat, Bernard Escudié, Philippe Guillemain, Richard Kronland-Martinet, Philippe Tchamitchian, and Bruno Torresani. Asymptotic wavelet and gabor analysis: Extraction of instantaneous frequencies. IEEE Trans. Inf. Theory, 38(2):644–664, 1992.
- [14] Anna-Maria Eid, Mohamed Elgamal, Antonio Gonzalez-Fiol, Kirk H Shelley, Hau-Tieng Wu, and Aymen Awad Alian. Using the ear photoplethysmographic waveform as an early indicator of central hypovolemia in healthy volunteers utilizing lbnp induced hypovolemia model. Physiol. Meas., 2023.
- [15] Patrick Flandrin. Time-frequency/time-scale analysis. Academic press, 1998.
- [16] Peter Hall, Wei Qian, and DM Titterington. Ridge finding from noisy data. Journal of Computational and Graphical Statistics, 1(3):197–211, 1992.
- [17] D. Iatsenko, P.V.E. McClintock, and A. Stefanovska. Extraction of instantaneous frequencies from ridges in time–frequency representations of signals. Signal Processing, 125:290–303, 2016.
- [18] Matthieu Kowalski, Adrien Meynard, and Hau-tieng Wu. Convex optimization approach to signals with fast varying instantaneous frequency. Applied and computational harmonic analysis, 44(1):89–122, 2018.
- [19] Nils Laurent and Sylvain Meignen. A novel ridge detector for nonstationary multicomponent signals: Development and application to robust mode retrieval. IEEE Trans. Signal Process, 69:3325–3336, 2021.
- [20] Quentin Legros and Dominique Fourer. A novel pseudo-bayesian approach for robust multi-ridge detection and mode retrieval. In 2021 29th European Signal Processing Conference (EUSIPCO), pages 1925–1929, 2021.
- [21] Chen-Yun Lin, Li Su, and Hau-Tieng Wu. Wave-shape function analysis–when cepstrum meets time-frequency analysis. J. Fourier Anal. Appl., 24(2):451–505, 2018.
- [22] S. Meignen, T. Gardner, and T. Oberlin. Time-frequency ridge analysis based on the reassignment vector. In 2015 23rd European Signal Processing Conference (EUSIPCO), pages 1486–1490, 2015.
- [23] Sylvain Meignen, Thomas Oberlin, Philippe Depalle, Patrick Flandrin, and Stephen McLaughlin. Adaptive multimode signal reconstruction from time–frequency representations. Philos. Trans. R. Soc. A-Math. Phys. Eng. Sci., 374(2065):20150205, 2016.
- [24] Sylvain Meignen, Duong-Hung Pham, and Stephen McLaughlin. On demodulation, ridge detection, and synchrosqueezing for multicomponent signals. IEEE Trans. Signal Process, 65(8):2093–2103, 2017.
- [25] Adrien Meynard and Hau-Tieng Wu. An efficient forecasting approach to reduce boundary effects in real-time time-frequency analysis. IEEE Transactions on Signal Processing, 69:1653–1663, 2021.
- [26] Steven T Moore, Hamish G MacDougall, and William G Ondo. Ambulatory monitoring of freezing of gait in parkinson’s disease. Journal of neuroscience methods, 167(2):340–348, 2008.
- [27] Thomas Oberlin, Sylvain Meignen, and Valérie Perrier. Second-order synchrosqueezing transform or invertible reassignment? towards ideal time-frequency representations. IEEE Trans. Signal Process, 63(5):1335–1344, 2015.
- [28] Alan V. Oppenheim and Ronald W. Schafer. From frequency to quefrency: A history of the cepstrum. IEEE Signal Processing Magazine, 21(5):95–106, 2004.
- [29] N. Ozkurt and F.A. Savaci. Determination of wavelet ridges of nonstationary signals by singular value decomposition. IEEE Trans. Circuits Syst. II, 52(8):480–485, 2005.
- [30] Benjamin Ricaud and Bruno Torrésani. A survey of uncertainty principles and some signal processing applications. Advances in Computational Mathematics, 40:629–650, 2014.
- [31] Joaquin Ruiz and Marcelo A Colominas. Wave-shape function model order estimation by trigonometric regression. Signal Processing, 197:108543, 2022.
- [32] N. Saulig, N. Pustelnik, P. Borgnat, P. Flandrin, and V. and Sucic. Instantaneous counting of components in nonstationary signals. In Proc. 21st Eur. Signal Process. Conf. 2013, pages 1–5, 2013.
- [33] Yae-Lin Sheu, Liang-Yan Hsu, Pi-Tai Chou, and Hau-Tieng Wu. Entropy-based time-varying window width selection for nonlinear-type time–frequency analysis. Int. J. Data. Sci. Anal., 3:231–245, 2017.
- [34] Matt Sourisseau, Hau-Tieng Wu, and Zhou Zhou. Asymptotic analysis of synchrosqueezing transform—toward statistical inference with nonlinear-type time-frequency analysis. The Annals of Statistics, 50(5):2694–2712, 2022.
- [35] Carsten Steger. An unbiased detector of curvilinear structures. IEEE Transactions on pattern analysis and machine intelligence, 20(2):113–125, 1998.
- [36] Stefan Steinerberger and Hau-Tieng Wu. Fundamental component enhancement via adaptive nonlinear activation functions. Appl. Comput. Harmon. Anal., 2022.
- [37] V. Sucic, N. Saulig, and B. Boashash. Estimating the number of components of a multicomponent nonstationary signal using the short-term time-frequency Rényi entropy. EURASIP Journal on Advances in Signal Processing, 2011(1):1–11, 2011.
- [38] Jacek K Urbanek, Vadim Zipunnikov, Tamara Harris, William Fadel, Nancy Glynn, Annemarie Koster, Paolo Caserotti, Ciprian Crainiceanu, and Jaroslaw Harezlak. Prediction of sustained harmonic walking in the free-living environment using raw accelerometry data. Physiol. Meas., 39(2):02NT02, 2018-02-28.
- [39] Shen-Chih Wang, Chien-Kun Ting, Cheng-Yen Chen, Chinsu Liu, Niang-Cheng Lin, Che-Chuan Loong, Hau-Tieng Wu, and Yu-Ting Lin. Arterial blood pressure waveform in liver transplant surgery possesses variability of morphology reflecting recipients’ acuity and predicting short term outcomes. J. Clin. Monit. Comput., pages 1–11, 2023.
- [40] H.-T. Wu. Instantaneous frequency and wave shape functions (I). Appl. Comput. Harmon. Anal., 35:181–199, 2013.
- [41] Hau-Tieng Wu. Current state of nonlinear-type time–frequency analysis and applications to high-frequency biomedical signals. Current Opinion in Systems Biology, 23:8–21, 2020.
- [42] Hau-Tieng Wu and Jaroslaw Harezlak. Application of de-shape synchrosqueezing to estimate gait cadence from a single-sensor accelerometer placed in different body locations. Physiol. Meas., 44(5):055009, 2023.
- [43] Xiangxiang Zhu, Zhuosheng Zhang, Jinghuai Gao, and Wenting Li. Two robust approaches to multicomponent signal reconstruction from stft ridges. Mechanical Systems and Signal Processing, 115:720–735, 2019.
Appendix A Numerical implementation of TF analysis tools
We summarize the numerical implementation of STFT and SST in this section. Suppose the signal is discretized with a sampling rate over the time interval , where , denoted as ; that is, with , where means the largest integer smaller or equal to . Denote as the discretization grid in the time domain. Denote as the frequency resolution chosen by user. Then, the discretization of STFT of is given by
(26) |
where is the total number of samples in the time domain and , where is the total number of ticks in the frequency domain. In other words, the discretization of the TFR determined by STFT is a complex matrix . Note that we skip evaluating STFT at frequency .
Numerically, this discretization is carried out in the following way. Discretize the window and denote it as , where and indicates the window length. A concrete example is discretizing the Gaussian with the standard deviation over the interval with a discretization grid size of as the window by setting
(27) |
for . Note that the discretization grid size may be different from . As the rule of thumb, we choose so that the window covers about cycles if the data is not too noisy, and choose a larger so that the window covers more, like cycles if the data is noisy. Usually, if the Gaussian function is taken as the window, we suggest to choose so that the Gaussian function is “essentially supported” on ; that is, is close to . We mention that it is possible to a time-dependent to enhance the overall performance. For example, take the Reńyi entropy as a metric to determine the optimal [33]. We skip this detail and refer readers to [33] for details. To implement , extend beyond so that when or . It is possible to extend by forecasting to avoid the boundary effect [25], but we skip this technical detail to simplify the discussion. Then, compute
(28) |
where is the index in the time domain and is the index in the frequency domain.
The numerical implementation of SST is denoted as , which is a nonlinear transform of STFT. Here we focus on its numerical implementation and refer readers to [12, 8] for its theory in the continuous setup. The main idea beyond SST is utilizing the phase information hidden in STFT. Consider the discretization of the derivative of the window function , which is denoted as . In the Gaussian window case, it is defined as
Then, compute another STFT of with the window , denoted as , by
(29) |
where is the index in the time domain and is the index in the frequency domain. Next, compute the frequency reassignment operator, denoted as , which is defined as
(32) |
where is the operator of evaluating the imaginary part and is a threshold chosen by the user. In practice, is set to avoid possible blowup situation that and is large, and can be set to times of the machine epsilon. gives the instantaneous frequency information of some IMT inside the signal at time and frequency . See [12, 8, 34] for more technical details. Finally, the SST of is computed by
(33) |
where is the index in the time domain and is the index in the frequency domain. In short, to detect if the input signal oscillates at frequency , we find all STFT coefficients sharing the same instantaneous frequency information by searching over , sum them together, and put the result in the -th entry of the -th row of . This step is called “squeezing”. Since the squeezing happens at a fixed time, this motivated the “synchro” part of the nomination SST.
In this paper, we denote the discretized TFR of , either determined by STFT or SST, as .
Appendix B More details of numerical simulation
In this section, we provide more details of numerical simulation in Section 4. A realization of the simulated signal with a signal-to-noise ratio of 0 dB, as discussed in Section IV.A, is shown in Figure 8. For RRP-RD, a key parameter is the number of ridge portions, set to in the main article. However, noise can cause ridges to split into different ridge portions. We extended the simulation to test various numbers of ridge portions, with results shown in Figure 9. We used Wilcoxon’s signed-rank test with a significance level of and Bonferroni correction to compare RRP-RD performance across different ridge portions. The results show that performance with 2 ridge portions is significantly lower than with or . Increasing from to ridge portions slightly worsens performance, particularly when is small. In some cases the difference is significant.


Appendix C Proof of Proposition 3.1
Proof.
Let so that its -th row is . Then,
where is a unit vector with . Thus,
(34) |
Since is a solution to (14), it follows that . ∎
Appendix D More details for walking prediction from IMU
D.1. Existing indices for walking detection
For a fair comparison, we consider several existing walking detection indices. In [38], the proposed method is to utilize the comb filters to detect the SHW state. The idea is to use a set of “comb teeth functions” on the spectrogram to capture the most possible fundamental step-to-step frequency that has the maximum energy allocation on its multiples. A comb teeth function is determined by its fundamental frequency , where is a set of candidate fundamental frequencies. With the help of a comb filter, they define a function , , where is the possible fundamental frequency and is time, which describes the strength of the periodic content of the accelerometry signal. Finally, the threshold that decides whether there is a SHW or not is determined by the density function of for all SHW and non-SHW periods for all subjects. At the end, the SHW index is given by
where means the prediction state is SHW and means non-SHW. For each given accelerometry signal, the STFT is computed using the Hanning window of length 10 seconds, the number of the harmonics is set to be 6, and the range of the fundamental frequency was set to be 1.2 Hz to 4.0 Hz. The kernel density estimation is applied with the normal kernel function to give the estimated densities. We implement the SHW index [38] following all parameters suggested in [38] by choosing the number of harmonics , Hanning window with window length seconds for the STFT, and candidates for the step-to-step frequency to generate the statistics .
Another common idea to detect walking activity is using the entropy ratio. This idea is similar to that used in the parameter selection in Section 3.5. After extracting the IF curves from the SST , we construct a masked TFR following (8), and compute the sequence . Similarly, we define an entropy sequence associated with . The pointwise ratio of these two sequences tells whether the harmonics exist at certain time segment or not. Hence, the Entropy Ratio index is defined as
(35) |
The notation is the median filter with a seconds bandwidth. Intuitively, the index should be small if is during a walking period, since the frequency concentration measurement of that time on the SST should be small, and the one for the curves-removed TFR, , should be relatively large. On the contrary, if the state is not in a walking period, then is expected to be close to 1. To implement the Entropy Ratio index, with the IF curve , we construct with a bandwidth of Hz and take .
The next one is based on a bandpass filter to extract the possible walking pattern via the Hilbert transform. The ratio of the energy between 0.5 and 3 Hz and the energy between 0.3 and 8 Hz is called the Hilbert-WSI. This index comes from capturing the energy associated with the fundamental frequency, and the spectral range is chosen to cover the common momentary speed of stride, which is suitable for our data. Another index used in evaluating the freezing of gait (FOG) in patients with Parkinson’s disease is defined as the ratio of the energy between 0.5 and 3 Hz and the energy between 3 and 8 Hz [26]. The index is called iFOG in [26], but for the purpose of unifying the notation, we call it FOG-WSI. Note that while FOG-WSI was not designed for the walking detection, its design can potentially be used for walking detection, and we considered it in this work.
D.1.1. Justification of SST-WSI
It is important to note that SST-WSI’s design accounts for the expansion (2), and the influence of characterizes the walking strength at time . Indeed, when and , where is from the slowly varying assumption for and , by [12, 8] we have for ,
(36) |
is the reconstruction of the -th harmonic when . Thus, the SST-WSI evaluates directly the walking strength since
which is the strength of the walking activity at time .
D.2. More results
The comparison of various walking detection indices when the accelerometer signal is recorded from the left ankle (right ankle and hip, respectively) is shown in Figure 10 (Figures 11 and 12, respectively). Note that in the HIP signal experiment, all the validation results of the Hilbert-WSI group has zero True-Positive rate, which leads to F1 (marked by the red frame).


