A Hybrid Model and Learning-Based Adaptive Navigation Filter

Barak Or, , and Itzik Klein Submitted: April 2022. First revision: June 2022. Second revision: July 2022. Accepted to IEEE TIM: August 2022. This is a preprint version.Barak Or and Itzik Klein are with the Department of Marine Technologies, Charney School of Marine Science, University of Haifa, Haifa, 3498838, Israel (e-mail: [email protected], kitzik@[email protected]).

Abstract

The fusion between an inertial navigation system and global navigation satellite systems is regularly used in many platforms such as drones, land vehicles, and marine vessels. The fusion is commonly carried out in a model-based extended Kalman filter framework. One of the critical parameters of the filter is the process noise covariance. It is responsible for the real-time solution accuracy, as it considers both vehicle dynamics uncertainty and the inertial sensors quality. In most situations, the process noise covariance is assumed to be constant. Yet, due to vehicle dynamics and sensor measurement variations throughout the trajectory, the process noise covariance is subject to change. To cope with such situations, several adaptive model-based Kalman filters were suggested in the literature. In this paper, we propose a hybrid model and learning-based adaptive navigation filter. We rely on the model-based Kalman filter and design a deep neural network model to tune the momentary system noise covariance, based only on the inertial sensor readings. Once the process noise covariance is learned, it is plugged into the well-established model-based Kalman filter. After deriving the proposed hybrid framework, field experiment results using a quadrotor are presented and a comparison to model-based adaptive approaches is given. We show that the proposed method obtained an improvement of 25% in the position error. Furthermore, the proposed hybrid learning method can be used in any navigation filter and also in any relevant estimation problem.

Index Terms:

Adaptive Algorithm, Deep Neural Network, Global Navigation Satellite System, Inertial Measurement Unit, Inertial Navigation System, Kalman Filter, Machine Learning, Quadcopter, Supervised Learning, Unmanned Autonomous Vehicles, Vehicle Tracking.

I Introduction

Autonomous vehicles, such as autonomous underwater vehicles (AUV) or quadrotors, are commonly equipped with an inertial navigation system (INS) and other sensors [1] to provide real-time information about their position, velocity, and orientation [2, 3, 4, 5, 6, 7]. The INS has two types of inertial sensors; namely, gyroscopes and accelerometers. The former measures the angular velocity vector, and the latter measures the specific force vector. The inertial sensors are combined in an inertial measurement unit (IMU). Their use for navigation is challenging due to their high noise levels, which eventually accumulate, leading to a navigation solution drift over time. To solve this problem, an external aiding sensor, such as the global navigation satellite system (GNSS) receiver, is regularly used in a navigation filter [2, 3, 6]. Such fusion is carried out using a nonlinear filter, where the error state implementation of the extended Kalman filter (es-EKF) is usually employed [8, 1]. In the filter, the process noise covariance matrix is based on the IMU measurements characteristics and/or vehicle dynamics while the external aiding measurements specification determines the measurement noise covariance matrix. Those two covariances matrices have a major influence on the filter performance. A common practice is to assume that both covariances are fixed during the vehicle operation. However, as the sensor characteristics are subject to change during operation and physical constraints do not allow capturing the vehicle dynamic optimally, those covariances should vary over time as the amount of uncertainty varies and is unknown. Thus, tuning the process or measurement noise covariances optimally can lead to a significant improvement in the filter performance [8].

To cope with this problem, several model-based attempts have been made in the literature to develop optimal adaptive filters [9, 10, 11, 12, 13, 14, 15]. The most common among them is based on the filter innovation process [16], where a measure of the new information is calculated at every iteration, leading to an update of the process noise covariance matrix. Still, the question of the optimal approach to tune the system noise covariance matrix is considered open and is addressed in detail in [17].

Recently, deep neural network (DNN) approaches were integrated in model based pedestrian dead-reckoning (PDR) algorithms. One of the initial works in the field is the robust IMU double integration, RIDI approach [18], where neural networks were trained to regress linear velocities from inertial sensors to constrain the accelerometer readings. Although it was the first work in the field, double integration is still needed and the error accumulates rapidly. In [19], the device orientation, in addition to the accelerometers and gyroscopes readings, was also used as input to a DNN architecture to regress the user velocity in 2D. The velocity is then integrated to obtain the user position. Later, PDRNet [20], utilizes a smartphone location recognition classification network followed by a change of heading and distance regression network to estimate the used 2D position. One of the main challenges in model or learning based PDR is the estimation of the user walking direction as the device is not always aligned with the user motion. To cope with that, in [21] a novel DNN structure was designed for extracting the motion vector in the device coordinates, using accelerometer readings. More pedestrian inertial navigation examples are described by [22] where recent databases, methods and real-time inferences can be found.

In AUV navigation, an end-to-end DNN approach was proposed to regress missing Doppler velocity log (DVL) beam measurements to provide the AUV velocity vector, only when a single beam is missing [23]. Later, [24], a DNN approach was used to address with a DVL failure scenarios to predict the DVL output.

For land vehicle navigation a DNN model was presented to overcome the inaccurate dynamics and observation models of wheel odometry for localization, named RINS-W [25]. Their novel approach exploited DNNs to identify patterns in wheeled vehicle motions using the IMU sensor only. To overcome the navigation solution drift in pure inertial quadrotor navigation, QuadNet, a hybrid framework to estimate the quadrotor’s three-dimensional position vector at any user-defined time rate was proposed [26].

Focusing on the estimation process, a self-learning square root-cubature Kalman filter was proposed [27]. This hybrid navigation filter enhanced the navigation accuracy by providing observation continuously, even in GNSS denied environments, by learning transfer rules of internal signals in the filter. In [28], recurrent neural networks are employed to learn the vehicle’s geometrical and kinematic features to regress the process noise covariance in a linear Kalman filer framework. In [29], DNN based multi models (triggered by traffic conditions classification) together with an EKF was proposed to cope with GNSS outages.

Recently, a DNN model was proposed to solve the GNSS outages, mainly for unmanned air vehicles (UAV). In [30], a convolutional neural network model was used for noise-free gyro measurements in open loop attitude estimation. They obtained state-of-the-art navigation performance in terms of attitude estimation where they compensated for gyro measurement errors as part of a strapdown integration approach. This trend of integrating learning approaches in various fields and applications also raises the motivation to adopt such approaches in the described problem.

In this paper, we propose a hybrid model and learning-based adaptive navigation filter. We rely on the model-based; yet, instead of using model-based adaptive process noise tuning, we design a DNN to tune the momentary system noise covariance matrix, based only on the inertial sensor readings. Once, the process noise covariance is learned, it is plugged into the well-established, model-based es-EKF. To that end, we simulated many vehicle trajectories based on six baseline trajectories to create a rich dataset. This dataset contains rich dynamic scenarios with many motion patterns and different IMU noise covariances. This rich dataset was created to enable robustness to motion dynamics and IMU noise characteristics. Based on this dataset, we adopt a supervised learning (SL) approach to regress the process noise covariance.

The main contributions of this paper are:

1.

Derivation of an adaptive hybrid learning algorithm to momentary determine the process noise covariance matrix as a function of the IMU measurements. That is, instead of using model-based approaches (as been done in the last 50 years), to the best of our knowledge, we are the first to propose an adaptive hybrid learning algorithm.
2.

Online integration of the proposed hybrid learning algorithm with es-EKF implementation of the navigation filter. In that manner, the proposed approach can be used with any external sensor aiding the INS.
3.

The proposed approach principles can be further applied to any estimation problem that requires adaptive tuning of the process noise covariance.

The advantages of the proposed approach lies in its hybrid fusion of the well celebrated es-EKF and applying learning approaches on the es-EKF soft spot, the adaptive determination of the process noise covariance. By using learning approaches we leverage from their ability to generalization of intrinsic properties appearing in sequential datasets and, therefore, better cope with varying conditions affecting the process noise values.

After deriving and validating the proposed framework using the stimulative dataset, field experiments using a quadrotor are done to evaluate the robustness and performance of the proposed approach testing phase and compare it to other adaptive model-based approaches. The recordings from the field experiment are used as a test dataset and also to examine our network capability for generalization. In our setup, we used GNSS position measurements to update the INS; however, the proposed approach is suitable for any aiding sensor and for any platform.

The rest of the paper is organized as follows: Section II presents the problem formulation for the INS/GNSS fusion in an es-EKF implementation with constant and adaptive process noise covariance. Section III presents our proposed hybrid adaptive es-EKF. Section IV gives the results, and Section V presents the conclusions of this study.

II Adaptive Navigation Filter

The nonlinear nature of the INS equations requires a nonlinear filter. The most common filter for fusing INS with external aiding sensors is the es-EKF [1], and in particular, with a $15$ error states implementation:

\delta{\bf{x}}={\left[{\begin{array}[]{*{20}{c}}{\delta{{\bf{p}}^{n}}}&{\delta{{\bf{v}}^{n}}}&{\delta{{\bf{\varepsilon}}^{n}}}&{{{\bf{b}}_{a}}}&{{{\bf{b}}_{g}}}\end{array}}\right]^{T}}\in{{\mathbb{R}}^{15\times 1}},

(1)

where $\delta{{\bf{p}}^{n}}\in{{\mathbb{R}}^{3\times 1}}$ is the position vector error states expressed in the navigation frame, $\delta{{\bf{v}}^{n}}\in{{\mathbb{R}}^{3\times 1}}$ is the velocity vector error states expressed in the navigation frame, ${{\bf{\delta\varepsilon}}^{n}}\in{{\mathbb{R}}^{3\times 1}}$ is the misalignment vector state expressed in the navigation frame, ${{\bf{b}}_{a}}\in{{\mathbb{R}}^{3\times 1}}$ is the accelerometer bias residuals vector state expressed in the body frame, and ${{\bf{b}}_{g}}\in{{\mathbb{R}}^{3\times 1}}$ is the gyro bias residuals vector state expressed in the body frame. The navigation frame is denoted by $n$ and the body frame is denoted by $b$ . The navigation frame center is located at the body center of mass, where the $x$ -axis points to the geodetic north, $z$ -axis points down parallel to local vertical, and $y$ -axis completes a right-handed orthogonal frame. The body frame center is located at the center of mass, $x$ -axis is parallel to the longitudinal axis of symmetry of the vehicle pointing forwards, the $y$ -axis points right, and the $z$ -axis points down such that it forms a right-handed orthogonal frame. The linearized, error-state, continuous-time model is

\delta{\bf{\dot{x}}}={\bf{F}}\delta{\bf{x}}+{\bf{G}}\delta{\bf{w}},

(2)

where ${\bf{F}}\in{{\mathbb{R}}^{15\times 15}}$ is the system matrix, ${\bf{G}}\in{{\mathbb{R}}^{15\times 12}}$ is the shaping matrix, and $\delta{\bf{w}}={\left[{\begin{array}[]{*{20}{c}}{{{\bf{w}}_{a}}}&{{{\bf{w}}_{g}}}&{{{\bf{w}}_{ab}}}&{{{\bf{w}}_{gb}}}\end{array}}\right]^{T}}\in{{\mathbb{R}}^{12\times 1}}$ is the system noise vector consisting of the accelerometer, gyro, and their biases random walk noises, respectively [8]. The system matrix, $\bf F$ , is given by

{\bf{F}}=\left[{\begin{array}[]{*{20}{c}}{{{\bf{F}}_{pp}}}&{{{\bf{F}}_{pv}}}&{{{\bf{F}}_{p\varepsilon}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{F}}_{vp}}}&{{{\bf{F}}_{vv}}}&{{{\bf{F}}_{v\varepsilon}}}&{-{\bf{T}}_{b}^{n}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{F}}_{\varepsilon p}}}&{{{\bf{F}}_{\varepsilon v}}}&{{{\bf{F}}_{\varepsilon\varepsilon}}}&{{{\bf{0}}_{3\times 3}}}&{{\bf{T}}_{b}^{n}}\\ {{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\end{array}}\right],

(3)

where ${{\bf{T}}_{b}^{n}}$ is the transformation matrix between body and navigation frame and ${{\bf{F}}_{ij}}\in{{\mathbb{R}}^{3\times 3}}$ can be found explicitly in the literature (see [1, 31, 32]). The shaping matrix is given by

{\bf{G}}=\left[{\begin{array}[]{*{20}{c}}{{\bf{T}}_{b}^{n}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{0}}_{3\times 3}}}&{-{\bf{T}}_{b}^{n}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{I}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}\\ {{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{0}}_{3\times 3}}}&{{{\bf{I}}_{3\times 3}}}\end{array}}\right],

(4)

where ${\bf I}_{3\times 3}$ is an identity matrix.
The corresponding discrete version of the navigation model (for small step sizes), as given in (2), is

\delta{{\bf{x}}_{k+1}}={{\bf{\Phi}}_{k}}\delta{{\bf{x}}_{k}}+{{\bf{G}}_{{k}}}\delta{{\bf{w}}_{k}}.

(5)

The transition matrix, ${{\bf\Phi}_{k}}$ , is defined by a first order approximation as

{{\bf{\Phi}}_{k}}\buildrel\Delta\over{=}{\bf{I}}+{{\bf{F}}}\Delta{t},

(6)

where $k$ is a time index, $\Delta t$ is the time step size, ${\bf{F}}$ is the system matrix (obtained using $(3)$ with the estimated state vector at time $k$ ), and $\delta{\bf{w}}_{k}$ is a zero mean white Gaussian noise vector. The discretized process noise is given by

{\bf{Q}}^{d}_{k}={\bf{G}}{{\bf{Q}}^{c}}{{\bf{G}}^{T}}\Delta{t},

(7)

where $\bf G$ is defined in $(4)$ (using the current estimated state vector at time $k$ ), and ${\bf Q}^{c}$ is the continuous process noise matrix.
The discrete es-EKF, as described here, is used in the fusion process between the INS and the external measurements. Equations $(8)-(12)$ summarize the navigation filter equations[1, 31]:

\delta{\bf{\hat{x}}}_{k}^{-}={\bf 0}

(8)

{\bf{P}}_{k}^{-}={{\bf{\Phi}}_{k-1}}{\bf{P}}_{k-1}{{\bf{\Phi}}_{k-1}}^{T}+{\bf{Q}}_{k-1}^{d}

(9)

{{\bf{K}}_{k}}={\bf{P}}_{k}^{-}{{\bf{H}}_{k}}^{T}{\left[{{{\bf{H}}_{k}}{\bf{P}}_{k}^{-}{{\bf{H}}_{k}}^{T}+{\bf{R}}_{k}^{d}}\right]^{-1}}

(10)

\delta{\bf{\hat{x}}}_{k}={{\bf{K}}_{k}}\delta{{\bf{z}}_{k}}

(11)

{\bf{P}}_{k}=\left[{{\bf{I}}-{{\bf{K}}_{k}}{{\bf{H}}_{k}}}\right]{\bf{P}}_{k}^{-},

(12)

where $\delta{\bf{\hat{x}}}^{-}_{k}$ is the prior estimate of the error state and $\delta{\bf{\hat{x}}}_{k}$ is the posterior estimate, with their corresponding covariance matrices ${\bf P}_{k}^{-}$ $(9)$ and ${\bf P}_{k}$ $(12)$ , respectively, ${\bf K}_{k}$ $(10)$ is the Kalman gain, $\delta{\bf z}_{k}$ is the $k$ measurement residual, ${\bf H}_{k}$ is the measurement matrix, and ${\bf R}_{k}$ is the measurement noise covariance matrix, assumed known and constant.

II-A Measurement Model

The GNSS position measurements are available in a constant and lower frequency then that of the IMU. After processing, the GNSS receiver outputs the vehicle position vector in the navigation frame, where it is used as an update in the navigation filter. Hence, the corresponding time-invariant GNSS measurement matrix is given by

{\bf{H}}_{GNSS}=\left[{\begin{array}[]{*{20}{c}}{{{\bf{I}}_{3\times 3}}}&{{{\bf{0}}_{3\times 12}}}\end{array}}\right]\in{{\mathbb{R}}^{3\times 15}}.

(13)

The corresponding measurement residual is given by

\delta{\bf{z}}_{{}_{GNSS},j}={{\bf{H}}_{GNSS}}\delta{{\bf{x}}_{j}}+{{\bf{\varsigma}}_{{}_{GNSS},j}},

(14)

where ${{\bf{\varsigma}}_{{}_{GNSS},j}}\sim{\cal N}\left({0,{{\bf{R}}^{d}}_{GNSS}}\right)\in{{\mathbb{R}}^{3\times 1}}$ is an additive, discretized, zero mean white Gaussian noise. It is assumed that ${{\bf{\varsigma}}_{j}}$ and $\delta{\bf w}_{j}$ are uncorrelated.

II-B Model Based-Adaptive Noise Covariance

The es-EKF minimizes the tracking error without changing $\bf Q$ matrix online; that is, the process noise covariance is constant throughout the operation. As shown in the literature, [17] and the references therein, tuning $\bf Q$ online greatly improves the navigation filter performance.
For simplicity, we assume ${\bf Q}_{k}^{d}$ is a diagonal matrix, thus no correlation exists between the noise terms.

The optimal diagonal $\bf Q$ matrix at time step $k$ is defined by

{\bf{Q}}_{k}^{*}\buildrel\Delta\over{=}\arg\mathop{\min}\limits_{{{\bf{Q}}_{k}}\in{\cal Q}}\left\|{{{{\bf{\hat{x}}}}_{k}}\left({{{\bf{Q}}_{k}}}\right)-{\bf{x}}_{k}^{GT}}\right\|_{2}^{2},

(15)

where $\cal Q$ is the admissible set of ${\bf Q}_{k}$ at time index $k$ , ${\bf{x}}_{k}^{GT}$ is the ground truth (GT) state at time index $k$ , and ${\left\|\cdot\right\|_{2}}$ is the second Euclidean norm.
The most common approach to estimate $\bf Q$ in an adaptive es-EKF framework was suggested in [16, 17], and is based on the innovation matrix for a window of size $\xi$ :

{{\bf{C}}_{k}}\buildrel\Delta\over{=}\frac{1}{\xi}\sum\limits_{i=k-\xi+1}^{k}{{{\bf{\nu}}_{i}}{{\bf{\nu}}_{i}}^{T}},

(16)

where ${{\bf{C}}_{k}}$ is the innovation matrix and the innovation vector is defined by

{{\bf{\nu}}_{k}}\buildrel\Delta\over{=}{{\bf{z}}_{k}}-{\bf{H\hat{x}}}_{k}^{-}

(17)

The innovation matrix, (16), is used together with the Kalman gain, $\bf K$ , to adapt $\bf Q$ , as follows:

{\bf{\hat{Q}}}_{k}={{\bf{K}}_{k}}{{\bf{C}}_{k}}{\bf{K}}_{k}^{T}.

(18)

III A Hybrid Adaptive Navigation Filter

We rely on the well established, model-based es-EKF, yet instead of a model-based adaptive approach, a data-driven approach is used to regress the momentary process noise covariance, and, as a result, creates a hybrid adaptive es-EKF. To this end, the navigation filter equations (8)-(12) are used, but instead of using a constant process noise (7) or a model-based adaptive process noise (18), we derive a data-driven approach to estimate the time-varying continuous process noise covariance matrix ${\bf Q}^{c}$ . Our proposed hybrid adaptive es-EKF framework, suitable for any aided INS scenario, is presented in Fig. 1.

Refer to caption — Figure 1: A hybrid adaptive navigation filter framework with online tuning of the process noise covariance matrix.

III-A Data-Driven Based Adaptive Noise Covariance

It is assumed that the continuous process noise covariance matrix, ${\bf Q}_{k}^{c}$ , at time $k$ , is a diagonal matrix with the following structure:

\begin{array}[]{l}{\bf{Q}}_{k}^{c}=\\ diag{\left\{{q_{f}^{x*},q_{f}^{y*},q_{f}^{z*},q_{\omega}^{x*},q_{\omega}^{y*},q_{\omega}^{z*},\varepsilon{{\bf{I}}_{1\times 6}}}\right\}_{k}}\in{\mathbb{R}}{{}^{12\times 12}},\end{array}

(19)

where the variance for each of the accelerometer axes is given by ${q_{f}^{i*}}$ and for the gyroscope by ${q_{\omega}^{i*}}$ ( $i\in x,y,z$ ). The biases are modeled by random walk processes, and their variances are set to $\varepsilon=0.001$ . By taking this approach, our goal is to ease a machine learning (ML) algorithm for accurately regressing the six unknown variances in (19) and thereby claim that these are the dominant variances needed in an adaptive framework.

We define a general one-dimensional series of length $N$ of the inertial sensor (accelerometer or gyroscope) readings in a single axis by

{{\cal S}_{k}}=\left\{{{{\bf{s}}_{i}}}\right\}_{i=k-N}^{k-1}.

(20)

The discussed optimization problem is to find $q_{k}^{*}$ such that the position error is minimized. By doing that for all six IMU channels, the es-EKF considers an optimal system noise covariance matrix and provides improved tracking performance. The power of ML increases the ability to solve many difficult and non-conventional tasks. To determine ${\bf Q}_{k}^{*}$ in (15), the problem is formulated as a SL problem. Formally, we search for a model to relate an instance space, ${\cal X}$ , and a label space, ${\cal Y}$ . We assume that there exists a target function, $\cal F$ , such that ${\cal Y}={\cal F}\left({\cal X}\right)$ .
Thus, the SL task for adaptive determination of the process noise covariance is to find ${\cal F}$ , given a finite set of labeled examples, inertial sensor readings, and corresponding process noise variance values:

\left\{{{{\cal S}_{k}},q_{k}^{*}}\right\}_{k=1}^{M}.

(21)

The SL approach aims to find a function ${\tilde{\cal F}}$ that best estimates ${\cal F}$ . To that end, for the training process, a loss function, $l$ , is defined to quantify the quality of ${\tilde{\cal F}}$ with respect to ${\cal F}$ . The loss function is given by

{\cal L}\left({{\cal Y},\hat{\cal Y}\left({\cal X}\right)}\right)\buildrel\Delta\over{=}\frac{1}{M}\sum\limits_{m=1}^{M}{l{{\left({y,\hat{y}}\right)}_{m}}},

(22)

where $M$ is the number of examples, and $m$ is the example index. Minimizing $\cal L$ in a training/test procedure leads to the target function. In our problem, the loss function is given by

{l_{m}}\buildrel\Delta\over{=}{\left({q_{m}^{*}-{\hat{q}}_{m}}\right)}^{2},

(23)

where ${\hat{q}}_{m}$ is the estimated term obtained by the learning model during the training process.
In this work, the train dataset used to find the function ${\tilde{\cal F}}$ given the labeled examples (21), as described in the next section.

III-B Dataset Generation

There are several existing inertial datasets as were recently summarized in [33]. Yet, none of those datasets fits our problem as they do not provide enough IMU readings with different noise characteristics. As a consequence, we generated are own dataset using a simulation (detailed in Appendix -B).

To generate the dataset six different baseline trajectories were simulated as presented in Fig. 2(a) and in Fig. 3. The richness of the trajectories, as a result of their diversity, allows the establishment of a model to cope with unseen trajectories. Each baseline trajectory was created by generating ideal IMU readings for a period of $400$ s in a sampling rate of $100$ Hz, resulting in a sequence of $6\times 40,000$ samples for each baseline trajectory as shown in Fig. 2.
Our task is to estimate the system noise covariance matrix, or specifically the noise variance of each IMU channel. Hence, we divided each of the six IMU channels and corrupted their perfect data with additive white Gaussian noise with variance in the range of $q\in[0.001,0.025]$ , with 15 different values inside this interval, as shown in Fig. 2(b).
Hence, each baseline trajectory has $15$ series of $6\times 40,000$ noisy inertial samples, as shown in Fig. 2(b). The justification to include a simple noise model lies in the momentary IMU measurement noise covariance sequence. A short time window for the IMU measurements is considered and allows characterizing the noise with its variance only. Next, the series length $N$ is chosen, and a corresponding labeled database is generated as shown in Fig. 2(c). Lastly, we choose $N=200$ samples and create batches corresponding to two seconds each with a total of $6\times 200\times 15$ . Then, these batches are randomly divided into train and test datasets in such a manner that all baseline trajectories are included in the train and the test sets in a ratio of 80:20, as described in Fig. 2(d).

III-C Architecture design

Several architectures were designed and evaluated. They included a recurrent DNN, and, in particular, the long short-term memory (LSTM) architectures for time series data. However, in our problem they did not perform well and failed in generalization of the noise variance property. One of the architecture we examined included a bi-directional LSTM layer followed by two linear layers. The training failed as the temporal information wasn’t relevant for this task. On the other hand, an architecture with only convolution layers achieved satisfactory results, where it minimized the test loss very fast and obtained a low position root mean squared error (PRMSE) (the metric is defined in (25)). This result is due to the spatial information the noisy IMU signals have, allowing to capture the dynamics and statistical properties of the measurements. This information structure was easily captured by the spatial convolutional operates in the convolution layers. As the goal of this paper is to lay the foundations for a hybrid adaptive navigation filter approach, we describe here only the DNN architecture as presented in Fig. 4. Our DNN model has the following layers:

1.

Linear layer: This is the simplified layer structure that applies a linear transformation to the incoming data from the previous layer. The input features are received in the form of a flattened 1D vector and are multiplied by the weighting matrix.
2.

Conv1D layer: A convolutional, one-dimensional (Conv1D) layer creates a convolution kernel that is convolved with the layer input over a single spatial dimension to produce a vector of outputs. During the training phase the DNN learns the optimal weights of the kernels. In our DNN model the input layer is followed by a chain of three Conv1D layers. There are five kernels for the first layer and three kernels for the second and third layers. The kernel size decreases as the inputs go deep: the first Conv1D layer has twenty kernels, the second Conv1D layer has ten kernels, and the third Conv1D layer has five kernels.
3.

Global average pooling layer: Pooling layers help with better generalization capability as they perform down sampling feature map. In this way, the robustness of the DNN grows. For example, better handling with changes in the position of the features inside the 1D input vector. For the DNN architecture we selected the global average pooling method, which calculates the average for each input channel and flattens the data so it passes from Conv1D through the linear layers.
4.

Leaky ReLU: Leaky rectified linear unit [34] is a nonlinear activation function obtaining a value $\alpha$ with the following output:

$f\left(\alpha\right)=\left\{{\begin{array}[]{*{20}{c}}\alpha&{\alpha>0}\\ {0.01\alpha}&{\alpha\leq 0}\end{array}}.\right.$ (24)

The main advantage of the leaky ReLU over the classical ReLU (0 for $\alpha\leq 0$ ) is the small positive gradient when the unit is not active, which deals better with the dying ReLU problem [35]. The leaky ReLU functions were combined after each layer in the DNN model.
5.

Layer normalization: One of the challenges of DNN is that the gradients with respect to the weights in one layer are highly dependent on the outputs of the neurons in the previous layer, especially if these outputs change in a highly correlated way [36]. Batch normalization, and in particular layer normalization, was proposed to reduce such an undesirable covariate shift. The layer normalization was added after every Conv1D layer (together with the leaky ReLU layer).

The parameters of the above layers are as follow: a series of $200\times 1$ samples is inserted to a one-dimensional convolutional layer (Conv1D) with a kernel size equals to $20$ with five filters. Then, a leaky ReLU activation function and normalization layer are applied, adding nonlinearity for better generalization capability of the DNN. We duplicated this structure twice more with three filters each and smaller kernels: the second with a kernel size equals to ten and the third with a kernel size equals to five. The output is inserted to a global average pooling layer followed by four linear layers, with leaky ReLU activation functions between them, with $100,80,50$ and $20$ weights, retrospectively. Finally, the DNN outputs the process noise variance.

III-D Hybrid Learning Approach

In this work, GNSS position updates are employed to aid the INS and demonstrate our hybrid learning approach. Applying the suggested adaptive tuning approach in online setting involves integrating the INS/GNSS with the regressor as presented in Fig. 5. Algorithm 1 describes the INS/GNSS with process noise covariance learning in a real time setup. The IMU signals are inserted both into the INS/GNSS filter $(8)-(12)$ and the DNN model. The regressed continuous process noise covariance is plugged into $(7)$ to obtain the discrete one, which in turn is substituted into $(9)$ .

Algorithm 1 Hybrid adaptive filter applied to INS/GNSS

{\bf\omega}_{ib},{\bf f}_{b},{\bf{v}}_{Aiding},\Delta t_{0},\Delta\tau,T,tuningRate

{\bf{v}}^{n},{\bf\varepsilon}^{n}

Initialization :

{\bf{v}}^{n}_{0},{\bf{\varepsilon}}^{n}_{0}

LOOP Process

1: for

t=0

T

2: obtain

{\bf\omega}_{ib},{\bf f}_{b}

3: solve navigation equations (3)

4: if

(mod(t,\Delta\tau)

=0) then

5: obtain

{\bf{p}}_{GNSS}

6: update navigation state using the es-EKF (8)-(14)

7: end if

8: Calculate DNN model and predict

{\bf Q}_{k+1}^{c}

9: if

mod(t,tuningRate)=0

then

10:

{\bf Q}_{k+1}^{c}={\tilde{\cal F}_{NN}}\left({{\cal S}_{k}}\right)

11: end if

12: end for

IV Analysis and Results

We employ the following two metrics to evaluate the position accuracy of the proposed approach:

•

Position root mean squared error ( $2^{nd}$ norm) for all three axes:

$PRMSE=\sqrt{\frac{1}{c}\sum\limits_{k=1}^{c}{\sum\limits_{j\in\left\{{x,y,z}\right\}}{\delta{{\hat{p}}_{jk}}^{2}}}}.$ (25)
•

Position mean absolute error for all six IMU channels:

$PMAE=\frac{1}{c}\sum\limits_{k=1}^{c}{\sum\limits_{j\in\left\{{x,y,z}\right\}}{\left|{\delta{{\hat{p}}_{jk}}}\right|}}.$ (26)

where $c$ is the number of samples, $k$ is a running index for the samples, $j$ is the running index for the Cartesian coordinate system, and $\delta{\hat{p}}_{jk}$ is the position error term.

IV-A Train Dataset Analysis

The proposed DNN architecture, presented in Section III-C, was trained on the dataset described in Section III-B. After $30$ epochs it achieved an RMSE of $0.0032$ on the test set, showing the proposed DNN generalization capability and thereby establishing the trained DNN regressor model. The training process was done in mini-batches of $500$ examples each. The ADAM optimizer [37] was used with an initial learning rate of $0.001$ and a learn-rate-schedule with a drop factor of $0.1$ after $20$ epochs. In our working environment (Intel i7-6700HQ [email protected] 16GB RAM with MATLAB), the training time for input length of 200 samples elapsed about three hours. The averaged inference time is $0.02[s]$ . Fig. 6 presents the performance for $N=200$ samples where the red line is the desired one, representing when the GT value is equal to the DNN predicted value. The gray points represent the performance of the test set with the trained DNN model. The mean values of the test set lies near the red line (blue points) for most cases.

The INS/GNSS fusion is mostly performed under real-time conditions, where latency in the position computation might degrade the performance. Thus, the influence of the IMU sequence length input was examined. The system noise covariance matrix was learned, based on series of length $N$ . As $N>>1$ , the probability of learning the correct terms grows, since the DNN can capture the signal intrinsic properties easier using more data; however, the latency grows. Considering this trade-off, we trained the chosen architecture with various $N$ values and calculated the RMSE as shown in Fig. 7. As expected, $N=400$ obtained the minimum RMSE, yet for the rest of the analysis we chose $N=200$ (similar RMSE) to receive the regression result as it is given in a shorter time period. For example, in the stimulative train and test dataset the sampling rate is $100$ Hz, thus working with $N=200$ gives the regression result every two seconds instead of working with four seconds ( $N=400$ ).

IV-B Field experiment

A field experiment with the DJI Matrice 300 [38] was performed (Fig. 8). The trajectory is shown in Fig. 9. A figure-eight trajectory was made with some additional approximated straight line segments. The experiment parameters, including the GNSS RTK GT values and IMU, are given in Table I. The quadrotor dataset was published in [39] and is publicly available at https://github.com/ansfl/Navigation-Data-Project.

TABLE I: Experiment parameters

Description	Symbol	Value
GNSS noise (var) - horizon	$R_{11},R_{22}$	$0.01[m]^{2}$
GNSS noise (var) - vertical	$R_{33}$	$0.02[m]^{2}$
GNSS step size	$\Delta\tau$	$0.2[s]$
IMU step size	$\Delta t$	$0.001[s]$
Accelerometer noise (MPU-9250)	$q_{1:3}^{*}$	$0.02^{2}[m/s]^{2}$
Gyroscope rate noise (MPU-9250)	$q_{4:6}^{*}$	$0.002^{2}[rad/s]^{2}$
Experiment duration	$T$	$60[s]$
Initial velocity	${\bf{v}}^{n}_{0}$	$[0,0,0]^{T}[m/s]$
Initial position	${\bf{p}}^{n}_{0}$	$[32.1^{0},34.8^{0},0]^{T}$

Equations (2)-(12) were used as a model-based es-EKF with a constant process noise covariance matrix. Three different cases of the model-based es-EKF with constant process noise covariance matrix (MB-EKF-CQ) were examined:

•

MB-EKF-CQ1: ${q_{1:3}}={0.002},{q_{4:6}}={0.02}$ .
•

MB-EKF-CQ2: ${q_{1:3}}={0.001},{q_{4:6}}={0.01}$ .
•

MB-EKF-CQ3: ${q_{1:3}}={0.002},{q_{4:6}}={0.01}$ .

In addition, using (16)-(18), three different (window size length) cases of model-based es-EKF with the adaptive process noise covariance matrix (MB-EKF-AQ) were examined:

•

MB-EKF-AQ1: $\xi=1$ .
•

MB-EKF-AQ2: $\xi=3$ .
•

MB-EKF-AQ3: $\xi=5$ .

All six approaches were compared to our proposed hybrid adaptive EKF (HB-AEKF) using the trained network, as described in the previous section. Results in terms of PMAE and PRMSE are presented in Table 2. The MB-EKF-AQ model-based adaptive filters obtained better performance than the MB-EKF-CQ constant process noise filters. In particular, MB-EKF-CQ3 achieved the best performance for constant process noise with a PMAE of $2.2[m]$ and a PRMSE of $1.6[m]$ , where MB-EKF-AQ3 manged to improve them by $18\%$ and $19\%$ , respectively. Our proposed hybrid adaptive approach, HB-AEKF, with an input length of $200$ samples, obtained the best performance, with $1.7[m]$ PMAE and $1.2[m]$ PRMSE improving MB-EKF-AQ3 by $5.5\%$ and $7.7\%$ , respectively.

The $q$ parameters time evaluation were obtained using our HB-AEKF approach—are shown in Fig. 10 for both accelerometers and gyroscopes. Their behavior shows fluctuations in the first $25$ seconds and then steady-state behavior. This behavior can be attributed to the convergence of the es-EKF $(8)$ - $(12)$ once it obtains enough data and captures the IMU noise statistic.

TABLE II: Experiment results showing PMAE and PRMSE performance

$\bf Approach$	$\bf PMAE[m]$	$\bf PRMSE[m]$
HB-AEKF (ours)	$\bf 1.7$	$\bf 1.2$
MB-EKF-AQ1	$2.3$	$1.6$
MB-EKF-AQ2	$2.0$	$1.4$
MB-EKF-AQ3	$1.8$	$1.3$
MB-EKF-CQ1	$2.4$	$1.8$
MB-EKF-CQ2	$2.4$	$1.7$
MB-EKF-CQ3	$2.2$	$1.6$

Fig. 11 summarizes the positioning performance by providing two graphs of the cumulative density function (CDF); one representing PRMSE and the other the PMAE. In those plots, all three MB-EKF-AQ and three MB-EKF-CQ approaches are presented as well as our proposed HB-AEKF hybrid approach (black lines). Both graphs show clearly that the entire error is lower for HB-AEKF than all other approaches. The PRMSE of the MB-AEKF is lower than $1.7[m]$ and the PMAE lower than $2.8[m]$ for the entire scenario. In comparison with the MB-EKF-AQ1 approach, the maximum PRMSE and the maximum PMAE might grow too much: $3.7[m]$ and $4.0[m]$ , respectively.

V Conclusions

The proper choice for the system noise covariance matrix is critical for accurate INS/GNSS fusion. In this paper, a hybrid learning and model based approach was suggested in an adaptive filter framework. To that end, a novel DNN-based approach to learn and online tune the system noise covariance was developed. The input to the DNN is only the IMU readings. This learning approach was then combined with the model based es-EKF, resulting in a hybrid adaptive filter.

The DNN model has been trained only once on a simulated database consisting of six different baseline trajectories. Analysis on this dataset showed a trade-off between accuracy and the regression solution latency, where as the latency increases the accuracy decreases. Relying on this analysis, an input length of $200$ samples was chosen.

To validate the performance of the proposed approach, it was compared to six model-based approaches representing both constant and adaptive process noise covariance selection. Quadrotor field experiments were conducted and used as an additional test dataset on the trained network. We obtained state-of-the-art results for each noise value compared to classical adaptive KF approaches; a PRMSE lower than $1.2[m]$ and a PMAE lower than $1.7[m]$ were obtained using our suggested approach, compared to a PRMSE of $1.6[m]$ and a PMAE of a $2.3[m]$ that were obtained using the innovation-based adaptive approach and classical tuning. Results in a total PRMSE improvement of $24\%$ and PMAE improvement of $27\%$ .

This improvement also demonstrates the robustness and generalization properties of the proposed DNN architecture, enabling it to cope with unseen data from different IMU sensors. In addition, and together with the train dataset results, it proves our hypothesis of regressing only the first six elements in the diagonal of the continuous process noise covariance matrix.

Although demonstrated for quadrotor INS/GNSS fusion, the proposed approach can be elaborated for any external sensor aiding the INS and for any type of platform.

Although the present study demonstrated a small improvement over model-based methods using the test dataset of a quadrotor INS/GNSS fusion, it introduces a hybrid framework to combine deep learning in an adaptive navigation filter. This approach can be applied to other platforms, dynamics and external sensor aiding the INS and serve as a foundation approach for future work in the field.

-A Abbreviations

TABLE III: Abbreviation and Description

Abbreviation	Description
AUV	autonomous underwater vehicle
CDF	cumulative density function
Conv1D	one-dimensional convolution
DNN	deep neural network
DVL	Doppler velocity log
EKF	extended Kalman filter
es	error state
GNSS	global navigation satellite system
GT	ground truth
HB-AEKF	hybrid based adaptive EKF
INS	inertial navigation system
LSTM	long short-term memory
MB-EKF-AQ	model based EKF adaptive Q
MB-EKF-CQ	model based EKF constant Q
ML	machine learning
NED	North-East-Down
PDR	pedestrian dead-reckoning
PMAE	position mean absolute error
PRMSE	position RMSE
RMSE	root mean squared error
SL	supervised learning
UAV	unmanned air vehicles

-B Dataset Generation

-B1 INS Equations of Motion

The INS equations of motion include the rate of change of the position, velocity, and the transformation between the navigation and body frame, as shown in Fig.12.

The position vector is given by

{{\bf{p}}^{n}}={\left[{\begin{array}[]{*{20}{c}}\phi&\lambda&h\end{array}}\right]^{T}}\in{\mathbb{R}^{3\times 1}},

(27)

where $\phi$ is the latitude, $\lambda$ is the longitude, and $h$ is the altitude. The velocity vector is Earth referenced and expressed in the North-East-Down (NED) coordinate system:

{{\bf{v}}^{n}}={\left[{\begin{array}[]{*{20}{c}}{{v_{N}}}&{{v_{E}}}&{{v_{D}}}\end{array}}\right]^{T}}\in\mathbb{R}^{3\times 1},

(28)

where $v_{N},v_{E},v_{D}$ denote the velocity vector components in north, east, and down directions, respectively. The rate of change of the position is given by [1]

{{{\bf{\dot{p}}}}^{n}}=\left[{\begin{array}[]{*{20}{c}}{\dot{\phi}}\\ {\dot{\lambda}}\\ {\dot{h}}\end{array}}\right]=\left[{\begin{array}[]{*{20}{c}}{\frac{{{v_{N}}}}{{{R_{M}}+h}}}\\ {\frac{{{v_{E}}}}{{\cos\left(\phi\right)\left({{R_{N}}+h}\right)}}}\\ {-{v_{D}}}\end{array}}\right],

(29)

where $R_{M}$ and $R_{N}$ are the meridian radius and the normal radius of curvature, respectively. The rate of change of the velocity vector is given by [1]

{{{\bf{\dot{v}}}}^{n}}={\bf{T}}_{b}^{n}{{\bf{f}}^{b}}+{{\bf{g}}^{n}}-\left({\left[{\omega_{en}^{n}\times}\right]+2\left[{\omega_{ie}^{n}\times}\right]}\right){{\bf{v}}^{n}},

(30)

where ${\bf{T}}_{b}^{n}\in\mathbb{R}^{3\times 3}$ is the transformation matrix from body frame to the navigation frame. ${{\bf{f}}^{b}}\in\mathbb{R}^{3\times 1}$ is the accelerometers vector state expressed in the body frame, ${{\bf{g}}^{n}}\in\mathbb{R}^{3\times 1}$ is the gravity vector expressed in the navigation frame. $\omega_{en}^{n}$ is the angular velocity vector between the earth centered earth fixed (ECEF) frame and the navigation frame. The angular velocity vector between ECEF and the inertial frame is given by $\omega_{ie}^{n}$ and the rate of change of the transformation matrix is given by [1]

{\bf{\dot{T}}}_{b}^{n}={\bf{T}}_{b}^{n}\left({\left[{\omega_{ib}^{b}\times}\right]-\left[{\omega_{in}^{b}\times}\right]}\right),

(31)

where $\omega_{ib}^{b}={\left[{\begin{array}[]{*{20}{c}}p&q&r\end{array}}\right]^{T}}\in\mathbb{R}^{3\times 1}$ is the angular velocity vector as obtained by the gyroscope and $\omega_{in}^{b}$ is the angular velocity vector between the navigation frame and the inertial frame expressed in the body frame. The angular velocity between the navigation frame and the inertial frame expressed in the navigation frame is given by $\omega_{in}^{n}$ . The alignment between body frame and navigation frame can be obtained from ${{\bf{T}}_{b}^{n}}$ , as follows

{\bf\varepsilon}=\left[{\begin{array}[]{*{20}{c}}\varphi\\ \theta\\ \psi\end{array}}\right]=\left[{\begin{array}[]{*{20}{c}}{atan2\left({{\bf{T}}{{{}_{n}^{b}}_{31}},{\bf{T}}{{{}_{n}^{b}}_{32}}}\right)}\\ {arccos\left({{\bf{T}}{{{}_{n}^{b}}_{33}}}\right)}\\ -{atan2\left({{\bf{T}}{{{}_{n}^{b}}_{13}},{\bf{T}}{{{}_{n}^{b}}_{23}}}\right)}\end{array}}\right]\in\mathbb{R}^{3\times 1},

(32)

where $\varphi$ is the roll angle, $\theta$ is the pitch angle, and $\psi$ is the yaw angle. These three angles are called Euler angles.

-B2 IMU readings

The GT trajectories were created by tuning vehicle angular rates, $\cal P$ , and vehicle accelerations, $\bf{a}$ , as follows [40]

{\cal P}={\left[{\begin{array}[]{*{20}{c}}{\dot{\varphi}}&{\dot{\theta}}&{\dot{\psi}}\end{array}}\right]^{T}}\in\mathbb{R}^{3\times 1},

(33)

and,

{\bf{a}}={\left[{\begin{array}[]{*{20}{c}}{{a_{x}}}&{{a_{y}}}&{{a_{z}}}\end{array}}\right]^{T}}\in\mathbb{R}^{3\times 1}.

(34)

The transformation between the desired motion and the INS framework for the gyroscope readings is given by [40]

{\bf{\omega}}_{ib}^{b}={\bf{T}}_{n}^{b}\left({\omega_{en}^{n}+\omega_{ie}^{n}}\right)+\overline{{\bf{\omega}}_{ib}^{b}}

(35)

where

\overline{{\bf{\omega}}_{ib}^{b}}=\left[{\begin{array}[]{*{20}{c}}1&0&{-\sin\theta}\\ 0&{\cos\varphi}&{\sin\varphi\cos\theta}\\ 0&{-\sin\varphi}&{\cos\varphi\cos\theta}\end{array}}\right]{\cal P}\in\mathbb{R}^{3\times 1},

(36)

and from (30) the accelerometer readings are

{{\bf{f}}_{b}}={\bf{T}}_{n}^{b}\left[{{\bf{a}}-{{\bf{g}}^{n}}+\left({\omega_{en}^{n}\times+2\omega_{ie}^{n}\times}\right){{\bf{v}}^{n}}}\right].

(37)

Once, the GT IMU readings (35) and (37) are obtained, noise characteristics as described in Section III are added to create our dataset.

References

[1] J. Farrell, Aided navigation: GPS with high rate sensors. McGraw-Hill, Inc., 2008, pp: 393-395.
[2] D. Wang, Y. Dong, Z. Li, Q. Li, and J. Wu, “Constrained mems-based gnss/ins tightly coupled system with robust kalman filter for accurate land vehicular navigation,” IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 7, pp. 5138–5148, 2019.
[3] G. Wang, X. Xu, J. Wang, and Y. Zhu, “An enhanced ins/gnss tightly coupled navigation system using time-differenced carrier phase measurement,” IEEE Transactions on Instrumentation and Measurement, vol. 69, no. 7, pp. 5208–5218, 2019.
[4] L. Paull, S. Saeedi, M. Seto, and H. Li, “AUV navigation and localization: A review,” IEEE Journal of Oceanic Engineering, vol. 39, no. 1, pp. 131–149, 2013.
[5] S. Gupte, P. I. T. Mohandas, and J. M. Conrad, “A survey of quadrotor unmanned aerial vehicles,” in 2012 Proceedings of IEEE Southeastcon. IEEE, 2012, pp. 1–6.
[6] Q. Xu, X. Li, and C.-Y. Chan, “Enhancing localization accuracy of mems-ins/gps/in-vehicle sensors integration during gps outages,” IEEE Transactions on Instrumentation and Measurement, vol. 67, no. 8, pp. 1966–1978, 2018.
[7] Y. Cui, S. Liu, J. Yao, and C. Gu, “Integrated positioning system of unmanned automatic vehicle in coal mines,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–13, 2021.
[8] Y. Bar-Shalom, X. R. Li, and T. Kirubarajan, Estimation with applications to tracking and navigation: theory algorithms and software. John Wiley & Sons, 2004.
[9] X. Lyu, B. Hu, K. Li, and L. Chang, “An adaptive and robust UKF approach based on Gaussian process regression-aided variational Bayesian,” IEEE Sensors Journal, vol. 21, no. 7, pp. 9500–9514, 2021.
[10] C. H. Kang, C. G. Park, and J. W. Song, “An adaptive complementary kalman filter using fuzzy logic for a hybrid head tracker system,” IEEE Transactions on Instrumentation and Measurement, vol. 65, no. 9, pp. 2163–2173, 2016.
[11] N. Davari and A. Gholami, “An asynchronous adaptive direct Kalman filter algorithm to improve underwater navigation system performance,” IEEE Sensors Journal, vol. 17, no. 4, pp. 1061–1068, 2016.
[12] D.-J. Jwo and S.-H. Wang, “Adaptive fuzzy strong tracking extended Kalman filtering for gps navigation,” IEEE Sensors Journal, vol. 7, no. 5, pp. 778–789, 2007.
[13] M. A. K. Jaradat and M. F. Abdel-Hafez, “Enhanced, delay dependent, intelligent fusion for INS/GPS navigation system,” IEEE Sensors Journal, vol. 14, no. 5, pp. 1545–1554, 2014.
[14] X. Tong, Z. Li, G. Han, N. Liu, Y. Su, J. Ning, and F. Yang, “Adaptive ekf based on hmm recognizer for attitude estimation using mems marg sensors,” IEEE Sensors Journal, vol. 18, no. 8, pp. 3299–3310, 2017.
[15] J. He, C. Sun, B. Zhang, and P. Wang, “Adaptive error-state kalman filter for attitude determination on a moving platform,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–10, 2021.
[16] R. Mehra, “On the identification of variances and adaptive Kalman filtering,” IEEE Transactions on Automatic Control, vol. 15, no. 2, pp. 175–184, 1970.
[17] L. Zhang, D. Sidoti, A. Bienkowski, K. R. Pattipati, Y. Bar-Shalom, and D. L. Kleinman, “On the identification of noise covariances and adaptive Kalman filtering: A new look at a 50 year-old problem,” IEEE Access, vol. 8, pp. 59 362–59 388, 2020.
[18] H. Yan, Q. Shan, and Y. Furukawa, “RIDI: Robust IMU double integration,” in Proceedings of the European Conference on Computer Vision (ECCV), 2018, pp. 621–636.
[19] S. Herath, H. Yan, and Y. Furukawa, “Ronin: Robust neural inertial navigation in the wild: Benchmark, evaluations, & new methods,” in 2020 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2020, pp. 3146–3152.
[20] O. Asraf, F. Shama, and I. Klein, “Pdrnet: A deep-learning pedestrian dead reckoning framework,” IEEE Sensors Journal, 2021.
[21] A. Manos, T. Hazan, and I. Klein, “Walking direction estimation using smartphone sensors: a deep network-based framework,” IEEE Transactions on Instrumentation and Measurement, 2022.
[22] C. Chen, P. Zhao, C. X. Lu, W. Wang, A. Markham, and N. Trigoni, “Deep-learning-based pedestrian inertial navigation: Methods, data set, and on-device inference,” IEEE Internet of Things Journal, vol. 7, no. 5, pp. 4431–4441, 2020.
[23] M. Yona and I. Klein, “Compensating for partial doppler velocity log outages by using deep-learning approaches,” in 2021 IEEE International Symposium on Robotic and Sensors Environments (ROSE). IEEE, 2021, pp. 1–5.
[24] D. Li, J. Xu, H. He, and M. Wu, “An underwater integrated navigation algorithm to deal with dvl malfunctions based on deep learning,” IEEE Access, vol. 9, pp. 82 010–82 020, 2021.
[25] M. Brossard and S. Bonnabel, “Learning wheel odometry and imu errors for localization,” in 2019 International Conference on Robotics and Automation (ICRA). IEEE, 2019, pp. 291–297.
[26] A. Shurin and I. Klein, “Quadnet: A hybrid framework for quadrotor dead reckoning,” Sensors, vol. 22, no. 4, p. 1426, 2022.
[27] C. Shen, Y. Zhang, X. Guo, X. Chen, H. Cao, J. Tang, J. Li, and J. Liu, “Seamless gps/inertial navigation system based on self-learning square-root cubature kalman filter,” IEEE Transactions on Industrial Electronics, vol. 68, no. 1, pp. 499–508, 2020.
[28] B. Or and I. Klein, “Learning vehicle trajectory uncertainty,” arXiv preprint arXiv:2206.04409, 2022.
[29] J. Liu and G. Guo, “Vehicle localization during gps outages with extended kalman filter and deep learning,” IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1–10, 2021.
[30] M. Brossard, S. Bonnabel, and A. Barrau, “Denoising imu gyroscopes with deep learning for open-loop attitude estimation,” IEEE Robotics and Automation Letters, vol. 5, no. 3, pp. 4796–4803, 2020.
[31] P. D. Groves, Principles of GNSS, Inertial, and Multisensor Integrated Navigation Systems 2nd ed. Boston, MA: Artech House, 2013.
[32] R. G. Brown and P. Y. Hwang, Introduction to random signals and applied Kalman filtering. Wiley New York, Third Edtion, 1992.
[33] A. Shurin, A. Saraev, M. Yona, Y. Gutnik, S. Faber, A. Etzion, and I. Klein, “The autonomous platforms inertial dataset (table 1),” IEEE Access, vol. 10, pp. 10 191–10 201, 2022.
[34] A. L. Maas, A. Y. Hannun, A. Y. Ng et al., “Rectifier nonlinearities improve neural network acoustic models,” in Proc. icml, vol. 30, no. 1. Citeseer, 2013, p. 3.
[35] L. Lu, Y. Shin, Y. Su, and G. E. Karniadakis, “Dying relu and initialization: Theory and numerical examples,” arXiv preprint arXiv:1903.06733, 2019.
[36] J. L. Ba, J. R. Kiros, and G. E. Hinton, “Layer normalization,” arXiv preprint arXiv:1607.06450, 2016.
[37] D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
[38] DJI, “Matrice 300 RTK, Aircraft Specifications,” Website source: https://www.dji.com/matrice-300/specs, 2021.
[39] A. Shurin, A. Saraev, M. Yona, Y. Gutnik, S. Faber, A. Etzion, and I. Klein, “The autonomous platforms inertial dataset,” IEEE Access, vol. 10, pp. 10 191–10 201, 2022.
[40] D.-J. Jwo, J.-H. Shih, C.-S. Hsu, and K.-L. Yu, “Development of a strapdown inertial navigation system simulation platform,” Journal of Marine Science and Technology, vol. 22, no. 3, pp. 381–391, 2014.