Sage Journals: Discover world-class research

Abstract

High-speed flight in GPS-denied environments is currently an important frontier in the research on autonomous flight of micro air vehicles. Autonomous drone races stimulate the advances in this area by representing a very challenging case with tight turns, texture-less floors, and dynamic spectators around the track. These properties hamper the use of standard visual odometry approaches and imply that the micro air vehicles will have to bridge considerable time intervals without position feedback. To this end, we propose an approach to trajectory estimation for drone racing that is computationally efficient and yet able to accurately estimate a micro air vehicle’s state (including biases) and parameters based on sparse, noisy observations of racing gates. The key concept of the approach is to optimize unknown and difficult-to-observe state variables so that the observations of the racing gates best fit with the known control inputs, estimated attitudes, and the quadrotor dynamics and aerodynamics during a time window. It is shown that a gradient-descent implementation of the proposed approach converges ∼4 times quicker to (approximately) correct bias values than a state-of-the-art 15-state extended Kalman filter. Moreover, it reaches a higher accuracy, as the predicted end-point of an open-loop turn is on average only ∼20 cm away from the actual end-point, while the extended Kalman filter and the gradient descent method with kinematic model only reach an accuracy of ∼50 cm. Although the approach is applied here to drone racing, it generalizes to other settings in which a micro air vehicle may only have sparse access to velocity and/or position measurements.

Keywords

Autonomous drone race quadrotor modeling bias estimation gradient descent

Introduction

Quadrotors have received considerable attention in recent years, thanks to their mechanical simplicity and good maneuverability combined with hover properties. They have offered new possibilities in a variety of fields like aerial photography, inspection and even transportation. With recent advances in on-board computation and sensor technology, aggressive maneuvering has come within reach of many applications. To further stimulate aggressive and fast flight, autonomous drone racing is gaining interest. The first ever autonomous drone race was held by the International Conference on Intelligent Robots and Systems (IROS) in 2016.¹ A track consisting of gates had to be flown autonomously in a pre-specified order. The robot had to achieve this as fast as possible, while only relying on onboard sensors and processing. Figure 1 illustrates the setup of the 2016 indoor track.

Figure 1.

The map of the IROS 2016 drone race. In this drone race, the UAVs have to fly through orange gates in a pre-specified order as fast as possible.

Autonomous indoor drone racing brings many new challenges to the fields of quadrotor navigation and control. One initial challenge is the navigation without any external positioning system like VICON, Optitrack or GPS. Typical approaches to this problem make use of on-board cameras and use Visual Inertial Odometry to integrate position. This type of algorithms rely on integrating inertial information, tracking visual features over several frames and solving an optimization problem to retrieve the most likely solution. In autonomous drone racing, on top of this position estimation algorithm, gate detection is often needed when the position of gates is not precisely known, or when gates contain moving parts—as is the case in the IROS competitions. With the limited computational resources of small indoor drones, to achieve the fast speeds needed in drone racing, this paper proposes a navigation solution based solely on gate detection, augmented with inertial measurements and an aerodynamic model. To cope with the sometimes sparse and noisy non-Gaussian visual observations, we formulate the navigation solution as an optimization problem. We then solve it using a gradient descent method. The resulting method provides online estimation of the quadrotor position, velocity and inertial biases using less computational resources than traditional Visual Inertial Odometry. The proposed approach also estimates aerodynamic properties of the quadcopter—which become increasingly important in the case of fast aggressive control. Finally, the approach scales favorably with increasing flight speeds as it keeps performing well even with very few position updates. As a comparison, we use the Kalman filter, which is currently still the default choice for navigation. Since the extended Kalman filter (EKF) is significantly less computationally complex than the unscented Kalman filter (UKF),² in this paper we select the EKF as a benchmark. We compare the results with EKF, which is shown to be much more sensitive to visual outliers or other non-Gaussian effects.

In the section Related work, an overview of studies on aerodynamics modeling and state estimation methods is given. The section Quadrotor model will describe the quadrotor model parameters that will be solved. The section State estimation proposes two different approaches for the visual state estimation. First a classic 15-state EKF is developed as benchmark. Then the novel FMINCON-based gradient descent optimization method is proposed to solve the model parameters and states. In the section Experiment setup and result, both algorithms are compared on flight test data and Conclusion summarizes the conclusions.

Related work

Several researchers have already proposed aerodynamics models for quadrotors.^3–7 The main object of their studies is to derive a nonlinear quadrotor aerodynamics model to improve the control performance by compensating for the nonlinear terms. In some studies, a detailed aerodynamic model is analyzed through theory and fitted by experimental data.³ Simplified aerodynamic models are also established from experiments.^5,6 It should be noted that their models are all obtained off-line using external measurements, such as GPS, VICON and thrust test beds. Aerodynamic models can also be combined with on-board measurements, for instance from computer vision,⁸ in order to better estimate the velocity of the drone on-line. In this article, we employ a simplified aerodynamic model in the trajectory estimation exactly for this purpose.

Quadrotor control heavily relies on attitude estimation from an attitude and heading reference system (AHRS). This system is typically based on inertial sensors (accelerometers and gyroscopes), but also relies on orientation sensors (magnetometer) and/or positioning sensors (GPS, VICON) to estimate inertial sensor biases and compensating for long term drift. Sensor biases become increasingly important as the drone will have to fly longer or temporarily perform feedforward control maneuvers in the absence of sensor measurements. Hence, for drone racing, it is important to estimate them accurately. Here we briefly discuss the sensors and then the filtering employed in estimating both attitude and position or velocity on micro air vehicles (MAVs).

Most systems intended for outdoor environments utilize the magnetometer and GPS-measurements.^9–13 The indoor equivalent is the use of a motion tracking system such as VICON or Optitrack.¹⁴ In many applications—like autonomous drone racing, it is required to have accurate state estimation without the help of external systems. The necessary position or velocity measurements can be obtained from multiple sensors. One early option is to use laser scanners.^15,16 But a laser scanner contains sensitive optics and mirrors, which are susceptible to shock and vibration problems.¹⁷ Another choice for on-board navigation is RGB-D devices.^18–20 The main drawback of these RGB-D devices is that their maximum depth perception range is limited to a few meters.^21,22 This is why light-weight and inexpensive on-board cameras which are more robust to vibration and shock, have attracted interest of researchers for the navigation of drones. Generally, visual odometry (VO) algorithms²³ using a stereo camera or monocular camera are used for estimation of the MAV’s translation and rotation between frames.^24–28 However, generic visual odometry approaches necessitate detecting features, matching corresponding features and estimating motions, which leads to a heavy demand for on-board computational resources and low-frequency estimation. In the meantime, aggressive maneuvers may introduce blur into generic visual odometry and seriously affect the accuracy of estimation. Moreover, in complicated environments like drone racing, dynamic spectators may also interfere visual odometry. Less generic but computationally efficient methods are employed in some specific environments, for instance, using detection of known visual markers to determine position.^29,30 However, these methods cannot cope with other generic environments.

Concerning filtering, with white-Gaussian position measurement, Kalman filter and its variants are widely used. It is well-known that nonlinearities in the state update or observation equations can be handled by an EKF^11,12,22,31 and that heavy nonlinearities are often handled better by a UKF.^30,32,33 Also, there are factor graph-based smoothing methods which can handle nonlinearity and allows multi-rate, asynchronous, and possibly delayed measurements, which have similar performance with an EKF.^34,35 We hypothesize that when these measurements get sparser, and their noise distribution moves further away from the Gaussian distribution, it will be better to estimate the attitude, heading, and trajectory in general as an optimization problem that uses more data at a time. In particular, we want to optimize the trajectory and parameters such as sensor biases, given a specified time-window with the corresponding sensor measurements, control inputs, and knowledge of the aerodynamic model. Our approach will be explained below, starting with our dynamic quadrotor model.

Quadrotor model

Dynamic model of quadrotor

Before deriving the dynamic model for quadrotor, two reference frames are introduced (Figure 4).

Earth frame E. The origin of the local tangent earth frame is on the ground, the x-axis x^E points to north, the y-axis y^E points east and the z-axis z^E points down.

Body frame B. The origin of the body frame is at the center of mass. Its x-axis x^B is in the symmetry plane of the drone and points forward. Its z-axis z^B also lies in the symmetry plane and points downward. The y-axis y^B is directed to the right, perpendicular to the symmetry plane.

The relative relation between two frames can be expressed by three successive rotations along three axes. In this paper, we use z–y–x sequence to rotate one frame to the other. The corresponding angle of rotation is defined by $ϕ_{E}^{B}, θ_{E}^{B}$ , and $ψ_{E}^{B}$ which are also called Euler angles. Given the Euler angles between the two frames, the rotation matrix between two frames can be expressed by

R_{E}^{B} = [\begin{matrix} \begin{matrix} C_{θ} C_{ψ} & C_{θ} S_{ψ} & - S_{θ} \\ S_{ϕ} S_{θ} C_{ψ} - C_{ϕ} S_{ψ} & S_{ϕ} S_{θ} S_{ψ} + C_{ϕ} C_{ψ} & S_{ϕ} C_{θ} \\ C_{ϕ} S_{θ} C_{ψ} + S_{ϕ} S_{ψ} & C_{ϕ} S_{θ} S_{ψ} - S_{ϕ} C_{ψ} & C_{ϕ} C_{θ} \end{matrix} \end{matrix}]

(1)

where C_X and S_X denote the cosine and sine of X, respectively. The control of the quadrotor is often divided in to two loops which can be independently developed, namely a high level translation loop and a faster low-level attitude loop. For the attitude loop, the inputs of the system are the four rotor speeds and the output consists of the three Euler angles. For the translation loop, the inputs of system are three Euler angles and the output is position. Since quadrotor attitude control is a well-developed topic, in this work we only derive the translational model and have used INDI from Smeur et al.³⁶ as innerloop.

According to Newton’s laws of motion, the motion of quadrotor can be described as

m \dot{V} = m g + F

(2)

where m is mass of the drone, g is gravity vector and F is the specific force vector. The change in position can be described by the kinematic equation

\dot{X} = V

(3)

In equation (2), the specific force F can be expressed in Body frame B as

F^{B} = [\begin{array}{l} F_{x}^{B} \\ F_{y}^{B} \\ F_{z}^{B} \end{array}]

(4)

Gravity acting on the center of mass and expressed in Earth frame is

m g^{E} = m [\begin{array}{l} 0 \\ 0 \\ g \end{array}]

(5)

Combining all forces yields the equations of motion in inertial frame

\begin{array}{l} [\begin{array}{l} \dot{x} \\ \dot{y} \\ \dot{z} \end{array}] = [\begin{array}{l} v_{x} \\ v_{y} \\ v_{z} \end{array}] \\ [\begin{array}{l} {\dot{v}}_{x} \\ {\dot{v}}_{y} \\ {\dot{v}}_{z} \end{array}] = [\begin{array}{l} 0 \\ 0 \\ g \end{array}] + R_{B}^{E} [\begin{array}{l} a_{x}^{B} \\ a_{y}^{B} \\ a_{z}^{B} \end{array}] \end{array}

(6)

where

[\begin{array}{l} a_{x}^{B} \\ a_{y}^{B} \\ a_{z}^{B} \end{array}] = [\begin{array}{l} F_{x}^{B} \\ F_{y}^{B} \\ F_{z}^{B} \end{array}] / m

(7)

In the system above, we have six states $x = {[x, y, z, v_{x}, v_{y}, v_{z}]}^{T}$ and four inputs $u = {[ϕ, θ, ψ, a_{z}^{B}]}^{T}$ . In equation (6), the specific force is a nonlinear function of velocity, attitude, angular rates and other factors. It can be expressed as $F = f_{a} (V, ϕ, θ, ψ, \dots)$ . This system is a multiple input multiple output nonlinear system.

IMU misalignment

Equation (6) reveals that rotation matrix $R_{E}^{B}$ is an essential part of the model. However, in the real world, many aspects can contribute to attitude estimation errors. A first reason is the misalignment of the IMU (see Figure 2). Assembly inaccuracy can cause the measurements of the IMU to differ from the real states in body frame. Rotor misalignment can also affect the performance of quadrotor. In an ideal quadrotor, the four rotors should be perpendicular to x_BOy_B plane. In practice however, due to installation errors or deformation of rotors or axes, the thrust produced by the rotors is not perfectly perpendicular to the x_BOy_B plane.

Figure 2.

When a quadrotor hovers, usually the average attitude of the quadrotor and reading of the AHRS are not zero. This is caused by the misalignment of both the IMU and the rotors.

Both factors lead to non-zero required attitude during hover: $ϕ_{E}^{B} \neq 0 °$ and $θ_{E}^{B} \neq 0 °$ . In order to model this misalignment error, we introduce a new frame. The IMU frame I is an orthogonal frame whose three axis coincide with three axes of the accelerometers. The rotation between the IMU frame I and the body frame B can be described by Euler angles $Φ_{I}^{B} = {[ϕ_{I}^{B}, θ_{I}^{B}, ψ_{I}^{B}]}^{T}$ . The rotation matrix between the IMU frame I and the body frame B is $R_{I}^{B} (Φ_{I}^{B})$ . Since the IMU frame is physically attached to the body frame, we have the assumption

{\begin{array}{l} {\dot{ϕ}}_{I}^{B} (t) = 0 \\ {\dot{θ}}_{I}^{B} (t) = 0 \\ {\dot{ψ}}_{I}^{B} (t) = 0 \end{array}

(8)

Aerodynamic model

There are many factors that can affect the quadrotor’s aerodynamics. Some examples are the quadrotor’s velocity V, its angle of attack $α$ , the thrust T, the rotor speed ω, the angular velocity q and so on. Accurate and complete quadrotors models can be complicated and nonlinear.^37,38 Moreover, accurate modeling also requires many more parameters to be estimated and this leads to heavier computations. In the context of autonomous drone racing we opted for a faster approach using a minimal model that covers the most important aerodynamic effects, hereby maximizing the yield for a given computational load. In particular, many drag factors—such as induced drag, translation drag and blade flapping drag—can be approximated as linear functions of body velocity $v_{x}^{b}$ and $v_{y}^{b}$ with the assumption that wind is still.⁶ This results in the following simple lumped parameter model

{\begin{array}{l} a_{x}^{B} = K_{x} v_{x}^{B} \\ a_{y}^{B} = K_{y} v_{y}^{B} \end{array}

(9)

where

[\begin{array}{l} v_{x}^{B} \\ v_{y}^{B} \end{array}] = R_{E}^{B} (3; 3) [\begin{array}{l} v_{x} \\ v_{y} \end{array}]

(10)

$a_{x}^{B}, a_{y}^{B}$ are the acceleration caused by drag in the body frame. K_x, K_y are first-order drag coefficients in body frame coordinates B and have units 1/s.

AHRS bias model

When positioning information is available, the mainstream approach for estimating attitude is merging information from gyro, accelerometer and the positioning system. For instance, the classic 15-state Kalman filter uses accelerometer and gyro measurements to predict states along with GPS measurement updates. It can provide non-biased optimal attitude by estimating the gyro and accelerometer biases as states.

When no continuous external positioning information is available, like in our experiment, a compromise is to neglect kinematic accelerations in the attitude filter. In this case, the biases of accelerometers cannot be estimated.

In the case of attitude determination with constant sensor biases and small angles, the Kalman gain in the Kalman filter typically converges to an almost constant value. To avoid the computational overhead of computing the Kalman gain, complementary filters can be used with very similar results. The structure of the complementary attitude determination filter implemented in this work can be found in Figure 3. In Figure 3, $Ω_{m} = [p_{m}, q_{m}, r_{m}]$ are the gyro measurements. $a_{m} = [a_{x}^{m}, a_{y}^{m}, a_{z}^{m}]$ contains the accelerometer measurements and

\begin{array}{l} R^{'} = (\begin{array}{l} 1 & \tan θ \sin ϕ & \tan θ \cos ϕ \\ 0 & \cos ϕ & - \sin θ \\ 0 & \frac{\sin ϕ}{\cos θ} & \frac{\cos ϕ}{\cos θ} \end{array}) \end{array}

(11)

Figure 3.

Complementary filter for attitude determination.

Figure 3 shows that the gyroscopes are integrated and the accelerometer is used as feedback to determine attitude. The high-frequency vibrations and centripetal forces which are measured by the accelerometers cancel out on the long term when no constant non-zero accelerations are present. On the long term, the resulting attitude estimation therefore converges to

[\begin{array}{l} {\hat{ϕ}}_{a} (t) \\ {\hat{θ}}_{a} (t) \end{array}] = [\begin{array}{l} \arctan \frac{- a_{x}^{m}}{- a_{z}^{m}} \\ \arctan \frac{- \cos {\hat{ϕ}}_{a} (t) a_{x}^{m}}{- a_{z}^{m}} \end{array}]

(12)

where

a_{x}^{m}, a_{y}^{m}

, and

a_{z}^{m}

are measurements of the accelerometer in three axes.

The gyroscopes measure angular velocity in the three axes of the body frame. Because they are integrated, even small biases cause drift over time, and in this filter the gyro biases $b_{g} = {[b_{p}, b_{q}, b_{r}]}^{T}$ are accounted for by the k_I/s term in the filter.

Accelerometers unfortunately also suffer from biases, which is denoted by $b_{a} = {[b_{a_{x}}, b_{a_{y}}, b_{a_{z}}]}^{T}$ , for instance caused by temperature changes. Fortunately, the biases of the accelerometers only change slowly. Everything combined, the AHRS has an erroneous representation of where earth is, which is referred to as coordinate frame $E^{'}$ and is shown in Figure 4. The AHRS attitude is then defined as the rotation between $E^{'}$ and I and is denoted as $Φ_{E^{'}}^{I} = {[ϕ_{E^{'}}^{I}, θ_{E^{'}}^{I}, ψ_{E^{'}}^{I}]}^{T}$ . The corresponding rotation matrix is written as $R_{E^{'}}^{I} (Φ_{E^{'}}^{I})$ .

Figure 4.

AHRS estimation errors can be represented by an erroneous Earth reference frame $E^{'}$ .

The rotation between the real earth E and $E^{'}$ can be expressed by three Euler angles $Φ_{E}^{E^{'}} = {[ϕ_{E}^{E^{'}}, θ_{E}^{E^{'}}, ψ_{E}^{E^{'}}]}^{T}$ . Based on the assumption that the AHRS error changes slowly, we can assume

{\begin{array}{l} {\dot{ϕ}}_{E}^{E^{'}} (t) \approx 0 \\ {\dot{θ}}_{E}^{E^{'}} (t) \approx 0 \\ {\dot{ψ}}_{E}^{E^{'}} (t) \approx 0 \end{array}

(13)

With this assumption, on the short term the rotation matrix $R_{E}^{E^{'}} (Φ_{E}^{E^{'}})$ is a constant matrix.

Four reference frames have been introduced, namely E, $E^{'}$ , I and B. The rotation matrix $R_{E}^{B}$ in equation (6) can now be expressed as

R_{E}^{B} (Φ_{E}^{B}) = R_{I}^{B} (Φ_{I}^{B}) R_{E^{'}}^{I} (Φ_{E^{'}}^{I}) R_{E}^{E^{'}} (Φ_{E}^{E^{'}})

(14)

where

R_{I}^{B} (Φ_{I}^{B})

and

R_{E}^{E^{'}} (Φ_{E}^{E^{'}})

are constant matrices and

R_{E^{'}}^{I} (Φ_{E^{'}}^{I})

represents the attitude as determined by the AHRS.

Full model

Combining equations (6), (9), and (14), we obtain the full model as

\dot{x} = {\begin{cases} v_{x} \\ v_{y} \\ v_{z} \\ [\begin{matrix} 0 \\ 0 \\ g \end{matrix}] + R_{B}^{E} [\begin{matrix} 0 \\ 0 \\ a_{z}^{B} \end{matrix}] + [\begin{matrix} K_{x} & 0 & 0 \\ 0 & K_{y} & 0 \\ 0 & 0 & 0 \end{matrix}] R_{E}^{B} [\begin{matrix} v_{x} \\ v_{y} \\ v_{z} \end{matrix}] \end{cases}

\begin{array}{l} R_{E}^{B} (Φ_{E}^{B}) = R_{I}^{B} (Φ_{I}^{B}) R_{E^{'}}^{I} (Φ_{E^{'}}^{I}) R_{E}^{E^{'}} (Φ_{E}^{E^{'}}) R_{B}^{E} = R_{E}^{B} T \\ a_{z}^{B} = a_{z}^{m} - b_{a_{z}} \end{array}

(15)

The model in equation (15) contains the following parameters, which are assumed to be constant over short periods of time

Θ = {[K_{x}, K_{y}, b_{a_{z}}, ϕ_{E}^{E^{'}}, θ_{E}^{E^{'}}, ψ_{E}^{E^{'}}, ϕ_{I}^{B}, θ_{I}^{B}, ψ_{I}^{B}]}^{T}

(16)

State estimation

To estimate the states of the model from the section Quadrotor model, two approaches are derived. As a benchmark, an EKF is developed. Secondly, a novel gradient descent based optimization method to estimate the states is proposed.

Vision-based EKF

The attitude determination Kalman filter uses the inertial sensors as inputs to predict the states of the system, then uses different observations to revise the predictions. When the system is linear, observable and the noise is white Gaussian, then it can be mathematically proven that the Kalman filter provides the optimal solution. If the system is nonlinear, it can be linearized at every time step, which is referred to as the EKF. A classic 15-state EKF is implemented as found in Gross’s work,² the difference being that we use vision measurements instead of GPS as positioning information. The following states are used

\begin{array}{l} X = {[x, y, z]}^{T} \\ V = {[v_{x}, v_{y}, v_{z}]}^{T} \\ Φ = {[ϕ, θ, ψ]}^{T} \\ b_{a} = {[b_{a_{x}}, b_{a_{y}}, b_{a_{z}}]}^{T} \\ b_{g} = {[b_{p}, b_{q}, b_{r}]}^{T} \end{array}

(17)

with as inputs

\begin{array}{l} Ω_{m} = {[p^{m}, q^{m}, r^{m}]}^{T} \\ a_{m} = {[a_{x}^{m}, a_{y}^{m}, a_{z}^{m}]}^{T} \end{array}

(18)

and as observation

y = h (x) = [\begin{array}{l} x \\ y \\ z \end{array}]

(19)

The process equation is

{\begin{array}{l} \dot{X} = V \\ \dot{V} = g + R_{B}^{E} (a_{m} + b_{a}) \\ \dot{Φ} = R^{'} (Ω_{m} + b_{g}) \\ {\dot{b}}_{a} = 0 \\ {\dot{b}}_{g} = 0 \end{array}

(20)

This forms a standard nonlinear system expression

{\dot{x}}^{'} = f (x^{'}, u)

(21)

where

x^{'} = {[X, V, Φ, b_{a}, b_{g}]}^{T}

and

f (x^{'}, u) = [\begin{array}{l} V \\ g + R_{E}^{B} (a_{m} + b_{a}) \\ R^{'} (Ω_{m} + b_{g}) \\ 0 \\ 0 \end{array}]

(22)

The EKF follows five steps:

(1) Predict the states based on equation (20)

{\hat{X}}_{k | k - 1} = {\hat{X}}_{k - 1} + f ({\hat{X}}_{k - 1}, u_{k - 1}) T

(23)

where T is sampling time.

(2) Linearize and discretize the system

\begin{array}{l} F_{k - 1} = \frac{\partial}{\partial x} f (x (t), u (t)) |_{x (t) = {\hat{x}}_{k - 1}} \\ Φ_{k | k - 1} \approx I + F_{k - 1} T \\ H_{k} = \frac{\partial}{\partial x} h (x (t)) |_{x (t) = {\hat{x}}_{k - 1}} \end{array}

(24)

(3) Propagate the covariance matrix $P_{k | k - 1}$

P_{k | k - 1} = Φ_{k | k - 1} P_{k - 1} Φ_{k | k - 1}^{T} + Q_{k - 1}

(25)

where

Q_{k - 1}

is system noise covariance matrix.

(4) Calculate the Kalman gain and update the prediction.

\begin{array}{l} δ {\hat{X}}_{k} = K_{k} {Z_{k} - h [{\hat{X}}_{k | k - 1}, k]} \\ K_{k} = P_{k | k - 1} H_{k}^{T} {[H_{k} P_{k | k - 1} H_{k}^{T} + R_{k}]}^{- 1} \\ {\hat{X}}_{k} = {\hat{X}}_{k | k - 1} + δ {\hat{X}}_{k} \end{array}

(26)

where

R_{k}

is sensor noise covariance matrix.

(5) Update the covariance matrix of the state estimation error

P_{k} = (I - K_{k} H_{k}) P_{k / k - 1} {(I - K_{k} H_{k})}^{T} + K_{k} R_{k} K_{k}^{T}

(27)

Vision-based gradient descent method

According to the gate detection algorithm we used in IROS 2016 autonomous drone race, the vision-based position used as observation in the Kalman filter (equation (19)) has very non-Gausian noise, which can significantly affect the estimation accuracy of Kalman filters. The vision measurement model will be discussed later. Therefore the state prediction is rewritten as a parameter optimization problem in the form of a trajectory matching problem.

Unlike the Kalman filter which estimates continuously varying states like pitch and roll for any moment in time, the proposed gradient descent using the model from equation (15) in essence estimates corrections on top of attitude estimates provided by an external complementary attitude filter.

Since most model parameters like drag and AHRS error are integrated twice to arrive at position, observing the trajectory over a period of time allows for extremely fine observations of these parameters. For instance, a sub-degree attitude error is hard to identify in noisy raw accelerometer measurements. However integrating the consequence of this small angle error, which causes a percentage of gravity to be erroneously double-integrated in the lateral position after several seconds, becomes very easily observable.

The observed trajectory is obtained from the vision pipeline and expressed as a list of n noisy measurements. The predicted trajectory is based on integrating the model presented in equation (15) using attitude from the AHRS and given a set of model parameters $\hat{Θ}$ . The resulting trajectory becomes

F (Θ) = \int_{0}^{t} f (Θ, u (t), t) d t = [\begin{array}{l} \hat{x} (Θ, u (t), t) \\ \hat{y} (Θ, u (t), t) \\ \hat{z} (Θ, u (t), t) \\ {\hat{v}}_{x} (Θ, u (t), t) \\ {\hat{v}}_{y} (Θ, u (t), t) \\ {\hat{v}}_{z} (Θ, u (t), t) \end{array}]

(28)

The error between the predicted integrated trajectory and the vision measurements is found as

J (Θ) = \sum_{i = i}^{n} ‖ [\begin{array}{l} \hat{x} (Θ, u (t_{i}), t_{i}) \\ \hat{y} (Θ, u (t_{i}), t_{i}) \\ \hat{z} (Θ, u (t_{i}), t_{i}) \end{array}] - [\begin{array}{l} x_{i}^{m} \\ y_{i}^{m} \\ z_{i}^{m} \end{array}] ‖

(29)

where

x_{i}^{m}, y_{i}^{m}, z_{i}^{m}

are position measurements obtained from onboard computer vision. Now the state estimation has become a nonlinear parameter optimization problem that finds a set of optimal parameters

Θ^{*}

to minimize the value of

J (Θ)

which can be expressed as

\begin{array}{l} \min_{Θ} J (Θ) \\ s . t . \dot{x} (t) = f (x (t), u (t)) \end{array}

(30)

To solve the problem formulated by equation (30), we can apply many types of nonlinear optimization methods to find the optimal parameters $Θ^{*}$ . In this paper, we propose the gradient descent method, which is iteratively searching for optimal values in negative gradient direction until it finds the minimum point

Θ_{k + 1} = Θ_{k} + α \nabla J (Θ_{k})

(31)

where α is learning rate and

\nabla J (Θ_{k}) = {[\begin{array}{l} \frac{\partial}{\partial Θ_{1}} J (Θ) & \dots & \frac{\partial}{\partial Θ_{n}} J (Θ) \end{array}]}^{T} |_{Θ = Θ_{k}}

(32)

is the gradient of

J (Θ)

Figure 5 shows an example of the gradient descent approach. The propagation in time of the model from equation (15) for various parameters $Θ$ is compared to the ground-truth measured by a passive external positioning system. The gradient descent starts with an initial guess of $Θ_{0}$ , and gradually gets the predicted trajectory closer to the real trajectory until an optimal set $Θ^{*}$ is found. In this example, we directly use Optitrack data as measurements which better illustrate how the predicted trajectories converge to the ground-truth trajectory (measured by Optitrack).

Figure 5.

A gradient descent method optimizes a set of parameters $Θ$ to best fit a predicted trajectory through a measured trajectory (blue). During the fitting phase, the gradient descent method converges to the ground-truth trajectory.

Experiment setup and result

Experiment setup

In order to study the performance of state estimation methods, a hippodrome shaped track is used with end circles with radius of 1.5 m and straights of 3 m as shown in Figure 6. Onboard flight data are recorded while flying without computer vision but based on Optitrack position. The data are then analyzed in MATLAB. A Bebop 1 (Figure 7) from Parrot is used as experiment platform. It is equipped with three gyros, three accelerometers, one sonar, one barometer, a front camera and a bottom camera. Only the front camera and IMU are used and the original stock flight-code in the drone is replaced by open-source software from the Paparazzi-UAV project.³⁹ The AHRS runs on-board and consists of the complementary filter discussed in previous section. The flight time of the test runs is about 100s and the average flight velocity is about 1.8m/s, resulting in about 15 circles of the hippodrome. An overview of data gathered is presented in Table 1.

Figure 6.

The top view of the experiment track.

Figure 7.

The Parrot Bebop 1 hardware is used as experiment platform. All flight code is replaced with open-source Paparazzi-UAV flight code.

Figure 8.

Based on the vision measurement model (equation (33)), simulated vision measurement points (red) are generated around the real trajectory (blue). During the autonomous drone race, only the visual measurement points are available.

Figure 9.

When vision measurements are not available, the quadrotor can only rely on model predictions based on model information and inertial data. This prediction will diverge in time. The better the model prediction is, the smaller the end point prediction error Δ _f becomes.

Table 1.

Data gathered during the experiment.

Parameter	Symbol	Frequency (Hz)	Source
Acceleration	${\hat{a}}^{m}$	512	IMU
Angular velocity	${\hat{p}}^{m}$	512	IMU
Attitude	${\hat{Φ}}^{m}$	512	AHRS
Position	${\hat{x}}^{m}$	120	Optitrack
Velocity	${\hat{v}}^{m}$	120	Optitrack
Altitude	z^m	512	Sonar

During the IROS 2016 autonomous drone race, we used the bebop 1 onboard camera to detect the gates and provide the position measurements for navigation. In this work, however, noisy vision measurements are generated simulating on-board vision-based gate detections with various levels of accuracy. Along the straight part trajectory, n random points P _i are randomly sampled ( $15 < n < 20, i \in [1, n]$ ). For each sampled point P _i , we calculate the distance between P _i and the gate which is denoted by ${\hat{x}}_{i} - x_{g}$ . Then, the noise $Δ P_{i}$ is generated depending on the distance to the gate that $Δ P_{i}$ is larger when the gate is further away. Finally, $Δ P_{i}$ is added to P _i to get the simulated measurements $P_{i}^{v}$ . This process can be described by equation (33) (Figure 8)

\begin{array}{l} P_{i}^{v} = P_{i}^{m} + Δ P_{i} \\ Δ P_{i} \sim N (0, S_{i}) \\ S_{i} = [\begin{array}{l} σ_{i}^{2} & 0 & 0 \\ 0 & σ_{i}^{2} & 0 \\ 0 & 0 & σ_{i}^{2} \end{array}] \\ σ_{i} = 0.1 ({\hat{x}}_{i} - x_{g}) \end{array}

(33)

The test flights consist of two distinct phases which are shown in Figure 6.

During the straight part (blue line), the gates are in the field of view of the quadrotor and vision-based position measurements are available. The vision-based EKF (VEKF) can run both prediction and update loops. The vision-based gradient descent method (VGD) searches for parameters $Θ$ that make the prediction best fit the noisy measurements.

During the arc (purple line), no position measurements are available but an open-loop coordinated turn is performed. The VEKF can only rely on model prediction and the gradient descent method uses the last estimated parameters and on-board inertial data to propagate the states of the quadrotor. This phase must be limited in time as the open-loop integration is diverging as can be seen in Figure 9.

The test track is designed to resemble an autonomous drone race track, where it is not possible to keep gates in sight at all times. When using fast gate detection as sole means of position information, some maneuvers need to be performed open-loop. But even when gates are in-sight, better model prediction allows the estimation of more accurate trajectories through the noisy visual data. Therefore, as a performance index we selected the prediction error Δ _f at the final point of the open-loop arc to evaluate the performance of both algorithms.

Δ_{f} = ‖ [\begin{array}{l} x_{f} \\ y_{f} \end{array}] - [\begin{array}{l} {\hat{x}}_{f} \\ {\hat{y}}_{f} \end{array}] ‖

(34)

where x_f and y_f, which are from Optitrack, form the ground truth of the end point of the arc, while

{\hat{x}}_{f}

and

{\hat{y}}_{f}

are the filter prediction of the end point.

Analysis of VGD

In this section, we use the on-board flight data and generated vision measurements to analyze the VGD using a MATLAB implementation of gradient descent, FMINCON.

The performance of the gradient descent method is affected by the size of the training data. It is important to investigate how the size of the dataset used to search for $Θ^{*}$ affects the estimation performance. We use the notation γ ( $1 \leq γ \leq 5$ ) to represent the size of the history used by FMINCON. In other words, γ is the number of straight lines whose corresponding vision measurement is used by FMINCON. Too short γ will contain very few visual measurements and the approach is at risk of over-fitting the gate detection noise. Too long γ will violate the constant parameter constraint like for instance equation (13). Figure 10 shows an example where γ = 2. For each step, we use an array of flight data and vision measurements of size γ in FMINCON to search for $Θ^{*}$ . Then, $Θ^{*}$ is used to estimate the trajectory of next arc, which is given by id τ ( $1 \leq τ \leq 15$ ). Finally the final point error $_{γ}^{τ} Δ_{f}$ can be calculated using equation (34)

γ τ Δ_{f} = ‖ [\begin{array}{l} x_{f} \\ y_{f} \end{array}] - [\begin{array}{l} {\hat{x}}_{f} \\ {\hat{y}}_{f} \end{array}] ‖

(35)

Figure 10.

Example test flight data showing the x position in function of time and illustrating the prediction strategy when γ = 2. First, the data of straight lines 1 and 2 are used to estimate $Θ^{*}$ . Then the identified model parameters are used to predict the second turn. Finally, the final point error after the second arc $_{2}^{2} Δ_{f}$ is calculated. Here, subscript 2 means the data from 2 straight lines are used and superscript 2 means second arc’s trajectory prediction is used. This procedure is repeated by using data of straight lines 2 and 3 and predicting the trajectory of third arc and so forth. (a) Final point error $_{γ}^{τ} Δ_{f}$ in function of γ for various parts of the run τ. (b) Number of FMINCON iterations based on stopping criteria (equation (36)) in function of γ for various parts of the run τ.

The stopping criteria used in the FMINCON optimization is

\frac{‖ J (Θ_{k}) - J (Θ_{k - 1}) ‖}{‖ J (Θ_{k}) ‖} \leq 10^{- 4}

(36)

With different combinations of τ and γ, a set of 70 $_{γ}^{τ} Δ_{f}$ is gathered. The prediction accuracy results, $_{γ}^{τ} Δ_{f}$ , and the number of iterations based on the stopping criteria from equation (36) are shown in Figure 11.

Figure 11.

Influence of the history length γ on the prediction accuracy $_{γ}^{τ} Δ_{f}$ and required number of iterations.

Figure 11(a) shows the prediction accuracy $_{γ}^{τ} Δ_{f}$ as a function of the history length γ. Each gray dot represents an individual arc estimation τ on another part of the data while the blue dots give the average for a given γ. Similarly, in Figure 11(b) the required number of iterations based on the stopping criteria is shown. The figures show that the prediction error $_{γ}^{τ} Δ_{f}$ keeps decreasing up to γ = 4. This means that fitting more than one straight part helps improving the accuracy of state estimation. Figure 11(b) shows that the average number of iterations is about 19 and the maximum is only 25, which means this VGD quickly converges and is not very computationally expensive.

Comparison between VEKF, VGD and VGD-kinematic

In this section, in order to show the different performance of the gradient descent between the kinematic model and model from equation (15), we introduce a new method called Vision-based gradient descent method with kinematic model (VGD-kinematic). This method has the same principle as VGD except that it is using a kinematic model 38 as prediction model.

{\begin{array}{l} \dot{X} = V \\ \dot{V} = g + R_{B}^{E} (a_{m} + b_{a}) \\ \dot{Φ} = R^{'} (Ω_{m} + b_{g}) \end{array}

(37)

In this case, the parameters to be estimated are the bias of accelerometers and gyros, which can be written as

Θ = {[b_{a_{x}}, b_{a_{y}}, b_{a_{z}}, b_{p}, b_{q}, b_{r}]}^{T}

(38)

To compare the performance of the of three methods, all three methods are tested using the same on-board data and the same generated vision measurements. In both VGD and VGD-kinematic, γ was set to 3, which means that the flight data of the last 3 straights is used in the estimation of $Θ^{*}$ . Note that during the first two arcs of the flight, there is not yet enough flight data, and γ will be smaller than 3.

The resulting full flight is shown in Figure 13. In Figure 13, the orange dots are the generated vision measurements from the straight parts of the track. The magenta curve is the estimation result of the VEKF. In the VEKF, $R = diag ([{2.5}^{2}, {2.5}^{2}, {2.5}^{2}]), Q = diag ([(2 e - 6, 2 e - 6, 5 e - 6, e - 5, 5 e - 6, 3 e - 5, 3 e - 8, 3 e - 9, 3 e - 9, 0, 0, 0, 0, 0, 0])$ and $P_{0} = 10 \times I_{15 \times 15}$ . The blue curve is the estimation result of the VGD and the red curve is the result of VGD-kinematic. To test the sensitivity of the VGD and the VGD-kinematic algorithm, the initial parameters $Θ_{0}$ are selected randomly within some ranges which can be found in Table 2. It can be seen that while the VEKF clearly converges to the measurements. The long prediction horizon combined with few and noisy measurement updates challenges the filter to its limit. On the other hand, the VGD managed to find parameters that fit the model very well through the noisy measurements and is not sensitive to the initial parameters. Even large measurement noise does not affect the prediction too much as the dynamics of the quadrotor cannot explain them.

Figure 12.

The final point error $γ τ Δ_{f}$ when using the VEKF, VGD and VGD-kinematic. The VGD has the most stable performance and least $γ τ Δ_{f}$ compared to the EKF and VGD-kinematic. (a) Position estimation; (b) Velocity estimation; (c) Estimation of accelerometer bias; (d) Estimation of gyro bias.

Figure 13.

Comparison of the position and velocity estimation results of the EKF and the FMINCON-based gradient descent method using inertial sensors and discrete low frequency noise vision based position measurements from gate detections.

Table 2.

The range of $Θ_{0}$ in VGD and VGD-kinematic.

$Θ_{0}$	Range	$Θ_{0}$	Range
$K_{*}^{0}$	$[- 1, 0]$	$ϕ_{}^{} 0$	$[- 3^{°}, 3^{°}]$
$b_{a}^{0}$	$[- 1 m / s^{2}, 1 m / s^{2}]$	$θ_{}^{} 0$	$[- 3^{°}, 3^{°}]$
$b_{g}^{0}$	$[- 3^{°} / s, - 3^{°} / s]$	$ψ_{}^{} 0$	$[- 3^{°}, 3^{°}]$

The final point prediction error $_{γ}^{τ} Δ_{f}$ after each turn of the three algorithms is shown in Figure 12. The VEKF requires several laps (3rd arc, or about 20 s of flight) to converge to sub-meter prediction accuracy. During the rest of the flight, the EKF can predict the 180° turns with a final point prediction error of around 0.5 m. The VGD-kinematic uses the derived kinematic model as prediction model and utilizes multiple vision measurements for parameter estimation. It has similar performance when compared with the VEKF. Overall, the VGD, which uses the same measurements as the VGD-kinematic but performs a bias and aerodynamics model estimation, is shown to find the best estimates of all parameters. It even finds good model parameters for the first arc, using only 1 straight line’s flight data. During the whole flight, $_{γ}^{τ} Δ_{f}$ of the VGD is kept around 0.2 m.

Conclusion

Accurate state and parameter estimation is essential for quadrotor control, especially when they perform aggressive maneuver. However, in the environment where only sparse and noisy position measurements are available, a classic Kalman filter can struggle to provide accurate state and model parameter estimation results. In this paper, we presented a novel method that only uses sparse vision measurements to estimate the AHRS error and select aerodynamic parameters of the quadrotor using a gradient descent method. The experiment result shows that our VGD could increase the accuracy of state estimation when compared to a classic Kalman filter in environments where only sparse noisy position measurements are available.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

References

Moon

Sun

Baltes

et al . The IROS 2016 Competitions [Competitions]. IEEE Robot Autom Mag 2017; 24: 20–29. DOI:10.1109/MRA.2016.2646090.

Gross

Rhudy

et al . Flight-test evaluation of sensor fusion algorithms for attitude estimation. IEEE Trans Aerosp Electron Syst 2012; 48: 2128–2139.

Hoffmann

Huang

Waslander

et al . Quadrotor helicopter flight dynamics and control: theory and experiment. In: AIAA guidance, navigation and control conference and exhibit. Reston: American Institute of Aeronautics and Astronautics, Aug, 2007, pp. 1–20

Huang

Hoffmann

Waslander

et al . Aerodynamics and control of autonomous quadrotor helicopters in aggressive maneuvering. In: 2009 IEEE international conference on robotics and automation. Piscataway: IEEE, pp. 3277–3282.

Bangura

Mahony

Nonlinear dynamic modeling for high performance control of a quadrotor. In: 2012 Proceedings Australasian conference on robotics and automation, pp.1–10.

Svacha

Mohta

Kumar

Improving quadrotor trajectory tracking by compensating for aerodynamic effects. In: 2017 International conference on unmanned aircraft systems (ICUAS). Piscataway: IEEE, pp. 860–866.

Kai

Allibert

Hua

et al . Nonlinear feedback control of quadrotors exploiting first-order drag effects. In: IFAC World Congress, Toulouse, France, 2017, pp. 8189–8195

Bristeau

Callou

Vissiere

et al . The navigation and control technology inside the AR.Drone micro UAV. IFAC Proc Vol 2011; 44: 1477–1484.

Yoo

CSYCS

Ahn

IKAIK.

Low cost GPS/INS sensor fusion system for UAV navigation. In: Digital avionics systems conference, 2003. DASC’03. The 22nd, vol. 2. Piscataway: IEEE, pp. 8–A.

10.

Brown

AK.

GPS/INS uses low-cost MEMS IMU. IEEE Aerosp Electron Syst Mag 2005; 20: 3–10.

11.

Shi

An improved real-time adaptive Kalman filter for low-cost integrated GPS/INS navigation. In: 2012 International conference on measurement, information and control (MIC), vol. 2. Piscataway: IEEE, pp. 1093–1098.

12.

Lopes

Kampen

Chu

Attitude determination of highly dynamic fixed-wing UAVs with GPS/MEMS-AHRS integration. In: 2012 AIAA guidance, navigation, and control conference. Reston: American Institute of Aeronautics and Astronautics, p. 4460.

13.

Kingston

Beard

RW.

Real-time attitude and position estimation for small UAVS using low-cost sensors. In: AIAA 3rd unmanned unlimited technical conference, workshop and exhibit. Reston: American Institute of Aeronautics and Astronautics, pp. 2004–6488.

14.

Mellinger

Kumar

Minimum snap trajectory generation and control for quadrotors. In: 2011 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 2520–2525.

15.

Bry

Richter

Bachrach

et al . Aggressive flight of fixed-wing and quadrotor aircraft in dense indoor environments. Int J Rob Res 2015; 34: 969–1002.

16.

Tomic

Schmid

Lutz

et al . Toward a fully autonomous UAV: research platform for indoor and outdoor urban search and rescue. IEEE Robot Autom Mag 2012; 19: 46–56.

17.

Hrabar

An evaluation of stereo and laser-based range sensing for rotorcraft unmanned aerial vehicle obstacle avoidance.

J Field Robot 2012; 29: 215–239.

18.

Valenti

Dryanovski

Jaramillo

et al . Autonomous quadrotor flight using onboard RGB-D visual odometry. In: 2014 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 5233–5238.

19.

Bachrach

Prentice

et al . Estimation, planning, and mapping for autonomous flight using an RGB-D camera in GPS-denied environments. Int J Rob Res 2012; 31: 1320–1343.

20.

Sampedro

Bavle

Rodríguez-Ramos

et al . A fully-autonomous aerial robotic solution for the 2016 international micro air vehicle competition. In: 2017 International conference on unmanned aircraft systems (ICUAS). Piscataway: IEEE, pp. 989–998.

21.

Mostegel

Wendel

Bischof

Active monocular localization: towards autonomous monocular exploration for multirotor MAVs. In: 2014 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 3848–3855.

22.

Huh

Shim

Kim

Integrated navigation system using camera and gimbaled laser scanner for indoor and outdoor autonomous flight of UAVS. In: 2013 IEEE/RSJ international conference on intelligent robots and systems (IROS). Piscataway: IEEE, pp. 3158–3163.

23.

Nistér

Naroditsky

Bergen

Visual odometry. In: 2004 Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition. CVPR 2004, vol. 1. Piscataway: IEEE, pp. I–I.

24.

Andert

Ammann

Puschel

et al . On the safe navigation problem for unmanned aircraft: visual odometry and alignment optimizations for UAV positioning. In: 2014 International conference on unmanned aircraft systems (ICUAS). Piscataway: IEEE, pp. 734–743.

25.

Strydom

Thurrowgood

Srinivasan

MV.

Visual odometry: autonomous UAV navigation using optic flow and stereo. In: Australasian conference on robotics and automation (ACRA). Australian Robotics and Automation Association, pp. 1–10.

26.

Mondragón

Olivares-Méndez

Campoy

et al . Unmanned aerial vehicles UAVs attitude, height, motion estimation and control using visual systems. Auton Robot 2010; 29: 17–34.

27.

Rodolfo García Carrillo

Enrique Dzul López

Lozano

et al . Combining stereo vision and inertial navigation system for a quad-rotor UAV. J Intell Robot Syst 2012; 65: 373–387.

28.

Martínez-Carranza

Calway

Efficient visual odometry using a structure-driven temporal map. In: 2012 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 5210–5215.

29.

Falanga

Mueggler

Faessler

et al . Aggressive quadrotor flight through narrow gaps with onboard sensing and computing using active vision. In: 2017 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 5774–5781.

30.

Loianno

Brunner

McGrath

et al . Estimation, control, and planning for aggressive flight with a small quadrotor with a single camera and IMU. IEEE Robot Autom Lett 2017; 2: 404–411.

31.

Jang

Liccardo

Small UAV automation using MEMS.

IEEE Aerosp Electron Syst Mag 2007; 22: 30–34.

32.

Crassidis

JL.

Sigma-point Kalman filtering for integrated GPS and inertial navigation.

IEEE Trans Aerosp Electron Syst 2006; 42: 750–756.

33.

Zhang

Milios

et al . Navigation with IMU/GPS/digital compass with unscented Kalman filter. In: 2005 IEEE international conference on mechatronics and automation, vol. 3. Piscataway: IEEE, pp. 1497–1502.

34.

Lange

Sünderhauf

Protzel

Incremental smoothing vs. filtering for sensor fusion on an indoor UAV. In: 2013 IEEE international conference on robotics and automation (ICRA). Piscataway: IEEE, pp. 1773–1778.

35.

Indelman

Williams

Kaess

et al . Factor graph based incremental smoothing in inertial navigation systems. In: 2012 15th International conference on information fusion (FUSION). Piscataway: IEEE, pp. 2154–2161.

36.

Smeur

Chu

de Croon

GC.

Adaptive incremental nonlinear dynamic inversion for attitude control of micro air vehicles. J Guid Control Dyn 2015; 38(12): 450–461.

37.

Hoffmann

Huang

Waslander

et al . Quadrotor helicopter flight dynamics and control: theory and experiment. In: Proceedings of the AIAA guidance, navigation, and control conference, vol. 2. Reston: American Institute of Aeronautics and Astronautics, p. 4.

38.

Bangura

Mahony

et al . Nonlinear dynamic modeling for high performance control of a quadrotor. In: Australasian conference on robotics and automation. pp. 1–10.

39.

Gati

Open source autopilot for academic research – the paparazzi system. In: American control conference (ACC), 2013. Washington, DC: IEEE, pp. 1478–1481.

In-flight model parameter and state estimation using gradient descent for high-speed flight

Abstract

Keywords

Introduction

Related work

Quadrotor model

Dynamic model of quadrotor

IMU misalignment

Aerodynamic model

AHRS bias model

Full model

State estimation

Vision-based EKF

Vision-based gradient descent method

Experiment setup and result

Experiment setup

Analysis of VGD

Comparison between VEKF, VGD and VGD-kinematic

Conclusion

Footnotes

Declaration of conflicting interests

Funding

References