Abstract
The exploration of large-scale unknown environments can benefit from the deployment of multiple robots for collaborative mapping. Each robot explores a section of the environment and communicates onboard pose estimates and maps to a central server to build an optimized global multi-robot map. Naturally, inconsistencies can arise between onboard and server estimates due to onboard odometry drift, failures, or degeneracies. The mapping server can correct and overcome such failure cases using computationally expensive operations such as inter-robot loop closure detection and multi-modal mapping. However, the individual robots do not benefit from the collaborative map if the mapping server provides no feedback. Although server updates from the multi-robot map can greatly alleviate the robotic mission strategically, most existing work lacks them, due to their associated computational and bandwidth-related costs. Motivated by this challenge, this paper proposes a novel collaborative mapping framework that enables global mapping consistency among robots and the mapping server. In particular, we propose graph spectral analysis, at different spatial scales, to detect structural differences between robot and server graphs, and to generate necessary constraints for the individual robot pose graphs. Our approach specifically finds the nodes that correspond to the drift’s origin rather than the nodes where the error becomes too large. We thoroughly analyze and validate our proposed framework using several real-world multi-robot field deployments where we show improvements of the onboard system up to 90% and can recover the onboard estimation from localization failures and even from the degeneracies within its estimation.
1. Introduction
Over recent years, an abundance of localization and mapping frameworks have been proposed and successfully deployed in various robotic scenarios. As part of this research, many traditional SLAM challenges have been fully or partially addressed. Despite this development, new challenges readily arise with the need for more robotic autonomy and the deployment of heterogeneous robotic teams in large-scale environments. In particular, an increase in the number of deployed robots and autonomy requires a higher degree of robustness and efficiency. At the same time, scalability and persistence across all systems become a pertaining issue. While it is difficult to maintain a consistent estimate of the environment across all employed systems, it is an essential prerequisite for operating robotic teams in applications like disaster response or search and rescue.
With the recent advent of high-bandwidth mobile networks such as 5G networks, collaborative robotic approaches have received increased attention in the robotics community due to their improved practical feasibility. A promising research direction is to employ a centralized mapping approach. Centralized servers running in the local network or a remote cloud environment have more computational capacity than individual robots. Therefore, they can perform expensive operations such as global optimizations, loop closing, and exploitation of all available sensor data to improve accuracy and overcome onboard failures.
Most collaborative mapping approaches focus on building accurate maps on the server and ignore the use of global multi-robot information to provide localization corrections to individual robots. Especially in centralized settings without feedback, pose estimation discrepancies may arise between robots during large missions leading to severe drift between robot and server maps resulting in increased optimization time at the server for collaborative mapping. Therefore, it is desirable to provide additional constraints to improve onboard estimation and collaborative mapping performance for large-scale multi-robot missions.
Furthermore, multi-robot missions often deploy a heterogeneous set of robots,
This paper proposes a novel multi-robot pose graph consistency approach independent of the underlying robot pose estimation processes. Our proposed approach relies only on a sparse abstraction of the estimated poses in • Graph spectral analysis of pose graphs to identify discrepancies between onboard and server pose graphs. • Automatic adaptive inference of multi-scale constraints to correct onboard estimation failures. • Comparison against current state-of-the-art approaches on datasets and a thorough quantitative analysis on large-scale multi-robot field deployments are presented to validate the proposed approach.
2. Related work
In this section, we review the state-of-the-art collaborative multi-robot localization and mapping approaches as well as the current applications of graph signal processing and degeneracy and failure detection.
2.1. Collaborative multi-robot mapping
Collaborative multi-robot approaches can be distinguished into centralized (Deutsch et al., 2016; Karrer et al., 2018; Schmuck and Chli, 2019) and distributed solutions (Cunningham et al., 2013; Dong et al., 2015). Deutsch et al. (2016) proposed a vision-based centralized multi-robot SLAM approach where a mapping server performs loop closures and replaces robot pose graphs with corrected graphs. A similar approach was proposed by Schmuck and Chli (2019) in which robots send local maps to a mapping server which then returns optimized keyframes and landmarks to each robot to include in their onboard optimizations, thus increasing the bandwidth requirements for real-world robot deployments. The work of Van Opdenbosch and Steinbach (2019) proposes an encoding and decoding of visual features during the transmission of the maps to reduce the required bandwidth. CoSLAM (Zou and Tan, 2013) proposes to make use of GPU computing to circumvent the need for large computational processes and improve the speed of onboard optimizing tasks, hence requiring a GPU onboard individual robots. Zhang et al. (2021) presents an incremental and centralized solver for multi-robot SLAM based on a Bayes tree.
Different from vision-only approaches, LAMP (Chang et al., 2022; Ebadi et al., 2020) proposes a large-scale collaborative multi-modal SLAM framework. However, their proposed approach does not provide any pose corrections from the centralized server to the individual robots.
In contrast to centralized approaches, distributed approaches require each robot to run a full onboard SLAM solution (Dong et al., 2015) and share marginalized information with other robots (Cunningham et al., 2013), thus making full information available to each robot. Additionally, they have the advantage of scaling well to large swarms of robotic systems (Ziegler et al., 2021) but typically increase the onboard compute requirements significantly. Tian and How (2022) proposes a distributed algorithm for communication-efficient rotation and translation estimation in multi-robot SLAM by collaboratively solving a Laplacian system.
A crucial aspect of multi-robot SLAM is the ability to incorporate inter-robot loop closures. Kim et al. (2010) aims to achieve consistent maps across multiple robots independently of the employed sensing modalities by detecting loop closures between robots and connecting their pose graphs. In the same direction, Mangelson et al. (2018, 2019) aim to robustly select inter-robot loop closure candidates by maintaining pair-wise consistent measurements. More recently Lajoie et al. (2020) proposed a distributed system with distributed loop closure detection.
The more robots are deployed for a specific task, the more information needs to be processed, potentially leading to delays or longer processing times, especially for components such as the factor graph optimization. Recently, COVINS (Schmuck et al., 2021) demonstrated a collaborative deployment of 12 individual agents while maintaining a reasonable collaborative trajectory error. Although their system propagates optimized poses from the centralized server back to individual agents, the poses are only used for drift quantization by comparing the optimized to the onboard estimate. Thus, the onboard pose estimations are not corrected.
Concluding, many existing approaches are limited to a single modality only (Deutsch et al., 2016; Karrer et al., 2018; Lajoie et al., 2020). These methods are often incorporated in tightly coupled multi-robot frameworks, exchanging large data structures such as descriptors (Tian et al., 2022), partial or complete (Schmuck and Chli, 2019) factor graphs. As a consequence, the systems become less flexible and maintain little versatility for the application of different robotic tasks. Conversely, this paper proposes to detect discrepancies between the robot graphs using spectral analysis and a sparse abstraction of the server graph to generate an individual set of constraints for each robot. This has the benefit that only the pose information is shared between the robots and we can retain a low profile by means of bandwidth requirements. Moreover, since our framework only transmits the poses, our approach does not require any expensive marginalization procedures. Hence, the proposed approach achieves high accuracy and mapping consistency while maintaining low network and compute requirements.
2.2. Failure and degeneracy detection
Pose estimation from onboard sensors is subject to drift (accumulation of small errors) and to degeneracies (errors due to specific sensor modality’s deficiency). Recognizing such errors enables corrective actions to avoid possible catastrophic losses (
We approach the problem differently by taking into account the underlying graph structure, precisely its spectral properties. Thus, making our approach independent of the employed sensor system and enabling us to evaluate the discrepancies at multiple scales to be more precise when resolving the spurious estimations.
2.3. Spectral graph and graph signal processing
Spectral graph theory is an active research area and has gained popularity in the past years in the context of robotics. Spectral graph theory approaches have been proposed for robotic mapping (Brunskill et al., 2007), planning (Indelman, 2018), and more recently, in combination with graph neural networks for various robotic tasks (Chandra et al., 2020; Moon and Lee, 2020). In general, graphs are irregular structures that do not depend on the underlying manifold of the nodes. Thus, graphs are perfectly capable of modeling large, complex, or distributed problems (Mateos et al., 2019). Khosoussi et al. (2019) evaluates the correlation between the Fisher information matrix and the Laplacian matrix in the context of SLAM.
Furthermore, Egilmez and Ortega (2014) proposes an anomaly detection for spatial proximity of graph nodes using spectral graph filtering. Furthermore, graph signal processing aims at applying signal processing techniques on graph structures, thus allowing the use of existing concepts such as the Laplacian operator (Sandryhaila and Moura, 2014) and multi-scale analysis (Hammond et al., 2011, 2019). Similarly, Donnat et al. (2018) aims to learn a multi-scale structural embedding using graph wavelets by treating the wavelet coefficients as a probability distribution. A good introduction and overview of graph signal processing are presented in Ortega et al. (2018). Recently, Doherty et al. (2022) proposes a spectral sparsification of pose graphs to deal with the limited computational capacity in long-term SLAM.
Our approach performs a structural analysis of graph signals using a spectral representation to detect discrepancies between two factor graphs. Consequently, our approach fits in a centralized multi-robot SLAM scenario where the robots are susceptible to drift. Our approach exploits graph Wavelets, localized on the individual nodes in the graph to perform a multi-scale analysis of the structural properties. Therefore, our approach enables estimating the severity of the inconsistency of the individual pose graphs.
3. Preliminaries
This section introduces the fundamental and necessary concepts for analyzing and comparing graph structures in the graph spectral domain. We first introduce the underlying methods to use graphs for modeling complex problems. Next, the analysis of harmonic signals in the Euclidean and graph domain are covered.
3.1. Fundamental graph theory review
In this work, we exploit the graph structure that serves as the primary foundation for most modern SLAM backends (Cadena et al., 2016). In particular, we extract the pose information of the factor graphs,
A graph
Finally, signals in traditional signal processing are often expressed as functions over time, such as
3.2. Euclidean and graph spectral analysis review
Integral transformations such as the Fourier transform project a signal onto a Hilbert space enabling the analysis of the characteristic properties of that signal. The data in the projected space is a compact representation denoting the information that is more or less prevalent in the input data. However, different aspects, such as the duration, bandwidth, and discretization, need to be carefully considered for a correct analysis of band-limited signals and to avoid unfavorable effects such as leakage and Gibbs ringing.
The Fourier transform is a fundamental tool for various applications in many different fields ranging from image-processing to data science (Rao et al., 2010). Specifically, the Fourier transform expresses a signal
The Laplace operator is a linear operation that expresses the divergence of a given function’s gradient. Likewise, a similar intuition can be applied for deriving a Laplace operator for irregular graph domains. In concrete, the Laplace operator Δ is replaced by the Laplacian matrix
In particular, the graph Fourier transform of a graph function
The construction of the graph Fourier transform using Equations (3) and (4) implies that all the spectral properties are given by the connection of the nodes. Thus, the modeling of the relationship between the individual nodes constitutes a crucial component of the system.
4. Collaborative multi-robot mapping
This section details the proposed method, for which an overview is presented in Figure 1. Overall, the aim is to identify graph nodes with high drift that will lead to large errors and correct them with only a few constraints. The proposed approach comprises the following core components: (i) Onboard localization and mapping, (ii) Mapping server at the base station, and (iii) Pose graph comparison and correction. Overview of our approach. We consider multiple robots simultaneously exploring an environment and sending incremental maps to a centralized mapping server. The server accumulates all robot maps and jointly optimizes them. A relaxation of the collaborative multi-robot map is sent back to the robots, where a multi-scale graph spectral analysis is performed to identify discrepancies onboard and server maps and to generate necessary constraints for making them consistent.
4.1. Centralized mapping and localization
Each robot performs onboard mapping and localization to provide an odometry estimate of its current position as well as to create a local map of the environment. Particularly, we don’t anticipate that the onboard system must perform any loop closure detection or re-localization methods to reduce their drift. These methods typically require large computational capacities and might, in real-world deployments, inhibit crucial processes like control, odometry, and navigation. Hence, in our setup, each robot solely focuses on estimating the relative motion and building an onboard map without optimizing it. In more detail, the maps are factor graphs with visual landmarks and IMU constraints in our setup. Each map is keyframed before sending it to the mapping server. The onboard map including visual descriptors and pose estimates are then incrementally serialized and sent to a centralized mapping server for additional processing. Figure 2 provides an overview of the employed graph structures. Different employed graphs: (a) Onboard visual-inertial graph, which will be incorporated into the global server map. (b) Graph abstractions, including Kron reduction. (c) Synchronized graphs. (d) Multi-scale spectral graph analysis. With the results from (d) the system queries poses in 
More specifically, the mapping server takes the role of a high-performance computing resource and acts as a centralized communications hub for all the robots. Hence, the mapping server constitutes the core component of our collaborative mapping approach. In particular, all incoming robot maps are accumulated, merged into a single collaborative multi-robot map, loop closed, and globally optimized (see Figure 2(a)). Consequently, all the computational-intense multi-robot operations are delegated to the central mapping server while the robots only perform the initial map building.
Since the onboard maps are sent incrementally, the mapping server needs to ensure continuous operation on the global multi-robot map,
Furthermore, at the end of each iteration, the mapping server constructs a global multi-robot graph from the optimized pose information and broadcasts it on the network (see Figure 2(b)). Notably, the sent global graph is an undirected graph that does not contain any sort of constraints such as IMU factors, visual terms, or LiDAR terms but rather only comprises the positional and rotational information of the nodes in the form of vertices. Furthermore, we create additional edges to model the relationships between the nodes.
Each time a robot receives a global graph, it replaces the previously received graph with the newly received graph. Thus, the comparison of the onboard estimation and the globally optimized solution of the server happens at the individual robots using the pose information in SE(3).
4.1.1. Global multi-robot graph
The global graph sent to the robots encapsulates the global knowledge of the environment in a compact representation and contains crucial information such as the last known positions of all robots. The global graph is built by defining representative nodes for each incoming submap. The representative nodes can be freely chosen but ought to reflect the robot trajectories to some degree. We found that keyframing heuristics such as minimum distance and rotation between consecutive nodes work well in our experiments.
Moreover, since the global map is continuously optimized, the graph is not immediately built after every operation, but only a reference to each node is maintained. When an update is triggered after the global optimization, the
This graph captures the relationship between all the different vertex within the factor graph. Since we cannot only rely on connectivity information such as odometry constraints or co-visible landmarks to build this graph, a radius search is performed around each vertex in the graph. The weight of adjacent nodes is calculated using a squared exponential function where the weight decreases with increasing distance Δ,
The distance function Δ(
In our previous work (Bernreiter et al., 2022), we employed an abstraction from the factor graph, the so-called proxy graph, that only consisted of the positional information in
Therefore, in this paper, we employ a combination of spatial proximity and viewpoint coherence using a distance metric in
Moreover, it is important to note that other realizations are possible distance metrics,
4.1.2. Graph reduction
Our framework allows for multiple different reduction schemes, for which all approaches operate on the server graph
When the global graph
Moreover, it is important to note that the larger the server graph
The choice of which nodes to keep in the reduced graph should rely upon the information content and how prominent a specific node is. Most interestingly, the magnitude of the eigenvalues of
It is important to note that the Kron reduction introduces more edges to the graph as a result of preserving the spectral properties of that graph. Despite more edges in the graph, the required bandwidth for transmission is still little as nodes require significantly more memory while adding more edges solely changes the adjacency matrix. Therefore, communication-wise, our proposed approach benefits from having fewer entries in the adjacency matrix and fewer poses to transmit.
As a next step, we perform a graph spectral analysis using
4.2. Spectral analysis of graph signals
The analysis of spectral components of band-limited signals is a well-established and widely used technique in engineering and research. This paper benefits from the spectral representation by exploiting the theory of spectral analysis and signal processing defined on graphs to improve collaborative multi-robot mapping.
Figure 3 denotes an overview of our proposed spectral analysis of factor graphs by showing an exemplary comparison. In contrast to the standard spectral analysis that operates, Overview of the proposed spectral analysis of the factor graphs, one from the mapping server (blue) and one from the onboard estimation (red). (A) Shows the server graph with eight nodes and corresponding graph abstractions along with the signal 
4.2.1. Graph comparison
After the robot has received an update containing a global multi-robot map, chronological synchronization of the estimates is necessary due to the keyframing at the centralized server. We assume that the robots have synchronized clocks with the mapping server, thus, the synchronization results in a one-to-one mapping of the global graph
Generally, Wavelets are a well-known time-frequency representation and a very efficient and flexible for a variety of different tasks in signal processing problems Hammond et al. (2019), such as compression (Finder et al., 2022), image-processing (Huang and Dragotti, 2022), and classification (Gorgel et al., 2009). In traditional Wavelet analysis, a signal
Analogously, the graph Wavelet transform can be derived using the graph Fourier transform and a Wavelet filter kernel on
Next, the graph Wavelet coefficients are computed by expanding the graph signals into the basis of
We employ the Meyer Wavelet due to its good localization in the graph and frequency domain. Generally, larger values for
The graph signal
In summary, we compute Wavelet coefficients
4.3. Correcting onboard estimation
We propose to compute a similarity metric between the graph Wavelet coefficients of the individual signals to identify the differences between the estimations. Therefore, for a node
Intuitively, since graph Wavelets are localized at a specific node
Our approach combines several scale levels by accumulating the distances in equation (14) over multiple scales. Accordingly, three separate cases are distinguished: a large difference in the lower, mid, and higher scales of the coefficients ( Illustration of three different relative constraint types. Based on the scale of the structural difference, additional constraints are added to correct (a) adjacent, (b) close neighborhood, (c) between submaps, or (d) on degenerate poses.
4.3.1. Updating the onboard graph
Each robot runs an onboard incremental graph optimization proposed by Kaess et al. (2012) to incorporate the odometry constraints as well as the additional constraints provided by our algorithm.
Since the mapping server will continuously optimize the global multi-robot map and incorporate new inter- and intra-robot loop closures, the constraints can change over time,
New constraints are not directly added to the onboard graph, but we rather buffer all incoming constraints and apply them at once in a batch. This avoids the graph being accessed for every constraint that is identified.
Additionally, since the global multi-robot graph is modified over time, new constraints are also generated if nodes that were previously labeled as drifting have changed but have not been identified as drifting nodes during the current comparison. Consequently, we check the orientation and position of each labeled node and provide an updated constraint accordingly. The onboard graph system replaces old constraints when new constraints are provided. This is important since loop closures at the mapping server can significantly alter the global graph. Accordingly, the newly generated constraints could contradict the already existing constraints in the onboard graph and, therefore, need to be updated. Otherwise, the onboard optimization could diverge or produce impaired results due to conflicting constraints.
Lastly, to make our approach more flexible, we introduce a stopping heuristic when the feature vector
5. Experiments
We thoroughly evaluate the proposed framework with its different configurations and demonstrate its real-world application using various datasets comprising aerial and legged robots. First, we validate our approach and compare its performance to the current state-of-the-art methods using the EuRoC (Burri et al., 2016) dataset sequences to simulate multi-robot deployments. Next, we demonstrate the real-world performance of our framework during a multi-robot autonomous exploration and mapping mission conducted in an underground tunnel system using ANYmal (Hutter et al., 2017) legged robots. Finally, a multi-robot experiment conducted in indoor and outdoor environments demonstrates the localization recovery for an individual agent in case of onboard localization failure.
For all experiments, we use a radius search of 7 m around the nodes in the global multi-robot map to construct the global graph using equation (5). The root-mean-square of the absolute trajectory error denoted as RMSE, is used as an evaluation metric for all experiments. Moreover, unless otherwise stated, our framework is continuously running along with the onboard estimation and performs the proposed spectral analysis every 20 s. In all experiments, the weighted adjacency matrix of the global multi-robot map is built using the
It is important to note that each robot also employs a
Finally, the mapping server provides an updated multi-robot graph to the robots after performing one cycle of operations.
5.1. EuRoC dataset: Validation and comparison
We compare the performance of our proposed approach against the current state-of-the-art collaborative mapping frameworks (Campos et al., 2021; Karrer et al., 2018; Schmuck and Chli, 2019) using the Machine Hall (MH) sequences from the EuRoC dataset to evaluate and validate our single- and multi-robot performance. For each sequence, ROVIO (Bloesch et al., 2017) is used to provide monocular visual-inertial odometry for individual aerial agents.
RMSE Comparison for the EuRoC Dataset. The top part shows the results of single and collaborative approaches, while the bottom row shows the individual corrected results.
aSingle robot results from Qin et al.(2018). Collaborative result from Schmuck and Chli (2019)
bMonocular visual-inertial results from Campos et al. (2021)
cAs reported in Schmuck and Chli (2019)
dResults from our previous work Bernreiter et al. (2022)
Onboard pose RMSE after adding constraints from the centralized server for different dataset combinations.
Onboard RMSE after correction w.r.t. different kron reduction levels. The right value denotes the number of nodes in the server graph after the reduction. With the increasing level of reduction, the efficiency of the algorithm decreases due to having less constrained nodes in the onboard factor graph.
After the reduction, the server graph contains fewer nodes that are sent to the robots. Thus, it requires less bandwidth for the transmission to the individual robots. However, the synchronization of the onboard estimate with
5.2. Analysis of robotic drift recovery
We conducted an experiment in a particularly challenging environment for LiDAR localization due to the absence of surrounding geometric structure to demonstrate the utility of collaborative mapping towards localization recovery for an individual robot in case of an onboard estimation failure. Two ANYmal robots were simultaneously deployed in an indoor office environment connected to an outdoor rooftop terrace. Each robot is equipped with a Velodyne VLP-16 LiDAR and a Sevensense Alphasense visual-inertial sensor. Both sensors are synchronized within the onboard computer.
Onboard robot odometry estimation and mapping are performed by CompSLAM (Khattak et al., 2020) and, along with the required visual and pointcloud data, are sent to the mapping server whenever the robots are within the communication range. The first robot performs a loop indoors while the second robot transitions outdoors through a narrow doorway, navigates a rectangular path, and returns indoors.
5.2.1. Localization recovery
Due to the absence of surrounding structures on the outdoor terrace, the onboard localization drifts significantly, skewing the onboard robot map. Nevertheless, the collaborative mapping approach is able to generate a consistent map of the environment, as shown in Figure 5, due to its inter- and intra-robot loop closure capabilities. Mapping results for the indoor/outdoor dataset. Cyan denotes the ground truth trajectory, red the onboard estimation, and green the corrected trajectory. The robot map of ANYmal 1 (red) is misaligned due to the onboard localization failure (A) and (B), but can be fixed using constraints provided by the centralized mapping server.
We created a ground truth map using a Leica RTC360 scanner to evaluate the proposed collaborative mapping framework’s performance and quantify the effect of the integration of collaborative corrections on individual robot pose accuracy. Ground truth robot poses were then computed by registering individual robot pointclouds against the ground truth map following the approach of Ramezani et al. (2020).
Comparison of the RMSE of the onboard estimation before and after the supplying additional constraints. The timings refer to the time needed to perform the final onboard factor graph optimization.
In contrast, our approach adds not only fewer constraints to the optimization but also achieves higher accuracy. In particular, the multi-scale spectral analysis enables our approach to identify and construct different types of constraints for nodes that have a higher significance on the reduction of the drift and, thus, also on the overall error. In more detail, the submap constraints especially have a great potential to improve the accuracy but need to be placed carefully at drifting nodes to avoid additional computational overhead. Moreover, the submap constraints are notably useful for correcting large-scale drifts, while the smaller constraint types perform a local refinement. Consequently, the submap constraints play the most influential role in the correction and recovery of the onboard drift.
Lastly, we present an evaluation of the runtime per individual component in the graph client in Figure 6. The graph building component includes the building of the initial graph structure as well as the Wavelet estimation. The total size of this graph was 190 nodes, and six Wavelet scales were computed during the comparison. The Wavelet estimation always computes the Wavelet coefficients from scratch and does not reuse older computations. We leave it for future work how these coefficients can be computed incrementally by reusing the previous result (Andreopoulos and Van der Schaar, 2008). Additionally, it is important to note that this step only has to be performed once per graph update from the server, while the other component can run multiple times per graph. Consequently, the increased runtime for the graph building has important practical implications since it is significantly higher than the runtime required for adding the constraints and optimizing the resulting graph. As a result, the deployment of our proposed solution on robotic systems requires the evaluation and configuration of different components, Evaluation of the runtime performance per individual component. These timings represent the result when using the final full graph of the indoor/outdoor dataset. It is evident that the graph generation and the wavelet estimation takes the biggest portion of the runtimes.
5.2.2. Analysis of the constraint generation
In this experiment, we investigate which and where constraints are generated to fix the onboard estimation. We additionally compare the performance of different distance metrics and construction approaches of the graphs in comparison to our previous work (Bernreiter et al., 2022).
In more detail, we configure our framework to perform the signal and edge weight calculations in different manifolds. Specifically, we investigate three different instances of our proposed framework where we use: (i) L2-Norm in
The resulting constraints for each selection strategy are presented in Figure 7. It is evident that selecting only the top constraint candidates leads to primarily adjacent constraints for Illustration of the different types of constraints added by using (A) 
Evaluation of different graph construction and signal generation strategies for the ANYmal 1 indoor/outdoor dataset.
Notably, the top 10 constraints for
Furthermore, the results also indicate superior performance of the
Moreover, we present an analysis of the spectral components in Figure 8. Here, we compare the estimated discrepancies when using Illustration of the discrepancies in the Wavelet coefficients mapped onto each node in the graph. The left image shows the discrepancies for the case of 
5.3. Large-scale multi-robot subterranean exploration
We demonstrate the suitability of our approach for complex real-world applications by utilizing it during an autonomous multi-robot exploration (Dang et al., 2020) and mapping mission conducted at the Hagerbach underground facility in Switzerland. In this experiment, three ANYmal quadrupedal robots were deployed during an hour-long mission and autonomously navigated distances of roughly 1.2 km, 1.1 km, and 600m, respectively. Each robot is equipped with the same sensor payload as described in the previous experiment.
The robots individually explore the environment and have multiple overlapping regions with inter-robot loop closures. In this experiment, the onboard estimation is stable but is subject to drift, especially given the large scale of the exploration. The collaborative multi-robot map and individual robot maps are shown in Figure 9, with quantitative results presented in Table 6. We demonstrate that the collaborative mapping approach greatly benefits from the feedback constraints by substantially reducing the onboard pose estimation error, leading to a globally consistent map between all the individual robotic systems. In particular, the combination of all three types of constraints facilitates the accurate reduction of onboard estimation errors. Hence, our spectral comparison and corrections especially promote long-term robotic mission where the onboard localization and mapping approach is susceptible for drift. Illustration of the multi-robot map built by our approach. The left image (A) shows the global multi-robot map at the mapping server comprising three individual robot missions. The grayscale images depict exemplary regions using the onboard camera of the robots. The right image (B) shows a top-down view of the individual corrected onboard maps. RMSE Comparison for the Original onboard, Server, and Corrected onboard Graph. We additionally Provide Insights Into the Sparsity of the Problem by Showing the Total Number of Components (N) and the Number of Non-zero Components (NNZ) per Dataset.
In this experiment, we configured our approach to provide as many constraints as possible to achieve the lowest possible onboard trajectory error. It is important to note that there is a trade-off between the incorporation of the constraints into the onboard graph and the resulting improvement it yields. Table 6 also lists the number of factors in the optimization problems along with how sparse the problem is.
Specifically, when incorporating more constraints in the onboard graph, it is evident that the optimization has to deal with more factors in it. In addition, the optimization problem becomes less sparse as there are more interrelations between the individual factors. Hence, when using an incremental solver, such as iSAM2 (Kaess et al., 2012), as it is running on each robot, it might happen that a large portion of the newly added constraints are factors between nodes that are significantly far away in the internal state matrix, invalidating the locality assumption of the solver along with all its performance optimizations. As a consequence, the optimization needs to perform more frequently expensive operations and can take a longer time to incorporate the constraints of our approach into the optimization problem. This is a particular problem for long-term and large-scale applications where our approach needs to be configured such that the number of constraints is at a reasonable level and does not introduce additional computational burden at the onboard systems.
5.4. Degeneracy in the state estimation
Next, we investigate the application of our approach in the case of a degenerate condition in the onboard state estimation. In particular, each robot employs a CompSLAM as a LiDAR-based state estimation (Khattak et al., 2020) in this investigation. Additionally, we incorporated for comparison a degeneracy check for the state hessian matrix (Zhang et al., 2016) during the scan-to-scan and scan-to-submap steps. More specifically, we evaluate the eigenvalues of the state hessian matrix and, if they are above a threshold of 30, consider the robot as degenerate, allowing us to compare the constructed constraints at the time of degeneracy.
The experiment took place in a subterranean cave in Switzerland and comprised two legged ANYmal robots and one flying tricopter (Tranzatto et al., 2022). Similar to the other experiments, all robots deploy the same sensor payload. The flying tricopter as well as one of the ANYmal robots, explore the cave environment without any issues. However, one of the ANYmal robots enters a long narrow tunnel where its onboard estimation system degenerates for a short period, causing shifted maps in the onboard mapping system. This is particularly critical as a safe exploration of the cave cannot be guaranteed any longer for this robot, and it needs to be grounded.
Since our approach is not directly applicable to the detection of degenerate states, we investigate whether, by providing additional constraints, the onboard estimation can recover from it. Our approach employed a constraint generation strategy of the top 15 constraints and can seamlessly recover from the incorrect state by only using a few constraints. The global multi-robot map comprises all three robots and, along with individually corrected robot trajectories, is shown in Figure 10 together with the degenerate and corrected onboard estimation. Illustration of the global multi-robot map with corrected robot trajectories (A). The grayscale images depict exemplary regions using the onboard camera of the robots. As the environment is not illuminated the robots also carry onboard illumination. The state estimation of ANYmal 2 became degenerate during the exploration of a long tunnel (B). The multi-modal mapping server was able to overcome the degeneracy and, by providing additional constraints, fixed the onboard estimation (C).
The degeneracy affects the relative position estimation in the direction of the tunnel,
Although the multi-modal mapping server requires at least a few optimization cycles to repair the broken map, our approach can still recover the onboard map reasonably. Hence, enabling the robot to continue the exploration of the underground environment.
In addition, Figure 11 visualizes in detail which constraints our proposed approach sent to the onboard graph in comparison with the detected degeneracy using the approach proposed by Zhang et al. (2016). Most interestingly, the degeneracy in the state estimation of the robot leads to that the prevalence of constructed constraints being between adjacent nodes in the affected area. Only a few constraints are added as 5-hop constraints between the nodes. Since the degeneracy does not properly estimate the transformation between multiple consecutive steps, the small scales are more predominant than the other scales. Illustration of the employed constraints constructed by our proposed approach at the degenerate region. The degeneracy was detected after the turn on the way back into the cave.
5.5. DARPA subterranean challenge
Finally, to show our approach’s pertinence to a real-world autonomous multi-robot search and rescue mission, we employ it on the DARPA Subterranean (SubT) dataset of Team CERBERUS (Tranzatto et al., 2022). The SubT Challenge was an international robotics competition for fast and autonomous exploration in complex underground environments, where team CERBERUS won the final event. Each participating team had to deploy a robotic team that explores an unknown environment reporting the location of specific artifacts in it.
Team CERBERUS’ dataset comprises four ANYmal legged robots covering roughly 2 km of semi-autonomous exploration within a one hour mission. During the exploration, multiple robots communicated with a central mapping server, providing visual feedback to a human supervisor. The environment of the final SubT Challenge consisted of three individual regions: (i) a tunnel, (ii) an urban (iii) and a cave environment making it particularly challenging for onboard localization and mapping modules.
Although the proposed approach was not deployed during the competition, we show qualitative, post-processed results of the corrected onboard estimations in Figure 12 with respect to the ground truth map. Corrected robot trajectories using our approach with respect to the ground truth map.
Comparison of the RMSE of the onboard estimation before and after the supplying additional constraints. Using our proposed approach, the corrected trajectories achieve similar accuracy as the globally optimized server trajectories.
It is important to note that Team CERBERUS scored 23 points along with the second-best team and won due to a tie-breaker rule. During the exploration of ANYmal 4, it reported an incorrect location of a cell phone (artifact ID:
During the final run, the reported position of artifact Illustration of the detection of artifact 
6. Conclusions
This paper proposed a novel collaborative multi-robot framework for updating the pose graph of individual robots with constraints from a centralized mapping server. Our approach benefits from spectral representations of the graph Laplacian matrix. In this context, we presented a graph-based spectral Wavelet analysis of the robot and server graphs to identify the underlying structural differences in the onboard estimation. In particular, our approach computes signals in
The presented results demonstrate the real-world potential of the proposed approach using small- and large-scale multi-robot field deployments in challenging environments. Additionally, we show results for datasets of up to four robots simultaneously, yielding important implications for both research and industry.
We intend to continue our research in two directions. First, we will investigate the construction of graph hierarchies to circumvent the accuracy trade-off for the reduction levels. Second, we plan to explore the possibility of keeping the onboard graph sparse using a minimal spanning tree that keeps only a limited set of the most prominent constraints in the onboard graph.
Footnotes
Acknowledgments
The authors are thankful to Marco Tranzatto, Patrick Pfreundschuh, Samuel Zimmermann and Timon Homberger, Gabriel Waibel for their assistance with field experiments.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported as a part of NCCR Robotics, a National Centre of Competence in Research, funded by the Swiss National Science Foundation (grant number 51NF40_185543).
