Sage Journals: Discover world-class research

Abstract

For safe and effective grasping in a dynamic environment, planning algorithms need real-time to deal with changing the target’s movement and obstacles. This paper proposes a new sequential Sense-Plan-Act (SeqSPA) dynamic grasping framework to generate a robot’s real-time and smooth grasping trajectory. Specifically, we cluster all stable grasps of the target, transform the clustering centers into pregrasps, and predict the future motion of the moving objects by using the observed values. The trajectory optimization algorithm constructing the approximative joint space gradient field can generate a smooth trajectory for a 6-DOF industrial robot arm within 2 ms. Our method generates trajectories for multiple pregrasps and selects the time-optimal trajectory for execution. Simulation comparison and actual experiments verify that our framework can immediately respond to environmental changes and efficiently find a grasping trajectory of the near-optimal time. The trajectory optimization algorithm in the framework can also be used alone to generate a real-time grasping trajectory when the prediction module cannot accurately predict the target motion.

Keywords

Dynamic grasping motion planning online trajectory generation manipulation

Introduction

Many automated tasks require robots to operate objects. Usually, robots operate targets in a static environment with fixed trajectories. However, the robot must generate trajectory autonomously when the environment changes dynamically. For example, in dynamic assembly, grasping and human-machine collaboration, targets and obstacles interacting with the manipulator may move with unknown motion.

Allowing robots to manipulate objects in a dynamic environment can improve the system’s overall efficiency but present more significant challenges. First, motion prediction of dynamic objects becomes extremely important, requiring higher real-time and robustness for visual tracking and prediction. Secondly, the existing grasping planning algorithms filter and sort the grasp poses and only use one grasp to plan a trajectory. In a dynamic environment, especially in the case of moving obstacles, the optimal grasp pose cannot be determined only by a grasping planning algorithm. Due to the change in the external environment, the non-optimal grasp pose may be more conducive to dynamic grasping when considering the whole capturing trajectory. Finally, continuous changes require online and fast motion replanning. With the change in the motion state of the target, we must update the trajectory of the manipulator and grasping pose in real time. Sampling-based algorithms generate random, unstable and fluctuating paths in each replanning, which does not meet these requirements. Optimization-based algorithms consume more time in cluttered scenes and are unsuitable for replanning in a dynamic environment.

Aiming at the problem of existing algorithms, we propose a new dynamic grasping framework in this paper. First, Bézier trajectories are used to predict target and obstacle trajectories. Secondly, ideally, the framework plans trajectories for all grasp poses and selects the optimal trajectory for execution. However, the calculation is heavy, and the trajectories of the similar grasp poses are also close. Therefore, it is necessary to cluster the grasp poses, take each clustering center as the target for trajectory planning, and select the optimal trajectory for execution. Finally, this paper simplified our earlier motion planning algorithm.¹ The trajectory planning module calculates the intersection time, use B-spline curves to represent joint trajectories, and generates smooth, collision-free and dynamically feasible trajectories after nonlinear optimization. The trajectory planner has good real-time performance and can be used in the whole grasping framework or alone. The grasping process for a moving object is shown in Figure 1.

Figure 1.

A mechanical arm is grasping a moving object. We online predict the object motion, plan trajectories for multiple pregrasps, and execute the shortest time of the trajectory. The gripper reaches the pregrasp position and then grasps and lifts the object.

The main contributions of this paper are summarized as follows:

This paper makes the motion planner¹ suitable for multi-pregrasps real-time planning. After predicting the intersection time, we construct a joint space gradient field and use nonlinear optimization to calculate the joint trajectory quickly.

This paper proposes a new dynamic grasping framework, which includes a polynomial trajectory prediction module, a grasping planning module, and a trajectory planning module. The framework clusters many grasps into a small number of pregrasps, generates the trajectories of all pregrasps, and selects the optimal time trajectory for execution.

We systematically evaluate the performance of the dynamic grasping framework through simulation and actual grasping experiments, which are multi-target and multi-task, and demonstrate the feasibility and robustness of the framework.

Related work

This section summarizes relevant work in four parts: Grasping in Dynamic Environments, Robotic Grasping, Object Motion Prediction and Motion Planning.

Grasping in dynamic environments

The two mainly alternative system architectures for grasping in dynamic environments are Locally Reactive Control and Sense-Plan-Act.^2,3

Locally Reactive Control: Such system architectures do not use global motion planning modules but local strategies to generate real-time reactive motions.² They respond immediately to change and are robust to uncertainties in sensors and object motion. Wholly based on feedback control systems^4–6 are used in grasping in dynamic environments for a long time. Recently, many studies are based on deep reinforcement learning.^7–9 The robotic control policy neural network that uses raw input from a camera is established in Levine et al.⁷ A mobile operation control framework⁸ based on multi-task reinforcement learning is proposed to achieve universal dynamic target tracking and grasping. In Pham et al.,⁹ the neural network predicts the action that meets the constraint closest to the unsafe action so that the arm can grasp the moving object while avoiding the obstacle. However, local reactive control may fall into a local minimum in a complex workspace.

Sense-Plan-Act: In the high degree of the freedom robot system, constructing sensing, planning, and action components through strong modularity is still the primary mode. In this system, the perception module builds a complete environment model, the planning module generates a collision-free and optimal trajectory, and finally, it is executed by the precise controller. Systems built around Sense-Plan-Act (SPA) are well suited to structure and well-defined environments.² The planner based on graph search¹⁰ is used to generate the trajectory. After the manipulator reaches the pregrasp, the controller is used to grasp in Cartesian space. A search-based kinodynamic motion planning algorithm is presented to generate time-parameterized trajectories to grasp moving objects smoothly in Menon et al.¹¹ The RRT-Connect method in time configuration space is used in Yang et al.¹² to generate a path to pick up the moving target. Still, it cannot cope with the uncertainty and changing environment. Due to the SPA’s lack of environmental feedback during movement execution, researchers have proposed extensions such as Sense-Plan-Act (SeqSPA). Constant-Time Motion Planning (CTMP)¹³ uses real-time heuristic search planning to grasp moving objects in the finite horizon. The Reachability-Aware Grasping and Motion-Aware Grasping are used in Akinola et al.¹⁴ to filter reachable grasps and incorporated the path from the previous timestep to seed the search process to achieve a quicker and smoother trajectory. Compared with SPA, SeqSPA systems still have better-grasping results when there is uncertainty in sensor signal and motion state.

However, the methods in Islam et al.¹³ and Akinola et al.¹⁴ cannot quickly cope with the dynamic obstacles. The core reason is that a motion planner requires a longer computing time than a local motion controller, which makes it impossible to react quickly in a dynamic environment. This paper develops a new SeqSPA dynamic grasping framework. The global planner in this paper is fast and can be used as a local motion controller. Moreover, the prediction module increases the reliable length of each trajectory.

Robotic grasping

Methods based on deep learning can directly extract the grasp pose of the target from an RGB-D view^15,16 or Point Clouds.¹⁷ These methods collect expensive grasp data sets in the real world to train a neural network. The physics-based simulation method is used in Eppner et al.¹⁸ to evaluate the grasping quality of the parallel jaw and establish a dense data set. The method in Akinola et al.¹⁴ generates robust grasping data sets through physical simulation, and uses pre-calculated reachability space¹⁹ and motion intention neural network to evaluate feasible grasp poses. In this paper, a stable grasping dataset is generated by physical simulation, the same as Eppner et al,¹⁸ and the clustering algorithm reduces the similar grasp poses.

Object motion prediction

The use time of the optimized trajectory can be improved by estimating the future motion of the target through the prediction module. The target pose detection system uses Kalman and Buey filtering methods^20–22 to filter the noisy pose results into more stable values. However, it is unreliable to use rough motion models to estimate the motion of future targets. The Covariant Optimization is utilized in Jeon et al.²³ to consider the target motion prediction of collision, and the collision term needs much time to calculate. The polynomial regression to predict object motion and generates a corresponding trajectory through the constraint-free QP optimizer is adopted in Chen et al.²⁴ Fast-Tracker^25,26 improves this method by using Bezier curves to represent the target prediction trajectory, and adding dynamic constraints and time-varying confidence in the optimization. We use this method to predict the movement of all objects.

Motion planning

The process of motion planning usually consists of path search and trajectory optimization. Sampling-based methods^12,27,28 and search-based methods^{10,11,13,14,29} are the main ways of front-end path search. The front-end path is jerky and leads to wavy motion. The back-end trajectory optimization algorithm calculates a smooth trajectory with time parameters based on the front-end path. The back-end methods of the manipulator^30,31 can optimize a satisfied constrained trajectory, but the optimization process does not consider the obstacles. CHOMP algorithm³² introduces the Euclidean Signed Distance Field (ESDF) information into motion planning to optimize trajectory using workspace gradient information. STOMP algorithm³³ uses sampling technology to generate multiple alternative trajectories and obtains the optimal trajectory through multiple iterations and evaluation of the cost of candidate trajectories. Machine learning techniques^34,35 are used to optimize trajectories. The clearance estimation is used for motion planning in Chase Kew et al.,³⁴ but the algorithm cannot quickly and adaptively handle dynamic obstacles. The method in Fujii and Pham³⁵ can quickly smooth the trajectory while processing dynamic obstacles but must obtain an initial collision-free path in advance. EGO algorithm³⁶ establishes a gradient field for a UAV without ESDF and uses the properties of the B-spline curve for rapid trajectory optimization.

Problem definition and algorithm framework

The task simulates a scenario in which multiple objects are placed on the conveyor belt in a warehouse.¹⁴ The robot arm must avoid colliding with non-target objects on the conveyor belt or static obstacles around it and pick up the target object. In grasping, there will be an external human being or robot as a moving obstacle or the conveyor belt speed mutation. We assume that we know the models of the target objects and obstacles, but their motions are unknown. The robot must catch and lift the target without colliding with surrounding obstacles.

We propose the manipulator dynamic grasping system, as shown in Figure 2. The SeqSPA Dynamic Grasping Frame is presented in Alg.1. Our method first extracts the pre-computed grasp data $G_{DB}$ of the target object O. By the K-Means algorithm, the close grasps in $G_{DB}$ are combined, and the centers of each class are taken to form $G_{K}$ . The grasp poses $G_{K}$ retreat a distance b along the grasp direction to generate pregrasp poses $G'_{K}$ . We assume the obstacle models are known and predict all obstacles’ future trajectories. If the obstacle is static, the predicted trajectory is a point. In the dynamic grasping loop, through visual sensors, the MotionPrediction Module records previous and evaluates current poses of the target and obstacles, and predicts the future target motion trajectory $g_{traj}$ and obstacle trajectories $o_{traj}$ . According to $g_{traj}$ and $G'_{K}$ , the pose of the manipulator $G_{J}$ when it arrives at the pregrasp is predicted through the IntersectionPoints module. We also obtain the initial trajectories $Ini t_{traj}$ and the intersection time T. The calculation process of the IntersectionPoints module is related to the position and speed of the end-effector and the target.

Figure 2.

Dynamic grasping system architecture diagram. The predicted trajectories of the target and obstacles generated by the motion prediction module and the pregrasp poses generated by the grasp planning are input into the trajectory planning module to generate the optimal trajectory. When using the motion prediction model, if there is a significant error between the actual position of the target and the predicted position, or the obstacle’s latest predicted trajectory will collide with the manipulator, the framework carries out the planner again. When not using the prediction module, the planner uses the current target and obstacles positions to generate the trajectory at a specific frequency.

Convert all initial trajectories to the B-spline control points $Q_{all}$ . All initial paths are optimized through the gradient-based TrajectoryOptimization module to obtain multiple collision-free and smooth trajectories $Q'_{all}$ . The optimal trajectory $obtimiz e_{traj}$ is selected from multiple alternative trajectories for execution, and the prediction module predicts the trajectory of the objects in real-time. When the difference between the observed position of the target and the predicted position is greater than one threshold value, or the obstacle may collide with the optimal trajectory at some point in the future, the trajectory optimization is carried out again. After the manipulator reaches the pregrasp $g_{j}$ , we get the pose of the target object after $t = 1 s$ , use the newly target predicted pose $p_{f}$ to generate grasp $g_{c}$ . The arm is then moved to $g_{c}$ . After moving the end-effector to $g_{c}$ , the gripper moves with the target for $t' = 1 s$ , closes the gripper, and then observes whether the target is lifted.

Grasp planning

We used an approach similar to Akinola et al.¹⁴ and Eppner et al.,¹⁸ where a grasping database is precomputed while all target objects remain static, to collect data in the physical simulation. First, many dense samples are taken to generate the grasp poses on the object’s surface. Then the object is lifted 20 times in the physical simulation environment, with random noise added each time. The success rate is the selection measure of the reliability of each grasp pose. The first 600 grasps are used as the grasping dataset.

We use the widely used 4-dimensional grasp representation in the physical simulation to represent the object grasp configuration. The four dimensions are: grip width w, opening size l, grasping position g and orientation u. We use a model the same width w as the actual gripper in the simulation. We sample on the target object’s surface, and the intersection point with the other side of the object is found through the normal line corresponding to the sampling point. The two points’ connection length is l, and the center point coordinate value of the segment is g. The gripper takes the line between two points as the rotation axis and samples in [−pi/3, pi/3] around the axis as u.

Our evaluation metric is based on distances between grasps and does not consider w and l. A grip pose $T (g, u) \in SE (3)$ consists of the grasp position $g \in R^{3}$ and the orientation represented as unit quaternion $u \in S^{3}$ . We use a weighted metric,¹⁸ the distance between T and $T'$ is defined as:

ρ (T, T^{'}) = ω {‖ g - g^{'} ‖}_{2} + \arccos (| 〈 u, u^{'} 〉 |)

(1)

where $ω \in R$ is a relative metric ratio that relates to rotation and translation. We define that a pure translation of 1 mm equals a pure rotation of 1°.

We use the K-Means to aggregate the robust grasps into k classes, where k = 10. In K-Means, the initial cluster center is randomly generated, and the distance between each sample and each cluster center is calculated by $ρ$ . The samples are assigned to the nearest cluster center to form the sample groups. After obtaining the new groups, the average value of all g and u in one group is calculated respectively to generate the new center. The sample closest to the new center is taken as the cluster center. Recalculated the distance between the samples and the cluster centers, the cluster centers are generated again. The algorithm is calculated several times until convergence, and the final clustering centers as the candidate grasps are obtained. All the candidate grasps are converted to the pregrasp set by each grasp pose moving d distance along the opposite direction of the unit quaternion u.

The transform relationship between the grasp and the pregrasp is shown in Figure 3. The center of the gripper at the grasp pose with closed gripper is g, and the center of the gripper after opening is $g'$ , as shown in Figure 3(a) and (b), respectively. $Δ b$ is the distance from g to the object boundary along the opposite direction of u, which is a parameter to generate robust grasp. The distance from g to $g'$ is $Δ b'$ , which can be calculated from the geometry of the gripper. The center of the gripper in the pregrasp is $g ″$ , and the distance from $g ″$ to the object along the direction of u is a fixed value $b_{0} = 0.06 m$ . $b = b_{0} + Δ b - Δ b'$ is the distance from $g ″$ to $g'$ , that is, the movement distance of the gripper center relative to the target within t. The grasp is closed using $t'$ , and the center point of the gripper is from $g'$ to g.

Figure 3.

The transformation relation between grasp and pregrasp. (a) is the position after the gripper is closed, which is one grasp in the grasps database including grasping orientation u, gripper center g, and the distance $Δ b$ from g to the edge of the object along the opposite direction of u; (b) is the grasp pose before the claw is closed; (c) is the pregrasp pose of the gripper: (a) grasp with closed gripper, (b) grasp with open gripper, and (c) pregrasp pose.

Since there are moving obstacles around the target, we conduct collision detection on the gripper, part of the arm and the obstacle, in which the gripper is in the pregrasp, as shown in Figure 4. We use a metric to rank the collision-free pregrasps. Our metric can be defined as:

f (T_{e}, T_{w}) = ρ (T_{e}, T_{w}) + W

(2)

Figure 4.

The mechanical arm components for collision detection in the grasp planning.

where $T_{e}$ is the gripper pose of the mechanical arm at initial time, $T_{w}$ is the pregrasp which is converted from the goal coordinate to the world coordinate. W = 0, if the inverse kinematics (IK) can be solved; W = 10,000, if IK cannot be solved.

Using f to arrange the candidate data from small to large, the first data is the best pregrasp at this moment.

Motion prediction

The planner needs time to calculate trajectories, quickly becoming obsolete as the object moves. Therefore, predicting the future pose of the target can improve the task’s overall success rate. We adopt the Bézier curve $B (t)$ to indicate the object predicted trajectory.²⁶

The method in Pan et al.²⁶ maintains a queue $Q = [q_{1}, q_{2}, \dots, q_{L}]$ , where $q_{i} = {p_{t_{i}}, t_{i}}$ . $p_{t_{i}}$ is the instantaneous pose of the moving object, and $t_{i}$ is the corresponding timestamp. Queue length is L and $t_{L}$ is the current time. The prediction module runs in real-time. We add the new target observation data to the queue and generate the new target prediction trajectory by fitting the queue data. Our predicted target movement period is $(t_{L}, t_{p}]$ . $t_{p}$ is the end time of the Bézier curve.

The cost function minimizes the residual distance between the observation data and the prediction trajectory²⁶:

J_{pre} = \sum_{i = 1}^{L} w_{t_{i}} ‖ B (t_{i}) - p_{t_{i}} ‖_{2}^{2} + w_{p} L \int_{t_{1}}^{t_{p}} {‖ B^{(2)} (t) ‖}_{2}^{2} dt

(3)

Where, $w_{t_{i}}$ is the weight term to distinguish the confidence of the observed values at different times. $w_{p}$ represents the weight of the regularizer to avoid over-fitting.

The Object-oriented software for Quadratic Programming³⁷ is used to solve the optimization problem of the Bézier curve that meets the velocity and acceleration constraints in Han et al.²⁵ and Pan et al.²⁶

Our complete motion predicted trajectory is shown as follows:

f (t) = {\begin{matrix} B (t), t_{L} \leq t \leq t_{p} \\ B (t_{p}) + B' (t_{p}) • (t - t_{p}), t > t_{p} \end{matrix}

(4)

We set $t_{p} - t_{L} = 1.5 s$ . When the time exceeds the period of the Bézier curve, assuming that the object is moving uniformly, the object’s position is calculated using the velocity corresponding to $t_{p}$ .

Motion generation

When grasping a dynamic object, the movement of the end-effector can be divided into four stages: pregrasping, grasping, tracking, and lifting. The manipulator reaches the pregrasp pose, approaches the grasp pose, closes the gripper while tracking the target, and lifts it. Our previous algorithm framework¹ is the SPA system. The trajectory generation algorithm to reach the pregrasp position contains sampling-based path search and nonlinear trajectory optimization. To further improve the computational efficiency of trajectory generation in our framework, this paper removed the time-consuming front-end sampling algorithm and trajectory feasibility adjustment module. When a trajectory falls to a minimum or the velocity and acceleration of the trajectory are not feasible, we remove it directly. This paper carries out real-time trajectory planning with multiple grasping poses. Removing several infeasible trajectories does not affect the grasping task.

We first estimate the intersection position using the rapid generation of motion primitive algorithm in Mueller et al.³⁸ according to the position and speed of the end-effector and the target. Then the planner generates multiple trajectories using the gradient-based method and selects the optimal trajectory for execution. We use the Cartesian controller to move the mechanical arm during the grasping, tracking and lifting stages.

Estimate intersection point

Assuming no obstacles, we need to calculate an optimal intersection trajectory to grasp the moving target. The total initial trajectory cost function $J_{Σ}$ of H + 1 dimensional joint space is defined as¹:

J_{Σ} = \sum_{k = 0}^{H} J_{k} + ρ T, where J_{k} = \frac{1}{T} \int_{0}^{T} j_{k} {(t)}^{2} dt .

(5)

Where, $J_{k}$ is the cost function of the $k^{th}$ joint, $ρ$ is the time factor, T is the trajectory time, and $j_{k} (t)$ is the third derivative of the $k^{th}$ joint curve.

By using Pontryagin’s minimum principle, fixed trajectory end angle and angular velocity, the optimal state trajectory can be straightforwardly solved³⁸:

\begin{matrix} s^{*} (t) = [\begin{matrix} θ^{*} (t) \\ ω^{*} (t) \\ 0 \end{matrix}] \\ = [\begin{matrix} \frac{α}{120} t^{5} + \frac{β}{24} t^{4} + \frac{γ}{6} t^{3} + \frac{a_{0}}{2} t^{2} + ω_{0} t + θ_{0} \\ \frac{α}{24} t^{4} + \frac{β}{6} t^{3} + \frac{γ}{2} t^{2} + a_{0} t + ω_{0} \\ α t^{2} + 2 β t + 2 γ \end{matrix}] . \end{matrix}

(6)

Assuming the initial angle, angular velocity and angular acceleration of the $k^{th}$ joint trajectory is $s (0) = (θ_{0}, ω_{0}, a_{0})$ , and motion parameters at intersection time is $s (T) = (θ_{f}, ω_{f})$ . Then, the unknowns $α$ , $β$ , and $γ$ can be solved.

[\begin{matrix} α \\ β \\ γ \end{matrix}] = \frac{1}{T^{5}} [\begin{matrix} \begin{matrix} 320 \\ - 200 T \\ 40 T^{2} \end{matrix} & \begin{matrix} - 120 T \\ 72 T^{2} \\ - 12 T^{3} \end{matrix} \end{matrix}] [\begin{matrix} Δ θ \\ Δ ω \end{matrix}]

(7)

Where

[\begin{matrix} Δ θ \\ Δ ω \end{matrix}] = [\begin{matrix} θ_{f} - θ_{0} - ω_{0} T - \frac{1}{2} a_{0} T^{2} \\ ω_{f} - ω_{0} - a_{0} T \end{matrix}] .

(8)

Each joint cost value³⁸ can be calculated as follows:

J_{k} = γ^{2} + β γ T + \frac{1}{3} β^{2} T^{2} + \frac{1}{3} α γ T^{2} + \frac{1}{4} α β T^{3} + \frac{1}{20} α^{2} T^{4} .

(9)

The only variable in the expression of $α$ , $β$ , $γ$ is T. We will substitute $α$ , $β$ , $γ$ into $J_{Σ}^{*} (T)$ . To find the optimal time T, we set $\partial J_{Σ}^{*} (T) / \partial T = 0$ to solve the flight time T, and then solve $α$ , $β$ , $γ$ . We substitute $α$ , $β$ , $γ$ into $s^{*} (t)$ to obtain the optimal angle trajectory $θ^{*} (t)$ and angular velocity trajectory $ω^{*} (t)$ .

We sample the path points along the target’s predicted trajectory starting from the starting point. The target center sampling point $[p, v]$ at time $T_{move}$ , which is the time from the starting point to the sampling point, and the pregrasp pose $[x_{c}, y_{c}, z_{c}, θ_{α}, θ_{β}, θ_{γ}]$ that change with the object’s center coordinate are transformed into robot arm joint space as ${θ_{f_{0}}, \dots, θ_{f_{H}}, ω_{f_{0}}, \dots, ω_{f_{H}}}$ . We calculate the total cost function $J_{Σ}$ to obtain the trajectory running time $T_{angle}$ . The time constraint $T_{angle} < T_{move}$ and angular velocity constraint $max_{t \subset [0 T_{angle}]} | ω^{*} (t) | < ω_{max}$ of each joint trajectory must be satisfied, where $ω_{max}$ is the maximum angular velocity. $p^{*}$ is the first intersection point to satisfy the constraints and the related $θ^{*} (t)$ is the initial path of the trajectory optimization in joint space. The intersection time is $T^{*} = T_{move}$ .

Gradient-based trajectory optimization

Due to ignoring obstacles’ information in the manipulator workspace when calculating the initial joint trajectory, the initial trajectory is close to or through obstacles. Therefore, we transform ESDF information into the joint space and use nonlinear optimization to generate smooth, safe and kinematically feasible trajectories in a short time.

This paper uses the method in Wei et al.¹ to establish the approximate repulsive gradient field generated by obstacles in high-dimensional space. Thus, we establish the distance and gradient expressions between the path point and a single obstacle in joint space for the collision penalty.

Our planner uses robotic control points to optimize the trajectory and the oriented bounding boxes (OBB) for collision detection. The distribution is shown in Figure 5. As shown in Figure 6, $r_{i}$ is radius of the control point $c_{i}$ . The distance between control point $c_{i}$ and obstacle $O_{j}$ is $D_{ij}$ . $s_{f}$ is the obstacle safety threshold constant. The repulsive function is:

F_{ij} = \frac{2 b}{e^{k (a + {d'}_{ij})} + 1}

(10)

Figure 5.

The distribution of control points (green points) and oriented bounding boxes (transparent cubes).

Figure 6.

The repulsive force generated by the obstacle on the control point.

where $F_{ij}$ is repulsive force, $d'_{ij} = D_{ij} - r_{i}$ , and the parameters $a, b, k$ determine the smooth function shape. Assuming the maximum repulsion constant is $F_{max}$ , $d'_{ij} = s_{f}$ and $F_{ij} = 0.01$ , so we can figure out $a, b, k$ , where $b = F_{max} / F_{max} 2 2$ .

We calculate the repulsion force of each control point, convert the repulsion force to the origin of the corresponding connecting rod coordinate, and combine the force and torque values. Repulsive force and torque at the origin of the coordinate affect the joint coordinates from this link to the base. The force and torque at the origin of each link coordinate system are recursively deduced to the base coordinate system by using the recursive formula of force and torque in the series mechanism of robotics. In this manipulator configuration, the torque values ${τ_{0}, τ_{1}, \dots, τ_{H}}$ of the rotation axis of each joint are the repulsion torques generated by the repulsion field of the obstacle.

Repulsive torques will move the manipulator away from the obstacle. ${τ_{0}, τ_{1}, \dots, τ_{H}}$ is converted to unit vector $[{τ^{'}}_{0}, {τ^{'}}_{1}, \dots, {τ^{'}}_{H}]$ , and the unit vector is approximated as the direction of joint angle change. The configuration $Q = {θ_{0}, θ_{1}, \dots, θ_{H}}$ moves one step in the unit vector to obtain the new robotic configuration $Q^{'} = {{θ^{'}}_{0}, {θ^{'}}_{1}, \dots, {θ^{'}}_{H}}$ , and the formula¹ is: $θ'_{i} = θ_{i} + τ'_{i} \cdot step, i \in {0, 1, \dots, H}$ .

Approximate repulsive gradient field was represented by ${p, v}$ pairs and the establishing process is shown in Figure 7. Each waypoint of trajectory is calculated to obtain a ${p_{ij}, v_{ij}}$ pair. The initial configuration is $Q_{i}$ . $p_{ij}$ represents the critical point where the repulsion torque generated by obstacle $O_{j}$ to configuration $Q_{i}$ is zero. $v_{ij}$ represents the corresponding gradient direction, which is the unit vector from $Q_{i}$ to $p_{ij}$ . The distance between $Q_{i}$ and $O_{j}$ is $d_{ij} = (Q_{i} - p_{ij}) \cdot v_{ij}$ .

Figure 7.

The establishment process of ${p, v}$ pairs.¹

We use nonlinear optimization method to plan uniform B-spline curve control points.^39,40 The derivative of a B-spline curve is also a B-spline curve, whose convex hull characteristics can ensure dynamic feasibility and safety. The cost function is as follow:

min_{Q} J = λ_{s} J_{s} + λ_{c} J_{c} + λ_{d} J_{d} + λ_{e} J_{e}

(11)

where $J_{s}$ , $J_{c}$ , $J_{d}$ are smoothness penalty, collision penalty and feasibility penalty, which are the same cost function of.¹ $J_{e}$ is endpoint penalty. $λ_{s}, λ_{c}, λ_{d}, λ_{e}$ are the coefficient of each penalty term.

In order to successfully grasp the moving object, the gripper needs to reach the pregrasp accurately and has the same speed as the target. When no end-point penalty is imposed, optimizing the penalty function of the first several terms will change the control points for determining the terminal position and velocity. So, we add the end trajectory point constraint to reduce the deviation of the manipulator reaching the pregrasp pose after optimization. The function is defined as follows:

J_{e} = k_{θ} (θ^{*} (T^{*}) - θ_{e})^{2} + k_{ω} (ω^{*} (T^{*}) - ω_{e})^{2}

(12)

where $θ^{*} (T^{*})$ and $w^{*} (T^{*})$ are the values of initial angle, angular velocity trajectory at intersection time $T^{*}$ , respectively. $k_{θ}$ and $k_{ω}$ are the weighting parameters. $θ_{e}$ and $ω_{e}$ are the endpoint angle, angular velocity in the optimization process, which are expressed as some control points and defined as follows:

\begin{matrix} θ_{e} = m_{0, 0} Q_{N - p_{b} - 1} + m_{0, 1} Q_{N - p_{b} - 2} + . . . + m_{0, k - 2} Q_{N}, \\ ω_{e} = m_{1, 0} Q_{N - p_{b} - 1} + m_{1, 1} Q_{N - p_{b} - 2} + . . . + m_{1, k - 2} Q_{N} . \end{matrix}

where

k = p_{b} + 1,

(13)

m_{i, j} = \frac{1}{(k - 1)!} C_{k - 1}^{k - 1 - i} \sum_{s = j}^{k - 1} {(- 1)}^{s - j} C_{k}^{s - j} {(k - s - 1)}^{k - 1 - i},

C_{n}^{i} = \frac{n!}{i! (n - i)!} .

(14)

Implementation details and results

We experimented with the simulation environment and the actual robot to evaluate our method. In the simulation, we evaluate the performance of our algorithm in different tasks for different targets. The simulation environment includes the target’s random linear or nonlinear movement, whether there are static obstacles, obstacle with the same speed as the target, and external moving obstacles. Then, the algorithm’s feasibility is verified by typical working conditions in the real world with different targets.

L-BFGS⁴¹ is a quasi-Newton Algorithm to solve the nonlinear optimization problem. The dynamic grasping framework runs on a standard desktop computer with a single-core CPU of 3.20 GHz, Intel Core i7-8700, through Ubuntu 18.04 and ROS melodic.

Experimental setup

According to the actual task and previous experience,¹⁴ we design seven working conditions to evaluate our method on the UR5-Robotiq robots, and the specific settings are as follows:

Linear Motion: The target moves uniformly in a straight line on the conveyor belt with no obstacles around, as shown in Figure 8(a). We random set $0.3 m \leq r \leq 0.7 m$ , $v = 0.1 m / s$ and $0.6 m \leq l \leq 1.0 m$ .

Linear with Static Obstacles: In the Linear Motion scene, we divide the near region surrounding the robot into three regions, as shown in Figure 8(b). We place one obstacle in each area and randomly locate the obstacle’s position in each region.

Linear with Moving Obstacle on The Belt: In the Linear Motion scene, we place an obstacle around the target that moves at the same speed as the target, as shown in Figure 8(c).

Linear with External Moving Obstacle: As the arm grasps the target in the Linear Motion scene, the external moving obstacle sweeps over the conveyor belt, as shown in Figure 8(d). The moving obstacle speed is about 0.05–0.15 m/s and changes dramatically some times. The height of the obstacle track to the conveyor belt surface is 0.2–0.8 m.

Linear with All Obstacles: In the Linear Motion scene, we add one static obstacle, one obstacle at the same speed as the target, and an external moving obstacle, as shown in Figure 8(e). The distribution and velocity of the three obstacles are the same as tasks (2)−(4).

Linear with Varying Speed: In the Linear Motion scene, the target object’s speed will change dramatically 3–5 times, and the linear velocity range is 0.05–0.1 m/s.

Circular Motion: The object moves uniformly around the manipulator’s base, as shown in Figure 8(f). The target trajectory radius is $0.3 m \leq r \leq 0.7 m$ , the object velocity is 0.05–0.1 m/s, and it moves in a single direction.

Figure 8.

Schematic diagram of random experimental motion parameters in seven working conditions. Schematic diagrams of working conditions 1–5 are (a)–(e), random parameters of working conditions 6 and 1 are the same, and (f) corresponds to working conditions 7. The distance between the target trajectory and the base coordinate is r. l is the projected length of the target starting point to the base along the conveyor belt direction. v is the target motion velocity. When the target moves in the x-axis’s positive direction, v is positive.

We selected five grasping targets and generated pregrasps, as shown in Figure 9. Each target’s initial stable grasp poses are about 600, and the clustering algorithm sets 10 classes. The grasps, after clustering, are transformed into pregrasps. All objects are used for the simulation and the actual robot experiment. Five kinds of targets are captured in each simulation experiment scene of each method. We run 40 times for each object in one scene and count the average success rate and capture time of 200 tests.

Figure 9.

The graspable objects and corresponding pregrasps. The five graspable objects from the YCB Object Database⁴² are shown in (a). Many stable grasp poses for five targets are shown in (b). In (c), the grasp poses are clustered, and 10 grasps are generated for each target. The grasps transform into the pregrasps in (d): (a) five graspable objects, (b) stable grasps, (c) grasps after clustering, and (d) pregrasps.

We compare the performance of the below methods in the experiment:

OUR: This is our complete framework. We removed the trajectories with the speed and acceleration exceeding the limit in the obtained trajectories and selected the trajectory with the shortest time for execution.

One grasp: Unlike OUR, this method only uses one grasp with the lowest score for each plan and runs the calculated trajectory. When the first planning fails, we rerank the pregrasps and carry out the trajectory optimization again.

No prediction: This method removes the prediction module to study its importance. It does not predict the trajectory of the target and obstacles but directly plans the trajectory of all pregrasps at the moment and selects the trajectory with the shortest time for execution. The planner takes an average of 2 ms to generate trajectories from the starting position. When the end-effector is close to the target, the trajectory length will decrease, so each planning time of a single trajectory is less than 2 ms in the grasping process. Therefore, the maximum time for each plan should not exceed 20 ms. The planner is used as a controller, and the control rate is about 50 Hz.

No prediction + One grasp: This method removes the prediction module and uses one grasp with the lowest score. Only one trajectory is generated per planning. The planner is used as a controller, and we set the control rate to be 100 Hz.

Experimental results

The simulation results are shown in Table 1. The bold entries are the best results of the success rate and dynamic gripping time among the four methods Under working conditions 1 and 5, planning trajectories generated by different algorithms are shown in Figures 10 and 11, respectively.

Table 1.

The average success rate (%) and dynamic gripping time(s) of simulation experiments under each condition.

Method	Linear	Linear with static obstacles	Linear with moving obstacle on the belt	Linear with external obstacle	Linear with all obstacles	Linear with Varying Speed	Circular Motion
Our	100, 5.32	100, 5.28	100, 5.33	100, 6.76	95, 7.32	100, 8.11	–
One grasp	100, 5.95	97, 5.96	100, 5.87	96, 8.26	91, 7.40	98, 8.73	–
No prediction	92, 7.01	86, 7.69	90, 6.81	80, 8.12	77, 9.49	96, 7.18	86, 6.87
No prediction + One grasp	96, 5.97	86, 5.54	96, 6.24	86, 9.01	84, 9.31	100, 7.76	88, 8.09

Figure 10.

Trajectories generated by four algorithms in working condition 1. The yellow arrows near the object are the selected pregrasps. In task 1, the target is moving uniformly on the conveyor belt. Algorithms 1,2 contain the prediction module and plan the trajectory at one time. The green trajectory at the geometric center of the target object is the prediction trajectory, and the blue trajectory is the historical trajectory, as shown in (a)–(b). Algorithms 3, 4 do not include the prediction module and carry out real-time trajectory planning, as shown in (c)–(d). Algorithms 1 and 3 plan trajectories for all grasp poses and generate multiple feasible trajectories.

Figure 11.

Trajectories generated by four algorithms in working condition 5. The target and the obstacle in front are on the conveyor belt in uniform motion, one obstacle is above the target and moves in a straight line with variable speed along the conveyor belt, and one static obstacle is on the right side of the manipulator. Algorithms 1, 2 contain the prediction module to predict the trajectory of all moving objects and generate a long-term trajectory. However, when the velocity of the outside moving obstacle changes to lead original trajectory colliding with the obstacle in the future, the planner carries out again to generate a new trajectory, as shown in (a)–(b). Algorithms 3 and 4 carry out real-time trajectory planning, as shown in (c)–(d). Before the planning, the grasping planner sorted the grasps and eliminated the grasps where collisions occurred. Therefore, in (a) and (c), the target only has eight pregrasp poses. In (d), real-time trajectory planning selected the nearest pregrasp to the gripping jaw, so the pregrasp changes in real-time.

In working conditions 1–5, the conveyor belt velocity is unknown but fixed, and the trajectory prediction module can better predict the target trajectory. Among them, in conditions 4 and 5, the movement state of the obstacle changes in real-time. Algorithm 1 generates multiple trajectories and finds the one with the shortest execution time. The success rate of algorithm 1 is slightly higher than algorithm 2, much higher than algorithms 3 and 4, and the grasping time of algorithm 1 is closer to the global minimum. Compared with Algorithm 3, Algorithm 4 uses only one pregrasp, so the trajectory update frequency of Algorithm 4 is faster, and the success rate is higher. When the environment is entirely predictable, each execution of the local shortest path is not the entire shortest path. Therefore, algorithms 3, 4, which updated the trajectory in real-time, has a longer grasping time than algorithms 1, 2. In condition 5, with the most obstacles, algorithms 3 and 4 are only planned for the current environment, so they easily fall into local extremums and have the lowest grasping success rate.

In case 6, the speed of the conveyor belt will mutate several times, and algorithm 1 can only calculate the shortest grasping time at the speed of the conveyor belt at this moment but cannot calculate the global shortest time. The grasping time of algorithms 3, 4 is shorter than 1, 2. Thus, the real-time trajectory update algorithm has a shorter grasp time in an unpredictable environment. The success rate of algorithms 1, 2 and 3, 4 is similar, demonstrating our overall algorithm framework’s real-time performance and robustness.

The target moves in a circle, and its speed constantly changes in condition 7. In algorithms 1 and 2, the manipulator’s predicted intersection point is not on the target trajectory circle. The Bezier curve used only has a predicted length of 1.5 s. When the time is more significant than 1.5 s, the position can be predicted according to the speed at the end of the Bezier curve and the predicted trajectory is a straight line, as shown in equation (4). When the initial intersection time is greater than 1.5 s, the intersection point will be on the straight line instead of the Bezier curve and the prediction error is large. So the grasping success rate is low, which is not compared in Table 1. Algorithms 3 and 4 generate real-time trajectories and can respond to changes immediately, and they also have a high grasping success rate in condition 7. In working conditions 6 and 7, the environment cannot be completely predicted. Algorithm 3 updates the trajectory in real time and executes the shortest one among multiple trajectories each time, so the overall time is the shortest.

In summary, when the target trajectory can be predicted, our algorithm framework can respond immediately to environmental changes and has the highest success rate. When the target trajectory cannot be accurately predicted, algorithms 3 and 4 generate the real-time trajectory, while algorithm 4 has a faster calculation speed and higher success rate. However, algorithms 3 and 4 may fall into local extreme values in a complex environment with many obstacles, such as task 5.

Real robot grasp

As shown in Figure 12, the experimental platform consists of an AUBO-i5 manipulator, a parallel gripper, a conveyor belt and a Kinect2 depth camera. The camera in front of the conveyor belt is used to collect visual information. Based on the simulation experiment results, we selected algorithms 1 (Our) and 4 (No prediction + One grasp) for the real-world experiment, and the target objects are the same in the simulation experiment. We add a position based visual servoing (PBVS) controller for comparison of algorithms. Based on the target information collected by the camera, the difference between the end pose of the current manipulator and the target pose is obtained. The servo controller is used to calculate the motion speed of the manipulator. Then the motion speed of each joint of the manipulator is obtained to drive the manipulator to approach the target.

Figure 12.

The experimental platform.

We use three tasks in the actual experiment. In the Linear Motion and Linear with Moving Obstacle on The Belt tasks, the conveyor belt speed does not exceed 0.1 m/s. In the Linear with Varying Speed task, the linear velocity range is 0.02 to 0.1 m/s, changing the belt speed three times. We test each object 10 times in a task, resulting in 50 times for each algorithm.

The experimental results are shown in Table 2. The bold entries are the best results of the success rate and dynamic gripping time among the three methods. The conveyor belt velocity is unknown in the first two tasks but does not occur mutation. In the Linear Motion task, algorithm 1 uses a prediction module to generate a long trajectory with a higher success rate. As the trajectory with the shortest time is executed every time, the grasping time is the fastest. The PBVS method responds quickly and has the highest success rate. In the Linear with Moving Obstacle on The Belt task, algorithm 1 has the highest success rate and takes the least time. Due to the not complicated environment, algorithm 4 will not fall into the extreme value. However, the PBVS method moves directly to the target in the workspace and cannot avoid obstacles, so the success rate is the lowest. In the Linear with Varying Speed task, The PBVS algorithm achieves the highest success rate and takes the least time. Algorithms 1 and 4 have similar success rates.

Table 2.

The average success rate (%) and dynamic gripping time(s) of real-world experiments under each condition.

Method	Linear	Linear with moving obstacle on the belt	Linear with varying speed
Our	88, 5.67	80, 6.13	76, 8.91
No prediction + One grasp	82, 6.83	74, 7.24	78, 7.86
PBVS	92, 6.24	56, 6.82	83, 7.27

Overall, our algorithmic framework (Algorithms 1) generates complete trajectories and works best in the presence of obstacles. When the target moves regularly, algorithm 1 calculates the intersection position in advance, making the grasping time the shortest. PBVS is a local reactive control method that works best when there are no obstacles, and the target velocity varies greatly. Algorithm 4 generates the complete trajectory in real time, and the performance is between algorithm 1 and PBVS. In the Linear with moving obstacle on the belt task, the dynamic grasping process of the target moving at a speed of 0.09 m/s on the conveyor belt is shown in Figure 13.

Figure 13.

The actual robot captures the moving object while avoiding an obstacle. We place the target and obstacle on the unknown speed conveyor belt. The prediction module records the target’s location information for a period to predict the target trajectory, as shown in (a)–(b). The robot arm executes the corresponding trajectory and arrives at the pregrasp pose in (c)–(e). After reaching the grasp pose, the robot closes the gripper and lifts the target object in (f)–(h).

Conclusions

This paper proposes a new dynamic grasping framework for effective grasping in a dynamic environment. Many grasp poses of the target are clustered to determine multiple pregrasp poses, and we use the Bézier curve motion prediction module to generate predicted trajectories of all moving objects. The trajectory optimization algorithm constructing an approximate gradient field in joint space can generate a smooth trajectory. Simulation comparison and actual experiments verify the dynamic grasping framework’s high performance and feasibility. The algorithm framework is robust to environmental changes and has a high success rate in complex environments. When the target’s motion state can be accurately predicted, we can achieve fast and robust moving target grasping using the prediction module and trajectory planning for all grasp poses. When the environment is not easy to predict, we only quickly plan for the current optimal grasp pose and do not use the prediction module, which can also get good results.

In the future, we will use deep learning to establish the mapping relationship between the deep image features and the manipulator’s local trajectory and use deep reinforcement learning to train the dynamic grasping framework. The new method will combine the prediction and planner modules to achieve more efficient dynamic grasping.

Footnotes

Declaration of conflicting interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Bo Liu

References

Wei

Liu

Yao

, et al. Efficient online motion planning method for the robotic arm to pick-up moving objects smoothly with temporal constraints. Proc IMechE, Part C: J Mechanical Engineering Science 2022; 236: 8650–8662.

Kappler

Meier

Issac

, et al. Real-time perception meets reactive motion generation. IEEE Robot Autom Lett 2018; 3(3): 1864–1871.

Eppner

Höfer

Jonschkowski

, et al. Lessons from the amazon picking challenge: Four aspects of building robotic systems. Robotics 2016; 12. DOI: 10.15607/rss.2016.xii.036

Aghili

A prediction and motion-planning scheme for visually guided robotic capturing of free-floating tumbling objects with uncertain dynamics. IEEE Trans Robot 2012; 28(3): 634–649.

Dong

Zhu

ZH.

Position-based visual servo control of autonomous robotic manipulators. Acta Astronaut 2015; 115: 291–302.

Gong

Qiu

Tao

, et al. Tracking and grasping of moving target based on accelerated geometric particle filter on colored image. Sci China Technol Sci 2021; 64(4): 755–766.

Levine

Finn

Darrell

, et al. End-to-end training of deep visuomotor policies. J Mach Learn Res 2016; 17: 1–40.

Wang

Zhang

Wang

, et al. Multi-task reinforcement learning based mobile manipulation control for dynamic object tracking and grasping. 2020; 1–6. http://arxiv.org/abs/2006.04271.

Pham

De Magistris

Tachibana

OptLayer - practical constrained optimization for deep reinforcement learning in the real world. In: IEEE international conference on robotics and automation, 2018, pp.6236–6243. New York: IEEE.

10.

Cowley

Cohen

Marshall

, et al. Perception and motion planning for pick-and-place of dynamic objects. In: IEEE international conference on intelligent robots and systems, 2013, pp.816–823. New York: IEEE.

11.

Menon

Cohen

Likhachev

Motion planning for smooth pickup of moving objects. In: IEEE international conference on robotics and automation, 2014, pp.453–460. New York: IEEE.

12.

Yang

Merkt

Ivan

, et al. Planning in time-configuration space for efficient pick-and-place in non-static environments with temporal constraints. In: IEEE-RAS international conference on humanoid robots, 2019, vol. 2018-November, pp.893–900. New York: IEEE.

13.

Islam

Salzman

Agarwal

, et al. Provably constant-time planning and replanning for real-time grasping objects off a conveyor belt. Int J Rob Res 2020; 40(12-14): 1370–1384. DOI: 10.15607/rss.2020.xvi.025

14.

Akinola

Song

, et al. Dynamic grasping with reachability and motion awareness. 2021 International Conference on Intelligent Robots and Systems (IROS). IEEE, 2021, pp. 9422–9429.

15.

Morrison

Leitner

Corke

Closing the loop for robotic grasping: A real-time, generative grasp synthesis approach. Robotics 2018. MIT Press-Journals, 2018.

16.

Lenz

Lee

Saxena

Deep learning for detecting robotic grasps. Int J Rob Res 2015; 34(4-5): 705–724.

17.

ten Pas

Gualtieri

Saenko

, et al. Grasp pose detection in point clouds. Int J Rob Res 2017; 36(13-14): 1455–1473.

18.

Eppner

Mousavian

Fox

A billion ways to grasp: an evaluation of grasp sampling schemes on a dense, physics-based grasp data set. Robotics Research: The 19th International Symposium ISRR. Cham: Springer International Publishing, 2022, pp. 890–905.

19.

Akinola

Varley

Chen

, et al. Workspace aware online grasp planning. In: IEEE international conference on intelligent robots and systems, 2018, pp.2917–2924. New York: IEEE.

20.

Kalman

Buey

A new approach to linear filtering and prediction theory. Trans ASME J Basic Eng 1961; 83(Series D): 95–108.

21.

Kendall

Salvapantula

Stol

KA.

On-board object tracking control of a quadcopter with monocular vision. In: 2014 international conference on unmanned aircraft systems (ICUAS), Orlando, USA. IEEE, 2014, pp.404–411.

22.

Nageli

Alonso-Mora

Domahidi

, et al. Real-time motion planning for aerial videography with dynamic obstacle avoidance and viewpoint optimization. IEEE Robot Autom Lett 2017; 2(3): 1696–1703.

23.

Jeon

Lee

Kim

HJ.

Integrated motion planner for real-time aerial videography with a drone in a dense environment. In: IEEE international conference on robotics and automation, 2020, pp.1243–1249. New York: IEEE.

24.

Chen

Liu

Shen

Tracking a moving target in cluttered environments using a quadrotor. In: IEEE international conference on intelligent robots and systems, 2016, vol. 2016-Novem, pp. 446–453. New York: IEEE.

25.

Han

Zhang

Pan

, et al. Fast-tracker: A robust aerial system for tracking agile target in cluttered environments. In: 2021 IEEE international conference on robotics and automation (ICRA), 2021, pp.328–334. New York: IEEE.

26.

Pan

Zhang

Yang

, et al. Fast-Tracker 2.0: Improving autonomy of aerial tracking with active vision and human location regression. IET Cyber Syst Robot 2021; 3(4): 292–301.

27.

Karaman

Frazzoli

Incremental sampling-based algorithms for optimal motion planning. Robot Sci Syst 2011; 6: 267–274.

28.

Kunz

Stilman

Probabilistically complete kinodynamic planning for robot manipulators with acceleration limits. In: 2014 IEEE/RSJ international conference on intelligent robots and systems, 2014, pp.3713–3719. New York: IEEE.

29.

Coleman

Sucan

Moll

, et al. Experience-based planning with sparse roadmap spanners. In: IEEE international conference on robotics and automation, 2015, pp.900–905. New York: IEEE.

30.

Zhao

Zhang

Optimum time-energy-jerk trajectory planning for serial robotic manipulators by reparameterized quintic NURBS curves. Proc IMechE, Part C: J Mechanical Engineering Science 2021; 235: 4382–4393.

31.

Liu

Zhang

Trajectory optimization for manipulators based on external archives self-searching multi-objective particle swarm optimization. Proc IMechE, Part C: J Mechanical Engineering Science 2022; 236: 1188–1201.

32.

Ratliff

Zucker

Bagnell

, et al. CHOMP: Gradient optimization techniques for efficient motion planning, 2009, pp.489–494.

33.

Kalakrishnan

Chitta

Theodorou

, et al. STOMP: Stochastic trajectory optimization for motion planning. In: IEEE international conference on robotics and automation, 2011, pp.4569–4574. New York: IEEE.

34.

Chase

Kew

Ichter

, et al. Neural collision clearance estimator for batched motion planning. Proc. Adv. Robot 2021; 17: 73–89.

35.

Fujii

Pham

Q-C

. Realtime trajectory smoothing with neural nets2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022, pp.7248–7254.

36.

Zhou

Wang

, et al. EGO-Planner: an ESDF-free gradient-based local planner for quadrotors. IEEE Robot Autom Lett 2021; 6(2): 478–485.

37.

Gertz

Wright

SJ.

Object-oriented software for quadratic programming. ACM Trans Math Softw 2003; 29(1): 58–81.

38.

Mueller

Hehn

D'Andrea

. A computationally efficient motion primitive for quadrocopter trajectory generation. IEEE Trans Robot 2015; 31(6): 1294–1310.

39.

Zhou

Gao

Wang

, et al. Robust and efficient quadrotor trajectory generation for fast autonomous flight. IEEE Robot Autom Lett 2019; 4(4): 3529–3536.

40.

Usenko

von Stumberg

Pangercic

, et al. Real-time trajectory replanning for MAVs using uniform B-splines and a 3D circular buffer. 2017 IEEE/RSJ international conference on intelligent robots and systems (IROS), 2017, pp.215–222. New York: IEEE.

41.

Calli

Walsman

Singh

, et al. Benchmarking in manipulation research: The YCB object and model set and benchmarking protocols, 2015.

42.

Qin

General matrix representations for B-splines. In: Pacific conference on computer graphics and applications, 1998, vol. 69473010, pp.37–43.

Dynamic grasping of manipulator based on realtime smooth trajectory generation

Abstract

Keywords

Introduction

Related work

Grasping in dynamic environments

Robotic grasping

Object motion prediction

Motion planning

Problem definition and algorithm framework

Grasp planning

Motion prediction

Motion generation

Estimate intersection point

Gradient-based trajectory optimization

Implementation details and results

Experimental setup

Experimental results

Real robot grasp

Conclusions

Footnotes

Declaration of conflicting interests

Funding

ORCID iD

References