Reinforcement learning for versatile,dynamic,and robust bipedal locomotion control

Abstract

This paper presents a comprehensive study on using deep reinforcement learning (RL) to create dynamic locomotion controllers for bipedal robots. Going beyond focusing on a single locomotion skill, we develop a general control solution that can be used for a range of dynamic bipedal skills, from periodic walking and running to aperiodic jumping and standing. Our RL-based controller incorporates a novel dual-history architecture, utilizing both a long-term and short-term input/output (I/O) history of the robot. This control architecture, when trained through the proposed end-to-end RL approach, consistently outperforms other methods across a diverse range of skills in both simulation and the real world. The study also delves into the adaptivity and robustness introduced by the proposed RL system in developing locomotion controllers. We demonstrate that the proposed architecture can adapt to both time-invariant dynamics shifts and time-variant changes, such as contact events, by effectively using the robot’s I/O history. Additionally, we identify task randomization as another key source of robustness, fostering better task generalization and compliance to disturbances. The resulting control policies can be successfully deployed on Cassie, a torque-controlled human-sized bipedal robot. This work pushes the limits of agility for bipedal robots through extensive real-world experiments. We demonstrate a diverse range of locomotion skills, including: robust standing, versatile walking, fast running with a demonstration of a 400-meter dash, and a diverse set of jumping skills, such as standing long jumps and high jumps.

Keywords

Humanoid and bipedal locomotion humanoids and animaloids reinforcement learning robot learning legged robots whole-body motion planning and control

Get full access to this article

View all access options for this article.

References

Agrawal

(2022) Model-based design for legged robots: predictive control and reinforcement learning. PhD Thesis. Berkeley: UC Berkeley.

Annaswamy

(2023) Adaptive control and intersections with reinforcement learning. Annual Review of Control, Robotics, and Autonomous Systems 6: 65–93.

Bogdanovic

Khadiv

Righetti

(2022) Model-free reinforcement learning for robust locomotion using demonstrations from trajectory optimization. Frontiers in Robotics and AI 9: 854212.

Boroujeni

Daneshman

Righetti

, et al. (2021) A unified framework for walking and running of bipedal robots. In: 2021 20th International Conference on Advanced Robotics (ICAR), Ljubljana, Slovenia, 06–10 December 2021, pp. 396–403, IEEE.

Bouyarmane

Kheddar

(2011) Using a multi-objective controller to synthesize simulated humanoid robot motion with changing contact configurations. In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Francisco, CA, USA, 25–30 September 2011, pp. 4414–4419, IEEE.

Caron

Kheddar

Tempier

(2019) Stair climbing stabilization of the hrp-4 humanoid robot using whole-body admittance control. In: 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019, pp. 277–283, IEEE.

Castillo

Weng

Zhang

, et al. (2022) Reinforcement learning-based cascade motion policy design for robust 3d bipedal locomotion. IEEE Access 10: 20135–20148.

Chen

Zhang

Mueller

, et al. (2023) Learning torque control for quadrupedal locomotion. In: 2023 IEEE-RAS 22nd International Conference on Humanoid Robots (Humanoids), Austin, TX, USA, 12–14 December 2023, pp. 1–8.

Cheng

Kumar

Pathak

(2023) Legs as manipulator: pushing quadrupedal agility beyond locomotion. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), 29 May–02 June 2023, London, UK, pp. 5106–5112.

10.

Chignoli

Kim

Stanger-Jones

, et al. (2021) The mit humanoid robot: Design, motion planning, and control for acrobatic behaviors. In: 2020 IEEE-RAS 20th International Conference on Humanoid Robots (Humanoids), Munich, Germany, 19–21 July 2021, pp. 1–8, IEEE.

11.

Crowley

Dao

Duan

, et al. (2023) Optimizing bipedal locomotion for the 100m dash with comparison to human running. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK, 29 May–02 June 2023, 12205–12211, IEEE.

12.

Harib

Hartley

, et al. (2016) From 2d design of underactuated bipedal gaits to 3d implementation: walking with speed tracking. IEEE Access 4: 3469–3478.

13.

Dai

Valenzuela

Tedrake

(2014) Whole-body motion planning with centroidal dynamics and full kinematics. In: 2014 IEEE-RAS International Conference on Humanoid Robots, Madrid, Spain, 18-20 November 2014, pp. 295–302, IEEE.

14.

Daneshmand

Khadiv

Grimminger

, et al. (2021) Variable horizon mpc with swing foot dynamics for bipedal walking control. IEEE Robotics and Automation Letters 6(2): 2349–2356.

15.

Dao

Green

Duan

, et al. (2022) Sim-to-real learning for bipedal locomotion under unsensed dynamic loads. In: 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022, pp. 10449–10455, IEEE.

16.

Deits

Tedrake

(2014) Footstep planning on uneven terrain with mixed-integer convex optimization. In: 2014 IEEE-RAS International Conference on Humanoid Robots, Madrid, Spain, 18–20 November 2014, pp. 279–286, IEEE.

17.

Deits

Kuindersma

Kelly

, et al. (2022) Robot movement and online trajectory optimization. US Patent App. 17/358,628.

18.

DRL (2023) cassie-mujoco-sim. https://github.com/osudrl/cassie-mujoco-sim.

19.

Drnach

Zhao

(2021) Robust trajectory optimization over uncertain terrain with stochastic complementarity. IEEE Robotics and Automation Letters 6(2): 1168–1175.

20.

Escontrela

Peng

, et al. (2022) Adversarial motion priors make good substitutes for complex reward functions. In: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022, pp. 25–32, IEEE.

21.

Feng

Zhang

, et al. (2023) Genloco: generalized locomotion controllers for quadrupedal robots. In: Conference on Robot Learning, Auckland, New Zealand, 14 December 2022, pp. 1893–1903, PMLR.

22.

Fernbach

Tonneau

Stasse

, et al. (2020) C-croc: continuous and convex resolution of centroidal dynamic trajectories for legged robots in multicontact scenarios. IEEE Transactions on Robotics 36(3): 676–691.

23.

Fevre

Wensing

Schmiedeler

(2020) Rapid bipedal gait optimization in casadi. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 24 October 2020–24 January 2021, pp. 3672–3678, IEEE.

24.

Kumar

Malik

, et al. (2021) Minimizing energy consumption leads to the emergence of gaits in legged robots. In: 5th Annual Conference on Robot Learning, London, UK, 08 November 2021.

25.

Cheng

Pathak

(2023) Deep whole-body control: learning a unified policy for manipulation and locomotion. In: Conference on Robot Learning, Auckland, New Zealand, 14 December 2022, pp. 138–149, PMLR.

26.

Gong

Grizzle

(2022) Zero dynamics, pendulum models, and angular momentum in feedback control of bipedal locomotion. Journal of Dynamic Systems, Measurement, and Control 144(12): 121006.

27.

Gong

Hartley

, et al. (2019) Feedback control of a cassie bipedal robot: walking, standing, and riding a segway. In: 2019 American Control Conference (ACC), Philadelphia, PA, USA, 10–12 July 2019, pp. 4559–4566, IEEE.

28.

Goswami

Vadakkepat

(2009) Planar bipedal jumping gaits with stable landing. IEEE Transactions on Robotics 25(5): 1030–1046.

29.

Haarnoja

Zhou

, et al. (2019) Learning to walk via deep reinforcement learning. In: Robotics: Science and Systems (RSS), Freiburg im Breisgau, 22–26 June 2019.

30.

Hereid

Hubicki

Cousineau

, et al. (2018) Dynamic humanoid locomotion: a scalable formulation for hzd gait optimization. IEEE Transactions on Robotics 34(2): 370–387.

31.

Hereid

Harib

Hartley

, et al. (2019) Rapid trajectory optimization using c-frost with illustration on a cassie-series dynamic walking biped. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 03–08 November 2019. pp. 4722–4729.

32.

Huang

Xiang

, et al. (2023) Creating a dynamic quadrupedal robotic goalkeeper with reinforcement learning. In: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023, pp. 2715–2722.

33.

Huang

Chi

Wang

, et al. (2024) Diffuseloco: real-time legged locomotion control with diffusion from offline datasets. arXiv preprint arXiv:2404.19264 .

34.

Hwangbo

Lee

Dosovitskiy

, et al. (2019) Learning agile and dynamic motor skills for legged robots. Science Robotics 4(26): eaau5872.

35.

Ibanez

Bidaud

Padois

(2014) Emergence of humanoid walking behaviors from mixed-integer model predictive control. In: 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, Chicago, IL, USA, 14–18 September 2014, pp. 4014–4021, IEEE.

36.

Mun

Kim

, et al. (2022) Concurrent training of a control policy and a state estimator for dynamic and robust legged locomotion. IEEE Robotics and Automation Letters 7(2): 4630–4637.

37.

Kajita

Kanehiro

Kaneko

, et al. (2001) The 3d linear inverted pendulum mode: a simple modeling for a biped walking pattern generation. In: Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No. 01CH37180), volume 1, Maui, HI, USA, 29 October–03 November 2001, pp. 239–246, IEEE.

38.

Kalashnikov

Varley

Chebotar

, et al. (2021) Mt-opt: continuous multi-task robotic reinforcement learning at scale. arXiv preprint arXiv:2104.08212 .

39.

Kim

Berseth

Schwartz

, et al. (2023) Torque-based deep reinforcement learning for task-and-robot agnostic learning on bipedal robots using sim-to-real transfer. IEEE Robotics and Automation Letters 8: 6251.

40.

Kojima

Kojio

Ishikawa

, et al. (2019) A robot design method for weight saving aimed at dynamic motions: design of humanoid jaxon3-p and realization of jump motions. In: 2019 IEEE-RAS 19th International Conference on Humanoid Robots (Humanoids), Toronto, ON, Canada, 15–17 October 2019, pp. 586–593, IEEE.

41.

Kuindersma

Permenter

Tedrake

(2014) An efficiently solvable quadratic program for stabilizing dynamic locomotion. In: 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 31 May–07 June 2014, pp. 2589–2594, IEEE.

42.

Kumar

Pathak

, et al. (2021) Rma: rapid motor adaptation for legged robots. Robotics: Science and Systems.

43.

Kumar

Zeng

, et al. (2022) Adapting rapid motor adaptation for bipedal robots. In: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022, pp. 1161–1168.

44.

Landau

Lozano

M’Saad

, et al. (2011) Adaptive Control: Algorithms, Analysis and Applications. Berlin, Germany: Springer Science & Business Media.

45.

Landry

Lorenzetti

Manchester

, et al. (2022) Bilevel optimization for planning through contact: a semidirect method. Robotics Research: The 19th International Symposium ISRR. Berlin, Germany: Springer, 789–804.

46.

Le Cleac’h

Howell

Yang

, et al. (2024) Fast contact-implicit model predictive control. IEEE Transactions on Robotics 40: 1617.

47.

Lee

Sun

Somasundaram

, et al. (2018) Composing complex skills by learning transition policies. In: International Conference on Learning Representations, New Orleans, Louisiana, USA, 6–9 May 2019.

48.

Lee

Hwangbo

Wellhausen

, et al. (2020) Learning quadrupedal locomotion over challenging terrain. Science Robotics 5(47): eabc5986.

49.

Cummings

Sreenath

(2020) Animated cassie: a dynamic relatable robotic character. In: 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 24 October 2020 - 24 January 2021, pp. 3739–3746.

50.

Cheng

Peng

, et al. (2021) Reinforcement learning for robust parameterized locomotion control of bipedal robots. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi'an, China, 30 May–05 June 2021, pp. 2811–2817, IEEE.

51.

, et al. (2023a) Learning agile bipedal motions on a quadrupedal robot. arXiv preprint arXiv:2311.05818 .

52.

Peng

Abbeel

, et al. (2023b) Robust and versatile bipedal jumping control through reinforcement learning. In: Robotics: Science and Systems XIX, Daegu, Republic of Korea, 10–14 July 2023.

53.

Lim

Kim

Cha

, et al. (2023) Proprioceptive external torque learning for floating base robot and its applications to humanoid locomotion. In: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 01–05 October 2023.

54.

Liu

Huang

, et al. (2023) Continual vision-based reinforcement learning with group symmetries. In: Conference on Robot Learning, Atlanta, USA, 06 November 2023, pp, 222–240, PMLR.

55.

Ljung

(1998) System identification. Signal Analysis and Prediction. Berlin, Germany: Springer, 163–173.

56.

Kolathaya

Ambrose

, et al. (2017) Bipedal robotic running with durus-2d: bridging the gap between theory and experiment. In: Proceedings of the 20th international conference on hybrid systems: computation and control, Pittsburgh, PA, USA, April 18-20, 2017, pp. 265–274.

57.

Marcucci

Gabiccini

Artoni

(2016) A two-stage trajectory optimization strategy for articulated bodies with unscheduled contact sequences. IEEE Robotics and Automation Letters 2(1): 104–111.

58.

Margolis

Agrawal

(2023) Walk these ways: tuning robot control for generalization with multiplicity of behavior. In: Conference on Robot Learning, Auckland, New Zealand, 14 December 2022, pp. 22–31, PMLR.

59.

Margolis

Yang

Paigwar

, et al. (2022) Rapid locomotion via reinforcement learning. In: Robotics: Science and Systems, New York City, NY, USA, June 27–1 July 2022.

60.

Meduri

Shah

Viereck

, et al. (2023) Biconmp: a nonlinear model predictive control framework for whole body motion planning. IEEE Transactions on Robotics 39(2): 905–922.

61.

Meuleau

Peshkin

Kim

, et al. (1999) Learning finite-state controllers for partially observable environments. In: Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence, Stockholm Sweden, July 30–1 August 1999, pp. 427–436.

62.

Miki

Lee

Hwangbo

, et al. (2022) Learning robust perceptive locomotion for quadrupedal robots in the wild. Science Robotics 7(62): eabk2822.

63.

Moro

Sentis

(2019) Whole-body control of humanoid robots. In: Humanoid Robotics: A Reference. Dordrecht: Springer.

64.

Orin

Goswami

Lee

(2013) Centroidal dynamics of a humanoid robot. Autonomous Robots 35: 161–176.

65.

Peng

Andrychowicz

Zaremba

, et al. (2018) Sim-to-real transfer of robotic control with dynamics randomization. In: 2018 IEEE international Conference on Robotics and Automation (ICRA), Brisbane, QLD, Australia, 21–25 May 2018, pp. 3803–3810.

66.

Peng

Coumans

Zhang

, et al. (2020) Learning agile robotic locomotion skills by imitating animals. In: Robotics: Science and Systems (RSS), held virtually, 12–16 July 2020.

67.

Peng

Abbeel

, et al. (2021) Amp: adversarial motion priors for stylized physics-based character control. ACM Transactions on Graphics 40(4): 1–20.

68.

Posa

Cantu

Tedrake

(2014) A direct method for trajectory optimization of rigid bodies through contact. The International Journal of Robotics Research 33(1): 69–81.

69.

Pratt

Koolen

De Boer

, et al. (2012) Capturability-based analysis and control of legged locomotion, part 2: application to m2v2, a lower-body humanoid. The International Journal of Robotics Research 31(10): 1117–1133.

70.

Chen

, et al. (2023) Vertical jump of a humanoid robot with cop-guided angular momentum control and impact absorption. IEEE Transactions on Robotics 39: 3154–3166.

71.

Radosavovic

Xiao

Zhang

, et al. (2024) Real-world humanoid locomotion with reinforcement learning. Science Robotics 9(89): eadi9579.

72.

Raibert

Brown

Jr Chepponis

(1984) Experiments in balance with a 3d one-legged hopping machine. The International Journal of Robotics Research 3(2): 75–92.

73.

Reher

Ames

(2021) Control lyapunov functions for compliant hybrid zero dynamic walking. arXiv preprint arXiv:2107.04241 .

74.

Rodriguez

Behnke

(2021) Deepwalk: omnidirectional bipedal gait by deep reinforcement learning. In: 2021 IEEE international conference on robotics and automation (ICRA), Xi'an, China, 30 May–05 June 2021, pp. 3033–3039.

75.

Rummel

Blum

Maus

, et al. (2010) Stable and robust walking with compliant legs. In: 2010 IEEE International Conference on Robotics and Automation, Anchorage, AK, USA, 3–7 May 2010, pp. 5250–5255, IEEE.

76.

Schulman

Wolski

Dhariwal

, et al. (2017) Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 .

77.

Sentis

Khatib

(2006) A whole-body control framework for humanoids operating in human environments. In: Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006, Orlando, FL, USA, 15–19 May 2006, pp. 2641–2648, IEEE.

78.

SFU (2018) Sfu motion capture database. https://mocap.cs.sfu.ca/.

79.

Shao

Jin

Liu

, et al. (2021) Learning free gait transition for quadruped robots via phase-guided controller. IEEE Robotics and Automation Letters 7(2): 1230–1237.

80.

Siekmann

Valluri

Dao

, et al. (2020) Learning memory-based control for human-scale bipedal locomotion. In: Robotics Science and Systems, Corvalis, Oregon, USA, 12–16 July, 2020.

81.

Siekmann

Godse

Fern

, et al. (2021a) Sim-to-real learning of all common bipedal gaits via periodic reward composition. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi'an, China, 30 May–05 June 2021, pp. 7309–7315, IEEE.

82.

Siekmann

Green

Warila

, et al. (2021b) Blind bipedal stair traversal via sim-to-real reinforcement learning. In: Robotics: Science and Systems, Held Virtually, July 12–16, 2021.

83.

Singh

Xie

Gergondet

, et al. (2023) Learning bipedal walking for humanoids with current feedback. IEEE Access.

84.

Smith

Kew

Peng

, et al. (2022) Legged robots that keep on learning: fine-tuning locomotion policies in the real world. In: 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022, pp. 1593–1599, IEEE.

85.

Smith

Kostrikov

Levine

(2023) Demonstrating a walk in the park: learning to walk in 20 minutes with model-free reinforcement learning. Robotics: Science and Systems (RSS) Demo 2(3): 4.

86.

Spaan

(2012) Partially observable markov decision processes. Reinforcement learning: state-of-the-art. Berlin, Germany: Springer, 387–414.

87.

Sreenath

Park

Poulakakis

, et al. (2011) A compliant hybrid zero dynamics controller for stable, efficient and fast bipedal walking on mabel. The International Journal of Robotics Research 30(9): 1170–1193.

88.

Sreenath

Park

Poulakakis

, et al. (2013) Embedding active force control within the compliant hybrid zero dynamics to achieve stable, fast running on mabel. The International Journal of Robotics Research 32(3): 324–345.

89.

Takenaka

Matsumoto

Yoshiike

(2009a) Real time motion generation and control for biped robot-1 st report: walking gait pattern generation. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, USA, 10–15 October 2009, pp. 1084–1091, IEEE.

90.

Takenaka

Matsumoto

Yoshiike

, et al. (2009b) Real time motion generation and control for biped robot-2 nd report: running gait pattern generation. In: 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, USA, 10–15 October 2009, pp. 1092–1099, IEEE.

91.

Todorov

Erez

Tassa

(2012) Mujoco: a physics engine for model-based control. In: 2012 IEEE/RSJ international conference on intelligent robots and systems, Vilamoura-Algarve, Portugal, 07–12 October 2012, pp. 5026–5033.

92.

van Marum

Shrestha

Duan

, et al. (2024) Revisiting reward design and evaluation for robust humanoid standing and walking. arXiv preprint arXiv:2404.19173.

93.

Vukobratović

Borovac

(2004) Zero-moment point—thirty five years of its life. International Journal of Humanoid Robotics 01(01): 157–173.

94.

Wensing

Orin

(2016) Improved computation of the humanoid centroidal dynamics and application for whole-body control. International Journal of Humanoid Robotics 13(01): 1550039.

95.

Wensing

Posa

, et al. (2023) Optimization-based control for dynamic legged robots. IEEE Transactions on Robotics 40: 43.

96.

Westenbroek

Castaneda

Agrawal

, et al. (2022) Lyapunov design for robust and efficient robotic reinforcement learning. In: 6th Annual Conference on Robot Learning, Auckland, New Zealand, 14 December 2022.

97.

Westervelt

Grizzle

Koditschek

(2003) Hybrid zero dynamics of planar biped walkers. IEEE Transactions on Automatic Control 48(1): 42–56.

98.

Xue

(2023a) Learning multiple gaits within latent space for quadruped robots. arXiv preprint arXiv:2308.03014 .

99.

Escontrela

Hafner

, et al. (2023b) Daydreamer: world models for physical robot learning. In: Conference on Robot Learning, Auckland, New Zealand, 14 December 2022, pp. 2226–2240, PMLR.

100.

Xie

Berseth

Clary

, et al. (2018) Feedback control for cassie with deep reinforcement learning. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 1–5 October 2018, 1241–1246, IEEE.

101.

Xie

Clary

Dao

, et al. (2020) Learning locomotion skills for cassie: iterative design and sim-to-real. Proceedings of the Conference on Robot Learning 100: 317–329.

102.

Xie

Van de Panne

, et al. (2021) Dynamics randomization revisited: a case study for quadrupedal locomotion. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–05 June 2021, pp. 4955–4961, IEEE.

103.

Xiong

Ames

(2018) Bipedal hopping: reduced-order model embedding via optimization-based control. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 1–5 October 2018, pp. 3821–3828, IEEE.

104.

Xiong

Ames

(2022) 3-d underactuated bipedal walking via h-lip based gait synthesis and stepping stabilization. IEEE Transactions on Robotics 38(4): 2405–2425.

105.

Yang

Posa

(2021) Impact invariant control with applications to bipedal locomotion. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021, pp. 5151–5158, IEEE.

106.

Yang

Posa

(2023) Impact-invariant control: maximizing control authority during impacts. arXiv preprint arXiv:2303.00817 .

107.

Yang

Zeng

, et al. (2022) Bayesian optimization meets hybrid zero dynamics: safe parameter learning for bipedal locomotion control. In: 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022, pp. 10456–10462, IEEE.

108.

Kumar

Turk

, et al. (2019) Sim-to-real transfer for biped locomotion. In: 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS), Macau, China, 03–08 November 2019. pp. 3503–3510.

109.

Batke

Dao

, et al. (2022) Dynamic bipedal turning through sim-to-real reinforcement learning. In: 2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids), Ginowan, Japan, 28–30 November 2022, pp. 903–910, IEEE.

110.

Yunt

Glocker

(2006) Trajectory optimization of mechanical hybrid systems using sumt. In: 9th IEEE International Workshop on Advanced Motion Control, 2006, Istanbul, Turkey, 27–29 March 2006, pp. 665–671, IEEE.

111.

Zhu

Pan

Hauser

(2021) Contact-implicit trajectory optimization with learned deformable contacts using bilevel optimization. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi'an, China, 30 May–05 June 2021, pp. 9921–9927.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB