Sage Journals: Discover world-class research

Abstract

Damage recovery is critical for autonomous robots that need to operate for a long time without assistance. Most current methods are complex and costly because they require anticipating potential damage in order to have a contingency plan ready. As an alternative, we introduce the T-resilience algorithm, a new algorithm that allows robots to quickly and autonomously discover compensatory behavior in unanticipated situations. This algorithm equips the robot with a self-model and discovers new behavior by learning to avoid those that perform differently in the self-model and in reality. Our algorithm thus does not identify the damaged parts but it implicitly searches for efficient behavior that does not use them. We evaluate the T-resilience algorithm on a hexapod robot that needs to adapt to leg removal, broken legs and motor failures; we compare it to stochastic local search, policy gradient and the self-modeling algorithm proposed by Bongard et al. The behavior of the robot is assessed on-board thanks to an RGB-D sensor and a SLAM algorithm. Using only 25 tests on the robot and an overall running time of 20 min, T-resilience consistently leads to substantially better results than the other approaches.

Keywords

Damage recovery evolutionary algorithm fault tolerance hexapod locomotion learning long-term autonomy resilience transferability

Get full access to this article

View all access options for this article.

References

Argall

Chernova

Veloso

. (2009) A survey of robot learning from demonstration. Robotics and Autonomous Systems 57(5): 469–483.

Barfoot

Earon

D’Eleuterio

(2006) Experiments in learning distributed control for a hexapod robot. Robotics and Autonomous Systems 54(10): 864–872.

Bellingham

Rajan

(2007) Robotics in remote and hostile environments. Science 318(5853): 1098–1102.

Berenson

Estevez

Lipson

(2005) Hardware evolution of analog circuits for in-situ robotic fault-recovery. In: Proceedings of NASA/DoD conference on evolvable hardware, pp. 12–19.

Bongard

(2007) Action-selection and crossover strategies for self-modeling machines. In: Proceedings of genetic and evolutionary computation conference (GECCO), pp. 198–205.

Bongard

Lipson

(2005) Nonlinear system identification using coevolution of models and tests. IEEE Transactions on Evolutionary Computation 9(4): 361–384.

Bongard

Zykov

Lipson

(2006) Resilient machines through continuous self-modeling. Science 314(5802): 1118–1121.

Caccavale

Villani

(2002) Fault Diagnosis and Fault Tolerance for Mechatronic Systems: Recent Advances. New York, NY: Springer.

Cantu-Paz

(2000) Efficient and Accurate Parallel Genetic Algorithms. Norwell, MA: Kluwer Academic Publishers.

10.

Chang

Lin

(2011) Libsvm: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2(3): 27.

11.

Chernova

Veloso

(2004) An evolutionary approach to gait learning for four-legged robots. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems (IROS), pp. 2562–2567.

12.

Clune

Stanley

Pennock

. (2011) On the performance of indirect encoding across the continuum of regularity. IEEE Transactions on Evolutionary Computation 15(3): 346–367.

13.

Connell

Mahadevan

(1993) Robot Learning. New York, NY: Springer.

14.

Corbato

(2007) On building systems that will fail. ACM Turing Award Lectures 34(9): 72–81.

15.

Cully

Mouret

(2013 a) Behavioral repertoire learning in robotics. In: Proceedings of genetic and evolutionary computation conference (GECCO). pp. 175–182.

16.

Cully

Mouret

(2013 b) Learning to walk in every direction. Available at: http://arxiv.org/abs/1308.3689.

17.

De Jong

(2006) Evolutionary Computation: A Unified Approach. Cambridge, MA: MIT Press.

18.

Deb

(2001) Multi-Objective Optimization Using Evolutionary Algorithms. New York, NY: John Wiley and Sons.

19.

Deb

Pratap

Agarwal

. (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation 6(2): 182–197.

20.

Delcomyn

(1971) The locomotion of the cockroach Periplaneta americana. Journal of Experimental Biology 54(2): 443–452.

21.

Ding

Wang

Rovetta

. (2010) Locomotion analysis of hexapod robot. Proceedings of conference on climbing and walking robots (CLAWAR), pp. 291–310.

22.

Doncieux

Mouret

J-B

Bredeche

. (2011) Evolutionary robotics: Exploring new horizons. In: Doncieux

Bredèche

Mouret

J-B

(eds) New Horizons in Evolutionary Robotics: Extended Contributions from the 2009 EvoDeRob Workshop. New York, NY: Springer, pp. 3–25.

23.

Endres

Hess

Engelhard

. (2012) An evaluation of the RGB-D SLAM system. In: Proceedings of the IEEE international conference on robotics and automation (ICRA).

24.

Goldberg

Chen

(2001) Collaborative control of robot motion: Robustness to error. In: Proceedings of IEEE/RSJ international conference on intelligent robots and systems (IROS).

25.

Görner

Hirzinger

(2010) Analysis and evaluation of the stability of a biologically inspired, leg loss tolerant gait for six- and eight-legged walking robots. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), pp. 4728–4735.

26.

Grefenstette

Schultz

Moriarty

(1999) Evolutionary algorithms for reinforcement learning. Journal of Artificial Intelligence Research 11: 241–276.

27.

Hartland

Bredeche

(2006) Evolutionary robotics, anticipation and the reality gap. In: IEEE international conference on robotics and biomimetics, pp. 1640–1645.

28.

Heidrich-Meisner

Igel

(2009) Neuroevolution strategies for episodic reinforcement learning. Journal of Algorithms 64(4): 152–168.

29.

Hemker

Stelzer

Von Stryk

. (2009) Efficient walking speed optimization of a humanoid robot. The International Journal of Robotics Research 28(2): 303–314.

30.

Hoffmann

Marques

Arieta

. (2010) Body schema in robotics: A review. IEEE Transactions on Autonomous Mental Development 2(4): 304–324.

31.

Holland

Goodman

(2003) Robots with internal models: A route to machine consciousness? Journal of Consciousness Studies 10(4–5): 4–5.

32.

Hoos

Stützle

(2005) Stochastic Local Search: Foundations And Applications. Burlington, MA: Morgan Kaufmann.

33.

Hornby

Takamura

Yamamoto

. (2005) Autonomous evolution of dynamic gaits with two quadruped robots. IEEE Transactions on Robotics 21(3): 402–410.

34.

Hornby

Lohn

Linden

(2011) Computer-automated evolution of an X-band antenna for NASA’s Space Technology 5 mission. Evolutionary Computation 19(1): 1–23.

35.

Jakimovski

Maehle

(2010) In situ self-reconfiguration of hexapod robot OSCAR using biologically inspired approaches. In: Miripour

(ed.) Climbing and Walking Robots. Rijeka, Croatia: InTech.

36.

Jakobi

Husbands

Harvey

(1995) Noise and the reality gap: The use of simulation in evolutionary robotics. Proceedings of the European conference on artificial life (ECAL), pp. 704–720.

37.

Kajita

Espiau

(2008) Legged robots. In: Siciliano

Khatib

(eds) Handbook of Robotics. New York, NY: Springer, pp. 361–389.

38.

Katić

Vukobratović

(2003) Survey of intelligent control techniques for humanoid robots. Journal of Intelligent & Robotic Systems 37(2): 117–141.

39.

Kimura

Yamashita

Kobayashi

(2001) Reinforcement learning of walking behavior for a four-legged robot. In: Proceedings of IEEE conference on decision and control (CDC), pp. 411–416.

40.

Klaus

Glette

Tørresen

(2012) A comparison of sampling strategies for parameter estimation of a robot simulator. Simulation, Modeling, and Programming for Autonomous Robots 7628: 173–184.

41.

Kober

Peters

(2010) Imitation and reinforcement learning – practical learning algorithms for motor primitives in robotics. IEEE Robotics and Automation Magazine 17(2): 1–8.

42.

Kober

Peters

(2012) Reinforcement learning in robotics: A survey. In: Adaptation, Learning, and Optimization (ALO). New York, NY: Springer, pp. 579–610.

43.

Kohl

Stone

(2004) Policy gradient reinforcement learning for fast quadrupedal locomotion. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), pp. 2619–2624.

44.

Koos

Mouret

(2011) Online discovery of locomotion modes for wheel-legged hybrid robots: A transferability-based approach. In: Proceedings of the conference on climbing and walking robots (CLAWAR), pp. 70–77.

45.

Koos

Mouret

Doncieux

(2013) The transferability approach: Crossing the reality gap in evolutionary robotics. IEEE Transactions on Evolutionary Computation 17(1): 22–145.

46.

Koren

Krishna

(2007) Fault-Tolerant Systems. Burlington, MA: Morgan Kaufmann.

47.

Lin

Chen

(2007) Robust fault-tolerant control for a biped robot using a recurrent cerebellar model articulation controller. Systems, Man, and Cybernetics, Part B: Cybernetics 37(1): 110–123.

48.

Mahdavi

Bentley

(2003) An evolutionary approach to damage recovery of robot motion with muscles. Advances in Artificial Life 2801: 248–255.

49.

Mahdavi

Bentley

(2006) Innately adaptive robotics through embodied evolution. Autonomous Robots 20(2): 149–163.

50.

Metzinger

(2004) Being No One: The Self-Model Theory Of Subjectivity. Cambridge, MA: MIT Press.

51.

Metzinger

(2007) Self models. Scholarpedia 2(10): 4174.

52.

Moore

(1975) Progress in digital integrated electronics. In: International electron devices meeting, pp. 11–13.

53.

Mostafa

Tsai

Her

(2010) Alternative gaits for multiped robots with leg failures to retain maneuverability. International Journal of Advanced Robotic Systems 7(4): 31.

54.

Mouret

Doncieux

(2010) Sferes: Evolvin’ in the multi-core world. In: Proceedings of the IEEE congress on evolutionary computation (CEC), pp. 4079–4086.

55.

Mouret

Doncieux

(2012) Encouraging behavioral diversity in evolutionary robotics: An empirical study. Evolutionary Computation 20(1): 91–133.

56.

Mouret

Koos

Doncieux

(2012) Crossing the reality gap: A short introduction to the transferability approach. In: Proceedings of ALIFE’s workshop ‘Evolution in physical systems‘, pp. 1–7.

57.

Mulder

Hochstenbach

Dijkstra

. (2008) Born to adapt, but not in your dreams. Consciousness and Cognition 17(4): 1266–71.

58.

Nakamura

Mori

Sato

. (2007) Reinforcement learning for a biped robot based on a CPG-actor-critic method. Neural Networks 20(6): 723–735.

59.

Nelson

Barlow

Doitsidis

(2009) Fitness functions in evolutionary robotics: A survey and analysis. Robotics and Autonomous Systems 57(4): 345–370.

60.

Nguyen-Tuong

Peters

(2011) Model learning for robot control: A survey. Cognitive Processing 12(4): 319–340.

61.

Palmer

Miller

Blackwell

(2009) An evolved neural controller for bipedal walking: Transitioning from simulator to hardware. In: Proceedings of IROS workshop on exploring new horizons in evolutionary design of robots.

62.

Parker

(2009) Punctuated anytime learning to evolve robot control for area coverage. Design and Control of Intelligent Robotic Systems 177: 255–277.

63.

Peters

(2010) Policy gradient methods. Scholarpedia 5(10): 3698.

64.

Peters

Schaal

(2008) Reinforcement learning of motor skills with policy gradients. Neural Networks 21(4): 682–697.

65.

Prassler

Kosuge

(2008) Domestic robotics. In: Siciliano

Khatib

(eds.) Handbook of Robotics. New York, NY: Springer, pp. 1253–1281.

66.

Pretorius

du Plessis

Cilliers

(2012) Simulating robots without conventional physics: A neural network approach. Journal of Intelligent & Robotic Systems 71: 319–348.

67.

Ihlefeld

Jin

. (2003) Robust fault-tolerant self-recovering control of nonlinear uncertain systems. Automatica 39(10): 1763–1771.

68.

Quigley

Conley

Gerkey

. (2009) ROS: An open-source robot operating system. In: Proceedings of ICRA’s workshop on open source software.

69.

Ramachandran

Hirstein

(1998) The perception of phantom limbs. Brain 121(9): 1603–1630.

70.

Saranli

Buehler

Koditschek

(2001) Rhex: A simple and highly mobile hexapod robot. The International Journal of Robotics Research 20(7): 616–631.

71.

Schleyer

Russell

(2010) Adaptable gait generation for autotomised legged robots. In: Proceedings of Australasian conference on robotics and automation (ACRA).

72.

Schmitz

Dean

Kindermann

. (2001) A biologically inspired controller for hexapod walking: Simple solutions by exploiting physical properties. The Biological Bulletin 200(2): 195–200.

73.

Smola

Schölkopf

(2004) A tutorial on support vector regression. Statistics and Computing 14(3): 199–222.

74.

Smola

Vapnik

(1997) Support vector regression machines. Advances in Neural Information Processing Systems 9: 155–161.

75.

Sproewitz

Moeckel

Maye

. (2008) Learning to move in modular robots using central pattern generators and online optimization. The International Journal of Robotics Research 27(3–4): 423–443.

76.

Steingrube

Timme

Wörgötter

. (2010) Self-organized adaptation of a simple neural circuit enables complex robot behaviour. Nature Physics 6(3): 224–230.

77.

Sturm

Plagemann

Burgard

(2008) Adaptive body scheme models for robust robotic manipulation. In: Robotics: Science and systems.

78.

Sutton

McAllester

Singh

. (2000) Policy gradient methods for reinforcement learning with function approximation. Advances In Neural Information Processing Systems 12(22): 1057–1063.

79.

Sutton

Barto

(1998) Introduction to Reinforcement Learning. Cambridge, MA: MIT Press.

80.

Tedrake

Zhang

Seung

(2005) Learning to walk in 20 minutes. In: Proceedings of Yale workshop on adaptive and learning systems.

81.

Toffolo

Benini

(2003) Genetic diversity as an objective in multi-objective evolutionary algorithms. Evolutionary Computation 11(2): 151–167.

82.

Togelius

Schaul

Wierstra

. (2009) Ontogenetic and phylogenetic reinforcement learning. Kuenstliche Intelligenz 23(3): 30–33.

83.

Turing

(1950) Computing machinery and intelligence. Mind 59(236): 433–460.

84.

Visinsky

Cavallaro

Walker

(1994) Robotic fault detection and fault tolerance: A survey. Reliability Engineering & System Safety 46(2): 139–158.

85.

Vogeley

Kurthen

Falkai

. (1999) Essential functions of the human self model are implemented in the prefrontal cortex. Consciousness and Cognition 8(3): 343–63.

86.

Weingarten

Lopes

Buehler

. (2004) Automated gait adaptation for legged robots. In: Proceedings of the IEEE international conference on robotics and automation (ICRA), pp. 2153–2158.

87.

Whiteson

(2012) Evolutionary computation for reinforcement learning. In: Wiering

van Otterlo

(eds) Reinforcement Learning: State of the Art. Berlin: Springer, pp. 326–355.

88.

Wilson

(1966) Insect walking. Annual Review of Entomology 11(1): 103–122.

89.

Yosinski

Clune

Hidalgo

. (2011) Evolving robot gaits in hardware: The HyperNEAT generative encoding vs. parameter optimization. In: Proceedings of the 20th European conference on artificial life (ECAL), pp. 11–18.

90.

Zagal

Delpiano

Ruiz-del Solar

(2009) Self-modeling in humanoid soccer robots. Robotics and Autonomous Systems 57(8): 819–827.

91.

Zagal

Ruiz-del Solar

Vallejos

(2004) Back to reality: Crossing the reality gap in evolutionary robotics. In: Proceedings of IFAC symposium on intelligent autonomous vehicles (IAV).

92.

Zykov

(2008) Morphological and behavioral resilience against physical damage for robotic systems. PhD Thesis, Cornell University, NY, USA.

93.

Zykov

Bongard

Lipson

(2004) Evolving dynamic gaits on a physical robot. In: Proceedings of genetic and evolutionary computation conference, late breaking paper (GECCO).

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

Fast damage recovery in robotics with the T-resilience algorithm

Abstract

Keywords

Get full access to this article

References

Supplementary Material