Sage Journals: Discover world-class research

Abstract

We develop a technique to automatically generate a control policy for a robot moving in an environment that includes elements with unknown, randomly changing behavior. The robot is required to achieve a surveillance mission, in which a certain request needs to be serviced repeatedly, while the expected time inbetween consecutive services is minimized and additional temporal logic constraints are satisfied. We define a fragment of linear temporal logic to describe such a mission and formulate the problem as a temporal logic game. Our approach is based on two main ideas. First, we extend results in automata learning to detect patterns of the unknown behavior of the elements in the environment. Second, we employ an automata–theoretic method to generate the control policy. We show that the obtained control policy converges to an optimal one when the partially unknown behavior patterns are fully learned. In addition, we illustrate the method in an experimental setup, in which an unmanned ground vehicle, with the help of a cooperating unmanned aerial vehicle (UAV), satisfies a temporal logic requirement in a partitioned environment whose regions are controlled by barriers with unknown behavior.

Keywords

learning and adaptive systems cognitive robotics autonomous agents AI reasoning methods

Get full access to this article

View all access options for this article.

References

Angluin

(1982) Inference of reversible languages. Journal of the ACM 29(3): 741–765.

Antoniotti

Mishra

(1995) Discrete event models + temporal logic = supervisory controller: Automatic synthesis of locomotion controllers. In: IEEE international conference on robotics and automation (ICRA 21–27 May, 1995), Nagoya, Japan, pp. 1441–1446. Piscataway: IEEE Press.

Baier

Katoen

(2008) Principles of Model Checking. Cambridge: MIT Press.

Becerra-Bonache

Dediu

Tîrnauca

(2006) Learning DFA from correction and equivalence queries. In: Sakakibara

Kobayashi

Sato

Nishino

Tomita

(eds) Grammatical Inference: Algorithms and Applications. Berlin: Springer, vol. 4201, pp. 281–292.

Bhatia

Kavraki

Vardi

(2010) Sampling-based motion planning with temporal goals. In: IEEE International conference on robotics and automation, 3-8 May, (ICRA ‘10), Anchorage, USA, pp. 2689–2696. Piscataway: IEEE Press.

Chatterjee

Henzinger

(2011) Faster and dynamic algorithms for maximal end-component decomposition and related graph problems in probabilistic verification. In: 22nd annual ACM-SIAM symposium on discrete algorithms, 23–25 January, Philadelphia. (SODA ‘11), San Francisco, USA, pp. 1318–1336. SIAM

Chen

Tumova

Belta

(2012a) LTL robot motion control based on automata learning of environmental dynamics. In: IEEE international conference on robotics and automation, 14–18 May, (ICRA ‘12), Saint Paul, USA. pp. 5177–5182. Piscataway: IEEE Press.

Chen

Ding

Stefanescu

Belta

(2012b) Formal approach to the deployment of distributed robotic teams. IEEE Transactions on Robotics 28(1): 158–171.

Courcoubetis

Yannakakis

(1995) The complexity of probabilistic verification. Journal of the ACM 42(4): 857–907.

10.

Ding

Smith

Belta

Rus

(2011a) MDP optimal control under temporal logic constraints. In: IEEE conference on decision and control and european control conference 12–15 December. Orlando, USA, pp. 532–538. Piscataway: IEEE Press.

11.

Ding

Smith

Belta

Rus

(2011b) LTL control in uncertain environments with probabilistic satisfaction guarantees. In: 18th IFAC world congress, 28 August - 2 September, Milano, Italy, vol. 8, pp. 3515–3520. Amsterdam: Elsevier.

12.

Garcia

Vidal

Oncina

(1990) Learning locally testable languages in the strict sense In: workshop on algorithmic learning theory, 8–10 October, Tokyo, Japan. Ohmsha Ltd, Tokyo. pp. 325–338.

13.

Grädel

Thomas

Wilke

(2002) Automata, Logics, and Infinite Games: A Guide to Current Research. Berlin: Springer, vol. 2500.

14.

Heinz

(2010) String extension learning. In: 48th annual meeting of the association for computational linguistics, 11–16 July, (ACL ‘10), Upsalla, Sweden, pp. 897–906. Stroudsburg: ACL.

15.

Horning

(1969) A study of grammatical inference. PhD Thesis, Stanford University, USA.

16.

Karaman

Frazzoli

(2008) Complex mission optimization for multiple-UAVs using linear temporal logic. In: 2008 american control conference, 11–13 June, Seattle, USA, pp. 2003–2009. Piscataway: IEEE Press.

17.

Klein

Baier

(2006) Experiments with deternimistic ω-automata for formulas of linear temporal logic. Theoretical Computer Science 363(2): 182–195.

18.

Kloetzer

Belta

(2008) Dealing with nondeterminism in symbolic control. In: Egerstedt

Mishra

(eds) Hybrid Systems: Computation and Control. Berlin: Springer, vol. 4981, pp. 287–300.

19.

Kloetzer

Belta

(2010) Automatic deployment of distributed teams of robots from temporal logic motion specifications. IEEE Transactions on Robotics 26(1): 48–61.

20.

Kress-Gazit

Conner

Choset

Rizzi

Pappas

(2008) Courteous cars. IEEE Robotics and Automation Magazine 15(1): 30–38.

21.

LaValle

(2006) Planning Algorithms. Cambridge: Cambridge University Press.

22.

Rawal

Tanner

Heinz

(2011) (Sub)regular robotic languages. In: IEEE mediterranean conference on control and automation, 20–23 June, Piscataway. pp. 321–326. Piscataway: IEEE Press.

23.

Tumova

Yordanov

Belta

Cerna

Barnat

(2010) A symbolic approach to controlling piecewise affine systems. In: IEEE conference on decision and control and european control conference, 15–17 December, Piscataway, Atlanta, USA. Piscataway: IEEE Press.

24.

Wongpiromsarn

Topcu

Murray

(2009) Receding horizon temporal logic planning for dynamical systems. In: IEEE conference on decision and control and chinese control conference, 16–18 December, Shanghai, China, pp. 5997–6004. Piscataway: IEEE Press.

Supplementary Material

Please find the following supplemental material available below.

For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.

For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.

0.00 MB

Temporal logic robot control based on automata learning of environmental dynamics

Abstract

Keywords

Get full access to this article

References

Supplementary Material