RRT-CoLearn: Towards kinodynamic planning without numerical trajectory optimization

Wouter Wolfslag; Mukunda Bharatheesha; Thomas Moerland; Martijn Wisse

doi:10.1109/LRA.2018.2801470

RRT-CoLearn: Towards kinodynamic planning without numerical trajectory optimization

Wouter Wolfslag, Mukunda Bharatheesha, Thomas Moerland, Martijn Wisse

Research output: Contribution to journal › Article › Scientific › peer-review

27 Citations (Scopus)

Abstract

Sampling-based kinodynamic planners, such as Rapidly-exploring Random Trees (RRTs), pose two fundamental challenges: computing a reliable (pseudo-)metric for the distance between two randomly sampled nodes, and computing a steering input to connect the nodes. The core of these challenges is a Two Point Boundary Value Problem, which is known to be NP-hard. Recently, the distance metric has been approximated using supervised learning, reducing computation time drastically. The previous work on such learning RRTs use direct optimal control to generate the data for supervised learning. This paper proposes to use indirect optimal control instead, because it provides two benefits: it reduces the computational effort to generate the data, and it provides a low dimensional parametrization of the action space. The latter allows us to learn both the distance metric and the steering input to connect two nodes. This eliminates the need for a local planner in learning RRTs. Experimental results on a pendulum swing up show 10-fold speed-up in both the offline data generation and the online planning time, leading to at least a 10-fold speed-up in the overall planning time.

Original language	English
Pages (from-to)	1655-1662
Journal	IEEE Robotics and Automation Letters
Volume	3
Issue number	3
DOIs	https://doi.org/10.1109/LRA.2018.2801470
Publication status	Published - 2018

Access to Document

10.1109/LRA.2018.2801470

Cite this

@article{8c26584fcf75444f93665daddf1471fc,

title = "RRT-CoLearn: Towards kinodynamic planning without numerical trajectory optimization",

abstract = "Sampling-based kinodynamic planners, such as Rapidly-exploring Random Trees (RRTs), pose two fundamental challenges: computing a reliable (pseudo-)metric for the distance between two randomly sampled nodes, and computing a steering input to connect the nodes. The core of these challenges is a Two Point Boundary Value Problem, which is known to be NP-hard. Recently, the distance metric has been approximated using supervised learning, reducing computation time drastically. The previous work on such learning RRTs use direct optimal control to generate the data for supervised learning. This paper proposes to use indirect optimal control instead, because it provides two benefits: it reduces the computational effort to generate the data, and it provides a low dimensional parametrization of the action space. The latter allows us to learn both the distance metric and the steering input to connect two nodes. This eliminates the need for a local planner in learning RRTs. Experimental results on a pendulum swing up show 10-fold speed-up in both the offline data generation and the online planning time, leading to at least a 10-fold speed-up in the overall planning time.",

author = "Wouter Wolfslag and Mukunda Bharatheesha and Thomas Moerland and Martijn Wisse",

year = "2018",

doi = "10.1109/LRA.2018.2801470",

language = "English",

volume = "3",

pages = "1655--1662",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "3",

}

TY - JOUR

T1 - RRT-CoLearn

T2 - Towards kinodynamic planning without numerical trajectory optimization

AU - Wolfslag, Wouter

AU - Bharatheesha, Mukunda

AU - Moerland, Thomas

AU - Wisse, Martijn

PY - 2018

Y1 - 2018

N2 - Sampling-based kinodynamic planners, such as Rapidly-exploring Random Trees (RRTs), pose two fundamental challenges: computing a reliable (pseudo-)metric for the distance between two randomly sampled nodes, and computing a steering input to connect the nodes. The core of these challenges is a Two Point Boundary Value Problem, which is known to be NP-hard. Recently, the distance metric has been approximated using supervised learning, reducing computation time drastically. The previous work on such learning RRTs use direct optimal control to generate the data for supervised learning. This paper proposes to use indirect optimal control instead, because it provides two benefits: it reduces the computational effort to generate the data, and it provides a low dimensional parametrization of the action space. The latter allows us to learn both the distance metric and the steering input to connect two nodes. This eliminates the need for a local planner in learning RRTs. Experimental results on a pendulum swing up show 10-fold speed-up in both the offline data generation and the online planning time, leading to at least a 10-fold speed-up in the overall planning time.

AB - Sampling-based kinodynamic planners, such as Rapidly-exploring Random Trees (RRTs), pose two fundamental challenges: computing a reliable (pseudo-)metric for the distance between two randomly sampled nodes, and computing a steering input to connect the nodes. The core of these challenges is a Two Point Boundary Value Problem, which is known to be NP-hard. Recently, the distance metric has been approximated using supervised learning, reducing computation time drastically. The previous work on such learning RRTs use direct optimal control to generate the data for supervised learning. This paper proposes to use indirect optimal control instead, because it provides two benefits: it reduces the computational effort to generate the data, and it provides a low dimensional parametrization of the action space. The latter allows us to learn both the distance metric and the steering input to connect two nodes. This eliminates the need for a local planner in learning RRTs. Experimental results on a pendulum swing up show 10-fold speed-up in both the offline data generation and the online planning time, leading to at least a 10-fold speed-up in the overall planning time.

U2 - 10.1109/LRA.2018.2801470

DO - 10.1109/LRA.2018.2801470

M3 - Article

SN - 2377-3766

VL - 3

SP - 1655

EP - 1662

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 3

ER -

RRT-CoLearn: Towards kinodynamic planning without numerical trajectory optimization

Abstract

Access to Document

Fingerprint

Cite this