A Hierarchical Maze Navigation Algorithm with Reinforcement Learning and Mapping

Tommaso Mannucci; Erik-Jan van Kampen

doi:10.1109/SSCI.2016.7849365

A Hierarchical Maze Navigation Algorithm with Reinforcement Learning and Mapping

Control & Simulation

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

7 Citations (Scopus)

176 Downloads (Pure)

Abstract

Goal-finding in an unknown maze is a challenging problem for a Reinforcement Learning agent, because the corresponding state space can be large if not intractable, and the agent does not usually have a model of the environment. Hierarchical Reinforcement Learning has been shown in the past to improve tractability and learning time of complex problems, as well as facilitate learning a coherent transition model for the environment. Nonetheless, considerable time is still needed to learn the transition model, so that initially the agent can perform poorly by getting trapped into dead ends and colliding with obstacles. This paper proposes a strategy for maze exploration that, by means of sequential tasking and off-line training on an abstract environment, provides the agent with a minimal level of performance from the very beginning of exploration. In particular, this approach allows to prevent collisions with obstacles, thus enforcing a safety restraint on the agent.

Original language	English
Title of host publication	2016 IEEE Symposium Series on Computational Intelligence
Subtitle of host publication	Athens, Greece
Editors	Y Jin, S. Kollias
Publisher	IEEE
Number of pages	8
DOIs	https://doi.org/10.1109/SSCI.2016.7849365
Publication status	E-pub ahead of print - 2016
Event	2016 IEEE Symposium Series on Computational Intelligence - Athens, Greece Duration: 6 Oct 2016 → 9 Oct 2016 http://ssci2016.cs.surrey.ac.uk/

Conference

Conference	2016 IEEE Symposium Series on Computational Intelligence
Abbreviated title	SSCI 2016
Country/Territory	Greece
City	Athens
Period	6/10/16 → 9/10/16
Internet address	http://ssci2016.cs.surrey.ac.uk/

Access to Document

10.1109/SSCI.2016.7849365

Mannucci_A_Hierarchical_Maze_Navigation_revisedAccepted author manuscript, 594 KB

Cite this

@inproceedings{3d32d5d34c464a91ba339a2747387987,

title = "A Hierarchical Maze Navigation Algorithm with Reinforcement Learning and Mapping",

abstract = "Goal-finding in an unknown maze is a challenging problem for a Reinforcement Learning agent, because the corresponding state space can be large if not intractable, and the agent does not usually have a model of the environment. Hierarchical Reinforcement Learning has been shown in the past to improve tractability and learning time of complex problems, as well as facilitate learning a coherent transition model for the environment. Nonetheless, considerable time is still needed to learn the transition model, so that initially the agent can perform poorly by getting trapped into dead ends and colliding with obstacles. This paper proposes a strategy for maze exploration that, by means of sequential tasking and off-line training on an abstract environment, provides the agent with a minimal level of performance from the very beginning of exploration. In particular, this approach allows to prevent collisions with obstacles, thus enforcing a safety restraint on the agent. ",

author = "Tommaso Mannucci and {van Kampen}, Erik-Jan",

year = "2016",

doi = "10.1109/SSCI.2016.7849365",

language = "English",

editor = "Y Jin and S. Kollias",

booktitle = "2016 IEEE Symposium Series on Computational Intelligence",

publisher = "IEEE",

address = "United States",

note = "2016 IEEE Symposium Series on Computational Intelligence , SSCI 2016 ; Conference date: 06-10-2016 Through 09-10-2016",

url = "http://ssci2016.cs.surrey.ac.uk/",

}

TY - GEN

T1 - A Hierarchical Maze Navigation Algorithm with Reinforcement Learning and Mapping

AU - Mannucci, Tommaso

AU - van Kampen, Erik-Jan

PY - 2016

Y1 - 2016

N2 - Goal-finding in an unknown maze is a challenging problem for a Reinforcement Learning agent, because the corresponding state space can be large if not intractable, and the agent does not usually have a model of the environment. Hierarchical Reinforcement Learning has been shown in the past to improve tractability and learning time of complex problems, as well as facilitate learning a coherent transition model for the environment. Nonetheless, considerable time is still needed to learn the transition model, so that initially the agent can perform poorly by getting trapped into dead ends and colliding with obstacles. This paper proposes a strategy for maze exploration that, by means of sequential tasking and off-line training on an abstract environment, provides the agent with a minimal level of performance from the very beginning of exploration. In particular, this approach allows to prevent collisions with obstacles, thus enforcing a safety restraint on the agent.

AB - Goal-finding in an unknown maze is a challenging problem for a Reinforcement Learning agent, because the corresponding state space can be large if not intractable, and the agent does not usually have a model of the environment. Hierarchical Reinforcement Learning has been shown in the past to improve tractability and learning time of complex problems, as well as facilitate learning a coherent transition model for the environment. Nonetheless, considerable time is still needed to learn the transition model, so that initially the agent can perform poorly by getting trapped into dead ends and colliding with obstacles. This paper proposes a strategy for maze exploration that, by means of sequential tasking and off-line training on an abstract environment, provides the agent with a minimal level of performance from the very beginning of exploration. In particular, this approach allows to prevent collisions with obstacles, thus enforcing a safety restraint on the agent.

UR - http://resolver.tudelft.nl/uuid:3d32d5d3-4c46-4a91-ba33-9a2747387987

U2 - 10.1109/SSCI.2016.7849365

DO - 10.1109/SSCI.2016.7849365

M3 - Conference contribution

BT - 2016 IEEE Symposium Series on Computational Intelligence

A2 - Jin, Y

A2 - Kollias, S.

PB - IEEE

T2 - 2016 IEEE Symposium Series on Computational Intelligence

Y2 - 6 October 2016 through 9 October 2016

ER -

A Hierarchical Maze Navigation Algorithm with Reinforcement Learning and Mapping

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this