Vision-based navigation using deep reinforcement learning

Jonáš Kulhánek; Erik Derner; Tim De Bruin; Robert Babuska

doi:10.1109/ECMR.2019.8870964

Vision-based navigation using deep reinforcement learning

Jonáš Kulhánek, Erik Derner, Tim De Bruin, Robert Babuska

Learning & Autonomous Control

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

43 Citations (Scopus)

97 Downloads (Pure)

Abstract

Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To achieve this, we have extended the batched A2C algorithm with auxiliary tasks designed to improve visual navigation performance. We propose three additional auxiliary tasks: predicting the segmentation of the observation image and of the target image and predicting the depth-map. These tasks enable the use of supervised learning to pre-train a major part of the network and to reduce the number of training steps substantially. The training performance has been further improved by increasing the environment complexity gradually over time. An efficient neural network structure is proposed, which is capable of learning for multiple targets in multiple environments. Our method navigates in continuous state spaces and on the AI2-THOR environment simulator surpasses the performance of state-of-the-art goal-oriented visual navigation methods from the literature.

Original language	English
Title of host publication	Proceedings of the European Conference on Mobile Robots (ECMR 2019)
Editors	Libor Preucil, Sven Behnke, Miroslav Kulich
Place of Publication	Piscataway, NJ, USA
Publisher	IEEE
Number of pages	8
ISBN (Electronic)	978-1-7281-3605-9
DOIs	https://doi.org/10.1109/ECMR.2019.8870964
Publication status	Published - 2019
Event	ECMR 2019: European Conference on Mobile Robots - Prague, Czech Republic Duration: 4 Sept 2019 → 6 Sept 2019

Conference

Conference	ECMR 2019: European Conference on Mobile Robots
Country/Territory	Czech Republic
City	Prague
Period	4/09/19 → 6/09/19

Bibliographical note

Accepted Author Manuscript

Keywords

Actor-critic
Auxiliary tasks
Deep reinforcement learning
Robot navigation

Access to Document

10.1109/ECMR.2019.8870964

AAMAccepted author manuscript, 3.62 MB

Cite this

@inproceedings{1b6cf0139b29408298fa0d9409f9b13a,

title = "Vision-based navigation using deep reinforcement learning",

abstract = "Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To achieve this, we have extended the batched A2C algorithm with auxiliary tasks designed to improve visual navigation performance. We propose three additional auxiliary tasks: predicting the segmentation of the observation image and of the target image and predicting the depth-map. These tasks enable the use of supervised learning to pre-train a major part of the network and to reduce the number of training steps substantially. The training performance has been further improved by increasing the environment complexity gradually over time. An efficient neural network structure is proposed, which is capable of learning for multiple targets in multiple environments. Our method navigates in continuous state spaces and on the AI2-THOR environment simulator surpasses the performance of state-of-the-art goal-oriented visual navigation methods from the literature.",

keywords = "Actor-critic, Auxiliary tasks, Deep reinforcement learning, Robot navigation",

author = "Jon{\'a}{\v s} Kulh{\'a}nek and Erik Derner and {De Bruin}, Tim and Robert Babuska",

note = "Accepted Author Manuscript; ECMR 2019: European Conference on Mobile Robots ; Conference date: 04-09-2019 Through 06-09-2019",

year = "2019",

doi = "10.1109/ECMR.2019.8870964",

language = "English",

editor = "Libor Preucil and Sven Behnke and Miroslav Kulich",

booktitle = "Proceedings of the European Conference on Mobile Robots (ECMR 2019)",

publisher = "IEEE",

address = "United States",

}

Kulhánek, J, Derner, E, De Bruin, T & Babuska, R 2019, Vision-based navigation using deep reinforcement learning. in L Preucil, S Behnke & M Kulich (eds), Proceedings of the European Conference on Mobile Robots (ECMR 2019). IEEE, Piscataway, NJ, USA, ECMR 2019: European Conference on Mobile Robots, Prague, Czech Republic, 4/09/19. https://doi.org/10.1109/ECMR.2019.8870964

Vision-based navigation using deep reinforcement learning. / Kulhánek, Jonáš; Derner, Erik; De Bruin, Tim et al.
Proceedings of the European Conference on Mobile Robots (ECMR 2019). ed. / Libor Preucil; Sven Behnke; Miroslav Kulich. Piscataway, NJ, USA: IEEE, 2019.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Vision-based navigation using deep reinforcement learning

AU - Kulhánek, Jonáš

AU - Derner, Erik

AU - De Bruin, Tim

AU - Babuska, Robert

N1 - Accepted Author Manuscript

PY - 2019

Y1 - 2019

N2 - Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To achieve this, we have extended the batched A2C algorithm with auxiliary tasks designed to improve visual navigation performance. We propose three additional auxiliary tasks: predicting the segmentation of the observation image and of the target image and predicting the depth-map. These tasks enable the use of supervised learning to pre-train a major part of the network and to reduce the number of training steps substantially. The training performance has been further improved by increasing the environment complexity gradually over time. An efficient neural network structure is proposed, which is capable of learning for multiple targets in multiple environments. Our method navigates in continuous state spaces and on the AI2-THOR environment simulator surpasses the performance of state-of-the-art goal-oriented visual navigation methods from the literature.

AB - Deep reinforcement learning (RL) has been successfully applied to a variety of game-like environments. However, the application of deep RL to visual navigation with realistic environments is a challenging task. We propose a novel learning architecture capable of navigating an agent, e.g. a mobile robot, to a target given by an image. To achieve this, we have extended the batched A2C algorithm with auxiliary tasks designed to improve visual navigation performance. We propose three additional auxiliary tasks: predicting the segmentation of the observation image and of the target image and predicting the depth-map. These tasks enable the use of supervised learning to pre-train a major part of the network and to reduce the number of training steps substantially. The training performance has been further improved by increasing the environment complexity gradually over time. An efficient neural network structure is proposed, which is capable of learning for multiple targets in multiple environments. Our method navigates in continuous state spaces and on the AI2-THOR environment simulator surpasses the performance of state-of-the-art goal-oriented visual navigation methods from the literature.

KW - Actor-critic

KW - Auxiliary tasks

KW - Deep reinforcement learning

KW - Robot navigation

UR - http://www.scopus.com/inward/record.url?scp=85074442868&partnerID=8YFLogxK

U2 - 10.1109/ECMR.2019.8870964

DO - 10.1109/ECMR.2019.8870964

M3 - Conference contribution

BT - Proceedings of the European Conference on Mobile Robots (ECMR 2019)

A2 - Preucil, Libor

A2 - Behnke, Sven

A2 - Kulich, Miroslav

PB - IEEE

CY - Piscataway, NJ, USA

T2 - ECMR 2019: European Conference on Mobile Robots

Y2 - 4 September 2019 through 6 September 2019

ER -

Vision-based navigation using deep reinforcement learning

Abstract

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this