Reinforcement learning based compensation methods for robot manipulators

Yudha P. Pane; Subramanya P. Nageshrao; Jens Kober; Robert Babuška

doi:10.1016/j.engappai.2018.11.006

Reinforcement learning based compensation methods for robot manipulators

Yudha P. Pane, Subramanya P. Nageshrao^*, Jens Kober, Robert Babuška

^*Corresponding author for this work

Learning & Autonomous Control

Research output: Contribution to journal › Article › Scientific › peer-review

77 Citations (Scopus)

67 Downloads (Pure)

Abstract

Smart robotics will be a core feature while migrating from Industry 3.0 (i.e., mass manufacturing) to Industry 4.0 (i.e., customized or social manufacturing). A key characteristic of a smart system is its ability to learn. For smart manufacturing, this means incorporating learning capabilities into the current fixed, repetitive, task-oriented industrial manipulators, thus rendering them ‘smart’. In this paper we introduce two reinforcement learning (RL) based compensation methods. The learned correction signal, which compensates for unmodeled aberrations, is added to the existing nominal input with an objective to enhance the control performance. The proposed learning algorithms are evaluated on a 6-DoF industrial robotic manipulator arm to follow different kinds of reference paths, such as square or a circular path, or to track a trajectory on a three dimensional surface. In an extensive experimental study we compare the performance of our learning-based methods with well-known tracking controllers, namely, proportional-derivative (PD), model predictive control (MPC), and iterative learning control (ILC). The experimental results show a considerable performance improvement thanks to our RL-based methods when compared to PD, MPC, and ILC.

Original language	English
Pages (from-to)	236-247
Journal	Engineering Applications of Artificial Intelligence
Volume	78
DOIs	https://doi.org/10.1016/j.engappai.2018.11.006
Publication status	Published - 2019

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care

Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Actor-critic scheme
Reinforcement learning
Robotics
Tracking control

Access to Document

10.1016/j.engappai.2018.11.006

1-s2.0-S0952197618302446-mainFinal published version, 1.58 MB

Cite this

@article{f8288f92b1a842ae8208cbf4da569efe,

title = "Reinforcement learning based compensation methods for robot manipulators",

abstract = "Smart robotics will be a core feature while migrating from Industry 3.0 (i.e., mass manufacturing) to Industry 4.0 (i.e., customized or social manufacturing). A key characteristic of a smart system is its ability to learn. For smart manufacturing, this means incorporating learning capabilities into the current fixed, repetitive, task-oriented industrial manipulators, thus rendering them {\textquoteleft}smart{\textquoteright}. In this paper we introduce two reinforcement learning (RL) based compensation methods. The learned correction signal, which compensates for unmodeled aberrations, is added to the existing nominal input with an objective to enhance the control performance. The proposed learning algorithms are evaluated on a 6-DoF industrial robotic manipulator arm to follow different kinds of reference paths, such as square or a circular path, or to track a trajectory on a three dimensional surface. In an extensive experimental study we compare the performance of our learning-based methods with well-known tracking controllers, namely, proportional-derivative (PD), model predictive control (MPC), and iterative learning control (ILC). The experimental results show a considerable performance improvement thanks to our RL-based methods when compared to PD, MPC, and ILC.",

keywords = "Actor-critic scheme, Reinforcement learning, Robotics, Tracking control",

author = "Pane, {Yudha P.} and Nageshrao, {Subramanya P.} and Jens Kober and Robert Babu{\v s}ka",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2019",

doi = "10.1016/j.engappai.2018.11.006",

language = "English",

volume = "78",

pages = "236--247",

journal = "Engineering Applications of Artificial Intelligence",

issn = "0952-1976",

publisher = "Elsevier",

}

TY - JOUR

T1 - Reinforcement learning based compensation methods for robot manipulators

AU - Pane, Yudha P.

AU - Nageshrao, Subramanya P.

AU - Kober, Jens

AU - Babuška, Robert

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2019

Y1 - 2019

N2 - Smart robotics will be a core feature while migrating from Industry 3.0 (i.e., mass manufacturing) to Industry 4.0 (i.e., customized or social manufacturing). A key characteristic of a smart system is its ability to learn. For smart manufacturing, this means incorporating learning capabilities into the current fixed, repetitive, task-oriented industrial manipulators, thus rendering them ‘smart’. In this paper we introduce two reinforcement learning (RL) based compensation methods. The learned correction signal, which compensates for unmodeled aberrations, is added to the existing nominal input with an objective to enhance the control performance. The proposed learning algorithms are evaluated on a 6-DoF industrial robotic manipulator arm to follow different kinds of reference paths, such as square or a circular path, or to track a trajectory on a three dimensional surface. In an extensive experimental study we compare the performance of our learning-based methods with well-known tracking controllers, namely, proportional-derivative (PD), model predictive control (MPC), and iterative learning control (ILC). The experimental results show a considerable performance improvement thanks to our RL-based methods when compared to PD, MPC, and ILC.

AB - Smart robotics will be a core feature while migrating from Industry 3.0 (i.e., mass manufacturing) to Industry 4.0 (i.e., customized or social manufacturing). A key characteristic of a smart system is its ability to learn. For smart manufacturing, this means incorporating learning capabilities into the current fixed, repetitive, task-oriented industrial manipulators, thus rendering them ‘smart’. In this paper we introduce two reinforcement learning (RL) based compensation methods. The learned correction signal, which compensates for unmodeled aberrations, is added to the existing nominal input with an objective to enhance the control performance. The proposed learning algorithms are evaluated on a 6-DoF industrial robotic manipulator arm to follow different kinds of reference paths, such as square or a circular path, or to track a trajectory on a three dimensional surface. In an extensive experimental study we compare the performance of our learning-based methods with well-known tracking controllers, namely, proportional-derivative (PD), model predictive control (MPC), and iterative learning control (ILC). The experimental results show a considerable performance improvement thanks to our RL-based methods when compared to PD, MPC, and ILC.

KW - Actor-critic scheme

KW - Reinforcement learning

KW - Robotics

KW - Tracking control

UR - http://www.scopus.com/inward/record.url?scp=85058243585&partnerID=8YFLogxK

U2 - 10.1016/j.engappai.2018.11.006

DO - 10.1016/j.engappai.2018.11.006

M3 - Article

AN - SCOPUS:85058243585

SN - 0952-1976

VL - 78

SP - 236

EP - 247

JO - Engineering Applications of Artificial Intelligence

JF - Engineering Applications of Artificial Intelligence

ER -

Reinforcement learning based compensation methods for robot manipulators

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this