Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability

Ye Zhou; Erik Jan Van Kampen; Qi Ping Chu

doi:10.2514/1.G003472

Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability

Ye Zhou, Erik Jan Van Kampen, Qi Ping Chu

Control & Simulation

Research output: Contribution to journal › Article › Scientific › peer-review

25 Citations (Scopus)

Abstract

Approximate dynamic programming is a class of reinforcement learning, which solves adaptive, optimal control problems and tackles the curse of dimensionality with function approximators. Within this category, linear approximate dynamic programming provides a model-free control method by systematically using a quadratic cost-to-go function. Although efficient, linear approximate dynamic programming methods are difficult to apply to nonlinear systems or time-varying systems. To overcome the above limitations, this paper proposes an adaptive nonlinear tracking control method based on incremental approximate dynamic programming, which combines the advantages of linear approximate dynamic programming and incremental nonlinear control techniques. This is a model-free method for unknown, nonlinear systems and time-varying references. The trait of separating the local model information from the cost function approximation makes this method an option for partially observable control problems. This paper, therefore, proposes two reference tracking controllers for different observability conditions: the direct measurement of the full state, and the partially observable tracking error. In each condition, two algorithms are developed for off-line learning and online learning, respectively. These algorithms are applied to attitude control of a spacecraft disturbed by internal liquid sloshing. The results demonstrate that the proposed algorithms accurately deal with the unknown, time-varying internal dynamics while retaining efficient control, even with only partial observability.

Original language	English
Pages (from-to)	2554-2567
Number of pages	14
Journal	Journal of Guidance, Control, and Dynamics
Volume	41
Issue number	12
DOIs	https://doi.org/10.2514/1.G003472
Publication status	Published - 2018

Access to Document

10.2514/1.G003472

Cite this

@article{fe47d013aecf4cfd97afdc4705e2f373,

title = "Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability",

abstract = "Approximate dynamic programming is a class of reinforcement learning, which solves adaptive, optimal control problems and tackles the curse of dimensionality with function approximators. Within this category, linear approximate dynamic programming provides a model-free control method by systematically using a quadratic cost-to-go function. Although efficient, linear approximate dynamic programming methods are difficult to apply to nonlinear systems or time-varying systems. To overcome the above limitations, this paper proposes an adaptive nonlinear tracking control method based on incremental approximate dynamic programming, which combines the advantages of linear approximate dynamic programming and incremental nonlinear control techniques. This is a model-free method for unknown, nonlinear systems and time-varying references. The trait of separating the local model information from the cost function approximation makes this method an option for partially observable control problems. This paper, therefore, proposes two reference tracking controllers for different observability conditions: the direct measurement of the full state, and the partially observable tracking error. In each condition, two algorithms are developed for off-line learning and online learning, respectively. These algorithms are applied to attitude control of a spacecraft disturbed by internal liquid sloshing. The results demonstrate that the proposed algorithms accurately deal with the unknown, time-varying internal dynamics while retaining efficient control, even with only partial observability.",

author = "Ye Zhou and {Van Kampen}, {Erik Jan} and Chu, {Qi Ping}",

year = "2018",

doi = "10.2514/1.G003472",

language = "English",

volume = "41",

pages = "2554--2567",

journal = "Journal of Guidance, Control, and Dynamics",

issn = "0731-5090",

publisher = "American Institute of Aeronautics and Astronautics Inc. (AIAA)",

number = "12",

}

TY - JOUR

T1 - Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability

AU - Zhou, Ye

AU - Van Kampen, Erik Jan

AU - Chu, Qi Ping

PY - 2018

Y1 - 2018

N2 - Approximate dynamic programming is a class of reinforcement learning, which solves adaptive, optimal control problems and tackles the curse of dimensionality with function approximators. Within this category, linear approximate dynamic programming provides a model-free control method by systematically using a quadratic cost-to-go function. Although efficient, linear approximate dynamic programming methods are difficult to apply to nonlinear systems or time-varying systems. To overcome the above limitations, this paper proposes an adaptive nonlinear tracking control method based on incremental approximate dynamic programming, which combines the advantages of linear approximate dynamic programming and incremental nonlinear control techniques. This is a model-free method for unknown, nonlinear systems and time-varying references. The trait of separating the local model information from the cost function approximation makes this method an option for partially observable control problems. This paper, therefore, proposes two reference tracking controllers for different observability conditions: the direct measurement of the full state, and the partially observable tracking error. In each condition, two algorithms are developed for off-line learning and online learning, respectively. These algorithms are applied to attitude control of a spacecraft disturbed by internal liquid sloshing. The results demonstrate that the proposed algorithms accurately deal with the unknown, time-varying internal dynamics while retaining efficient control, even with only partial observability.

AB - Approximate dynamic programming is a class of reinforcement learning, which solves adaptive, optimal control problems and tackles the curse of dimensionality with function approximators. Within this category, linear approximate dynamic programming provides a model-free control method by systematically using a quadratic cost-to-go function. Although efficient, linear approximate dynamic programming methods are difficult to apply to nonlinear systems or time-varying systems. To overcome the above limitations, this paper proposes an adaptive nonlinear tracking control method based on incremental approximate dynamic programming, which combines the advantages of linear approximate dynamic programming and incremental nonlinear control techniques. This is a model-free method for unknown, nonlinear systems and time-varying references. The trait of separating the local model information from the cost function approximation makes this method an option for partially observable control problems. This paper, therefore, proposes two reference tracking controllers for different observability conditions: the direct measurement of the full state, and the partially observable tracking error. In each condition, two algorithms are developed for off-line learning and online learning, respectively. These algorithms are applied to attitude control of a spacecraft disturbed by internal liquid sloshing. The results demonstrate that the proposed algorithms accurately deal with the unknown, time-varying internal dynamics while retaining efficient control, even with only partial observability.

UR - http://www.scopus.com/inward/record.url?scp=85059523543&partnerID=8YFLogxK

U2 - 10.2514/1.G003472

DO - 10.2514/1.G003472

M3 - Article

AN - SCOPUS:85059523543

SN - 0731-5090

VL - 41

SP - 2554

EP - 2567

JO - Journal of Guidance, Control, and Dynamics

JF - Journal of Guidance, Control, and Dynamics

IS - 12

ER -

Incremental approximate dynamic programming for nonlinear adaptive tracking control with partial observability

Abstract

Access to Document

Other files and links

Fingerprint

Cite this