Online policy iterations for optimal control of input-saturated systems

Simone Baldi; Giorgio Valmorbida; Antonis Papachristodoulou; Elias B. Kosmatopoulos

doi:10.1109/ACC.2016.7526568

Online policy iterations for optimal control of input-saturated systems

Simone Baldi, Giorgio Valmorbida, Antonis Papachristodoulou, Elias B. Kosmatopoulos

Team Bart De Schutter

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

37 Downloads (Pure)

Abstract

This work proposes an online policy iteration procedure for the synthesis of sub-optimal control laws for uncertain Linear Time Invariant (LTI) Asymptotically Null-Controllable with Bounded Inputs (ANCBI) systems. The proposed policy iteration method relies on: a policy evaluation step with a piecewise quadratic Lyapunov function in both the state and the deadzone functions of the input signals; a policy improvement step which guarantees at the same time close to optimality (exploitation) and persistence of excitation (exploration). The proposed approach guarantees convergence of the trajectory to a neighborhood around the origin. Besides, the trajectories can be made arbitrarily close to the optimal one provided that the rate at which the the value function and the control policy are updated is fast enough. The solution to the inequalities required to hold at each policy evaluation step can be efficiently implemented with semidefinite programming (SDP) solvers. A numerical example illustrates the results.

Original language	English
Title of host publication	Proceedings of the 2016 American Control Conference (ACC 2016)
Editors	George Chiu, Katie Johnson, Danny Abramovitch
Place of Publication	Piscataway, NJ, USA
Publisher	IEEE
Pages	5734-5739
ISBN (Electronic)	978-1-4673-8682-1
DOIs	https://doi.org/10.1109/ACC.2016.7526568
Publication status	Published - 2016
Event	American Control Conference (ACC), 2016 - Boston, MA, United States Duration: 6 Jul 2016 → 8 Jul 2016

Conference

Conference	American Control Conference (ACC), 2016
Abbreviated title	ACC 2016
Country/Territory	United States
City	Boston, MA
Period	6/07/16 → 8/07/16

Bibliographical note

Accepted Author Manuscript

Keywords

Optimal control
Linear systems
Convergence
Asymptotic stability
Lyapunov methods
Estimation
Trajectory

Access to Document

10.1109/ACC.2016.7526568

Sat_resub_ACC3_finalAccepted author manuscript, 130 KB

Cite this

@inproceedings{54f0e7693fbc4d5cbe5f337321526f70,

title = "Online policy iterations for optimal control of input-saturated systems",

abstract = "This work proposes an online policy iteration procedure for the synthesis of sub-optimal control laws for uncertain Linear Time Invariant (LTI) Asymptotically Null-Controllable with Bounded Inputs (ANCBI) systems. The proposed policy iteration method relies on: a policy evaluation step with a piecewise quadratic Lyapunov function in both the state and the deadzone functions of the input signals; a policy improvement step which guarantees at the same time close to optimality (exploitation) and persistence of excitation (exploration). The proposed approach guarantees convergence of the trajectory to a neighborhood around the origin. Besides, the trajectories can be made arbitrarily close to the optimal one provided that the rate at which the the value function and the control policy are updated is fast enough. The solution to the inequalities required to hold at each policy evaluation step can be efficiently implemented with semidefinite programming (SDP) solvers. A numerical example illustrates the results.",

keywords = "Optimal control, Linear systems, Convergence, Asymptotic stability, Lyapunov methods, Estimation, Trajectory",

author = "Simone Baldi and Giorgio Valmorbida and Antonis Papachristodoulou and Kosmatopoulos, {Elias B.}",

note = "Accepted Author Manuscript; American Control Conference (ACC), 2016, ACC 2016 ; Conference date: 06-07-2016 Through 08-07-2016",

year = "2016",

doi = "10.1109/ACC.2016.7526568",

language = "English",

pages = "5734--5739",

editor = "George Chiu and Katie Johnson and Danny Abramovitch",

booktitle = "Proceedings of the 2016 American Control Conference (ACC 2016)",

publisher = "IEEE",

address = "United States",

}

Baldi, S, Valmorbida, G, Papachristodoulou, A & Kosmatopoulos, EB 2016, Online policy iterations for optimal control of input-saturated systems. in G Chiu, K Johnson & D Abramovitch (eds), Proceedings of the 2016 American Control Conference (ACC 2016). IEEE, Piscataway, NJ, USA, pp. 5734-5739, American Control Conference (ACC), 2016, Boston, MA, United States, 6/07/16. https://doi.org/10.1109/ACC.2016.7526568

Online policy iterations for optimal control of input-saturated systems. / Baldi, Simone; Valmorbida, Giorgio; Papachristodoulou, Antonis et al.
Proceedings of the 2016 American Control Conference (ACC 2016). ed. / George Chiu; Katie Johnson; Danny Abramovitch. Piscataway, NJ, USA: IEEE, 2016. p. 5734-5739.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Online policy iterations for optimal control of input-saturated systems

AU - Baldi, Simone

AU - Valmorbida, Giorgio

AU - Papachristodoulou, Antonis

AU - Kosmatopoulos, Elias B.

N1 - Accepted Author Manuscript

PY - 2016

Y1 - 2016

N2 - This work proposes an online policy iteration procedure for the synthesis of sub-optimal control laws for uncertain Linear Time Invariant (LTI) Asymptotically Null-Controllable with Bounded Inputs (ANCBI) systems. The proposed policy iteration method relies on: a policy evaluation step with a piecewise quadratic Lyapunov function in both the state and the deadzone functions of the input signals; a policy improvement step which guarantees at the same time close to optimality (exploitation) and persistence of excitation (exploration). The proposed approach guarantees convergence of the trajectory to a neighborhood around the origin. Besides, the trajectories can be made arbitrarily close to the optimal one provided that the rate at which the the value function and the control policy are updated is fast enough. The solution to the inequalities required to hold at each policy evaluation step can be efficiently implemented with semidefinite programming (SDP) solvers. A numerical example illustrates the results.

AB - This work proposes an online policy iteration procedure for the synthesis of sub-optimal control laws for uncertain Linear Time Invariant (LTI) Asymptotically Null-Controllable with Bounded Inputs (ANCBI) systems. The proposed policy iteration method relies on: a policy evaluation step with a piecewise quadratic Lyapunov function in both the state and the deadzone functions of the input signals; a policy improvement step which guarantees at the same time close to optimality (exploitation) and persistence of excitation (exploration). The proposed approach guarantees convergence of the trajectory to a neighborhood around the origin. Besides, the trajectories can be made arbitrarily close to the optimal one provided that the rate at which the the value function and the control policy are updated is fast enough. The solution to the inequalities required to hold at each policy evaluation step can be efficiently implemented with semidefinite programming (SDP) solvers. A numerical example illustrates the results.

KW - Optimal control

KW - Linear systems

KW - Convergence

KW - Asymptotic stability

KW - Lyapunov methods

KW - Estimation

KW - Trajectory

UR - http://resolver.tudelft.nl/uuid:54f0e769-3fbc-4d5c-be5f-337321526f70

UR - http://www.scopus.com/inward/record.url?scp=84992150913&partnerID=8YFLogxK

U2 - 10.1109/ACC.2016.7526568

DO - 10.1109/ACC.2016.7526568

M3 - Conference contribution

AN - SCOPUS:84992150913

SP - 5734

EP - 5739

BT - Proceedings of the 2016 American Control Conference (ACC 2016)

A2 - Chiu, George

A2 - Johnson, Katie

A2 - Abramovitch, Danny

PB - IEEE

CY - Piscataway, NJ, USA

T2 - American Control Conference (ACC), 2016

Y2 - 6 July 2016 through 8 July 2016

ER -

Online policy iterations for optimal control of input-saturated systems

Abstract

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this