Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics

Tijmen Pollack; Erik-jan van Kampen

doi:10.2514/6.2020-2100

Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics

Control & Simulation

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

176 Downloads (Pure)

Abstract

Reinforcement learning (RL) enables the autonomous formation of optimal, adaptive control laws for systems with complex, uncertain dynamics. This process generally requires a learning agent to directly interact with the system in an online fashion. However, if the system is safety-critical, such as an Unmanned Aerial Vehicle (UAV), learning may result in unsafe behavior. Moreover, irrespective of the safety aspect, learning optimal control policies from scratch can be inefficient and therefore time-consuming. In this research, the safe curriculum learning paradigm is proposed to address the problems of learning safety and efficiency simultaneously. Curriculum learning makes the process of learning more tractable, thereby allowing the intelligent agent to learn desired behavior more effectively. This is achieved by presenting the agent with a series of intermediate learning tasks, where the knowledge gained from earlier tasks is used to expedite learning in succeeding tasks of higher complexity. This framework is united with views from safe learning to ensure that safety constraints are adhered to during the learning curriculum. This principle is first investigated in the context of optimal regulation of a generic mass-spring-damper system using neural networks and is subsequently applied in the context of optimal attitude control of a quadrotor UAV with uncertain dynamics.

Original language	English
Title of host publication	AIAA Scitech 2020 Forum
Subtitle of host publication	6-10 January 2020, Orlando, FL
Publisher	American Institute of Aeronautics and Astronautics Inc. (AIAA)
Number of pages	23
ISBN (Electronic)	978-1-62410-595-1
DOIs	https://doi.org/10.2514/6.2020-2100
Publication status	Published - 2020
Event	AIAA Scitech 2020 Forum - Orlando, United States Duration: 6 Jan 2020 → 10 Jan 2020

Publication series

Name	AIAA Scitech 2020 Forum
Volume	1 PartF

Conference

Conference	AIAA Scitech 2020 Forum
Country/Territory	United States
City	Orlando
Period	6/01/20 → 10/01/20

Access to Document

10.2514/6.2020-2100

6.2020-2100Final published version, 2.14 MB

Cite this

Pollack, T., & van Kampen, E. (2020). Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics. In AIAA Scitech 2020 Forum: 6-10 January 2020, Orlando, FL Article AIAA 2020-2100 (AIAA Scitech 2020 Forum; Vol. 1 PartF). American Institute of Aeronautics and Astronautics Inc. (AIAA). https://doi.org/10.2514/6.2020-2100

@inproceedings{281ece031dc84e8b9aa36c18aec44ede,

title = "Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics",

abstract = "Reinforcement learning (RL) enables the autonomous formation of optimal, adaptive control laws for systems with complex, uncertain dynamics. This process generally requires a learning agent to directly interact with the system in an online fashion. However, if the system is safety-critical, such as an Unmanned Aerial Vehicle (UAV), learning may result in unsafe behavior. Moreover, irrespective of the safety aspect, learning optimal control policies from scratch can be inefficient and therefore time-consuming. In this research, the safe curriculum learning paradigm is proposed to address the problems of learning safety and efficiency simultaneously. Curriculum learning makes the process of learning more tractable, thereby allowing the intelligent agent to learn desired behavior more effectively. This is achieved by presenting the agent with a series of intermediate learning tasks, where the knowledge gained from earlier tasks is used to expedite learning in succeeding tasks of higher complexity. This framework is united with views from safe learning to ensure that safety constraints are adhered to during the learning curriculum. This principle is first investigated in the context of optimal regulation of a generic mass-spring-damper system using neural networks and is subsequently applied in the context of optimal attitude control of a quadrotor UAV with uncertain dynamics.",

author = "Tijmen Pollack and {van Kampen}, Erik-jan",

year = "2020",

doi = "10.2514/6.2020-2100",

language = "English",

series = "AIAA Scitech 2020 Forum",

publisher = "American Institute of Aeronautics and Astronautics Inc. (AIAA)",

booktitle = "AIAA Scitech 2020 Forum",

address = "United States",

note = "AIAA Scitech 2020 Forum ; Conference date: 06-01-2020 Through 10-01-2020",

}

Pollack, T & van Kampen, E 2020, Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics. in AIAA Scitech 2020 Forum: 6-10 January 2020, Orlando, FL., AIAA 2020-2100, AIAA Scitech 2020 Forum, vol. 1 PartF, American Institute of Aeronautics and Astronautics Inc. (AIAA), AIAA Scitech 2020 Forum, Orlando, Florida, United States, 6/01/20. https://doi.org/10.2514/6.2020-2100

Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics. / Pollack, Tijmen ; van Kampen, Erik-jan.
AIAA Scitech 2020 Forum: 6-10 January 2020, Orlando, FL. American Institute of Aeronautics and Astronautics Inc. (AIAA), 2020. AIAA 2020-2100 (AIAA Scitech 2020 Forum; Vol. 1 PartF).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics

AU - Pollack, Tijmen

AU - van Kampen, Erik-jan

PY - 2020

Y1 - 2020

N2 - Reinforcement learning (RL) enables the autonomous formation of optimal, adaptive control laws for systems with complex, uncertain dynamics. This process generally requires a learning agent to directly interact with the system in an online fashion. However, if the system is safety-critical, such as an Unmanned Aerial Vehicle (UAV), learning may result in unsafe behavior. Moreover, irrespective of the safety aspect, learning optimal control policies from scratch can be inefficient and therefore time-consuming. In this research, the safe curriculum learning paradigm is proposed to address the problems of learning safety and efficiency simultaneously. Curriculum learning makes the process of learning more tractable, thereby allowing the intelligent agent to learn desired behavior more effectively. This is achieved by presenting the agent with a series of intermediate learning tasks, where the knowledge gained from earlier tasks is used to expedite learning in succeeding tasks of higher complexity. This framework is united with views from safe learning to ensure that safety constraints are adhered to during the learning curriculum. This principle is first investigated in the context of optimal regulation of a generic mass-spring-damper system using neural networks and is subsequently applied in the context of optimal attitude control of a quadrotor UAV with uncertain dynamics.

AB - Reinforcement learning (RL) enables the autonomous formation of optimal, adaptive control laws for systems with complex, uncertain dynamics. This process generally requires a learning agent to directly interact with the system in an online fashion. However, if the system is safety-critical, such as an Unmanned Aerial Vehicle (UAV), learning may result in unsafe behavior. Moreover, irrespective of the safety aspect, learning optimal control policies from scratch can be inefficient and therefore time-consuming. In this research, the safe curriculum learning paradigm is proposed to address the problems of learning safety and efficiency simultaneously. Curriculum learning makes the process of learning more tractable, thereby allowing the intelligent agent to learn desired behavior more effectively. This is achieved by presenting the agent with a series of intermediate learning tasks, where the knowledge gained from earlier tasks is used to expedite learning in succeeding tasks of higher complexity. This framework is united with views from safe learning to ensure that safety constraints are adhered to during the learning curriculum. This principle is first investigated in the context of optimal regulation of a generic mass-spring-damper system using neural networks and is subsequently applied in the context of optimal attitude control of a quadrotor UAV with uncertain dynamics.

UR - http://www.scopus.com/inward/record.url?scp=85092404949&partnerID=8YFLogxK

U2 - 10.2514/6.2020-2100

DO - 10.2514/6.2020-2100

M3 - Conference contribution

T3 - AIAA Scitech 2020 Forum

BT - AIAA Scitech 2020 Forum

PB - American Institute of Aeronautics and Astronautics Inc. (AIAA)

T2 - AIAA Scitech 2020 Forum

Y2 - 6 January 2020 through 10 January 2020

ER -

Safe Curriculum Learning for Optimal Flight Control of Unmanned Aerial Vehicles with Uncertain System Dynamics

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this