Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study conducts a comparative analysis of Model Predictive Control (MPC) and Proximal Policy Optimization (PPO), a Deep Reinforcement Learning (DRL) algorithm, applied to a 1-Degree of Freedom (DOF) Quanser Aero 2 system. Classical control techniques such as MPC and Linear Quadratic Regulator (LQR) are widely used due to their theoretical foundation and practical effectiveness. However, with advancements in computational techniques and machine learning, DRL approaches like PPO have gained traction in solving optimal control problems through environment interaction. This paper systematically evaluates the dynamic response characteristics of PPO and MPC, comparing their performance, computational resource consumption, and implementation complexity. Experimental results show that while LQR achieves the best steady-state accuracy, PPO excels in rise-time and adaptability, making it a promising approach for applications requiring rapid response and adaptability. Additionally, we have established a baseline for future RL-related research on this specific testbed. We also discuss the strengths and limitations of each control strategy, providing recommendations for selecting appropriate controllers for real-world scenarios.
Original languageEnglish
Title of host publication2024 IEEE 22nd INDIN
PublisherIEEE Industrial Electronics Society
Publication statusPublished - Aug 2024

Keywords

  • Linear-quadratic regulator
  • Model Predictive Control
  • Proximal Policy Optimization

Classification according to Österreichische Systematik der Wissenschaftszweige (ÖFOS 2012)

  • 202034 Control engineering

Applied Research Level (ARL)

  • ARL Level 4 - Experimental setup in laboratory-like conditions

Research focus/foci

  • Industrial Informatics

Fingerprint

Dive into the research topics of 'Comparison of Model Predictive Control and Proximal Policy Optimization for a 1-DOF Helicopter System'. Together they form a unique fingerprint.

Cite this