Skip to main navigation Skip to search Skip to main content

Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task

  • Department of Artificial Intelligence and Human Interfaces, University of Salzburg

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper presents a comparison between two well-known deep Reinforcement Learning (RL) algorithms: Deep Q-Learning (DQN) and Proximal Policy optimization (PPO) in a simulated production system. We utilize a Petri Net (PN)-based simulation environment, which was previously proposed in related work. The performance of the two algorithms is compared based on several evaluation metrics, including average percentage of correctly assembled and sorted products, average episode length, and percentage of successful episodes. The results show that PPO outperforms DQN in terms of all evaluation metrics. The study highlights the advantages of policy-based algorithms in problems with high-dimensional state and action spaces. The study contributes to the field of deep RL in context of production systems by providing insights into the effectiveness of different algorithms and their suitability for different tasks. © 2023 IEEE.
Original languageEnglish
Title of host publication2023 IEEE 32nd International Symposium on Industrial Electronics (ISIE)
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)979-8-3503-9971-4
ISBN (Print)979-8-3503-9972-1
DOIs
Publication statusPublished - 2023
Event32nd IEEE International Symposium on Industrial Electronics, ISIE 2023 - Aalto University, Helsinki-Espoo, Finland
Duration: 19 Jun 202321 Jun 2023
https://2023.ieee-isie.org/

Conference

Conference32nd IEEE International Symposium on Industrial Electronics, ISIE 2023
Abbreviated titleISIE 2023
Country/TerritoryFinland
CityHelsinki-Espoo
Period19/06/2321/06/23
Internet address

Keywords

  • Deep Q-Learning
  • Material Flow System
  • Petri Nets
  • Proximal Policy optimization
  • Reinforcement Learning
  • Deep learning
  • Learning systems
  • Reinforcement learning
  • Deep Q-learning
  • Evaluation metrics
  • Material flow system
  • Performance comparison
  • Policy optimization
  • Production system
  • Proximal policy optimization
  • Q-learning
  • Reinforcement learning algorithms
  • Reinforcement learnings
  • Petri nets

Fingerprint

Dive into the research topics of 'Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task'. Together they form a unique fingerprint.

Cite this