Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay

Kevin Marlon Soza Mamani; Óscar Camacho; Alvaro Prado

doi:10.1109/andescon61840.2024.10755797

Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay

dc.contributor.author	Kevin Marlon Soza Mamani
dc.contributor.author	Óscar Camacho
dc.contributor.author	Alvaro Prado
dc.coverage.spatial	Bolivia
dc.date.accessioned	2026-03-22T14:26:18Z
dc.date.available	2026-03-22T14:26:18Z
dc.date.issued	2024
dc.description	Citaciones: 3
dc.description.abstract	The main concern in thermal process control revolves around uncertainties and disturbances, arising from external processes, unmodeled dynamics, or simplified characteristics, to name a few. For instance, a primary source of uncertainties involves disturbances and long-time delays, which typically lead to loose robust control performance. This paper develops a robust control technique based on Reinforcement-Learning (RL) strategies via Deep Deterministic Policy Gradient (DDPG), integrating Nonlinear Model Predictive Control (NMPC). The NMPC works as a policy generator and the DDPG strategy is devoted to evaluating the learning process. While NMPC was able to approach tracking performance, the combined scheme with DDPG allowed further robust performance in terms of adaptation to changing thermal process conditions such as external disturbances and variations to internal model parameters. Indeed, combining strategies (NMPC-based DDPG) rendered unnecessary offline design of a terminal cost and constraints typically required in traditional robustified NMPC strategies. The RL agent was trained, tested, and validated in a simulation environment using a thermal process with longtime delay. Results demonstrated that the proposed NMPC-based DDPG technique achieved nearly similar tracking performance compared to traditional NMPC strategies, even maintaining control objectives. However, the proposed control strategy exhibited enhanced adaptivity regarding NMPC under the presence of disturbances and model parameter variations. The latter findings are expected to have an impact on the energy resources of real thermal processes in the industry.
dc.identifier.doi	10.1109/andescon61840.2024.10755797
dc.identifier.uri	https://doi.org/10.1109/andescon61840.2024.10755797
dc.identifier.uri	https://andeanlibrary.org/handle/123456789/46510
dc.language.iso	en
dc.source	Universidad La Salle
dc.subject	Reinforcement learning
dc.subject	Model predictive control
dc.subject	Computer science
dc.subject	Nonlinear system
dc.subject	Process (computing)
dc.subject	Nonlinear model
dc.subject	Variable (mathematics)
dc.subject	Control theory (sociology)
dc.subject	Control (management)
dc.subject	Process control
dc.title	Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay
dc.type	article

Collections

Artículo Científico Publicado

Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay

Files

Collections