Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay
| dc.contributor.author | Kevin Marlon Soza Mamani | |
| dc.contributor.author | Óscar Camacho | |
| dc.contributor.author | Alvaro Prado | |
| dc.coverage.spatial | Bolivia | |
| dc.date.accessioned | 2026-03-22T14:26:18Z | |
| dc.date.available | 2026-03-22T14:26:18Z | |
| dc.date.issued | 2024 | |
| dc.description | Citaciones: 3 | |
| dc.description.abstract | The main concern in thermal process control revolves around uncertainties and disturbances, arising from external processes, unmodeled dynamics, or simplified characteristics, to name a few. For instance, a primary source of uncertainties involves disturbances and long-time delays, which typically lead to loose robust control performance. This paper develops a robust control technique based on Reinforcement-Learning (RL) strategies via Deep Deterministic Policy Gradient (DDPG), integrating Nonlinear Model Predictive Control (NMPC). The NMPC works as a policy generator and the DDPG strategy is devoted to evaluating the learning process. While NMPC was able to approach tracking performance, the combined scheme with DDPG allowed further robust performance in terms of adaptation to changing thermal process conditions such as external disturbances and variations to internal model parameters. Indeed, combining strategies (NMPC-based DDPG) rendered unnecessary offline design of a terminal cost and constraints typically required in traditional robustified NMPC strategies. The RL agent was trained, tested, and validated in a simulation environment using a thermal process with longtime delay. Results demonstrated that the proposed NMPC-based DDPG technique achieved nearly similar tracking performance compared to traditional NMPC strategies, even maintaining control objectives. However, the proposed control strategy exhibited enhanced adaptivity regarding NMPC under the presence of disturbances and model parameter variations. The latter findings are expected to have an impact on the energy resources of real thermal processes in the industry. | |
| dc.identifier.doi | 10.1109/andescon61840.2024.10755797 | |
| dc.identifier.uri | https://doi.org/10.1109/andescon61840.2024.10755797 | |
| dc.identifier.uri | https://andeanlibrary.org/handle/123456789/46510 | |
| dc.language.iso | en | |
| dc.source | Universidad La Salle | |
| dc.subject | Reinforcement learning | |
| dc.subject | Model predictive control | |
| dc.subject | Computer science | |
| dc.subject | Nonlinear system | |
| dc.subject | Process (computing) | |
| dc.subject | Nonlinear model | |
| dc.subject | Variable (mathematics) | |
| dc.subject | Control theory (sociology) | |
| dc.subject | Control (management) | |
| dc.subject | Process control | |
| dc.title | Deep Reinforcement Learning Via Nonlinear Model Predictive Control for Thermal Process with Variable Longtime Delay | |
| dc.type | article |