Koopman-Operator-based Reinforcement Learning Control of Partial Differential Equations
This project aims to enhance reinforcement learning for large-scale engineering systems by developing performance-guaranteed controllers, addressing safety in energy-efficient technologies.
Projectdetails
Introduction
An unprecedented energy crisis is looming over us. In order to transition to a greener and more energy-efficient society, existing technologies need to be improved and novel techniques such as nuclear fusion developed. This requires the stabilization of aerodynamics, heat transfer, or combustion and fusion processes and thus, the development of efficient control strategies for large-scale dynamical systems.
Challenges with Reinforcement Learning
In recent years, reinforcement learning (RL) has emerged as a highly promising data-driven technique. Unfortunately, we cannot trust RL to handle our most important and complex systems, since the resulting controllers do not possess performance guarantees.
Limitations of Current Approaches
Certifiable RL approaches such as linear or kernel methods tend to scale poorly, such that their applicability is limited to toy examples. In contrast to other application areas, this is a complete show-stopper for safety-critical engineering. Moreover, the training is extremely data-hungry and costly, due to which RL itself contributes to the energy crisis.
Project Vision
The vision of this project is to develop new foundational methods to equip RL controllers for large-scale engineering systems with performance guarantees by exploiting system knowledge and systematically reducing the complexity.
Major Breakthroughs
To achieve this, I will target three major breakthroughs:
- Global linearization of the dynamics via the Koopman operator framework.
- The extension of certified Q-learning to continuous action spaces via control quantization.
- The detection and exploitation of symmetries in the system dynamics.
Required Advancements
The project requires significant joint advancements in several challenging areas such as control, approximation theory, and machine learning.
Potential Impact
In the case of success, the resulting controllers will provide a massive advancement of RL towards safety-critical engineering applications and significantly contribute to the challenge of meeting the future energy demands of our society.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.499.000 |
Totale projectbegroting | € 1.499.000 |
Tijdlijn
Startdatum | 1-1-2025 |
Einddatum | 31-12-2029 |
Subsidiejaar | 2025 |
Partners & Locaties
Projectpartners
- TECHNISCHE UNIVERSITAT DORTMUNDpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Deep Bayesian Reinforcement Learning -- Unifying Perception, Planning, and ControlDevelop an algorithmic framework using deep learning and Bayesian reinforcement learning to enhance robotic manipulation in unstructured environments by effectively managing uncertainty. | ERC Starting... | € 1.500.000 | 2022 | Details |
Projection-based Control: A Novel Paradigm for High-performance SystemsPROACTHIS aims to develop a novel projection-based control paradigm to enhance performance in future engineering systems through innovative design and optimization techniques. | ERC Advanced... | € 2.498.516 | 2022 | Details |
REinforcement TWInning SysTems: from collaborative digital twins to model-based reinforcement learningThe Re-Twist project aims to develop a novel Reinforcement Twinning framework that integrates machine learning with engineering to optimize systems like wind turbines and drones for societal benefits. | ERC Starting... | € 1.500.000 | 2025 | Details |
Model-based Reinforcement Learning for Versatile Robots in the Real WorldREAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning. | ERC Consolid... | € 1.998.500 | 2023 | Details |
Continual and Sequential Learning for Artificial IntelligenceDevelop a Continual and Sequential Learning AI that integrates RL with advanced data gathering to autonomously adapt and explore in dynamic environments for scientific breakthroughs. | ERC Starting... | € 1.259.375 | 2025 | Details |
Deep Bayesian Reinforcement Learning -- Unifying Perception, Planning, and Control
Develop an algorithmic framework using deep learning and Bayesian reinforcement learning to enhance robotic manipulation in unstructured environments by effectively managing uncertainty.
Projection-based Control: A Novel Paradigm for High-performance Systems
PROACTHIS aims to develop a novel projection-based control paradigm to enhance performance in future engineering systems through innovative design and optimization techniques.
REinforcement TWInning SysTems: from collaborative digital twins to model-based reinforcement learning
The Re-Twist project aims to develop a novel Reinforcement Twinning framework that integrates machine learning with engineering to optimize systems like wind turbines and drones for societal benefits.
Model-based Reinforcement Learning for Versatile Robots in the Real World
REAL-RL aims to create versatile autonomous robots that learn from experience using a model-based approach for efficient task adaptation and behavior planning.
Continual and Sequential Learning for Artificial Intelligence
Develop a Continual and Sequential Learning AI that integrates RL with advanced data gathering to autonomously adapt and explore in dynamic environments for scientific breakthroughs.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Reinforcement Learning & Solver Racing in simulatieversnellingenHet project onderzoekt het gebruik van reinforcement learning en solver racing om de efficiëntie van computersimulaties te verbeteren. | Mkb-innovati... | € 20.000 | 2024 | Details |
Reinforcement Learning & Solver Racing in simulatieversnellingen
Het project onderzoekt het gebruik van reinforcement learning en solver racing om de efficiëntie van computersimulaties te verbeteren.