Dynamics-Aware Theory of Deep Learning
This project aims to create a robust theoretical framework for deep learning, enhancing understanding and practical tools to improve model performance and reduce complexity in various applications.
Projectdetails
Introduction
The recent advances in deep learning (DL) have transformed many scientific domains and have had major impacts on industry and society. Despite their success, DL methods do not obey most of the wisdoms of statistical learning theory, and the vast majority of the current DL techniques mainly stand as poorly understood black-box algorithms.
Theoretical Gaps
Even though DL theory has been a very active research field in the past few years, there is a significant gap between the current theory and practice:
- The current theory often becomes vacuous for models with a large number of parameters (which is typical in DL).
- It cannot capture the interaction between data, architecture, training algorithm, and its hyper-parameters, which can have drastic effects on the overall performance.
Due to this lack of theoretical understanding, designing new DL systems has been dominantly performed by ad-hoc, 'trial-and-error' approaches.
Objectives
The main objective of this proposal is to develop a mathematically sound and practically relevant theory for DL, which will ultimately serve as the basis of a software library that provides practical tools for DL practitioners. In particular:
- We will develop error bounds that closely reflect the true empirical performance by explicitly incorporating the dynamics aspect of training.
- We will develop new model selection, training, and compression algorithms with reduced time/memory/storage complexity by exploiting the developed theory.
Theoretical Framework
To achieve the expected breakthroughs, we will develop a novel theoretical framework, which will enable tight analysis of learning algorithms in the lens of dynamical systems theory. The outcomes will help relieve DL from being a black-box system and avoid the heuristic design process.
Software Development
We will produce comprehensive open-source software tools adapted to all popular DL libraries and test the developed algorithms on a wide range of real applications arising in computer vision, audio/music/natural language processing.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.498.410 |
Totale projectbegroting | € 1.498.410 |
Tijdlijn
Startdatum | 1-10-2022 |
Einddatum | 30-9-2027 |
Subsidiejaar | 2022 |
Partners & Locaties
Projectpartners
- INSTITUT NATIONAL DE RECHERCHE EN INFORMATIQUE ET AUTOMATIQUEpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Understanding Deep LearningThe project aims to establish a solid theoretical foundation for deep learning by investigating optimization, statistical complexity, and representation, enhancing understanding and algorithm development. | ERC Starting... | € 1.499.750 | 2022 | Details |
Algorithmic Bias Control in Deep learningThe project aims to develop a theory of algorithmic bias in deep learning to improve training efficiency and generalization performance for real-world applications. | ERC Starting... | € 1.500.000 | 2022 | Details |
The Complexity of Dynamic Matrix ProblemsThis project aims to enhance dynamic data structures for efficient matrix operations, optimizing algorithms in both convex and non-convex settings, particularly for deep neural networks and AI applications. | ERC Starting... | € 1.439.413 | 2022 | Details |
Uniting Statistical Testing and Machine Learning for Safe PredictionsThe project aims to enhance the interpretability and reliability of machine learning predictions by integrating statistical methods to establish robust error bounds and ensure safe deployment in real-world applications. | ERC Starting... | € 1.500.000 | 2024 | Details |
Verifiably Safe and Correct Deep Neural NetworksThis project aims to develop scalable verification techniques for large deep neural networks to ensure their safety and correctness in critical systems, enhancing reliability and societal benefits. | ERC Starting... | € 1.500.000 | 2023 | Details |
Understanding Deep Learning
The project aims to establish a solid theoretical foundation for deep learning by investigating optimization, statistical complexity, and representation, enhancing understanding and algorithm development.
Algorithmic Bias Control in Deep learning
The project aims to develop a theory of algorithmic bias in deep learning to improve training efficiency and generalization performance for real-world applications.
The Complexity of Dynamic Matrix Problems
This project aims to enhance dynamic data structures for efficient matrix operations, optimizing algorithms in both convex and non-convex settings, particularly for deep neural networks and AI applications.
Uniting Statistical Testing and Machine Learning for Safe Predictions
The project aims to enhance the interpretability and reliability of machine learning predictions by integrating statistical methods to establish robust error bounds and ensure safe deployment in real-world applications.
Verifiably Safe and Correct Deep Neural Networks
This project aims to develop scalable verification techniques for large deep neural networks to ensure their safety and correctness in critical systems, enhancing reliability and societal benefits.