Provable Scalability for high-dimensional Bayesian Learning
This project develops a mathematical theory for scalable Bayesian learning methods, integrating computational and statistical insights to enhance algorithm efficiency and applicability in high-dimensional models.
Projectdetails
Introduction
As the scale and complexity of available data increase, developing a rigorous understanding of the computational properties of statistical procedures has become a key scientific priority of our century. In line with such priority, this project develops a mathematical theory of computational scalability for Bayesian learning methods, with a focus on extremely popular high-dimensional and hierarchical models.
Methodological Integration
Unlike most recent literature, we will integrate computational and statistical aspects in the analysis of Bayesian learning algorithms. This approach will provide novel insight into the interaction between commonly used model structures and fitting algorithms.
Key Breakthroughs
Key methodological breakthroughs will include:
- A novel connection between computational algorithms for hierarchical models and random walks on the associated graphical models.
- The use of statistical asymptotics to derive computational scalability statements.
- A novel understanding of the computational implications of model misspecification and data heterogeneity.
Results and Applications
We will derive a broad collection of results for popular Bayesian computation algorithms, especially Markov chain Monte Carlo ones, in a variety of modeling frameworks, such as:
- Random-effect models
- Shrinkage models
- Hierarchical models
- Nonparametric models
These algorithms are routinely used for various statistical tasks, such as multilevel regression, factor analysis, and variable selection in various disciplines ranging from political science to genomics.
Implications for Computational Schemes
Our theoretical results will have direct implications on the design of novel and more scalable computational schemes, as well as on the optimization of existing ones. Focus will be given to developing algorithms with provably linear overall cost both in the number of datapoints and unknown parameters.
Conclusion
The above contributions will dramatically reduce the gap between theory and practice in Bayesian computation and allow us to fully benefit from the huge potential of the Bayesian paradigm.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.488.673 |
Totale projectbegroting | € 1.488.673 |
Tijdlijn
Startdatum | 1-5-2023 |
Einddatum | 30-4-2028 |
Subsidiejaar | 2023 |
Partners & Locaties
Projectpartners
- UNIVERSITA COMMERCIALE LUIGI BOCCONIpenvoerder
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
The missing mathematical story of Bayesian uncertainty quantification for big dataThis project aims to enhance scalable Bayesian methods through theoretical insights, improving their accuracy and acceptance in real-world applications like medicine and cosmology. | ERC Starting... | € 1.492.750 | 2022 | Details |
Scalable Learning for Reproducibility in High-Dimensional Biomedical Signal Processing: A Robust Data Science FrameworkScReeningData aims to develop a scalable learning framework to enhance statistical robustness and reproducibility in high-dimensional data analysis, reducing false positives across scientific domains. | ERC Starting... | € 1.500.000 | 2022 | Details |
High-dimensional mathematical methods for LargE Agent and Particle systemsThis project aims to develop a new mathematical framework for efficient simulation of high-dimensional particle and agent systems, enhancing predictive insights across various scientific fields. | ERC Starting... | € 1.379.858 | 2023 | Details |
Advanced Numerics for Uncertainty and Bayesian Inference in ScienceANUBIS aims to enhance quantitative scientific analysis by unifying probabilistic numerical methods with machine learning and simulation, improving efficiency and uncertainty management in data-driven insights. | ERC Consolid... | € 1.997.250 | 2024 | Details |
High-dimensional nonparametric Bayesian causal inferenceDevelop Bayesian nonparametric methods for high-dimensional causal inference to enhance variable selection and uncertainty quantification, enabling reliable causal conclusions across various fields. | ERC Starting... | € 1.499.770 | 2023 | Details |
The missing mathematical story of Bayesian uncertainty quantification for big data
This project aims to enhance scalable Bayesian methods through theoretical insights, improving their accuracy and acceptance in real-world applications like medicine and cosmology.
Scalable Learning for Reproducibility in High-Dimensional Biomedical Signal Processing: A Robust Data Science Framework
ScReeningData aims to develop a scalable learning framework to enhance statistical robustness and reproducibility in high-dimensional data analysis, reducing false positives across scientific domains.
High-dimensional mathematical methods for LargE Agent and Particle systems
This project aims to develop a new mathematical framework for efficient simulation of high-dimensional particle and agent systems, enhancing predictive insights across various scientific fields.
Advanced Numerics for Uncertainty and Bayesian Inference in Science
ANUBIS aims to enhance quantitative scientific analysis by unifying probabilistic numerical methods with machine learning and simulation, improving efficiency and uncertainty management in data-driven insights.
High-dimensional nonparametric Bayesian causal inference
Develop Bayesian nonparametric methods for high-dimensional causal inference to enhance variable selection and uncertainty quantification, enabling reliable causal conclusions across various fields.