Machine Learning and Mass Spectrometry for Structural Elucidation of Novel Toxic Chemicals

LearningStructurE aims to enhance the discovery of novel toxic chemical structures by integrating chromatography, mass spectrometry, and machine learning to explore unknown chemical spaces in environmental samples.

Subsidie
€ 1.867.187
2024

Projectdetails

Introduction

Nearly half a million known chemicals have been deemed relevant for exposure studies, and an even larger number of their transformation products are likely to co-occur in the environment. This mind-blowing number of possible chemical structures makes it impossible to in-silico generate all these structures, let alone synthesize and analytically confirm them, thereby limiting the discovery of novel chemicals.

Current Limitations

Today, the structural elucidation of chemicals detected with high-resolution mass spectrometry relies on databases and machine learning models trained on the known chemical space. Both are fundamentally ill-suited for discovering novel chemical structures. As a result, only a few percent of the toxic activity of the environmental samples is explained by the currently known and monitored chemicals.

Importance of Novel Chemical Space

It is crucial to access the novel chemical space to improve our understanding of the origin, fate, and impact of these chemicals.

Project Aim

The aim of LearningStructurE is to turn the discovery of novel chemical structures from serendipity to routine. As a steppingstone in this pursuit, I will combine the fundamental understanding of chromatography and high-resolution mass spectrometry with machine learning to pinpoint novel toxic chemical structures based on their empirical analytical information.

Methodology

To significantly advance the predictive power of machine learning models for empirical analytical information, I will take advantage of the candidate structures as a sample-specific training set for machine learning models. The improved predictive power will feed into in-silico structure generation, allowing elucidation of the structure directly from the empirical analytical information.

Expected Outcomes

LearningStructurE will pave the way for exploration of the unknown chemical space detected from environmental samples, and thereby improve our understanding of the emissions, chemical processes transforming the emitted chemicals, and close the gap in measured and explained toxicity.

Financiële details & Tijdlijn

Financiële details

Subsidiebedrag€ 1.867.187
Totale projectbegroting€ 1.867.187

Tijdlijn

Startdatum1-1-2024
Einddatum31-12-2028
Subsidiejaar2024

Partners & Locaties

Projectpartners

  • STOCKHOLMS UNIVERSITETpenvoerder

Land(en)

Sweden

Vergelijkbare projecten binnen European Research Council

ERC Starting...

Machine Learning Combined with Spectral Imaging for Inferring the Toxicity of Micro- and Nanoplastics

The project aims to assess micro- and nanoplastics' risks to gastrointestinal health by integrating spectral imaging, experimental bioassays, and machine learning for predictive toxicity modeling.

€ 1.499.949
ERC Starting...

Deep learning of chemical reactions

This project aims to develop advanced deep learning frameworks for modeling organic and enzymatic reactions to enhance predictions of selectivity and enable sustainable synthesis.

€ 1.499.285
ERC Consolid...

dAta-dRiven integrated approaches to CHemIcal safety assessMEnt and Drug dEvelopment

The ARCHIMEDES project aims to revolutionize chemical and drug development by integrating toxicogenomics, AI, and a Knowledge Graph to enhance safety and innovation in a regulatory-compliant manner.

€ 2.000.000
ERC Advanced...

Energy Transfer Catalysis: A Highway to Molecular Complexity

HighEnT aims to innovate synthetic methodologies using visible light-mediated EnT catalysis to create complex organic molecules for pharmacological applications, enhancing chemical space and reaction design.

€ 2.499.250
ERC Consolid...

Explainable Machine Learning for Identifying the Full Heterogeneity of Peptidoforms and Proteoforms

explAInProt aims to enhance proteomics by developing explainable, end-to-end machine learning models to identify undetected protein variants and improve clinical applications through advanced sequencing methods.

€ 1.992.500

Vergelijkbare projecten uit andere regelingen

EIC Pathfinder

QUANTUM-TOX - Revolutionizing Computational Toxicology with Electronic Structure Descriptors and Artificial Intelligence

This project aims to revolutionize computational toxicology by developing interpretable quantum mechanics-based descriptors (ESigns) for accurate toxicity predictions across the entire chemical space.

€ 1.994.770
EIC Transition

Digital Discovery Platform for Organic Electronics Materials

Develop a digital platform to streamline the discovery and commercialization of molecular materials for organic electronics, enhancing efficiency through virtual screening and supply chain integration.

€ 1.294.000