DEep COgnition Learning for LAnguage GEneration
This project aims to enhance NLP models by integrating machine learning, cognitive science, and structured memory to improve out-of-domain generalization and contextual understanding in language generation tasks.
Projectdetails
Introduction
In recent years, transformer-based deep learning models such as BERT or GPT-3 have led to impressive results in many natural language processing (NLP) tasks, exhibiting transfer and few-shot learning capabilities.
Limitations of Current Models
However, despite faring well in benchmarks, current deep learning models for NLP often fail badly in the wild. The main issues include:
- Poor out-of-domain generalization
- Inability to exploit contextual information
- Poor calibration
- Non-traceable memory
These limitations stem from their monolithic architectures, which are good for perception but unsuitable for tasks requiring higher-level cognition.
Project Goals
In this project, I attack these fundamental problems by bringing together tools and ideas from various fields:
- Machine Learning
- Sparse Modeling
- Information Theory
- Cognitive Science
This interdisciplinary approach will involve several key strategies:
Utility-Guided Controlled Generation
First, I will use uncertainty and quality estimates for utility-guided controlled generation. This will combine the control mechanism with the efficient encoding of contextual information and integration of multiple modalities.
Sparse and Structured Memory Models
Second, I will develop sparse and structured memory models, along with attention descriptive representations aimed at conscious processing.
Mathematical Models for Sparse Communication
Third, I will build mathematical models for sparse communication that reconcile discrete and continuous domains. These models will support end-to-end differentiability and enable a shared workspace where multiple modules and agents can communicate.
Application of Innovations
I will apply the innovations mentioned above to highly challenging language generation tasks, including:
- Machine translation
- Open dialogue
- Story generation
Collaboration and Impact
To reinforce interdisciplinarity and maximize technological impact, collaborations are planned with cognitive scientists and with a scale-up company in the crowd-sourcing translation industry.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.999.595 |
Totale projectbegroting | € 1.999.595 |
Tijdlijn
Startdatum | 1-8-2023 |
Einddatum | 31-7-2028 |
Subsidiejaar | 2023 |
Partners & Locaties
Projectpartners
- INSTITUTO DE TELECOMUNICACOESpenvoerder
- UNBABEL UNIPESSOAL, LDA
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Next-Generation Natural Language GenerationThis project aims to enhance natural language generation by integrating neural models with symbolic representations for better control, adaptability, and reliable evaluation across various applications. | ERC Starting... | € 1.420.375 | 2022 | Details |
Towards an Artificial Cognitive ScienceThis project aims to establish a new field of artificial cognitive science by applying cognitive psychology to enhance the learning and decision-making of advanced AI models. | ERC Starting... | € 1.496.000 | 2024 | Details |
Natural Language Understanding for non-standard languages and dialectsDIALECT aims to enhance Natural Language Understanding by developing algorithms that integrate dialectal variation and reduce bias in data and labels for fairer, more accurate language models. | ERC Consolid... | € 1.997.815 | 2022 | Details |
Tensors and Neural Networks for Computational CreativityThis project aims to develop unsupervised language models using tensor constructs and advanced neural networks to enhance creativity in natural language generation. | ERC Consolid... | € 1.988.500 | 2024 | Details |
Controlling Large Language ModelsDevelop a framework to understand and control large language models, addressing biases and flaws to ensure safe and responsible AI adoption. | ERC Starting... | € 1.500.000 | 2024 | Details |
Next-Generation Natural Language Generation
This project aims to enhance natural language generation by integrating neural models with symbolic representations for better control, adaptability, and reliable evaluation across various applications.
Towards an Artificial Cognitive Science
This project aims to establish a new field of artificial cognitive science by applying cognitive psychology to enhance the learning and decision-making of advanced AI models.
Natural Language Understanding for non-standard languages and dialects
DIALECT aims to enhance Natural Language Understanding by developing algorithms that integrate dialectal variation and reduce bias in data and labels for fairer, more accurate language models.
Tensors and Neural Networks for Computational Creativity
This project aims to develop unsupervised language models using tensor constructs and advanced neural networks to enhance creativity in natural language generation.
Controlling Large Language Models
Develop a framework to understand and control large language models, addressing biases and flaws to ensure safe and responsible AI adoption.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Nederlandstalige GPT-2 aanvul AI-programmatuurDeepdesk onderzoekt de haalbaarheid van het toepassen van GPT-2 voor contextafhankelijke tekstgeneratie in het Nederlands om de efficiëntie van correspondentie in contactcenters te verbeteren. | Mkb-innovati... | € 20.000 | 2021 | Details |
EdionHet project ontwikkelt een geautomatiseerd systeem voor vraaggeneratie in de natuurkunde om docenten tijd te besparen en studenten een boeiendere leerervaring te bieden. | Mkb-innovati... | € 20.000 | 2023 | Details |
Haalbaarheidsonderzoek naar AIPerLearn (AI-Powered Personalized Learning)STARK Learning onderzoekt de toepassing en training van AI-modellen om het ontwikkelen van gepersonaliseerde lesmaterialen te automatiseren en de kwaliteit en validatie te waarborgen. | Mkb-innovati... | € 20.000 | 2023 | Details |
Intenties van tekst herkennen middels neurale netwerkenMaxwell Labs en Xomnia ontwikkelen een intelligente cognitieve engine voor natural language processing van Europese talen, gericht op verbeterde intent herkenning via neurale netwerken. | Mkb-innovati... | € 199.960 | 2019 | Details |
Nederlandstalige GPT-2 aanvul AI-programmatuur
Deepdesk onderzoekt de haalbaarheid van het toepassen van GPT-2 voor contextafhankelijke tekstgeneratie in het Nederlands om de efficiëntie van correspondentie in contactcenters te verbeteren.
Edion
Het project ontwikkelt een geautomatiseerd systeem voor vraaggeneratie in de natuurkunde om docenten tijd te besparen en studenten een boeiendere leerervaring te bieden.
Haalbaarheidsonderzoek naar AIPerLearn (AI-Powered Personalized Learning)
STARK Learning onderzoekt de toepassing en training van AI-modellen om het ontwikkelen van gepersonaliseerde lesmaterialen te automatiseren en de kwaliteit en validatie te waarborgen.
Intenties van tekst herkennen middels neurale netwerken
Maxwell Labs en Xomnia ontwikkelen een intelligente cognitieve engine voor natural language processing van Europese talen, gericht op verbeterde intent herkenning via neurale netwerken.