Data-Driven and User-Centered Content Moderation
DEDUCE aims to revolutionize content moderation by developing personalized interventions based on user characteristics, enhancing effectiveness and fairness through scientific evaluation methods.
Projectdetails
Introduction
Online platforms apply moderation interventions (MIs) to mitigate misbehavior. Today, MIs are one-size-fits-all, meaning that each intervention is applied in the same way for all users. However, not all users are the same, as they have diverse demographics, ideologies, and personalities.
Limitations of Current Approaches
This naive approach to content moderation is platform-centered and neglects user differences. Moreover, content moderation resembles art more than science. The design of MIs is based on common sense and intuition, and progress is sought via trial-and-error rather than via a rigorous scientific process.
The inevitable consequence is that current MIs have variable effectiveness, are highly unreliable, and fall short of the moderation needs.
Project Goals
The ambitious goal of DEDUCE is to initiate a paradigm shift in content moderation by building the theoretical and methodological foundations to move from intuition-driven approaches enforced via one-size-fits-all MIs to science-driven strategies grounded on personalized moderation interventions (PMIs).
- We will develop causal methods and indicators to evaluate the effectiveness and fairness of current content moderation practices.
- Then, we will study how user characteristics influence the outcomes of moderation.
- Finally, we will leverage the acquired knowledge to design and evaluate PMIs, a first-of-its-kind endeavor.
Data-Driven Approach
Our data-driven approach will enable us to evaluate in advance the effects of many MIs (what-if analyses) and to plan ahead their application, rather than to assess and correct afterwards.
Expected Outcomes
The high-gain nature of DEDUCE is evident, as it will open new directions of research (e.g., the design of PMIs), while also providing major practical and social benefits.
Our results will yield groundbreaking advancements in the theory and practice of content moderation, and will be embodied in:
- Practical guidelines for moderators and policymakers.
- An open-source proof-of-concept system to support both human and automated moderation.
Financiële details & Tijdlijn
Financiële details
Subsidiebedrag | € 1.494.775 |
Totale projectbegroting | € 1.494.775 |
Tijdlijn
Startdatum | 1-4-2024 |
Einddatum | 31-3-2029 |
Subsidiejaar | 2024 |
Partners & Locaties
Projectpartners
- CONSIGLIO NAZIONALE DELLE RICERCHEpenvoerder
- UNIVERSITA DI PISA
Land(en)
Vergelijkbare projecten binnen European Research Council
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Designing Social Media Recommendation Algorithms for Societal GoodThe project aims to enhance social media algorithms by integrating civic discourse values to reduce risks to social cohesion while balancing freedom of expression through participatory design and risk assessment. | ERC Starting... | € 2.037.464 | 2025 | Details |
Social Media: Measuring Effects and Mitigating DownsidesThis project aims to investigate the causal effects of social media on political engagement and mental health, while evaluating interventions to mitigate its negative impacts on users and society. | ERC Starting... | € 1.494.625 | 2023 | Details |
Measuring and Mitigating Risks of AI-driven Information TargetingThis project aims to assess the risks of AI-driven information targeting on individuals, algorithms, and platforms, and propose protective measures through innovative measurement methodologies. | ERC Starting... | € 1.499.953 | 2022 | Details |
Improving Digital Mental Health Interventions: ENGAGEment as Mechanism of ImpactENGAGE aims to enhance Digital Mental Health Interventions by developing a comprehensive understanding of real-time engagement to improve individual outcomes through personalized strategies. | ERC Starting... | € 1.499.590 | 2023 | Details |
Development and Mass-dissemination of Intervention to Mobilize Pro-social Bystander Reactions to Hostile Content on Social MediaThe STANDBYCOMMS project aims to enhance and disseminate a pro-social bystander intervention to combat online hostility through collaboration and scalable field testing. | ERC Proof of... | € 150.000 | 2023 | Details |
Designing Social Media Recommendation Algorithms for Societal Good
The project aims to enhance social media algorithms by integrating civic discourse values to reduce risks to social cohesion while balancing freedom of expression through participatory design and risk assessment.
Social Media: Measuring Effects and Mitigating Downsides
This project aims to investigate the causal effects of social media on political engagement and mental health, while evaluating interventions to mitigate its negative impacts on users and society.
Measuring and Mitigating Risks of AI-driven Information Targeting
This project aims to assess the risks of AI-driven information targeting on individuals, algorithms, and platforms, and propose protective measures through innovative measurement methodologies.
Improving Digital Mental Health Interventions: ENGAGEment as Mechanism of Impact
ENGAGE aims to enhance Digital Mental Health Interventions by developing a comprehensive understanding of real-time engagement to improve individual outcomes through personalized strategies.
Development and Mass-dissemination of Intervention to Mobilize Pro-social Bystander Reactions to Hostile Content on Social Media
The STANDBYCOMMS project aims to enhance and disseminate a pro-social bystander intervention to combat online hostility through collaboration and scalable field testing.
Vergelijkbare projecten uit andere regelingen
Project | Regeling | Bedrag | Jaar | Actie |
---|---|---|---|---|
Lasso ModerationOntwikkel een AI-gedreven content moderatieplatform met een gebruiksvriendelijke interface, gericht op het efficiënt modereren van miljoenen berichten voor diverse bedrijven. | Mkb-innovati... | € 20.000 | 2023 | Details |
eXplainable AI in Personalized Mental HealthcareDit project ontwikkelt een innovatief AI-platform dat gebruikers betrekt bij het verbeteren van algoritmen via feedbackloops, gericht op transparantie en betrouwbaarheid in de geestelijke gezondheidszorg. | Mkb-innovati... | € 350.000 | 2022 | Details |
Lasso Moderation
Ontwikkel een AI-gedreven content moderatieplatform met een gebruiksvriendelijke interface, gericht op het efficiënt modereren van miljoenen berichten voor diverse bedrijven.
eXplainable AI in Personalized Mental Healthcare
Dit project ontwikkelt een innovatief AI-platform dat gebruikers betrekt bij het verbeteren van algoritmen via feedbackloops, gericht op transparantie en betrouwbaarheid in de geestelijke gezondheidszorg.