TRAILS - Trustworthy and Inclusive Machines

Natural language processing (NLP) has demonstrated impressive performance in some human tasks. To achieve such performance, current neural models need to be pre-trained on huge amounts of raw text data. This dependence on uncurated data has at least four indirect and unintended consequences:

Uncurated data tends to be linguistically and culturally non-diverse due to the statistical dominance of major languages and dialects in online texts (English vs. North Frisian, US English vs. UK English, etc.).
Pre-trained neural models such as the ubiquitous pre-trained language models (PLM) reproduce the features present in the data, including human biases.
Rare phenomena (or languages) in the “long tail” are often not sufficiently taken into account in model evaluation, leading to an underestimation of model performance, especially in real-world application scenarios.
The focus on achieving state-of-the-art results through the use of transfer learning with giant PLMs such as GPT4 or mT5 often underestimates alternative methods that are more accessible, efficient and sustainable.

As inclusion and trust are undermined by these problems, in TRAILS we focus on three main research directions to address such problems: (i) inclusion of underrepresented languages and cultures through multilingual and culturally sensitive NLP, (ii) robustness and fairness with respect to long-tail phenomena and classes and “trustworthy content”, and (iii) robust and efficient NLP models that enable training and deployment of models for (i) and (ii). We also partially address economic inequality by aiming for more efficient models (objective (iii)), which directly translates into a lower resource/cost footprint.

TRAILS is funded by the German Federal Ministry of Research, Technology and Space (BMFTR) under the funding code 16IW24005.

Principal Investigators

Sebastian Möller

Professor for Quality and Usability, TU Berlin and Department Head, DFKI

Simon Ostermann

Senior Researcher

Researchers

Yusser Al Ghussin

PhD Student

Tatiana Anikina

PhD Student

Tanja Bäumel

PhD Student

Daniil Gurgurov

PhD Student

David Harbecke

PhD Student

Josef van Genabith

Professor at German Research Center for Artificial Intelligence (DFKI)

Günter Neumann

Professor at German Research Center for Artificial Intelligence (DFKI)

Arne Binder

PhD Student

Cristina España i Bonet

Senior Researcher

Aleksandra Gabryszak

PhD Student

Leonhard Hennig

Senior Researcher

News

Three papers by TRAILS authors accepted to IJCNLP-AACL 2025

Three papers from researchers in the TRAILS project have been accepted at the 2025 International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2025).

25. Nov 2025 1 min read

Five papers by TRAILS authors accepted to EMNLP 2025

Five papers from researchers in the TRAILS project have been accepted as Main and Findings papers at the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), and one has been accepted as best paper at the Workshop on Multilingual Representation Learning.

30. Sep 2025 2 min read

Three papers by TRAILS authors accepted to NAACL 2025 and associated workshops, and one to COLING 2025

Two papers originating from research in the TRAILS project have been accepted to the Main Track and Findings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL).

3. Feb 2025 2 min read

Recent Publications

Modular Arithmetic: Language Models Solve Math Digit by Digit

While recent work has begun to uncover the internal strategies that Large Language Models (LLMs) employ for simple arithmetic tasks, a …

Tanja Bäumel, Daniil Gurgurov, Yusser Al Ghussin, Josef van Genabith, Simon Ostermann

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

Large language models (LLMs) exhibit strong multilingual abilities, yet the neural mechanisms behind language-specific processing …

Daniil Gurgurov, Katharina Trinley, Yusser Al Ghussin, Tanja Bäumel, Josef van Genabith, Simon Ostermann

Multilingual Political Views of Large Language Models: Identification and Steering

Large language models (LLMs) are increasingly used in everyday tools and applications, raising concerns about their potential influence …

Daniil Gurgurov, Katharina Trinley, Ivan Vykopal, Josef van Genabith, Simon Ostermann, Roberto Zamparelli

On Multilingual Encoder Language Model Compression for Low-Resource Languages

In this paper, we combine two-step knowledge distillation, structured pruning, truncation, and vocabulary trimming for extremely …

Daniil Gurgurov, Michal Gregor, Josef van Genabith, Simon Ostermann

TenseLoC: Tense Localization and Control in a Multilingual LLM

Multilingual language models excel across languages, yet how they internally encode grammatical tense remains largely unclear. We …

Ariun-Erdene Tumurchuluun, Yusser Al Ghussin, David Marecek, Josef van Genabith, Koel Dutta Chowdhury

Saarland-Groningen at NADI 2025 Shared Task: Effective Dialectal Arabic Speech Processing under Data Constraints

We present our systems for the NADI 2025 shared task on multidialectal Arabic speech processing, participating in both spoken dialect …

Badr M. Abdullah, Yusser Al Ghussin, Zena Al-Khalili, Ömer Tarik Özyilmaz, Matias Valdenegro-Toro, Simon Ostermann, Dietrich Klakow

The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs

Autoregressive large language models (LLMs) exhibit impressive performance across various tasks but struggle with simple arithmetic, …

Tanja Bäumel, Josef van Genabith, Simon Ostermann

A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages

Large Language Models (LLMs) are increasingly used to generate synthetic textual data for training smaller specialized models. However, …

Tatiana Anikina, Jan Cegin, Jakub Simko, Simon Ostermann

Building Common Ground in Dialogue: A Survey

Common ground plays a crucial role in human communication since it helps to establish shared knowledge. However, common ground is also …

Tatiana Anikina, Alina Leippert, Simon Ostermann

Large Language Models for Multilingual Previously Fact-Checked Claim Detection

In our era of widespread false information, human fact-checkers often face the challenge of duplicating efforts when verifying claims …

Ivan Vykopal, Matus Pikuliak, Simon Ostermann, Tatiana Anikina, Michal Gregor, Marian Simko

Multilingual Datasets for Custom Input Extraction and Explanation Requests Parsing in Conversational XAI Systems

Conversational explainable artificial intelligence (ConvXAI) systems based on large language models (LLMs) have garnered considerable …

Qianli Wang, Tatiana Anikina, Nils Feldhus, Simon Ostermann, Fedor Splitt, Jiaao Li, Yoana Tsoneva, Sebastian Möller, Vera Schmitt

PolBiX: Detecting LLMs' Political Bias in Fact-Checking through X-phemisms

Large Language Models are increasingly used in applications requiring objective assessment, which could be compromised by political …

Charlott Jakob, David Harbecke, Patrick Parschan, Pia Wenzel Neves, Vera Schmitt

PolBiX: Detecting LLMs' Political Bias in Fact-Checking through X-phemisms

Automatic Fact-checking in English and Telugu

False information poses a significant global challenge, and manually verifying claims is a time-consuming and resource-intensive …

Ravi Kiran Chikkala, Tatiana Anikina, Natalia Skachkova, Ivan Vykopal, Rodrigo Agerri, Josef van Genabith

dfkinit2b at CheckThat! 2025: Leveraging LLMs and Ensemble of Methods for Multilingual Claim Normalization

The rapid spread of misinformation on social media across languages presents a major challenge for fact-checking efforts. Social media …

Tatiana Anikina, Ivan Vykopal, Sebastian Kula, Ravi Chikkala, Natalia Skachkova, Jing Yang, Veronika Solopova, Vera Schmitt, Simon Ostermann

Cross-Lingual Fact Verification: Analyzing LLMs Performance Patterns Across Languages

Fact verification has emerged as a critical task in combating misinformation, yet most research remains focused on English-language …

Hanna Shcharbakova, Tatiana Anikina, Natalia Skachkova, Josef van Genabith

See all publications

TRAILS - Sponsored by the Federal Ministry of Education and Research