Paper-Conference

Modular Arithmetic: Language Models Solve Math Digit by Digit

While recent work has begun to uncover the internal strategies that Large Language Models (LLMs) employ for simple arithmetic tasks, a …

Tanja Bäumel, Daniil Gurgurov, Yusser Al Ghussin, Josef van Genabith, Simon Ostermann

Language Arithmetics: Towards Systematic Language Neuron Identification and Manipulation

Large language models (LLMs) exhibit strong multilingual abilities, yet the neural mechanisms behind language-specific processing …

Daniil Gurgurov, Katharina Trinley, Yusser Al Ghussin, Tanja Bäumel, Josef van Genabith, Simon Ostermann

Multilingual Political Views of Large Language Models: Identification and Steering

Large language models (LLMs) are increasingly used in everyday tools and applications, raising concerns about their potential influence …

Daniil Gurgurov, Katharina Trinley, Ivan Vykopal, Josef van Genabith, Simon Ostermann, Roberto Zamparelli

On Multilingual Encoder Language Model Compression for Low-Resource Languages

In this paper, we combine two-step knowledge distillation, structured pruning, truncation, and vocabulary trimming for extremely …

Daniil Gurgurov, Michal Gregor, Josef van Genabith, Simon Ostermann

TenseLoC: Tense Localization and Control in a Multilingual LLM

Multilingual language models excel across languages, yet how they internally encode grammatical tense remains largely unclear. We …

Ariun-Erdene Tumurchuluun, Yusser Al Ghussin, David Marecek, Josef van Genabith, Koel Dutta Chowdhury

Saarland-Groningen at NADI 2025 Shared Task: Effective Dialectal Arabic Speech Processing under Data Constraints

We present our systems for the NADI 2025 shared task on multidialectal Arabic speech processing, participating in both spoken dialect …

Badr M. Abdullah, Yusser Al Ghussin, Zena Al-Khalili, Ömer Tarik Özyilmaz, Matias Valdenegro-Toro, Simon Ostermann, Dietrich Klakow

The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs

Autoregressive large language models (LLMs) exhibit impressive performance across various tasks but struggle with simple arithmetic, …

Tanja Bäumel, Josef van Genabith, Simon Ostermann

A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages

Large Language Models (LLMs) are increasingly used to generate synthetic textual data for training smaller specialized models. However, …

Tatiana Anikina, Jan Cegin, Jakub Simko, Simon Ostermann

Building Common Ground in Dialogue: A Survey

Common ground plays a crucial role in human communication since it helps to establish shared knowledge. However, common ground is also …

Tatiana Anikina, Alina Leippert, Simon Ostermann

Large Language Models for Multilingual Previously Fact-Checked Claim Detection

In our era of widespread false information, human fact-checkers often face the challenge of duplicating efforts when verifying claims …

Ivan Vykopal, Matus Pikuliak, Simon Ostermann, Tatiana Anikina, Michal Gregor, Marian Simko