The MultiplEYE COST Action aims to foster an interdisciplinary network of research groups working on collecting eye tracking data from reading in many languages. The goal is to support the development of a large multilingual eye tracking corpus and enable researchers to collect data by sharing infrastructure and their knowledge between various fields, including linguistics, psychology, and computer science. This data collection can then be used to study human language processing from a psycholinguistic perspective as well as to improve and evaluate computational language processing from a machine learning perspective.
The MultiplEYE COST Action has three core goals:
1. To provide a platform for discussing the desiderata and reaching a common ground between psycholinguists and computational linguists for a multilingual eye-tracking and self-paced reading data collection. This includes developing and reaching a consensus concerning experiment design, stimulus selection, stimulus layout, experimental procedure, and data preprocessing.
2. To enable discussions on the psycholinguistic research questions that can be addressed with multilingual eye movement data and providing a broad network to initiate collaborations focusing on cross-linguistic and multilingual projects.
3. To advance the natural language processing and machine learning applications that leverage eye-tracking data and improve their cross-linguistic generalization abilities by bringing researchers from psycholinguistics and computational linguistics closer together.
In this meeting, participants discussed and tested the MultiplEYE preprocessing pipeline. The pipeline is supposed to be applied to all MultiplEYE datasets and beyond. The focus was therefore on the usability of the pipeline in general, as well as extending it to other datasets.
The MultiplEYE meeting in Bucharest brought together the project teams to review annotation progress and coordinate upcoming work. The sessions covered sentence and word tokenization, POS tagging, lemmatization, syllable counting, named entity recognition, and coreference resolution. Particular emphasis was placed on sharing experiences with language-specific annotation challenges.
MultiplEYE is co-organising the Tenth Summer School on Statistical Methods for Linguistics and Psychology which will take place in Potsdam, Germany, on 24-28 August 2026.
Please send an email to jskn@hum.ku.dk to subscribe to our newsletter.
Your email address is used solely to send you our newsletter and information concerning our activities (2–3 times per year). You can unsubscribe any time by simply replying to the newsletter.
MultiplEYE members do not have to subscribe to receive the newsletter.