Saturday, May 16, 2026 | Palma de Mallorca, Spain
| 09:00 - 10:30 | Session A - Room 4 |
| 09:10 - 09:20 | A Bolu: A Structured Dataset for the Computational Analysis of Sardinian Improvisational Poetry Università di Pisa, "L'Orientale" University of Naples |
| 09:20 - 09:30 | Saar-Voice: A Multi-Speaker Saarbrücken Dialect Speech Corpus Saarland University |
| 09:30 - 09:40 | MD_NLP: Reconstructing an Australian English Heritage Dialect Corpus from the Mitchell–Delbridge Recordings through LLM-Assisted Speaker Attribution University of Oulu |
| 09:40 - 09:50 | Challenges in the Detection of Dialect for Historical Languages; the Case of Old Irish Text Resources University of Galway |
| 10:00 - 10:30 | 2-minute poster presentations |
| 10:30 - 11:00 | Poster Session & coffee break (Running in parallel) |
| 11:00 - 13:00 | Session B - Room 4 |
| 11:00 - 11:10 | Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects University of the Basque Country UPV/EHU |
| 11:10 - 11:20 | Systematic Normalization of Spoken Mixed-Language, Mixed-Dialect Data The University of Texas at Austin |
| 11:20 - 11:30 | Evaluating Cross-Dialect Syntactic Variation: a Theory-Driven Web Resource Università di Modena e Reggio Emilia, Università di Padova, University of York |
| 11:30 - 11:40 | Can LLM Agents Identify Spoken Dialects like a Linguist? University of Bonn, Fraunhofer IAIS, Philipps-Universität Marburg |
| 11:45 - 12:30 | Invited Talk Prof. Barbara Plank, LMU Munich, Visiting Prof ITU Copenhagen |
| 12:30 - 13:00 | Community Discussion |
| Beyond Accuracy: Analyzing Dialect Confusion in Automatic Speech-Based Dialect Classification |
| FLEURS-Kobani: Extending FLEURS dataset for Northern Kurdish |
| Exploring the reusability of Northern Kurdish resources for Badini speech recognition |
| Wancho Dialectometry: Community-created data and the Living Dictionaries project |
| Dialectometry and Evaluation of the ePark Corpus for Low-Resource Formosan Language Dialects |
| A Dialectal Corpus for Ukrainian: Collection, Classification, and Standardization |
| German Dialects Across Situations, Generations, and Regions: The REDE corpus as an Oral Resource for NLP |
| A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations |
| WoVis: Interactive Visualization of Word Embeddings for Semantic Change in Historical and Dialectal Language Resources |
| Speaker Normalization via Voice Conversion Reveals a Human-Machine Dissociation in Dialect Classification |
| South Tyrolean Dialect-to-Standard Speech Translation |
| TransVar – the Corpus for Variation and Change Study of the Historical Transcarpathian lects |
| The Generator-Eraser Paradox: Community Guidelines for Responsible LLM-Assisted Dialect Resource Creation |
| The Texas German Dialect Project Corpus as a Diachronic Resource for Investigating Language Contact |
| Pontic Greek in the Caucasus: an online corpus |
| Meaning Over Morphology: A Multi-Metric Benchmark of LLMs for Bangla Dialect Translation |
| Sociolinguistic aspects of crowdsourcing for a vocal corpus of Alsatian |
| HeptaTAX: A Neuro-Symbolic Pipeline and Benchmark for Classifying 16th-Century Heptanesian Notarial Acts |
| Towards Semantic Access and Interoperability in Digital Dialectal Atlases. A Case Study |
| A CLDF-Compliant Lexical Database for Modern Greek Dialects: Resource Design and Dialectometric Analysis |
| A Speech Resource for the Pontic Greek Dialect: Transcription Choices and Baseline ASR Evaluation |
| First Steps in ASR for Cypriot Greek: Challenges and Insights |
| Structural Divergence under Shared Language-Level Specification: Griko in Universal Dependencies |
| Digital Preservation of Aromanian Through Knowledge Management and Automatic Speech Recognition Evaluation |
| A Novel Typology of Mutually Intelligible Words: The Case of Slavic Languages |
| Transfer Learning for an Endangered Slavic Variety: Dependency Parsing in Pomak Across Contact-Shaped Dialects |