For an updated list of publications, please visit my profile on Google Scholar. In the list below, asterisks denote equal contribution.

Conference Proceedings and Findings

Esther Ploeger, Paola Saucedo, Johannes Bjerva, Ross Deans Kristensen-McLachlan & Heather Lent (2025). Tokenization on Trial: The Case of Kalaallisut–Danish Legal Machine Translation. Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/BalticHLT 2025). [link]

Esther Ploeger*, Wessel Poelman*, Miryam de Lhoneux & Johannes Bjerva (2024). What is “Typological Diversity” in NLP? Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024). [link]

Esther Ploeger, Huiyuan Lai, Rik van Noord & Antonio Toral (2024). Towards Tailored Recovery of Lexical Diversity in Literary Machine Translation. Proceedings of the 25th Annual Conference of the European Association for Machine Translation (EAMT 2024). [link]

Emi Baylor*, Esther Ploeger* & Johannes Bjerva (2024). Multilingual Gradient Word-Order Typology from Universal Dependencies. Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2024). [link]

Emi Baylor*, Esther Ploeger* & Johannes Bjerva* (2023). The Past, Present and Future of Typological Databases in NLP. Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings 2023). [link]

Journal Papers and Submitted Manuscripts

Esther Ploeger, Wessel Poelman, Andreas Holck Høeg-Petersen, Anders Schlichtkrull, Miryam de Lhoneux & Johannes Bjerva (2024). A Principled Framework for Evaluating on Typologically Diverse Languages. Currently under review. [link]

Pieter Fivez, Walter Daelemans, Tim Van de Cruys, Yury Kashnitsky, Savvas Chamezopoulos, Hadi Mohammadi, Anastasia Giachanou, Ayoub Bagheri, Wessel Poelman, Juraj Vladika, Esther Ploeger, Johannes Bjerva, Florian Matthes & Hans van Halteren (2024). The CLIN33 Shared Task on the Detection of Text Generated by Large Language Models. Computational Linguistics in the Netherlands Journal, 13, pp. 233-259. [link]

Heather Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen, Marcell Fekete, Esther Ploeger, Li Zhou, Hans Erik Heje, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux, Daniel Hershcovich, Michel DeGraff, Anders Søgaard & Johannes Bjerva (2024). CreoleVal: Multilingual Multitask Benchmarks for Creoles. Transactions of the Association for Computational Linguistics (TACL). [link]

Pre-prints

Huiyuan Lai, Esther Ploeger, Rik van Noord & Antonio Toral (2024). Multi-perspective Alignment for Increasing Naturalness in Neural Machine Translation. Currently under review. [link]

Kushal Tatariya, Artur Kulmizev, Wessel Poelman, Esther Ploeger, Marcel Bollmann, Johannes Bjerva, Jiaming Luo, Heather Lent & Miryam de Lhoneux (2024). How Good is Your Wikipedia? Currently under review. [link]

Workshop Papers, Shared Task Papers and Extended Abstracts

Wessel Poelman*, Esther Ploeger*, Miryam De Lhoneux & Johannes Bjerva (2024). A Call for Consistency in Reporting Typological Diversity. Proceedings of the 6th Workshop on Research in Computational Linguistic Typology and Multilingual NLP (SIGTYP 2024). [link]

Frank van den Berg*, Gijs Danoe*, Esther Ploeger*, Wessel Poelman*, Lukas Edman & Tommaso Caselli (2022). RUG-1-pegasussers at SemEval-2022 Task 3: Data generation methods to improve recognizing appropriate taxonomic word relations. Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval 2022). [link]

Andreas van Cranenburgh, Esther Ploeger, Frank van den Berg & Remi Thüss (2021). A Hybrid Rule-Based and Neural Coreference Resolution System with an Evaluation on Dutch Literature. Proceedings of the Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC 2021). [link]

Large-scale Collaborations

For the papers below, which include the publication of large multilingual datasets, I collected, created and curated data for Dutch.

Angelika Romanou, Negar Foroutan, Anna Sotnikova, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Zeming Chen, Mohamed A. Haggag, Snegha A, Alfonso Amayuelas, Azril Hafizi Amirudin, Danylo Boiko, Michael Chang, Jenny Chim, Gal Cohen, Aditya Kumar Dalmia, Abraham Diress, Sharad Duwal, Daniil Dzenhaliou, Daniel Fernando Erazo Florez, Fabian Farestam, Joseph Marvin Imperial, Shayekh Bin Islam, Perttu Isotalo, Maral Jabbarishiviari, Börje F. Karlsson, Eldar Khalilov, Christopher Klamm, Fajri Koto, Dominik Krzemiński, Gabriel Adriano de Melo, Syrielle Montariol, Yiyang Nan, Joel Niklaus, Jekaterina Novikova, Johan Samir Obando Ceron, Debjit Paul, Esther Ploeger, Jebish Purbey, Swati Rajwal, Selvan Sunitha Ravi, Sara Rydell, Roshan Santhosh, Drishti Sharma, Marjana Prifti Skenduli, Arshia Soltani Moakhar, Bardia soltani moakhar, Ayush Kumar Tarun, Azmine Toushik Wasi, Thenuka Ovin Weerasinghe, Serhan Yilmaz, Mike Zhang, Imanol Schlag, Marzieh Fadaee, Sara Hooker & Antoine Bosselut (2025). INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge. Accepted to ICLR 2025. [link]

Margaret Mitchell, Giuseppe Attanasio, Ioana Baldini, Miruna Clinciu, Jordan Clive, Pieter Delobelle, Manan Dey, Sil Hamilton, Timm Dill, Jad Doughman, Ritam Dutt, Avijit Ghosh, Jessica Zosa Forde, Carolin Holtermann, Lucie-Aimée Kaffee, Tanmay Laud, Anne Lauscher, Roberto L Lopez-Davila, Maraim Masoud, Nikita Nangia, Anaelia Ovalle, Giada Pistilli, Dragomir Radev, Beatrice Savoldi, Vipul Raheja, Jeremy Qin, Esther Ploeger, Arjun Subramonian, Kaustubh Dhole, Kaiser Sun, Amirbek Djanibekov, Jonibek Mansurov, Kayo Yin, Emilio Villa Cueva, Sagnik Mukherjee, Jerry Huang, Xudong Shen, Jay Gala, Hamdan Al-Ali, Tair Djanibekov, Nurdaulet Mukhituly, Shangrui Nie, Shanya Sharma, Karolina Stanczak, Eliza Szczechla, Tiago Timponi Torrent, Deepak Tunuguntla, Marcelo Viridiano, Oskar van der Wal, Adina Yakefu, Aurélie Névéol, Mike Zhang, Sydney Zink & Zeerak Talat (2025). SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models. Accepted to NAACL 2025. [link]