Marcos Zampieri

Assistant Professor
School of Computing
George Mason University
Fairfax, VA, USA

email   linkedin  Google Scholar
Headshot

About

I am an Assistant Professor at the School of Computing at George Mason University and currently a Visiting Assistant Professor at the Department of Computer Science at Duke University.

My research interests are in Computational Linguistics and Natural Language Processing. Drawing on variational and applied linguistics, I investigate language variation at both systemic and individual levels and develop computational methods to improve the safety, robustness, and accessibility of NLP systems in the age of LLMs.

I am Program Chair for SemEval (2025–2026) and co-organizer of the VarDial workshop since 2024. I have served as Tutorial Chair for ACL 2022 and Faculty Advisor for NAACL SRW 2024, alongside regular area chair and senior area chair roles at all major NLP venues.


Recent Selected Publications

For a full list of publications please check Google Scholar.

Narrate2Nav: Real-Time Visual Navigation with Implicit Language Reasoning in Human-Centric Environments
Amirreza Payandeh, Anuj Pokhrel, Daeun Song, Marcos Zampieri, Xuesu Xiao
ICRA (2026) pdf

TigerLLM - A Family of Bangla Large Language Models
Nishat Raihan, Marcos Zampieri
ACL (2025) pdf

Tracing L1 Interference in English Learner Writing: A Longitudinal Corpus with Error Annotations
Poorvi Acharya, J. Elizabeth Liebl, Dhiman Goswami, Kai North, Marcos Zampieri, Antonios Anastasopoulos
EMNLP (2025) pdf

mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
Nishat Raihan, Antonios Anastasopoulos, Marcos Zampieri
NAACL (2025) pdf

Bayelemabaga: Creating Resources for Bambara NLP
Allahsera Auguste Tapo, Kevin Assogba, Christopher M Homan, M. Mustafa Rafique, Marcos Zampieri
NAACL (2025) pdf

Large Language Models in Computer Science Education: A Systematic Literature Review
Nishat Raihan, Mohammed Latif Siddiq, Joanna CS Santos, Marcos Zampieri
SIGCSE (2025) pdf

Annotator Reliability Through In-Context Learning
Sujan Dutta, Deepak Pandita, Tharindu Weerasooriya, Marcos Zampieri, Christopher Homan, Ashiqur KhudaBukhsh
AAAI (2025) pdf

A Survey of Multimodal Sarcasm Detection
Shafkat Farabi, Tharindu Ranasinghe, Diptesh Kanojia, Yu Kong, Marcos Zampieri
IJCAI (2024) pdf

Language Variety Identification with True Labels
Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Bangera
LREC-COLING (2024) pdf

Native Language Identification in Texts: A Survey
Dhiman Goswami, Sharanya Thilagan, Kai North, Shervin Malmasi, Marcos Zampieri
NAACL (2024) pdf

Features of Lexical Complexity: Insights from L1 and L2 Speakers
Kai North, Marcos Zampieri
Frontiers in Artificial Intelligence (2023) url

Lexical Complexity Prediction: An Overview
Kai North, Matthew Shardlow, Marcos Zampieri
ACM Computing Surveys (2023) url

ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification
Kai North, Marcos Zampieri, Tharindu Ranasinghe
COLING (2022) pdf

Handling Extreme Class Imbalance in Technical Logbook Datasets
Farhad Akhbardeh, Cecilia O. Alm, Marcos Zampieri, Travis Desell
ACL (2021) pdf


Books

Automatic Language Identification in Texts

Automatic Language Identification in Texts

Tommi Jauhiainen, Marcos Zampieri, Timothy Baldwin, Krister Lindén
Synthetisis Lectures on Human Language Technologies
Springer (2024)


Similar Languages, Varieties, and Dialects

Similar Languages, Varieties, and Dialects: A Computational Perspective

Marcos Zampieri, Preslav Nakov (Editors)
Studies in Natural Language Processing
Cambridge University Press (2021)


Last Updated: June 2026 | Template: Plain Academic