Abigail Walsh

Postdoctoral Researcher

A Survey of MWE Identification Experiments: The Devil is in the Details


Journal article


Carlos Ramisch, Abigail Walsh, Thomas Blanchard, Shiva Taslimipoor
Workshop on Multiword Expressions, 2023

Semantic Scholar DBLP DOI
Cite

Cite

APA   Click to copy
Ramisch, C., Walsh, A., Blanchard, T., & Taslimipoor, S. (2023). A Survey of MWE Identification Experiments: The Devil is in the Details. Workshop on Multiword Expressions.


Chicago/Turabian   Click to copy
Ramisch, Carlos, Abigail Walsh, Thomas Blanchard, and Shiva Taslimipoor. “A Survey of MWE Identification Experiments: The Devil Is in the Details.” Workshop on Multiword Expressions (2023).


MLA   Click to copy
Ramisch, Carlos, et al. “A Survey of MWE Identification Experiments: The Devil Is in the Details.” Workshop on Multiword Expressions, 2023.


BibTeX   Click to copy

@article{carlos2023a,
  title = {A Survey of MWE Identification Experiments: The Devil is in the Details},
  year = {2023},
  journal = {Workshop on Multiword Expressions},
  author = {Ramisch, Carlos and Walsh, Abigail and Blanchard, Thomas and Taslimipoor, Shiva}
}

Abstract

Multiword expression (MWE) identification has been the focus of numerous research papers, especially in the context of the DiMSUM and PARSEME Shared Tasks (STs). This survey analyses 40 MWE identification papers with experiments on data from these STs. We look at corpus selection, pre- and post-processing, MWE encoding, evaluation metrics, statistical significance, and error analyses. We find that these aspects are usually considered minor and/or omitted in the literature. However, they may considerably impact the results and the conclusions drawn from them. Therefore, we advocate for more systematic descriptions of experimental conditions to reduce the risk of misleading conclusions drawn from poorly designed experimental setup.