Proceedings chapter
OA Policy
English

Modeling the Non-Substitutability of Multiword Expressions with Distributional Semantics and a Log-Linear Model

Presented atBerlin (Germany), 7-12 August 2016
PublisherAssociation for Computational Linguistics
Publication date2016
Abstract

Non-substitutability is a property of Multiword Expressions (MWEs) that often causes lexical rigidity and is relevant for most types of MWEs. Efficient identification of this property can result in the efficient identification of MWEs. In this work we propose using distributional semantics, in the form of word embeddings, to identify candidate substitutions for a candidate MWE and model its substitutability. We use our models to rank MWEs based on their lexical rigidity and study their performance in comparison with association measures. We also study the interaction between our models and association measures. We show that one of our models can significantly improve over the association measure baselines, identifying collocations.

Citation (ISO format)
FARAHMAND, Meghdad, HENDERSON, James. Modeling the Non-Substitutability of Multiword Expressions with Distributional Semantics and a Log-Linear Model. In: Proceedings of the 12th Workshop on Multiword Expressions. Berlin (Germany). [s.l.] : Association for Computational Linguistics, 2016. p. 61–66. doi: 10.18653/v1/W16-1809
Main files (1)
Proceedings chapter (Published version)
Identifiers
327views
224downloads

Technical informations

Creation14/08/2020 17:13:00
First validation14/08/2020 17:13:00
Update time15/03/2023 22:25:07
Status update15/03/2023 22:25:06
Last indexation31/10/2024 19:25:54
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack