en
Proceedings chapter
Open access
English

Modeling the Non-Substitutability of Multiword Expressions with Distributional Semantics and a Log-Linear Model

Presented at Berlin (Germany), 7-12 August 2016
PublisherAssociation for Computational Linguistics
Publication date2016
Abstract

Non-substitutability is a property of Multiword Expressions (MWEs) that often causes lexical rigidity and is relevant for most types of MWEs. Efficient identification of this property can result in the efficient identification of MWEs. In this work we propose using distributional semantics, in the form of word embeddings, to identify candidate substitutions for a candidate MWE and model its substitutability. We use our models to rank MWEs based on their lexical rigidity and study their performance in comparison with association measures. We also study the interaction between our models and association measures. We show that one of our models can significantly improve over the association measure baselines, identifying collocations.

Citation (ISO format)
FARAHMAND, Meghdad, HENDERSON, James. Modeling the Non-Substitutability of Multiword Expressions with Distributional Semantics and a Log-Linear Model. In: Proceedings of the 12th Workshop on Multiword Expressions. Berlin (Germany). [s.l.] : Association for Computational Linguistics, 2016. p. 61–66. doi: 10.18653/v1/W16-1809
Main files (1)
Proceedings chapter (Published version)
Identifiers
212views
114downloads

Technical informations

Creation08/14/2020 5:13:00 PM
First validation08/14/2020 5:13:00 PM
Update time03/15/2023 10:25:07 PM
Status update03/15/2023 10:25:06 PM
Last indexation08/30/2023 11:18:55 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack