Doctoral thesis
OA Policy
English

Computational models of learning the idiosyncrasy of multiword expressions

ContributorsFarahmand, Meghdad
Defense date2017-07-17
Abstract

Idiosyncrasy is an important property of the language that enables it to be productive and at the same time prevents it from growing infinitely large. Idiosyncrasymeans having a peculiar statistical, semantic or syntactic behavior. Idiosyncratic phrases are commonly referred to as Multiword Expressions (MWEs) and have application in most natural language processing (NLP) tasks. The ability to identify and generate MWEs is essential for an NLP system designed to interact in and understand human language. Presently,most models of identifying idiosyncrasy suffer from a low precision. In order to improve the quality of MWE-related systems, more formal definitions of idiosyncrasy as well as more complex computational models need to be developed. This work attempts to define idiosyncrasy on statistical and distributional grounds and study it froma computational perspective. It also presents various models for identifying different types ofMWEs with a focus on nominal MWEs.

Citation (ISO format)
FARAHMAND, Meghdad. Computational models of learning the idiosyncrasy of multiword expressions. Doctoral Thesis, 2017. doi: 10.13097/archive-ouverte/unige:96989
Main files (1)
Thesis
accessLevelPublic
Identifiers
981views
450downloads

Technical informations

Creation25/09/2017 15:29:00
First validation25/09/2017 15:29:00
Update time15/03/2023 03:02:52
Status update15/03/2023 03:02:52
Last indexation31/10/2024 08:51:24
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack