Computational models of learning the idiosyncrasy of multiword expressions

Farahmand, Meghdad

doi:10.13097/archive-ouverte/unige:96989

Doctoral thesis

English

Computational models of learning the idiosyncrasy of multiword expressions

ContributorsFarahmand, Meghdad

DirectorsMarchand-Maillet, Stéphane

; Falquet, Gilles

Defense date2017-07-17

Abstract

Idiosyncrasy is an important property of the language that enables it to be productive and at the same time prevents it from growing infinitely large. Idiosyncrasymeans having a peculiar statistical, semantic or syntactic behavior. Idiosyncratic phrases are commonly referred to as Multiword Expressions (MWEs) and have application in most natural language processing (NLP) tasks. The ability to identify and generate MWEs is essential for an NLP system designed to interact in and understand human language. Presently,most models of identifying idiosyncrasy suffer from a low precision. In order to improve the quality of MWE-related systems, more formal definitions of idiosyncrasy as well as more complex computational models need to be developed. This work attempts to define idiosyncrasy on statistical and distributional grounds and study it froma computational perspective. It also presents various models for identifying different types ofMWEs with a focus on nominal MWEs.

Affiliation entities

Citation (ISO format)

FARAHMAND, Meghdad. Computational models of learning the idiosyncrasy of multiword expressions. Doctoral Thesis, 2017. doi: 10.13097/archive-ouverte/unige:96989

Thesis

Identifiers

PID : unige:96989
DOI : 10.13097/archive-ouverte/unige:96989
URN : urn:nbn:ch:unige-969896
Thesis number : Sc. 5103

1222views

534downloads

Creation25/09/2017 13:29:00

First validation25/09/2017 13:29:00

Update time13/10/2025 19:47:03

Status update15/03/2023 02:02:52

Last indexation03/12/2025 07:39:43

Archive ouverte UNIGE

Computational models of learning the idiosyncrasy of multiword expressions

Technical informations